My question is about using vtune (Amplifier XE 2016 Update 2 (build 444464)) to profile C++ code on MICs.
I'm not sure how to describe this problem accurately without a screenshot, so here it is:
we found that the latest Vtune can only do basic profiling on Intel coming Atom APL platform. However we need advanced profiling for platform performance analysis. How can we get Vtune to support
I'm absolutely sure that this question has been addressed numerous times, but I can't find it in the search. But then I can never find anything using the Intel search engine.
I am trying to run VTune on a rather long test case: 2+ hours with 16 threads.
What are the suggested settings for this type of analysis?
When I try to profile my application, I get the following error:
I just installed Beta Update 1 (build 466719) of VTune Amplifier XE 2017 on windows. I created a trivial program (below) but I can't see any Python function names in VTune.
I am trying to profile a process on linux running on a single CPU on a Broadwell (model name : Intel(R) Xeon(R) CPU D-1540 @ 2.00GHz) and getting a CPI rate of 2.828 with default Vtune config.
I have a piece of code which uses both openMP and MPI and I wish to profile it in different configurations. e.g.
One Haswell node with 20 cores in following configurations