You may have received an email inviting you to the Intel® Parallel Studio XE 2016 Beta. VTune Amplifier XE 2016 beta is part of the studio and adds OpenMP* parallelization inefficiency, imbalance and work sharing analysis to tune for more efficient use of parallel regions. It also now supports multi-rank analysis of MPI* compute nodes with or without OpenMP use. Various ease-of-use enhancements include confidence indicators in General Exploration analysis results, "super tiny" bird's-eye view timeline, and "Platform" tab replacing "Tasks and Frames" tab.
I am using Amplifier XE 2015 on Windows 7 and trying to profile 4xMPI processes running on my local machine. I get 3x of the above messages when running 4 MPI processes. Is that expected? That is it seems that XE is having problems profiling multiple MPI processes at the same time.
mpiexec -n 4 amplxe-cl -result-dir my_result_ah -collect hotspots -- <my_exe.exe>
We have spin locks from TBB (rw). I am interested to know who owns the lock. Yes, we have information who spins at particular object, but where is that guy who holds a lock? How to identify it?
I have Intel(R) Core(TM) i7-4800MQ CPU @ 2.70GHz which is Haswell based processor. I want to estimate FLOPS of an application. I am using Intel VTune Amplifier XE 2015. wondering if anybody knows how to find FLOPS?
I tried following steps on https://software.intel.com/en-us/articles/estimating-flops-using-event-b... but I don't find Processor Event Name on the pages in VTune. wondering if anybody has successfully done this on Haswell processor.
I've got troubles with VTune for collecting data on a native MIC application, after we upgraded MPSS to 3.5 and Intel Tools to 2015 update 3.
Here is a log of what happens and the debug logs that VTune created as an attachment.
Probably the key line is:
tb5gen_Add_Sampling_Data returned error "Unknown reason"
There is some data, but no info based on the call stack like "top/down", and the result is therefore not really useful.
I used Vtune2015 to profile an android project. It failed just after I attached project process ID. I attached the screen print. Has anyone known this?.
Hello, I am interested in using Vtune to profile a system. I have run a project and gathered the results. I am looking at the hardware event samples for a specific cpu. EG: All hardware events for CPU 0. The problem I am having is that I want to look at the results based on small time intervals. Basically I want to see the results for every 15ms.
On Wednesday, May 6 at 9AM Pacific yours truly will present: Latest Intel® VTune™ Amplifier XE Improvements.
Discover improvements to Intel® VTune™ Amplifier XE’s OpenMP analysis aimed at helping you achieve maximum scalability. Next, find out how Intel VTune Amplifier XE will support analysis of MPI+OpenMP hybrid applications. Explore Intel VTune Amplifier XE’s profiling capabilities for Intel® HD graphics with OpenCL* and Intel® Media Server SDK. Finally, see the latest ease-of-use improvements.
So I am using Vtune to profile my system while I use a generated workload. What I am trying to do is get the data for 15ms intervals for the entire run. Here are some of the things I am running:
General exploration -> Hardware event sample count viewpoint
Bottom up view
Core, HW context, Function, Callstack -> (I am filtering in by 1 cpu. So my data is only for 1 cpu)
Is there an Intel API for getting hardware counters from code? I'm talking about something like PAPI where you can start counters at the beginning of a function then stop the counters at the end and read them.
- Página 1