I would like to pre-allocate a number of buffers for later data transfers from CPU to MIC, using explicit offloading in C++.
We are having issues on windows 7 visual studio 2012/2013, using the intel compiler version 15.
Can VTune visualize execution of OpenMP for-loop chunks? For example, consider the parallel for-loop below:
In the following snippet, Term1 is part of a larger equation:
- 1 de 12318