I took an example of type overloading from a website and unfortunately on compilation with ifort I am getting an internal error.
The code is attached herewith to look at.
I have Xeon Phi 3120A and I wonder if I have to buy "Intel Parallel Studio XE 2015" or any other software package to make it work.
This code recipe describes how to get, build, and use the Quantum ESPRESSO code that includes support for the Intel® Xeon Phi™ coprocessor with Intel® Many-Integrated Core (MIC) architecture. This recipe focuses on how to run this code using explicit offload.
Does MIC only support 3 analysis types of vtune:General-exploration, advanced-hotspots, bandwith?
However I use snb-access-contention ,and also got a result.The result is believable?
I recently started using Xeon Phi cards for parallel programming, so I am still a newbie in this field.
I wrote this code as a simple example to start understanding this fascinating world, but I got surprised when I looked at the time of executions.
When I run the code on the host, execution time is 0,08 s. When I run the code adding the pragma offload and pragma omp parallel for, execution time increase up to 9s!
When I compiled the codes, I used -O3 optimization for both of them.
Is there something I am missing?
how to monitor mic's Cache Utilization when a program is running? use vtune or some other tools?