I want to solve Poisson problem in parallel. I use red-black ordering in order to introduce parallelism.
Here is the "Red" part of the code:
I am trying to use MKL libraries and OpenMP in a MSVS C++ application on Windows7. The application shows affinity for all 24 processors (2 nodes, 6 processors, HyperThreaded). omp_get_num_procs() also shows 24 processors. When I run the program only 1 node and 6 processors are accessed. This is confirmed when I use "KMP_AFFINITY=verbose,none". It ouputs "OMP: Info #179: KMP_AFFINITY: 1 packages x 6 cores/pkg x 1 threads/core (6 total cores)". I get no compiler or linker complaints.
In one of our programs, we have this piece of code:
Does the Intel compiler/ OpenMP runtime lib. support OpenMP 4.0 for accelerators?
Explicitly cache control is a one of important feature in Xeonphi (MIC). How could I use the L1 or L2 as scratchpad memory and also sharing them data between the cores?
In addition, is there any way to hack the MESI state of the cache line in the distributed tag directory (DTD)?
Thanks in advance.
- Pagina 1