Here is a Friday post that has a sufficient lack of information that will probably be impossible to answer. I have some older Fortran code I'm trying to improve the performance of.
Hi. I have a question about synchronization OpenMP thread teams. I would like to write code where only one threads team do caltulation at the time, eg: time 0 -> team 0
I have a program where at one point I call a function.
I am doing some fortran openmp code offloading in MIC.In compilation time I am getting the below error messages.Please help me.
Is it possible to implement BLAS library on Intel Phi in such a way that each thread in openMP calls a BLAS function on different data sets independently?
when I run my program on two nodes, and set the number of threads larger than 1 (i.e., 2-16 threads), I've encountered the segmentation error:
when I offload a parallel region, my OpenMP tasks are not executed, is this intended?
I am using a simple Merge Sort benchmark on the Xeon Phi. 78% of the total CPU time is consumed by "libiomp5.so"
Hi. I have some problem. I write aplication for Intel Xeon Phi (61 cores), which does stencil calculation using 2D matrix (five-point stencil). I would like to use OpenMP 4.0 teams.
I just wrote test code on KNL using openmp. But I feel really confused about the result.
The code is as below: