I've run openmp code using 8 cores, and I've compiled with -mkl tag.
My openmp code calculates simultaneously each job.(calculation of each thread contain MKL function - dgemm)
Then, I wonder how MKL thread is generated for openmp job.
For example, I use 8 omp_threads for 8 cores(8 calculation run simultaneously, and 8 MKL function run simultaneously)
=> How MKL thread number is generated?(I know that MKL is based on OpenMP threads.)
Only one mkl thread per each one omp thread?
(I've checked this using VTUNE, and it's look like mkl is generated only thread each openmp thread in this case.)