Intel® Many Integrated Core Architecture (Intel MIC Architecture)

offload error: dlopen() failed


When I try to run my offload code application, I got this error:

On the remote process, dlopen() failed. The error message sent back from the sink is /tmp/coi_procs/1/3640/load_lib/icpcoutmdD0mj: undefined symbol: _ZSt3maxIiERKT_S2_S2_
On the sink, dlopen() returned NULL. The result of dlerror() is "/tmp/coi_procs/1/3640/load_lib/icpcoutmdD0mj: undefined symbol: _ZSt3maxIiERKT_S2_S2_"
offload error: cannot load library to the device 0 (error code 20)

My compiler version is :




I am going to run a CFD simulation that the memory usage will be over 100 GB.
I use mkl PARDISO in order to solve the linear system that arises.

Very recently I informed about Intel Phi coprocessor and their capabilities that has.
In order to accelerate the solving, I was wondering if PARDISO incorporates Intel Phi technology internally in order
not to modify my code.

Thank you in advance.

Mulit-process service tool for xeon-phi

              I was wondering do we have any resource management tool for xeon-phi like CUDA MPS (it allows CUDA kernels to be processed concurrently on the same GPU; this can benefit performance when the GPU compute capacity is underutilized by a single application process. )? This tool improves the utilization of GPU so is there any Mulit-process service tool for xeon-phi? 

Strange Errors by reseting mic configuration

Hi all,


I have installed mpss 3.3.3 on my Centos 7.2 machine. After rebuilding the kernel-modules, I could install it without problems.

I had another mpss configuration on the machine which caused problems (in terms of, no functioning). So I removed those packages and reinstalled.

When I want to reset the config / init the default config, I get those errors. 

micinfo displays my Xeon Phi and I can access it via ssh.

I just wanted to know it this Errors are somewhat take influence on my system. And how to fix them


Offload transfer question

Hi, i have a question about transfer data from host do coprocessor. Look at samplce code below. Are data transferred asynchronously to coprocessor? I would like to overlap transfer and computation performed on Intel Xeon Phi with computation carried out by CPU. When i use combination of offload transfer signal() and offload wait() performance of computation is a lower than in code presented below.

Linux does not detect Xeon Phi card

Here is our setup:

Motherboard: Asus P9X79 WS BIOS version 4802 (Above 4G decoding is enabled)
CPU: Intel Core i7 4820K
OS: CentOS 7.1 with Linux 3.10.0-229

I have a Xeon Phi 31S1P. I have not been able to display the card
with lspci. I have tried to put in a different PCI slot. I have
upgraded the BIOS of the motherboard to the latest version. I have
tried passing noapic and pci=realloc to the Linux kernel. Nothing
seems to work.

I do not have a Xeon processor. Could that be the problem?

Below is the complete dmesg output.

S’abonner à Intel® Many Integrated Core Architecture (Intel MIC Architecture)