Intel® Many Integrated Core Architecture

Get the Power Consumption Info When Running a Program


When I run a program using offload mode, I want to collect the information of power consumption of the MIC. Is there any way to do that?

I found that I can use the micsmc to get the Total Power info in the command line/GUI interface.( Could I add a function from any libraries inside a program to collect the power consumption when use offload mode?

Remote procedure call, license of code call to scif library

We have developed a low latency, remote procedure call (rpc) Framework  for Xeon <--> Xeon Phi based on SCIF, which is very successful for our applications.  We think it could be helpful for other Intel Xeon Phi  users.  We would like to share the code to communities as open-source project. However we does not know if  there are any problem with Intel's license, copyright or copyright infringement, which we believe do not. Would you please help us? Thanks, Minh

Explore Intel® AVX-512 Code Paths with Intel® Advisor XE while not Having Compatible Hardware

Many factors that can make programs difficult for automatic vectorization. We will examine some of the factors that can make vectorizing code problematic without providing the compiler with some additional hints. Vectorizing loops is critical for increasing your applications’ performance, and Intel Advisor XE is the tool that can guide you through the process of vectorization.

Intel Advisor XE 2016 is a dynamic analysis tool that now contains a Vectorization Advisor feature. Using Vectorization Advisor you can survey all the loops in your application and see:

offload error: cannot offload to MIC - device is not available

I am a newer in MIC.

I install and config Phi follow this link "".It seems that everything is OK.But when I run program,I get this information "offload error: cannot offload to MIC - device is not available".I source intel64 before running program.My compiler version is "Intel(R) C Intel(R) 64 Compiler for applications running on Intel(R) 64, Version Build 20151021".

I input "micctrl -s",then I get follow output:   

offload error: dlopen() failed


When I try to run my offload code application, I got this error:

On the remote process, dlopen() failed. The error message sent back from the sink is /tmp/coi_procs/1/3640/load_lib/icpcoutmdD0mj: undefined symbol: _ZSt3maxIiERKT_S2_S2_
On the sink, dlopen() returned NULL. The result of dlerror() is "/tmp/coi_procs/1/3640/load_lib/icpcoutmdD0mj: undefined symbol: _ZSt3maxIiERKT_S2_S2_"
offload error: cannot load library to the device 0 (error code 20)

My compiler version is :




I am going to run a CFD simulation that the memory usage will be over 100 GB.
I use mkl PARDISO in order to solve the linear system that arises.

Very recently I informed about Intel Phi coprocessor and their capabilities that has.
In order to accelerate the solving, I was wondering if PARDISO incorporates Intel Phi technology internally in order
not to modify my code.

Thank you in advance.

Mulit-process service tool for xeon-phi

              I was wondering do we have any resource management tool for xeon-phi like CUDA MPS (it allows CUDA kernels to be processed concurrently on the same GPU; this can benefit performance when the GPU compute capacity is underutilized by a single application process. )? This tool improves the utilization of GPU so is there any Mulit-process service tool for xeon-phi? 

Strange Errors by reseting mic configuration

Hi all,


I have installed mpss 3.3.3 on my Centos 7.2 machine. After rebuilding the kernel-modules, I could install it without problems.

I had another mpss configuration on the machine which caused problems (in terms of, no functioning). So I removed those packages and reinstalled.

When I want to reset the config / init the default config, I get those errors. 

micinfo displays my Xeon Phi and I can access it via ssh.

I just wanted to know it this Errors are somewhat take influence on my system. And how to fix them


Offload transfer question

Hi, i have a question about transfer data from host do coprocessor. Look at samplce code below. Are data transferred asynchronously to coprocessor? I would like to overlap transfer and computation performed on Intel Xeon Phi with computation carried out by CPU. When i use combination of offload transfer signal() and offload wait() performance of computation is a lower than in code presented below.

S’abonner à Intel® Many Integrated Core Architecture