Trouble with Updating MPSS

My server has 4x Intel Xeon Phi 5110P accelerator cards. it runs Centos 6.5 with kernel version 2.6.32-431.29.2.el6.x86_64

When updating MPSS from 2.1 to 3.3.4 and 3.4.3, I receive the following error:

[root@XXXXX mpss-3.3.4]# /usr/bin/micflash -update -device all -smcbootloader
Error getting SCIF driver version
failed to open mic'0': /sys/class/mic/mic0/family: Knights Corner: not supported: Operation canceled

failed to open mic'1': /sys/class/mic/mic1/family: Knights Corner: not supported: Operation canceled

MPSS 3.5

Please note that the new MPSS 3.5 is just released at


This new version supports the following OS:


- Linux: RHEL* 6.4, 6.5, 6.6, 7.0 and 7.1 & SuSE SLES* 11 SP3 and SuSE 12.

- Microsoft Windows*: Windows* 7 Enterprise SP1, 8/8.1 Enterprise, Server 2008 R2 SP1, Server 2012 and Server 2012 R2.


Performance scale of the Intel Phi MIC


The attached is plot of execution time on Intel Phi with varying number of threads. The same program runs in native and offload modes.

The Phi device has 60 cores.

1) Why the timing steps don't occur at multiples of number of cores (i.e., multiple of 60s)?

2) Why the time drops substantially around 248 threads and increases again? (i.e., > 4x60)

adding offload pragma , performance drops

Hello ,

I am running a code in openMP which is like this:


#pragma omp parallel for default( none ) shared( X , Y ,V ,V ,H , W ,N ) private ( i,x,y ,Kx,Ky,initD ,T ) 

		for ( y = 0; y < H; y++ )
			for ( x = 0; x < W; x++ )

				initD = aValue;
				for ( i = 0; i < N; i++ )
				V[ x + y * Width ] = T;


Now , I want to run it on mic card , so when I just add the line:

ANSYS® and Intel Team Up to Shrink Simulation Timelines

By Mike Pearce, Ph.D., IDZ Server Community Manager

ANSYS, a world leader in simulation software, announced on March 12 that its premier engineering simulation software product, ANSYS* Mechanical APDL 16.0 (ANSYS Mechanical 16.0), will ship with built-in, optimized support for Intel® Xeon Phi™ coprocessors.

ANSYS® 与英特尔联合加快仿真解决方案上市速度

Mike Pearce 博士,IDZ Server 社区经理

全球仿真软件领导者 ANSYS 于 3 月 12 日宣布,其顶尖工程设计仿真软件产品 ANSYS* Mechanical APDL 16.0 (ANSYS Mechanical 16.0) 将随面向英特尔® 至强融核™ 协处理器的内置、优化支持一并发运。

借助 ANSYS Mechanical 16.0,结构工程师和设计师将能够充分利用高度并行化的多核处理技术的强大功能和性能,从而以经济高效的方式加快工程设计工作负载的速度。

ANSYS Mechanical 16.0 是一款领先的商业有限元分析 (FEA) 软件解决方案,可支持工程师借助仿真手段来测试和验证各种机械和结构设计方案。 在结构设计过程中,采用精确建模和仿真可避免在设计初期创建昂贵的原型或执行物理测试,从而显著提高设计效率、降低开发成本,并缩短开发周期。 ANSYS Mechanical 16.0 仿真具备卓越的品质和灵活性,可在多个真实场景中进行反复测试,从而有助于预测最终产品的行为与可靠性。

internal error: bad pointer

my code is this: ---------------------------------------------------------------------------------------------- #include class TEST{ public: double *A; public: TEST(double * _A){ A = _A; #pragma offload_transfer target(mic:0) nocopy(this : alloc_if(1) free_if(0)) in(A:length(2*3) alloc_if(1) free_if(0)) } void run(){ A[1] = 0; // double *B = A; std::cout<

timing is different each time

Hello ,

I wrote a simple application on cpu and I am using offload pragmas for the pieces I want to run on the coprocessors.

Since I am compiling on cpu and I use offloads , I am using :

<code>export MIC_ENV_PREFIX=MIC

in order to specify the threads number.

My problems:

1) Running the code , shows always 40 threads been used.

2) Running again and again the code without compiling , I am getting different time results.

Server abonnieren