segfault: adapted code from mkl 11.3.0 example, LLS routine: lapacke_sgels_row.c

the attached code is just simply modified matrix size from original example: size of matrix and how they are initialized.
--the size: M, N and NRHS: 15000 150 1

when build with options from:, as:
gcc -DMKL_ILP64 -m64 -I${MKLROOT}/include lapacke_sgels_row.c -Wl,--no-as-needed -L${MKLROOT}/lib/intel64 -lmkl_intel_ilp64 -lmkl_core -lmkl_sequential -lpthread -lm

ippiFilterMin_16u_C1R crashed (IPP 8.2)

Could you please point me where I made a mistake doing ippiFilterMin_16u_C1R? Demo code below.

	int mskSizeX = 3;
	int mskSizeY = 3;
	IppiSize roi={(int)imageW - (mskSizeX - 1),(int)imageH - (mskSizeY -1)}, mask={mskSizeX,mskSizeY};
	IppiPoint anchor =  {mskSizeX/2, mskSizeY/2};

	int stepBytes;
	int stepBytesf;
	Ipp16u* d_image1 = ippiMalloc_16u_C1(imageW, imageH, &stepBytes);
	Ipp16u* d_image2 = ippiMalloc_16u_C1(imageW, imageH, &stepBytes);
	Ipp16u* d_image11 = ippiMalloc_16u_C1(imageW, imageH, &stepBytes);

Can't undersatnd Intel vTune Amplifier XE 2016 license method

Hello, I'm Saar and I'm trying hard to find the way to activate Intel vTune Amplifier XE 2016 with a leagal registered serial number.

After reading those pages:

Linking with cilk on OSX

On MAC OSX I link with


 At runtime it seems to expect


but I had expected it would require


However, there is versioned and a nonversioned variants:




Have I misunderstood something?



CentOS 7.2 + MLNX_OFED 3.1-1 + MPSS 3.6.1

I'm trying to get mpss running on our centOS 7.2 cluster.

We're using kernel 3.10.0-327.4.4.el7.x86_64.

MLNX_OFED_LINUX-3.1- (OFED-3.1-1.0.3).

On Mellanox Technologies MT27500 Family [ConnectX-3] adapters.

Installation of both MOFED and mpss run flawlessly, except when I try to use them together.

So I can install mpss. Setup ethernet networking. Ssh from/to xeon phi. Run code on xeon phi, all without a problem problem.

I can install Mellanox ofed, use infiniband (ON THE HOST),(ibv_*_pingpong ) without problems. 

Poor speed in MIC

Dear All:

As learning purpose, i tried to code a program which find total number prime number for a given range. isprime function finds  if a number is prime or not. I added !$omp declare simd to vectorize that function. I do not know why, program perform three times slower in intel phi than host.


Host: 16 sec

MIC: 43 sec

MODULEFILE creation the easy way

If you use Environment Modules  (from Sourceforge, SGI, Cray, etc) to setup and control your shell environment variables, we've created a new article on how to quickly and correctly create a modulefile.  The technique is fast and produces a correct modulefile for any Intel Developer Products tool.

The article is here:


Rebuild ofed-driver-3.6.1-1.src.rpm MPSS installation issues


Im installing MPSS 3.6.1 on two xeon phi nodes in a cluster connected to Infiniband, CentOS 6.6 and the kernel version is 2.6.32-504.8.1.el6.x86_64, so I update the kernel-headers and kernel-devel and rebuilt the MPSS host drives as the user guide says, and so far so good, but the problem comes when I tried to rebuild OFED drivers with rpmbuild --rebuild ofed-driver-3.6.1-.1.src.rpm,  I get the following error message:

Suscribirse a Subprocesos