Hi, I am having trouble when I use the _mm_load_si128 intrinsic on Intel 3770k processor.
__m128i packed_image128 = _mm_load_si128((__m128i*)packed_image);
Is causing a seg fault on this processor with both icpc 13.0 and g++ 4.6.3. I made sure that packed_image is 16 byte aligned.
My flags are -march=native -O0 -g -msse4.2. This works fine if compiled using -O2.
Any ideas on how I can proceed on this?