Intel® Cilk™ Plus

cilkview and gcc 4.9 cilkplus branch on 64 bit linux

Hello, I've downloaded the latest (I think) version fo cilkview from the cilkplus website. For some reason it is not working (even if I try to run it from the same folder, it doesn't find the program...) I'm thinking it has to do with some 32 vs 64-bit library. I exported the lib64 to no avail. Is there any kind of support for cilkview? If not, will you guys just release the source, so we can patch it? This would be great. Same with cilkprof.

Thanks for your time.

Is it possible to me to improve a simple O(n) algorithm with Cilk Plus?

Hi everybody!

I have a simple algorithm to print the very first unique ASCII character from a stream. In the worst case the algorithm scans through the stream twice, doing some comparisons and increments. So, roughly, my algorithm is O(n).

The algorithm0 do two main things:

vectorizing with an inline function?

I attached two code files mandel1.cpp and mandel2.cpp.

mandel1.cpp has a loop with all the code in the body

mandel2.cpp has equivalent code but instead of having the code in the body it calls an inline function

Compiling with intel c++ compiler 15 with "icc  -O3 -fp-model fast=2 -xCORE-AVX2 -fma -c -S", I can vectorize mandel1.cpp but not mandel2.cpp.

Is there I way I can vectorize mandel2.cpp and still have a separate function? It seems like the optimizer ought to just be able to inline and then apply the vectorization if it can vectorize mandel1.cpp.

How to compile cilk plus runtime source with Intel® C++ Composer XE 2013

Dear all,

I want to compile cilk plus runtime source with Intel® C++ Composer XE 2013. I build the cilk plus runtime according to the directions in the "readme" file (libtoolize; aclocal; automake --add-missing; autoconf; ./configure; make; make install). But in this way, gcc is used by default.

Please, could somebody give me some guidelines in order to compile cilk plus runtime source with Intel® C++ Composer XE 2013? 

Thanks a lot for your help.

Best Regards,

Yaqiong Peng

Thread local calculation of reducers?


I wonder how reducers work internally. So if a value is set into a reducer, does it block other threads each time a value is set?

I ask because normally I'm creating a local 'reducer', e.g. a local histogram on an image tile and on leaving the thread all the data is pushed at once into the global reducer. Just like local memory operations in OpenCL.

Why does the available number of workers changes execution for a 1 cilk_spawn program?

While optimizing a matrix manipulation code in C, I used CilkPlus to spawn a thread to execute in parallel two functions that are data independent and somewhat computationally intensive. Cilk_spawn is used in only one place in the code as follows:

Run-time exit function

I would like to understand run-time execution in Cilk a little better. 
I have downloaded Intel Cilk run-time release (cilkplus-rtl-003365 - released 3-May-2013).

On 09/09/2013 I had asked a question seeking to figure out which is the last function executed before Cilk run-time ends assuming execution went without any problems.

Barry suggested to look at “__cilkrts_c_return_from_initial()” in scheduler.c  and indeed that was what I needed at that time.

Intel® Cilk™ Plus abonnieren