The attachment is the code of the eigen sample.
I find something strange that when the program run on my windows laptop, the cilk version runs several times slower than the serial one, while the cilk version runs twice faster than the serial version on the Linux server.
I compare the environment as following:
OS Compiler Core Time cost
Windows 7 Version 14.0.1 Intel(R) Core(TM) i5 M560 @ 2.67GHz x2 Serial:1.817 sec
Red Hat Enterprise 6.0 version 14.0.1 (Cannot find 13) Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz x4
icc JCBMatrixEigen.cpp timer.cpp -O3
icl JCBMatrixEigen.cpp timer.cpp /O3
I profiled this code in Vtune ,find the 90%'s runtime is spent on the Cilk's reducer's wait and spin time.
However there are no problem when compiled and run on the Linux machine.