英特尔® Cilk™ Plus

New Intel Cilk Plus runtime sources and SDK is now available

New Intel Cilk Plus runtime sources and SDK were just released and are available for download now.

  • Intel Cilk Plus runtime sources build 4420 contains minor scheduler improvements and community-contributed Raspberry Pi* port
  • Intel Cilk Plus SDK build 4421 contains support for the latest Linux*, Windows*, and Mac OS X* operating systems

More information can be found at https://www.cilkplus.org/download.

Intel C++ Compiler 16.0 is now available in Intel Parallel Studio XE 2016

The new version Intel C++ Compiler 16.0 is now available in Intel Parallel Studio XE 2016 that has launched early this week. If your support license is current you can download and install this at no additional charge from the Intel Registration Center

You will find some blog postings about some new features and new free tools.

Cilkscreen/Cilkview needs to be released as 64 bit binary (Linux)

Hi, many modern 64 bit clusters do not provide 32 bit libraries. The cilkscreen/cilkview tools appear to be compiled as 32 bit applications and so error when running on a cluster without 32 bit libraries: /lib/ld-linux.so.2: bad ELF interpreter: No such file or directory

Can you please release both 32 bit and 64 bit binaries for cilk screen and cilk view? This seems like negligible effort on your part for a huge benefit to your users.

Thanks,

Stephen

gcc cilkplus non-support of reducers (other than int type ?)

problem confirmed

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71473

Don't know the answer to the question about int vs. size_t for min|max_ind reducers.  Note that Intel cilk(tm) plus uses size_t in several contexts where they must be cast to (int) in order to attain satisfactory performance, but the reducers in question don't give good performance, so may be using size_t internally.

Splitting array notation work accross threads

Hi,

I have an operation on a large array written in array notation. Since the array is large, what I really want is the work to be split up across many cores, and each core to use SIMD units to perform its work. Is there an easy way to specify that the work should be divided up among however many threads there are in the machine?

Vector of reducers that are not cache aligned

I am using Cilk and a custom reducer as described here: https://software.intel.com/en-us/node/522608. In the example, they use the reducer for append operation in a linked list.

Now, I want to create a vector of reducers (using std::vector); however, I get the following runtime error: 

Reducer should be cache aligned. Please see comments following this assertion for explanation and fixes.

Hybrid Parallelism: A MiniFE* Case Study

This case study examines the situation where the problem decomposition is the same for threading as it is for Message Passing Interface* (MPI); that is, the threading parallelism is elevated to the same level as MPI parallelism.
  • 专业版
  • 教授
  • 学生
  • Linux*
  • 现代代码
  • 服务器
  • C/C++
  • 中级
  • 英特尔® C++ 编译器
  • 英特尔® Cilk™ Plus
  • Intel® MPI Library
  • MiniFE*
  • 消息传递接口 (MPI)
  • OpenMP*
  • 学术
  • 集群计算
  • Intel® Many Integrated Core Architecture
  • 优化
  • 并行计算
  • 线程
  • 整理您的数据和代码: 数据和布局 - 第 2 部分

    Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
  • 学生
  • 现代代码
  • 服务器
  • Windows*
  • C/C++
  • Fortran
  • 中级
  • 英特尔® Advisor
  • 英特尔® Cilk™ Plus
  • Intel® Threading Building Blocks
  • Intel® Advanced Vector Extensions
  • OpenMP*
  • Intel® Many Integrated Core Architecture
  • 优化
  • 并行计算
  • 线程
  • 矢量化
  • 订阅 英特尔® Cilk™ Plus