Article

Intel® System Studio - Multicore Programming with Intel® Cilk™ Plus

Intel System Studio not only provides a variety of signal processing primitives via Intel® Integrated Performance Primitives (Intel® IPP), and Intel® Math Kernel Library (Intel® MKL), but also allows developing high-performance low-latency custom code (Intel C++ Compiler with Intel Cilk Plus). Since Intel Cilk Plus is built into the compiler, it can be used where it demands an efficient threading...
Authored by Hans P. (Intel) Last updated on 12/11/2017 - 10:48
Article

Cilk Plus Solver for a Chess Puzzle or: How I Learned to Love Fast Rejection

Intel® Cilk™ Plus enabled parallelizing a chess puzzle solver with a few changes.
Authored by Last updated on 06/07/2017 - 09:12
Article

Parallel sorts for Cilk Plus

This article describes the parallel sorts in the latest release of “Cilkpub”, an open-source library of utilities for Intel®

Authored by Last updated on 06/07/2017 - 10:29
Article

Parallel Programming with C#

By Bruno Sonnino

Authored by admin Last updated on 05/25/2018 - 15:30
Article

Programação Paralela com C#

Por Bruno Sonnino

Authored by admin Last updated on 05/25/2018 - 15:30
Article

Putting Your Data and Code in Order: Optimization and Memory – Part 1

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
Authored by David M. Last updated on 12/12/2018 - 18:00
Article

A Runtime Generated FFT for Intel® Processor Graphics

Download the code

Authored by Dan Petre (Intel) Last updated on 07/06/2019 - 20:30
Article

Improve Vectorization Performance with Intel® AVX-512

See how the new Intel® Advanced Vector Extensions 512CD and the Intel AVX512F subsets (available in the Intel® Xeon Phi processor and in future Intel Xeon processors) lets the compiler automatically generate vector code with no changes to the code.
Authored by Alberto V. (Intel) Last updated on 07/08/2019 - 19:26
Article

Performance of Classic Matrix Multiplication Algorithm on Intel® Xeon Phi™ Processor System

Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
Authored by Last updated on 10/15/2019 - 15:30
Article

Thread Parallelism in Cython*

Cython* is a superset of Python* that additionally supports C functions and C types on variable and class attributes. Cython generates C extension modules, which can be used by the main Python program using the import statement.
Authored by Nguyen, Loc Q (Intel) Last updated on 10/15/2019 - 16:40