Mensajes en el blog

Vectorization Series, Part 2- Who Can Use It?

In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi

Autor Shannon Cepeda (Blackbelt) Última actualización 14/06/2017 - 16:16
Article

Using Intel® AVX without Writing AVX

Intel® AVX is a new 256-bit instruction set extension to Intel® Streaming SIMD Extensions and is designed for applications that are floating point intensive. This paper discusses options to integrate Intel® AVX into an application via use of intrinsics.
Autor richard-hubbard (Intel) Última actualización 05/03/2019 - 22:08
Article

Intel® Cilk™ Plus – AOBench Sample

This is the AOBench example associated with the "Intel® Cilk™ Plus – The Simplest Path to Parallelism" how-to article.  It shows an Ambient Occlusion algorithm implemented as serial loops, one us
Autor Última actualización 25/05/2018 - 15:30
Mensajes en el blog

Parallel Universe Magazine #12: Advanced Vectorization

This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:

Autor Última actualización 03/07/2019 - 20:08
Article
Mensajes en el blog

Optimizing Big Data processing with Haswell 256-bit Integer SIMD instructions

Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.

Autor gaston-hillar (Blackbelt) Última actualización 06/07/2019 - 17:00
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Autor David M. Última actualización 06/07/2019 - 16:40
Article

Improve Performance with Vectorization

This article focuses on the steps to improve software performance with vectorization. Included are examples of full applications along with some simpler cases to illustrate the steps to vectorization.
Autor David M. Última actualización 06/07/2019 - 16:40
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Autor David M. Última actualización 06/07/2019 - 16:40
Article

Intel® System Studio Release Notes, System Requirements, and What's New

This page provides system requirements and release notes for Intel® System Studio.

Autor Jeffrey R. (Intel) Última actualización 21/08/2019 - 15:31