Mensajes en el blog

Vectorization Series, Part 3 - What are the Benefits?

This will be the final post in my planned short vectorization series. Although I reserve the right to post more on vectorization in the future!

Autor Shannon Cepeda (Blackbelt) Última actualización 14/06/2017 - 15:56
Mensajes en el blog

Vectorization Series, Part 2- Who Can Use It?

In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi

Autor Shannon Cepeda (Blackbelt) Última actualización 14/06/2017 - 16:16
Mensajes en el blog

Some Performance Advantages of Using a Task-Based Parallelism Model

As part of my focus on software performance, I also support and consult on implementing scalable parallelism in applications.

Autor Shannon Cepeda (Blackbelt) Última actualización 04/02/2019 - 10:40
Mensajes en el blog

Let's rename "for" to "serial_for"...

Proposal: rename for in C and C++ to serial_for No more incumbent "for." (it was voted off the island)

Autor James R. (Blackbelt) Última actualización 14/06/2017 - 16:06
Mensajes en el blog

Parallelism as a First Class Citizen in C and C++, the time has come

It is time to make Parallelism a full First Class Citizen in C and C++.  Hardware is once again ahead of software, and we need to close the gap so that application development is better able to uti

Autor James R. (Blackbelt) Última actualización 14/06/2017 - 16:04
Mensajes en el blog

Graduate Intern at Intel - Parallel Ray-Tracing

Ray-tracing is a classic example of an embarrassingly parallel algorithm; since each pixel is typically independent of the rest, theoretically every pixel can be done in parallel (given enough core

Autor Última actualización 14/06/2017 - 15:37
Mensajes en el blog

Graduate Intern at Intel - Parallel N-Body

The N-Body problem is a classic example used frequently to demonstrate parallelization and how it improves performance.

Autor Última actualización 14/06/2017 - 15:46
Mensajes en el blog

Parallel Universe Magazine #12: Advanced Vectorization

This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:

Autor Última actualización 03/07/2019 - 20:08
Mensajes en el blog

Slides da palestra sobre Computação Paralela no FISL14

A palestra "Como domar uma fera de 1 TFlop que cabe na palma da sua mão" foi apresentada em 3/7/13, no FISL14, por Luciano Palma - Community Manager da Intel para Servidores e Computação de Alto De

Autor Luciano Palma (Intel) Última actualización 03/10/2019 - 09:03
Mensajes en el blog

Optimizing Big Data processing with Haswell 256-bit Integer SIMD instructions

Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.

Autor gaston-hillar (Blackbelt) Última actualización 15/10/2019 - 17:38