Article

Using Tasks Instead of Threads

Tasks are a lightweight alternative to threads that provide faster startup and shutdown times, better load balancing, an efficient use of available resources, and a higher level of abstraction.
Criado por administrar Última atualização em 05/07/2019 - 09:41
Article

Exploiting Data Parallelism in Ordered Data Streams

This article identifies some of these challenges and illustrates strategies for addressing them while maintaining parallel performance.
Criado por administrar Última atualização em 05/07/2019 - 14:50
Mensagem de blog

Debug Intel® Transactional Synchronization Extensions

If printf or fprintf functions cause transaction aborts, use Intel® Processor Trace as a work-around.
Criado por Roman Dementiev (Intel) Última atualização em 04/07/2019 - 17:00
Article

The Ultimate Question of Programming, Refactoring, and Everything

Yes, you've guessed correctly - the answer is "42".

Criado por Andrey Karpov (Blackbelt) Última atualização em 20/06/2019 - 22:52
Article

The Evil within the Comparison Functions

Perhaps, readers remember my article titled "Last line effect". It describes a pattern I've once noticed: in most cases programmers make an error in the last line of similar text blocks.

Criado por Andrey Karpov (Blackbelt) Última atualização em 12/12/2018 - 18:00
Article

Offload Computations from Servers with an Intel® Xeon Phi™ Processor

Learn how to use Offload over Fabric software for a server migration path.
Criado por Jan Z. (Intel) Última atualização em 15/10/2019 - 15:30
Article

Explicit Vector Programming in Fortran

No longer does Moore’s Law result in higher frequencies and improved scalar application performance; instead, higher transistor counts lead to increased parallelism, both through more cores and thr

Criado por Martyn Corden (Intel) Última atualização em 15/10/2019 - 15:30
Article

Improve Performance with Vectorization

This article focuses on the steps to improve software performance with vectorization. Included are examples of full applications along with some simpler cases to illustrate the steps to vectorization.
Criado por David M. Última atualização em 15/10/2019 - 15:30
Article

Measuring performance in HPC

This is the first article in a series of articles about High Performance Computing with the Intel® Xeon Phi™ coprocessor.

Criado por Última atualização em 15/10/2019 - 16:40
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Criado por David M. Última atualização em 15/10/2019 - 16:40