Filters

Article

Threading Fortran Applications for Parallel Performance on Multi-Core Systems

Advice and background information is given on typical issues that may arise when threading an application using the Intel Fortran Compiler and other software tools, whether using OpenMP, automatic parallelization or threaded libraries.
Authored by Martyn Corden (Intel) Last updated on 12/12/2018 - 18:00
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 12/13/2018 - 13:52
Blog post

Wellington and Austin: programming lots of cores

A couple of back-to-back opportunities to see great talks about harness lots of cores, and to give talks about programming options and why we do not need to give up on programmability in our quest

Authored by James R. (Blackbelt) Last updated on 12/12/2018 - 18:08
Article

Superscalar Programming 101 (Matrix Multiply) Part 1 of 5

Part one of a five-part series, this article teaches a methodology to interpret statistics gathered during test runs and use those interpretations to improve parallel code.
Authored by jimdempseyatthecove (Blackbelt) Last updated on 02/08/2019 - 09:00
Article

Improving the Compute Performance of Video Processing Software Using AVX (Advanced Vector Extensions) Instructions

This paper describes a case study in which AVX instructions are used to enhance the performance of a de-saturation algorithm (a common video filter). The case study takes the algorithm from a non-SIMD state to AVX based SIMD.
Authored by Last updated on 12/12/2018 - 18:08
Article

Vectorization Toolkit

A toolkit that gives 6 Steps to Increase Performance Through Vectorization in Your Application
Authored by AmandaS (Intel) Last updated on 03/27/2019 - 13:34
Article

PAOS - Packed Array Of Structures

by Jim Dempsey

Authored by jimdempseyatthecove (Blackbelt) Last updated on 12/28/2018 - 11:03
Article

Programming and Compiling for Intel® Many Integrated Core Architecture

This article discussions parallelization and provides links that will help you understand your programming environment and evaluate the suitability of your app.
Authored by AmandaS (Intel) Last updated on 03/26/2019 - 10:07
Article

Cache Blocking Techniques

Cache Blocking Techniques Overview

Authored by AmandaS (Intel) Last updated on 03/26/2019 - 15:07
Article

Memory Layout Transformations

Memory Layout Transformations Overview
Authored by AmandaS (Intel) Last updated on 03/26/2019 - 14:54