File Wrapper

Parallel Universe Magazine - Issue 16, November 2013

Authored by admin Last updated on 12/12/2018 - 18:08
File Wrapper

Parallel Universe Magazine - Issue 18, June 2014

Authored by admin Last updated on 05/16/2019 - 11:39
Article

Introduction to the Intel® Xeon Phi™ Coprocessor

This tutorial introduces the basic hardware and software architecture of the Intel Xeon Phi coprocessor, describing their general features, and provides a first view of the various programming mode

Authored by admin Last updated on 06/14/2019 - 13:00
Article

Choosing the right threading framework

This is the second article in a series of articles about High Performance Computing with the Intel Xeon Phi.

Authored by Last updated on 07/06/2019 - 16:30
Article

Program Optimization through Loop Vectorization

Download Article

Download Program Optimization through Loop Vectorization [PDF 617KB]

Authored by Last updated on 07/06/2019 - 16:30
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

Hybrid Parallelism: A MiniFE* Case Study

This case study examines the situation where the problem decomposition is the same for threading as it is for Message Passing Interface* (MPI); that is, the threading parallelism is elevated to the same level as MPI parallelism.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

Intel Guide for Developing Multithreaded Applications

Download this guide for developing multithreaded applications, which also includes general topics such as application threading and synchronization.
Authored by admin Last updated on 07/06/2019 - 16:40
Article

Improve Performance with Vectorization

This article focuses on the steps to improve software performance with vectorization. Included are examples of full applications along with some simpler cases to illustrate the steps to vectorization.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

Superscalar Programming 101 (Matrix Multiply) Part 1 of 5

Part one of a five-part series, this article teaches a methodology to interpret statistics gathered during test runs and use those interpretations to improve parallel code.
Authored by jimdempseyatthecove (Blackbelt) Last updated on 07/04/2019 - 22:00