Article

Introduction to the Heterogeneous Streams Library

To efficiently utilize all available resources for the task concurrency application on heterogeneous platforms, designers need to understand the memory architecture, the thread utilization on each platform, the pipeline to offload the workload to different platforms. To relieve designers of the burden of implementing the necessary infrastructures, the Heterogeneous Streaming (hStreams) library...
Autor Nguyen, Loc Q (Intel) Última actualización 06/07/2019 - 16:40
Article

Improve Vectorization Performance with Intel® AVX-512

See how the new Intel® Advanced Vector Extensions 512CD and the Intel AVX512F subsets (available in the Intel® Xeon Phi processor and in future Intel Xeon processors) lets the compiler automatically generate vector code with no changes to the code.
Autor Alberto V. (Intel) Última actualización 08/07/2019 - 19:26
Article

Thread Parallelism in Cython*

Cython* is a superset of Python* that additionally supports C functions and C types on variable and class attributes. Cython generates C extension modules, which can be used by the main Python program using the import statement.
Autor Nguyen, Loc Q (Intel) Última actualización 06/07/2019 - 16:30
Article

Code Sample: Exploring MPI for Python* on Intel® Xeon Phi™ Processor

Learn how to write an MPI program in Python*, and take advantage of Intel® multicore architectures using OpenMP threads and Intel® AVX512 instructions.
Autor Nguyen, Loc Q (Intel) Última actualización 06/07/2019 - 16:30
Article

Intel® Xeon Phi™ Processor 7200 Family Memory Management Optimizations

This paper examines software performance optimization for an implementation of a non-library version of DGEMM executing on the Intel® Xeon Phi™ processor (code-named Knights Landing, with acronym K

Autor Última actualización 06/07/2019 - 16:30
Article

How to use the MPI-3 Shared Memory in Intel® Xeon Phi™ Processors

Code Sample included: Learn how to use MPI-3 shared memory feature using the corresponding APIs on the Intel® Xeon Phi™ processor.
Autor Nguyen, Loc Q (Intel) Última actualización 06/07/2019 - 16:30
Article

Performance of Classic Matrix Multiplication Algorithm on Intel® Xeon Phi™ Processor System

Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
Autor Última actualización 14/06/2019 - 11:50
Article

Using Intel® MPI Library on Intel® Xeon Phi™ Product Family

This document is designed to help users get started writing code and running MPI applications using the Intel® MPI Library on a development platform that includes the Intel® Xeon Phi™ processor.
Autor Nguyen, Loc Q (Intel) Última actualización 21/03/2019 - 12:00
Article

Recipe: Monte Carlo European Option Pricing for Intel® Xeon Phi® Processor

This article covers the Monte Carlo Methods using a simple quasi random number generator.
Autor admin Última actualización 02/09/2019 - 15:25
Article

Recipe: Pricing Options Using Barone-Adesi Whaley Approximation

In this paper, we look at one of the successful efforts, pioneered by Barone-Adesi and Whaley, and apply the high performance parallel computing entailed in the modern microprocessors to create a program that can exceed our expectation for high performance with a suitable numerical result.
Autor admin Última actualización 02/09/2019 - 15:29