Article

Intel® Performance Counter Monitor - A Better Way to Measure CPU Utilization

The Intel® Performance Counter Monitor provides sample C++ routines and utilities to estimate the internal resource utilization of the latest Intel® Xeon® and Core™ processors and gain a significant performance boost.
Authored by Thomas Willhalm (Intel) Last updated on 10/01/2019 - 15:30
Article

A Matrix Multiplication Routine that Updates Only the Upper or Lower Triangular Part of the Result Matrix

  Background

Intel® MKL provides the general purpose BLAS*  matrix multiply routines ?GEMM defined as follows:

Authored by Zhang, Zhang (Intel) Last updated on 10/08/2019 - 18:20
Article

Intel® MKL Sparse BLAS Overview

Sparse BLAS routines can be useful to implement iterative methods for solving large sparse systems of equations or eigenvalue problems
Authored by Last updated on 10/08/2019 - 18:20
Article

Run-to-Run Reproducibility of Floating-Point Calculations for Applications on Intel® Xeon Phi™ Coprocessors (and Intel® Xeon® Processors)

The Issue

If I rerun the identical program on the identical input data on an identical processor, will I get an identical result?

Authored by Martyn Corden (Intel) Last updated on 10/15/2019 - 15:30
Article

How to detect Knights Landing AVX-512 support (Intel® Xeon Phi™ processor)

The Intel® Xeon Phi™ processor, code named Knights Landing, is part of the second generation of Intel Xeon Phi products. Knights Landing supports Intel® AVX-512 instructions, specifically AVX-512F (foundation), AVX-512CD (conflict detection), AVX-512ER (exponential and reciprocal) and AVX-512PF (prefetch).
Authored by James R. (Blackbelt) Last updated on 10/15/2019 - 15:30
Article

Introduction to the Intel® MKL Extended Eigensolver

 

Authored by Zhang, Zhang (Intel) Last updated on 10/15/2019 - 16:50
Article
Article

Intel® AVX-512 Instructions

The latest Intel® Architecture Instruction Set Extensions Programming Reference includes the definition of Intel® Advanced Vector Extensions 512 (Intel® AV

Authored by James R. (Blackbelt) Last updated on 10/15/2019 - 20:39
Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Authored by Robert Ioffe (Intel) Last updated on 11/19/2019 - 13:39