Documentação

Intel® Math Kernel Library Cookbook de Intel® Math Kernel Library Cookbook

Última atualização em 28/05/2018 - 23:50
Article

Optimizing Memory Bandwidth in Knights Landing on Stream Triad

This document demonstrates the best methods to obtain peak memory bandwidth performance on Intel® Xeon Phi™ Processor (codenamed Knights Landing). This is done using STREAM* benchmarks, the de facto industry-standard benchmark for the measurement of computer memory bandwidth.
Criado por Karthik Raman (Intel) Última atualização em 29/07/2019 - 07:59
Article

Recipe: Building and Running YASK (Yet Another Stencil Kernel) on Intel® Processors

Yet Another Stencil Kernel (YASK), is a framework to facilitate design exploration and tuning of HPC kernels including vector folding, cache blocking, memory layout, loop construction, temporal wave-front blocking, and others.YASK contains a specialized source-to-source translator to convert scalar C++ stencil code to SIMD-optimized code.
Criado por Chuck Yount (Intel) Última atualização em 21/03/2019 - 12:00
Documentação

Frequent DRAM Accesses de Intel® VTune™ Amplifier Performance Analysis Cookbook

This recipe explores profiling a memory-bound matrix application using the Microarchitecture Exploration and Memory Access analyses of the Intel® VTune™ Amplifier to understand the cause of the frequent DRAM accesses.

Última atualização em 09/08/2019 - 12:50
Documentação

Poor Port Utilization de Intel® VTune™ Amplifier Performance Analysis Cookbook

This recipe explores profiling a core-bound matrix application using the Microarchitecture Exploration analysis (formerly, General Exploration) of the Intel® VTune™ Amplifier to understand the cause of the poor port utilization and Intel® Advisor to benefit from compiler vectorization.

Última atualização em 09/08/2019 - 12:50
Documentação

False Sharing de Intel® VTune™ Amplifier Performance Analysis Cookbook

This recipe explores profiling a memory-bound linear_regression application using the General Exploration and Memory Access analyses of the Intel® VTune™ Amplifier.

Última atualização em 09/08/2019 - 12:50
Documentação

Inefficient Synchronization de Intel® VTune™ Amplifier Performance Analysis Cookbook

This recipe shows how to locate inefficient synchronization in your code by running the Advanced Hotspots analysis of the Intel® VTune™ Amplifier with the stack collection enabled.

Última atualização em 09/08/2019 - 12:50
Documentação

Legal Information de Intel® VTune™ Amplifier Performance Analysis Cookbook

Última atualização em 09/08/2019 - 12:50