Forum topic

OpenMP KMP_AFFINITY with proclist

Hi All,

I am trying to understand how "proclist" works with KMP_AFFINITY. I run a benchmark with following environment variables:

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Forum topic

Intel Caffe OpenMP Threads on Xeon Phi

Hi All,

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Forum topic

OpenMP Threads - BVLC AlexNet vs Intel AlexNet Timing

Hi All,

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:10
File Wrapper

Parallel Universe Magazine - Issue 27, January 2017

Authored by admin Last updated on 03/21/2019 - 12:00
Article

Maximize Performance of Intel® Optimization of PyTorch*/Caffe2* Framework on CPU

This article describes what you need to consider in order to get a satisfying performance with PyTorch, with examples.
Authored by Jing X. (Intel) Last updated on 08/15/2019 - 12:50
Article

Tips to Improve Performance for Popular Deep Learning Frameworks on CPUs

This document provides optimization tips for TensorFlow*, Keras, and Caffe* on Intel® Xeon® processors.
Authored by Anju P. (Intel) Last updated on 09/24/2018 - 16:06
Article

Maximize TensorFlow* Performance on CPU: Considerations and Recommendations for Inference Workloads

This article will describe performance considerations for CPU inference using Intel® Optimization for TensorFlow*
Authored by Nathan Greeneltch (Intel) Last updated on 07/31/2019 - 12:11
Article

Caffe* Optimized for Intel® Architecture: Applying Modern Code Techniques

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 07/06/2019 - 16:40