Forum topic

OpenMP KMP_AFFINITY with proclist

Hi All,

I am trying to understand how "proclist" works with KMP_AFFINITY. I run a benchmark with following environment variables:

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Forum topic

Intel Caffe OpenMP Threads on Xeon Phi

Hi All,

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Forum topic

OpenMP Threads - BVLC AlexNet vs Intel AlexNet Timing

Hi All,

Authored by Chetan Arvind Patil Last updated on 05/17/2018 - 16:08
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:10
Article

Tips to Improve Performance for Popular Deep Learning Frameworks on CPUs

This document provides optimization tips for TensorFlow*, Keras, and Caffe* on Intel® Xeon® processors.
Authored by Anju P. (Intel) Last updated on 09/24/2018 - 16:06
Article

Maximize TensorFlow* Performance on CPU: Considerations and Recommendations for Inference Workloads

This article will describe performance considerations for CPU inference using Intel® Optimization for TensorFlow*
Authored by Nathan Greeneltch (Intel) Last updated on 04/01/2019 - 13:01
Article

Caffe* Optimized for Intel® Architecture: Applying Modern Code Techniques

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 07/06/2019 - 16:40
Article

Code Sample: Optimizing Binarized Neural Networks on Intel® Xeon® Scalable Processors

In the previous article, we discussed the performance and accuracy of Binarized Neural Networks (BNN). We also introduced a BNN coded from scratch in the Wolfram Language. The key component of this neural network is Matrix Multiplication.
Authored by Yash Akhauri Last updated on 03/21/2019 - 12:40
Article

Building and Probing Prolog* with Intel® Architecture

This article explores what happens when Intel solutions support functional and logic programming languages that are regularly used for Artificial Intelligence (AI) and proposes a Prolog interpreter recompilation using Intel® C++ Compiler and libraries in order to evaluate their contribution to logic based AI.
Authored by Flavio Luis de Mello Last updated on 01/24/2018 - 15:35