Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:10
Article

Fine-Tuning Optimization for a Numerical Method for Hyperbolic Equations Applied to a Porous Media Flow Problem with Intel® Tools

This paper presents an analysis for potential optimization for a Godunov-type semi-discrete central scheme, for a particular hyperbolic problem implicated in porous media flow, using OpenMP* and Intel® Advanced Vector Extensions 2.
Authored by Last updated on 07/03/2019 - 20:00
Article
Article

Hybrid Parallelism: A MiniFE* Case Study

This case study examines the situation where the problem decomposition is the same for threading as it is for Message Passing Interface* (MPI); that is, the threading parallelism is elevated to the same level as MPI parallelism.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

3D Isotropic Acoustic Finite-Difference Wave Equation Code: A Many-Core Processor Implementation and Analysis

Finite difference is a simple and efficient mathematical tool that helps solve differential equations.

Authored by Sunny G. (Intel) Last updated on 07/06/2019 - 16:30
Article

Implementation of Classic Gram-Schmidt in a Reservoir Simulator

Introduction
Authored by Last updated on 07/06/2019 - 16:59
Article

在英特尔® 数学核心函数库中引入 DNN 基元

    深度神经网络 (DNN) 处于机器学习领域的前沿。这些算法在 20 世纪 90 年代后期得到了行业的广泛采用,最初应用于诸如银行支票手写识别等任务。深度神经网络在这一任务领域已得到广泛运用,达到甚至超过了人类能力。如今,DNN 已用于图像识别、视频和自然语言处理以及解决复杂的视觉理解问题,如自主驾驶等。DNN 在计算资源及其必须处理的数据量方面要求非常苛刻。

Authored by Vadim Pirogov (Intel) Last updated on 03/21/2019 - 12:08
Article

Performance of Classic Matrix Multiplication Algorithm on Intel® Xeon Phi™ Processor System

Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
Authored by Last updated on 06/14/2019 - 11:50
Article

3D 同性声波有限差分波动方程代码:多核处理器实施与分析

有限差分是一个简单有效的数学工具,用于求解微分方程。在本文中,我们将利用明确的时间域有限差分求解同性声波 3D 波动方程。

Authored by Sunny G. (Intel) Last updated on 07/06/2019 - 16:30