Article

Preparing for Parallel Optimization

Optimizing your application for multi-core technology can result in big performance improvements, but it requires a plan of action that is well suited to your application. This article gives an overview of key steps to follow as you optimize your code.
作者: Diana B. (Intel) 最后更新时间: 2019/07/04 - 22:00
博客

Advanced Computer Concepts for the (Not So) Common Chef: The Home Kitchen

Since that brief aside on terminology is out of the way, let us continue with the kitchen analogy.

作者: 最后更新时间: 2019/07/06 - 17:10
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
作者: David M. 最后更新时间: 2019/07/06 - 16:40
Article

Classical Molecular Dynamics Simulations with LAMMPS Optimized for Knights Landing

LAMMPS is an open-source software package that simulates classical molecular dynamics. As it supports many energy models and simulation options, its versatility has made it a popular choice. It was first developed at Sandia National Laboratories to use large-scale parallel computation.
作者: WILLIAM B. (Intel) 最后更新时间: 2019/03/21 - 12:00
Article

Data Persistence in a Nutshell

Applications often use files to store data from one run to the next, but high-capacity, non-volatile memory devices make it possible to store data more effectively than using a disk-based file system. This article describes how to design your application to take advantage of these memory devices, thereby avoiding the need for files to serve as persistent memory.
作者: 最后更新时间: 2018/12/12 - 18:00
Article

Performance Improvement Opportunities with NUMA Hardware

Intel’s non-uniform memory access (NUMA) strategy is based on several new memory technologies that promise significant improvements in both capability and performance. This article provides information on Multi-Channel DRAM (MCDRAM) and High-Bandwidth Memory (HBM), Non-volatile dual inline-memory modules (NVDIMMs), and Intel® Omni-Path Fabric (Intel® OP Fabric).
作者: 最后更新时间: 2019/03/21 - 12:00
Article

NUMA Hardware Target Audience

Do you have a problem that Intel non-uniform memory access (NUMA) hardware and the related tools and strategies can solve? The answer depends on the problem you are facing and if you can make decisions about choosing/changing your hardware, your software, or both. This article walks you through the decision.
作者: 最后更新时间: 2019/03/21 - 12:00
Article

Hardware and Software Approach for Using NUMA Systems

Learn how to build an application that runs effectively on non-uniform memory access (NUMA) hardware. This article walks you through choosing the algorithm all the way through to measuring your application's performance.
作者: 最后更新时间: 2019/03/21 - 12:00
Article

Modern Memory Subsystems Benefits for Data Base Codes, Linear Algebra Codes, Big Data, and Enterprise Storage

This article describes and contrasts advantages different types of memory, including Multi-Channel DRAM (MCDRAM) and High-Bandwidth Memory (HBM), the future 3D XPoint™ memory devices, and Intel® Omni-Path Fabric (Intel® OP Fabric).
作者: 最后更新时间: 2019/03/21 - 12:00
博客

Debug Intel® Transactional Synchronization Extensions

If printf or fprintf functions cause transaction aborts, use Intel® Processor Trace as a work-around.
作者: Roman Dementiev (Intel) 最后更新时间: 2019/07/04 - 17:00