Webinars

Reproducible Results

With Intel® Performance Libraries, learn which conditions make it possible to build serial and parallel applications that deliver repeatable results. (61:11 min)

Dedicated Code Analysis

Find out how Intel® VTune™ Profiler monitors high performance computing (HPC) workloads and generates reports to optimize various platform components. (56 min)

Expertly Tune Your Application

See how to tune and accelerate compute-intensive performance with the features and architecture of Intel® Xeon® Scalable processors and Intel® Software Development Tools.

Uncover Common Issues with Threaded Performance

Locate performance and scalability issues, and identify whether imbalance, lock contention, creation overhead, or scheduling overhead cause them.

Find & Fix Common Performance Bottlenecks

Identify where code performance needs to improve and learn how to fix it.

Fix Critical Performance Hot Spots & Bottlenecks

Use key components in Intel® System Studio to correct hot spots, power inefficiencies, memory leaks, non-optimized threads, and other system issues.

Take a Snapshot of Application Performance

See a demonstration of the Application Performance Snapshot. Quickly discover untapped performance and make the best use of your computer hardware. (20:27 min)

A Performance Analysis of Python Applications

This video discusses the needs, advantages, and common tools and techniques used to profile Python applications. It includes a demo and code sample. (47:28 min)

How Non-Uniform Memory Access Affects Workloads

In multisocket non-uniform memory access (NUMA) systems, get the best performance through memory object placement on the memory subsystem. (58:39 min)

Give Your Hybrid Program a Performance Tune-Up

Find the root causes of why a hybrid application may not be performing as expected using this walk-through of uncovering and resolving each issue. (43:49 min)

Strategies for Tuning Multilevel Parallelism

Learn where to add parallelism to your application and determine how scalable it can be. (57:41 min)

Samples

Name Description
matrix
  • Description: Calculates matrix transformations and identifies general hardware issues in a C++ application
  • Performance issues addressed: Poor cycles per instruction (CPI) rate, cache misses, retire stalls, execution stalls, and others
  • Location: \samples\<locale>\C++\matrix

Product and Performance Information

1

Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804