Optimize Multithreaded Performance


In modern multicore systems, threaded performance is critical for exploiting the full potential of the processor. Intel® VTune™ Profiler helps you tune your software to make effective use of all cores.

Find Common Causes of Slow Threaded Code

The Locks and Waits analysis helps you focus your tuning efforts and envision potential improvements. Use it to identify synchronization objects (locks) that prevent effective processor utilization and to estimate the impact and wait time each lock has on application performance.

See a prioritized list of synchronization objects that negatively impact performance.

Tune Parallel Performance

Intel VTune Profiler has the built-in ability to discern parallel programming models (including OpenMP* 4.0 and Intel® Threading Building Blocks) making it easy to visualize and understand multithreading concepts such as a task beginning and ending, synchronizing, and waiting. Get the data you need to tune performance and see which parallel regions are inefficient and why (for example, imbalance, lock contention, and communication).

Detailed data for each OpenMP region highlights tuning opportunities.

Visually Spot Inefficient Threading

Use the timeline to spot patterns of inefficient threading (like coarse-grained locks). The image shows multiple threads, but only one thread (dark green) runs at a time. No work is done in parallel due to data sharing issues. The timeline lets you visually spot threading inefficiencies. In this example, there are four threads, but only one is running at any given time, so thread concurrency is very low.

See Lock Contention

Another common threading performance issue is when multiple threads contend for the same lock. This becomes obvious when the timeline is dominated by yellow transition lines. A high density of transitions may indicate lock contention and poor parallel performance.

More Effective OpenMP* Tuning

The summary report quickly delivers the top four answers you need to effectively improve OpenMP performance. For additional details on each region, under OpenMP Region, select the links.

Getting the right data makes tuning OpenMP much more effective.

Additional Capabilities

Single Thread

Optimize single-threaded performance.

System

See a system-level view of application performance.

Media & OpenCL™ Applications

Deliver high-performance image and video processing pipelines.

HPC & Cloud

Access specialized, in-depth analyses for HPC and cloud computing.

Memory & Storage Management

Diagnose memory, storage, and data plane bottlenecks.

Analyze & Filter Data

Mine data for answers.

Environment

Fits your environment and workflow.

Are you ready to try or purchase Intel VTune Profiler?

Informações de produto e desempenho

1

Os compiladores da Intel podem ou não otimizar para o mesmo nível de microprocessadores não Intel no caso de otimizações que não são exclusivas para microprocessadores Intel. Essas otimizações incluem os conjuntos de instruções SSE2, SSE3 e SSSE3, e outras otimizações. A Intel não garante a disponibilidade, a funcionalidade ou eficácia de qualquer otimização sobre microprocessadores não fabricados pela Intel. As otimizações que dependem de microprocessadores neste produto são destinadas ao uso com microprocessadores Intel. Algumas otimizações não específicas da microarquitetura Intel são reservadas para os microprocessadores Intel. Consulte os Guias de Usuário e Referência do produto aplicáveis para obter mais informações sobre os conjuntos de instruções específicos cobertos por este aviso.

Revisão do aviso #20110804