Optimize Single-Threaded Performance


Single-threaded performance remains important to well-tuned applications, even in today's multicore world. Get accurate data with minimal overhead, so you can focus on achieving maximum impact.

Find Hotspots

Identifying code that uses a lot of processor time is often the first step in single-threaded optimization. Hotspot analysis gives you a list of functions sorted by the amount of time they consume. Optimizing the longest running functions provide you with the biggest performance gain.

Hotspot analysis lists functions that consume a lot of processor time. Click + for the call stacks. To see the source, double-click a table entry. Colors show threading efficiency.

View Profiling Results on Your Source Code

Once you find the time-consuming functions, the next step is to figure out what part of each function needs improvement. Double-clicking the hotspot list takes you directly to the source, showing the hottest spot in the function. Intel® VTune™ Profiler supports most native compilers that follow industry standards, such as C, C++, and Fortran. No special compiler is required—just use your normal production build and include symbols.

To view the hottest spot, double-click the function list.

Python*, C++, C & Fortran Profiling

Statistical sampling techniques capture accurate profiling data with minimal overhead.

  • Avoid the heavy instrumentation overhead of many Python* profilers.
  • Profile pure Python or Python plus native code extensions in C, C++, or Fortran.
  • Tune native code extensions (unlike profilers for only Python).
  • Get source-line detail that includes call stacks.

View the profiling results on your Python source code.

Low-Overhead Java* Profiling

Analyze Java* or Java mixed with native code, and map results to the original Java source. Unlike some high-overhead Java profilers that instrument the code, Intel VTune Profiler uses low-overhead statistical sampling.

Analyze running Java services (such as mail) even if they are in a container. Profile Java daemons without a restart.

View the profiling results on your Java source.

Analyze Faster with Highlighted Tuning Opportunities

Event-based sampling uses the hardware performance monitoring unit (PMU) built into Intel® processors. PMU events can find specific tuning opportunities fast—like backend stalls or cache misses—highlighting them to facilitate easier analysis and optimization.

A pink cell indicates a potential tuning opportunity. Hover over it to get suggestions.

Additional Capabilities

Multithreaded

Effectively use all available cores.

System

See a system-level view of application performance.

Media & OpenCL™ Applications

Deliver high-performance image and video processing pipelines.

HPC & Cloud

Access specialized, in-depth analyses for HPC and cloud computing.

Memory & Storage Management

Diagnose memory, storage, and data plane bottlenecks.

Analyze & Filter Data

Mine data for answers.

Environment

Fits your environment and workflow.

Are you ready to try or purchase Intel VTune Profiler?

Informações de produto e desempenho

1

Os compiladores da Intel podem ou não otimizar para o mesmo nível de microprocessadores não Intel no caso de otimizações que não são exclusivas para microprocessadores Intel. Essas otimizações incluem os conjuntos de instruções SSE2, SSE3 e SSSE3, e outras otimizações. A Intel não garante a disponibilidade, a funcionalidade ou eficácia de qualquer otimização sobre microprocessadores não fabricados pela Intel. As otimizações que dependem de microprocessadores neste produto são destinadas ao uso com microprocessadores Intel. Algumas otimizações não específicas da microarquitetura Intel são reservadas para os microprocessadores Intel. Consulte os Guias de Usuário e Referência do produto aplicáveis para obter mais informações sobre os conjuntos de instruções específicos cobertos por este aviso.

Revisão do aviso #20110804