Diagnose Memory, Storage & Data Plane Bottlenecks


Not all workloads are compute-bound. Intel® VTune™ Profiler has specialized analyses for optimizing the use of memory and I/O bandwidth.

Optimize Bandwidth-Limited Software

Use the timeline to see the spikes in bandwidth used for DRAM and Intel® QuickPath Interconnect. To see which functions are consuming bandwidth at a specific time, select a spike in the timeline and filter on the selection. This lets you isolate the individual contributors to bandwidth consumption and tune effectively.

Functions that are significantly memory bound are highlighted in pink.

Identify Which Memory Objects Are Bottlenecks

A typical hotspot analysis shows code that is taking the most time. The Memory Access analysis offers a different perspective—it shows which memory objects cause performance issues, independent of where they are accessed. This can yield new insight on how to improve performance.

Available for Linux* targets only.

Tune Non-Uniform Memory Access (NUMA)

Some memory accesses can be slower than others. For example, on a two-socket system, latency is higher when a core in socket 0 accesses memory that is attached to socket 1. Memory Analysis in Intel VTune Profiler lets you identify frequently accessed data that is stored remotely and reconsider how you allocate memory. Memory access analysis shows both local memory access (which is fast) and remote memory access (which is slow). Changing your memory allocation to improve local access may improve performance.

Design & Optimize for Persistent Memory

Memory Access analysis also helps you decide which objects to allocate in Intel® Optane™ DC persistent memory. Place the hottest objects in DRAM, warm objects in persistent memory, and cold objects on an SSD or disk.

Available for Linux* targets only.

Uncover I/O Bottlenecks

Determine whether your application is I/O-bound or CPU-bound by exploring imbalance between I/O operations (synchronous and asynchronous) and compute. See when the CPU is waiting for I/O, and see storage accesses mapped to the source code.

Sliders on the histogram control the display of data in the grid and on the timeline, making data analysis easier.

Tune Polled I/O Using the Data Plane Development Kit (DPDK) & Storage Performance Development Kit (SPDK)

DPDK and SPDK are built for fast packet processing and high-performance storage. Both operate in polled mode instead of using interrupts. Applications check for more work from the NIC (DPDK) or the SSD (SPDK). The problem with most profilers is there is no way to tell if a thread is heavily loaded or lightly loaded because polling always puts the CPU use at 100 percent. But because Intel VTune Profiler can track cycles where no work is done, it can show you which threads are heavily loaded and which are not.

Analysis with SPDK goes beyond simple aggregate data for the I/O channel and details the data for each attached device. This gives you a more detailed picture of complex I/O workloads.

Set up I/O analysis with just a few clicks.

Data Plane Development Kit

Storage Performance Development Kit

Determine Which Systems Benefit from Faster Storage

Storage Performance Snapshot shows system storage bottlenecks for servers and workstations with directly attached storage. Easy to install, this tool helps you determine which workloads need further analysis and where faster storage improves performance. This snapshot comes with Intel VTune Profiler and is also available separately to facilitate a quick system check.

Get a quick view of:

  • I/O boundedness
  • Storage and network saturation
  • CPU utilization
  • Memory capacity saturation

Get system data while running workloads to see how migration to Serial ATA and PCIe* SSDs can offer better solutions, user experiences, and performance density.

Storage Performance Snapshot

Collect data on Windows* or Linux systems and view the results in a web browser.

Additional Capabilities

Single Thread

Optimize single-threaded performance.

Multithread

Effectively use all available cores.

System

See a system-level view of application performance.

Media & OpenCL™ Applications

Deliver high-performance image and video processing pipelines.

HPC & Cloud

Access specialized, in-depth analyses for HPC and cloud computing.

Analyze & Filter Data

Mine data for answers.

Environment

Fits your environment and workflow.

Are you ready to try or purchase Intel VTune Profiler?

Informações de produto e desempenho

1

Os compiladores da Intel podem ou não otimizar para o mesmo nível de microprocessadores não Intel no caso de otimizações que não são exclusivas para microprocessadores Intel. Essas otimizações incluem os conjuntos de instruções SSE2, SSE3 e SSSE3, e outras otimizações. A Intel não garante a disponibilidade, a funcionalidade ou eficácia de qualquer otimização sobre microprocessadores não fabricados pela Intel. As otimizações que dependem de microprocessadores neste produto são destinadas ao uso com microprocessadores Intel. Algumas otimizações não específicas da microarquitetura Intel são reservadas para os microprocessadores Intel. Consulte os Guias de Usuário e Referência do produto aplicáveis para obter mais informações sobre os conjuntos de instruções específicos cobertos por este aviso.

Revisão do aviso #20110804