User Guide

  • 2021.2
  • 05/21/2021
  • Public Content
  • Download as PDF
Contents

CPU / Memory Roofline Insights
Perspective

Visualize actual performance against hardware-imposed performance ceilings by running the
CPU / Memory Roofline Insights
perspective. It helps you determine the main limiting factor (memory bandwidth or compute capacity) and provides an ideal roadmap of potential optimization steps.
Use the
Roofline
chart to answer the following questions:
  • What is the maximum achievable performance with your current hardware resources?
  • Does your application work optimally on current hardware resources?
  • If not, what are the best candidates for optimization?
  • Is memory bandwidth or compute capacity limiting performance for each optimization candidate?

How It Works

The
CPU / Memory Roofline Insights
perspective includes the following steps:
  1. Collect loop/function timings using the
    Survey
    analysis.
  2. Collect floating-point and/or integer operations data, memory traffic data, and measure the hardware limitations of your machine using the
    FLOP
    analysis in the
    Characterization
    step.
    This collection can take three to four times longer than the Survey analysis.

CPU Roofline Report

The
Roofline
chart plots an application's
achieved performance
and
arithmetic intensity
against the machine's
maximum achievable performance
:
  • Arithmetic intensity (x axis) - measured in number of floating-point operations (FLOPs) and/or integer operations (INTOPs) per byte, based on the loop/function algorithm, transferred between CPU/VPU and memory
  • Performance (y axis) - measured in billions of floating-point operations per second (GFLOPS) and/or billions of integer operations per second (GINTOPS)
Example of a CPU Roofline report

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.