Intel® VTune™ Amplifier
Use the CPU/FPGA Interaction analysis to assess the balance between the CPU and FPGA on systems with a discrete Intel® Arria® 10 FPGA running OpenCL™ applications.
This is a PREVIEW FEATURE. A preview feature may or may not appear in a future production release. It is available for your use in the hopes that you will provide feedback on its usefulness and help determine its future. Data collected with a preview feature is not guaranteed to be backward compatible with future releases. Please send your feedback to parallel.studio.support@intel.com or to intelsystemstudio@intel.com.
The CPU/FPGA Interaction analysis can be used to assess FPGA time spent executing kernels, overall time for memory transfers between the CPU and FPGA, and wait time impact on CPU and FPGA work loads.
To configure options for the CPU/FPGA Interaction analysis:
Prerequisites: Create a project.
Click the
(standalone GUI)/
(Visual Studio IDE)Configure Analysis button on the
Intel® VTune™
Amplifier toolbar.
The Configure Analysis window opens.
From the
WHAT pane, click the
Browse button and select
Launch Application or
Attach to Process.
From
HOW pane, click the
Browse button and select
CPU/FPGA Interaction.
Configure the collection options, including sampling interval and collection with or without call stacks.
You may
generate the command line for this configuration using the
Command Line button at the bottom.
Click the
Start button to
run the analysis.
The CPU/FPGA Interaction analysis results appear in the CPU/FPGA Interaction viewpoint, which consists of the following windows/panes:
Summary window displays statistics on the overall application execution, identifying CPU time and processor utilization, and execution time for FPGA OpenCL™ kernels.
Bottom-up window displays functions in the bottom-up tree, CPU time and CPU utilization per function.
Platform window displays over-time data for OpenCL kernels and memory transfers, CPU context switches, FPU utilization, and CPU threads with OpenCL kernels.
Use the FPGA Hotspots viewpoint to review the following:
FPGA Utilization: Look at the FPGA Top Compute Tasks on the Summary window for a list of OpenCL kernels running on the FPGA. The Bottom-up window shows the Total and Average execution time for every kernel.
Memory Transfers: Look at the Data Transferred column on the Bottom-up window or the Computing Queue rows on the Platform window to view the FPGA OpenCL kernels and memory transfers.
Workload Impact: The Context Switch Time metric on the Summary window shows how much time was spent in CPU context switches. Context switches can also be seen on the Platform tab as they occurred during application execution.