runsa/runss Custom Command Line Analysis

Use the collect-with action to configure and run a custom analysis using any of the following data collectors:

runsa

The hardware event-based sampling collector of the VTune Amplifier profiles your application using the counter overflow feature of the Performance Monitoring Unit (PMU).

Syntax:

amplxe-cl -collect-with runsa [-knob <knobName=knobValue>] [--] <target>

Knobs:

analyze-loops enable-thread-affinity
analyze-mem-objects uncore-sampling-interval
analyze-openmp enable-user-tasks
chipset-event-config event-config
collect-io-waits event-mode
collect tsx-cycles ftrace-config
counting-mode gpu-counters-mode
dram-bandwidth-limits gpu-profiling-mode
enable-driverless-collection gpu-sampling-interval
enable-context-switches io-mode
enable-gpu-runtimes kernels-to-profile
enable-gpu-usage mem-object-size-min-thres
enable-parallel-fs-collection mrte-type
enable-thread-affinity sampling-interval
enable-stack-collection stack-size
enable-system-cswitch stack-type

Note

For the most current information on available knobs (configuration options) for the hardware event-based sampling, enter:

amplxe-cl -help collect-with runsa

To display a list of events available on the target PMU, enter:

amplxe-cl -collect-with runsa -knob event-config=? <target>

The command returns names and short descriptions of available events. For more information on the events, use Intel Processor Events Reference

Example 1:

This example runs a custom hardware event-based sampling collection for the sample application with the specified events:

amplxe-cl -collect-with runsa -knob event-config=CPU_CLK_UNHALTED.CORE,CPU_CLK_UNHALTED.REF,INST_RETIRED.ANY -- /home/test/sample

Example 2:

This example configures and runs a custom event-based sampling data collection with the stack size limited to 8192 bytes and defines a custom Sample After value for the CPU_CLK_UNHALTED.REF_TSC event using the sa option:

amplxe-cl -collect-with runsa -knob enable-stack-collection=true -knob stack-size=8192 -knob -knob event-config=CPU_CLK_UNHALTED.REF_TSC:sa=1800000,CPU_CLK_UNHALTED

runss

The user-mode sampling and tracing collector profiles an application execution and takes snapshots of how that application utilizes the processors in the system. The collector interrupts a process, collects the value of all active instruction addresses and captures a calling sequence for each of these samples.

Syntax:

amplxe-cl -collect-with runss [-knob <knobName=knobValue>] [--] <target>

Knobs:

analyze-loops gpu-sampling-interval
analyze-openmp gpu-profiling-mode
cpu-samples-mode ftrace-config
counting-mode io-mode
enable-gpu-runtimes kernels-to-profile
enable-gpu-usage mrte-type
enable-user-tasks no-altstack
enable-user-sync sampling-interval
enable-parallel-fs-collection signals-mode
enable-thread-affinity stackwalk-mode
event-config stack-stitching
gpu-counters-mode waits-mode

Note

For the most current information on available knobs (configuration options) for the user-mode sampling and tracing, enter:

amplxe-cl -help collect-with runss

Example:

This example runs user-mode sampling and tracing collection for the sample application with enabled loop analysis.

amplxe-cl -collect-with runss -knob analyze-loops=true -- /home/test/sample

What's Next

When the data collection is complete, do one of the following to view the result:

For more complete information about compiler optimizations, see our Optimization Notice.
Select sticky button color: 
Orange (only for download buttons)