User Guide

  • 2020
  • 06/18/2020
  • Public Content
Contents

QNX* Targets

Intel® VTune™
Profiler
supports collecting performance data on QNX* target systems.
Data collection is possible via command line interface from a host system running Windows* or Linux* to the target QNX system. The collected traces are transferred to the host system via ethernet and stored for review. After collection, the performance results can be imported and viewed in the
Intel VTune
Profiler
user interface.
The target collector can be integrated into the target QNX image during the image build process and requires only 1 MB of space on the target file system. Because the traces are transferred to the host system, collection can be done on target systems with limited storage capacity or with read-only file systems.

Prerequisites

  • Host System: Linux* or Windows* system with QNX BSP and
    VTune
    Profiler
    installed
  • Target System: Supported processor with QNX7 operating with instrumental kernel, connected to the host system via ethernet. Supported processors include Intel® Pentium®, Intel® Celeron®, or Intel Atom® processors formerly code named Apollo Lake or Intel Atom® processors formerly code named Denverton.
  • Turn off firewall restrictions for network connections between the host system and target system

Set up Your System

Complete the following steps on your host and target system to install collectors and enable performance analysis using
Intel VTune
Profiler
:
  1. Ensure that the host system is connected to the target QNX system via ethernet and log in to the target QNX system using a command window.
  2. Make the
    < install-dir >/target/qnx_x86_64/bin64/sep
    file on the host system available on the target QNX system by copying, mounting a network share, or integrating it into the target image.
  3. On the host system, launch the
    VTune
    Profiler
    user interface, click
    New Project
    , specify a project name, and click
    Create Project
    .
  4. Click
    Configure Analysis
    , select
    local host
    in the
    WHERE
    pane, and click
    Search Binaries
    .
  5. In the
    Binary/Symbol Search
    window, browse to the location of the kernel and application target modules on the host system, and click
    OK
    .

Run Analysis

Analysis is run using collectors previously installed on the target QNX system and a command invoked on the host Windows or Linux system. All result files are saved to the host system.
  1. On the target QNX system, run the following command:
    <sep-dir>/sep
    Where
    <
    sep-dir
    >
    is the location where the
    sep
    file was copied. The target collector loads and waits for the host system to connect.
  2. On the host system, run one of the following analysis commands.
    • Hotspots with call stacks
      :
      <install-dir>
      /bin64/sep -start -d
      <duration>
      -target-ip
      <target-ip-address>
      -target-port 9321 -lbr call_stack -out
      <filename>
      .tb7
      Example command:
      /opt/intel/vtune_
      profiler
      /bin64/sep -start -d 60 -target-ip 12.345.67.89 -target-port 9321 -lbr call_stack -out hotspots_callstacks.tb7
      Call stacks are hardware based and limited to a depth of 16 frames. Due to hardware limitations, the depth of the captured call stack can be less than 16 frames.
    • Custom CPU events
      :
      <install-dir>
      /bin64/sep.exe -start -d
      <duration>
      -target-ip
      <target-ip-address>
      -target-port 9321 -ec "
      <event-list>
      " -out
      <filename>
      .tb7
      Example command:
      /opt/intel/vtune_
      profiler
      /bin64/sep.exe -start -d 60 -target-ip 12.345.67.89 -target-port 9321 -ec "MEM_LOAD_UOPS_RETIRED.DRAM_HIT,MEM_LOAD_UOPS_RETIRED.HITM,MEM_LOAD_UOPS_RETIRED.L2_HIT" -out custom.tb7
      See the Sampling Enabling Product User's Guide for more information.
  3. After collection begins, run the application on the target QNX system or ensure that it is already running. The analysis collects system-wide data. Collection stops automatically when the specified duration is complete.
  4. After collection is complete, stop the application on the target QNX system if it is not already finished.

View and Interpret Results

After collection is complete, the
*.tb7
result file is available on the host system.
  1. On the host system, import the
    *.tb7
    file into the previously created project.
  2. Switch to the
    Hotspots
    viewpoint and review the performance data collected.
    • If you collected hotspots data, begin with the
      Summary
      window in the
      Hotspots
      viewpoint. The
      Top Hotspots
      list shows the top 5 functions that occupied the most CPU time. Double-click a function to be taken to the
      Bottom-up
      window where you can see aggregated performance data and a timeline showing activity over the entire collection. For more information, see Hotspots View .
    • If you collected CPU event data, begin with the
      Microarchitecture Exploration
      viewpoint. For more information, see Microarchitecture Exploration View .

Product and Performance Information

1

Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.

Notice revision #20110804