Intel® Trace Analyzer and Collector

Intel® Trace Analyzer and Collector 2017 Update 1 Readme

The Intel® Trace Analyzer and Collector for Linux* and Windows* is a low-overhead scalable event-tracing library with graphical analysis that reduces the time it takes an application developer to enable maximum performance of cluster applications.  This package is for users who developer on and build for Intel® 64 architectures on Linux* and Windows*, as well as customers running on Intel® Xeon Phi™.  The package also includes an option download on OS X* for analysis only.  You must have a valid license to download, install, and use this product.

  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 10
  • Microsoft Windows* 8.x
  • C/C++
  • Fortran
  • Intel® Trace Analyzer and Collector
  • Message Passing Interface (MPI)
  • Cluster Computing
  • Parallel Computing
  • Analyze Vector Instruction Set with Intel® VTune™ Amplifier XE

    Use Intel® VTune™ Amplifier XE to understand why the computation time of the 2/64 combination is worse than the 32/4 combination even though the elapsed time is much less. A lower elapsed time for 32/4 is not possible due to the overhead of MPI deployments. As a result, it is better to focus on improving the computation time for the 2/64 combination instead.

    To analyze the application performance with VTune Amplifier:


    You have completed the Analyzing OpenMP* and MPI Applications tutorial with MPI Performance Snapshot, Intel® Trace Analyzer and Collector, and Intel® VTune™ Amplifier XE. Here are some important things to remember when working with your own hybrid application:

    Key Terms

    Baseline: A performance metric used as a basis for comparison of the application versions before and after optimization. Baseline should be measurable and reproducible.

    Computation Time: The time your application ran without any additional overhead (initialization time, finalization time, etc.). Computation time is included in Elapsed Time.

    Get a Performance Overview with MPI Performance Snapshot

    The first step in analyzing a hybrid MPI/OpenMP* application is getting an overview of the application performance. There is a tool called MPI Performance Snapshot (MPS) that can provide the general performance information about your application. This includes MPI and OpenMP time and load balance information, information about memory and disk usage, most utilized MPI operations, and more.

    Subscribe to Intel® Trace Analyzer and Collector