Инструментарии Intel® Trace Analyzer и Intel® Trace Collector

Tracing of Failing MPI Applications

Normally if an MPI application fails or is aborted, all trace data collected so far is lost: libVT needs a working MPI to write the trace file, but the MPI standard does not guarantee that MPI is still operational after a failure. In practice most MPI implementations just abort the application. To solve this problem, link the application against libVTfs instead of libVT, like this:

Signals

libVTcs uses the same techniques as fail-safe MPI tracing to handle failures inside the application, therefore it will generate a trace even if the application segfaults or is aborted with CTRL-C.

matched-vtf

Syntax

--matched-vtf

Default

off

Description

When converting from STF to ASCII-VTF communication records are usually split up into conventional VTF records. If this option is enabled, an extended format is written, which puts all information about the communication into a single line.

Intel® Trace Collector API

The Intel® Trace Collector library provides the user with a number of routines that control the profiling library and record user-defined activities, define groups of processes, define performance counters and record their values. Header files with the necessary parameter, macro and function declarations are provided in the include directory: VT.h for ANSI C and C++ and VT.inc for Fortran 77 and Fortran 90.

VT_Region Class Reference

Description

This is similar to VT_Function, but should be used to mark regions within a function.

The difference is that source code locations can be provided for the beginning and end of the region, and one instance of this class can be used to mark several regions in one function.

Constructor & Destructor Documentation

VT_Region()

Default constructor. Does not start a region.

CHECK-TIMEOUT

Syntax

CHECK-TIMEOUT <time>

Variable

VT_CHECK_TIMEOUT

Default

5s

Description

After stopping one process because it cannot or is not allowed to continue, the other processes are allowed to continue for this amount of time to see whether they run into other errors.

Подписаться на Инструментарии Intel® Trace Analyzer и Intel® Trace Collector