Intel® C++ Compiler

Timing Programs and Program Segments

You should measure performance to find and understand bottlenecks. Even small changes in a program can lead to large and sometimes surprising performance differences. The only reliable way to tune performance is to measure frequently—preferably on a mix of different systems. Use any tool or technique at your disposal, but only true measurements will determine if your optimizations are effective.

OpenMP* Pragmas Summary

This is a summary of the OpenMP* pragmas supported in the Intel® C++ Compiler. For detailed information about the OpenMP* API, see the OpenMP Application Program Interface Version TR4: Version 5.0 specification, which is available from the OpenMP* web site.

PARALLEL Pragma

Use this pragma to form a team of threads and execute those threads in parallel.

GAP Message (Diagnostic ID 30756)

Message

Split the structure '%s' into two parts to improve data locality. Frequently accessed fields are '%s'; performance may improve by putting these fields into one structure and the remaining fields into another structure. Alternatively, performance may also improve by reordering the fields of the structure. Suggested field order: '%s'.

Controlling the Coprocessor's Execution Environment

This topic only applies when targeting Intel® Many Integrated Core Architecture (Intel® MIC Architecture).

You can modify the coprocessor's execution environment using the following mechanisms:

  • Setting environment variables for the coprocessor on the CPU, and passing them to the coprocessor

  • calling functions from the CPU

In general, function calls override environment variables.

_GFX_offload

Puts the task into the in-order offload queue. This topic only applies when targeting Intel® Graphics Technology. This function replaces the deprecated function _GFX_enqueue.

Subscribe to Intel® C++ Compiler