Advanced Optimizations for Intel® MIC Architecture

Compiler Methodology for Intel® MIC Architecture

Advanced Optimizations


This chapter details some of the advanced compiler optimizations for performance on Intel® MIC Architecture AND most of these optimizations are also applicable to host applications. This chapter includes topics such as the floating-point model, prefetching, use of streaming-stores, etc.  This is a good chapter for users still not seeing their desired performance OR are looking for the last level of performance enhancements.

Goals and Topics

Goals for this chapter are to explore a variety of advanced optimizations to determine which may be useful for your application:


It is essential that you read this guide from start to finish using the built-in hyperlinks to guide you along a path to a successful port and tuning of your application(s) on Intel® Xeon Phi™architecture.  The paths provided in this guide reflect the steps necessary to get best possible application performance.

The next chapter, The Native and Offload Programming Models, presents a variety of programming models and data considerations to help you get the most performance out of The Intel® Many Integrated Core Architecture (Intel® MIC Architecture)

For more complete information about compiler optimizations, see our Optimization Notice.