This article is part of the Intel® Modern Code Developer Community documentation which supports developers in leveraging application performance in code through a systematic step-by-step optimization framework methodology. This article addresses: Thread level parallelization.
Intel is bringing to market, in anticipation of general availability of the Intel® Xeon Phi™ Processor (codenamed Knights Landing), the Developer Access Program (DAP). DAP is an early access program for developers worldwide to purchase an Intel Xeon Phi Processor based system.
This article discussions parallelization and provides links that will help you understand your programming environment and evaluate the suitability of your app.
Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
Learn how to use Offload over Fabric software for a server migration path.
Modern server farms consist of a large number of heterogeneous, energy-efficient, and very high-performance computing nodes connected with each other through a high-bandwidth network interconnect. Such systems pose one of the biggest challenges for engineers and scientists today: how to solve complex, real-world problems by efficiently using the enormous computational horsepower available from...
Get a background on vectorization and learn different techniques to evaluate its effectiveness.