The MPI Performance Snapshot (MPS) is a scalable lightweight performance tool for MPI applications. It collects a variety of MPI application statistics (such as communication, activity, and load balance) and presents it in an easy-to-read format. The tool is not available separately but is provided as part of the Intel® Trace Analyzer and Collector installation. This article will serve as a quick getting started guide.
The general matrix-matrix multiplication (GEMM) is a fundamental operation in most scientific, engineering, and data applications. There is an everlasting desire to make this operation run faster. Optimized numerical libraries like Intel® Math Kernel Library (Intel® MKL) typically offer parallel high-performing GEMM implementations to leverage the concurrent threads supported by modern multi-core architectures. This strategy works well when multiplying large matrices because all cores are used efficiently.
On April 10th, 2015, I was fortunate to travel from Hillsboro, Oregon to San Francisco, California especially to take part in the International NASA Space App Challenge hosted at Constant Contact. The challenge was held for two days in 133 cities around the world focusing on 4 themes: Earth, outer space, humans, and robotics.