In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
High quality image and video processing has become an important part in many professional and consumer applications. This article shares insights and methods gained during a shared work by HP* Labs and Intel on optimizing several imaging algorithms.
One of the Intel® Modern Code Developer Challenge winners, Daniel Falguera, describes many of the optimizations he implemented and why some didn't work.
Setting up a cloud environment is complicated, and it involves multiple elements such as database, network infrastructure, security, etc., (depending on the need). How do you increase the p
This case study examines the situation where the problem decomposition is the same for threading as it is for Message Passing Interface* (MPI); that is, the threading parallelism is elevated to the same level as MPI parallelism.
Learn about a technique that deals with the load imbalance of heterogeneous distributed systems, plus get sample source code.
Case study discusses Intel® Advanced Vector Extensions (Intel® AVX), gives an overview of Black-Scholes valuation.