This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
This article focuses on aspects of porting Fortran codes to the Intel® Xeon Phi™ coprocessor. Most of the documentation for the coprocessor is C/C++ centric.
We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.
I don’t know if any of you have noticed but Intel® has a tendency to emphasize its own homegrown tools. This isn’t bad as Intel has some of the best.
MPI Symmetric Mode is widely used in systems equipped with Intel® Xeon Phi™ coprocessors.
Running large scale Intel® MPI applications on Omni-Path or InfiniBand* clusters, one might have recognized an increasing time spend within the MPI_Init() routine.
The Intel® MPI Library includes a tool - mpitune - that can help to optimize the execution parameters of the Intel MPI Library itself.
VTune Amplifier uses SCP as a mechanism to copy files to and from the host and the target system, while performing a remote collection.