Tachyon ray tracer port on Intel® Xeon Phi™ coprocessor

Tachyon ray tracer port on Intel® Xeon Phi™ coprocessor

This webinar will present a practical case study of porting the Tachyon, an open source ray tracer, part of the SpecMPI suite, to Intel® Xeon Phi™ coprocessor. The Initial port revealed disappointing performance, e.g. the combined Intel® Xeon® processor and Intel Xeon Phi coprocessor version ran 2.6x slower than Xeon-only version. To achieve good performance some code modifications needed to be introduced improving both processor and coprocessor parts. Intel® Cluster Studio XE is used to pinpoint the problems and will highlight key code changes which helped achieve significant improvements (up to 7x vs from initial baseline, and 1.8x speed up vs improved Xeon version). The application exploits parallelism at multiple levels - symmetric MPI execution model, OpenMP-based multi-threading, and explicit SIMD (using SSE2/AVX/Xeon Phi instructions). Several software tools will be highlighted – Intel® Trace Analyzer and Collector, and Intel® VTune™ Amplifier XE in combination with MPI* and OpenMP* programming models, as well as a SIMD-enabled 3D vector operations library (reused and extended from Embree, the open source ray tracer by Intel Labs). Algorithmic changes include MPI-based dynamic scheduling, introduction of explicit intrinsics-based SIMD support, enabling greater OpenMP parallelism capacity.

View Slides

Download Video

1280X720 (151.01 MB)

For more complete information about compiler optimizations, see our Optimization Notice.