Intel® HPC Developer Conference 2017
Astronomical catalogs derived from wide-field imaging surveys are the quintessential tool for understanding the universe. We construct an astronomical catalog from imaging data using Celeste, a Bayesian variational inference code written entirely in the high-productivity programming language, Julia.
Keno Fischer, Julia Computing Inc
Jeff Regier, UC Berkeley
I/O is a key part of all applications, but is often neglected when performance optimization is considered. Also, as I/O is a shared resource on computational systems, the performance can vary significantly for the same application. New memory technology that can be used for I/O is on the horizon.
Adrian Jackson, EPCC, The University of Edinburgh
This talk shares design-efficient MPI+X runtimes to accelerate applications for emerging HPC systems with millions of processors and featuring next-generation interconnects. Energy-aware designs, co-design schemes, cloud environments, features, and performance numbers (MVAPICH2 libraries) are also addressed.
Dhabaleswar K (DK) Panda and Hari Subramoni, The Ohio State University
It is essential to design programming models with a good separation of domain science and parallel system. LFRic (an atmospheric model for the Met Office in the United Kingdom) leverages Psyclone* (a code generator tool by Hartree Centre) used for operational weather forecasting and climate modeling by 2020.
Sergi Siso, STFC, Hartree Centre
This presentation focuses on defining and developing a new model that simulates a large graph on HPC with a task parallel approach.
Babak Saravi, Mississippi State University
This talk focuses on the capabilities that OSPRay and OpenSWR software-defined visualization rendering libraries bring to VTK and ParaView*.
David DeMarle, Kitware Inc.
Unlike other parallel programming models, the OpenCL™ platform provides practical portability across a wide range of architectures, offering perks (runtime kernel-compilation and being library only). We show an interdisciplinary OpenCL platform-based workflow and evaluate the OpenCL platform and its implementations from an HPC perspective.
Matthias Noack, Zuse Institute Berlin
We demonstrate a high-level vector coding scheme using OpenMP* 4 to approach SIMD vectorization of complex code sections in a portable way.
Florian Wende, Zuse Institute Berlin
Structural biology is going through a revolution where cryo-electron microscopy (cryo-EM) now determine 3D structures from thousands of noisy images, but it relies on very large computations. This talk presents our work with Intel to accelerate the RELION program with x86 SIMD, Intel® Threading Building Blocks, and Intel® Math Kernel Library to provide outstanding performance.
Erik Lindahl, Stockholm University
Charles Congdon, Intel
Explore thread- and data-level performance optimizations of key computational routines of an unstructured computational aerodynamics application with irregular memory accesses on Intel® Xeon® and Intel® Xeon Phi™ processors.
Mohammed Al Farhan and David Keyes, King Abdullah University of Science and Technology
We discuss RckT, a scalable, physically accurate, spectral rendering system that builds on OSPRay (a high-fidelity visualization framework). Rckt is an extensible framework used to implement scalable ray-based rendering techniques for high-performance visual analysis tools across several domains.
Christiaan Gribble, SURVICE Engineering
With technical details of development and parallelization schemes, this session introduces an example of practical applications that shows the strength of manycore computing with Intel® Xeon Phi™ processors in terms of speed and energy-efficiency through a solid comparison to NVIDIA* P100 GPGPU devices.
Hoon Ryu, Intel, Korea Institute of Science and Technology Information
Traditional large HPC simulation codes rely on MPI or MPI plus OpenMP for their parallelization over clusters of more than 100,000 cores. This approach of task-based parallelism strategy is used in SPH With Interdependent Fine-Grained Tasking, or SWIFT. This open-source cosmological code makes use of vectorization, dynamic scheduling, task-based I/O, and more.
Matthieu Schaller, Institute for Computational Cosmology (ICC), Durham University
This talk presents MoonRay, a high-performance production rendering architecture using Monte Carlo path tracing developed at DreamWorks Animation. MoonRay is the first production path tracer designed to fully leverage SIMD vector units throughout.
Mark Lee, DreamWorks Animation
This session focuses on Flow Graph, an extension to the Intel® TBB interface that can be used as a coordination layer for heterogeneity that retains optimization opportunities and composes with existing models.
Pablo Reble, Intel
Intel® MPI Library over multi-endpoints allows a single multithreaded rank per node to achieve near-line rate on Intel® Omni-Path Architecture over a variety of communication patterns. It allows significant improvement in hybrid MPI plus OpenMP* applications when MPI is called from threads.
Michael Chuvelev, Intel
In this technical lecture, we describe our work instrumenting LAMMPS for interactive in-situ visualization with SENSEI and OSPRay. We show results of our implementation on supercomputers based on the Intel® Xeon Phi™ architecture, such as Stampede 2 at Texas Advanced Computing Center (TACC) and Theta at Argonne National Laboratory (ANL).
Will Usher and Valerio Pascucci, Scientific Computing and Imaging Institute, University of Utah
Silvio Rizzi and Joseph Insley, Argonne National Laboratory
Depth perception cues are critical for understanding biological structure and function. We present scientific visualization for bioscience data (cancer biophysics, molecular diffusion, and medical imaging) that have been rendered using OSPRay and VR to gain insights.
Ayat Mohammed, Anne Bowen, and Hadley Vaughn, Texas Advanced Computing Center (TACC)
This talk presents DASH, a C++ template library of distributed data structures and parallel algorithms. DASH makes available the memory of multiple compute nodes and offers a rich set of data distribution patterns and interoperability with the C++ standard template library (STL).
Karl Fuerlinger, Ludwig-Maximilian-Universtiy (LMU), Munich
Programming the complex memory subsystem of the Intel® Xeon Phi™ processor is challenging. We analyze the system in detail and provide some guidance for writing close-to-optimal applications.
Torsten Hoefler, ETH Zurich
We detail the evolution our previously GPU-centric visualization framework, MegaMol, to take advantage of software-defined visualization and make it ready for in-situ visualization. We show where visualization is a valuable addition to the scientific workflow and talk about interfacing simulation code and MegaMol.
Tobias Rau, VISUS University of Stuttgart
We share a case study on parallel tasking frameworks by comparing performance using Fast Multipole Method mini-app, which is used for a parallel task model of computation. The mini-app is ported to Intel® TBB, Intel Cilk™ Plus, OpenMP 4.0, OpmSs. See how different methods of synchronization can affect the performance.
Patrick Atkinson and Simon McIntosh-Smith, University of Bristol
Hydra is a BP* internal visualization application that is used both to improve image quality control and visualization for researchers and to enable quick implementation of new features. In this talk, we present the successful integration of OSPRay as a plug-in (named Probe) into Hydra.
Jim Ching-Rong Lin, BP
Learn about basic principles of rendering, Blender* software, and its rendering engine, Cycles. See how we ported this code to Intel® Xeon Phi™ processors and how hybrid parallelization combining OpenMP and MPI was used to utilize multiple nodes for rendering.
Lubomir Riha, IT4Innovations, VSB – Technical University of Ostrava
This talk presents an overview of the new features that Fortran 2015 provides to address future exascale computing challenges and will discuss our experience with using these features on Intel® Xeon Phi™ platforms.
Damian Rouson, Sourcery Institute
Alessandro Fanfarillo, National Center for Atmospheric Research