Explicit offload for Quantum ESPRESSO


This code recipe describes how to get, build, and use the Quantum ESPRESSO code that includes support for the Intel® Xeon Phi™ coprocessor with Intel® Many-Integrated Core (MIC) architecture. This recipe focuses on how to run this code using explicit offload.

  • Desarrolladores
  • Linux*
  • Servidor
  • Intermedio
  • Intel® MPI Library
  • Quantum ESPRESSO
  • Intel® Xeon Phi™ Coprocessor
  • Arquitectura Intel® para muchos núcleos integrados
  • Simple offloaded code, enormous time consuming

    Dear all,

    I recently started using Xeon Phi cards for parallel programming, so I am still a newbie in this field.

    I wrote this code as a simple example to start understanding this fascinating world, but I got surprised when I looked at the time of executions.

    When I run the code on the host, execution time is 0,08 s. When I run the code adding the pragma offload and pragma omp parallel for, execution time increase up to 9s!

    When I compiled the codes, I used -O3 optimization for both of them.

    Is there something I am missing?


    Suscribirse a Linux*