A Structured Performance Optimization Framework for Simultaneous Heterogeneous Computing

Heterogeneous computing platforms with multicore host system and many-core accelerator devices have taken a major step forward in the mainstream HPC computing market this year with the announcement of HP Apollo 6000 Sys-tem’s ProLiant XL250a server with support for Intel® Xeon Phi™ coprocessors. Although many application developers attempt to use it in the same way as GPGPU acceleration platforms, doing so forfeits the processing capability of multicore host processors and introduces power inefficiency in corporate IT op-erations.

  • Développeurs
  • Linux*
  • C/C++
  • Intermédiaire
  • Bibliothèque Intel® Math Kernel Library
  • Modernisation du code
  • Secteur des services financiers
  • Intel® Many Integrated Core Architecture
  • Informatique parallèle
  • Parallélisation
  • Vectorisation
  • A quick peek at hardware-accelerated Android* emulation, using Intel® Hardware Accelerated Execution Manager (Intel® HAXM)

    A look at the advantages of using Intel(R) HAXM for Android Emulation. HAXM speeds up Android app emulation on a host development machine and is available as a feature of Intel(R) INDE or as a standalone release.

    offload error: cannot release buffer memory on device 0 (error code 14)

    I need your help. 

    I tried to run K-means algorithm on Xeon Phi by using offload mode.

    But when i tried to get into offload region with the clause '#pragma offload ~~ (as attached pic 1) ' ,

    i got an erorr 'offload error: cannot release buffer memory on device 0 (error code 14)' .

    I have no idea to solve this problem, and i even cannot find any previous example similar to my problem on google.

    I saw offload report by using 'export OFFLOAD_REPORT=3', but i couldn't get any hints. 

    plz help me !


    TaeHyeok, Jang

    How to protect code from triggering MIC build on non-MIC nodes

    we have two clusters in-house, one with MIC cards and another without. When we build code with OpenMP 4.x pragmas or functions for devices, we get a compilation error on the cluster without MIC cards:

    icc: warning #10362: Environment configuration problem encountered.
    Please check for proper MPSS installation and environment setup.
    testomp.c(1): catastrophic error: *MIC* cannot open source file "stdio.h"
       #include <stdio.h> 


    kmp_reduce non atomic


    I have studied the openmp reduction function kmp_reduce and the conclusion is : 

    if (lck!=NULL) ==> we can do CRITICAL version
    if (reduce_data!=NULL && reduce_func!=NULL) ==> we can do TREE version
    if ((loc->flags & KMP_IDENT_ATOMIC_REDUCE) == KMP_IDENT_ATOMIC_REDUCE) ==> we can do ATOMIC version

    So i have created 3 flags which tests the conditions above (CRITICAL, TREE, ATOMIC)

    I have done some tests of openmp reductions to look at the results of these flags, but it seems to be always at 1.

    Attend IDF'15: HPC Developer Showcase

    Intel has just launched the Intel® Modern Code Developer Community to help HPC developers code for maximum performance on current and future hardware. The initiative is targeted to more than 400,000 HPC-focused developers and partners. 

  • Développeurs
  • Professeurs
  • Linux*
  • Serveur
  • C/C++
  • Fortran
  • Intermédiaire
  • Interface de transmission de messages
  • OpenMP*
  • Modernisation du code
  • Intel® Many Integrated Core Architecture
  • S’abonner à C/C++