Experten

Late-initialization of frame descriptors in Cilk Plus/LLVM

The Intel® Cilk™ Plus C/C++ language extensions support the expression of portable and efficient task and vector parallel programs. Cilk Plus/LLVM is an implementation of these extensions in the Clang frontend for LLVM. In this article we explain one of the optimizations that we have implemented in Cilk Plus/LLVM: late-initialization of frame descriptors[1]. With this explanation, we provide a view into how one of the Intel® Cilk™ keywords - cilk_spawn - is implemented under the hood.
  • Entwickler
  • Professoren
  • Studenten
  • C/C++
  • Experten
  • Fortgeschrittene
  • Intel® Cilk™ Plus
  • Cilk Plus
  • Choosing the right threading framework

    This is the second article in a series of articles about High Performance Computing with the Intel Xeon Phi. The Intel Xeon Phi is the first commercial product of Intel to incorporate the Many Integrated Core architecture. In this article I will present various frameworks for unleashing the power of multiple threads on the Xeon Phi. We will also have a look at interesting properties and advantages / disadvantages of each framework.

  • Entwickler
  • Professoren
  • Studenten
  • Linux*
  • Server
  • C/C++
  • Experten
  • Fortgeschrittene
  • Intel® C++ Composer XE
  • Intel® Cilk™ Plus
  • Intel® Threading Building Blocks
  • MIC
  • Xeon
  • Phi
  • performance
  • threading
  • openMP
  • Pthreads
  • C++11
  • boost
  • tbb
  • CilkPlus
  • OpenMP*
  • Cluster-Computing
  • Parallel Computing
  • Threading
  • VTune Amplifier XE 2013 installation on clusters

    Using VTune Amplifier XE 2013 on clusters or other distributed environments which have an LDAP/NIS administration model and share file resources by NFS is as easy as on a local machine. The only trick is installation of the kernel drivers.
  • Entwickler
  • Linux*
  • Experten
  • Intel® VTune™ Amplifier XE
  • VTune Amplifier XE
  • Entwicklungstools
  • What's new? Update 7 - Intel® VTune™ Amplifier XE 2013

    Intel® VTune™ Amplifier XE 2013

    Intel® VTune™ Amplifier XE is an easy to use performance and thread profiler for C, C++, C#, Fortran, Java and MPI developers. No special recompiles are needed, just start profiling. Hotspots are highlighted on the source. A powerful timeline makes it easy to tune your application and scale performance on multicore processors.

  • Entwickler
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8
  • Unix*
  • Server
  • .NET*
  • C#
  • C/C++
  • Fortran
  • Java*
  • Experten
  • Anfänger
  • Fortgeschrittene
  • Intel® Fortran Composer XE
  • Intel® C++ Studio XE
  • Intel® Cluster Studio XE
  • Intel® Fortran Studio XE
  • Intel® Parallel Studio XE
  • Intel® VTune™ Amplifier XE
  • 在Intel®平台上优化多媒体应用程序功耗

    经过多年的硬件平台优化,硬件平台功耗逐年降低。CPU有了新的低功耗状态,显示功耗也大幅下降。然而,运行在平台上的软件所产生的功耗问题却越来越明显。在软件方面,多媒体应用程序对功耗的影响更加引人关注。事实上,研究表明优化过的多媒体应用程序播放时间是未优化过的媒体程序的两倍多。这篇白皮书介绍了设计和开发绿色多媒体应用程序需要考虑的要素,以及如何分析优化多媒体应用软件软件Intel®平台下的功耗。这篇白皮书旨在面向 ISVs, OEMs, 和其他技术相关人士。

  • Entwickler
  • Partner
  • Professoren
  • Studenten
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8
  • Windows*
  • Experten
  • Statische Analyse
  • Debugging
  • Medienverarbeitung
  • Microsoft Windows* 8 Desktop
  • Weird OpenMP Reduction

    Typical reductions in OpenMP* involve using a associative operator op to do local reductions, and then using a reduction clause to collect those local reductions.  For example, the following code computes a dot product by computing local sums on each thread and then summing them.  

  • Studenten
  • C/C++
  • Experten
  • OpenMP*
  • openMP
  • puzzle
  • Using Intel® MPI Library and Intel® Xeon Phi™ coprocessor tips

    1. Check prerequisites

    • Each host and each Intel® Xeon Phi™ coprocessor should have a unique IP address across a cluster;
    • ssh access between host(s) and Intel® Xeon Phi™ coprocessor(s) should be password-less;
    • Update the Intel® Manycore Platform Software Stack (Intel® MPSS) to current version;
  • Entwickler
  • Studenten
  • Linux*
  • Experten
  • Fortgeschrittene
  • Cluster Tools
  • Intel® MPI Library
  • Intel® Cluster Studio
  • Intel® Cluster Studio XE
  • Cluster-Computing
  • Intel® Many Integrated Core Architektur
  • Parallel Computing
  • Seiten

    Experten abonnieren