Intermediate

Improve Intel MKL Performance for Small Problems: The Use of MKL_DIRECT_CALL

One of the big new features introduced in the Intel MKL 11.2 is the greatly improved performance for small problem sizes. In 11.2, this improvement focuses on xGEMM functions (matrix multiplication). Out of the box, there is already a version-to-version improvement (from Intel MKL 11.1 to Intel MKL 11.2). But on top of it, Intel MKL introduces a new control that can lead to further significant performance boost for small matrices. Users can enable this control when linking with Intel MKL by specifying "-DMKL_DIRECT_CALL" or "-DMKL_DIRECT_CALL_SEQ".

  • Developers
  • Professors
  • Apple OS X*
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8.x
  • Unix*
  • Server
  • C/C++
  • Fortran
  • Advanced
  • Beginner
  • Intermediate
  • Intel® Math Kernel Library
  • small matrix
  • performance
  • Optimization
  • 在采用 TouchScript 的 Unity* 3D 中实施手势序列

    Download PDF

    By Lynn Thompson

    当配置触摸目标控制一个场景中的其他元素时,应最大限度地减少控制元素所占的屏幕空间。 这样,您可以将更多超极本™ 设备的可视屏幕区域用于显示视觉活动,更少用于用户交互。 实现它的一种方法是,让触摸目标能够处理多手势组合,从而无需在屏幕上设置更多触摸目标。 比如,使用连续点击图形用户界面 (GUI) widget 来触发炮楼旋转并开火,而非使用专门的 GUI widget 开火并使用另一个 widget 旋转炮楼(或 Unity* 3D 场景中的其他资产)。

    本文将介绍如何配置使用触摸目标的场景来控制第一人称控制器 (FPC)。 首先,为基本的 FPC 位置和旋转配置触摸目标;然后,将其扩大为附加功能。 附件功能可通过现有的 GUI widget 来实现,且不需要增加几何。 生成的场景将展示 Windows* 8 上运行的 Unity 3D 能够处理不同序列中使用的多手势。

  • Developers
  • Microsoft Windows* 8.x
  • Windows*
  • Unity
  • Intermediate
  • Perceptual Computing
  • windows
  • Game Development
  • Touch Interfaces
  • MVarandas Soluções em Tecnologia escolhe Crosswalk como fundação de seus aplicativos HTML5 para Android

    Usado em vários restaurantes como única forma de fazer um pedido, o "Menew Cardápio Digital", software desenvolvido por MVarandas Soluções em Tecnologia, está recebendo várias melhorias graças ao Crosswalk,  HTML5 runtime para Android desenvolvido pela Intel. O bom resultado também é fruto da colaboração entre as equipes de desenvolvimento de ambas as empresas.

  • Developers
  • Partners
  • Android*
  • Android*
  • HTML5
  • HTML5
  • Beginner
  • Intermediate
  • Development Tools
  • Open Source
  • What’s New in the Intel Compiler

    The list below summarizes new features in the Intel® C++ Compiler 15.0 and the Intel® Fortran Compiler 15.0. For more details about changes in the Intel compilers since the previous release, including a list of new options, please refer to the ‘What’s New’ section in the release notes (C++, Fortran).

  • Developers
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8.x
  • C/C++
  • Fortran
  • Advanced
  • Beginner
  • Intermediate
  • Intel® C++ Compiler
  • Intel® Fortran Compiler
  • OpenMP*
  • Development Tools
  • Intel® Core™ Processors
  • Intel® Many Integrated Core Architecture
  • Optimization
  • Parallel Computing
  • Threading
  • Vectorization
  • Intel® C++ Compiler 15.0 for Linux Sets “-ansi-alias” by Default, Noncompliant Code May Fail

    Beginning with version 15.0, the Intel® C++ Compiler for Linux enables -ansi-alias by default at optimization level 2 (-O2). The -ansi-alias option indicates that the code complies with ANSI aliasing rules.

    If the application code adheres to ANSI aliasing rules, this change allows the compiler to make aggressive optimizations. The effect of this change on runtime performance depends on the application, the optimization methods supported by the compiler, and the compiler switches used.

  • Developers
  • Linux*
  • C/C++
  • Intermediate
  • Intel® C++ Compiler
  • Intel® Many Integrated Core Architecture
  • Optimization
  • Vectorization
  • Using Intel® MPI Library 5.0 with MPICH based applications

    Why it is needed?

    Different MPI implementations have their specific benefits and advantages. So in the specific cluster environment the HPC application with the other MPI implementation can probably perform better.

     Intel® MPI Library has the following benefits:

  • Developers
  • Partners
  • Professors
  • Students
  • Linux*
  • Server
  • Advanced
  • Beginner
  • Intermediate
  • Intel® Cluster Toolkit
  • Intel® Trace Analyzer and Collector
  • Intel® MPI Library
  • Intel® Cluster Studio
  • Intel® Cluster Studio XE
  • Intel® Cluster Ready
  • Message Passing Interface
  • Cluster Computing
  • Development Tools
  • Data transfer of non-contiguous array elements using the Intel® Language Extensions for Offload (LEO) for the Intel® Xeon Phi™ coprocessor

    The Intel® Parallel Studio XE 2015 Composer Editions for C++ Windows* and Linux* have a feature enhancement supporting data transfer for non-contiguous array elements with the Intel® Language Extensions for Offload (LEO) for the Intel® Xeon Phi™ coprocessor.

    The feature adds support under the LEO offload data marshalling model for transferring non-contiguous array elements within an array variable reference (variable-ref) in the data transfer clauses (i.e. in, out, inout, nocopy) of the #pragma offload/offload_transfer statement.

  • Developers
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8.x
  • Server
  • C/C++
  • Intermediate
  • Intel® C++ Compiler
  • Intel® C++ Composer XE
  • Intel® Composer XE
  • Intel® Parallel Composer
  • Development Tools
  • Intel® Many Integrated Core Architecture
  • Subscribe to Intermediate