Tutorial

Tuning the Intel MKL DFT functions performance on Intel® Xeon Phi™ coprocessors

Overview

Intel® Math Kernel Library (Intel® MKL) includes the optimized DFT transform functions on Intel® Xeon Phi™ coprocessors. These functions are carefully vectorized and threaded to take advantage of the hardware features. This article provides some performance tuning tips on running MKL DFT function on Intel Xeon Phi coprocessors.  We will start with some simple example code.

Building the example code

  • Entwickler
  • Linux*
  • Server
  • C/C++
  • Fortran
  • Anfänger
  • Fortgeschrittene
  • Intel® Math Kernel Library
  • MIC
  • Xeon Phi
  • DFT
  • FFT
  • performance
  • offload
  • MKL
  • Intel® Streaming SIMD Extensions
  • Intel® Many Integrated Core Architektur
  • HTML5 前端框架 Bootstrap 使用教程

    1. 简介

    BootstrapTwitter推出的一个开源的前端框架。

    BootstrapTwitter的设计师Mark OttoJacob Thornton合作开发,由动态语言Less写成。它是一套“易用、优雅、灵活、可扩展”的前端工具集,提供了优雅的HTML/CSS规范。

  • Entwickler
  • Studenten
  • HTML5
  • JavaScript*
  • Windows* 8 Tutorial: Writing a Multithreaded Application for the Windows Store* using Intel® Threading Building Blocks.

    It is known that the Windows Store apps API is missing some commonly used thread functions, such as function CreateThread() and those that work with TLS keys. This is yet another great opportunity to move your application development from thread-based parallelism to task-based parallelism. This post shows step-by-step instructions for writing an  example that uses parallelism and that can pass validation by the Windows App Certification Kit (WACK). This example can be expanded upon to support clients for online games and more.

    Intel® System Studio - Multicore Programming with Intel® Cilk™ Plus

    Intel System Studio not only provides a variety of signal processing primitives via Intel® Integrated Performance Primitives (Intel® IPP), and Intel® Math Kernel Library (Intel® MKL), but also allows developing high-performance low-latency custom code (Intel C++ Compiler with Intel Cilk Plus). Since Intel Cilk Plus is built into the compiler, it can be used where it demands an efficient threading runtime in order to extract parallelism. Therefore it's possible to effectively introduce multicore parallelism even without introducing it into each of the important algorithms e.g., by employing a parallel pattern called pipeline. For custom code (e.g., code that's not reused via a library), one can rely (in addition to auto-vectorization) on an extended Array Notation incl. elemental functions (kernels) to explicitly vectorize at a higher level compared to ISA-specific intrinsic functions.
  • Entwickler
  • Studenten
  • Linux*
  • Yocto-Projekt
  • C/C++
  • Experten
  • Anfänger
  • Fortgeschrittene
  • Intel® C++-Compiler
  • Intel® Cilk™ Plus
  • Intel® Integrated-Performance-Primitives
  • Intel® Math Kernel Library
  • Intel® System Studio
  • embedded c programming
  • Embedded
  • Parallel Computing
  • Energieeffizienz
  • Threading
  • Vektorisierung
  • HTML5 前端框架 jQuery Mobile 使用教程

    1. 简介

    jQuery Mobile是由(MTMedia Temple联合多家移动设备厂商以及软件企业共同发起的针对触屏智能手机与平板电脑的Web应用的前端开发框架。

    jQuery Mobile构建于大名鼎鼎的jQuery 以及 jQuery UI类库之上,为前端开发人员提供了一个兼容所有主流移动设备平台的统一UI接口系统。拥有出色的弹性,轻量化以及渐进增强特性与可访问性。

  • Entwickler
  • Partner
  • Studenten
  • HTML5
  • JavaScript*
  • Anfänger
  • Seiten

    Tutorial abonnieren