Improving performance

How to use Intel® Advisor XE 2015 to model suitability on an Intel® Xeon Phi™ coprocessor

Introduction

Intel® Advisor XE 2015 now includes some new capabilities for analyzing Intel® Xeon Phi™ coprocessor applications. This article steps through this analysis on an Intel Xeon Phi coprocessor and also outlines some of the new capabilities.

Building the application

The application we are using is one of the samples included in the Intel Advisor XE. It is located in C:\Program Files (x86)\Intel\Advisor XE 2015\samples\en\C++\tachyon_Advisor.zip. To build the application on the Microsoft Windows* OS:

  • Developers
  • Intermediate
  • Intel® VTune™ Amplifier for Systems Usage Models

            Intel® VTune™ Amplifier for Systems is available through Intel(R) System Studio product. It is a software performance analysis tool for users developing serial and multithreaded applications on Embedded and Mobile system. VTune Amplifier supports multiple usage modes for various target systems depending on your development environment and target environment. In this article, we will describe the Vtune Amplifier usage modes and the recommended modes for different target systems.

  • Developers
  • Android*
  • Linux*
  • Tizen*
  • Yocto Project
  • Android*
  • C/C++
  • Java*
  • Beginner
  • Intel® System Studio
  • VTune Amplifier for systems
  • performance tuning
  • remote collection
  • Development Tools
  • How to analyze Intel® Xeon Phi™ coprocessor applications using Intel® VTune™ Amplifier XE 2015

     

    Introduction

     

    Intel® VTune™ Amplifier XE 2015 now includes some new capabilities for analyzing Intel® Xeon Phi™ coprocessor applications. This article will step through this analysis on a Intel® Xeon Phi™ coprocessor and also outline some of the new capabilities.

     

    Compiling and running the application

  • Intermediate
  • Useful links for Intel® VTune™ Amplifier for Systems

     

    Intel® VTune™ Amplifier  for Systems is part of the Intel® System Studio, http://software.intel.com/en-us/intel-system-studio, suite of embedded tools.

    Some useful articles and videos on using Intel® VTune™ Amplifier  for Systems:

    Videos

    Remote collection

  • Developers
  • Linux*
  • MeeGo*
  • Moblin*
  • Tizen*
  • Yocto Project
  • C/C++
  • Advanced
  • Beginner
  • Intermediate
  • Intel® System Studio
  • Анализ производительности Java на устройствах Android с помощью Intel® VTune™ Amplifier 2014 for Systems

    Intel® VTune™ Amplifier 2014 for Systems поддерживает анализ функций Java и доступ к JIT-ассемблеру, Java Source и Dex* для функций, динамически скомпилированных на рутованных устройствах Android*, где запущена виртуальная машина Java/Dalvik* с оснасткой.  Прочтите эту статью, чтобы узнать, как запустить будущую версию VTune Amplifier for Systems, чтобы включить анализ Java на ART* JVM.

    Если возникают следующие проблемы:

  • Developers
  • Android*
  • Android*
  • C/C++
  • Java*
  • Advanced
  • Intermediate
  • Intel® System Studio
  • Intel® VTune™ Amplifier
  • VTune Amplifier Java Dalvik Android
  • Development Tools
  • Mobility
  • Optimization
  • How to analyze OpenMP* applications using Intel® VTune™ Amplifier XE 2015

     

    Introduction

     

    Intel® VTune™ Amplifier XE 2015 now includes extensive capabilities for analyzing OpenMP applications. This article will step through this analysis on an Intel® Xeon Phi™ coprocessor.

     

    Compiling and running the application

     

    The application we will be using is one of the samples included in VTune Amplifier. It is located in /opt/intel/vtune_amplifier_xe_2015/samples/en/C++/matrix_vtune_amp_xe.tgz. To build the application on Linux*:

  • Developers
  • Linux*
  • C/C++
  • Fortran
  • Intermediate
  • Intel® Parallel Studio XE
  • Intel® VTune™ Amplifier
  • Development Tools
  • Parallel Computing
  • Threading
  • How Intel® AVX2 Improves Performance on Server Applications

    The latest Intel® Xeon® processor E5 v3 family includes a feature called Intel® Advanced Vector Extensions 2 (Intel® AVX2), which can potentially improve application performance related to high performance computing, databases, and video processing. Here we will explain the context, and provide an example of how using Intel® AVX2 improved performance for a commonly known benchmark.

  • Developers
  • Partners
  • Students
  • Linux*
  • Server
  • Intermediate
  • Intel® C++ Compiler
  • AVX2
  • AVX
  • SSE
  • server
  • High Performance Linpack
  • LINPACK Benchmark
  • Linpack
  • Enterprise
  • Parallel Computing
  • Threading
  • Vectorization
  • Improve Intel MKL Performance for Small Problems: The Use of MKL_DIRECT_CALL

    One of the big new features introduced in the Intel MKL 11.2 is the greatly improved performance for small problem sizes. In 11.2, this improvement focuses on xGEMM functions (matrix multiplication). Out of the box, there is already a version-to-version improvement (from Intel MKL 11.1 to Intel MKL 11.2). But on top of it, Intel MKL introduces a new control that can lead to further significant performance boost for small matrices. Users can enable this control when linking with Intel MKL by specifying "-DMKL_DIRECT_CALL" or "-DMKL_DIRECT_CALL_SEQ".

  • Developers
  • Professors
  • Apple OS X*
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Microsoft Windows* 8.x
  • Unix*
  • Server
  • C/C++
  • Fortran
  • Advanced
  • Beginner
  • Intermediate
  • Intel® Math Kernel Library
  • small matrix
  • performance
  • Optimization
  • Subscribe to Improving performance