Conserving Active Power
This article presents techniques to optimize applications to save power during active use. These techniques include multi-threading, batching of I/0 accesses, and reducing memory bandwidth. Some of the techniques are applicable for general program usage. Type: Technical Article |
|
05/04/2012
|
Intel® SDK for OpenCL* Applications Installation Guide
See the following links to the online resources and documents for the latest information regarding the Intel® SDK for OpenCL* Applications:
Intel® SDK for OpenCL* Applications at the Intel® Visual C ... Type: Technical Article |
openCL OpenCL SDK Intel OpenCL vcsource_domain_media vcsource_os_windows vcsource_platform_desktoplaptop vcsource_domain_graphics vcsource_product_oclsdk vcsource_domain_gamedev vcsource_index |
05/03/2012
|
How to build GCC* cilkplus branch in 64bit Ubuntu* 12.04
Intel® Cilk™ Plus is an open source project now. This article is about how to build the 'cilkplus' branch to support CilkPlus in GCC. Type: Technical Article |
ICC gcc Cilk Plus |
04/25/2012
|
Intel® OpenCL SDK - Frequently Asked Questions
To the Intel® SDK for OpenCL* Applications main page
Updated: April 23rd 2012
Table of Contents
Product Information
1. What is the Intel® SDK for OpenCL* Applications?
2. Is OpenCL* supported on ... Type: Technical Article |
openCL OpenCL SDK Intel OpenCL vcsource_domain_media vcsource_os_windows vcsource_platform_desktoplaptop vcsource_domain_graphics vcsource_product_oclsdk vcsource_domain_gamedev vcsource_index |
04/23/2012
|
The Full Check List for Optimized OpenCL* Application
To the Intel® SDK for OpenCL* Applications main page
Content Update: 23rd April 2012
Table of Contents
What is Intel SDK for OpenCL* Applications?
Why to optimize OpenCL* applications to Intel ... Type: Technical Article |
openCL OpenCL SDK Intel OpenCL vcsource_type_techarticle vcsource_os_windows vcsource_platform_desktoplaptop vcsource_domain_graphics vcsource_product_oclsdk vcsource_producttout_oclsdk vcsource_index |
04/23/2012
|
Using Intel® Power Gadget 2.0 to measure the energy performance of a compute-intensive application
Intel® Power Gadget 2.0 is a software-based power estimation tool enabled for Second Generation Intel® Core™ processors. Type: Technical Article |
|
04/23/2012
|
Case Study: Parallelizing a Recursive Problem with Intel® Threading Building Blocks
Intel worked closely with DreamWorks Animation engineers to improve the performance of a key rendering system library by up to 35X performance improvement in some cases. Type: Technical Article |
TBB Intel TBB rendering dreamworks vcsource_domain_graphics vcsource_product_tbb vcsource_type_casestudy vcsource_os_windows vcsource_platform_desktoplaptop |
04/13/2012
|
Intel® Performance Counter Monitor - A better way to measure CPU utilization
The Intel® Performance Counter Monitor provides sample C++ routines and utilities to estimate the internal resource utilization of the latest Intel® Xeon® and Core™ processors and gain a significant performance boost. Type: Technical Article |
monitoring Intel Performance Counter Monitor simultaneous multithreading out-of-order execution Intel® Performance Counter Monitor Intel® Xeon® Core™ processors multi-level caches pipelining |
04/13/2012
|
Designing Application Software for Energy-efficient Performance
Personal computers are designed to be in processor idle 75% of the time but in fact might more realistically be estimated to be idle in excess of 90% of the time because of the effects of imposed waits for user input, server response, and resource availab Type: Technical Article |
"Intel® Battery Life Analyzer" "Intel® Energy Checker SDK" |
04/11/2012
|
Unhandled Exceptions when Debugging OpenMP applications
When debugging OpenMP applications built with the Intel Compiler unhandled exceptions may occur. Type: Technical Article |
|
04/04/2012
|
Using Intel® Power Checker to measure the energy performance of a compute-intensive application
Intel® Power Checker provides developers with a quick and easy way to evaluate the idle power efficiency of their applications on mobile platforms with Intel® Core™ processor or Intel® Atom™ technology running the Microsoft Windows* operating system. Type: Technical Article |
"Intel Power Checker" |
03/15/2012
|
Using Intel Cluster Checker to check that MPI applications will properly run over InfiniBand
Using Intel Cluster Checker to check that MPI applications will properly run over InfiniBand. Type: Technical Article |
Intel Cluster Ready Intel Cluster Checker intel_mpi_rt_internode |
03/07/2012
|
Using Intel® Inspector XE 2011 to Find Data Races in Multithreaded Code
Intel Inspector XE 2011 automatically finds memory errors, deadlocks and other conditions that could lead to deadlocks, data races, thread . Some specific issues associated with debugging multithreaded applications will be discussed in this article. Type: Technical Article |
critical section OpenMP Debugger data races Intel Parallel Inspector threading |
02/03/2012
|
90 errors in open-source projects
There are actually 91 errors described in the article, but number 90 looks nicer in the title. The article is intended for C/C++ programmers, but developers working with other languages may also find it interesting. Type: Technical Article |
errors C++ open source bugs cpp PVS-Studio code review static code analyzer Security Community |
02/02/2012
|
Multi-threaded Rendering and Physics Simulation
by Rajshree Chabukswar, Adam T. Lake, and Mary R. Lee, Intel® Software Solutions Group
Introduction
Learn how to decouple rendering and physical simulation in a multi-threaded environment with a ... Type: Technical Article |
Multithreading physics visual computing |
01/12/2012
|
Introduction to OpenCL™
Open Compute Language (OpenCL™) provides a framework to write programs in C-like language that can run on heterogeneous cores such as CPUs, GPUs or specialized hardware. This white paper provides a br ... Type: Technical Article |
|
01/05/2012
|
OpenCL™ – Using Events
Introduction
This white paper is the fourth in a series of white papers on OpenCL describing how to set up and use events in multithreaded design. This white paper will go over various design choices ... Type: Technical Article |
|
12/21/2011
|
OpenCL™ - Programming for CPU Performance
This white paper is the third in a series of whitepapers on OpenCL™ describing how to best utilize underlying Intel hardware architecture using OpenCL. This white paper will go over programming conside ... Type: Technical Article |
|
12/21/2011
|
Avoiding AVX-SSE Transition Penalties
Avoiding AVX-SSE Transition Penalties (PDF 678 KB)
Transitioning between 256-bit Intel® AVX instructions and legacy Intel® SSE instructions within a program may cause performance penalties because the ... Type: Technical Article |
Intel AVX Sandy Bridge Intel® SSE |
11/10/2011
|
Use Non-blocking Locks When Possible
Non-blocking system calls allow the competing thread to return on an unsuccessful attempt to the lock, and allow useful work to be done, thereby avoiding wasteful utilization of execution resources at the same time. Type: Technical Article |
|
11/04/2011
|
Choosing Appropriate Synchronization Primitives to Minimize Overhead
Currently, there are a number of synchronization mechanisms available, and it is left to the application developer to choose an appropriate one to minimize overall synchronization overhead. Type: Technical Article |
atomic operations synchronization threading Win32 threads system overhead mutual exclusion PPGuide |
11/04/2011
|
Use Synchronization Routines Provided by the Threading API Rather than Hand-Coded Synchronization
Application programmers sometimes write hand-coded synchronization routines rather than using constructs provided by a threading API in order to reduce synchronization overhead or provide different functionality than existing constructs offer. Type: Technical Article |
Hyper-Threading OpenMP synchronization threading Pthreads Win32 threads spin-wait PPGuide |
11/04/2011
|
Managing Lock Contention: Large and Small Critical Sections
This topic introduces the concept of critical section size, defined as the length of time a thread spends inside a critical section, and its effect on performance. Type: Technical Article |
|
11/04/2011
|
Using AVX Without Writing AVX Code
Using AVX Without Writing AVX Code (PDF 260KB)
Abstract
Intel® Advanced Vector Extensions (Intel® AVX) is a new 256-bit instruction set extension to Intel® Streaming SIMD Extensions (Intel® SSE) and ... Type: Technical Article |
|
11/04/2011
|
Exploiting Data Parallelism in Ordered Data Streams
This article identifies some of these challenges and illustrates strategies for addressing them while maintaining parallel performance. Type: Technical Article |
data parallelism I/O threading order dependence PPGuide |
11/04/2011
|