Next Generation Intel® Processor: Software Developers Guide
Introduction
This guide discusses the Intel® Pentium® 4 processor with Hyper-Threading Technology (code named Prescott).
Prescott is the code name for a new generation of IA32 processors. The te ... |
08/24/2009
|
How to Vectorize Code Using Intrinsics on 32-Bit Intel® Architecture
Challenge
Vectorize code by means of intrinsics. Intrinsics provide the access to the ISA functionality using C/C++ style coding instead of assembly language. Consider the following simple loop:
... |
08/13/2009
|
How to Vectorize Code Automatically on 32-Bit Intel® Architecture
Challenge
Vectorize code automatically. Consider the following simple loop:
void add(float *a, float *b, float *c)
{
int i;
for (i = 0; i < 4; i++) {
c[i] = a[i] + b[i];
}
}
... |
08/13/2009
|
Intel® compiler options for SSE generation (SSE2, SSE3, SSSE3, SSE4) and processor-specific optimizations
Explains which Intel Compiler switches to use to target and optimize for a specific platform, cpu or processor. Type: Technical Article,Code |
07/13/2009
|
Block-Matching In Motion Estimation Algorithms
Introduction
The Streaming SIMD Extensions 2 (SSE2) technology introduces new Single Instruction Multiple Data (SIMD) double-precision floating-point instructions and new SIMD integer instruction ... Type: Code |
01/14/2009
|
Intel® Software Development Emulator Release Notes
Release notes for the Intel(R) Software Development Emulator Author: Mark Charney (Intel) Type: Download |
01/12/2009
|
Threading Basics for Games
by Jeff Andrews
Introduction
Take advantage of Intel® dual-core processors in your game development and learn how Intel® Compilers can optimize your code.
Even with the benefits provided by Hyp ... Author: Jeff Andrews (Intel) |
10/23/2008
|
Optimized Matrix Library for use with the Intel® Pentium® 4 Processor's SSE2 Instructions
Introduction
On January 2000, Intel published an optimized matrix library (4D single-precision matrix and vector classes) for use with Pentium® III Streaming SIMD (Single Instruction Multiple Data ... Type: Code |
10/22/2008
|
Optimizing for the Intel® Pentium® 4 Processor Using Assembly Language
by Khang NguyenIntel Corporation
Introduction
When talking about optimizing programs for the Intel® Pentium® 4 processor, people usually think about using Streaming SIMD Extensions (SSE) and Streamin ... Author: Khang Nguyen (Intel) |
10/20/2008
|
Developing for Speed: A Four-Step Approach
by George Walsh
Introduction
There's really no denying that application optimization yields performance benefits. The question in each case is whether time spent optimizing and resulting perform ... Author: gbwalsh Type: Technical Article |
10/20/2008
|
Optimizing Performance of the AES Algorithm for the Intel® Pentium® 4 Processor
About This Paper
This paper describes how to optimize the Advanced Encryption Standard (AES) algorithm for the Intel® Pentium® 4 processor. A brief overview of the original AES code precedes a des ... |
10/20/2008
|
Fast SIMD Integer Move for the Intel® Pentium® 4 Processor
Introduction
Several instructions are available on the Intel® Pentium® 4 Processor for moving integer data between SIMD registers. However, it may be more beneficial to use other instructions as a rep ... |
10/20/2008
|
Pentium® 4 Processor Faster at Math
by Andrew Binstock
Introduction
The Pentium® 4 processor's Streaming SIMD Extensions 2 (SSE-2) are new processor instructions designed to accelerate the performance of applications that use doubl ... Author: Andrew Binstock |
10/17/2008
|
Cross Intel® Architecture Development Tools
Introduction
Recent years have yielded an amazing number of new operating systems, new processors, and new platform capabilities that provide exciting opportunities for application developers. Con ... |
10/17/2008
|
Absolute-Difference Motion Estimation for Intel® Pentium® 4 Processors
Introduction
The media extensions to the Intel Architecture (IA) instruction set include single-instruction, multiple-data (SIMD) instructions. Streaming SIMD Extensions 2 (SSE2) instructions extend S ... |
10/17/2008
|
Introducing Intel® NetBurst® MicroArchitecture Optimization
Introduction
A Deeper Pipeline and New Cache Structure
The Intel NetBurst® microarchitecture is a new feature from Intel that was introduced in the Pentium® 4 and Intel® Xeon™ processors. Althoug ... |
10/17/2008
|
SSE2 Instructions in a Double-precision 3D Transform
Inroduction
The Streaming SIMD Extensions 2 (SSE2) technology introduces new Single Instruction Multiple Data (SIMD) double-precision floating-point instructions and new SIMD integer instructions ... Type: Code |
10/03/2008
|
Using Streaming SIMD Extensions 2 (SSE2)
Introduction
The Streaming SIMD Extensions 2 (SSE2) introduces new Single Instruction Multiple Data (SIMD) double-precision floating-point instructions and new SIMD integer instructions into the I ... Type: Code |
10/03/2008
|
Intel® Pentium® 4 Processor Optimization Reference Manual
Introduction
Developing high-performance applications for IA-32 Intel® architecture processors can be more efficient with better understanding of the Intel NetBurst™ microarchitecture and the Pentium® ... |
10/03/2008
|
Integrating Fast Math Libraries for the Intel Pentium® 4 Processor
Introduction
Simple math functions are known to be a bottleneck in a wide variety of floating-point applications, ranging from financial analytics to 2D image manipulation to 3D physics engines. Th ... |
10/03/2008
|
Streaming SIMD Extensions 3 Enabling for the MS .Net 2003
by James Rose, Sr. Application Engineer CSD/AET Client Enabling Technology
Introduction
The Streaming SIMD Extensions 3 instructions (also known as SSE3) add important new capabilities to the Int ... |
10/02/2008
|
Streaming SIMD Extensions 3 Enabling for the Microsoft .NET* Compiler 2003
Introduction
By James RoseSr. Application EngineerCSD/AET Client Enabling TechnologyThe Streaming SIMD Extensions 3 instructions (also known as SSE3) add important new capabilities to the Intel® Penti ... |
10/02/2008
|
Threading Methodology: Principles and Practice
Introduction
Today's operating systems strive to make the most efficient use of a computer's resources. Most of this efficiency is gained by sharing the machine's resources among several tasks (mu ... |
09/16/2008
|
Differences in Optimizing for the Pentium® 4 Processor vs. the Pentium® III Processor
by Bryan HayesHayes Technologies
Introduction
Abstract
The Pentium® 4 processor introduces a completely new micro-architecture for the IA32 instruction set, the NetBurst™ architecture. It is spe ... |
09/09/2008
|
Fast Random Number Generator on the Intel® Pentium® 4 Processor
by Kipp Owens, Applications Engineer &Rajiv Parikh, Sr. Applications EngineerSoftware Solutions Group, Intel Corporation.
Abstract
This paper shows how to speed up a commonly used pseudo-rand ... Author: Rajiv Parikh (Intel),kippowens |
09/09/2008
|