Intel® Pentium® Processors

Reducing the Impact of Misaligned Memory Accesses


Introduction

Misalignment of memory access is a problem commonly encountered when optimizing code with Streaming SIMD Extensions 2 (SSE2). An SSE2 algorithm often requires loading and storing data 16 bytes at a time to match the size of the XMM registers. If alignment cannot be guaranteed, some part of the performance gain achieved by processing multiple data elements in parallel will be lost because either the compiler or assembly programmer must use unaligned move instructions.

  • Developers
  • Intel® Streaming SIMD Extensions
  • SSE2
  • Graphics
  • Intel® Pentium® Processors
  • Optimizing Vertex Linear Interpolation with Intel® Pentium® III and Pentium® 4 Processors

    Introduction

    Increasing processor speeds have led to the introduction of more complex algorithms that provide games with a more dynamic player environment, resulting in a decline in the use of vertex linear interpolation for animations. Additionally, vertex processing in hardware is increasingly minimizing the need for software interpolation. However, most users do not have hardware vertex processors, so it's important to optimize your software interpolation to support this gaming segment.

  • visual computing
  • Game Development
  • Graphics
  • Intel® Pentium® Processors
  • How to Vectorize Code on 32-Bit Intel® Architecture


    Challenge

    Vectorize code for greater performance. The SIMD features of Streaming SIMD Extensions (SSE), Streaming SIMD Extensions 2 (SSE2) and MMX™ technology require new methods of coding algorithms. One of them is vectorization. Vectorization is the process of transforming sequentially executing, or scalar, code into code that can execute in parallel, taking advantage of the SIMD architecture parallelism.

  • SSE2
  • Intel® Pentium® Processors
  • Developing for Speed: A Four-Step Approach

    by George Walsh


    Introduction

    There's really no denying that application optimization yields performance benefits. The question in each case is whether time spent optimizing and resulting performance gains is worth the development time, effort and cost. As part of his work with the Intel® software team Eric Palmer works closely with ISVs, helping to boost performance of their applications. Palmer has developed a four-step approach--it works for him, and it will work for your applications too.

  • Developers
  • Server
  • Intel® Pentium® Processors
  • Optimization
  • Using htmobile Library Functions to Discover Processor Information


    Introduction

    The htmobile library provides a convenient means to retrieve feature and identification data to improve application performance.

    The variety of hardware platforms currently available requires applications to have access to increasingly large amounts of hardware information in order to run efficiently.

  • Developers
  • Intel® Pentium® Processors
  • Mobility
  • Threading
  • Linux* on Intel® Architecture Improved Performance and TCO


    Introduction

    During the design of the Pentium® 4 processor, Intel's design-engineering team reaped the rewards of migrating from legacy RISC*-based systems to Linux* on Intel architecture.

    Design-engineering teams at Intel have achieved substantial savings in Total Cost of Ownership (TCO) by migrating to Linux on Intel® architecture from legacy UNIX* systems on RISC-based architecture. At the same time, those teams have achieved substantial productivity gains from the performance benefits associated with the migration.

  • Developers
  • Linux*
  • Intel® Pentium® Processors
  • Open Source
  • Возможное будущее микропроцессорной индустрии

    В этой статье поговорим о возможном развитии микропроцессорной индустрии. А именно перепрограммирования микропроцессоров на лету.
  • Server
  • WhatIf Software
  • Graphics
  • Intel® Pentium® Processors
  • Intel® Itanium® Processors
  • Intel® Atom™ Processors
  • Subscribe to Intel® Pentium® Processors