英特尔® 奔腾® 处理器

Reducing the Impact of Misaligned Memory Accesses


Introduction

Misalignment of memory access is a problem commonly encountered when optimizing code with Streaming SIMD Extensions 2 (SSE2). An SSE2 algorithm often requires loading and storing data 16 bytes at a time to match the size of the XMM registers. If alignment cannot be guaranteed, some part of the performance gain achieved by processing multiple data elements in parallel will be lost because either the compiler or assembly programmer must use unaligned move instructions.

  • 开发人员
  • 英特尔® SIMD 流指令扩展
  • SSE2
  • 图形
  • 英特尔® 奔腾® 处理器
  • Optimizing Vertex Linear Interpolation with Intel® Pentium® III and Pentium® 4 Processors

    Introduction

    Increasing processor speeds have led to the introduction of more complex algorithms that provide games with a more dynamic player environment, resulting in a decline in the use of vertex linear interpolation for animations. Additionally, vertex processing in hardware is increasingly minimizing the need for software interpolation. However, most users do not have hardware vertex processors, so it's important to optimize your software interpolation to support this gaming segment.

  • visual computing
  • 游戏开发
  • 图形
  • 英特尔® 奔腾® 处理器
  • How to Vectorize Code on 32-Bit Intel® Architecture


    Challenge

    Vectorize code for greater performance. The SIMD features of Streaming SIMD Extensions (SSE), Streaming SIMD Extensions 2 (SSE2) and MMX™ technology require new methods of coding algorithms. One of them is vectorization. Vectorization is the process of transforming sequentially executing, or scalar, code into code that can execute in parallel, taking advantage of the SIMD architecture parallelism.

  • SSE2
  • 英特尔® 奔腾® 处理器
  • Developing for Speed: A Four-Step Approach

    by George Walsh


    Introduction

    There's really no denying that application optimization yields performance benefits. The question in each case is whether time spent optimizing and resulting performance gains is worth the development time, effort and cost. As part of his work with the Intel® software team Eric Palmer works closely with ISVs, helping to boost performance of their applications. Palmer has developed a four-step approach--it works for him, and it will work for your applications too.

  • 开发人员
  • 服务器
  • 英特尔® 奔腾® 处理器
  • 优化
  • Using htmobile Library Functions to Discover Processor Information


    Introduction

    The htmobile library provides a convenient means to retrieve feature and identification data to improve application performance.

    The variety of hardware platforms currently available requires applications to have access to increasingly large amounts of hardware information in order to run efficiently.

  • 开发人员
  • 英特尔® 奔腾® 处理器
  • 移动性
  • 线程
  • Linux* on Intel® Architecture Improved Performance and TCO


    Introduction

    During the design of the Pentium® 4 processor, Intel's design-engineering team reaped the rewards of migrating from legacy RISC*-based systems to Linux* on Intel architecture.

    Design-engineering teams at Intel have achieved substantial savings in Total Cost of Ownership (TCO) by migrating to Linux on Intel® architecture from legacy UNIX* systems on RISC-based architecture. At the same time, those teams have achieved substantial productivity gains from the performance benefits associated with the migration.

  • 开发人员
  • Linux*
  • 英特尔® 奔腾® 处理器
  • 开源
  • Возможное будущее микропроцессорной индустрии

    В этой статье поговорим о возможном развитии микропроцессорной индустрии. А именно перепрограммирования микропроцессоров на лету.
  • 服务器
  • "假设"软件
  • 图形
  • 英特尔® 奔腾® 处理器
  • 英特尔® 安腾® 处理器
  • 英特尔® 凌动™ 处理器
  • 订阅 英特尔® 奔腾® 处理器