Blog post

Visual Studio 2010 Built-in CPU Acceleration

Writing the sample code for this post I was amazed myself to see how simple it was to reach over 20 times performance improvement with so little effort.   

Authored by Last updated on 12/12/2018 - 18:00
Article

3D Vector Normalization Using 256-Bit Intel® Advanced Vector Extensions (Intel® AVX)

This article shows how to use 256-bit Intel® Advanced Vector Extensions (Intel® AVX) to normalize an array of 3D vectors. We describe a shuffle approach to convert between AOS & SOA on-the-fly in order to make data ready for up to 8-wide SIMD processing.
Authored by Last updated on 05/03/2019 - 14:05
Article

SOA Cloth Simulation with 256-bit Intel® Advanced Vector Extensions (Intel® AVX)

This white paper describes a code sample that uses Intel® AVX for computing mesh-based cloth simulation. A structure of arrays (SOA) implementation is used to maximize data parallelism enabling usage of 256-bit (8 float) SIMD processing. Code is provided.
Authored by Last updated on 05/03/2019 - 15:45
Article

Embree: Photo-Realistic Ray Tracing Kernels

Photo-realistic rendering requires accurate simulation of light propagation according to physics laws. The best known way to solve this problem is Monte Carlo ray tracing. We describe a state-of-the-art photo-realistic Monte Carlo rendering engine.
Authored by Sven Woop (Intel) Last updated on 08/02/2019 - 17:30
Article

Fast CPU DXT Compression

DXT compression is a lossy texture compression algorithm that can reduce texture storage requirements and decrease texture bandwidth. DXT decompression is typically hardware accelerated, which makes it very fast and efficient.
Authored by admin Last updated on 05/03/2019 - 13:57
Article

AVX Cloth - Retired

  AVX Cloth Intel Corporation
Authored by Jeffrey Mott (Intel) Last updated on 01/24/2018 - 12:12
Article

Software Occlusion Culling

This article details an algorithm and associated sample code for software occlusion culling which is available for download. The technique divides scene objects into occluders and occludees and culls occludees based on a depth comparison with the occluders that are software rasterized to the depth buffer. The sample code uses frustum culling and is optimized with Streaming SIMD Extensions (SSE)...
Authored by Kiefer Kuah (Intel) Last updated on 05/03/2019 - 15:54
Blog post

Intel® Graphics Virtualization Update

Traditional business models, built on graphics and visualization usages such as workstation remoting, VDI, DaaS, transcoding, media streaming, and on-line gaming, are beginning to draw open s

Authored by Last updated on 07/04/2019 - 19:24
Article

Easy SIMD through Wrappers

SIMD operations are widely used for 3D graphics applications. This tutorial provides new insights into SIMD by comparing SIMD lanes and CPU threads, and steps you through the process of creating a simple, straightforward SIMD implementation in your own code.
Authored by admin Last updated on 05/03/2019 - 14:05