Mensagem de blog

Visual Studio 2010 Built-in CPU Acceleration

Writing the sample code for this post I was amazed myself to see how simple it was to reach over 20 times performance improvement with so little effort.   

Criado por Última atualização em 12/12/2018 - 18:00

3D Vector Normalization Using 256-Bit Intel® Advanced Vector Extensions (Intel® AVX)

This article shows how to use 256-bit Intel® Advanced Vector Extensions (Intel® AVX) to normalize an array of 3D vectors. We describe a shuffle approach to convert between AOS & SOA on-the-fly in order to make data ready for up to 8-wide SIMD processing.
Criado por Última atualização em 03/05/2019 - 14:05

SOA Cloth Simulation with 256-bit Intel® Advanced Vector Extensions (Intel® AVX)

This white paper describes a code sample that uses Intel® AVX for computing mesh-based cloth simulation. A structure of arrays (SOA) implementation is used to maximize data parallelism enabling usage of 256-bit (8 float) SIMD processing. Code is provided.
Criado por Última atualização em 03/05/2019 - 15:45

Embree: Photo-Realistic Ray Tracing Kernels

Photo-realistic rendering requires accurate simulation of light propagation according to physics laws. The best known way to solve this problem is Monte Carlo ray tracing. We describe a state-of-the-art photo-realistic Monte Carlo rendering engine.
Criado por Sven Woop (Intel) Última atualização em 02/08/2019 - 17:30

Fast CPU DXT Compression

DXT compression is a lossy texture compression algorithm that can reduce texture storage requirements and decrease texture bandwidth. DXT decompression is typically hardware accelerated, which makes it very fast and efficient.
Criado por administrar Última atualização em 03/05/2019 - 13:57

AVX Cloth - Retired

  AVX Cloth Intel Corporation
Criado por Jeffrey Mott (Intel) Última atualização em 24/01/2018 - 12:12

Software Occlusion Culling

This article details an algorithm and associated sample code for software occlusion culling which is available for download. The technique divides scene objects into occluders and occludees and culls occludees based on a depth comparison with the occluders that are software rasterized to the depth buffer. The sample code uses frustum culling and is optimized with Streaming SIMD Extensions (SSE)...
Criado por Kiefer Kuah (Intel) Última atualização em 03/05/2019 - 15:54

Accelerating Texture Compression with Intel® Streaming SIMD Extensions

Improving ETC1 and ETC2 texture compression   What is texture compression?
Criado por Última atualização em 03/07/2019 - 18:44

Use the Intel® SPMD Program Compiler for CPU Vectorization in Games

Migrate highly vectorized GPU compute kernels to CPU code using the Intel® SPMD Program Compiler (commonly referred to in previous documents as ISPC). Includes a link to a Github code sample to help you utilize spare CPU cycles to create a richer gaming experience.
Criado por Jon Kennedy. (Intel) Última atualização em 02/05/2019 - 16:12