Forum topic

How to speed up this code?

    Hello together,

many thanks for all contributors to my past question.

Authored by Alexander L. Last updated on 01/19/2017 - 02:12

Software Occlusion Culling

This article details an algorithm and associated sample code for software occlusion culling which is available for download. The technique divides scene objects into occluders and occludees and culls occludees based on a depth comparison with the occluders that are software rasterized to the depth buffer. The sample code uses frustum culling and is optimized with Streaming SIMD Extensions (SSE)...
Authored by Kiefer Kuah (Intel) Last updated on 01/17/2017 - 11:59
Forum topic

mitigating permute costs in AVX 256?

Hello, I'm investigating conversion of a number of compute kernels from AVX 128 to AVX 256 and would appreciate any guidance which might be available on getting a small number of operations on port

Authored by Todd West Last updated on 01/15/2017 - 09:21
Forum topic

_mm_prefetch usage



Authored by Ioan H. Last updated on 01/15/2017 - 06:01
Forum topic

Is xend treated as a full memory barrier?

I've started attempting to learn RTM extensions. The most common examples I can find online are using them to implement a mutex or concurrent lock. Often they are similar to:

Authored by william laeder Last updated on 01/13/2017 - 08:05

Part 3: Expressing Parallelism with Vectors

Episode 3 of the “Hands-On Workshop (HOW) series on parallel programming and optimization with Intel® architectures” introduces data parallelism and automatic vectorization.

Authored by Last updated on 01/12/2017 - 14:45
Forum topic

Code scales poorly with AVX

This code scales poorly with AVX on my Sandy Bridge, how can I make it more vectorizer friendly:

Authored by CommanderLake Last updated on 01/11/2017 - 18:32
Blog post

Resetting the lowest n set bits

Already a couple of years ago, the Bit Manipulation Instruction Set 1 (BMI1) introduced the instruction BLSR, which resets the lowest bit that is set.

Authored by Thomas Willhalm (Intel) Last updated on 01/10/2017 - 00:54
Forum topic

Parallelization + Vectorization using OpenMP in Sandy Bridge


I would like to ask question about parallelization+vectorization:

Authored by Claudia W. Last updated on 01/09/2017 - 00:05
Forum topic

AVX512 suboptimal intrinsics compilation


I'm looking into the compilation result, of what the Intel compiler makes out of AVX512 intrinsics. (latest Intel trial compiler downloaded a few weeks ago)

Authored by jan v. Last updated on 01/05/2017 - 09:40
For more complete information about compiler optimizations, see our Optimization Notice.