Intel® Advanced Vector Extensions

Links to instruction documentation

IIR Gaussian Blur Filter Implementation using Intel® Advanced Vector Extensions

This white paper proposes an implementation for the Infinite Impulse Response (IIR) Gaussian blur filter using Intel® Advanced Vector Extensions (Intel® AVX) instructions. For a 2048x2048 image size, the AVX implementation is ~2X faster than the SSE code.
  • Desarrolladores
  • C/C++
  • Intel® Advanced Vector Extensions
  • Intel® Streaming SIMD Extensions
  • visual computing
  • Gaussian blur filter
  • Gráficos
  • Procesamiento de medios
  • Vectorización
  • Can AVX instruction be executed in parallel

    Hi,

    Can two avx instrcutions can be executed in parallel?

    For example,

    Version1:

                a1= _mm256_load_ps((Rin +offset)); 
                a2= _mm256_load_ps((Gin +offset));  
                a3= _mm256_load_ps((Bin +offset));

                ac0 = _mm256_mul_ps(a1, in2outAvx_11); 
                ac1 = _mm256_mul_ps(a2, in2outAvx_12);
                ac2 = _mm256_mul_ps(a3, in2outAvx_13);
                
                z0 = _mm256_add_ps(ac0,ac1);
                z1 = _mm256_add_ps(z0, ac2);
                

    Suscribirse a Intel® Advanced Vector Extensions