I was wondering if Intel engineers would consider adding certain BMI instructions to AVX. In particular, I think vector variants of PDEP and PEXT would be of great value for various kinds of multimedia that often encode things in small bitfields.
I'm personally interested in it in light of the ongoing CPU-GPU convergence. AVX2's gather and FMA support will help a great deal to catch up with the GPU, but there's still a handful of legacy graphics operations that require a relatively large number of instructions to implement on the CPU. The great thing about PDEP/PEXT is that developers could also efficiently implement custom data formats. Recently there has been a lot of research into custom rasterization and anti-aliasing algorithms, but I'm sure the uses go far beyond (rasterization) graphics.