I'm developing a transcoding unit with IPP libraries. So, I need multiple instances of encoder and decoder. Especially with H264, it seems not to be scalable with a multi-threaded environment. With one or two H264 encoder and decoder instances (30fps@VGA), there is no huge performance problem. But, if I use more and more H264 encoder and decoder instances, CPU increases dramatically.
The only answer I found in this forum is "DO NOT USE OPENMP". But, the problem is that with Xeon E5620 @ 2.4GHz CPU, I cannot decode and encode a H264 30fps@VGA stream in real time without OpenMP support.
Please tell me if I missed something.