Does anybody know of a 2D convolution function that is well optimized for the phi? We have 2-megapixel images and a 26x26 nonseparable kernel. Also, how much speedup should we expect compared to a 6-core i7 or Xeon? My baseline is ippiFilter_8u_C4R on a CPU or nppiFilter_8u_C4R on GPU.
Jue, 28/03/2013 - 15:13