Is there any benchmark for Matrix Multiplication on MIC? If yes, please share a link with me.
Also I am experiencing a much wired phenomena in my application.
I am trying to develop a O(n^3) matrix multiplication like application.
Let I have a function A. and I am only interested in timing of function A. Function A takes input from a routine which preprocesses the input for A. However, whatever value is supplied to functionA, it always does O(n^3) operations.
Now initially function A was running in 24s for 16384x16384 size. But the result was incorrect because I made some mistake in the preprocessing routine which filled the input matrices with some wrong values.
However when I fixed that preprocessing step to produce correct input, function A started running in 45s? Although it was doing same amount of computations as before!!
Why might that happen??
If anyone has experience with this kind of problem, please share that with me.
Thanks in advance,