The question is really elementary. I am running Suse linux on P4. All I want to try now is to gather L1 data miss/reference on a simple triple loop (say a Matrix-Matrix-Multiplication), only the reference & miss regarding the loop. Is there a simple way to do it?
Thanks a million!!