And on page 2-5, it has a table talking about the speed/latency of Skylake's cache.
It has a column called Fastest Latency. Does it mean that's the smallest number of cycles required for writing a cache line (64 bytes)? Or reading? Or combined? Or?
And what's "Sustained Bandwidth"?
And in real life, how can one measure the latency occur at each cache level? Says I have a C++ or C program.