I have used iaca for a few days , I mainly used it to analysis the performance of my assembly code.
I noticed that it can only anlyisis the first pass of a loop , though the code in the loop runs thousands of passes.
How I find this ?
I write two assembly codes, for convenient to state, I give them a name respectly, code A and code B. they differers a little.
Code A is faster than code B in speed.
Both Code A and Code B has a loop in side.
The first pass loop of Code A is slower than Code B,
but in later passes, faster than Code B.
But , the iaca give the result report that Code B is faster than Code A for 2 clock cycles both in througout and latency.
Is that IACA can only analysis the first pass of o loop ?