I am trying to measure the UOPs being delivered from the Loop Stream Dectector (LSD) in my Sandy Bridge processor. I don't see any documentation in the PMCs as to doing this. Is there a method I can use to determine the # of uops delivered to the UopQ from the LSD? Is the LSD in the UopQ, if so then it's not really delivering uops to the UopQ, right? PMC 79 allows me to measure the uops dispatched from the uop cache with umask=0x08, from the legacy decode unit (ILD) with umask=0x04 and from micro-code (MS) with umask=0x30, but if you can't determine those coming from the LSD, you can not account for all uops delivered to the UopQ. I ask this because I'm observing the large number of uops missing which are retired in simple copy/read/write tests and want to account for the sources and identify the %'s of uops delivered to the UopQ from the various sources.Thanks..perfwise
For more complete information about compiler optimizations, see our Optimization Notice.