I have some static code that I've been tuning as much as I can, trying all sorts of things. The fastest I could get was 9.5 seconds. When I ran the program multiple times with prof-gen, and then used prof-use, the code is now down to 8.2 seconds. Wow.Is there any way I can find out what it did? Any hints/reports? I've looked through the assembler but would like some hints at producing the source code in the first place.
For more complete information about compiler optimizations, see our Optimization Notice.