You optimized your code to apply a loop interchange mechanism that gave you 3 seconds of improvement in the application execution time. To understand whether you got rid of the hotspot and what kind of optimization you got per function, re-run the Basic Hotspots analysis on the optimized code and compare results:
Compare Results Before and After Optimization
VTune Amplifier collects data and opens the result in the Result tab. Make sure to close the results before comparison.
Identify the Performance Gain
Explore the Bottom-up pane to compare CPU time data for the first hotspot: CPU Time:r001hs - CPU Time:r002hs = CPU Time: Difference. 8.719s - 5.564s = 3.154s, which means that you got the optimization of ~3 seconds for the
If you switch to the Summary window, you see that the Elapsed time also shows 3.392 seconds of optimization for the whole application execution: