Intel® VTune™ Amplifier

Profiling Tensorflow* workloads with Intel® VTune™ Amplifier

Machine learning applications are very compute intensive by their nature. That is why optimization for performance is quite important for them. One of the most popular libraries, Tensorflow*, already has an embedded timeline feature that helps understand which parts of the computational graph are causing bottlenecks but it lacks some advanced features like an architectural analysis.

  • Linux*
  • Microsoft Windows* 10
  • Artificial Intelligence
  • C/C++
  • Python*
  • Beginner
  • Intermediate
  • Intel® Parallel Studio XE
  • Intel® VTune™ Amplifier
  • VTune
  • TensorFlow
  • timeline
  • JSON
  • Debugging
  • Development Tools
  • Machine Learning
  • What's New? - Intel® VTune™ Amplifier XE 2017 Update 4

    Intel® VTune™ Amplifier XE 2017 performance profiler

    A performance profiler for serial and parallel performance analysis. Overviewtrainingsupport.

    New for the 2017 Update 4! (Optional update unless you need...)

    As compared to 2017 Update 3:

  • C#
  • C/C++
  • Fortran
  • Java*
  • Intel® VTune™ Amplifier
  • Development Tools
  • TSX Hotspots Analysis Inside Transactions

    The TSX Hotspots analysis type uses event-based sampling collection and is targeted for the Intel® processors supporting Intel Transactional Synchronization Extensions (Intel TSX).

    This analysis type uses the the INST_RETIRED.PREC_DIST hardware event that emulates precise clockticks and helps identify performance-critical program units inside transactions.

    To use the TSX Hotspots analysis type, explore:

    Subscribe to Intel® VTune™ Amplifier