Intel Media SDK Tutorial - Workload analysis and benchmarking

To allow workload benchmarking, all of the Intel® Media SDK tutorial source code samples contain the macro “ENABLE_BENCHMARK”. If set, the macro activates measurement of the complete workload run time. 

For benchmarking accuracy, it’s important to understand the performance impact of uncompressed frame read/write file access. Without disabling such file access, the benchmarks become inaccurate and do not convey the real performance of the HW. To address this, the macros “ENABLE_INPUT” and “ENABLE_OUTPUT” can be disabled to prevent uncompressed frame file access. However, note that when disabling input of uncompressed data, the input surface will be initialized as a static “one color” image. This naturally does not represent a typical encoder or VPP input. But the purpose of the captured benchmark is not to convey real life performance but to illustrate relative performance improvements using various implementation techniques.

The table below summarizes a performance benchmark snapshot of all the workloads analyzed using Intel® Graphics Performance Analyzers (Intel® GPA). As can be seen, for each use case, the run-time performance and GPU utilization improves gradually as optimizations are applied. For details about each workload go here.

Workload Name

GPU load (%)

CPU load (%)

Run time (s)

 

Overall

EU

MFX

 

 

simple_2_decode

72

0

72

36

3.30

simple_2_decode - d3d

95

0

95

15

1.84

simple_3_encode

70

52

18

5

12.50

simple_3_encode - d3d

84

63

21

4

8.10

simple_3_encode - d3d – async

97

84

32

6

5.00

simple_4_vpp_resize_procamp

20

20

0

30

2.25

simple_4_vpp_resize_procamp - d3d

38

38

0

7

1.10

simple_5_transcode

70

43

26

11

22.50

simple_5_transcode_opaque

82

55

33

6

17.55

simple_5_transcode_opaque - async

97

82

54

9

7.70

Configuration: 3rd generation Intel® Core™ i5-3317U, 1.7 GHz., 4 GB RAM. 32-bit Microsoft Windows* 8. ~50s 1080 p content. Microsoft DirectX* 9 workloads. For more information go to http://www.intel.com/performance

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations, and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.

Click here to go back to Tutorial front page.

Pour de plus amples informations sur les optimisations de compilation, consultez notre Avertissement concernant les optimisations.