By Shawn Casey (Intel), Added

**by Shawn D. Casey**

### Introduction

Hyper-Threading Technology from Intel allows one physical processor package to be perceived as two separate logical processors within the operating system. Processor resources enabled for Hyper-Threading Technology duplicate, tag, or share the majority of resources. Sharing resources allows a more efficient use of the processor for a significant performance increase, at less than 5% die size and power consumption increase compared to a single processor package. However, Hyper-Threading Technology cannot have performance expectations equivalent to that of multi-processing where all the processor resources are replicated.

Measured performance on the Intel® Xeon® processor MP with Hyper-Threading Technology shows performance gains of up to 30% on common server application benchmarks for this technology¹. There are several dependencies reflected in the speedup demonstrated by any benchmark with Hyper-Threading Technology, and the amount of gain can vary widely. However, a gain of 30% or more within a parallel section with is considered acceptable.

This paper explains how to calculate *Hyper-Threading Technology* effectiveness, a derived quantity describing the effectiveness of Hyper-Threading Technology while taking the scalability of the workload into account.

^{1}*Intel Technology Journal*, Volume 6, Issue 1 p.11

### Why use Hyper-Threading Technology effectiveness?

There is a misconception that equal performance on two workloads means equal *Hyper-Threading Technology effectiveness*. This doesn't give the full picture, since the amount of performance achievable is unknown.

### Example 1

**Question:**

Which application is more effective with Hyper-Threading Technology? Application A has a 5% speedup with Hyper-Threading Technology while application B has a 7% speedup.**Answer:**

Indeterminate. Take the case where application A has a 90% speedup with dual-processors and application B only has a 10% increase with dual-processors. Application A had much more potential to scale with Hyper-Threading Technology but failed to take advantage of it. Application B, on the other hand, only had 10% potential and took advantage of most of that with Hyper-Threading Technology. Calculating *Hyper-Threading Technology effectiveness* solves this problem.

### Multi-processing Performance

The two dependencies on the performance speedup from any multi-processing system according to Amdahl's Law² are the amount of parallelism in the application/workload and the speedup of the parallel parts.

where:*Speedup*_{overall} = Overall speedup of application/workload

with enhanced sections of code.*Fraction*_{enhanced} = Portion of code that has be en enhanced or

made parallel. (0.0 <= *Fraction*_{enhanced} <= 1.0)

**Equation 1: Amdahl's Law**

Amdahl's Law basically states that the amount of performance gain is driven by the time spent in the parallel region of the application and how fast you can make the code run in those regions. In a dual processor system, this formula becomes the more specific:

where:*DP Scaling*_{Max} = Maximum possible dual-processor scaling (ignoring

unusual super-linear cases), giving the amount of parallelism.

Synchronization overhead is included in speedup value of 2.*Parallel*_{Observed} = Amount of parallel activity that occurs in the

application/workload pair.

**Equation 2: Amdahl's Law applied to DP systems**

^{1}*Computer Architecture: A Quantitative Approach*, 2^{nd} Edition, §1.6"

### Amdahl's Law Applied to Hyper-Threading Technology

Since Hyper-Threading Technology is a form of multi-processing, similar techniques that are used to determine the effectiveness of multi-processing can be used to determine the effectiveness of Hyper-Threading Technology.

If we assume that to be effective with Hyper-Threading Technology, a 30% gain within a parallel section is needed, we can modify Equation 1 and set *Speedup*_{Enhanced} = 1.3 to determine *Hyper-Threading Technology Scaling *_{Effective} if the amount of parallel activity is known (Equation 3).

where:*Hyper-Threading Technology Scaling *_{Effective} = An effective *Hyper-Threading Technology* scaling value assuming a 30% acceptable speedup within the parallel sections (which includes synchronization overhead).*Parallel*_{Observed} = Amount of parallel activity that occurs in the application/workload pair.

**Equation 3: Amdahl's Law applied to effective Hyper-Threading Technology systems**

Performance measurements usually do not reflect how parallel a particular application is. The derivation of *Hyper-Threading Technology effectiveness* eliminates the need of knowing how parallel an application is by empirically gathering that information from performance measurements. To determine an application/workload *Hyper-Threading Technology effectiveness*, the following measurements are needed:

- Single-processor (UP) performance
- Hyper-Threading Technology performance
- Dual-processor (DP) performance

Performance measurements denote a metric where higher values are more favorable. In cases where elapsed time i s the performance metric, the reciprocal of time is used. The most important consideration is that the application/workload pair performs the same amount of work on all configurations.

From the measurements, we can calculate the following:

**Equation 4: Definitions of DP and Hyper-Threading Technology scaling**

Hyper-Threading Technology effectiveness is defined as:

where:*HT Scaling*_{Observed} = Measured Hyper-Threading Technology Scaling (includes synchronization overhead).*HT Scaling*_{Effectiveness} = How effective an application/workload pair is with Hyper-Threading Technology.

**Equation 5: Hyper-Threading Technology effectiveness**

We know from our measurements what the *HT Scaling*_{Observed} is, but *HT Scaling*_{Effective} in Equation 3 is still in terms the degree of parallelism in the application/workload. Assuming the parallel sections of the applications scale at 2, we can solve for *Parallel *_{Observed} from Equation 2 in terms of *DP Scaling*_{Observed} as shown in Equation 6.

**Equation 6: Amount of parallelism in code in terms of DP scaling observed**

*Parallel *_{Observed} from Equation 6 can then be used in Equation 3 to calculate *HT Scaling*Effective in terms of DP Scaling measured as shown in Equation 7.

**Equation 7: Effective Hyper-Threading Technology scaling in terms of DP scaling observed**

Using Equation 5 and Equation 7 together, Hyper-Threading Technology effectiveness can now be represented completely in terms of Hyper-Threading Technology scaling and DP scaling.

**Figure 8: Hyper-Threading Technology Effectiveness in terms of measured values**

A Hyper-Threading Technology Effectiveness of 1.0 is desirable, meaning that the application is achieving what is typical based on measured performance from common benchmarks. Anything less than one represents an undesirable effectiveness that should be investigated from the application's perspective using performance analyzer tools such as the Intel VTune™ Performance Analyzer.

### Example 2

**Question:**

For an application/workload that takes 40.0s to complete on a single-processor system, 25.0s to complete on a dual-processor s ystem, and 35.0s on Hyper-Threading Technology enabled single-processor system, what is the Hyper-Threading Technology Effectiveness? Is it acceptable? What if the Hyper-Threading Technology system completed its task in 30s?**Answer:**

First, calculate the *HT Scaling *_{Observed},

Next, calculate the *DP Scaling *_{Observed},

Now, calculate Hyper-Threading Technology Effectiveness,

This application/workload combination is not very effective. However, if the Hyper-Threading Technology system took 30s to complete, then,

Which would result in the following Hyper-Threading Technology Effectiveness,

When the Hyper-Threading Technology system completes in 30s, the application/workload has a terrific Hyper-Threading Technology Effectiveness.

### Conclusion

So, understanding the effectiveness of your application running on a Hyper-Threading Technology enabled system takes a little more than just comparing performance numbers. As we've seen in the previous sections, however, there are some straightforward things you can do to really understand how much your application/workload can gain from using Hyper-Threading Technology. See the Resources below for additional information.

### Resources

For information on taking advantage of Hyper-Threading Technology in developing your applications, visit our community for Parallel Programming.

### References

*Computer Architecture: A Quantitative Approach*, 2^{nd} Edition, Patterson, David A. and Hennessy, John L.

### About the Author

Shawn Casey is a Senior Application Engineer with Intel's Software and Solutions Group.

## 添加评论

顶部（有关技术讨论的信息，请访问开发人员论坛。有关网站或软件产品的问题，请联系支持部门。）

请登录添加评论。还不是成员？立即加入