Параллельные вычисления
Intel® Xeon Phi™ Coprocessor February Developer Webinar Q&A Responses
Response to our February session of the Intel® Xeon® and Xeon® Phi™ Introduction to High Performance Application Development for Multicore and Manycore-Live webinar was gratifying and overwhelming, but finally we worked through all of your questions. Some of the questions required a context we lost with the transcript and some were only partially formed, or of special interest, or duplicates. We gathered together all the questions of general interest from the webinar and farmed them out to our experts for more complete answers. We'll assembled that list and sorted it by
高效并行化
高效并行化文档
高效并行化
概述
本章介绍并行化。其中有各种并行化方法与资源的链接以及如何获取最佳并行化性能的技巧。
The Intel® Xeon Phi™ coprocessor: What is it and why should I care? Part 0: Introduction
PART 0: “Introduction”
Calculating estimated call counts with Intel® VTune™ Amplifier XE 2013
When you profile your software with VTune™ Amplifier XE you often start from looking at the top function hotspots list. This allows you to see what functions are spending CPU resources, so you can focus your optimization efforts.
Function call counts can provide some additional information to assist in further optimization.
Intel® Trace Analyzer and Collector Guides
This is currently a placeholder for Intel® Trace Analyzer and Collector usage guides. Until articles are added, please visit the Intel® Trace Analyzer and Collector product page. You can also view the documentation.
Intel® Trace Collector Filtering
Filtering in the Intel® Trace Collector will apply specified filters to the trace collection process. This directly reduces the amount of data collected. The filter rules can be applied either via command line arguments or in a configuration file (specified by the environment variable VT_CONFIG). Filters are evaluated in the order they are listed.
Filter Specification
Filters consist of three components, type, pattern, and body. The basic filter is of type STATE. The other types, SYMBOL and ACTIVITY, apply pattern replacements, but are otherwise identical to STATE.
New Contributed Code for Cilk™ Plus: DotMix, a Deterministic Parallel Random-Number Generator
Tutorial Windows* 8: Escrevendo uma Aplicação Multithreaded para a Windows Store* usando a biblioteca Intel® Threading Building Blocks.
É sabido que a API das aplicações Windows Store não disponibiliza algumas funções comuns para trabalhar com Threads, como a CreateThread e aquelas que trabalham com chaves TLS (Thread-local storage). Esta é mais uma grande oportunidade para migrar o seu desenvolvimento de aplicações de um paralelismo baseado em threads para um paralelismo baseado em tarefas. Este post demonstra as instruções passo-a-passo para escrever um exemplo que usa paralelismo e que pode passar pela validação do Windows App Certification Kit (WACK).
Optimization of a Parallel Application for Multi-Core Environments
(This work was done by Vivek Lingegowda during his internship at Intel.)
