| Link/Url | Tags |
|---|---|
| Реализация БИХ-фильтров с помощью набора инструкций Intel® AVX для комплексных чисел с плавающей точкой [Knowledgebase] Введение В данной статье описывается реализация сложных фильтров с бесконечной импульсной характеристикой ... Posted: 2008-10-28 17:23:57 by | simd, IPP, signal processing, SSE, БИХ, filter, Intel AVX |
| Levels of concurrency: an example using both instruction and thread level parallelism [Videos] Dr. Michael Wrinn, Senior Course Architect with Intel(R) Software College, opens up his series on topics in parallelism with a discussion on mapping between software strategies for concurrency and the ... Posted: 2007-03-02 19:25:54 by Michael Wrinn (Intel) | tools, wrinn, simd, technology, Multithreading, software, intel, multithread, threading |
| Using SSE3 Technology in Algorithms with Complex Arithmetic [Knowledgebase] Introduction This paper demonstrates the benefits of Streaming SIMD Extensions 3 and Hyper-Threading Technology when implementing complex arithmetic. Both of these features are available in Inte ... Posted: 2009-07-31 19:12:16 by | simd, SSE3, coding, Code, Hyper-Threading Technology, physics, visual computing |
| Matrix Vector Multiplication and Multi-threading Benefits [Knowledgebase] by Kiefer Kuah Abstract Dual-core and Quad-core processors are fast becoming the staple in desktop and mobile computing. In this article, performance data comparing single-threaded and multi-thr ... Posted: 2009-01-14 16:52:12 by Kiefer Kuah (Intel) | simd, C/C++, Multi-threading |
| How to Manipulate Data Structure to Optimize Memory Use on 32-Bit Intel® Architecture [Knowledgebase] Challenge Improve memory utilization by manipulating data-structure layout. For certain algorithms, like 3D transformations and lighting, there are two basic ways of arranging the vertex data. Th ... Posted: 2008-12-10 13:08:52 by | simd, optimization |
| 利用英特尔 SIMD 流指令扩展指令集,优化动画模型的渲染管线 [Knowledgebase] 介绍 下列文章阐述了用于渲染管线的单个例程,以及如何采用英特尔 SIMD 流指令扩展指令集对其进行优化。 作者 Id Software 公司,J.M.P. van Waveren 目前, ... Posted: 2008-03-28 21:59:39 by | simd, specular map, visual computing |
| Vectorization with the Intel® Compilers (Part I) [Knowledgebase] by Aart J.C. Bik Introduction Many general-purpose microprocessors today feature multimedia extensions that support SIMD (single-instruction-multiple-data) parallelism on relatively short vector ... Posted: 2008-12-15 09:48:29 by Aart Bik | simd, Code |
| Flush to Zero Mode in Fortran on Intel® Architecture [Knowledgebase] Introduction This article briefly covers the background and use of the "flush-to-zero" (FTZ) or abrupt-underflow settings for Streaming SIMD Extensions (SSE/SSE2) instructions on IA-32 and floating-p ... Posted: 2008-10-17 12:54:45 by | simd |
| Creating a Particle System with Streaming SIMD Extensions [Knowledgebase] by William Damon Introduction Smoke, fire, water spray, dust, and more all have something in common. They all benefit from a good particle system. For the uninitiated, a particle system is esse ... Posted: 2010-01-28 08:48:26 by William Damon (Intel) | simd, Smoke, visual effects, physics, visual computing |
| How to Use Intrinsics [Knowledgebase] by Joseph D. Wieber, Jr and Gary M. Zoppetti Introduction In a previous article, we introduced the intrinsics provided by the Intel® C++ Compiler. Intrinsics allow a programmer to utilize si ... Posted: 2008-12-10 08:06:58 by | C++, simd |
| Fast SIMD Integer Move for the Intel® Pentium® 4 Processor [Knowledgebase] Introduction Several instructions are available on the Intel® Pentium® 4 Processor for moving integer data between SIMD registers. However, it may be more beneficial to use other instructions as a ... Posted: 2008-10-20 08:47:48 by | simd |
| x87 and SSE Floating Point Assists in IA-32: Flush-To-Zero (FTZ) and Denormals-Are-Zero (DAZ) [Knowledgebase] Introduction This document details the difference between how assists are handled with x87 and Single Instruction Multiple Data (SIMD) instructions, and gives information on how to change their behav ... Posted: 2008-10-17 12:32:19 by Shawn Casey (Intel) | simd, SSE2, SSE, Code |
| Fast Color Conversion Using Streaming SIMD Extensions and MMXT Technology [Knowledgebase] Datatype color conversions are a common requirement in 3-D application pipelines. In a simple lighting scheme, these conversions happen at least once per color channel, red, green, blue (R, G, B) per ... Posted: 2009-06-18 18:04:59 by | simd, global illumination, visual computing |
| Avoiding the Cost of Branch Misprediction [Knowledgebase] Introduction by Rajiv Kapoor Today's modern processors derive much of their performance by executing instructions in parallel and before the time when their results are actually needed. When a seque ... Posted: 2009-02-20 21:02:38 by Rajiv Kapoor (Intel) | simd |
| Is there an IPP function to detect the processor type? [Knowledgebase] In Intel IPP v6.0, there is a new function named ippGetCpuFeatures() that can be used to detect your processor features. It is declared in ippcore.h. This function retrieves CPU features like those re ... Posted: 2010-01-26 14:47:44 by Ying Song (Intel) | simd, IPP, SSE, Detect Intel CPU, ippGetCpuFeatures, ippGetCpuType |
| High Clocks Per Instruction Retired when vectorizing the loop. [Knowledgebase] Introduction Sometimes when we vectorize a loop, we get a high Clocks Per Instruction Retired (CPI) value. This happens when there is high bus utilization and the bus gets saturated. The subtrac ... Posted: 2009-07-14 03:49:00 by | simd, SSE2, SSE3, SSE4, SSE, High CPI, Vectorizer, hardware prefetcher, SSE1, Memoray latency, BUS Saturation, Vtune |
| Understanding CPU Dispatching in the Intel® IPP Library [Knowledgebase] The Intel IPP library contains a collection of functionally identical processor-specific optimized libraries that are “dispatched” at run-time. The “dispatcher” chooses which of these processo ... Posted: 2010-02-04 17:33:27 by Ying Song (Intel), Paul Fischer (Intel) | simd, IPP, AES, SSE, CPU dispatch, CPU-specific code, dispatch |
| SIMD HK_CONFIG_SIMD [Forums] Defining HK_CONFIG_SIMD to 1 and adding const hkQuadReal hkQuadReal0001 = {0,0,0,1}; compiles and links; hkVector4 is 16-byte aligned; so even though libraries won't use SIMD, my client code will an ... Posted: 2008-12-04 18:26:12 by migdalskiy | simd |
| The Intel® AVX Realization of Lanczos interpolation in Intel® IPP 2D Resize Transform [Knowledgebase] Download PDF Download The Intel® AVX Realization of Lanczos interpolation in Intel® IPP 2D Resize Transform [PDF 174KB] Introduction This paper presents the interpolation algorithm base ... Posted: 2010-01-28 14:51:11 by | simd, SSE, ippiResize, AVX, image processing, ipp filter, AVX optimization in IPP |
| Identifying JVM SIMD and SSE Usage with the VTune™ Performance Analyzer [Knowledgebase] by Levent Akyil Leveraging SIMD and SSE (Streaming SIMD Extensions) support available on target processors is one of the key optimization techniques JVMs use (or should use). The question is how to i ... Posted: 2010-02-26 12:35:50 by Levent (Intel) | java, simd, JVM, SSE, Vtune |
| Making XML schema validation fast [Blogs] Last week I promised to talk more about what our team in PRC is doing. So in this post I have a little bit of a challenge for you XML pioneers out there. If you could define new CPU instructions ... Posted: 2008-04-30 22:29:53 by Kenneth Graf (Intel) | simd, xml, STTNI |
| Intel® Integrated Performance Primitives 6.1 – Code Samples [Knowledgebase] To help jumpstart your application development we provide free code samples that illustrate the use of the Intel® Integrated Performance Primitives (Intel® IPP) in real applications. The Intel IPP ... Posted: 2010-02-21 15:45:45 by Paul Fischer (Intel) | simd, IPP, SSE, h.264, jpg, samples |
| How Special Silicon Facilitates Parallel Arithmetic [Knowledgebase] One of the most effective forms of parallelization is found deep inside Intel® x86 processors: the ability to execute parallel calculations with a single instruction. You can do this manually or let ... Posted: 2009-06-18 18:31:49 by | simd, Multi-thread apps for Multi-Core, SSE, game development, physics, visual computing |
| Theoretical Maximum Throughput on Xeon 5500 Nehalem [Forums] Hi guys. Lots of great people on this forum, so I'm certain someone can help me answer this question. What I'm not sure of is if it has been discussed before, or if I'm even in the right forum.Obvious ... Posted: 2009-05-22 19:54:46 by gallomimia | simd, memory, vector, Nehalem, throughput |
| Instruction Level Lock-step Parallelism on desert islands [Blogs] I told myself that all my posts here would be crosstaggable to both academic and multi-core, but Henry Neeman has got me all riled up. Half this blog will NOT be about multi-core, but half will; it ... Posted: 2009-05-28 14:58:17 by wolfmurphy | simd, GPU, adventure, curricula, workshops, "Steven Colbert", Hamlet, Macbeth, nomenclature |
| fabs() and SIMD [Forums] Hello, I've been seaching over the web and also intel foruns and I cannot find a version of fabs() function in SIMD.... I have a __m128 x variable and I want to get the absolute value of the 4 float v ... Posted: 2009-06-16 08:36:14 by threethazz | simd, fabs, __m128 |
| I Cuda written more, but unlike Polonius, I actually will be brief [Blogs] Thanks to Wen-Mei for a delightful chat. I found your site, http://courses.ece.illinois.edu/ece498/al/, with the curricula for your Programming Massively Parallel Processors course. This curricula fos ... Posted: 2009-06-16 10:58:57 by wolfmurphy | simd, "instruction level lock-step parallelism", grad students |
| 2. SIMD & MIMD [Forums] ВНИМАНИЕ! Прием "Объяснений" закрыт. Всем преогромное спасибо за участие в конкурсе. Судьи уже собрали свои ... Posted: 2009-07-02 07:20:56 by Dmitry Oganezov (Intel) | simd, конкурс, MIMD |
| SIMD Programming with Larrabee part II of II [Videos] Doug Binks, Intel Senior Applications Engineer, presents at SIGGRAPH on the SIMD Programming Instructions for Larrabee. This short, focused, technical session presents some of the key features of La ... Posted: 2009-08-06 10:09:36 by Elizabeth Vice (Intel), Amy Barton (Intel), Jerry Makare (Intel), venkatrajaraman | simd, siggraph, Larabee, SIGGRAPH09, SIMD Programming, Instruction sets, graphics animations, SIGGRAPH 2009 |
| GDC09: Why CPUs are important to game developers [Blogs] During his yesterday's tech session Leigh Davies shared several useful tips with the audience how game developers can benefit from current and future micro processors. As I have blogged already, es ... Posted: 2009-08-19 05:45:31 by Michael J Huelskoetter | Events, Game Developers, simd, conferences, cpu, leigh davies, Core i7, gdc, GDC 2009 |
| Using Intel® Streaming SIMD Extensions and Intel® Integrated Performance Primitives to Accelerate Algorithms [Knowledgebase] By Sam Siewert Algorithm Acceleration Using Single Instruction Multiple Data Computing architecture can be described at the highest level using Flynn's architecture classification scheme as single i ... Posted: 2009-12-21 10:09:12 by | simd, IPP, visual computing, Intel IPP, image processing, Intel SSE |
| Use Intel® IPP for Increased Performance on Intel or Compatible AMD Processors [Knowledgebase] Because the Intel® IPP library is optimized for Intel and compatible processors, in 32-bit and 64-bit versions, performance on Intel or AMD processors should be improved when using the Intel® IPP li ... Posted: 2010-01-28 15:25:55 by Ying H (Intel) | simd, SSE, amd support, IPP cpu optimization |
| IPP Dispatcher Control Functions - ipp*Init*() functions [Knowledgebase] Are you confused by the various ipp*Init*() functions? So was I. So I asked the Intel IPP engineering team for some clarification, and this is what I found. The Intel® IPP Dispatcher One of the mos ... Posted: 2010-02-04 11:18:59 by Paul Fischer (Intel) | simd, IPP, SSE, AVX, init, dispatch |
| Threading and the Intel® IPP Library - part 1 of 3 [Blogs] Introduction to Threading in IPP There is no universal threading solution that works for every application. Fortunately the Intel® Integrated Performance Primitives (the Intel IPP library) are desig ... Posted: 2009-12-18 14:43:46 by Paul Fischer (Intel) | simd, OpenMP, IPP, SSE, Multi-threading, Parallel Programming |
| Threading and the Intel® IPP Library – part 2 of 3 [Blogs] Threading Choices for Your Intel IPP Application Source code for some multi-threaded IPP application examples are included in the free sample downloads. Several of these examples implement threading ... Posted: 2009-12-18 15:13:26 by Paul Fischer (Intel) | simd, OpenMP, IPP, SSE, Parallel Programming |
| Threading and the Intel® IPP Library – part 3 of 3 [Blogs] OpenMP Threading and Intel IPP The low-level primitives within the IPP library generally represent basic atomic operations. This limits threading within the library to ~15-20% of the primitives. Inte ... Posted: 2009-12-18 15:46:57 by Paul Fischer (Intel) | simd, OpenMP, IPP, SSE, Multi-threading, Parallel Programming |
| JPEG Bug Fix Details [Knowledgebase] In reply to this forum thread http://software.intel.com/en-us/forums/intel-integrated-performance-primitives/topic/69755/DPD200130124 - JPEG color conversion function requestAdded support for RGBA as ... Posted: 2010-01-08 16:58:54 by Paul Fischer (Intel) | simd, bug, IPP, SSE |
| Threading and Intel® Integrated Performance Primitives [Knowledgebase] Abstract There is no universal threading solution that works for all applications. Likewise, there are multiple ways for applications built with Intel® Integrated Performance Primitives (Intel® IPP ... Posted: 2010-03-19 11:57:12 by | simd, OpenMP, MMX, threading, Intel IPP, Integrated Performance Primitives, DSP |
| 12,000 IPP Functions! Where to begin? (part 1) [Blogs] Your first encounter with the Intel® IPP library can be overwhelming, due to the number of functions contained within. At Intel we sometimes even "proudly" make statements like "over 12,000 functions ... Posted: 2010-02-01 22:47:10 by Paul Fischer (Intel) | simd, IPP, library, SSE, MMX, Integrate Performance Primitives |
| Intel® IPP 6.1 Library Getting Started [Knowledgebase] Please see the following links for the latest information regarding the Intel IPP library: Intel IPP Main Product Page Intel IPP 6.1 Library Release Notes Intel IPP 6.1 Library Installation Guide ... Posted: 2010-03-09 16:55:17 by Paul Fischer (Intel) | simd, IPP, SSE, release notes, readme |
| TickerTape Part 2 [Knowledgebase] Download PDF (PDF | XMB) Introduction Ticker Tape is a tech demo that encourages developers to implement more complex behavior in particle systems. The demo uses a number of techniques to improve ... Posted: 2010-03-09 13:21:11 by | simd, SSE, game development, visual computing, Ticker Tape, TickerTape, Ticker Tape Demo |