Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
作者: 最后更新时间: 2019/07/31 - 14:30
Article

Fast Gathering-based SpMxV for Linear Feature Extraction

This algorithm can be used to improve sparse matrix-vector and matrix-matrix multiplication in any numerical computation. As we know, there are lots of applications involving semi-sparse matrix computation in High Performance Computing. Additionally, in popular perceptual computing low-level engines, especially speech and facial recognition, semi-sparse matrices are found to be very common....
作者: 最后更新时间: 2018/12/12 - 18:00
博客

Three Pieces of Advice for Code Modernization Success

What three code modernization techniques would I suggest to help a programmer improve the execution performance of her code? With too many specific things to choose from, these are three recommendations for any programmer anywhere and anytime.
作者: Clay B. (Blackbelt) 最后更新时间: 2018/12/12 - 18:08
Article

Putting Your Data and Code in Order: Optimization and Memory – Part 1

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
作者: David M. 最后更新时间: 2018/12/12 - 18:00
Article

整理您的数据和代码: 优化和内存 — 第 1 部分

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
作者: David M. 最后更新时间: 2018/12/12 - 18:00
博客

Can You Write a Vectorized Reduction Operation?

I can. And if you read this post you will also be able to write one, too. (Might be a cool party trick or a sucker bet to make a little cash.)
作者: Clay B. (Blackbelt) 最后更新时间: 2018/12/12 - 18:08
视频

Best Practices in Vector Programming

Intel® AVX-512 is the new instruction set extension for SIMD. Robert shares how it offers unique benefits, especially for financial applications.

作者: 最后更新时间: 2018/12/12 - 18:00
Article

Fast Computation of Adler32 Checksums

Adler32 is a common checksum used for checking the integrity of data in applications such as zlib*, a popular compression library. In this paper we show how the vector processing capabilities of Intel® Architecture Processors can be exploited to efficiently compute the Adler32 checksum.
作者: James Guilford (Intel) 最后更新时间: 2018/12/12 - 18:00
File Wrapper

Parallel Universe Magazine - Issue 19, September 2014

作者: 管理 最后更新时间: 2018/12/12 - 18:08