博客

GCC x86 Performance Hints

      People say that GCC (GNU Compiler Collection) cannot generate effective code compared to other proprietary compilers. Is it a myth or reality?

作者: Evgeny Stupachenko (Intel) 最后更新时间: 2019/03/27 - 10:18
博客

Optimization of Data Read/Write in a Parallel Application

(This work was done by Vivek Lingegowda during his internship at Intel.)

作者: 最后更新时间: 2019/07/04 - 17:40
博客

“Why Should I Update GCC x86 Compiler?” or “GCC Compiler Performance on Intel® Atom™ from Version to Version”

      I’ll try to figure out what is new for Intel® Atom™ architecture in new versions of GCC and how this affects performance and code size on the well-known EEMBC CoreMark*  benchmark:

作者: Evgeny Stupachenko (Intel) 最后更新时间: 2019/03/27 - 10:21
博客

Introduction to OpenMP* on YouTube*

Tim Mattson (Intel) has authored an extensive series of excellent videos as in introduction to OpenMP*.

作者: Mike P. (Intel) 最后更新时间: 2019/07/04 - 19:51
Article

Analyzing Intel® SDE's TSX-related log data for capacity aborts

Starting with version 7.12.0, Intel® SDE has Intel® TSX-related instruction and memory access logging features which can be useful for debugging Intel® TSX's capacity aborts.

作者: 最后更新时间: 2019/07/06 - 10:52
博客

Dynamic Languages Take Over the Internet

The server world has really embraced Python in a big way. For example, the OpenStack project is a very popular Infrastructure as a Service offering, and most of it is written in Python. This makes Python a leader for Software Defined Infrastructure (SDI), Software Defined Storage (SDS) and Software Defined Networking (SDN).
作者: David S. (Blackbelt) 最后更新时间: 2019/07/04 - 19:43
博客

Core Challenge In Speeding Up Python, PHP, HHVM, Node.js...

A traditional compiler translates a high-level computer program into machine code for the CPU you want to run it on. An interpreted language translates a high-level language into the machine code for some imaginary CPU. For historical reasons, this imaginary CPU is called a "virtual machine" and its instructions are called "byte code." One advantage of this approach is development speed: creating...
作者: David S. (Blackbelt) 最后更新时间: 2019/07/04 - 20:00
博客

PGO: Let It Go (PHP)

We can hope that companies like Intel® will come along with a faster processor. (And this does tend to happen every year). Or we can improve our compilers to produce better machine code. Or we can analyze our own code and change it to run more optimally. For PHP, we do all three: We partner with the processor architects to improve the way they execute PHP; we look for changes we can make to the...
作者: David S. (Blackbelt) 最后更新时间: 2019/07/03 - 20:08
Article

Introducing DNN primitives in Intel® Math Kernel Library

Please notes: Deep Neural Network(DNN) component in MKL is deprecated since intel® MKL ​2019 and will be removed in the next intel® MKL Release.

作者: Vadim Pirogov (Intel) 最后更新时间: 2019/03/21 - 12:00
Article

The Inside Scoop on How We Accelerated NumPy Umath Functions

NumPy UMath Optimizations

作者: Andres G. (Intel) 最后更新时间: 2018/05/30 - 07:08