In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
The Intel® Xeon Phi™ processor, code named Knights Landing, is part of the second generation of Intel Xeon Phi products. Knights Landing supports Intel® AVX-512 instructions, specifically AVX-512F (foundation), AVX-512CD (conflict detection), AVX-512ER (exponential and reciprocal) and AVX-512PF (prefetch).
英特尔至强融核处理器（代号“Knights Landing”）是第二代英特尔至强融核产品的一部分。Knights Landing 支持 AVX-512 指令，特别是 AVX-512F (foundation)、AVX-512CD（冲突检测）、AVX-512ER（指数函数和倒数函数）和 AVX-512PF（预取）。