The Intel® Xeon Phi™ Coprocessor is designed for highly parallel, high performance demanding applications.
Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Introduction A Brief History of Quicksort
General Matrix Multiply
Введение Краткий курс истории алгоритма быстрой сортировки