博客

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

作者: 最后更新时间: 2019/01/24 - 16:00
博客

ubuntu 中安装 hadoop 记录

Hadoop 版本:hadoop-1.2.1-bin.tar

Jdk 版本:jdk-6u30-linux-i586

作者: 最后更新时间: 2019/01/24 - 16:00
博客

Optimizing Big Data processing with Haswell 256-bit Integer SIMD instructions

Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.

作者: gaston-hillar (Blackbelt) 最后更新时间: 2019/07/06 - 17:00
博客

Experimenting with OpenStack* Sahara* on Docker* Containers

Docker* is an emerging technology that has become very popular recently in the market. It provides a flexible architecture to deploy applications. OpenStack* is another hot technology on the market. It has been available for several years, became more stable and also added more features support in recent releases.
作者: WEITING C. (Intel) 最后更新时间: 2019/07/06 - 17:10
博客

Restudy SchemaRDD in SparkSQL

At the very beginning, SchemaRDD was just designed as an attempt to make life easier for developers in their daily routines of code debugging and unit testing on SparkSQL core module. The idea can boil down to describing the data structures inside RDD using a formal description similar to the relational database schema. On top of all basic functions provided by common RDD APIs, SchemaRDD also...
作者: 最后更新时间: 2017/06/14 - 16:50
博客

The JITter Conundrum - Just in Time for Your Traffic Jam

In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
作者: David S. (Blackbelt) 最后更新时间: 2019/07/04 - 20:00
博客

How Moscow Institute of Physics and Technology Rocketed the Development of Hypersonic Vehicles

The Moscow Institute of Physics and Technology (MIPT) Laboratory is focused on futuristic vehicles such as airplanes and spacecraft that travel at high speeds.

作者: Sally Sams (Intel) 最后更新时间: 2019/03/21 - 12:00
博客

Python Brings Us the LIGO Gravity Wave Sound

 

作者: David S. (Blackbelt) 最后更新时间: 2019/07/04 - 19:22
博客

Doubling the Performance of OpenStack Swift with No Code Changes

My current gig is mostly about performance. I manage a group of software engineers dedicated to the languages becoming really important to the cloud and the datacenter.

作者: David S. (Blackbelt) 最后更新时间: 2019/07/06 - 17:10
博客

Big Datasets from Small Experiments

作者: Andrey Vladimirov 最后更新时间: 2019/07/04 - 18:46