博客

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

作者: 最后更新时间: 2019/01/24 - 16:00
博客

ubuntu 中安装 hadoop 记录

Hadoop 版本:hadoop-1.2.1-bin.tar

Jdk 版本:jdk-6u30-linux-i586

作者: 最后更新时间: 2019/01/24 - 16:00
博客

Benefits of Intel® Enterprise class SSD

In this blog, I want share with you the benefits of the Intel® Enterprise Class Solid-State Drive (SSD).  I have compiled a list of articles, white papers, solution briefs, and blogs and provided l

作者: Thai Le (Intel) 最后更新时间: 2019/07/04 - 10:36
博客

Intel's baremetal provisioning patch for DevStack

OpenStack employs DevStack for integration testing and development purposes.

作者: Zhongyue Nah (Intel) 最后更新时间: 2019/07/06 - 17:10
博客

Experimenting with OpenStack* Sahara* on Docker* Containers

Docker* is an emerging technology that has become very popular recently in the market. It provides a flexible architecture to deploy applications. OpenStack* is another hot technology on the market. It has been available for several years, became more stable and also added more features support in recent releases.
作者: WEITING C. (Intel) 最后更新时间: 2019/07/06 - 17:10
博客

Ceph Erasure Coding Introduction

Ceph introduction
作者: Yuan Zhou (Intel) 最后更新时间: 2017/06/14 - 15:45
博客

Sparking Real-Time Analytics, Igniting Real-Time Intelligence

At the Spark Summit in San Francisco, Michael Greene announced the release of 

作者: Mike P. (Intel) 最后更新时间: 2017/06/14 - 15:45
博客

The JITter Conundrum - Just in Time for Your Traffic Jam

In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
作者: David S. (Blackbelt) 最后更新时间: 2019/07/04 - 20:00
博客

Getting Started with Tachyon by Use Cases

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
作者: 最后更新时间: 2019/06/07 - 16:01