Filters

Blog post

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

Authored by Last updated on 01/24/2019 - 16:00
Article

开源 - OpenStack

OpenStack
Authored by Last updated on 07/13/2018 - 14:32
Article

Hadoop 0.22.0 及其 RAID 部署

        使用0.20.X系列版本的Hadoop快有一年时间了,主要集中在HDFS上。期间自己参与了部署Hadoop集群(1 Server + 20 PC),也参与了分析HDFS的源码。

Authored by Last updated on 01/24/2019 - 16:00
Blog post

ubuntu 中安装 hadoop 记录

Hadoop 版本:hadoop-1.2.1-bin.tar

Jdk 版本:jdk-6u30-linux-i586

Authored by Last updated on 01/24/2019 - 16:00
Blog post

Benefits of Intel® Enterprise class SSD

In this blog, I want share with you the benefits of the Intel® Enterprise Class Solid-State Drive (SSD).  I have compiled a list of articles, white papers, solution briefs, and blogs and provided l

Authored by Thai Le (Intel) Last updated on 07/04/2019 - 10:36
Blog post

Ceph Erasure Coding Introduction

Ceph introduction
Authored by Yuan Zhou (Intel) Last updated on 06/14/2017 - 15:45
Blog post

Sparking Real-Time Analytics, Igniting Real-Time Intelligence

At the Spark Summit in San Francisco, Michael Greene announced the release of 

Authored by Mike P. (Intel) Last updated on 06/14/2017 - 15:45
Article

Installing Apache Zeppelin* on Cloudera Distribution of Hadoop*

Apache Zeppelin* is a new web-based notebook that enables data-driven, interactive data analytics, and visualization with the added bonus of supporting multiple languages, including Python*, Scala*, Spark SQL, Hive*, Shell, and Markdown. Zeppelin also provides Apache Spark* integration by default, making use of Spark’s fast in-memory, distributed, data processing engine to accomplish data science...
Authored by Last updated on 06/07/2017 - 10:40