Blog post

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

Authored by Last updated on 01/24/2019 - 16:00
Article

Hadoop 0.22.0 及其 RAID 部署

        使用0.20.X系列版本的Hadoop快有一年时间了,主要集中在HDFS上。期间自己参与了部署Hadoop集群(1 Server + 20 PC),也参与了分析HDFS的源码。

Authored by Last updated on 01/24/2019 - 16:00
Blog post

Hadoop RPC机制+源码分析

 一、RPC基本原理

Authored by Last updated on 07/03/2019 - 20:08
Blog post

按照使用案例开始使用 Tachyon

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
Authored by Last updated on 06/07/2019 - 16:00
Blog post

英特尔® 数据分析加速库

The Intel® Data Analytics Acceleration Library (Intel® DAAL) helps speed big data analytics by providing highly optimized algorithmic building blocks for all data analysis stages (Pre-processing, Transformation, Analysis, Modeling, Validation, and Decision Making) for offline, streaming and distributed analytics usages. It’s designed for use with popular data platforms including Hadoop*, Spark*,...
Authored by James R. (Blackbelt) Last updated on 12/12/2018 - 18:00
Article

BigDL – Apache Spark* 集群上的横向扩展深度学习

要点综述
Authored by Sunny G. (Intel) Last updated on 03/11/2019 - 13:17
Video

英特尔® 大数据和分析软件概述

This video animation provides an overview of Intel® Software contributions to Big Data & Analytics.

Authored by IDZSupport K. Last updated on 01/16/2019 - 00:50
Article

在容器化环境中提升大数据的性能与灵活性

Authored by Michael G. (Intel) Last updated on 07/04/2019 - 11:02