Filters

Blog post

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

Authored by Last updated on 01/24/2019 - 16:00
Blog post

ubuntu 中安装 hadoop 记录

Hadoop 版本:hadoop-1.2.1-bin.tar

Jdk 版本:jdk-6u30-linux-i586

Authored by Last updated on 01/24/2019 - 16:00
Blog post

Looking at big data performance

My very first Intel blog entry!! Exciting!!
Authored by Eric Kaczmarek (Intel) Last updated on 06/14/2017 - 15:43
Blog post

Part #1 - Tuning Java Garbage Collection for HBase

Part #1 of a multi-parts post, we will take a look on how to tune Java garbage collection (GC) for HBase focusing on 100% YCSB reads. In part #2, we will look at 100% writes and finally in part #3, we will tune Java GC for a mix of 50/50 read/writes. As already mentioned, we are using YCSB which seems to be the de facto NoSQL workload. We wont go into much details on how to install, configure...
Authored by Eric Kaczmarek (Intel) Last updated on 06/14/2017 - 16:10
Blog post

Hands-on Hive-on-Spark in the AWS Cloud

by Brock Noland (Cloudera), Na Yang (MapR), and Rui Li (Intel)

 

Authored by Last updated on 06/14/2017 - 15:43
Blog post

Apache Spark* Innovation: Driving a Stronger Community Standard

This blog post was jointly written by Jiangang Duan, Jie Huang and Weihua Jiang (Intel), Alex Gutow (Cloudera), and Dale Kim (MapR)

 

Authored by Last updated on 03/11/2019 - 13:17
Blog post

Unlocking Big Data with Open Source Solutions – Intel® Chip Chat episode 368

Ziya Ma, Director of Big Data Technologies at Intel, stops by to talk about how open source solutions are enabli

Authored by Mike P. (Intel) Last updated on 06/14/2017 - 15:44
Blog post

Experience and Lessons Learned for Large-Scale Graph Analysis using GraphX

While GraphX provides nice abstractions and dataflow optimizations for parallel graph processing on top of Apache Spark*, there are still many challenges in app

Authored by Mike P. (Intel) Last updated on 06/14/2017 - 15:44
Blog post

Ceph Erasure Coding Introduction

Ceph introduction
Authored by Yuan Zhou (Intel) Last updated on 06/14/2017 - 15:45