Blog post

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

Authored by Last updated on 01/24/2019 - 16:00
Blog post

Hadoop RPC机制+源码分析

 一、RPC基本原理

Authored by Last updated on 07/03/2019 - 20:08
Article

Tutorial for Intel® DAAL : Using Simple Java* Examples

System Environment

Intel® DAAL version : 2016 Gold Initial Release (w_daal_2016.0.110.exe)

OS : Windows 8.1

Authored by JON J K. (Intel) Last updated on 07/06/2019 - 11:41
Blog post

The JITter Conundrum - Just in Time for Your Traffic Jam

In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
Authored by David S. (Blackbelt) Last updated on 07/04/2019 - 20:00
Blog post

Getting Started with Tachyon by Use Cases

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
Authored by Last updated on 06/07/2019 - 16:01
Video

Building Faster Data Applications on Spark* Clusters Using Intel® Data Analytics Acceleration Library

Apache Spark* is an open-source cluster computing framework that’s widely popular for big data processing applications.

Authored by admin Last updated on 02/12/2018 - 15:30
Blog post

按照使用案例开始使用 Tachyon

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
Authored by Last updated on 06/07/2019 - 16:00
Article

OpenStack App Developer Survey

As part of a long-term commitment to enhance ease-of-use, the OpenStack UX project, with support of the OpenStack Foundation and the Technical Committee, is now bu

Authored by Mike P. (Intel) Last updated on 06/07/2017 - 12:14
Article

BigDL – Scale-out Deep Learning on Apache Spark* Cluster

Learn how to install and use BigDL for training and testing some of the commonly used deep neural network models on Apache Spark.
Authored by Sunny G. (Intel) Last updated on 03/11/2019 - 13:17
Article

BigDL – Apache Spark* 集群上的横向扩展深度学习

要点综述
Authored by Sunny G. (Intel) Last updated on 03/11/2019 - 13:17