Blog post

Ways to Speed up your Cloud Environment and Workload Performance on Intel® Architecture

Setting up a cloud environment is complicated, and it involves multiple elements such as database, network infrastructure, security, etc., (depending on the need).  How do you increase the p

Authored by Thai Le (Intel) Last updated on 07/04/2019 - 17:05
Blog post

mahout 0.5 基于 hadoop 的 CF 代码分析

mahout的taste框架是协同过滤算法的实现。它支持DataModel,如文件、数据库、NoSQL存储等,也支持hadoop的MapReduce。这里主要分析mahout0.5中的基于MR的实现。

Authored by Last updated on 01/24/2019 - 16:00
Blog post

Experimenting with OpenStack* Sahara* on Docker* Containers

Docker* is an emerging technology that has become very popular recently in the market. It provides a flexible architecture to deploy applications. OpenStack* is another hot technology on the market. It has been available for several years, became more stable and also added more features support in recent releases.
Authored by WEITING C. (Intel) Last updated on 07/06/2019 - 17:10
Blog post

Unlocking Big Data with Open Source Solutions – Intel® Chip Chat episode 368

Ziya Ma, Director of Big Data Technologies at Intel, stops by to talk about how open source solutions are enabli

Authored by Mike P. (Intel) Last updated on 06/14/2017 - 15:44
Blog post

Ceph Erasure Coding Introduction

Ceph introduction
Authored by Yuan Zhou (Intel) Last updated on 06/14/2017 - 15:45
Blog post

Restudy SchemaRDD in SparkSQL

At the very beginning, SchemaRDD was just designed as an attempt to make life easier for developers in their daily routines of code debugging and unit testing on SparkSQL core module. The idea can boil down to describing the data structures inside RDD using a formal description similar to the relational database schema. On top of all basic functions provided by common RDD APIs, SchemaRDD also...
Authored by Last updated on 06/14/2017 - 16:50
Blog post

Intel’s Big Data Solutions Team Goes Open Source

Starting today, the Big Data Solutions engineering team at Intel is excited to announce a move from developing almost exclusively internally to doing the majority of our development, documentation and issue tracking in public repositories on GitHub.com.
Authored by Last updated on 06/14/2017 - 16:44
Blog post

Python Brings Us the LIGO Gravity Wave Sound

 

Authored by David S. (Blackbelt) Last updated on 07/04/2019 - 19:22
Blog post

Unleash the Parallel Performance of Python* Programs

[updated 10/5/2018]

Authored by Anton Malakhov (Intel) Last updated on 10/05/2018 - 18:24
Blog post

My Experience from Spark Summit

Spark Summit is a professional conference which usually has in attendance thousands of developers, scientist, analysts, researchers and executives from all over the world. At the conference, attendees come together to understand how big data, machine learning and data science could deliver new insights. The 2016 mid-year event of Spark Summit concluded today in San Francisco, California. Now the...
Authored by Last updated on 03/05/2019 - 23:48