Filters

Blog post

Benefits of Intel® Enterprise class SSD

In this blog, I want share with you the benefits of the Intel® Enterprise Class Solid-State Drive (SSD).  I have compiled a list of articles, white papers, solution briefs, and blogs and provided l

Authored by Thai Le (Intel) Last updated on 07/04/2019 - 10:36
Blog post

Ceph Erasure Coding Introduction

Ceph introduction
Authored by Yuan Zhou (Intel) Last updated on 06/14/2017 - 15:45
Blog post

Sparking Real-Time Analytics, Igniting Real-Time Intelligence

At the Spark Summit in San Francisco, Michael Greene announced the release of 

Authored by Mike P. (Intel) Last updated on 06/14/2017 - 15:45
Article

Installing Apache Zeppelin* on Cloudera Distribution of Hadoop*

Apache Zeppelin* is a new web-based notebook that enables data-driven, interactive data analytics, and visualization with the added bonus of supporting multiple languages, including Python*, Scala*, Spark SQL, Hive*, Shell, and Markdown. Zeppelin also provides Apache Spark* integration by default, making use of Spark’s fast in-memory, distributed, data processing engine to accomplish data science...
Authored by Last updated on 06/07/2017 - 10:40
Article

Indexing DICOM* Images on Cloudera Hadoop* Distribution

This paper show how to replicate the proof point, to index DICOM images for storage, management, and retrieval on a Cloudera Hadoop* cluster, using open source software components.
Authored by Last updated on 02/22/2019 - 16:10
Blog post

Getting Started with Tachyon by Use Cases

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
Authored by Last updated on 06/07/2019 - 16:01
Article

Intel® MPI Library: Supporting the Hadoop* Ecosystem

For decades, MPI has dominated as the model to use in distributed calculations.

Authored by Mike P. (Intel) Last updated on 06/07/2017 - 10:37
Article

How to Install the Python* Version of Intel® Data Analytics Acceleration Library (Intel® DAAL) in Linux*

The Intel® Data Analytics Acceleration Library (Intel® DAAL) 1, 2 is a software solution for data analytics. It provides building blocks for data preprocessing, transformation, modeling, predicting, and so on.
Authored by Nguyen, Khang T (Intel) Last updated on 07/05/2019 - 19:05