Filters

Blog post

Ways to Speed up your Cloud Environment and Workload Performance on Intel® Architecture

Setting up a cloud environment is complicated, and it involves multiple elements such as database, network infrastructure, security, etc., (depending on the need).  How do you increase the p

Authored by Thai Le (Intel) Last updated on 07/04/2019 - 17:05
Blog post

Use HiBench as a representative proxy for benchmarking Hadoop applications

As any good engineer knows, “if you cannot measure it, you cannot improve it.” And a representative benchmark suite is the key for measuring any computer systems.

Authored by Jason Dai (Intel) Last updated on 07/03/2019 - 20:08
Blog post

Benefits of Intel® Enterprise class SSD

In this blog, I want share with you the benefits of the Intel® Enterprise Class Solid-State Drive (SSD).  I have compiled a list of articles, white papers, solution briefs, and blogs and provided l

Authored by Thai Le (Intel) Last updated on 07/04/2019 - 10:36
Video

Next Steps in Business Computing with Intel and SAP HANA

Technology Consultant, Rob Klopp, brings us a look at the evolution of hardware capabilities from the age of physical mainframes through to the current age of virtual mainframes, where databases ha

Authored by admin Last updated on 07/04/2019 - 10:17
Blog post

Ceph Erasure Coding Introduction

Ceph introduction
Authored by Yuan Zhou (Intel) Last updated on 06/14/2017 - 15:45
Blog post

Restudy SchemaRDD in SparkSQL

At the very beginning, SchemaRDD was just designed as an attempt to make life easier for developers in their daily routines of code debugging and unit testing on SparkSQL core module. The idea can boil down to describing the data structures inside RDD using a formal description similar to the relational database schema. On top of all basic functions provided by common RDD APIs, SchemaRDD also...
Authored by Last updated on 06/14/2017 - 16:50
Blog post

Big Performance Gains for Big Data

Imagine two teams of data analysts working on the same goal: to extract usable business intelligence (BI) from massive, growing data sets.

Authored by Last updated on 01/28/2019 - 15:20
Article

Indexing DICOM* Images on Cloudera Hadoop* Distribution

This paper show how to replicate the proof point, to index DICOM images for storage, management, and retrieval on a Cloudera Hadoop* cluster, using open source software components.
Authored by Last updated on 02/22/2019 - 16:10
Blog post

Getting Started with Tachyon by Use Cases

In-memory computing has become an irreversible trend in big data technology, for which the wide popularity of Spark provides a good evidence. Meanwhile, memory storage and management for large data sets are still posing challenges. Out of numerous solutions, Tachyon, a memory-centric distributed storage, well solves the problems faced by many application scenarios. For example, it avoids severe...
Authored by Last updated on 06/07/2019 - 16:01