Optimizing Spark Projects
Gearpump, the Real-Time Big Data Streaming Engine
Gearpump adds a key ingestion capability to TAP, capable of handling a variety of use cases that either involve complicated workflows or low latency processing of many types of ingestion streams that need to be fault tolerant.
Large-Scale Graph Analysis using GraphX (PDF)
Read about the lessons learned while building real-world, large-scale graph analysis applications using GraphX for some of the largest organizations and websites in the world, including both algorithm level and framework level optimizations.
Innovation: Driving a Stronger Community Standard
Apache Spark complements the existing Hadoop ecosystem by adding easy-to-use APIs and data-pipelining capabilities to Hadoop data. Since its launch in 2009, Spark has seen over 400 contributors from more than 50 different companies.
StreamSQL on Spark (Video)
This presentation will show Intel's implementation of StreamSQL by using Spark-streaming and Catalyst modules, which makes SQL users grasp stream processing with ease. Find out what StreamSQL is and what benefits you gain.
Download the presentation (PDF)
Building Real-World Spark Applications (PDF)
Explore what we've learned about managing memory, networks, improving disk I/O, and optimizing computations with real-world Spark applications.