This blog post was jointly written by Jiangang Duan, Jie Huang and Weihua Jiang (Intel), Alex Gutow (Cloudera), and Dale Kim (MapR)
Apache Spark* (http://spark.apache.org/) is a fast and general engine for large-scale data processing.
Radhika Rangarajan, Engineering Program Manager at Intel, discusses distributed machine learning on Apache Spark*.
This video shows how the Similarity API analyzes items in a data set and uses "memory voting" to find similar items.
Nervana has joined Intel