Article

Installing Apache Zeppelin* on Cloudera Distribution of Hadoop*

Apache Zeppelin* is a new web-based notebook that enables data-driven, interactive data analytics, and visualization with the added bonus of supporting multiple languages, including Python*, Scala*, Spark SQL, Hive*, Shell, and Markdown. Zeppelin also provides Apache Spark* integration by default, making use of Spark’s fast in-memory, distributed, data processing engine to accomplish data science...
Authored by Last updated on 06/07/2017 - 10:40
Article

Accelerating Secondary Genome Analysis Using Intel® Reference Architecture

The dramatic reduction in whole human genome sequencing costs, from USD 100 million per genome in 2001 to USD 4,500 per genome in 2014, combined with the increasing performance gains in computing t

Authored by Mike P. (Intel) Last updated on 07/06/2019 - 16:40
Article

Caffe* Training on Multi-node Distributed-memory Systems Based on Intel® Xeon® Processor E5 Family

Caffe is a deep learning framework developed by the Berkeley Vision and Learning Center (BVLC) and one of the most popular community frameworks for image recognition. Caffe is often used as a benchmark together with AlexNet*, a neural network topology for image recognition, and ImageNet*, a database of labeled images.
Authored by Gennady F. (Blackbelt) Last updated on 07/05/2019 - 14:54
Article

Indexing DICOM* Images on Cloudera Hadoop* Distribution

This paper show how to replicate the proof point, to index DICOM images for storage, management, and retrieval on a Cloudera Hadoop* cluster, using open source software components.
Authored by Last updated on 02/22/2019 - 16:10
Article

Live Webinar: Boost Python* Performance with Intel® Math Kernel Library

Python* is a popular open-source scripting language known for its easy-to-learn syntax and active developer community.
Authored by Mike P. (Intel) Last updated on 06/07/2017 - 10:28
Article

How to Install the Python* Version of Intel® Data Analytics Acceleration Library (Intel® DAAL) in Linux*

The Intel® Data Analytics Acceleration Library (Intel® DAAL) 1, 2 is a software solution for data analytics. It provides building blocks for data preprocessing, transformation, modeling, predicting, and so on.
Authored by Nguyen, Khang T (Intel) Last updated on 07/05/2019 - 19:05
Article

Free access to Intel® Compilers, Performance libraries, Analysis tools and more...

Intel® Parallel Studio XE is a very popular product from Intel that includes the Intel® Compilers, Intel® Performance Libraries, tools for analysis, debugging and tuning, tools for MPI and the Intel® MPI Library. Did you know that some of these are available for free? Here is a guide to “what is available free” from the Intel Parallel Studio XE suites.
Authored by admin Last updated on 03/21/2019 - 12:00
Article

Chain of Things Solar Case Study: Blockchain+IoT Security

With partners ElectriCChain, SolarCoin, Solcrypto, Bitseed, IOTA, and RWE, the first Chain of Things Case Study Event focused on the secure logging of solar energy production data to a distributed blockchain ledger. Through multiple case studies, Chain of Things will develop a clearer picture of what an optimal IoT+blockchain stack should include and will determine if blockchain technology...
Authored by Last updated on 07/10/2018 - 08:00
Article

Using Intel® Data Analytics Acceleration Library to Improve the Performance of Naïve Bayes Algorithm in Python*

This article discusses machine learning and describes a machine learning method/algorithm called Naïve Bayes (NB) [2]. It also describes how to use Intel® Data Analytics Acceleration Library (Intel® DAAL) [3] to improve the performance of an NB algorithm.
Authored by Nguyen, Khang T (Intel) Last updated on 07/06/2019 - 16:40
Article

Manage Deep Learning Networks with Caffe* Optimized for Intel® Architecture

How to optimize Caffe* for Intel® Architecture, train deep network models, and deploy networks.
Authored by Andres Rodriguez (Intel) Last updated on 03/11/2019 - 13:17