Deploy Tall Arrays to a Spark Enabled Hadoop Cluster

Create and execute MATLAB® applications with tall arrays against a Spark™ enabled Hadoop® cluster

Supported Platform: Linux® only.

Deploying a MATLAB application that contains tall arrays against a Spark enabled Hadoop cluster consists of two parts :

  • Creating and packaging a standalone application in the MATLAB desktop environment.

  • Executing the standalone application on a Spark enabled Hadoop cluster from a Linux shell.

For a complete example on deploying tall arrays to a Spark enabled Hadoop cluster, see Example on Deploying Tall Arrays to a Spark Enabled Hadoop Cluster. You can follow the same instructions to deploy tall array Spark applications to Cloudera® CDH.

Classes

matlab.mapreduce.DeploySparkMapReducerConfigure a MATLAB tall array application with Spark parameters as key-value pairs

Functions

mapreducerDefine execution environment for mapreduce or tall arrays

Topics

Apache Spark Basics

Learn basic Apache Spark™ concepts and see how these concepts relate to deploying MATLAB applications to Spark.

Examples

Example on Deploying Tall Arrays to a Spark Enabled Hadoop Cluster

Complete example showing how to deploy a tall array MATLAB application to a Spark enabled Hadoop cluster.