Apache Spark

Course Length: 3 days
  1. Apache Spark Introduction
    1. Why Adopt Spark
    2. Benefits of Spark
    3. Spark Architecture
    4. Spark Datasources
  2. Spark Components
    1. Driver
    2. Workers
    3. Stages
    4. Tasks
    5. Partitions
  3. Spark Processing
    1. Transformations
    2. Actions
  4. Cluster Managers
    1. In House Spark Cluster Manager
    2. Mesos
    3. YARN
    4. Kubernetes
  5. Spark Shell
  6. Spark UI
  7. Running a Spark Job
  8. Value Types
  9. Programming Spark
    1. Enough Scala to get you through the day
    2. DataFrame
    3. DataSet
    4. Spark SQL
    5. RDD
  10. Spark Streaming
    1. Structured
    2. Unstructured
  11. GraphX
  12. Spark MLLib