Big Data, Predictive Analytics and Deep Learning with Apache Spark

Chris Teplovs, Ph.D.

Day 3

Workshop overview

Day 1:
Focus on data
Introductions to each other, the workshop, Big Data, Spark and Databricks
Day 2:
Focus on techniques
Clustering, classification and analytic pipelines
Day 3:
Focus on the future
Deep Learning, Neural networks, and project presentations

Day 1

SegmentTopic
1.1Workshop overview and Introductions
1.2Introduction to Databricks
1.3Hands-On: Databricks
1.4Intro to Spark & DataFrames
1.5Hands-On: DataFrames
1.6Big Data Sets
1.7Hands-On: Exploring Data

Day 2 (yesterday)

SegmentTopic
2.1Clustering Overview
2.2k-Means and Bisecting k-Means
2.3Hands-On: Clustering
2.4Classification Overview
2.5Hands-On: Classification
2.6Model Evaluation and Tuning
2.7Hands-On: Evaluation and Tuning

Day 3 (today)

SegmentTopic
3.1Intro to Dimensionality Reduction
3.1Hands-on: PCA
3.3Intro to Neural Nets
3.4Hands-On: Neural Net Pipelines
3.5Hands-On time for projects
3.6Project Presentations

Neural Networks

http://neuralnetworksanddeeplearning.com/chap1.html

Workshop Overview

Day 1

Day 2

Day 3