Big Data, Predictive Analytics and Deep Learning with Apache Spark

Chris Teplovs, Ph.D.

Introductions

Chris Teplovs
Sonal Doomra
Ping Hou
Anna Lenhart

Workshop overview

Day 1:
Focus on data
Introductions to each other, the workshop, Big Data, Spark and Databricks
Day 2:
Focus on techniques
Clustering, classification and analytic pipelines
Day 3:
Focus on the future
Deep Learning, Neural networks, and project presentations

Day 1

SegmentTopic
1.1Workshop overview and Introductions
1.2Introduction to Databricks
1.3Hands-On: Databricks
1.4Intro to Spark & DataFrames
1.5Hands-On: DataFrames
1.6Big Data Sets
1.7Hands-On: Exploring Data

Day 2

SegmentTopic
2.1Clustering Overview
2.2k-Means and Bisecting k-Means
2.3Hands-On: Clustering
2.4Classification Overview
2.5Hands-On: Classification
2.6Model Evaluation and Tuning
2.7Hands-On: Evaluation and Tuning

Day 3

SegmentTopic
3.1Neural Nets and Deep Learning
3.2Keras and Tensorflow
3.3Hands-On: Deep Learning for image data
3.4Hands-On: Deep Learning on other data
3.5Hands-On: Data analysis
3.6Group Presentations
Day 1:
Focus on data
Introductions to each other, the workshop, Big Data, Spark and Databricks
Day 2:
Focus on techniques
Clustering, classification and analytic pipelines
Day 3:
Focus on the future
Deep Learning, Neural networks, and project presentations

(links above)