Introduction
Data Formats and Performance (Jupyter Notebooks)
Ranked Retrieval (Example)
SML Lecture (Jupyter Notebooks)
Distributed File Systems: HDFS, Architecture, NameNode, DataNode, Block Storage, Replication, Fault Tolerance
MapReduce Lecture
Apache Spark
Spark SQL
Dimensionality Reduction & Clustering (Python Example)
Graph Processing: Graph Models, PageRank Algorithm, GraphX Fundamentals
Stream Processing: Spark Streaming and Event Time Semantics
CPU and GPU: PyTorch Distributed