Introduction
Data Formats and Performance (Jupyter Notebooks)
Ranked Retrieval (Example)
SML Lecture (Jupyter Notebooks)
Distributed File Systems: HDFS, Architecture, NameNode, DataNode, Block Storage, Replication, Fault Tolerance
MapReduce Lecture
Apache Spark
Spark SQL