These series of spark study notes will deep dive into Apache Spark source codes, which means there will be advanced topics and readers need to be alreadly familiar with Spark APIs.
- Execution Model and Job Scheduling
- Shuffle
- Spark SQL
- DataSet
- Catalyst Optimizer
- Spark Streaming
- and so on