In this Github repository, I'm documenting my journey to write a self-compiling compiler. I'm also writing out the details so that, if you want to follow along, there will be an explanation of what I did, why, and with some references back to the theory of compilers.
But not too much theory, I want this to be a practical journey.
Here are the steps I've taken so far:
- Part 0: Introduction to the Journey
- Part 1: Introduction to Lexical Scanning
- Part 2: Introduction to Parsing
- Part 3: Operator Precedence
- Part 4: An Actual Compiler
- Part 5: Statements
- Part 6: Variables
- Part 7: Comparison Operators
- Part 8: If Statements
- Part 9: While Loops
- Part 10: For Loops
- Part 11: Functions, part 1
- Part 12: Types, part 1
- Part 13: Functions, part 2
- Part 14: Generating ARM Assembly Code
- Part 15: Pointers, part 1
- Part 16: Declaring Global Variables Properly
- Part 17: Better Type Checking and Pointer Offsets
- Part 18: Lvalues and Rvalues Revisited
- Part 19: Arrays, part 1
- Part 20: Character and String Literals
- Part 21: More Operators
- Part 22: Design Ideas for Local Variables and Function Calls
- Part 23: Local Variables
- Part 24: Function Parameters
- Part 25: Function Calls and Arguments
- Part 26: Function Prototypes
- Part 27: Regression Testing and a Nice Surprise
There isn't a schedule or timeline for the future parts, so just keep checking back here to see if I've written any more.
I have borrowed some of the code, and lots of ideas, from the SubC compiler written by Nils M Holm. His code is in the public domain. I think that my code is substantially different enough that I can apply a different license to my code.
Unless otherwise noted,
- all source code and scripts are (c) Warren Toomey under the GPL3 license.
- all non-source code documents (e.g. English documents, image files) are (c) Warren Toomey under the Creative Commons BY-NC-SA 4.0 license.