Project 3: R2 / LIf -> x86-64
Project due date:
Download the starter files here
This project is inspired by Essentials of Compilation (EoC), but the code here is a from-scratch reimplementation with some simplifications to keep the workload reasonable and to make grading/debugging clearer. Where behavior differs from the book, this README is the source of truth for expectations, file names, and command lines.
You will compile a tiny expression language (R2 / LIf) with
integers, booleans, let
, arithmetic, conditionals, and comparisons
into x86-64 assembly, then produce a real binary and test it on actual
inputs. The power of this language is (roughly) decision diagrams (and
functions on) finite input streams.
This README file contains some specific tips which I hope you will read, including debugging tips and some project-specific instructions. Please read the whole README file and be prepared to discuss it in office hours, email, and class.
Input Language (R2 / LIf)
R2 extends the straight-line arithmetic core with booleans and simple
conditionals. It includes integers, booleans, variables, let
, unary
minus, addition, logical operators, comparisons, and a zero-argument
(read)
primitive that consumes one integer from stdin at runtime.
Before shrinking, the surface language allows and
, or
, not
,
if
, and the comparators eq?
, <
, <=
, >
, >=
. The shrink
pass reduces this set to eq?
and <
, removes binary minus, and
desugars and
/or
.
(define (R2-exp? e)
(match e
[#t #t]
[#f #t]
[(? fixnum? n) #t]
[`(read) #t]
[`(- ,(? R2-exp? e)) #t]
[`(- ,(? R2-exp? e0) ,(? R2-exp? e1)) #t]
[`(+ ,(? R2-exp? e0) ,(? R2-exp? e1)) #t]
[`(and ,(? R2-exp? e0) ,(? R2-exp? e1)) #t]
[`(or ,(? R2-exp? e0) ,(? R2-exp? e1)) #t]
[`(if ,(? R2-exp? e-g) ,(? R2-exp? e-t) ,(? R2-exp? e-f)) #t]
[`(not ,(? R2-exp? e1)) #t]
[`(,(? cmp? c) ,(? R2-exp? e0) ,(? R2-exp? e1)) #t]
[(? symbol? var) #t]
[`(let ([,(? symbol? x) ,(? R2-exp? e)]) ,(? R2-exp? e-body)) #t]
[_ #f]))
(define (R2? e)
(match e
[`(program ,(? R2-exp? exp)) #t]
[_ #f]))
Programs on disk are s-expressions, e.g.:
(program
(let ([a (+ 1 (read))])
(if (<= a 0)
(let ([x (if (< a 2) 3 (+ 1 (- 2)))])
(> x 1))
3)))
Repository Layout
compile.rkt
– Your pass implementations. You will edit functions provided here. -> This is the only file you will edit! The rest are read-onlyirs.rkt
– IR definitions and predicates likeanf-program?
,c1-program?
, etc. (see also typed/shrunk variants)interpreters.rkt
– Reference interpreters for several IRs (used by tests and for your own debugging).system.rkt
– System/ABI configuration, pass names, runtime filenames, output paths, etc.main.rkt
– Driver that runs all passes, can build a binary, and can launch a debug server.test.rkt
– Test harness. Runs isolation tests or end-to-end native tests depending on-m
mode.runtime.c
– Minimal runtime (read_int64
,print_int64
, etc.).test-programs/
– Example programs (.scm
).input-files/
– Input streams for programs (lines of integers).goldens/
– Instructor goldens (IR snapshots, interpreter outputs, and stdout baselines).
Please do not rename files or directories (grading infra depends on them).
The Compiler Pipeline (passes)
The compiler consists of a number of passes. Please do not get overwhelmed: each of these passes is going to be very small, and each has a specific, isolated behavior that will allow us to explain the specific stages of compilation. Additionally, as the code is compiled, it is translated into an increasingly-lower-level IR.
You will implement the following passes:
typecheck
→ assign R2 types (Int
,Bool
) or raise'type-error
Input:R2?
→ Output:R2?
→ Interp:interpret-R2
shrink
→ remove binary-
,and
/or
, and<=, >, >=
(desugar toeq?
/<
)
Input:R2?
→ Output:shrunk-R2?
→ Interp:interpret-R2
uniqueify
→ ensure each bound identifier is written exactly once
Input:shrunk-R2?
→ Output:unique-source-tree?
→ Interp:interpret-R2
anf-convert
→ ANF conversion (introduce temps; maintain booleans/comparisons)
Input:unique-source-tree?
→ Output:anf-program?
→ Interp:interpret-anf
explicate-control
→ ANF → C1 (blocks with(seq (assign …) …)
,if
/goto
)
Input:anf-program?
→ Output:c1-program?
→ Interp:interpret-c1
uncover-locals
→ collect locals for stack layout
Input:c1-program?
→ Output:locals-program?
→ Interp:interpret-c1
select-instructions
→ C1 → pseudo-x86 (vars,cmpq
,set e|l
,jmp-if e|l
)
Input:locals-program?
→ Output:instr-program?
→ Interp:interpret-instr
assign-homes
→ map(var x)
→(deref (reg rbp) n)
Input:instr-program?
→ Output:homes-assigned-program?
→ Interp:interpret-instr
patch-instructions
→ fix illegal memory↔memory moves and related cases
Input:homes-assigned-program?
→ Output:patched-program?
→ Interp:interpret-instr
prelude-and-conclusion
→ prologue, print result, return 0
Input:patched-program?
→ Output:x86-64?
→ Interp:interpret-instr
dump-x86-64
→ render assembly text
Input:x86-64?
→ Output:string?
→ Interp:dummy-interp-x86-64
What you implement: All passes in compile.rkt
are scaffolded.
Your job is to produce outputs that satisfy each predicate in
irs.rkt
and that remain semantically equivalent (the interpreters
check this).
NOTE: Right now (in this version of the project), I testing typechecking rather minimally. Unfortunately that is a known limitation of this project; please do implement typechecking, but know that it is far less important than the other parts (you can simply implement it as a no-op to start).
IR Predicates and Interpreters
-
Predicates (
irs.rkt
): Each pass has a precise predicate (e.g.,anf-program?
,c1-program?
, etc.). Tests check both input and output predicates at every step to catch malformed IR early. You should read and understand each predicate. -
Interpreters (
interpreters.rkt
): For many stages we run the interpreter over your IR to check semantic consistency—different IRs must compute the same result on the same input stream. The harness reports whether outputs match and whether stdout across passes is consistent. Please do not modify this file.
How to Compile and Run
To directly compile a single file using main.rkt
:
racket main.rkt test-programs/<prog>.scm
The compiler dumps a long summary of the work done at each pass, and
ultimately yields a program ./output
which it compiles using
system-specific infrastructure (works on Linux/Mac).
Example:
$ racket main.rkt test-programs/example.scm
Compiling IR (using *your* compile.rkt) …
'(pushq (reg rbp))
...
🏗️ Now building a binary... 🏗️
-> Host: macosx/x86_64 Target: x86_64-apple-darwin Entry: main
/usr/bin/clang -target x86_64-apple-darwin -Wall -O2 -c ./output.s -o ./output.o
/usr/bin/clang -target x86_64-apple-darwin -Wall -O2 -c ./runtime.c -o ./runtime.o
/usr/bin/clang -target x86_64-apple-darwin -Wall -O2 ./output.o ./runtime.o -o ./output
Success! Executable produced at: ./output
Done!
$ ./output
42
Programs that (read)
input will prompt or consume stdin. You can use
input redirection:
./output < input-files/1.in
Debugging Guide
This is a tricky project, and it is really important that you lean into a debugging methodology that works for you. Let me share with you some advice I used when I was writing the compiler myself.
- First, I started by writing a single manual test in the top-level of
compile.rkt
, so that I could easily just sit at the command line (or Dr. Racket) and runcompile.rkt
(with no command-line arguments) and see if the test would pass. At the early stages of debugging, this is an excellent strategy, since it means fully-pushbutton test. For example, I had (appended tocompile.rkt
) something like:
(displayln ;; or pretty-print
(dump-x86-64
(prelude-and-conclusion
(patch-instructions
(assign-homes
(select-instructions
(uncover-locals
(explicate-control
(anf-convert
(uniqueify
(shrink
(typecheck '(program (let ([a (read)])
(let ([b (read)]) (if (or (> a b) (eq? (+ b 1) 0))
(+ a b)
(- b a)))))))))))))))))
-
I started with a fairly large test in my case–since I was merely adding forms not present in my previous implementation. In this case, I would often be hitting match failures–this is a good thing, it allows me to trace down exactly where I need to add match cases and handle new behavior. Of course, the issue is that I also need to mix that with thinking holistically about the specs of each IR.
-
After I thought I had each pass of the compiler working, I started switching over to
main.rkt
, which will run all passes of the compiler and will report their outputs. I also needed to write the interpreters (you do not), and so I debugged some of those usingmain.rkt
. -
Last, once things are acutally working, I used
test.rkt
, which I adjusted to account for the possibility of expected type errors in malformed inputs. -
My advice to you is similar: start with something where you can press “Run” (or continually reinvoke
racket compile.rkt
). It will facilitate rapid testing, and it is really important to build some skill and intuition for how to accomplish that exercise. -
Remember, debugging is a key concept that you are practicing in this class. The worst possible thing you can do is “guess and check” debugging (running the tests and hoping they pass)–the issue is that doing this gives you very little observability into why the code is broken. To fix this, you need to have some way of interacting with the code. Experts debug their code using an iterative, hypothesis-driven methodology, where they (a) articulate a falsifiable hypothesis (“this match pattern never matches anything”), (b) change their code to observe the bug (“add a displayln at every match handler”), and (c)
Tricky Parts of this Project
BE CAREFUL of the following:
-
In the event of a type error, raise
(error (type-error-tag) "Error here...")
-
In the previous implementation, we had only a single block, and
prelude-and-conclusion
sandwiched the generated code between code to set up the function and code to print the value and exit the function. In this case, that will not work so well. Instead, I recommend the following approach: in theselect-instructions
pass, assume the existence of a special block namedconclusion
, which assumes the return value (to be printed) is in%rax
, and jump to that label. Then, inprelude-and-conclusion
, add this label to the program with the right code to print the value of%rax
. -
On Mac OSX, you need to prefix labels for functions, meaning
_main
instead ofmain
. To facilitate this, I provide a function(rt-sym s)
, which converts a symbol to a system-specific variant (based on the OS). However, you only need to worry about this for labels that correspond to exported function entrypoints, not other labels that are internal to the program (e.g., the target of a jump, or evenconclusion
). This is important, because you don’t want to call(rt-sym ...)
until the very last pass (it is ugly to make the previous passes OSX/Linux-specific). The issue is this: if you naively rename every label fromfoo
to_foo
, then you also need to ensure that you clean up jumps so that instead of jumping tofoo
, they go to_foo
. In my implementation, I handled this as follows: I simply used(rt-sym ...)
on the(entry-symbol)
(i.e.,main
), which is the only function in this project. -
Ensure 16-byte stack alignment before
callq
Testing Infrastructure
The file test.rkt
runs a set of formal tests, as in the last
project.
frontend
– runstypecheck
→anf-convert
, checks ANF via interpretermiddleend
– runsexplicate-control
→uncover-locals
, checks viainterpret-c1
backend
– runsselect-instructions
→patch-instructions
, checks viainterpret-instr
native
– full compile → build binary → run with stdin → compare stdout to golden
Usage:
racket test.rkt -m <mode> -i <comma-separated input files> -g <comma-separated goldens> <program.scm>
Example:
racket test.rkt -m native -i ./input-files/1.in -g goldens/example_1_uniqueify.stdout test-programs/example.scm
Goldens: What They Are
We use goldens to verify correctness:
- Isolation goldens – for non-native modes:
- Compare serialized ASTs and interpreter stdout
- Native goldens – for full pipeline:
- Run your compiled binary, compare stdout
Layout:
goldens/<prog>_<n>_<pass>.in-ast
goldens/<prog>_<n>_<pass>.out-ast
goldens/<prog>_<n>_<pass>.interp
goldens/<prog>_<n>_<pass>.stdout
Instructor-only gengoldens
mode regenerates these.
FAQ
-
Do I need register allocation?
No. Stack-based only. -
Can I write helpers?
Yes—keep pass signatures unchanged. -
Can I call arbitrary C functions?
No. Only useread_int64
/print_int64
. -
How are inputs fed?
Via stdin;.in
files are whitespace-separated integers.
Autograder Tests
The autograder invokes test.rkt
in JSON mode. Your job is to make
each pass satisfy its predicates and produce semantically equivalent
IRs until final native code passes all tests.