CMSC 22610
Implementation of Computer Languages - I
Winter 2015

General Information

Instructor:    John Reppy Ry 256
TA: Charisee Chiw   
Lecture: TR 10:30-11:50 Ry 251

Overview

A vast majority of computer programs must deal with textual input of some form or another. This input can range from simple configuration languages to data description languages (e.g., XML) to scripting languages to full-blown programming languages. In this course, we cover the tools and techniques used to process the full range of computer languages (i.e., languages that specify programs and data on computers). Topics include scanning and parsing, tree representations of structured input, simple typechecking, translation between intermediate forms, interpretation, simple code generation, and some run-time system issues. There will be homework assignments and programming projects. We will use Standard ML (SML) for the majority of the programming projects. Students should have taken CMSC 15400 (Introduction to Computer Systems).

This course is the first in a sequence of two courses that cover the implementation of computer languages. The second course (CMSC 22620) covers more advanced issues related specifically to the translation of general purpose programming languages, including optimization and code generation.

Text books

The main text for the course is

Title: Modern Compiler Implementation in ML
Author: Andrew Appel
Publisher:   Cambridge University Press, 1998
Errata: www.cs.princeton.edu/~appel/modern/ml/errata.html

The programming assignments will be written using the SML programming language. The following book is one of the better introductions to SML programming.

Title: ML for the Working Programmer (2nd Edition)
Authors: L.C. Paulson
Publisher:   Cambridge University Press, 1996

The following reference may also be useful:

Title: The Standard ML Library
Editors: Emden Gansner and John Reppy
Publisher:   Cambridge University Press, 2004

Grading

Grading for the course will be based on the following scheme:

Percentage Component
20% Homework assignments
30% Exam
40% Projects

Class participation will also be a factor in the assignment of grades.

Homework assignments

Written homework is due at the beginning of class and late homework will not be accepted for credit.
Date Assignment Due date
Jan. 6 Homework 1 Jan. 13
Jan. 20 Homework 2 Jan. 27
Feb. 3 Homework 3 Feb. 10
Feb. 12 Homework 4 Feb. 19
March 1 Homework 5 March 5

Course project

The course project is to implement a higher-order functional language called Flang. The project will be divided into four milestones with the following tentative due dates:
Jan. 23:Project 1 — Lexer
Feb. 6:Project 2 — Parser
Feb. 27:Project 3 — Typechecker
Mar. 16:Project 4 — Code generation and runtime

Project handouts

Project handouts are available online from the following links:
Project overview [Revised 2015-02-05]
Project 1 — Flang Lexer [Revised 2015-01-14]
Project 2 — Flang Parser [Revised 2015-02-05]
Project 3 — Flang Typechecker [Revised 2015-02-10]
Project 4 — Flang bytecode generator [Revised 2015-03-16]
The Virtual Machine Reference [Revised 2015-03-11]

Additional project resources

Sample programs with parse-tree and AST output (examples.tgz)
Virtual Machine source code (version 2.2) (vm.tgz)

Resources

You will find the following software and documentation useful for programming the projects:

Academic Honesty

[The following is owed to Stuart Kurtz]

The University of Chicago is a scholarly academic community. You need to both understand and internalize the ethics of our community. A good place to start is with the Cadet's Honor Code of the US Military Academy: "A Cadet will not lie, cheat, or steal, or tolerate those who do." It is important to understand that the notion of property that matters most to academics is ideas, and that to pass someone else's ideas off as your own is to lie, cheat, and steal.

The University has a formal policy on Academic Honesty, which is somewhat more verbose than West Point's. Even so, you should read and understand it.

We believe that student interactions are an important and useful means to mastery of the material. We recommend that you discuss the material in this class with other students, and that includes the homework assignments. So what is the boundary between acceptable collaboration and academic misconduct? First, while it is acceptable to discuss homework, it is not acceptable to turn in someone else's work as your own. When the time comes to write down your answer, you should write it down yourself from your own memory. Moreover, you should cite any material discussions, or written sources, e.g.,

Note: I discussed this exercise with Jane Smith.

The University's policy, for its relative length, says less than it should regarding the culpability of those who know of misconduct by others, but do not report it. An all too common case has been where one student has decided to "help" another student by giving them a copy of their assignment, only to have that other student copy it and turn it in. In such cases, we view both students as culpable and pursue disciplinary sanctions against both.

For the student collaborations, it can be a slippery slope that leads from sanctioned collaboration to outright misconduct. But for all the slipperyness, there is a clear line: present only your ideas as yours and attribute all others.

If you have any questions about what is or is not proper academic conduct, please ask your instructors.


Last revised: March 16, 2015