Using Types to Model Problems

In the last chapter we introduces used types to annotate individual values: a parameter was a number, a function returned a string, and the compiler checked that we used them consistently. These primitive types are enough when a program passes around single, unrelated values, but real information rarely arrives one value at a time.

Consider a song. A song is not one value; it has its musical contents, as well as much associated metadata.

For instance, a song has at least a title, an artist, and a duration. This data only means something when associated to the same song. With only primitive types we would carry these as three separate values and have to remember, everywhere, that they belong to the same song. Nothing would stop us from pairing one song's title with another's duration, or forgetting the duration entirely, or passing an artist where a title was expected (both are strings, so the compiler would stay silent). The information has a shape, and primitive annotations cannot capture it.

Other information cannot be expressed with primitives at all. A playlist is either empty or a song followed by another playlist. This spells out two distinct cases, and the playlist can be any length. No single number or string means "either nothing, or a song and then more songs."

This chapter introduces the tools to describe information like this: compound types that group related values into one, model alternatives as distinct cases, and capture self-referential structure. Writing such a description down as a data definition does two things at once: it gives the program a shape to follow, and it lets the compiler hold us to that shape, catching whole classes of mistakes before the program runs.

This is the data-definition design you practised in CPSC 110, now written directly in the language and checked by the compiler.

Assigning Values to Names

Before we build values of any type, we need a way to name them. In TypeScript we assign a value to a name with const: the name comes first, then its type, then =, then the value.

typescript

const courseName: string = "CPSC 210";
const credits: number = 4;

A name introduced with const cannot be reassigned to a different value later: courseName will always refer to that one string. Every value in this chapter is named with const; names whose values are meant to change come later, when we look at mutation.

Declaring Variables with const

The syntax

typescript

const x: T = e

declares a variable x of type T and initializes it to the value that expression e evaluates to. Variables declared with const cannot be reassigned to different values later. Also, you cannot use const to declare the same variable multiple times.

As with other one-line statements, we will put a semicolon ; after it when writing it in programs.

Two values are worth knowing from the start because they stand for the absence of a value: null and undefined. null represents a deliberate "no value here", such as the result of a lookup that finds nothing. undefined is the value a name has when nothing has been assigned to it yet. Each is its own type, and both become useful in combination with other types, as we will see when a function may or may not find a result.

typescript

const noMatch: null = null;
const notSet: undefined = undefined;

const vs define

Where in ISL, you wrote

racket

(define course-name "CPSC 210")

to bind a name to a value, in Typescript we would write the same binding as

typescript

const courseName: string = "CPSC 210";

Note that in TypeScript, we add a type annotation that the compiler checks. Adding this extra information is marginally more work, but it allows the compiler to check basic bugs for us. For instance:

typescript

// static error: Type 'string' is not assignable to type 'number'
const courseNum: number = "210";

Modelling Information as Data

A data definition is a precise description of which values a type can express. As you design and interact with more software systems, you may grow to have your own process to derive these.

To get you started in this course, we propose a systematic process to turn a natural-language description of a problem into a type. The main steps are:

Identify the main entities.
Identify any distinct cases.
Determine what information each case needs.
Translate into a TypeScript type.
Write concrete examples to check your model.
Look for generalisation.

The rest of this chapter works through this process on the examples below, from the simplest to the most involved. As we go we will meet the building blocks TypeScript provides for specifying types: primitive values for atomic facts, literal union for fixed choices, types that group related values together, and self-reference for recursive structure.

Example: Traffic Lights

Consider the natural language description of traffic light data:

As a driver, I want the intersection's signal to be exactly one of red, yellow, or green, so that I always know whether to stop, slow down, or go.

Let's apply our systematic process. One design is as follows:

Entities: the signal at an intersection.
Cases: it shows one of three colours: red, yellow, or green.
Information per case: none; a colour is a bare label that carries nothing beyond itself.
Translate: a value that is one of a fixed set of labels is exactly a union of string literals.
Concrete examples: one valid colour, plus an invalid one to confirm the type is enforced.
Generalisation: none; a small enumeration stands on its own.

In this case, in step 4, we translate the data definition into the following typescript Type:

typescript

type TrafficLight = "red" | "green" | "yellow";

For step 5, our concrete examples could be:

typescript

const light: TrafficLight = "red";   // ok
const broken: TrafficLight = "blue"; // error: "blue" is not a TrafficLight

Union of Literals

A union of literal values restricts expresses that variables of that type can take on exactly the specified literal values. The following, where | is read as "or":

typescript

type TypeName = v_1 | v_2 | v_3;

expresses that values of type TypeName can take on exactly the values v_1, or v_2, or v_3. There can be as many primitive values v_i as you want.

Above we used strings, but numbers work as literals too, so the same idea models any fixed set of values:

typescript

type HttpStatus = 200 | 301 | 404 | 500;

Example: Shuffle Modes

As a listener, I want to set playback to one of off, on, or repeat-one, so that I can control how my music is ordered.

Entities: the shuffle mode
Cases: off, on, or repeat-one
Information per case: information is totally encoded by the cases.
Translate: again, we can use a union of literals:

typescript

type ShuffleMode = "off" | "on" | "repeat-one";

Concrete examples: again, we will have one correct and one incorrect mode:

typescript

const mode: ShuffleMode = "on"; // ok
const mode2: ShuffleMode = "repeat-album"; //error

Generalisation: nothing to generalize, all possible cases are expressed.

Example: Songs

Let's move on to applying our systematic process to the song example we started with:

As a listener, I want each song to carry its title, artist, and length, so that I can see what is playing and how long it will last.

Entities: the only entity here is a song.
Cases: A song has just one case: every song has the same shape, so there are no alternatives to distinguish.
Information per Case: for the natural language description above, what is relevant is that a song carries three facts: a title, an artist, and a duration in seconds.
Translate: A song's facts belong together, so we describe their shape with a type, which lists named properties and their types. It helps to keep two words apart: a type describes a shape, but it is not itself a value. Song is the shape.

typescript

type Song = {
  title: string;
  artist: string;
  durationSeconds: number; // must be positive
};

The type cannot express that a duration must be positive, so we record that constraint in a comment and rely on tests to enforce it.

Grouping Values Together with Object Types

To express a type that groups multiple pieces of data together, we use object type syntax. In particular, the following:

typescript

type TypeName = {
  prop_1: Type1;
  prop_2: Type2;
  prop_3: Type3; 
};

declares a type TypeName which has 3 pieces of data. Each piece of data has a name (prop_x above) and a type (TypeX) above.

Concrete Examples: An actual song is an object: a value that has that shape, an instance of the type. We create an object by writing an object literal. Below, song1 and song2 are two separate songs that share the Song type.

typescript

const song1: Song = {
  title: "Song A",
  artist: "Artist 1",
  durationSeconds: 200
};

const song2: Song = {
  title: "Song B",
  artist: "Artist 2",
  durationSeconds: 180
};

An object is an instance of its type, and each object is its own independent value. Below, song1 and song2 are two separate songs that share the Song type.

Creating Object Values with Object Literals

The syntax

typescript

const v: TypeName = {
  prop_1: <expression-1>,
  prop_2: <expression-2>,
  prop_3: <expression-3>
};

defines a value v of type TypeName, assigning each prop_x to the value gotten from evaluating <expression-x>. There can be any number of property-expression pairs, but they should be in sync with the type.

The TypeScript type checker will check that: (1) each prop_x is defined in TypeName's definition, and (2) each <expression-x> is of the type that prop_x is declared to have in TypeName's definition.

Note a syntax difference between object values and object types; property definitions in object types are separated with semicolons, while they are separated with commas for object values.

Generalisation: A song is a single fixed shape, so there is nothing to generalise.

Reading an Object's Properties

Creating an object stores its data; reading that data back out uses dot notation. To do this, write the object's name, followed by ., followed by a property name. This evaluates to the value held under that property:

typescript

song1.title;           // evaluates to "Song A"
song1.durationSeconds; // evaluates to 200

The property name is part of the program text, not a string or a variable, and it is checked against the object's type: song1.length does not compile, because Song declares no length. This is the guarantee the type gave us when building the object, now applied to taking it apart.

When a property holds another object, a second . reads a property of that result, so accesses chain from left to right. We rely on this in the next example, where a playlist's first property holds a Song and the song's title is read with playlist.first.title.

Reading Properties with Dot Notation

A property is read by naming it after a dot:

typescript

<object>.<propertyName>

The expression evaluates to the value stored under <propertyName>. If that value is itself an object, a further property is read from it in the same way, evaluated left to right:

typescript

<object>.<propertyName>.<propertyName>

The name after each dot is fixed in the source and checked against the type of the value on its left, so naming a property the type does not declare is a compile-time error rather than a value that is absent at run time.

Example: Playlists

This example builds on the Song type from above:

As a listener, I want to build an ordered list of songs of any length, so that I can queue up exactly the music I want to hear.

Entities: The nouns give us a playlist, built from the song we just modelled.
Cases: A playlist has two distinct cases: it is empty or non-empty.
Information per Case: The empty case needs no information; knowing that it is empty is the whole story. The non-empty case needs two things: its first song, and the rest of the playlist after that song. That last piece, the rest, is itself a playlist, so this definition is recursive.
Translate: A playlist has cases, so we model it as a tagged union: a union of one type per case, where each case carries a shared discriminator property (here kind) set to a different constant. Checking the discriminator tells both us and the compiler which case we are in, and therefore which properties are available.

typescript

type Playlist = EmptyPlaylist | NonEmptyPlaylist;

type EmptyPlaylist = {
  kind: "empty";
};

type NonEmptyPlaylist = {
  kind: "songs";
  first: Song;
  rest: Playlist;
};

EmptyPlaylist carries no song data; NonEmptyPlaylist carries the first Song and the rest of the playlist. The rest property has type Playlist again, and that self-reference is what lets one type describe a playlist of any length.

The self-reference makes a playlist a chain: each songs node holds one Song and points at the rest, until the chain ends in empty.

graphviz Diagram — Figure 02.01: Visual representation of Playlist data structure, containing Song A and Song B.

Tagged Unions

Previously we saw unions of literals. Tagged unions have similar syntax, but bind together various type names, rather than literal values:

typescript

type UnionType = Type1 | Type2 | Type3;

Each TypeX must have a definition that includes the property kind:

typescript

type Type1 = {
  kind: v_1;
  prop_1: T1;
  // ... as many properties as you like
};

kind should map to a specific primitive value v_1, while the other properties should map to types. The kind is the "tag" in tagged union.

To relate to a prior concept, you can understand the type of the kind property of any value of UnionType to be a union of literals. However, we know more than that: we know that kind is a specific one of those literals for each option in the tagged union.

Concrete Examples: With the type written, we build concrete examples from the songs we already have. If they are easy to construct, the design fits; if they are awkward, the model is probably too complicated. These examples also become the data our tests run against later.

typescript

const empty: Playlist = { kind: "empty" };

const oneTrack: Playlist = {
  kind: "songs",
  first: song1,
  rest: empty
};

const twoTracks: Playlist = {
  kind: "songs",
  first: song1,
  rest: { kind: "songs", first: song2, rest: empty }
};

Because an object is a value like any other, oneTrack reuses the empty object we already named rather than building a fresh one; only the new node in twoTracks has to be written out.

Generalisation: A playlist is one instance of a more general shape: a list of any element type. If a program needed lists of several different things, we would write that shape once and let it take the element type as a type parameter, written in angle brackets. A type parameter lets one definition serve many content types:

typescript

type LinkedList<T> =
  | { kind: "empty" }
  | { kind: "node"; head: T; tail: LinkedList<T> };

A playlist would then be a LinkedList<Song> and a leaderboard a LinkedList<number>. We keep the concrete Playlist from above so its kind labels stay readable, but it describes exactly the same values.

Generic Types

In a type definition, type TypeName<T,S,R> = ..., the names in angle brackets (i.e., T, S, R) are type variables. While regular program variables take on concrete values, type variables take on types. These can then be used in the definition of TypeName as stand-in for a particular type. A type definition can have any number of type variables (LinkedList above has only 1)

We call TypeName<T,S,R> a generic type when it has any type variable in its definition.

Note that while we have been using < to indicate when code can be filled in with various syntactical constructs, <expression> capturing all types of expressions (e.g., 3, 3 + 2, foo(3)), in generics, < is concrete, necessary syntax.

For the LinkedList example above, the compiler will ensure we are correctly populating the list based on its type:

typescript

// valid song list
const playlist: LinkedList<Song> = {
  kind: "node",
  head: song1,
  tail: { kind: "node", head: song2, tail: { kind: "empty" } }
};

// invalid song list; the second 'song' is only a song title
const badList: LinkedList<Song> = {
  kind: "node",
  head: song1,
  tail: { kind: "node", head: "song title", tail: { kind: "empty" } }
};

The compiler's error for badList points at the exact property that violates the type parameter:

Type 'string' is not assignable to type 'Song'.

Use generics only when you see real duplication in your code; until then they add abstraction without benefit.

Functions Follow Data Shapes

With the data defined, writing functions over it is far less open-ended than it first appears, because the structure of the code will mirror the structure of the data.

The data definition provides a template: if the data has distinct cases, the function branches on the case; if the data is recursive, the function is recursive. This is why the modelling work pays off, as a precise data definition has already done much of the design of the functions that consume it.

Branching on the Case

When data has multiple cases, a function analyses which case it has and responds to each. We do this with a compound if/else chain: one branch per case, testing the value itself for a union of literals, and the value of the discriminator for a tagged union.

An if/else chain over a union of literals, has one branch per value:

typescript

function action(light: TrafficLight): string {
  if (light === "red") {
    return "stop";
  } else if (light === "yellow") {
    return "slow down";
  } else {
    return "go";
  }
}

The comparisons above use === to test a value against each literal. Because this is the first time we compare values, it is worth being precise about what === means.

Evaluating Equality with ===

There are several ways to evaluate equality with differing amounts of rigour in TypeScript. We will always use === (often called triple equals) in CPSC 210. Using this operator ensures that two values are strictly equal. Here are some examples.

typescript

checkExpect(1 === 1, true);
checkExpect(true === true, true);
checkExpect("cpsc210" === "cpsc210", true);
checkExpect(1 === "1", false);              // number 1 compared to string "1"
checkExpect(true === "true", false);        // boolean true compared to string "true"

We do this because it is almost always the case that when we want a 2, we want the number 2, not the string "2", or we would have used "2".

Some examples of why this can be confusing with non-strict equality (==) can be seen below. These unexpected values are never visible statically; they only surface when you run the program, which often leads to surprises. Because of this we will encourage you to always use === in this course.

typescript

checkExpect(1 == 1, true);                  // as expected
checkExpect(1 == "1", true);                // number 1 is considered the same as string "1"
checkExpect(true == true, true);            // as expected
checkExpect(true == 1, true);               // true is considered the same as the number 1

The same idea holds for a tagged union, but we branch on kind. After the check, the matching case's properties are available and read with dot notation, so p.first.title selects the first song, then its title:

typescript

function firstTitle(p: Playlist): string | null {
  if (p.kind === "empty") {
    return null;
  } else {
    return p.first.title; // p.first is known to exist here
  }
}

Checking the discriminator also unlocks the case's data. This is called type narrowing: once you have tested that p.kind === "songs", the compiler knows that p.first and p.rest exist and lets you use them, while preventing you from accessing properties the other case does not have. For instance, the following code would not pass the type checker:

typescript

function firstTitle(p: Playlist): string  {
  if (p.kind === "empty") {
    // Error: Property 'first' does not exist on type 'EmptyPlaylist'
    return p.first.title; 
  } else {
    return p.first.title; 
  }
}

Recurring over the Structure

A recursive data definition leads to a recursive function. The function handles the base case directly (an empty playlist) and the recursive case by combining the first element with the result of calling itself on the rest. Because every value ends in the empty case, the recursion is guaranteed to terminate.

The same template solves a whole family of problems: counting elements, accumulating a total, and building a new structure all share the shape "handle empty, otherwise combine first with the recursion on rest."

Here are some functions counting accumulating over a playlist:

typescript

function countSongs(p: Playlist): number {
  if (p.kind === "empty") {
    return 0;                       // base case
  } else {
    return 1 + countSongs(p.rest);  // recursive case
  }
}

function totalDuration(p: Playlist): number {
  if (p.kind === "empty") {
    return 0;
  } else {
    return p.first.durationSeconds + totalDuration(p.rest);
  }
}

Building a new playlist from an old one, here keeping only the longer songs:

typescript

function keepLongSongs(p: Playlist, minSeconds: number): Playlist {
  if (p.kind === "empty") {
    return { kind: "empty" };
  } else if (p.first.durationSeconds >= minSeconds) {
    return { kind: "songs", first: p.first, rest: keepLongSongs(p.rest, minSeconds) };
  } else {
    return keepLongSongs(p.rest, minSeconds);
  }
}

The shape is not unique to lists. A tree branches into two recursive calls instead of one:

typescript

type BinaryTree = Leaf | Branch;
type Leaf = { kind: "leaf"; value: number };
type Branch = { kind: "branch"; left: BinaryTree; right: BinaryTree };

function sum(tree: BinaryTree): number {
  if (tree.kind === "leaf") {
    return tree.value;
  } else {
    return sum(tree.left) + sum(tree.right);
  }
}

What the Types Can Catch

Modelling the data this way is not just tidy; it changes what can go wrong. Because the types describe the exact shape of the information, the compiler rejects code that does not respect that shape, and it does so before the program ever runs.

Given the Song and Playlist types, each of these is rejected at compile time:

typescript

// a required field is missing
const bad1: Song = { title: "A", artist: "B" };
// error: property 'durationSeconds' is missing

// a field has the wrong type
const bad2: Song = { title: "A", artist: "B", durationSeconds: "200" };
// error: 'string' is not assignable to 'number'

// accessing data the case may not have
function firstSong(p: Playlist): Song {
  return p.first;
  // error: 'first' does not exist on an empty playlist
}

Without the types, none of these would be caught until the program ran, if they were caught at all.

The types rule out whole categories of mistakes statically, but they cannot check that a function computes the right answer. For that we still write tests. As in CPSC 110, we use checkExpect to state what a call should produce and have it verified when the program runs.

Using the example playlists from above:

typescript

checkExpect(countSongs(empty), 0);
checkExpect(countSongs(twoTracks), 2);
checkExpect(totalDuration(twoTracks), 380);

These run the functions and confirm they produce the expected values. The compiler guarantees the shapes line up; checkExpect guarantees the answers are right.

The Centrality of Abstraction

A precise data definition is the foundation everything else rests on. It catches mistakes early, it mirrors the structure of the problem, and it drives the structure of the code that consumes it: once the data is modelled, the functions largely follow its shape. In this chapter we followed one process across a sequence of examples, from a simple enumeration through a song to a recursive playlist, and then wrote functions whose shape follows the data's shape.

From here, Part 1 builds directly on this work: using generic types such as arrays and promises, deriving tests from the structure of data, and leaning further on the type checker. In Part 2, when we move to object-oriented programming, the tagged unions you wrote here become class hierarchies. The underlying ideas will carry over even as the syntax changes.

Using Types to Model Problems ​

Assigning Values to Names ​

Modelling Information as Data ​

Example: Traffic Lights ​

Example: Shuffle Modes ​

Example: Songs ​

Reading an Object's Properties ​

Example: Playlists ​

Functions Follow Data Shapes ​

Branching on the Case ​

Recurring over the Structure ​

What the Types Can Catch ​

The Centrality of Abstraction ​

Using Types to Model Problems

Assigning Values to Names

Modelling Information as Data

Example: Traffic Lights

Example: Shuffle Modes

Example: Songs

Reading an Object's Properties

Example: Playlists

Functions Follow Data Shapes

Branching on the Case

Recurring over the Structure

What the Types Can Catch

The Centrality of Abstraction