Categories

Motivation

Serge Lang

In the forties and fifties (mostly in the works of Cartan, Eilenberg, MacLane, and Steenrod), it was realized that there was a systematic way of developing certain relations of linear algebra, depending only on fairly general constructions which were mostly arrow-theoretic, and were affectionately called abstract nonsense by Steenrod.

My source: Riehl

Peter Freyd

Perhaps the purpose of categorical algebra is to show that which is trivial is trivially trivial.

It is likely you were unwittingly exposed to mathematical categories long before you first heard the words "category theory." Real vector spaces and their linear transformations? That's a category. Groups and their group homomorphisms? A category. Rings, fields, or topological spaces (and the "appropriate" maps between them)? All categories.

At the most intuitive level, a mathematical category simply consists of "stuff" (usually mathematical objects with prescribed algebraic structures) and the "maps" between them (usually, but not always, set maps that respect those algebraic structures). Category theory, then, can be thought of as a mathematical language that is broadly applicable across algebra, topology, set theory, logic and beyond. This is part of what gives category theory its power, namely its ability to universally describe constructions and ideas across different mathematical disciplines. It brings under a single umbrella the study of sets (with their set maps), vector spaces (with their linear transformations), groups (with their homomorphisms) and topological spaces (with their continuous maps), just to name a few.

Category theory studies objects (e.g., groups) and the arrows between them (e.g., homomorphisms). Every general result of category theory is a result that can be interpreted and used in your favorite category. There are maps between categories, called functors, which allow us to connect categories to each other; and there are maps between functors, called natural transformations, which can provide deep insights into fundamental mathematical constructions (e.g., free groups) and operations (e.g., tensor products).

There is a second, less obvious benefit to studying category theory that I personally feel is even more profound. Thinking categorically can push us to embrace new, abstract ideas that we might initially find unintuitive, but which eventually provide incredible new insights. These insights and lessons are sprinkled throughout these notes. Keep your eyes peeled for them!

Formal definitions

Any formal definition of category is admittedly a bit clunky, so remember the general idea: you have objects and you have arrows between those objects.

Definition of category

A category consists of the following data:

A collection^[1] of objects
A collection of arrows
For each arrow, specified domain and codomain objects. The notation $f : a \to b$ signifies that $f$ is an arrow with domain $a$ and codomain $b$
For each object, a specified identity arrow. The notation $1_{a}$ denotes the identity arrow $1_{a} : a \to a$ for the object $a$
Any pair of arrows $f, g$ with the codomain of $f$ equal to the domain of $g$ is called a composable pair. For each composable pair of arrows, there is a specified composite arrow with domain the domain of $f$ and codomain the codomain of $g$ . We denote this composite arrow $g \circ f$ (or simply $g f$ , if there is no cause for confusion)

These data are subject to the following two axioms:
1. (Identity) For any arrow $f : a \to b$ , the composites $1_{b} \circ f$ and $f \circ 1_{a}$ are both equal to $f$ .
2. (Associativity) For any composable triple of arrows $f, g, h$ , the composites $h \circ (g \circ f)$ and $(h \circ g) \circ f$ are equal.

If one were forced to distill the above definition to a single idea, it might be that category theory is essentially the "theory of composition."

Visualization

When we want to visualize categories it's useful to think of the objects as dots and the arrows as ... arrows. For example, we might visualize two objects $a$ and $b$ with some arrows between them as follows:

There are a few things to note. First, if this image is meant to represent an entire category, then it is implicitly assumed that all of the properties required to be a category hold. For example, there must exist an arrow corresponding to the composition $h \circ f$ (and similarly one for $h \circ g$ ), and since there is evidently only one arrow from $a$ to $a$ in the image above, namely $1_{a}$ , then we must have $h \circ f = 1_{a}$ (and also $h \circ g = 1_{a}$ ).

That being said, it is much more common to draw a picture like the above to represent a small "part" of a category, in which case we are not meant to assume that the only arrows between $a$ and $b$ are the ones shown. In that (common) case, we are simply visualizing some of the arrows in the category, leaving open the possibility of others. In that case, we would not assume that $h \circ f$ must equal $1_{a}$ . (Later on we will formalize the notion of a commutative diagram in a category.)

In all cases, it is common convention (mainly for our own sanity) to omit any arrows that must necessarily be in the category, per the definition of category. So we usually don't draw the identity arrows, nor do we draw the compositions of all composable arrows. We simply assume that those are all present. In our currently example, we might then simply sketch the following diagram:

For one final simplification, it is common to simplify the visual presentation of the objects, either by dropping their labels, or removing the dots, such as below:

More examples can be found below.

Conventions

The language and notation of category theory is not completely standardized, but here are some common conventions.

Abstract categories are sometimes denoted with a single capital letter, such as $C$ . In this case, the set of objects of the category is often denoted $Ob (C)$ . However, it is also common to simply use $C$ to refer also to the set of objects, writing $a \in C$ for an object $a$ in the category $C$ . You could go even further, and use this notation for arrows as well (e.g., referring to an arrow $f \in C$ ), but that is less common. (It was also originally common to use script letters for categories, like $C$ , probably because that made them seem fancy and exotic. Let's not mystify our categories here.)

It is common to use the word morphism in place of "arrow." I will personally use "morphism" when working with known algebraic objects (such as groups or modules), as it harkens back to the word "homomorphism" that was (and regrettably, in my opinion, still is) used in those contexts. However, for an abstract category I will stick to "arrow."

Given two objects $a$ and $b$ in a category $C$ , the collection of arrows between them is usually denoted with some variation of $Hom (a, b)$ or $Mor (a, b)$ . This derives from the alternative names of "homomorphisms" or "morphisms", noted above. The word "hom-sets" refers to such collections of arrows^[2]. If there are multiple categories under consideration, we might more carefully denote hom-sets as ${Hom}_{C} (a, b)$ or ${Mor}_{C} (a, b)$ .

Most (but not all) categories are named after their objects. For example, the category with objects all groups and with arrows all group morphisms is called "the category of groups" and is usually denoted with some variation of $Grp$ . More examples can be found below.

Set-theoretic issues

Optional warning

At the most fundamental and rigorous level, there are some technical logical issues that need to be addressed. This section briefly addresses those concerns, but this is nothing something we will worry about elsewhere.

When attempting to study all objects of a certain type, it is easy to run into set-theoretic issues along the lines of Russell's paradox. A common convention is to assume there is a big enough set $U$ , called a universe, such that all sets one needs to discuss are members of the set $U$ . (There are some formal properties it needs to satisfy.) A set $X$ is then called small if it is a member of the universe. A set that is not in the universe is called large. (Some sources instead use the language of sets and classes.) The universe itself is necessarily large.

Definition of small category

A category is small if its collection of arrows is a small set.

Since each object is uniquely associated with an identity arrow, in a small category the collection of objects is also a small set. Unfortunately, many of the common categories we encounter are not small, i.e., are large.

Definition of locally small

A category is locally small if for any pair of objects the collection of arrows between those objects is a small set.

Will we worry about any of this? We will not. Instead, we will embrace the following quote:

Emily Riehl

The search for the most useful set-theoretical foundations for category theory is a fascinating topic that unfortunately would require too long of a digression to explore. Instead, we sweep these foundational issues under the rug, not because these issues are not serious or interesting, but because they distract from the task at hand.

Examples

Examples are abundant! We begin with some really basic categories before discussing the (more complicated) categories you've likely encountered before.

Some basic categories

The smallest possible category is the empty category, which has neither objects nor arrows. This category is usually denoted $0$ . It is the initial object in the (large) category of all (small) categories: for every (small) category there is unique arrow (functor) from $0$ to $C$ .

The next smallest categories are the categories that have a single object. It is common to let $1$ denote the category with exactly one object and one arrow (the identity arrow on that object).

Since there is a unique object and unique arrow (the identity arrow on that object), there's no point to even label them. However, if you were to label the object you might^[3] label it as below:

With our convention of omitting identity arrows, we would visualize this category simply as

The category $1$ is the terminal object in the (large) category of all (small) categories: for every (small) category $C$ there is a unique functor from $C$ to $1$ .

Note

There are lots of categories that have a unique object but many arrows. Such categories are in (natural) bijection with monoids.

Continuing the pattern above, the category denoted $2$ is the category with two objects and exactly one non-identity arrow. Since identity arrows are not usually included when sketching visual representations of categories, a picture of the category $2$ is

As one last basic example in this specific sequence of categories, the category $3$ has three objects and the following non-identity arrows:

Note that arrow $1 \to 3$ is equal to the composition of the arrows $1 \to 2$ and $2 \to 3$ .

Preorders

A preorder is a category $P$ in which there is at most a single arrow between any two objects. We can then define a relation $\leq$ on the objects of $P$ by saying $p \leq p^{'}$ if and only if there is an arrow $p \to p^{'}$ in $P$ . This relation is reflexive (identity arrows) and transitive (composition of arrows). Each of the basic categories above (i.e., $0, 1, 2, 3, \dots$ ) is a preorder.

If $X$ is a set, we can form a preorder $C$ whose objects are the subsets of $X$ , and where there is an arrow $U \to V$ exactly when $U \subseteq V$ . This category can be useful for understanding "internal constructions" in the set , such as intersections and unions of subsets, in the language of category theory. (We could do the same type of construction for a vector space, group, ring, etc.) There should be an established name (and notation) for this construction, but I don't know what it is (yet).

Sets as categories

A category is discrete when every arrow is an identity arrow. In other words, it's basically just a set (of objects). For example, a discrete category with six objects might be visualized as below. As usual, the identity arrows are not shown.

To each set $X$ we can associate the discrete category $X$ , whose objects are labeled by the elements of $X$ and which only contains identity arrows. The association to each set $X$ the corresponding category $X$ is the object map of a functor from the category $Set$ to the category $Cat$ of (small) categories.

Groups as categories

For a given group $G$ one can associate a category $B G$ , which has a single object $⋆$ and exactly one arrow $g : ⋆ \to ⋆$ for each element $g \in G$ . The composition of arrows in the category $B G$ corresponds to the group operation in $G$ . The category $B G$ is sometimes called the delooping category or (delooping groupoid) of the group $G$ .

For example, if $G$ is a group with four elements (say $G = Z_{4}$ ), then a picture of $B G$ might look like

Note here that I have chosen to include the identity arrow, since it corresponds to the identity element in $G$ (and it feels weird leaving it out).^[4]

The association to each group $G$ the corresponding category $B G$ is the object map of a functor $B : Grp \to Cat$ .

Matrices over a fixed commutative ring

For each commutative ring $R$ , the set of all matrices with entries in $R$ is the arrow set of a category ${Mat}_{R}$ . The objects of this category are the positive integers, and each $m \times n$ matrix $A$ corresponds to an arrow $A : n \to m$ . Composition of arrows corresponds to matrix product.

For example, in the category ${Mat}_{R}$ of real matrices, the arrows $2 \to 4$ are in bijection with the $4 \times 2$ matrices with real entries, while the arrows (loops) $2 \to 2$ are in bijection with the (square) $2 \times 2$ matrices with real entries. The identity arrow on an object $n$ corresponds to the $n \times n$ identity matrix. An illustration of how arrow composition matches with matrix product might be the following:

Note that composition is written algebraically right-to-left ("inside out"), so the composition of the two arrows above corresponds to the arrow labeled by the product of those matrices in the opposite (visual) order.

A rare case

This is the rare case of a category named after its arrows!

Opposite categories

For each category $C$ , its opposite is the category $C^{op}$ with the same objects as $C$ and with all arrows "reversed." In other words, for each arrow $f : c \to d$ in $C$ there is a corresponding arrow $f^{op} : d \to c$ in $C^{op}$ (and conversely).

Why consider such a category? We'll see.

Large categories

Most of the objects we encounter in math are the objects of some (large) categories. Below is a quick roundup of some with which you might already be familiar.^[5]

Category	Objects	Arrows
$Set$	sets	set maps (i.e., functions)
${Set}_{*}$	sets with selected base point	base-point-preserving set maps
$Cat$	categories	functors
$Mon$	monoids	morphisms of monoids
$Grp$	groups	group homomorphisms
$Ab$	abelian groups	group homomorphisms
$Ring$	rings (with unity)	(unit-preserving) ring homomorphisms
$CRing$	commutative rings (with unity)	(unit-preserving) ring homomorphisms
$R - Mod$	left modules over the ring $R$	$R$ -module homomorphisms
$Mod - R$	right modules over the ring $R$	$R$ -module homomorphisms
$Top$	topological spaces	continuous maps
$Toph$	topological spaces	homotopy classes of maps
${Top}_{*}$	topological spaces with selected base point	base point-preserving continuous maps

Suggested next note

Functors

Here we use the word "collection" (as opposed to "set") to allow for set theory technicalities, such as a "class" of objects. For our purposes you can assume the objects of our categories form a set. ↩︎
At least in the case these collections are sets. See below. ↩︎
You'll shortly see why I labeled the object with a number, as opposed to a letter. ↩︎
And I choose to denote the uique object with a star for flair. ↩︎
Note that, in those cases in which the objects correspond to some type of sets, those sets should be assumed to be small sets. ↩︎