Adjoints

...good general theory does not search for the maximum generality, but for the right generality.

Motivation

Have you ever wondered what's so "free" about free groups? Or why the free abelian group on a set X is not the same as the free group on the set X? Have you ever wondered what some constructions (such as these "free" constructions) commute with certain constructions (like forming direct sums) but not others (like forming direct products)?

Or perhaps you've wondered if objects in different categories can have maps to each other. Sadly, they cannot. But they can try the next best thing. Imagine for a moment, two categories C and D and objects cC and dD. Officially, the two objects are separated by an impossible gulf:

If there are no functors between C and D, then the objects c and d truly will never see each other. They can have no interaction whatsoever. The story ends.

But suppose there were functors connecting the two categories. We can think of these as transit lines, able to convey data (objects and arrows) from one category to the other.

Suddenly it is possible for the two objects to interact. We just need to send one of the objects into the other category. Of course, we have two options. We can either send c into D using F; or we can send d into C using G.

And now we have two different sets of maps we can consider. We can consider the maps between c and G(d) in C; or we can consider the maps between F(c) and d in D:

For a random pair of functors F and G, we have no reason to expect any nice relationship between these sets of arrows. As in the visual example above, there might be more arrows on one side of this picture than the other. So let's consider some special cases.

Special Case 1: Isomorphisms of categories

Let's first consider the case in which F and G are mutual inverses. In other words, suppose GF=IC and FG=ID. In this case we say both F and G are isomorphisms and the categories C and D are isomorphic.

Is this case interesting? Not really. An isomorphism of categories means a bijection on both objects and arrows. For all intents and purposes, the categories C and D are identical. Are question about arrows between c and d is then just a question about arrows between two objects in the same category.

Special Case 2: Equivalences of categories

With natural transformations at our disposal, it makes sense to relax the requirement that the compositions GF and FG exactly equal the respective identity functors, and replace it with the weaker condition that the compositions are naturally isomorphic to the respective identity functors. In other words, let's assume there are natural isomorphisms GFIC and FGID. This means that for every cC we have an natural isomorphism cGF(c) in C, and similarly for every dD a natural isomorphism FG(d)d in D. In this case, the functors F and G are said to be naturally isomorphic and the categories C and D are said to be equivalent.

When people talk about equivalent categories, this is what they mean. For our purposes, though, this is still too strong of a condition on our functors. Equivalent categories are "basically" the same, and we want to allow for functors between categories that are "very different."

Special Case 3: Adjunction of categories

Suppose we stick to our original idea, and ask for pairs of functors F,G for which arrows F(c)d in D are in natural bijection with arrows cG(d) in C:

Inverse functors and isomorphic functors certainly satisfy this condition, but potentially many other pairs of functors, too. Following this thread leads to the notion of an adjunction of categories.


Adjunction of categories

There a several equivalent definitions, but we'll give the one that most resembles the situations we've seen:

Definition of an adjunction

Suppose C and D are categories. An adjunction from C to D is a triple F,G,τ:CD where F and G are functors in opposite directions between categories C and D

F:CD:G

and τ is a function that assigns to every pair of objects cC, dD a natural bijection

τc,d:HomD(F(c),d)HomC(c,G(d)).

We call the pair of functors (F,G) and adjoint pair. The functor F is said to be a left adjoint for G, while G is called a right adjoint for F.

We should immediately note that the adjunction requires all three pieces of information: the functor F, the functor G, and the natural bijections in τ between the hom-sets (shown above). In particular, the natural bijections in τ are not implicit in the functors F and G.

The naturality condition

The statement that τc,d is natural has a formal meaning. Let's first consider "naturality in c." Suppose f:cc is an arrow in C. The functor F then yields an arrow F(f):F(c)F(c) in D. For each arrow g:F(c)d in D, pre-composition with F(f) yields and arrow gF(f):F(c)d in D. Similarly, for each arrow h:cG(d)), pre-composition with f yields an arrow hf:cG(d) in D. Naturality in c is then the requirement that the diagram below commutes:

Similarly, naturality in d is the requirement that for every arrow g:dd in D, the diagram below commutes:

In short, the bijections in τ should be compatible with arrow composition.

Aside

The description of an adjunction can be rephrased in an equivalent way in which τ becomes a bona fide natural isomorphism between two functors.


We will see a plethora of examples (below), but first a few additional notes are in order.

Units and counits

There is a lot of additional information that can be extracted from an adjunction, and some of that information is equivalent to the adjunction itself. Here we list some of that data.

If we set d=F(c) in the natural bijection of the adjunction, then the left-hand hom-set is the set HomD(F(c),F(c)). This set contains one identifiable arrow, namely the identity arrow 1F(c). Let ηc denote its image under τ. This arrow is a universal arrow from c to G, and the function cηc defines a natural transformation from the identity functor IC to GF:

η:ICGF

We call η the unit of the adjunction. This situation is visualized below:

This unit η:ICGF is the weakening of the natural isomorphism GFIC in Case 2, above.

A warning about the unit

The image above might suggest that ηc is simply G(1F(c)), but this is not the case. By functoriality we always have G(1F(c))=1GF(c), i.e., it's simply the identity arrow on GF(c). On the other hand, the unit is an arrow cGF(c), and as we'll see in our examples below, it is often very far from an identity arrow.

Similarly, if we set c=G(d) in the natural bijection above, then the right-hand hom-set contains the identity arrow 1G(d). Let εd denote its preimage under τ. This arrow is a universal arrow from F to d, and the function dεd defines a natural transformation from FG to the identity functor ID:

ε:FGID

We call ε the counit of the adjunction. This dual situation is visualized below:

Moreover, the unit and counits satisfy two "triangular" identities, namely the commutative triangles of natural transformations below:

What should you take away from these triangles? Not much. Only that these identities give specific natural isomorphisms of FFGF and GGFG. In other words, after each round trip, nothing new really happens. Doing F, then G, then F again is "essentially the same" as just doing F. So while the functors F and G are (probably) not mutual inverses, they're the next closest thing.

In any case, it turns out that an adjunction between C and D can be given equivalently by the data of the two functors F and G, together with the unit η:ICGF and the counit εFGID (satisfying those triangle identities). See page 82 in Mac Lane for details.


Suggested next note

Examples of adjoints