Direct sums of modules

As is always the case with category-theoretic constructions, there is a construction "dual" to that of the direct product of modules, called the direct sum of modules. It is characterized by the property "dual" to that of direct product. (It really should have been called the coproduct instead of the direct sum, but we'll save that discussion for later.)

The direct sum of two modules


First suppose M1 and M2 are two R-modules. We can construct an R-module called their direct sum, denoted M1⨁M2, whose elements consist of all formal sums[1] of the form m1+m2, with "component-wise" addition and scaling.[2] We also have two "inclusion" module morphisms ji:Miβ†’M1⨁M2, which send an element mi∈Mi to the element m1+0M2 (for i=1) or 0M1+m2 (for i=2).

These data satisfy the usual universal property for a coproduct, in that it is universal among all such modules equipped with morphisms from M1 and M2. More precisely, for each module P and pair of module morphisms f1:M1β†’P and f2:M2β†’P there is a unique module morphism h:M1⨁M2β†’P such that the diagram below commutates:

As with the direct product, we can actually deduce the unique map h from the commutativity conditions. First suppose we take any element m1+m2 of M1βŠ•M2. We must have

h(m1+m2)=h((m1+0M2)+(0M1+m2))=h(m1+0M2)+h(0M1+m2)=h(j1(m1))+h(j2(m2))=(h∘j1)(m1)+(h∘j2)(m2)=f1(m1)+f2(m2).

So, the only possible map h:M1βŠ•M2β†’P with the required factorization properties is defined by h(m1+m2)=f1(m1)+f2(m2). All that remains is to verify that this map is indeed an R-module morphism.

Coproducts in related categories

Observe that, as an abelian group, M1⨁M2 is the usual direct sum (i.e., coproduct) of abelian groups; however, as a set, M1⨁M2 is not (bijective to) the usual disjoint union (i.e., coproduct) of sets.

More precisely, let U1:R-Modβ†’Ab and U2:R-Modβ†’Set denote the usual forgetful functors. Then U1(M1⨁M2)≃U1(M1)⨁U2(M2) but U2(M1⨁M2)≄U2(M1)βŠ”U2(M2). We might reasonably say that the forgetful functor U1 preserves binary coproducts, but the forgetful functor U2 does not.

What is a "formal sum"?

In the construction above, we wrote that the elements of M1βŠ•M2 consist of "formal sums" of the form m1+m2 with mi∈Mi. But what is a "formal sum" and why is it not just called a "sum"? For an initial answer to that question, note that for elements m1∈M1 and m2∈M2 there is no ambient operation that allows us to add m1 to m2. That's because we are not given any module M in which we can simultaneously view M1 and M2 as submodules. In fact, you can view the construction of M1βŠ•M2 as precisely the construction of such a module.

The question still remains, however, as to what we're precisely doing here. In which previously existing set is the formal sum m1+m2 supposed to be contained? We'll give an answer to that question at the bottom of this note.

The direct sum of an arbitrary family of modules


As with product of modules, let's now jump to the general case. Suppose {Ms∣s∈S} is a family of R-modules indexed by some set S; i.e., a functor F:Sβ†’R-Mod. Following the pattern we've established, the direct sum of this family is an R-module, denoted ⨁s∈SMs, together with module morphisms jt:Mt→⨁s∈SMs for every t∈S, universal among all such data. As an abelian group, this is the usual direct sum of the corresponding abelian groups.

How is the module ⨁s∈SMs constructed? Recall that with the infinite direct product, we formally defined the elements of ∏s∈SMs as set maps f:Sβ†’Z such that f(s)∈Ms for every s∈S, where Z was any set containing all of the elements of every Ms. Here we do the same, with one small twist. The elements of ⨁s∈SMs are the set maps f:Sβ†’Z such that:

  1. f(s)∈Ms for every s∈s
  2. f(s)=0Ms for all but finitely many s∈S.

With this description of the direct sum, the "inclusion" morphism it:Mt→⨁s∈SMs is the morphism that sends each m∈Mt to the map fm:Sβ†’Z defined by

fm(s)={m,ifΒ s=t0Ms,else

You might feel that this formal description is clunky, but there is one immediate benefit. Written this way, it is clear that there is a "canonical" injective module morphism

⨁s∈SMsβ†’βˆs∈SMs,

which is an isomorphism when S is finite (but usually not an isomorphism when S is finite).

But why the requirement that f(s)=0Ms for all but finitely many s∈S? This new restriction turns out to be a consequence of the desired universal property, one that was not implied/required for the universal property of the direct product.

Indeed, suppose we did not require this. Now suppose we had a collection of R-module morphisms gs:Msβ†’P, one for every element . The desired universal property of ⨁s∈SMs would then require the existence of a unique module morphism h:⨁s∈SMsβ†’P through which every gs factored. This factorization property would force h to satisfy

h(f)=βˆ‘s∈Sg(f(s)).

However, that sum occurs in the module P, where arbitrary sums of elements are not necessarily defined. (As an R-module, P is endowed with a binary operation, which can be repeated to define an n-ary operation, but not necessarily anything beyond that.) In other words, it's not clear such a map h is well defined (or even possible).

The finiteness condition on our formal sums removes this issue, and indeed the desired universal property is now straightforward to verify.

Suggested next notes


Sums of submodules
Direct products vs. direct sums vs. sums


  1. See below if you don't like this "formal sum" business. β†©οΈŽ

  2. The choice of symbol is deliberate. The direct sum of modules is not the same as the (direct) sum of submodules of a given module, so the notation is use to distinguish the two constructions. However, both share the same universal property, only in different categories. Thus the desire to give both a "sum-like" notation. β†©οΈŽ