This note should be considered temporary, focusing only on what we need right now for the express purpose of understanding Fourier series. It will later be expanded and broken into many notes.
It turns out that much of the machinery we're building around Fourier series can be greatly streamlined if we take a minute to explore a way to define "orthogonality" for functions, very similar to the usual dot product for column vectors.
The dot product on
Let's first extend the notion of dot product to vectors with complex coordinates, although we need one little adjustment.
First suppose is a single complex number. Using the complex plane as our guide, the point corresponding to would be at coordinates , which is a distance from the origin. Because of this, it is reasonable to define the length of the complex number as
Notice that we then immediately have
Now, by comparison, notice that
and so we almost never have . This in contrast with real numbers, where for any real number we always have (where is the usual absolute value, which measures the distance of the point from the origin on the real line). However, observe that
Using this observation, it is hopefully reasonable to define the length of a complex vector in as
With this in mind, it should seem reasonable to extend the dot product on to vectors in $ as follows:
Definition of dot product on
Given vectors and in , we define their dot product to be the complex number denoted given by
Notice that if and happen to have real entries, then this new dot product is the same as our old dot product (since complex conjugation doesn't do anything to real numbers). Also, with this definition we always have
both for vectors in and .
Extending the dot product to functions
Using the dot product on defined above as inspiration, we define the following:
A definition of an inner product on functions
Suppose are nice[1] complex-valued functions on . We define their inner product to be the complex number denoted given by
We can use this inner product to define a norm (or length) for functions , by setting to be the number given by
We say a function is of unit length if .
We can also extend the notion of orthogonal to functions.
A definition of orthogonal for functions
We say two functions are orthogonal if .
Orthonormal bases
Orthonormal bases on
Recall that in we have the following collection of so-called standard basis vectors:
This collection of vectors has many nice properties.
The fact that they form a basis for means that every vector in can be written uniquely as a linear combination of these vectors.
Each of these vectors is a unit vector, i.e., for every .
These vectors are mutually orthogonal, i.e., whenever .
The second two properties above provide a super quick and easy way to write a given vector in as a linear combination of these basis vectors. Indeed, if we first suppose
then for each we can dot product both sides of the above equality with to obtain
since those dot products are all zero except in the one case when , for which .
In other words, we always have
Extending the above ideas to our world of periodic functions
None of the above argument is restricted to with the usual dot product and the standard basis. We can repeat the same logic in any vector space which has an inner product, and any basis that is orthonormal with respect to that inner product.
In particular, for each integer let's write , considered as a function from to . Using the inner product we defined above, observe first that each of these functions has unit length:
These functions are also mutually orthogonal, since for every pair of integers and with we have
So while it's not clear these set of these functions is a orthonormal basis, it's at least an orthonormal set. By the same logic used above, it follows that if a function can be written as a linear combination
then the coefficients must be given by
The space
In order to state a bunch of nice facts that are actually true, we need to nail down what it means for our functions to be "nice." We have been considering functions and we would like to define the norm (or length) of such functions by the formula
So at the very least we should restrict ourselves to functions where the integral above exists and is finite. We will refer to the space of such functions as (a) Lebesgue space and denote it .[2]