Orthogonality & Completeness

The vector idea, stretched to functions

Picture an ordinary vector in three dimensions. The reason you can write v = v_x i + v_y j + v_z k so painlessly is that the basis vectors i, j, k are orthogonal — mutually perpendicular — so each component decouples from the others. To find v_x you do not solve three equations at once; you simply take the dot product v · i, and the other two directions contribute nothing because they are perpendicular to i. Orthogonality is what turns 'find the coefficients' into 'take one inner product each'. This whole guide is the claim that the same convenience is available for functions.

To talk about perpendicular functions we need a notion of 'dot product' for functions. The standard one replaces the finite sum over components with an integral over the interval: the inner product of two functions f and g on [a, b] is integral from a to b of f(x) g(x) w(x) dx, where w(x) is a fixed positive weight function we'll explain shortly. (If you ignore w, just read it as 1 for now.) This is a definite integral you already know how to compute; the new mental move is treating its value as a single number that measures how aligned two functions are. Two functions are called orthogonal when this integral comes out to zero.

Why different eigenvalues force orthogonality

Here is the heart of the matter, and it is one of the most satisfying short arguments in this whole volume. Recall from the previous guide that a Sturm–Liouville problem writes the operator in its self-adjoint form (d/dx)(p y') + q y + lambda w y = 0. The eigenfunctions y_m and y_n satisfy this with their own eigenvalues lambda_m and lambda_n. The trick is to multiply the y_m equation by y_n, multiply the y_n equation by y_m, subtract, and integrate from a to b. Almost everything cancels — and what survives is an identity that practically hands you the result.

Two eigenpairs:  (p y_m')' + q y_m = -lambda_m w y_m
                 (p y_n')' + q y_n = -lambda_n w y_n

multiply first by y_n, second by y_m, subtract, integrate a..b:

  (lambda_m - lambda_n) INT_a^b  y_m y_n w dx
        =  INT_a^b [ y_n (p y_m')' - y_m (p y_n')' ] dx

the right side is an exact derivative (Lagrange identity):
        =  [ p ( y_n y_m' - y_m y_n' ) ]_a^b      (a boundary term)

self-adjoint boundary conditions  ==>  that boundary term = 0

  ==>  (lambda_m - lambda_n) INT_a^b y_m y_n w dx = 0

  lambda_m != lambda_n   ==>   INT_a^b y_m y_n w dx = 0   (orthogonal!)

The Lagrange identity turns the messy left-over integral into a pure boundary term; the boundary conditions kill it; what remains forces the weighted integral of y_m y_n to vanish whenever the two eigenvalues differ.

Read the last line slowly, because it is the punchline. We arrive at (lambda_m - lambda_n) times integral of y_m y_n w dx = 0. If the two eigenvalues are different — lambda_m not equal to lambda_n — then that leading factor is nonzero, so the only way the product can vanish is for the integral itself to be zero. That integral being zero is exactly the statement that y_m and y_n are orthogonal under the weight. Nobody chose the eigenfunctions to be perpendicular; the self-adjoint structure plus the boundary conditions forced it. This is the theorem of orthogonality of eigenfunctions, and it is automatic.

Two honest footnotes. First, the boundary term vanishes only for the right kind of boundary conditions — the Sturm–Liouville family (the function pinned to zero at an end, or its slope, or a mix, or periodic matching). With the wrong boundary conditions the surviving term need not cancel, and orthogonality can fail; the theorem genuinely earns its hypotheses. Second, the argument settles only the case of distinct eigenvalues. If one eigenvalue were shared by two independent eigenfunctions, the integral need not be zero on its own — but you can always make those finitely many companions orthogonal by hand with Gram–Schmidt orthogonalization, the same tidying you'd do for ordinary vectors.

The weight is not optional bookkeeping

Look back at the derivation and notice exactly where the weight w(x) entered: it rode in on the lambda w y term of the equation, so it is w that appears in the orthogonality integral, not a bare product y_m y_n. This is orthogonality with a weight, and the weight is dictated by the equation — you do not get to choose it. When you cast a differential operator into self-adjoint form, the factor that multiplies lambda is the weight, and that is the only inner product under which these eigenfunctions are perpendicular.

A concrete pair makes the point. The plain trig system sin(n pi x / L) on [0, L] comes from y'' + lambda y = 0, whose self-adjoint form has weight w = 1; so those sines are orthogonal in the ordinary unweighted sense — the familiar integral of sin(m...) sin(n...) over a period being zero. But the Legendre polynomials P_n(x) come from (1 - x^2) y'' - 2x y' + n(n+1) y = 0, which is already self-adjoint as ((1 - x^2) y')' + n(n+1) y = 0 with weight w = 1 as well — while the Bessel functions arrive with a weight w(x) = x, and the Hermite weight is e^{-x^2}. Try to check Hermite or Bessel orthogonality without their weight and the integral simply won't vanish; insert the correct w and it falls to zero exactly.

Completeness: orthogonal AND enough of them

Orthogonality alone is not enough, and it is worth being precise about why. The single vector i is orthogonal to everything in the y–z plane, but you cannot build a general 3-D vector from i alone — you also need j and k to span the rest. A basis must be both orthogonal AND complete: rich enough that nothing is left out. The deep companion theorem of Sturm–Liouville theory says the eigenfunctions are exactly that. The completeness of eigenfunctions guarantees that the family {y_n} spans the whole space of reasonable functions on [a, b] — there is no function lurking 'perpendicular to all of them' except the zero function.

Put orthogonality and completeness together and you have a recipe machine. Suppose you want to write some target function f as f(x) = sum c_n y_n(x). Completeness promises the expansion exists; orthogonality hands you each coefficient with no fuss. Take the inner product of both sides with one eigenfunction y_m: on the right, every term integrates to zero except the n = m term, because the others are orthogonal. The whole infinite sum collapses to a single survivor, and out drops c_m = (integral of f y_m w dx) / (integral of y_m^2 w dx). This is the exact analogue of v_x = v · i, just with integrals.

Solve the Sturm–Liouville problem on [a, b] to get the orthogonal eigenfunctions y_n and the weight w(x).
Compute each coefficient by one inner product: c_n = (integral of f y_n w dx) / (integral of y_n^2 w dx). The denominator is just the squared length of y_n.
Assemble the generalized Fourier series f(x) = sum c_n y_n(x); completeness guarantees it reconstructs f (in the mean-square sense).

What completeness actually buys

Completeness is the quiet hero behind every expansion method in this volume. Because the eigenfunctions are complete, the ordinary trigonometric Fourier series, the Legendre series, the Bessel–Fourier series, and the Hermite expansion are all the SAME idea wearing different clothes — each is a generalized Fourier series for the Sturm–Liouville problem that geometry happens to produce. Completeness is also exactly why separation of variables succeeds for the heat, wave, and Laplace equations (the technique that splits a PDE into one ordinary equation per variable): you expand the initial data in the eigenfunctions, evolve each simple mode on its own, and superpose. If the eigenfunctions did not span the space, some initial states could not be represented and the method would stall.

Completeness even comes with an energy ledger. For ordinary vectors, the Pythagorean theorem says |v|^2 = v_x^2 + v_y^2 + v_z^2; the length of the whole equals the lengths of its perpendicular parts added in quadrature. The function version is Parseval's identity: integral of f^2 w dx = sum c_n^2 (integral of y_n^2 w dx). The total 'size' of f equals the sum of the sizes of its components — and the punchline is that completeness is precisely the condition that turns this from an inequality into an exact equality. If the eigenfunctions missed part of the space, the right side would fall short; equality means nothing is lost.

Why this is the secret engine of the volume

Step back and the whole landscape clicks into place. Every time this volume 'expands a function in orthogonal pieces' — and it does so constantly — it is invoking the same two guarantees you have just seen earned: orthogonality, which decouples the coefficients so each is a single integral, and completeness, which promises the pieces leave nothing out. The named special functions of mathematical physics are not a zoo of unrelated curiosities; they are the eigenfunctions of the Sturm–Liouville problems that the standard geometries — interval, disk, sphere — hand you. Master this one mechanism and you have, in a real sense, understood Fourier, Legendre, and Bessel all at once.

There is one more reason to treasure this structure: it makes hard problems modular. Because the modes are orthogonal, you can analyze one at a time and add the answers up without interference — the same superposition you relied on for linear ODEs, now operating on a whole basis of functions. The next guides in this rung put the machine to work: turning a concrete boundary-value problem into its eigenfunctions, and watching an arbitrary heat or wave profile dissolve into modes that each evolve by a simple law. Everything downstream rests on the two facts proven here — that different eigenvalues are perpendicular, and that together the eigenfunctions miss nothing.