Fundamental solutions give explicit formulas but require constant coefficients and no boundary. The weak formulation replaces the PDE with an equation in a Sobolev dual space, reducing existence to the Riesz representation theorem (symmetric case) or Lax-Milgram (general case). The Gelfand triple identifies where the solution, the data, and the bilinear form live.
From classical to weak solutions¶
Fundamental solutions give explicit formulas, but only for constant-coefficient operators on all of . For bounded domains with boundary conditions, or for variable-coefficient operators, we need a different approach. Sobolev spaces provide it: instead of finding a formula, we prove existence abstractly using duality.
The idea is to rewrite the PDE as a problem about functionals on a Hilbert space. Consider on a bounded domain with on . The boundary condition is encoded by working in (the closure of in ). Multiply the PDE by a test function and integrate by parts:
The boundary terms vanish because . This is the weak formulation: find satisfying the identity above.
Existence by the Riesz representation theorem¶
The PDE is an equation in the dual space. The weak formulation is not an equation between functions in ; it is an equation between elements of . Read both sides of as functionals of the test direction :
The left-hand functional is continuous on by Cauchy-Schwarz (), so is a bounded operator. The right-hand functional is continuous on by Hölder plus Poincaré (for , or more generally for by definition). Solving the PDE means solving the linear equation in .
This is the shift that distributions made possible. Classically is an equation between functions at each point ; weakly, it is a single equation between two bounded linear functionals on . The equality is tested by pairing against every , not by comparing pointwise values.
The left-hand side is simultaneously an inner product on , (using the Poincaré inequality to show is a norm equivalent to on ). That is what makes Riesz applicable: the operator is the Riesz isomorphism associated with this inner product.
By the Riesz representation theorem (Theorem 1), there exists a unique such that
This is the weak solution. Existence and uniqueness are immediate corollaries of a theorem we already proved in the duality chapter.
The Riesz map , , is the inverse Laplacian. It gains two derivatives: the input lives in (a distribution), but the output lives in (a function with a derivative in ).
This two-derivative gain is not a coincidence. If , the pairing is defined abstractly. But the Riesz map produces , and now
The singular object has been “absorbed” into . The regularity of compensates exactly for the singularity of .
The Lax-Milgram theorem¶
The Riesz argument above works because the left-hand side is an inner product on . For more general operators (with lower-order terms, non-symmetric coefficients, or convection) the bilinear form may not be symmetric, so it is not an inner product. The Lax-Milgram theorem handles this case.
Theorem 1 (Lax-Milgram)
Let be a real Hilbert space. Suppose is a bilinear form satisfying:
Continuity: for all ,
Coercivity: for all ,
for constants . Then for every , there exists a unique such that
and .
Proof 1
For each fixed , the map is a continuous linear functional on (by the continuity bound). By the Riesz representation theorem, there exists a unique such that
This defines a linear operator . Continuity of gives . Coercivity gives
so . This means is injective and has closed range. To show surjectivity: if , then , so . Hence is bijective.
Now represent via Riesz: there exists with . Set . Then for all . The bound follows from coercivity.
Remark 2 (Riesz as a special case)
When is the inner product itself, Lax-Milgram reduces to the Riesz representation theorem with .
Remark 3 (Lax-Milgram and the direct method)
The two hypotheses of Lax-Milgram play exactly the same roles as the two ingredients of the direct method for variational problems. Recall the direct method (Example 2): to minimize , take a minimizing sequence , use coercivity to force to be bounded, use Banach-Alaoglu (Theorem 1) to extract a weakly convergent subsequence , and use weak lower semicontinuity to conclude . Two ingredients: coercivity to get a weak limit, weak l.s.c. to pass to it.
When is symmetric, Lax-Milgram is the direct method applied to the energy
whose Euler-Lagrange equation is . The two Lax-Milgram hypotheses translate into the two direct-method ingredients as follows.
Coercivity of coercivity of . From ,
So minimizing sequences are bounded in . This is the same mechanism that produces the a priori estimate in the Lax-Milgram conclusion: coercivity controls the size of the solution.
Continuity of + symmetry + coercivity weak l.s.c. of . This step is subtler than it looks. Continuity of makes continuous in the norm topology (since ), but norm continuity alone does not imply weak l.s.c. — is norm-continuous yet fails to be weakly l.s.c. The bridge is convexity, supplied by Mazur's theorem:
A convex, norm-continuous functional on a Banach space is weakly lower semicontinuous.
Symmetry together with coercivity makes a positive-definite quadratic form, hence convex. Continuity then promotes convexity to weak l.s.c. via Mazur. So the honest reading of the Lax-Milgram hypotheses under symmetry is
The shorthand “continuity of weak l.s.c.” is harmless in the symmetric setting because convexity is already on the table for free.
Non-symmetric case. When is not symmetric there is no energy to minimize, and the direct method does not literally apply. The proof of Lax-Milgram instead routes through Riesz representation and the operator with . But the roles of the two hypotheses are preserved:
Coercivity gives the lower bound , which forces to be injective with closed range — the “compactness / a priori bound” step.
Continuity makes a bounded operator in the first place — the “pass to the limit” step.
So the dichotomy coercivity controls size, continuity controls limits survives even when there is no energy functional in sight.
The Gelfand triple¶
The natural framework for weak formulations is a triple of spaces
where is a Hilbert space (the “energy space”), is a pivot space, and is the dual. The embeddings are continuous and dense.
The prototypical example is
An element acts on test functions in via the duality pairing. The PDE is an equation in : we seek such that for all . The Gelfand triple identifies which space each player lives in:
The solution lives in the energy space .
The data lives in the dual .
The bilinear form connects them.
The pivot sits in between, providing the inner product that mediates between function values and functionals.
Example 1 (General second-order elliptic operator)
Consider the operator on a bounded domain with Dirichlet boundary conditions. The weak formulation is: find such that
for all .
Continuity of follows from , , , and Cauchy-Schwarz.
Coercivity requires uniform ellipticity () and suitable bounds on and (or add a large enough multiple of to absorb the lower-order terms via Poincaré).
By Lax-Milgram, there exists a unique weak solution .
What Sobolev spaces provide¶
Compared to the fundamental solution approach, the weak formulation gains:
Boundary conditions. The space encodes on . No need to modify the fundamental solution.
Variable coefficients. Replace by for a matrix-valued coefficient . As long as is uniformly elliptic, the same argument works (via Lax-Milgram instead of Riesz).
Regularity from the Sobolev scale. The solution lives in . If is more regular ( rather than ), elliptic regularity gives . The Sobolev scale tracks exactly how regular the solution is.
Compactness. The embedding is compact (Rellich-Kondrachov), so the inverse Laplacian is a compact operator. This gives eigenvalues, eigenfunctions, and the spectral theory from the linear operators chapter.
Two approaches, one solution¶
On a bounded domain , both approaches produce the same answer. The Green’s function is the fundamental solution modified to satisfy boundary conditions: with on . The convolution formula becomes
and this is also the Riesz representative in . The fundamental solution gives the explicit kernel; the Sobolev approach gives existence without needing to find it.
Nonlinear PDE via Compactness¶
The Lax-Milgram theorem handles linear equations. For nonlinear problems, we need a different strategy: approximate, extract limits, and pass to the limit in the nonlinear terms. The tools are Banach-Alaoglu (weak compactness), Rellich-Kondrachov (compact embedding), and Sobolev embedding (controlling the nonlinearity). We illustrate with a model problem that ties together the entire theory.
The model problem¶
Consider the semilinear elliptic equation
where () is bounded with Lipschitz boundary and . The cubic nonlinearity models a restoring force that grows with amplitude.
The weak formulation is: find such that
The nonlinear term is well-defined when because the Sobolev embedding (for ) gives and .
Galerkin approximation¶
Choose finite-dimensional subspaces with dense in (e.g., the span of the first eigenfunctions of ). The Galerkin problem is: find such that
This is a finite-dimensional system, so Brouwer’s fixed point theorem guarantees the existence of a Galerkin solution .
A priori bounds¶
Test with itself:
By the Poincaré inequality (Theorem 1), , so
Dividing: . The sequence is bounded in , uniformly in .
Extracting the limit¶
The bounded sequence in the reflexive space has a weakly convergent subsequence by the Banach-Alaoglu theorem (Theorem 1):
Weak convergence alone is not enough to pass to the limit in the nonlinear term . The map is continuous (in the strong topology), but continuous is not the same as weakly continuous: continuity preserves strong limits, and weak convergence is in general only preserved by linear bounded operators. Nonlinear maps can destroy weak limits through oscillation. The textbook example is squaring: on satisfies in , but
so is not weakly continuous. Cubing fails the same way: has but (the cross term is what leaks out). So knowing only that in is not enough to conclude in any sense; we need strong convergence.
The fix is to upgrade weak convergence to strong convergence in a space where the nonlinearity is continuous, and this is exactly what Rellich-Kondrachov provides: applying Proposition 5 with the compact embedding (compact) for (taking , so the Sobolev critical exponent is ) upgrades the weak- convergence of the Galerkin sequence, along a further subsequence, to:
In particular, in . This strong convergence gives:
(since by the algebraic identity ).
Passing to the limit¶
Fix a test direction (not just ) and read each side of the Galerkin equation as a value of a functional on . For every , the maps
are continuous linear functionals on , i.e. elements of . The linear piece is continuous by Cauchy-Schwarz; the nonlinear piece is continuous because implies (Sobolev embedding dualized). So the Galerkin identity is the statement in , but restricted to test directions , i.e. tested only against a dense subset.
We want to upgrade this to in for a single limit object . Both pieces of converge, pointwise in , to the corresponding pieces built from :
Linear term. in is exactly the statement for every (weak convergence is pointwise convergence of the Riesz functionals).
Nonlinear term. Rellich upgraded weak- to strong- convergence, hence in . This upgrades to convergence in via the continuous embedding : for any , Hölder and Sobolev give
so defines a bounded linear functional on with . This is exactly the dual of Sobolev . Pairing with any fixed then gives .
So for every , where . For each (any ) the identity holds eventually, so passing yields on the dense set . Continuity of and on extends this to all . The Galerkin equation has upgraded to the identity in , which is the weak form we set out to solve.
Linear vs. nonlinear: why compactness was needed¶
Compare with the purely linear problem from the start of the section. There, existence followed from Riesz representation alone: weak convergence in was already enough to pass to the limit, because the only operation applied to was the continuous linear functional . Linear bounded operators preserve weak convergence by definition, so weak subsequential limits suffice and Rellich is never needed.
Nonlinearity changes this. The map is continuous but not weakly continuous, so weak convergence of gives no information about . The fix is to upgrade weak convergence to strong convergence in a space where the nonlinearity is continuous, and this upgrade is precisely what a compact embedding provides. This is the general shape of the direct method: for linear problems weak compactness (Banach-Alaoglu, Riesz) is enough; for nonlinear problems weak compactness must be combined with a compact embedding (Rellich-Kondrachov) to tame the nonlinear term.
Why multiplication is the hard operation. The obstruction is structural, and the Banach-algebra chapter already told us what it is: multiplication is a frequency-convolution operation (), so squaring or cubing a function sums its frequencies. Two modes at frequencies and produce a mode at frequency . High-frequency content in therefore breeds even higher frequencies in , so can be strictly worse behaved than , and may leave the Sobolev space that lived in. That is why Sobolev spaces fail to be Banach algebras below the threshold (Theorem 1): squaring moves you out of the space, because the squared spectrum escapes the regularity bound. The compactness trick sidesteps the problem by reducing the question to strong convergence in a concrete , where multiplication is continuous (Hölder), rather than working in the where multiplication might blow up the regularity budget.