Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Sobolev Spaces

Big Idea

LpL^p norms measure height and width but are blind to oscillation. A function can oscillate faster and faster while its LpL^p norm stays constant. Sobolev norms add a third dimension, frequency, by penalizing derivatives. The Sobolev space Wk,p(Ω)W^{k,p}(\Omega) consists of functions whose weak derivatives up to order kk are in LpL^p: it sees not just how big a function is, but how fast it oscillates.

From weak derivatives to function spaces

In the previous sections we defined the weak derivative of a locally integrable function: v=Dαuv = D^\alpha u weakly if

ΩuDαφdx=(1)αΩvφdxfor all φCc(Ω).\int_\Omega u \, D^\alpha \varphi \, dx = (-1)^{|\alpha|} \int_\Omega v \, \varphi \, dx \quad \text{for all } \varphi \in C_c^\infty(\Omega).

This makes sense for any uLloc1(Ω)u \in L^1_{\mathrm{loc}}(\Omega), but the derivative vv might not be in LpL^p, or might not exist as an LpL^p function at all. Sobolev spaces are defined by requiring that weak derivatives exist and have controlled size.

Definition 1 (Sobolev space Wk,p(Ω)W^{k,p}(\Omega))

Let ΩRd\Omega \subseteq \mathbb{R}^d be open, k0k \geq 0 an integer, and 1p1 \leq p \leq \infty. The Sobolev space Wk,p(Ω)W^{k,p}(\Omega) consists of all functions uLp(Ω)u \in L^p(\Omega) whose weak derivatives DαuD^\alpha u exist and belong to Lp(Ω)L^p(\Omega) for all multi-indices αk|\alpha| \leq k. The Sobolev norm is, for 1p<1 \leq p < \infty,

uWk,p=(αkDαuLp(Ω)p)1/p,\|u\|_{W^{k,p}} = \left( \sum_{|\alpha| \leq k} \|D^\alpha u\|_{L^p(\Omega)}^p \right)^{1/p},

and for p=p = \infty,

uWk,=maxαkDαuL(Ω).\|u\|_{W^{k,\infty}} = \max_{|\alpha| \leq k} \|D^\alpha u\|_{L^\infty(\Omega)}.

Proposition 1 (Wk,p(Ω)W^{k,p}(\Omega) is a Banach space)

For 1p1 \leq p \leq \infty and k0k \geq 0, the space (Wk,p(Ω),Wk,p)(W^{k,p}(\Omega), \|\cdot\|_{W^{k,p}}) is a Banach space.

Proof 1

Let (un)(u_n) be Cauchy in Wk,pW^{k,p}. Then for each αk|\alpha| \leq k, the sequence (Dαun)(D^\alpha u_n) is Cauchy in Lp(Ω)L^p(\Omega). Since LpL^p is complete, DαunvαD^\alpha u_n \to v_\alpha in LpL^p for some vαv_\alpha. In particular, unv0=:uu_n \to v_0 =: u in LpL^p.

We verify that vα=Dαuv_\alpha = D^\alpha u weakly: for any φCc(Ω)\varphi \in C_c^\infty(\Omega),

Ωvαφdx=limnΩDαunφdx=limn(1)αΩunDαφdx=(1)αΩuDαφdx.\int_\Omega v_\alpha \, \varphi \, dx = \lim_n \int_\Omega D^\alpha u_n \, \varphi \, dx = \lim_n (-1)^{|\alpha|} \int_\Omega u_n \, D^\alpha \varphi \, dx = (-1)^{|\alpha|} \int_\Omega u \, D^\alpha \varphi \, dx.

So uWk,pu \in W^{k,p} and unuu_n \to u in Wk,pW^{k,p}.

The completeness proof is simple because it reduces to the completeness of LpL^p: each derivative converges separately, and the weak derivative structure is preserved under LpL^p limits. This is a common pattern: Sobolev spaces inherit their analytical properties from LpL^p.

Definition 2 (The Hilbert-Sobolev spaces Hk(Ω)H^k(\Omega))

For p=2p = 2, the Sobolev space Wk,2(Ω)W^{k,2}(\Omega) is denoted Hk(Ω)H^k(\Omega). It is a Hilbert space with inner product

u,vHk=αkΩDαuDαvdx.\langle u, v \rangle_{H^k} = \sum_{|\alpha| \leq k} \int_\Omega D^\alpha u \cdot D^\alpha v \, dx.

The notation HkH^k reflects the Hilbert space structure. For PDE applications, H1(Ω)H^1(\Omega) is the most important space: it controls the function and its first derivatives in L2L^2.

What does the Sobolev norm measure?

In the introduction (Remark 1), we asked “what do LpL^p norms measure?” and found that they capture two features of a function:

But these two features miss something fundamental. Consider the sequence un(t)=sin(nπt)u_n(t) = \sin(n\pi t) on [0,1][0,1]:

unL22=01sin2(nπt)dt=12for all n.\|u_n\|_{L^2}^2 = \int_0^1 \sin^2(n\pi t) \, dt = \frac{1}{2} \quad \text{for all } n.

Every function in this sequence has the same height, the same support, and the same L2L^2 norm, yet they are clearly different: u100u_{100} oscillates 100 times faster than u1u_1. The L2L^2 norm is completely blind to this.

This is the same sequence that appeared in the weak convergence chapter (Example 2): sin(nπt)0\sin(n\pi t) \rightharpoonup 0 weakly in L2L^2, but sin(nπt)L2=1/2\|\sin(n\pi t)\|_{L^2} = 1/\sqrt{2} for all nn. The L2L^2 norm cannot distinguish rapid oscillation from no oscillation at all.

Source
<Figure size 1500x450 with 3 Axes>

Left: the functions sin(nπt)\sin(n\pi t) oscillate faster but have constant L2L^2 norm. Center: the L2L^2 norm (blue) stays flat while the H1H^1 norm (red) grows linearly in nn, showing that the derivative detects oscillation. Right: the Fourier weights (1+n2)s(1 + n^2)^s that define HsH^s; higher ss penalizes high frequencies more aggressively.

The third feature: frequency

The Sobolev norm adds the missing dimension. Computing the derivative:

un(t)=nπcos(nπt),unL22=n2π201cos2(nπt)dt=n2π22.u_n'(t) = n\pi \cos(n\pi t), \qquad \|u_n'\|_{L^2}^2 = n^2\pi^2 \int_0^1 \cos^2(n\pi t) \, dt = \frac{n^2\pi^2}{2}.

So the H1H^1 norm sees the oscillation:

unH12=unL22+unL22=12+n2π22.\|u_n\|_{H^1}^2 = \|u_n\|_{L^2}^2 + \|u_n'\|_{L^2}^2 = \frac{1}{2} + \frac{n^2\pi^2}{2} \to \infty.

The derivative term unL2\|u_n'\|_{L^2} blows up like nπn\pi: differentiation converts oscillation frequency into amplitude. The H1H^1 norm detects the frequency because derivatives penalize oscillation.

The bump function scaling heuristic

Following Tao, we can make this precise with a model function that independently controls all three features. Fix a bump function ϕCc(R)\phi \in C_c^\infty(\mathbb{R}) with supp(ϕ)[1,1]\operatorname{supp}(\phi) \subset [-1, 1], and consider

fA,R,N(x)=Aϕ(x/R)sin(Nx)f_{A, R, N}(x) = A \, \phi(x/R) \, \sin(Nx)

where:

The LpL^p norm scales as (by the change of variables y=x/Ry = x/R):

fA,R,NLpp=Aϕ(x/R)sin(Nx)pdxApRϕsinpp\|f_{A,R,N}\|_{L^p}^p = \int |A \, \phi(x/R) \, \sin(Nx)|^p \, dx \approx A^p \, R \, \|\phi \cdot \sin\|_p^p

so fA,R,NLpAR1/p\|f_{A,R,N}\|_{L^p} \approx A \, R^{1/p}. The frequency NN does not appear: the LpL^p norm is blind to oscillation.

The Ws,pW^{s,p} norm behaves differently. Each derivative of sin(Nx)\sin(Nx) pulls down a factor of NN:

Ds[Aϕ(x/R)sin(Nx)]ANsϕ(x/R)sin(Nx+sπ/2)+lower-order termsD^s [A\,\phi(x/R)\,\sin(Nx)] \approx A \, N^s \, \phi(x/R) \, \sin(Nx + s\pi/2) + \text{lower-order terms}

(the lower-order terms involve derivatives of ϕ\phi, which contribute factors of R1R^{-1} instead of NN, and are negligible when N1/RN \gg 1/R). Therefore:

fA,R,NWs,pAR1/pNs.\|f_{A,R,N}\|_{W^{s,p}} \approx A \, R^{1/p} \, N^s.

The three parameters are now visible:

NormScalingWhat it sees
fLp|f|_{L^p}AR1/pA \, R^{1/p}Height ×\times Width
fWs,p|f|_{W^{s,p}}AR1/pNsA \, R^{1/p} \, N^sHeight ×\times Width ×\times Frequencys^s

The extra factor NsN^s is the whole point: the Sobolev norm penalizes high-frequency oscillation, with the penalty growing as NsN^s. Higher Sobolev index ss means stronger frequency penalty.

Remark 1 (The three dimensions of function behavior)

A function’s “size” in the broad sense has three independent components:

  1. Amplitude AA: how tall is the function? (Controlled by LL^\infty.)

  2. Spatial extent RR: how wide is the support? (Controlled by LpL^p for p<p < \infty.)

  3. Frequency scale NN: how fast does it oscillate? (Controlled by Ws,pW^{s,p}.)

LpL^p norms see dimensions 1 and 2. Sobolev norms see all three. This is why Sobolev spaces are the natural setting for PDE: differential equations involve derivatives, and the Sobolev norm is the norm that controls derivatives.

Regularity as frequency decay

The bump function heuristic shows that the Sobolev norm penalizes high frequency. But how many degrees of regularity does a function have? To make this precise, consider the one-parameter family

Gs,N(x)=Nsϕ(x)sin(Nx),s>0,N1,G_{s,N}(x) = N^{-s}\,\phi(x)\,\sin(Nx), \qquad s > 0, \quad N \gg 1,

where ϕCc(R)\phi \in C_c^\infty(\mathbb{R}) is a fixed bump. The prefactor NsN^{-s} is chosen so that Gs,NWs,p\|G_{s,N}\|_{W^{s,p}} remains bounded as NN \to \infty (from the scaling fA,R,NWs,pAR1/pNs\|f_{A,R,N}\|_{W^{s,p}} \approx A R^{1/p} N^s with A=NsA = N^{-s} and R=1R = 1).

Computing derivatives (keeping only the dominant term when N1N \gg 1):

Gs,N(k)(x)Nksϕ(x)sin(Nx+kπ/2)+lower-order terms.G_{s,N}^{(k)}(x) \approx N^{k-s}\,\phi(x)\,\sin(Nx + k\pi/2) + \text{lower-order terms}.

The behavior splits sharply at k=sk = s:

Derivative order kkGs,N(k)Lp|G_{s,N}^{(k)}|_{L^p} as NN \to \inftyInterpretation
k<sk < s0\to 0 (decays as NksN^{k-s})Well within regularity budget
k=sk = sO(1)O(1) (bounded)Exactly at the regularity threshold
k>sk > s\to \infty (grows as NksN^{k-s})Beyond the regularity budget

The following plot shows Gs,NG_{s,N} for s=2s = 2 and increasing NN. The function itself shrinks (because of the NsN^{-s} prefactor), but its second derivative stays O(1)O(1) and its third derivative grows.

Source
<Figure size 1200x350 with 3 Axes>

Remark 2 (Sobolev regularity is sharp)

The family Gs,NG_{s,N} demonstrates that ss is a sharp regularity threshold: the function has exactly ss derivatives in LpL^p and no more. This is the core idea of the Sobolev scale: the index ss counts exactly how many derivatives a function can afford.

The mechanism is frequency: each derivative pulls down a factor of NN, and the prefactor NsN^{-s} budgets exactly ss such factors before the LpL^p norm diverges.

The Fourier picture

For Hs=Ws,2H^s = W^{s,2} on the torus [0,2π][0, 2\pi] (or Rd\mathbb{R}^d via the Fourier transform), the connection to frequency is exact. If u(x)=nu^neinxu(x) = \sum_n \hat{u}_n \, e^{inx}, then DsuD^s u has Fourier coefficients (in)su^n(in)^s \hat{u}_n, and Parseval gives:

uHs2=n(1+n2)su^n2.\|u\|_{H^s}^2 = \sum_n (1 + |n|^2)^s |\hat{u}_n|^2.

The weight (1+n2)s(1 + |n|^2)^s penalizes high frequencies n|n| by a factor of n2s|n|^{2s}. This makes the frequency interpretation precise:

A function is in HsH^s if and only if its Fourier coefficients decay fast enough to compensate for the weight (1+n2)s(1 + |n|^2)^s. Smoother functions have faster-decaying Fourier coefficients, so they live in higher Sobolev spaces.

Examples: what is and isn’t in W1,2W^{1,2}

Example 1 (Functions in and out of H1H^1)

In H1(0,1)H^1(0,1):

  • Any C1C^1 function on [0,1][0,1]: classical derivatives are in L2L^2.

  • f(x)=x1/2f(x) = |x - 1/2|: the weak derivative is f(x)=sgn(x1/2)f'(x) = \operatorname{sgn}(x - 1/2), which is in L2L^2 (in fact in LL^\infty). The kink at x=1/2x = 1/2 is invisible to the H1H^1 norm.

Not in H1(0,1)H^1(0,1):

  • The Heaviside function H(x)=1[1/2,1](x)H(x) = \mathbf{1}_{[1/2, 1]}(x): its weak derivative is δ1/2\delta_{1/2}, a measure and not an L2L^2 function. So HL2H \in L^2 but HH1H \notin H^1. The jump discontinuity is detected by the Sobolev norm.

  • f(x)=x1/4f(x) = x^{-1/4} on (0,1)(0,1): fL2(0,1)f \in L^2(0,1) (barely), but f(x)=14x5/4L2(0,1)f'(x) = -\frac{1}{4}x^{-5/4} \notin L^2(0,1). The singularity at 0 is too strong for one derivative in L2L^2.

The pattern: H1H^1 tolerates kinks (Lipschitz singularities) but not jumps or blowup. This is because the weak derivative of a Lipschitz function is bounded (hence in every LpL^p), while the derivative of a discontinuous function is a measure.