Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Dual Spaces: From Self-Dual to Non-Reflexive

The behavior of the dual space XX^* varies dramatically across different Banach spaces. There are three main classes: self-dual (Hilbert), reflexive, and non-reflexive. The geometry of the unit ball is what drives the distinction.

Hilbert Spaces: HHH^* \cong H (Self-Dual)

In a Hilbert space, every continuous linear functional comes from the inner product.

Theorem 1 (Riesz Representation Theorem)

Let HH be a Hilbert space. For every fHf \in H^*, there exists a unique xfHx_f \in H such that

f(y)=xf,yfor all yHf(y) = \langle x_f, y \rangle \quad \text{for all } y \in H

and f=xf\|f\| = \|x_f\|. The map fxff \mapsto x_f is an isometric (conjugate-linear) isomorphism HHH^* \cong H.

Proof 1

Existence. If f=0f = 0, take xf=0x_f = 0. Otherwise ker(f)\ker(f) is a proper closed subspace of HH. By the orthogonal decomposition, H=ker(f)ker(f)H = \ker(f) \oplus \ker(f)^\perp, and since ker(f)H\ker(f) \neq H, the orthogonal complement ker(f)\ker(f)^\perp is one-dimensional (because ker(f)\ker(f) has codimension 1). Let zz be the unit vector in ker(f)\ker(f)^\perp, so z=1\|z\| = 1.

Now decompose any yHy \in H using this splitting: write y=y0+αzy = y_0 + \alpha z where y0ker(f)y_0 \in \ker(f) and αR\alpha \in \mathbb{R}. Apply ff to both sides: f(y)=f(y0)+αf(z)=αf(z)f(y) = f(y_0) + \alpha f(z) = \alpha f(z), so α=f(y)/f(z)\alpha = f(y) / f(z).

Set xf=f(z)zx_f = \overline{f(z)} \, z. Then since xfker(f)x_f \in \ker(f)^\perp, we have xf,y0=0\langle x_f, y_0 \rangle = 0, and:

xf,y=xf,y0+αz=αxf,z=f(y)f(z)f(z)=f(y).\langle x_f, y \rangle = \langle x_f, y_0 + \alpha z \rangle = \alpha \langle x_f, z \rangle = \frac{f(y)}{f(z)} \cdot \overline{f(z)} = f(y).

Uniqueness. If x1,y=x2,y\langle x_1, y \rangle = \langle x_2, y \rangle for all yy, then x1x2,y=0\langle x_1 - x_2, y \rangle = 0 for all yy. Taking y=x1x2y = x_1 - x_2 gives x1=x2x_1 = x_2.

Isometry. f=supy1xf,y=xf\|f\| = \sup_{\|y\| \leq 1} |\langle x_f, y \rangle| = \|x_f\| by Cauchy-Schwarz, with equality at y=xf/xfy = x_f / \|x_f\|.

The R2\mathbb{R}^2 picture

In R2\mathbb{R}^2 with the Euclidean norm, every functional f(x,y)=ax+byf(x, y) = ax + by is represented by the vector (a,b)(a, b). The kernel ker(f)\ker(f) is the line perpendicular to (a,b)(a, b), and the level sets are parallel lines spaced 1/(a,b)1/\|(a,b)\| apart. The Riesz representative is the normal vector to the foliation.

Source
<Figure size 1100x450 with 2 Axes>

Left: in (R2,2)(\mathbb{R}^2, \|\cdot\|_2), the functional f(x,y)=35x+45yf(x,y) = \tfrac{3}{5}x + \tfrac{4}{5}y is represented by the vector xf=(3/5,4/5)x_f = (3/5, 4/5), which is perpendicular to ker(f)\ker(f). Right: given ff on M=span(1,1)M = \mathrm{span}(1,1) with f(1,1)=1f(1,1) = 1, the unique norm-preserving extension is F(x,y)=(x+y)/2F(x,y) = (x+y)/2. The round ball forces a single choice of kernel, unlike the \ell^\infty worked example in the previous section, where the square ball allowed a whole family of extensions.

Example 1 (Rn\mathbb{R}^n and 2\ell^2)

In Rn\mathbb{R}^n with the standard inner product, every functional is f(y)=ayf(y) = a \cdot y for some aRna \in \mathbb{R}^n. Thus (Rn)Rn(\mathbb{R}^n)^* \cong \mathbb{R}^n.

More generally, in 2={(xn):xn2<}\ell^2 = \{(x_n) : \sum |x_n|^2 < \infty\}, every functional is f(x)=anxnf(x) = \sum a_n x_n for a unique (an)2(a_n) \in \ell^2, and f=(an)2\|f\| = \|(a_n)\|_{\ell^2}.

Example 2 (L2(Ω)L^2(\Omega))

In L2(Ω)L^2(\Omega) every functional is given by

f[x]=Ωx(t)g(t)dtfor a unique gL2(Ω).f[x] = \int_\Omega x(t) \, g(t) \, \mathrm{d}t \quad \text{for a unique } g \in L^2(\Omega).

This is the Riesz Representation Theorem applied to L2L^2. The representing function gg plays the role of the “normal vector” to ker(f)\ker(f).

The key property of Hilbert spaces: the inner product provides a canonical identification between HH and HH^*. Duality is not an abstract theorem but a concrete computation. Every closed hyperplane has a unique normal vector, and extending a functional from a subspace amounts to orthogonal projection.

Reflexive Spaces: XXX^{**} \cong X

Definition 1 (Reflexive space)

A Banach space XX is reflexive if the canonical embedding J:XXJ : X \to X^{**} defined by

J[x](f)=f(x)for all fXJ[x](f) = f(x) \quad \text{for all } f \in X^*

is surjective (and hence an isometric isomorphism onto XX^{**}).

Lemma 1 (The canonical embedding is an isometry)

For any normed space XX, the canonical embedding J:XXJ : X \to X^{**} is an isometry: J[x]X=xX\|J[x]\|_{X^{**}} = \|x\|_X for all xXx \in X.

Proof 2

By definition, J(x)X=supf1J[x](f)=supf1f(x)\|J(x)\|_{X^{**}} = \sup_{\|f\| \leq 1} |J[x](f)| = \sup_{\|f\| \leq 1} |f(x)|. The sup formula gives supf1f(x)=x\sup_{\|f\| \leq 1} |f(x)| = \|x\|.

So XX always embeds isometrically into XX^{**}. Reflexivity asks: does every element of XX^{**} come from a point in XX? That is, does JJ miss anything?

All Hilbert spaces are reflexive (since HHH^* \cong H implies HHHH^{**} \cong H^* \cong H). But reflexivity is strictly weaker than self-duality.

The LpL^p family

Example 3 (LpL^p spaces for 1<p<1 < p < \infty)

The spaces Lp(Ω)L^p(\Omega) for 1<p<1 < p < \infty are reflexive. The dual is (Lp)Lq(L^p)^* \cong L^q where 1p+1q=1\frac{1}{p} + \frac{1}{q} = 1.

For gLq(Ω)g \in L^q(\Omega), the functional

Lg(f)=Ωf(x)g(x)dxL_g(f) = \int_\Omega f(x)\,g(x)\,dx

satisfies Lg=gq\|L_g\| = \|g\|_q by Hölder’s inequality. The map gLgg \mapsto L_g is an isometric isomorphism.

Proof 3

Step 1: LgL_g is bounded and Lggq\|L_g\| \leq \|g\|_q. For any fLpf \in L^p, Hölder’s inequality gives

Lg(f)=Ωfgfpgq,|L_g(f)| = \left|\int_\Omega f\,g\right| \leq \|f\|_p \, \|g\|_q,

so Lg(Lp)L_g \in (L^p)^* with Lggq\|L_g\| \leq \|g\|_q.

Step 2: Lg=gq\|L_g\| = \|g\|_q (the bound is sharp). Assume g0g \neq 0 and define the test function

f0(x)=g(x)q1sgn(g(x))gqq/p.f_0(x) = \frac{|g(x)|^{q-1} \operatorname{sgn}(g(x))}{\|g\|_q^{q/p}}.

Then f0p=g(q1)p/gqq=gq/gqq|f_0|^p = |g|^{(q-1)p} / \|g\|_q^q = |g|^q / \|g\|_q^q, so f0p=1\|f_0\|_p = 1. Evaluating:

Lg(f0)=Ωf0g=1gqq/pΩgq=gqqgqq/p=gq,L_g(f_0) = \int_\Omega f_0 \, g = \frac{1}{\|g\|_q^{q/p}} \int_\Omega |g|^q = \frac{\|g\|_q^q}{\|g\|_q^{q/p}} = \|g\|_q,

where we used (q1)+1=q(q-1) + 1 = q and qq/p=q(11/p)=1q - q/p = q(1 - 1/p) = 1. So Lggq\|L_g\| \geq \|g\|_q, and combined with Step 1, Lg=gq\|L_g\| = \|g\|_q. In particular gLgg \mapsto L_g is an isometry.

Step 3: Surjectivity. Let ϕ(Lp)\phi \in (L^p)^*. Define a set function ν\nu on measurable sets by ν(E)=ϕ(1E)\nu(E) = \phi(\mathbf{1}_E). Since ν(E)=ϕ(1E)ϕ1Ep=ϕμ(E)1/p|\nu(E)| = |\phi(\mathbf{1}_E)| \leq \|\phi\| \, \|\mathbf{1}_E\|_p = \|\phi\| \, \mu(E)^{1/p}, we have ν(E)0\nu(E) \to 0 whenever μ(E)0\mu(E) \to 0, so ν\nu is absolutely continuous with respect to μ\mu. By the Radon-Nikodym theorem, there exists gL1(Ω)g \in L^1(\Omega) with ν(E)=Egdμ\nu(E) = \int_E g \, d\mu for all measurable EE. This gives ϕ(s)=Ωsg\phi(s) = \int_\Omega s \, g for all simple functions ss. Since simple functions are dense in LpL^p and ϕ\phi is continuous, ϕ(f)=Ωfg\phi(f) = \int_\Omega f \, g for all fLpf \in L^p, i.e., ϕ=Lg\phi = L_g. The isometry from Step 2 forces gLqg \in L^q with gq=ϕ\|g\|_q = \|\phi\|.

The duality pairing (Lp)=Lq(L^p)^* = L^q and (Lq)=Lp(L^q)^* = L^p closes the loop: X=(Lq)=Lp=XX^{**} = (L^q)^* = L^p = X.

The geometry of reflexive spaces

What makes reflexive spaces special geometrically? The answer involves the shape of the unit ball.

Definition 2 (Uniform convexity)

A Banach space XX is uniformly convex if for every ε>0\varepsilon > 0 there exists δ>0\delta > 0 such that

x=y=1,xyε    x+y21δ.\|x\| = \|y\| = 1, \quad \|x - y\| \geq \varepsilon \implies \left\|\frac{x+y}{2}\right\| \leq 1 - \delta.

In words: if two points on the unit sphere are separated, their midpoint is strictly inside the ball. The boundary has no flat spots.

Source
<Figure size 1400x450 with 3 Axes>

p\ell^p unit balls for three values of p>1p > 1. In each case, the midpoint of two boundary points (red dot) lies strictly inside the ball. This is uniform convexity: the boundary curves inward, with no flat edges. Contrast with the 1\ell^1 diamond and \ell^\infty square, which have flat sides where the midpoint stays on the boundary.

From geometry to reflexivity

Recall what reflexivity asks: is every ξX\xi \in X^{**} of the form ξ=J(x)\xi = J(x) for some xXx \in X? An element ξX\xi \in X^{**} is a linear functional on XX^*, so it assigns a number ξ(f)\xi(f) to every fXf \in X^*. If ξ=J(x)\xi = J(x), then ξ(f)=f(x)\xi(f) = f(x), meaning the readings come from evaluating at a point. Non-reflexivity means there exist “phantom” evaluation functionals in XX^{**} that do not come from any point in XX.

The shape of the unit ball controls this through the uniqueness of norm-attaining directions:

A concrete example: in c0c_0 (sequences converging to zero, sup norm), the canonical embedding J(c0)=c0J(c_0) \subset \ell^\infty = c_0^{**}. The constant sequence (1,1,1,)(1,1,1,\ldots) \in \ell^\infty defines a valid element of c0c_0^{**}, but it is not in c0c_0 since it does not converge to zero. The flat geometry of the \ell^\infty ball accommodates this extra element.

Theorem 2 (Milman-Pettis)

Every uniformly convex Banach space is reflexive.

Proof 4

Idea. We must show every ξX\xi \in X^{**} equals J(x)J(x) for some xXx \in X. The key step is: if two points x,yBXx, y \in B_X produce nearly the same readings on all functionals (i.e., f(x)f(y)|f(x) - f(y)| is small for all fBXf \in B_{X^*}), then xx and yy must be close in norm.

Suppose not: xyε\|x - y\| \geq \varepsilon but the readings are close. By uniform convexity, (x+y)/21δ\|(x+y)/2\| \leq 1 - \delta. But the readings of the midpoint are the averages of the readings of xx and yy, which are both close to ξ(f)\xi(f). Choosing ff that nearly norms (x+y)/2(x+y)/2 gives a contradiction.

This means: if a sequence of “approximate representatives” xnBXx_n \in B_X satisfies f(xn)ξ(f)f(x_n) \to \xi(f) for all ff, then (xn)(x_n) is Cauchy in norm. Since XX is complete, xnxx_n \to x for some xXx \in X, and ξ=J(x)\xi = J(x).

The LpL^p spaces for 1<p<1 < p < \infty are uniformly convex (Clarkson’s inequalities), so Milman-Pettis gives reflexivity as a consequence of their round unit balls.

Reflexivity is really about the duality structure (does JJ hit all of XX^{**}?), not about any single geometric property. Uniform convexity is one sufficient condition, but spaces can be reflexive for other reasons, for instance because their dual pairing closes the loop as with (Lp)=Lq(L^p)^* = L^q and (Lq)=Lp(L^q)^* = L^p.

Non-Reflexive Spaces: XXX \subsetneq X^{**}

When XX is not reflexive, the canonical embedding J:XXJ : X \hookrightarrow X^{**} is a strict inclusion. The bidual XX^{**} contains “phantom” elements that do not correspond to any point in XX. Geometrically, the unit ball has flat spots or corners that create room for extra functionals.

L1L^1 and LL^\infty

Example 4 (L1L^1 and LL^\infty)

(L1(Ω))L(Ω)(L^1(\Omega))^* \cong L^\infty(\Omega). The pairing is

Lg(f)=Ωf(x)g(x)dx,gL.L_g(f) = \int_\Omega f(x)\,g(x)\,dx, \quad g \in L^\infty.

But the duality chain does not close: (L)L1(L^\infty)^* \supsetneq L^1. The dual of LL^\infty consists of finitely additive measures, which include singular objects that cannot be represented by L1L^1 functions.

A finitely additive measure is a set function μ:AR\mu : \mathcal{A} \to \mathbb{R} satisfying μ(AB)=μ(A)+μ(B)\mu(A \cup B) = \mu(A) + \mu(B) for disjoint A,BA, B, but not necessarily for countable unions. Every countably additive (i.e., ordinary) measure is finitely additive, but not conversely. The extra elements of (L)(L^\infty)^* come from finitely additive measures that are not countably additive, such as Banach limits (which assign a “limit” to every bounded sequence, extending the usual limit).

The L1L^1 unit ball is not uniformly convex. In R2\mathbb{R}^2, the 1\ell^1 ball (diamond) has flat edges connecting (1,0)(1,0) to (0,1)(0,1), where midpoints of boundary points remain on the boundary. This flat geometry is what allows non-reflexivity.

Sequence spaces: c0c_0 and 1\ell^1

Example 5 (The c01c_0 \to \ell^1 \to \ell^\infty chain)

c0={(xn):xn0}c_0 = \{(x_n) : x_n \to 0\} with the sup norm. The duality chain is:

c01,(1),()1.c_0^* \cong \ell^1, \qquad (\ell^1)^* \cong \ell^\infty, \qquad (\ell^\infty)^* \supsetneq \ell^1.

Neither c0c_0 nor 1\ell^1 is reflexive. The embedding J:c0=c0J : c_0 \hookrightarrow \ell^\infty = c_0^{**} misses sequences that do not converge to zero: for instance, the constant sequence (1,1,1,)(1, 1, 1, \ldots) \in \ell^\infty is in c0c_0^{**} but not in J(c0)J(c_0).

Source
<Figure size 1400x450 with 3 Axes>

Reflexive vs. non-reflexive geometry. In 2\ell^2 (left), the midpoint of two boundary points is strictly inside the ball: uniform convexity. In 1\ell^1 (center) and \ell^\infty (right), flat edges allow midpoints to remain on the boundary. This failure of uniform convexity is the geometric signature of non-reflexivity.

C(K)C(K) and Radon measures

This is one of the most important dualities in analysis.

Definition 3 (Radon measure)

Let KK be a compact Hausdorff space. A Radon measure on KK is a Borel measure μ\mu that is:

  • Finite: μ(K)<|\mu|(K) < \infty.

  • Inner regular: μ(E)=sup{μ(C):CE,  C compact}|\mu|(E) = \sup\{|\mu|(C) : C \subseteq E, \; C \text{ compact}\} for every Borel set EE.

  • Outer regular: μ(E)=inf{μ(U):UE,  U open}|\mu|(E) = \inf\{|\mu|(U) : U \supseteq E, \; U \text{ open}\} for every Borel set EE.

A signed Radon measure is a difference μ=μ+μ\mu = \mu^+ - \mu^- of two (positive) Radon measures with μ(K)=μ+(K)+μ(K)<|\mu|(K) = \mu^+(K) + \mu^-(K) < \infty. The space M(K)\mathcal{M}(K) of all finite signed Radon measures on KK is a Banach space under the total variation norm μ=μ(K)\|\mu\| = |\mu|(K).

Examples include absolutely continuous measures (with L1L^1 densities), point masses δx\delta_x (where δx(f)=f(x)\delta_x(f) = f(x)), and singular measures such as the Cantor measure.

Theorem 3 (Riesz-Markov-Kakutani Representation Theorem)

Let KK be a compact Hausdorff space. Then every bounded linear functional φC(K)\varphi \in C(K)^* is represented by a unique finite signed Radon measure μ\mu on KK:

φ(f)=Kfdμfor all fC(K)\varphi(f) = \int_K f \, d\mu \quad \text{for all } f \in C(K)

and φ=μ(K)\|\varphi\| = |\mu|(K) (total variation). That is, C(K)M(K)C(K)^* \cong \mathcal{M}(K), the space of finite signed Radon measures on KK.

Proof 5

The proof constructs the measure μ\mu from the functional φ\varphi in several steps.

Positive case. If φ0\varphi \geq 0 (i.e., f0φ(f)0f \geq 0 \Rightarrow \varphi(f) \geq 0), define for each open UKU \subseteq K:

μ(U)=sup{φ(f):0f1,  supp(f)U}\mu(U) = \sup\{\varphi(f) : 0 \leq f \leq 1, \; \mathrm{supp}(f) \subseteq U\}

This defines an outer-regular Borel measure. Urysohn’s lemma (which requires KK Hausdorff and compact) provides the continuous functions needed to approximate indicator functions, and the Riesz representation follows from monotone convergence arguments.

General case. Decompose φ=φ+φ\varphi = \varphi^+ - \varphi^- into positive and negative parts (the Jordan decomposition of functionals), apply the positive case to each, and set μ=μ+μ\mu = \mu^+ - \mu^-.

Example 6 (Key instances)

  • C([0,1])=M([0,1])C([0,1])^* = \mathcal{M}([0,1]): functionals on continuous functions are signed measures. This includes absolutely continuous measures (represented by L1L^1 densities), singular measures, and point masses δx\delta_x.

  • C(Rn)C(\mathbb{R}^n) (with compact support or vanishing at infinity): the dual captures probability distributions, empirical measures, and distributional objects.

The space C(K)C(K) is generally not reflexive. The dual M(K)\mathcal{M}(K) is much larger than C(K)C(K): it contains point masses δx\delta_x (which are not continuous functions) and singular measures. The bidual C(K)=M(K)C(K)^{**} = \mathcal{M}(K)^* is larger still.

This duality is central to probability theory (measures as linear functionals on observables), PDE (weak solutions via integration against test functions), and optimization (dual problems over measures). It also provides the setting for the weak* convergence of measures discussed in the next section.

Spaces with Trivial Dual

Example 7 (LpL^p for 0<p<10 < p < 1)

The spaces Lp(Ω)L^p(\Omega) for 0<p<10 < p < 1 have trivial dual: (Lp)={0}(L^p)^* = \{0\}.

These are complete metric spaces (F-spaces) but not Banach spaces. The “norm” fp=fp\|f\|_p = \int |f|^p fails the triangle inequality. The proof (due to Day, 1940) shows that the only convex open subsets of LpL^p for 0<p<10 < p < 1 are \emptyset and LpL^p itself. Since any nonzero continuous linear functional would have f1((1,1))f^{-1}((-1,1)) as a proper convex open set, no such functional exists.

This is the extreme case where the dual sees nothing. It is precisely what the Hahn-Banach theorem prevents in Banach spaces: the convexity of the unit ball (guaranteed by the triangle inequality) is what keeps the dual nontrivial.