Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

The Adjoint (Transpose) Operator

The finite-dimensional picture

In Rn\mathbb{R}^n, a linear map T:RnRmT : \mathbb{R}^n \to \mathbb{R}^m is represented by an m×nm \times n matrix AA. The transpose ATA^T is the n×mn \times m matrix satisfying

Ax,y=x,ATyfor all xRn,  yRm.\langle Ax, y \rangle = \langle x, A^T y \rangle \quad \text{for all } x \in \mathbb{R}^n, \; y \in \mathbb{R}^m.

The fundamental theorem of linear algebra decomposes Rn\mathbb{R}^n and Rm\mathbb{R}^m into four mutually orthogonal subspaces:

Rn=ker(A)Range(AT),Rm=ker(AT)Range(A).\mathbb{R}^n = \ker(A) \oplus \text{Range}(A^T), \qquad \mathbb{R}^m = \ker(A^T) \oplus \text{Range}(A).
Source
<Figure size 1000x550 with 1 Axes>

The four fundamental subspaces of an m×nm \times n matrix AA of rank rr. The domain Rn\mathbb{R}^n decomposes into the row space R(AT)\mathrm{R}(A^T) (dimension rr) and the nullspace N(A)\mathrm{N}(A) (dimension nrn - r), which are orthogonal complements. The codomain Rm\mathbb{R}^m decomposes into the column space R(A)\mathrm{R}(A) (dimension rr) and the left nullspace N(AT)\mathrm{N}(A^T) (dimension mrm - r). The map AA sends R(AT)\mathrm{R}(A^T) isomorphically onto R(A)\mathrm{R}(A), and kills N(A)\mathrm{N}(A).

The four subspace picture says: AA and ATA^T are perfectly coupled. The kernel of one is the orthogonal complement of the range of the other. This is the cleanest version of the story, and it relies on two features of Rn\mathbb{R}^n:

  1. Inner product: orthogonal complements make sense.

  2. Finite dimensions: all subspaces are closed, and ranges are automatically closed.

In infinite dimensions, both features can fail. The adjoint theory generalizes this picture, but the four subspace relations require progressively more assumptions as we move away from the finite-dimensional setting.

The Banach adjoint (transpose)

Definition 1 (Banach adjoint)

Let X,YX, Y be normed spaces and T:XYT : X \to Y a bounded linear operator. The adjoint (or transpose) T:YXT^* : Y^* \to X^* is defined by

(Tg)(x)=g(Tx)for all gY,  xX.(T^*g)(x) = g(Tx) \quad \text{for all } g \in Y^*, \; x \in X.

The definition is simple: TT^* takes a functional on YY and pulls it back to a functional on XX by precomposing with TT. If gg defines a foliation of YY, then TgT^*g defines a foliation of XX whose level sets are the preimages under TT of the level sets of gg.

Proposition 1 (Basic properties)

Let T:XYT : X \to Y be bounded. Then:

  1. T:YXT^* : Y^* \to X^* is bounded with T=T\|T^*\| = \|T\|.

  2. (S+T)=S+T(S + T)^* = S^* + T^* and (αT)=αT(\alpha T)^* = \alpha T^*.

  3. (ST)=TS(ST)^* = T^* S^* (reversal of order).

  4. TT^* is injective if and only if Range(T)=Y\overline{\text{Range}(T)} = Y.

  5. TT^* is surjective only if TT is bounded below (i.e., Txcx\|Tx\| \geq c\|x\| for some c>0c > 0).

Proof 1

(1) For gYg \in Y^*: Tg=supx1g(Tx)gT\|T^*g\| = \sup_{\|x\| \leq 1} |g(Tx)| \leq \|g\| \cdot \|T\|, so TT\|T^*\| \leq \|T\|. For the reverse, the sup formula <hb-sup-formula> gives Tx=supg1g(Tx)=supg1(Tg)(x)Tx\|Tx\| = \sup_{\|g\| \leq 1} |g(Tx)| = \sup_{\|g\| \leq 1} |(T^*g)(x)| \leq \|T^*\| \cdot \|x\|, so TT\|T\| \leq \|T^*\|.

(3) ((ST)g)(x)=g(STx)=(Sg)(Tx)=(TSg)(x)((ST)^*g)(x) = g(STx) = (S^*g)(Tx) = (T^*S^*g)(x).

(4) Tg=0T^*g = 0 means g(Tx)=0g(Tx) = 0 for all xx, i.e., gg vanishes on Range(T)\text{Range}(T). By Hahn-Banach (classification of closures <hb-closures>), gg vanishes on Range(T)\overline{\text{Range}(T)}. So ker(T)={0}\ker(T^*) = \{0\} iff the only functional vanishing on Range(T)\overline{\text{Range}(T)} is zero, iff Range(T)=Y\overline{\text{Range}(T)} = Y.

The four subspace relations: general Banach spaces

To state the four subspace relations in Banach spaces, we need a substitute for orthogonal complements. Since we have no inner product, we use annihilators.

Definition 2 (Annihilator)

Let XX be a normed space.

  • For MXM \subseteq X, the annihilator of MM in XX^* is

    M={fX:f(x)=0 for all xM}.M^\perp = \{f \in X^* : f(x) = 0 \text{ for all } x \in M\}.
  • For NXN \subseteq X^*, the pre-annihilator of NN in XX is

    N={xX:f(x)=0 for all fN}.{}^\perp N = \{x \in X : f(x) = 0 \text{ for all } f \in N\}.

Annihilators are the Banach-space replacement for orthogonal complements. In a Hilbert space with Riesz identification, MM^\perp reduces to the usual orthogonal complement. But in general, MM^\perp lives in the dual space, not in XX itself.

Theorem 1 (Four subspace relations (general Banach spaces))

Let X,YX, Y be Banach spaces and T:XYT : X \to Y a bounded linear operator. Then:

  1. ker(T)=Range(T)\ker(T^*) = \text{Range}(T)^\perp

  2. ker(T)=Range(T)\ker(T) = {}^\perp\text{Range}(T^*)

  3. Range(T)=ker(T)\overline{\text{Range}(T)} = {}^\perp\ker(T^*)

  4. Range(T)ker(T)\overline{\text{Range}(T^*)} \subseteq \ker(T)^\perp

Proof 2

(1) gker(T)g \in \ker(T^*) iff Tg=0T^*g = 0 iff g(Tx)=0g(Tx) = 0 for all xx iff gRange(T)g \in \text{Range}(T)^\perp.

(2) xRange(T)x \in {}^\perp\text{Range}(T^*) iff (Tg)(x)=0(T^*g)(x) = 0 for all gYg \in Y^* iff g(Tx)=0g(Tx) = 0 for all gYg \in Y^*. By the distinguishing property <hb-separation>, this holds iff Tx=0Tx = 0 iff xker(T)x \in \ker(T).

(3) From (1), ker(T)=(Range(T)){}^\perp\ker(T^*) = {}^\perp(\text{Range}(T)^\perp). For any subspace MM, the classification of closures gives (M)=M{}^\perp(M^\perp) = \overline{M}. Applying this to M=Range(T)M = \text{Range}(T) yields (3).

(4) If fRange(T)f \in \overline{\text{Range}(T^*)}, then f=limTgnf = \lim T^*g_n for some sequence in YY^*. For xker(T)x \in \ker(T): f(x)=lim(Tgn)(x)=limgn(Tx)=0f(x) = \lim (T^*g_n)(x) = \lim g_n(Tx) = 0. So fker(T)f \in \ker(T)^\perp.

Remark 1 (The asymmetry in relation (4))

Relation (4) is an inclusion, not an equality. This is the key difference from the finite-dimensional case, and it arises because:

  • In (3), we used (M)=M{}^\perp(M^\perp) = \overline{M}, which is a theorem about XX (using Hahn-Banach for XX).

  • To upgrade (4) to equality, we would need (N)=N({}^\perp N)^\perp = \overline{N} for subspaces NXN \subseteq X^*. This requires Hahn-Banach applied to XX^* with functionals from XX^{**}, and it gives (N)=Nw({}^\perp N)^\perp = \overline{N}^{w^*} (weak* closure), which equals N\overline{N} (norm closure) only when XX is reflexive.

So in a non-reflexive space, Range(T)\text{Range}(T^*) can be a proper subset of ker(T)\ker(T)^\perp even after taking its closure. The “missing” elements of ker(T)\ker(T)^\perp are functionals that annihilate ker(T)\ker(T) but cannot be reached as limits of TgnT^*g_n.

The closed range theorem

The most important application of the four subspace picture is the closed range theorem, which restores full symmetry when the range is closed.

Theorem 2 (Closed Range Theorem (Banach))

Let X,YX, Y be Banach spaces and T:XYT : X \to Y bounded. Then

Range(T) is closed    Range(T) is closed.\text{Range}(T) \text{ is closed} \iff \text{Range}(T^*) \text{ is closed.}

When this holds, relation (4) becomes an equality:

Range(T)=ker(T).\text{Range}(T^*) = \ker(T)^\perp.

Proof 3

Sketch. (\Rightarrow) If Range(T)\text{Range}(T) is closed, the open mapping theorem applied to T:X/ker(T)Range(T)T : X/\ker(T) \to \text{Range}(T) gives a bounded inverse. One then shows that TgnfT^*g_n \to f implies gng_n (restricted to Range(T)\text{Range}(T)) converge, producing gg with Tg=fT^*g = f.

(\Leftarrow) Apply the same argument to TT^*.

Equality in (4). If Range(T)\text{Range}(T^*) is closed, then Range(T)=Range(T)ker(T)\text{Range}(T^*) = \overline{\text{Range}(T^*)} \subseteq \ker(T)^\perp. For the reverse inclusion: if fker(T)f \in \ker(T)^\perp, then ff defines a functional on X/ker(T)X/\ker(T). Since TT induces an isomorphism X/ker(T)Range(T)X/\ker(T) \cong \text{Range}(T) (by the closed range), we can pull ff back to a functional gg on Range(T)\text{Range}(T), extend by Hahn-Banach to gYg \in Y^*, and verify Tg=fT^*g = f.

The closed range condition is a property of the operator TT, not the ambient space. It is essential for solvability of equations: Tx=yTx = y has a solution iff yRange(T)y \in \text{Range}(T), and the closed range theorem lets us check this via the adjoint: yRange(T)y \in \overline{\text{Range}(T)} iff g(y)=0g(y) = 0 for all gker(T)g \in \ker(T^*). An alternative route to full equality in relation (4) is to work in a reflexive space, where it holds for every bounded operator automatically. An important class of operators with closed range are the Fredholm operators, discussed in Application: Fredholm Operators.

Reflexive spaces: full symmetry restored

In a reflexive space (XXX \cong X^{**} via the canonical embedding), the asymmetry in relation (4) disappears.

Theorem 3 (Four subspace relations (reflexive spaces))

Let X,YX, Y be reflexive Banach spaces and T:XYT : X \to Y bounded. Then all four relations hold with equality:

  1. ker(T)=Range(T)\ker(T^*) = \text{Range}(T)^\perp

  2. ker(T)=Range(T)\ker(T) = {}^\perp\text{Range}(T^*)

  3. Range(T)=ker(T)\overline{\text{Range}(T)} = {}^\perp\ker(T^*)

  4. Range(T)=ker(T)\overline{\text{Range}(T^*)} = \ker(T)^\perp

Proof 4

Relations (1)--(3) hold in any Banach space. For (4): in a reflexive space, every functional on XX^* comes from an element of XX, so the weak* closure and the norm closure of any convex subset of XX^* coincide. Therefore (N)=N({}^\perp N)^\perp = \overline{N} for subspaces NXN \subseteq X^*, and applying this to N=Range(T)N = \text{Range}(T^*) gives ker(T)=(Range(T))=Range(T)\ker(T)^\perp = ({}^\perp\text{Range}(T^*))^\perp = \overline{\text{Range}(T^*)}.

Reflexivity restores the full symmetry of the finite-dimensional picture: each kernel is the annihilator of the other’s range, and vice versa. No closed range assumption is needed for the four relations to hold (though the ranges themselves may still not be closed).

Hilbert spaces: orthogonal complements

In a Hilbert space, the Riesz representation theorem identifies HH^* with HH, and annihilators become orthogonal complements. The Banach adjoint TT^* becomes the Hilbert adjoint, defined by the familiar relation:

Definition 3 (Hilbert adjoint)

Let H,KH, K be Hilbert spaces and T:HKT : H \to K a bounded linear operator. The Hilbert adjoint T:KHT^* : K \to H is the unique operator satisfying

Tx,yK=x,TyHfor all xH,  yK.\langle Tx, y \rangle_K = \langle x, T^*y \rangle_H \quad \text{for all } x \in H, \; y \in K.

The Hilbert adjoint maps back into the same type of space (not the dual), which is why we can write the four subspace decompositions as orthogonal direct sums within HH and KK themselves.

Corollary 1 (Four subspaces in Hilbert space)

Let H,KH, K be Hilbert spaces and T:HKT : H \to K bounded. Then:

H=ker(T)Range(T),K=ker(T)Range(T).H = \ker(T) \oplus \overline{\text{Range}(T^*)}, \qquad K = \ker(T^*) \oplus \overline{\text{Range}(T)}.

The four subspaces satisfy:

  1. ker(T)=Range(T)\ker(T^*) = \text{Range}(T)^\perp

  2. ker(T)=Range(T)\ker(T) = \text{Range}(T^*)^\perp

  3. Range(T)=ker(T)\overline{\text{Range}(T)} = \ker(T^*)^\perp

  4. Range(T)=ker(T)\overline{\text{Range}(T^*)} = \ker(T)^\perp

All orthogonal complements are taken in the ambient Hilbert space (not the dual).

This is exactly the finite-dimensional picture, with closures added to account for the possibility that ranges are not closed. Every vector in HH decomposes uniquely into a component killed by TT and a component in the closure of what TT^* produces.

See the four subspace picture above, which now carries over with ATA \mapsto T, RnH\mathbb{R}^n \mapsto H, and RmK\mathbb{R}^m \mapsto K, and closures added to account for possibly non-closed ranges.