Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Compact Operators

Almost every existence argument in analysis follows the same three-step template:

  1. Construct an approximate sequence of “almost-solutions” (minimizing sequences, Galerkin approximations, Euler polygonal approximations, maximizers of a Rayleigh quotient, etc.).

  2. Extract a convergent subsequence. This is where compactness enters.

  3. Pass to the limit. Show the limit actually solves the original problem.

In finite dimensions step 2 is free: Bolzano-Weierstrass guarantees that every bounded sequence in Rn\mathbb{R}^n has a convergent subsequence. This is why eigenvalue decompositions, the SVD, the direct method of the calculus of variations, and Peano’s ODE existence theorem all “just work” in Rn\mathbb{R}^n. Each reduces to building an approximate sequence on a compact set and extracting a convergent subsequence.

In infinite dimensions, closed bounded sets are no longer compact (Example 2), and step 2 fails for general bounded operators. Compact operators are precisely the class of operators for which it does not break down: they restore step 2 as a property of the operator rather than the space.

Definition 1 (Compact, precompact, and totally bounded sets)

Let (M,d)(M, d) be a metric space and AMA \subset M.

  1. AA is compact if every open cover of AA has a finite subcover, or equivalently, if every sequence in AA has a subsequence converging to a point in AA.

  2. AA is precompact (or relatively compact) if its closure A\overline{A} is compact, or equivalently, if every sequence in AA has a subsequence that is Cauchy.

  3. AA is totally bounded if for every ε>0\varepsilon > 0, AA can be covered by finitely many balls of radius ε\varepsilon.

The key equivalence is:

AA is compact     \iff AA is complete and totally bounded.

In a complete metric space (such as a Banach space), precompactness and total boundedness coincide: AA is precompact if and only if it is totally bounded.

Why both conditions are needed.

Total boundedness is the condition that makes a set approximately finite-dimensional: at every resolution ε\varepsilon, the set is indistinguishable from a finite set. Completeness then ensures that Cauchy subsequences (built by the pigeonhole principle at finer and finer scales) actually converge.

Remark 1 (Compact sets in infinite dimensions)

Boundedness alone is not enough. The unit ball of 2\ell^2 is closed and bounded, yet the orthonormal sequence (en)(e_n) has every pair distance 2\sqrt{2} apart, so no subsequence can be Cauchy. You need a constraint that suppresses this “too many directions” problem, i.e. you need total boundedness (Definition 1).

A concrete and important example: the unit ball of H1(Ω)H^1(\Omega), viewed inside L2(Ω)L^2(\Omega), is precompact (Rellich-Kondrachov). The reason is visible in Fourier space. An H1H^1 bound forces u^(k)2C/(1+k2)|\hat{u}(k)|^2 \leq C/(1 + |k|^2), so high-frequency modes are uniformly suppressed. For any ε>0\varepsilon > 0, all the L2L^2 energy beyond a sufficiently large frequency NN is less than ε\varepsilon, uniformly over the entire H1H^1 ball. The remaining low-frequency part lives in a finite-dimensional space. This is total boundedness: the derivative bound kills oscillations that would otherwise prevent clustering, and the bounded domain prevents mass from escaping to spatial infinity.

Definition and Basic Properties

Definition 2 (Compact operator)

An operator K:XYK : X \to Y between normed spaces is compact if for each bounded set UXU \subset X, the image K(U)K(U) is relatively compact in YY (i.e., K(U)\overline{K(U)} is compact).

Equivalently, KK is compact if and only if every bounded sequence (xn)(x_n) in XX has a subsequence (xnk)(x_{n_k}) such that (Kxnk)(Kx_{n_k}) converges in YY.

The sequential characterization is the one we use most often in practice: bounded sequence in, convergent subsequence out. Compare this with a general bounded operator, which only guarantees bounded sequence in, bounded sequence out.

Proposition 1 (Compact operators are bounded)

Every compact operator is bounded.

Proof 1

The unit ball B1(0)XB_1(0) \subset X is bounded, so by compactness of the operator, K(B1(0))\overline{K(B_1(0))} is compact and hence bounded in YY. Thus

supx1KxYM\sup_{\|x\| \leq 1} \|Kx\|_Y \leq M

for some M>0M > 0, which gives KopM\|K\|_{\mathrm{op}} \leq M.

The converse is false in infinite dimensions: the identity operator I:22I : \ell^2 \to \ell^2 is bounded but not compact, since the orthonormal sequence (en)(e_n) is bounded but has no convergent subsequence.

The Space of Compact Operators

Proposition 2 (Compact operators form a closed ideal)

Let X,Y,ZX, Y, Z be normed spaces with YY a Banach space.

  1. If K:XYK : X \to Y is compact and T:YZT : Y \to Z is bounded, then TKTK is compact.

  2. If T:ZXT : Z \to X is bounded and K:XYK : X \to Y is compact, then KTKT is compact.

  3. The set K(X,Y)\mathcal{K}(X, Y) of compact operators is a closed subspace of L(X,Y)\mathcal{L}(X, Y) in the operator norm.

In algebraic language, K(X,Y)\mathcal{K}(X, Y) is a closed two-sided ideal in L(X)\mathcal{L}(X) when X=YX = Y. Composing a compact operator with any bounded operator (on either side) produces another compact operator.

The closedness statement is the most important: it says that the operator-norm limit of compact operators is again compact. This is the key tool for proving compactness of specific operators.

Theorem 1 (Limits of compact operators are compact)

Let XX be a normed space and YY a Banach space. If (Kn)(K_n) is a sequence of compact operators in L(X,Y)\mathcal{L}(X, Y) with KnKK_n \to K in the operator norm, then KK is compact.

Proof 2

Let (xj)(x_j) be a bounded sequence in XX. We use a diagonal argument.

Since K1K_1 is compact, (K1xj)(K_1 x_j) has a convergent subsequence K1xj(1)K_1 x_{j}^{(1)}.

Since K2K_2 is compact, (K2xj(1))(K_2 x_j^{(1)}) has a convergent subsequence K2xj(2)K_2 x_j^{(2)}.

Continuing, at stage ll the sequence (Klxj(l))(K_l x_j^{(l)}) converges. The diagonal sequence yj:=xj(j)y_j := x_j^{(j)} satisfies: (Klyj)j(K_l y_j)_j converges for every ll.

Now estimate:

KyiKyjKyiKnyiKKnC+KnyiKnyj0 as i,j+KnyjKyjKKnC\|K y_i - K y_j\| \leq \underbrace{\|K y_i - K_n y_i\|}_{\leq \|K - K_n\| \cdot C} + \underbrace{\|K_n y_i - K_n y_j\|}_{\to 0 \text{ as } i,j \to \infty} + \underbrace{\|K_n y_j - K y_j\|}_{\leq \|K - K_n\| \cdot C}

where C=supjyjC = \sup_j \|y_j\|. For any ε>0\varepsilon > 0, first choose nn so that KKn<ε/3C\|K - K_n\| < \varepsilon / 3C, then choose i,ji, j large enough so that KnyiKnyj<ε/3\|K_n y_i - K_n y_j\| < \varepsilon / 3. Thus (Kyj)(K y_j) is Cauchy in YY and converges since YY is a Banach space.

Corollary 1 (K(X,Y) is a Banach space)

The space K(X,Y)\mathcal{K}(X, Y) of compact operators from a normed space XX into a Banach space YY is itself a Banach space.

Finite-Rank Operators: Compact Operators as “Infinite-Dimensional Matrices”

The connection between compact operators and matrices runs through finite-rank operators.

Definition 3 (Finite-rank operator)

A bounded linear operator A:XYA : X \to Y has finite rank if its range R(A)R(A) is finite-dimensional. We write rank(A)=dimR(A)\mathrm{rank}(A) = \dim R(A).

Proposition 3 (Finite-rank operators are compact)

Every bounded operator with finite-dimensional range is compact.

Proof 3

Let (xn)(x_n) be a bounded sequence in XX. Then (Axn)(Ax_n) is a bounded sequence in the finite-dimensional space R(A)R(A). By Bolzano-Weierstrass, it has a convergent subsequence.

Finite-rank operators are literally matrices: a matrix ARm×nA \in \mathbb{R}^{m \times n} has rank at most min(m,n)\min(m, n). On Hilbert spaces, compact operators turn out to be exactly the norm limits of such operators:

Proposition 4 (Compact operators on Hilbert spaces are limits of finite-rank operators)

Let HH be a Hilbert space and K:HHK : H \to H a compact operator. Then there exists a sequence of finite-rank operators (Kn)(K_n) such that KnK0\|K_n - K\| \to 0.

Proof 4

Let {ej}j=1\{e_j\}_{j=1}^{\infty} be an orthonormal basis for HH, and let Pn:HHP_n : H \to H denote the orthogonal projection onto Vn:=span{e1,,en}V_n := \operatorname{span}\{e_1, \ldots, e_n\}, so that

Pnx=j=1nx,ejej.P_n x = \sum_{j=1}^{n} \langle x, e_j \rangle \, e_j.

Define Kn:=PnKK_n := P_n K. Since R(Kn)VnR(K_n) \subseteq V_n, each KnK_n has rank at most nn and is therefore compact by Proposition 3.

We claim KnK0\|K_n - K\| \to 0. Suppose for contradiction that this fails. Then there exists ε>0\varepsilon > 0 and a subsequence (still denoted nn) such that KnK>ε\|K_n - K\| > \varepsilon for all nn. By definition of the operator norm, for each nn there exists xnHx_n \in H with xn1\|x_n\| \leq 1 and

(KPnK)xn=(IPn)Kxn>ε.\|(K - P_n K) x_n\| = \|(I - P_n) K x_n\| > \varepsilon.

Since KK is compact and (xn)(x_n) is bounded, there exists a subsequence (xnk)(x_{n_k}) such that KxnkyK x_{n_k} \to y for some yHy \in H. Now estimate:

(IPnk)Kxnk(IPnk)(Kxnky)+(IPnk)y.\|(I - P_{n_k}) K x_{n_k}\| \leq \|(I - P_{n_k})(K x_{n_k} - y)\| + \|(I - P_{n_k}) y\|.

The first term satisfies (IPnk)(Kxnky)Kxnky0\|(I - P_{n_k})(K x_{n_k} - y)\| \leq \|K x_{n_k} - y\| \to 0 since IPn1\|I - P_n\| \leq 1. The second term satisfies (IPnk)y0\|(I - P_{n_k}) y\| \to 0 since partial sums of the Fourier expansion converge: PnyyP_n y \to y for every yHy \in H. This gives (IPnk)Kxnk0\|(I - P_{n_k}) K x_{n_k}\| \to 0, contradicting the assumption that it stays above ε\varepsilon.

Combined with Theorem 1, this tells us that on Hilbert spaces, compact operators are precisely the norm limits of finite-rank operators. They are the closest thing to matrices in infinite dimensions.

The result above extends beyond Hilbert spaces. A Banach space YY is said to have the approximation property if for every compact set CYC \subset Y and every ε>0\varepsilon > 0, there exists a finite-rank operator F:YYF : Y \to Y with Fxx<ε\|Fx - x\| < \varepsilon for all xCx \in C. When YY has the approximation property, every compact operator K:XYK : X \to Y is the norm limit of finite-rank operators. All Hilbert spaces, LpL^p spaces, and C(K)C(K) spaces have the approximation property. However, Enflo (1973) constructed a Banach space that fails the approximation property, so the result does not hold in full generality.

On a Hilbert space, the analogy between compact operators and matrices is precise:

Finite dimensions (RnRm\mathbb{R}^n \to \mathbb{R}^m)Compact operators (HHH \to H)
Every operator has finite rankNorm limit of finite-rank operators
Spectrum = eigenvaluesSpectrum = eigenvalues  {0}\cup\ \{0\}
Each eigenvalue has finite multiplicityEach nonzero eigenvalue has finite multiplicity
SVD: A=i=1rσiuiviTA = \sum_{i=1}^r \sigma_i u_i v_i^TSVD: K=i=1σi,viuiK = \sum_{i=1}^\infty \sigma_i \langle \cdot, v_i \rangle \, u_i
Symmetric: A=QΛQTA = Q\Lambda Q^TSelf-adjoint: K=i=1λi,ψiψiK = \sum_{i=1}^\infty \lambda_i \langle \cdot, \psi_i \rangle \, \psi_i
Fredholm alternative holdsFredholm alternative holds

The SVD holds for every compact operator between Hilbert spaces. The eigenvalue decomposition into an orthonormal eigenbasis requires the additional assumption that the operator is normal (KK=KKK^*K = KK^*), which includes self-adjoint operators as a special case (just as A=QΛQTA = Q\Lambda Q^T requires symmetry in finite dimensions). Without normality, a compact operator’s eigenvectors need not form a basis.

The Canonical Example: Integral Operators

Our main source of compact operators is integral operators with square-integrable kernels.

Theorem 2 (Hilbert-Schmidt integral operators are compact)

Let ΩRd\Omega \subset \mathbb{R}^d be bounded and let kL2(Ω×Ω)k \in L^2(\Omega \times \Omega). Then the integral operator

Kf(x):=Ωk(x,y)f(y)dyKf(x) := \int_\Omega k(x, y) f(y) \, dy

defines a compact operator K:L2(Ω)L2(Ω)K : L^2(\Omega) \to L^2(\Omega).

Proof 5

Choose an orthonormal basis {φj}\{\varphi_j\} of L2(Ω)L^2(\Omega). Then {φi(x)φj(y)}\{\varphi_i(x) \varphi_j(y)\} is an ONB of L2(Ω×Ω)L^2(\Omega \times \Omega), so

k(x,y)=i,j=1kijφi(x)φj(y),kL2(Ω×Ω)2=i,j=1kij2.k(x, y) = \sum_{i,j=1}^{\infty} k_{ij} \, \varphi_i(x) \varphi_j(y), \qquad \|k\|_{L^2(\Omega \times \Omega)}^2 = \sum_{i,j=1}^{\infty} |k_{ij}|^2.

Define truncations kn(x,y):=i,j=1nkijφi(x)φj(y)k_n(x,y) := \sum_{i,j=1}^{n} k_{ij} \, \varphi_i(x) \varphi_j(y) and corresponding operators KnK_n. For f=l=1clφlf = \sum_{l=1}^{\infty} c_l \varphi_l:

Knf=i,j=1nkijcjφi(x).K_n f = \sum_{i,j=1}^{n} k_{ij} c_j \, \varphi_i(x).

Each KnK_n has finite-dimensional range (rank n\leq n), hence is compact by Proposition 3. The approximation error satisfies

KKn2ΩΩk(x,y)kn(x,y)2dxdy=i,j=n+1kij20.\|K - K_n\|^2 \leq \int_\Omega \int_\Omega |k(x,y) - k_n(x,y)|^2 \, dx \, dy = \sum_{i,j=n+1}^{\infty} |k_{ij}|^2 \to 0.

By Theorem 1, KK is compact.

The proof above is really a finite-rank approximation argument. The kernel k(x,y)k(x,y) is expanded in an ONB for L2(Ω×Ω)L^2(\Omega \times \Omega), and truncating to nn terms gives a rank-nn operator that converges to KK in the operator norm. This is analogous to truncating the SVD of a matrix to rr singular values to obtain a best rank-rr approximation (the Eckart-Young theorem). However, the expansion here uses an arbitrary ONB, not the singular functions of KK. The true SVD of the integral operator would use the eigenbases of KKK^*K and KKKK^*.

Example 1 (The solution operator for Poisson’s equation is compact)

Consider the Poisson equation on [0,L][0, L] with Neumann boundary conditions:

u=f,u(0)=u(L)=0,0Lu(x)dx=M.-u'' = f, \quad u'(0) = u'(L) = 0, \quad \int_0^L u(x) \, dx = M.

Integrating twice and applying Fubini gives the solution

u(x)=0Lg(x,s)f(s)ds+cu(x) = \int_0^L g(x, s) f(s) \, ds + c

where gg is the Green’s function (an L2L^2 kernel). The solution operator A1:fuA^{-1} : f \mapsto u is an integral operator with an L2L^2 kernel, hence compact by Theorem 2.

This is a fundamental pattern: differential operators are unbounded, but their inverses (solution operators) are often compact. The compactness of A1A^{-1} is what gives the Laplacian a discrete spectrum of eigenvalues.

Example 2 (Non-example: unbounded operators are not compact)

The second derivative operator A=d2dx2:D(A)L2(0,1)A = -\frac{d^2}{dx^2} : D(A) \to L^2(0,1) is unbounded, and therefore not compact. Compact operators are always bounded (Proposition 1).

Spectral Theory of Compact Operators

The spectral theory of compact operators is the payoff: it tells us that compact operators have a spectrum that looks just like the spectrum of a matrix, up to a possible accumulation point at zero.

The Fredholm Alternative

Theorem 3 (Spectral Fredholm Alternative)

Let A:HHA : H \to H be a compact linear operator on a Hilbert space HH. Then:

  1. The spectrum σ(A)\sigma(A) is a compact subset of C\mathbb{C} whose only possible accumulation point is λ=0\lambda = 0.

  2. For each λC{0}\lambda \in \mathbb{C} \setminus \{0\}, exactly one of the following holds:

    • (Alternative 1): λρ(A)\lambda \in \rho(A), i.e., (AλI)1(A - \lambda I)^{-1} exists and is bounded. The equation Axλx=yAx - \lambda x = y has a unique solution for every yy.

    • (Alternative 2): λσp(A)\lambda \in \sigma_p(A) is an eigenvalue of finite multiplicity. The equation Axλx=0Ax - \lambda x = 0 has a nontrivial finite-dimensional solution space.

Compare this with finite dimensions: for a matrix ARn×nA \in \mathbb{R}^{n \times n} and λ0\lambda \neq 0, either det(AλI)0\det(A - \lambda I) \neq 0 (invertible) or det(AλI)=0\det(A - \lambda I) = 0 (eigenvalue with finite-dimensional eigenspace). The Fredholm alternative is exactly this dichotomy, extended to infinite dimensions via compactness. Without compactness, the spectrum can contain continuous and residual parts that have no finite-dimensional analogue.

The Hilbert-Schmidt Theorem

When the compact operator is additionally self-adjoint, we get a complete spectral decomposition—the infinite-dimensional analogue of the eigenvalue decomposition of a symmetric matrix.

Theorem 4 (Hilbert-Schmidt Spectral Theorem)

Let A:HHA : H \to H be a linear, compact, and self-adjoint operator on a Hilbert space HH. Then:

  1. All eigenvalues λi\lambda_i of AA are real.

  2. Eigenvalues can accumulate only at 0.

  3. There exists an orthonormal set of eigenfunctions {ψi}\{\psi_i\} such that AA has the spectral representation

Aφ=i=1λiφ,ψiψi.A\varphi = \sum_{i=1}^{\infty} \lambda_i \langle \varphi, \psi_i \rangle \, \psi_i.

This is the infinite-dimensional eigendecomposition for self-adjoint operators: the operator AA is completely determined by its eigenvalues and eigenfunctions, just as a symmetric matrix is determined by its eigenvalues and eigenvectors. The spectral representation A=λi,ψiψiA = \sum \lambda_i \langle \cdot, \psi_i \rangle \psi_i is the direct analogue of the matrix diagonalization A=QΛQTA = Q \Lambda Q^T. Note that this is not the SVD—it is the eigenvalue decomposition, which requires self-adjointness. The SVD K=σi,viuiK = \sum \sigma_i \langle \cdot, v_i \rangle u_i is a separate factorization that works for all compact operators (see Remark 3).

Remark 6 (Why eigenvalues accumulate at zero)

On a Hilbert space, compact operators are norm limits of finite-rank operators (Proposition 4). A finite-rank operator is a matrix: it has finitely many nonzero eigenvalues. As the finite-rank approximations KnKK_n \to K, the eigenvalues of KnK_n converge to those of KK, but each KnK_n can only contribute finitely many. The only place infinitely many eigenvalues can pile up is at the shared limit 0. Compactness forces the spectrum to be “almost finite” at every scale, and 0 is the residue of that approximation.

Contrast this with a general bounded operator that is not compact: multiplication by xx on L2([0,1])L^2([0,1]), defined by (Mf)(x)=xf(x)(Mf)(x) = xf(x), has M=1\|M\| = 1 but no eigenvalues at all. If Mf=λfMf = \lambda f, then (xλ)f(x)=0(x - \lambda)f(x) = 0 a.e., forcing f=0f = 0 a.e. The spectrum is the purely continuous set [0,1][0,1]. There is no eigenfunction expansion, no discrete decomposition. This operator preserves the full infinite-dimensional structure of L2L^2, which is exactly what compact operators suppress.

The spectral representation also provides a natural way to define fractional powers of operators. If AA is a positive operator with Aφ=λiφ,ψiψiA\varphi = \sum \lambda_i \langle \varphi, \psi_i \rangle \psi_i, then we define

Aαφ:=i=1λiαφ,ψiψiA^\alpha \varphi := \sum_{i=1}^{\infty} \lambda_i^\alpha \langle \varphi, \psi_i \rangle \, \psi_i

for α0\alpha \geq 0. This is used, for example, to define Δ\sqrt{-\Delta}, which appears in the study of fractional diffusion and Levy flights.

Application: Spectral Theory of the Laplacian

Theorem 5 (Spectral theorem for differential operators with compact inverse)

Let A:D(A)HA : D(A) \to H be a symmetric, linear, unbounded operator with R(A)=HR(A) = H, and suppose A1A^{-1} exists and is compact. Then:

  1. There exists an infinite sequence of real eigenvalues {λn}\{\lambda_n\} with limnλn=+\lim_{n \to \infty} |\lambda_n| = +\infty.

  2. The eigenvectors {wj}\{w_j\} can be chosen to form an orthonormal basis, and

Au=j=1λju,wjwj.Au = \sum_{j=1}^{\infty} \lambda_j \langle u, w_j \rangle \, w_j.

Remark 8 (Connection to the Hilbert-Schmidt theorem)

If A1A^{-1} is compact and self-adjoint, then A1A^{-1} satisfies the Hilbert-Schmidt theorem with eigenvalues μj0\mu_j \to 0. The eigenvalues of AA are λj=1/μj\lambda_j = 1/\mu_j \to \infty.

This is the typical situation for Laplacian-type operators: A=ΔA = -\Delta with suitable boundary conditions is unbounded, but the solution operator A1A^{-1} (given by a Green’s function) is compact (Example 1). Hence Δ-\Delta has a discrete spectrum of eigenvalues tending to infinity, with eigenfunctions forming an ONB. This is why Fourier series work: the eigenfunctions of the Laplacian on [0,2π][0, 2\pi] are precisely {einx}\{e^{inx}\}.

The bounded domain is essential. On Rn\mathbb{R}^n the “cage” preventing translation is lost: a fixed bump φ(xn)\varphi(x - n) slides off to infinity, so the solution operator is no longer compact. The spectrum of Δ-\Delta on Rn\mathbb{R}^n is the continuous set [0,)[0, \infty), and the clean eigenfunction expansion breaks down.

Looking Ahead

Compact operators are the bridge between the abstract operator theory of this chapter and several later topics in the course: