Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Compact Operators

In finite dimensions, every linear operator maps bounded sets to bounded sets, and since closed bounded sets are compact (Heine-Borel), bounded sequences always have convergent subsequences. This is the engine behind most of finite-dimensional linear algebra: eigenvalue decompositions, the SVD, and the Fredholm alternative all rely on extracting convergent subsequences.

In infinite dimensions, closed bounded sets are no longer compact (Example 2), and this machinery breaks down for general bounded operators. Compact operators are precisely the class of operators for which it does not break down—they are the infinite-dimensional operators that still behave like matrices.

Why is extracting convergent subsequences the engine behind finite-dimensional linear algebra?

The eigenvalue decomposition and the SVD both reduce to finding a vector that achieves an extremum. The method is always:

  1. Take a sequence approaching the supremum/infimum.

  2. Extract a convergent subsequence (compactness of the unit ball).

  3. The limit achieves the optimum (continuity).

Eigenvalues of a symmetric matrix are found by maximizing the Rayleigh quotient R(x)=Ax,x/x2R(x) = \langle Ax, x \rangle / \|x\|^2 over the unit sphere. In Rn\mathbb{R}^n the unit sphere is compact, so a maximizing sequence has a convergent subsequence whose limit is an eigenvector. Restrict to its orthogonal complement and repeat—the full eigenvalue decomposition follows by induction. Each eigenvalue λi\lambda_i yields a projection Pi=,ψiψiP_i = \langle \cdot, \psi_i \rangle \psi_i onto the corresponding eigenspace, and the spectral decomposition A=λiPiA = \sum \lambda_i P_i says the operator is a sum of scaled orthogonal projections.

The SVD of a general (non-symmetric) matrix follows the same pattern by reducing to the symmetric case: form KKK^*K (which is self-adjoint and positive), apply the Rayleigh quotient argument to find its eigenvalues σi2\sigma_i^2 and eigenvectors viv_i, then set ui=Kvi/σiu_i = Kv_i / \sigma_i. The result is K=σi,viuiK = \sum \sigma_i \langle \cdot, v_i \rangle \, u_i. Each step requires extracting a convergent subsequence from a maximizing sequence on the unit sphere.

A compact operator maps bounded sequences to sequences with convergent subsequences—by definition. This is step 2, transplanted from a property of the space to a property of the operator. For the Hilbert-Schmidt spectral theorem, you maximize the Rayleigh quotient on the unit sphere. You cannot extract a convergent subsequence from the maximizing sequence (xn)(x_n) directly, but since AA is compact, (Axn)(Ax_n) has a convergent subsequence, which suffices to show (xn)(x_n) converges to an eigenvector. Compactness of the operator substitutes for compactness of the ball.

Definition and Basic Properties

The sequential characterization is the one we use most often in practice: bounded sequence in, convergent subsequence out. Compare this with a general bounded operator, which only guarantees bounded sequence in, bounded sequence out.

Proof:

The unit ball B1(0)XB_1(0) \subset X is bounded, so by compactness of the operator, K(B1(0))\overline{K(B_1(0))} is compact and hence bounded in YY. Thus

supx1KxYM\sup_{\|x\| \leq 1} \|Kx\|_Y \leq M

for some M>0M > 0, which gives KopM\|K\|_{\mathrm{op}} \leq M.

The converse is false in infinite dimensions: the identity operator I:22I : \ell^2 \to \ell^2 is bounded but not compact, since the orthonormal sequence (en)(e_n) is bounded but has no convergent subsequence.

The Space of Compact Operators

In algebraic language, K(X,Y)\mathcal{K}(X, Y) is a closed two-sided ideal in L(X)\mathcal{L}(X) when X=YX = Y. Composing a compact operator with any bounded operator (on either side) produces another compact operator.

The closedness statement is the most important: it says that the operator-norm limit of compact operators is again compact. This is the key tool for proving compactness of specific operators.

Proof:

Let (xj)(x_j) be a bounded sequence in XX. We use a diagonal argument.

Since K1K_1 is compact, (K1xj)(K_1 x_j) has a convergent subsequence K1xj(1)K_1 x_{j}^{(1)}.

Since K2K_2 is compact, (K2xj(1))(K_2 x_j^{(1)}) has a convergent subsequence K2xj(2)K_2 x_j^{(2)}.

Continuing, at stage ll the sequence (Klxj(l))(K_l x_j^{(l)}) converges. The diagonal sequence yj:=xj(j)y_j := x_j^{(j)} satisfies: (Klyj)j(K_l y_j)_j converges for every ll.

Now estimate:

KyiKyjKyiKnyiKKnC+KnyiKnyj0 as i,j+KnyjKyjKKnC\|K y_i - K y_j\| \leq \underbrace{\|K y_i - K_n y_i\|}_{\leq \|K - K_n\| \cdot C} + \underbrace{\|K_n y_i - K_n y_j\|}_{\to 0 \text{ as } i,j \to \infty} + \underbrace{\|K_n y_j - K y_j\|}_{\leq \|K - K_n\| \cdot C}

where C=supjyjC = \sup_j \|y_j\|. For any ε>0\varepsilon > 0, first choose nn so that KKn<ε/3C\|K - K_n\| < \varepsilon / 3C, then choose i,ji, j large enough so that KnyiKnyj<ε/3\|K_n y_i - K_n y_j\| < \varepsilon / 3. Thus (Kyj)(K y_j) is Cauchy in YY and converges since YY is a Banach space.

Finite-Rank Operators: Compact Operators as “Infinite-Dimensional Matrices”

The connection between compact operators and matrices runs through finite-rank operators.

Proof:

Let (xn)(x_n) be a bounded sequence in XX. Then (Axn)(Ax_n) is a bounded sequence in the finite-dimensional space R(A)R(A). By Bolzano-Weierstrass, it has a convergent subsequence.

This is the precise sense in which compact operators generalize matrices. A matrix ARm×nA \in \mathbb{R}^{m \times n} defines an operator of rank at most min(m,n)\min(m, n)—always finite. The key theorem (Theorem 1) now tells us:

Compact operators are precisely the operators that can be approximated by “matrices” (finite-rank operators) in the operator norm.

In Hilbert spaces this is exact: every compact operator is the operator-norm limit of finite-rank operators. In general Banach spaces, this is the approximation property (which most natural spaces satisfy, though Enflo showed it can fail).

The Key Example: Integral Operators

Our main source of compact operators is integral operators with square-integrable kernels.

Proof:

Choose an orthonormal basis {φj}\{\varphi_j\} of L2(Ω)L^2(\Omega). Then {φi(x)φj(y)}\{\varphi_i(x) \varphi_j(y)\} is an ONB of L2(Ω×Ω)L^2(\Omega \times \Omega), so

k(x,y)=i,j=1kijφi(x)φj(y),kL2(Ω×Ω)2=i,j=1kij2.k(x, y) = \sum_{i,j=1}^{\infty} k_{ij} \, \varphi_i(x) \varphi_j(y), \qquad \|k\|_{L^2(\Omega \times \Omega)}^2 = \sum_{i,j=1}^{\infty} |k_{ij}|^2.

Define truncations kn(x,y):=i,j=1nkijφi(x)φj(y)k_n(x,y) := \sum_{i,j=1}^{n} k_{ij} \, \varphi_i(x) \varphi_j(y) and corresponding operators KnK_n. For f=l=1clφlf = \sum_{l=1}^{\infty} c_l \varphi_l:

Knf=i,j=1nkijcjφi(x).K_n f = \sum_{i,j=1}^{n} k_{ij} c_j \, \varphi_i(x).

Each KnK_n has finite-dimensional range (rank n\leq n), hence is compact by Proposition 3. The approximation error satisfies

KKn2ΩΩk(x,y)kn(x,y)2dxdy=i,j=n+1kij20.\|K - K_n\|^2 \leq \int_\Omega \int_\Omega |k(x,y) - k_n(x,y)|^2 \, dx \, dy = \sum_{i,j=n+1}^{\infty} |k_{ij}|^2 \to 0.

By Theorem 1, KK is compact.

Spectral Theory of Compact Operators

The spectral theory of compact operators is the payoff: it tells us that compact operators have a spectrum that looks just like the spectrum of a matrix, up to a possible accumulation point at zero.

The Fredholm Alternative

The Hilbert-Schmidt Theorem

When the compact operator is additionally self-adjoint, we get a complete spectral decomposition—the infinite-dimensional analogue of the eigenvalue decomposition of a symmetric matrix.

This is the infinite-dimensional eigendecomposition for self-adjoint operators: the operator AA is completely determined by its eigenvalues and eigenfunctions, just as a symmetric matrix is determined by its eigenvalues and eigenvectors. The spectral representation A=λi,ψiψiA = \sum \lambda_i \langle \cdot, \psi_i \rangle \psi_i is the direct analogue of the matrix diagonalization A=QΛQTA = Q \Lambda Q^T. Note that this is not the SVD—it is the eigenvalue decomposition, which requires self-adjointness. The SVD K=σi,viuiK = \sum \sigma_i \langle \cdot, v_i \rangle u_i is a separate factorization that works for all compact operators (see Remark 1).

Application: Spectral Theory of the Laplacian

Connection to the Hilbert-Schmidt theorem:

If A1A^{-1} is compact and self-adjoint, then A1A^{-1} satisfies the Hilbert-Schmidt theorem with eigenvalues μj0\mu_j \to 0. The eigenvalues of AA are λj=1/μj\lambda_j = 1/\mu_j \to \infty.

This is the typical situation for Laplacian-type operators: A=ΔA = -\Delta with suitable boundary conditions is unbounded, but the solution operator A1A^{-1} (given by a Green’s function) is compact (Example 1). Hence Δ-\Delta has a discrete spectrum of eigenvalues tending to infinity, with eigenfunctions forming an ONB. This is why Fourier series work: the eigenfunctions of the Laplacian on [0,2π][0, 2\pi] are precisely {einx}\{e^{inx}\}.

Looking Ahead

Compact operators are the bridge between the abstract operator theory of this chapter and several later topics in the course: