Lagrange Interpolation - Introduction to Scientific Computing

The Interpolation Problem¶

Problem: Given $n+1$ distinct points $(x_0, y_0), (x_1, y_1), \ldots, (x_n, y_n)$ , find a polynomial $p(x)$ such that:

p(x_i) = y_i \quad \text{for } i = 0, 1, \ldots, n

(1)

Proof 1

Existence: The Lagrange form (below) explicitly constructs such a polynomial.

Uniqueness: Suppose $p(x)$ and $q(x)$ are both polynomials of degree at most $n$ interpolating the same data. Consider the difference $d(x) = p(x) - q(x)$ .

$d(x)$ is a polynomial of degree at most $n$
$d(x_i) = p(x_i) - q(x_i) = y_i - y_i = 0$ for all $i = 0, 1, \ldots, n$

So $d(x)$ has $n+1$ roots. But a nonzero polynomial of degree $n$ can have at most $n$ roots. Therefore $d(x) \equiv 0$ , which means $p(x) = q(x)$ .

Lagrange Basis Polynomials¶

For nodes $x_0, x_1, \ldots, x_n$ , define the Lagrange basis polynomial:

L_j(x) = \prod_{i=0, i \neq j}^{n} \frac{x - x_i}{x_j - x_i}

(2)

Key property:

L_j(x_i) = \begin{cases} 1 & \text{if } i = j \\ 0 & \text{if } i \neq j \end{cases}

(3)

The interpolating polynomial is then:

p(x) = \sum_{j=0}^{n} y_j L_j(x)

(4)

Example 1 (Lagrange Interpolation Through Three Points)

Find the Lagrange interpolation function through the points $(0, 1), (1, 0), (2, 3)$ .

Remark 1 (Limitations of Direct Lagrange Evaluation)

The Lagrange formula is elegant but has practical limitations:

Issue	Problem
Cost	$O(n^2)$ operations per evaluation point
Stability	Products of many terms can overflow/underflow
Flexibility	Adding a new point requires recomputing all basis polynomials

See the Interpolation Methods notebook for a computational comparison.

Barycentric Interpolation¶

Can we do better? Yes—by algebraically reorganizing the Lagrange formula, we obtain the barycentric form, which resolves all three issues while computing the same polynomial.

The Node Polynomial¶

Define the node polynomial:

\ell(x) = \prod_{k=0}^{n} (x - x_k) = (x - x_0)(x - x_1) \cdots (x - x_n)

(9)

Its derivative at a node $x_j$ is:

\ell'(x_j) = \prod_{k \neq j} (x_j - x_k)

(10)

This lets us rewrite the Lagrange basis as:

L_j(x) = \frac{\ell(x)}{\ell'(x_j)(x - x_j)}

(11)

Barycentric Weights¶

Define the barycentric weights:

\lambda_j = \frac{1}{\ell'(x_j)} = \frac{1}{\prod_{k \neq j} (x_j - x_k)}

(12)

Then the Lagrange basis becomes:

L_j(x) = \ell(x) \cdot \frac{\lambda_j}{x - x_j}

(13)

First Barycentric Formula¶

Substituting into the interpolant gives the first barycentric formula:

Once the weights $\lambda_j$ are known (computed once in $O(n^2)$ time), evaluating $p(x)$ costs only $O(n)$ .

Second Barycentric Formula¶

An even more elegant formula comes from the identity $\sum_{j=0}^{n} L_j(x) = 1$ (the Lagrange polynomials form a partition of unity—they interpolate the constant function 1).

Dividing the interpolant by this identity:

Advantages¶

$O(n)$ per evaluation after $O(n^2)$ preprocessing for weights
Numerically stable even for large $n$
Adding a point only requires updating weights

Implementation¶

def bary_weights(x):
    """Compute barycentric weights for nodes x."""
    n = len(x)
    w = np.ones(n)
    for j in range(n):
        for i in range(n):
            if i != j:
                w[j] /= (x[j] - x[i])
    return w

def bary_interp(xeval, x, y, w):
    """Evaluate interpolant at xeval using barycentric formula."""
    # Handle evaluation at nodes
    for j, xj in enumerate(x):
        if np.isclose(xeval, xj):
            return y[j]

    terms = w / (xeval - x)
    return np.dot(terms, y) / np.sum(terms)

Chebyshev Points and Barycentric Weights¶

For Chebyshev points, the barycentric weights have a remarkably simple closed form:

This makes Chebyshev interpolation especially efficient—no $O(n^2)$ weight computation needed!

def cheb_bary_weights(n):
    """Barycentric weights for Chebyshev points."""
    w = np.ones(n+1)
    w[0] = 0.5
    w[-1] = 0.5
    w[1::2] *= -1
    return w

Numerical Stability¶

The two barycentric formulas have different stability properties. The following results are due to Higham (2004).

Backward Stability of the First Formula¶

Forward Stability of the Second Formula¶

Theorem 4 (Forward Stability of Barycentric Formula)

The second barycentric formula

p(x) = \frac{\sum_{j=0}^{n} \frac{\lambda_j}{x - x_j} f_j}{\sum_{j=0}^{n} \frac{\lambda_j}{x - x_j}}

(21)

is forward stable when the Lebesgue constant $\Lambda_n$ is small. Specifically:

\frac{|\hat{p}(x) - p(x)|}{|p(x)|} \lesssim \Lambda_n \cdot nu

(22)

For Chebyshev points, $\Lambda_n = O(\log n)$ , so the formula is stable for all practical $n$ .

Comparison¶

Formula	Stability Type	Best For
First (modified Lagrange)	Backward stable	Extrapolation, any nodes
Second (barycentric)	Forward stable	Interpolation with good nodes

Why Scale Invariance Matters¶

The second barycentric formula is scale invariant: we can multiply all weights by any nonzero constant without changing the result:

\frac{\sum_j \frac{c\lambda_j}{x - x_j} f_j}{\sum_j \frac{c\lambda_j}{x - x_j}} = \frac{\sum_j \frac{\lambda_j}{x - x_j} f_j}{\sum_j \frac{\lambda_j}{x - x_j}}

(23)

This means we can rescale weights to avoid overflow, which is critical for large $n$ .

Warning: Equispaced Points¶

For equispaced points, the weights grow like:

|\lambda_j| \sim \frac{2^n}{n!} \binom{n}{j}

(24)

This grows exponentially, making polynomial interpolation through equispaced points numerically unstable—even with the barycentric formula. This is separate from (but compounds) Runge’s phenomenon.

Runge’s Phenomenon: A Warning¶

Consider $f(x) = \frac{1}{1 + 25x^2}$ on $[-1, 1]$ .

Runge’s phenomenon: Polynomial interpolation of f(x) = 1/(1+25x^2) through equally spaced nodes. As the degree increases, the interpolant develops large oscillations near the boundaries, despite the function being smooth. — **Runge’s phenomenon:** Polynomial interpolation of $f(x) = 1/(1+25x^2)$ through equally spaced nodes. As the degree increases, the interpolant develops large oscillations near the boundaries, despite the function being smooth.

Interpolation error comparison: (Left) Equidistant points cause error to grow exponentially with n. (Center) Chebyshev points cluster near endpoints. (Right) Chebyshev interpolation error decreases exponentially—the hallmark of spectral accuracy. — **Interpolation error comparison:** (Left) Equidistant points cause error to grow exponentially with $n$ . (Center) Chebyshev points cluster near endpoints. (Right) Chebyshev interpolation error decreases exponentially—the hallmark of spectral accuracy.

Nodes	Error as $n \to \infty$
Equally spaced	Grows without bound
Chebyshev	Decreases exponentially

# Equally spaced: error GROWS
x_eq = np.linspace(-1, 1, n)
f_eq = 1/(1 + 25*x_eq**2)
# Interpolant oscillates wildly near boundaries!

# Chebyshev: error DECREASES
x_cheb = np.cos(np.pi * np.arange(n) / (n-1))
f_cheb = 1/(1 + 25*x_cheb**2)
# Smooth, accurate approximation

The lesson: node placement matters. Chebyshev nodes cluster near the endpoints, exactly where equally spaced nodes cause trouble.

Use barycentric interpolation. It’s the standard in modern software (Chebfun, etc.).

References¶

Higham, N. J. (2004). The numerical stability of barycentric Lagrange interpolation. IMA Journal of Numerical Analysis, 24(4), 547–556. 10.1093/imanum/24.4.547
Berrut, J.-P., & Trefethen, L. N. (2004). Barycentric Lagrange interpolation. SIAM Review, 46(3), 501–517. 10.1137/S0036144502417715
Webb, M., Trefethen, L. N., & Gonnet, P. (2012). Stability of barycentric interpolation formulas for extrapolation. SIAM Journal on Scientific Computing, 34(6), A3009–A3015. 10.1137/110848797