Numerical Integration - Introduction to Scientific Computing

The Trapezoidal Rule¶

The simplest approach: approximate $f(x)$ by a straight line and integrate that.

Single Interval¶

Local Error Analysis¶

What is the error of this approximation? Let $h = b - a$ and use Taylor’s theorem.

Proof 1

Expand $f(x)$ around the midpoint $c = (a+b)/2$ using Taylor’s theorem:

f(x) = f(c) + f'(c)(x - c) + \frac{f''(\eta(x))}{2}(x - c)^2

(3)

Integrating from $a$ to $b$ :

\int_a^b f(x)\,dx = hf(c) + \frac{f''(\eta)}{2}\int_a^b (x-c)^2\,dx = hf(c) + \frac{h^3}{24}f''(\eta)

(4)

where we used that $\int_a^b (x-c)\,dx = 0$ by symmetry, and $\int_a^b (x-c)^2\,dx = h^3/12$ .

For the trapezoidal approximation, expand $f(a)$ and $f(b)$ around $c$ :

f(a) = f(c) - \frac{h}{2}f'(c) + \frac{h^2}{8}f''(\xi_1)

(5)

f(b) = f(c) + \frac{h}{2}f'(c) + \frac{h^2}{8}f''(\xi_2)

(6)

Adding:

\frac{h}{2}(f(a) + f(b)) = hf(c) + \frac{h^3}{16}\cdot\frac{f''(\xi_1) + f''(\xi_2)}{2}

(7)

Taking the difference and using the intermediate value theorem to combine the $f''$ terms:

\int_a^b f(x)\,dx - \frac{h}{2}(f(a) + f(b)) = -\frac{h^3}{12}f''(\xi)

(8)

for some $\xi \in (a, b)$ .

Key observation: The local error is $O(h^3)$ —cubic in the interval width.

Composite Trapezoidal Rule¶

For better accuracy, divide $[a, b]$ into $n$ subintervals of equal width $h = (b-a)/n$ , with nodes $x_k = a + kh$ for $k = 0, 1, \ldots, n$ .

From Local to Global Error¶

The global error is the total error when approximating the integral over $[a, b]$ .

Proof 2

On each subinterval $[x_{k-1}, x_k]$ , the local error is:

\int_{x_{k-1}}^{x_k} f(x)\,dx - \frac{h}{2}(f(x_{k-1}) + f(x_k)) = -\frac{h^3}{12}f''(\xi_k)

(11)

for some $\xi_k \in (x_{k-1}, x_k)$ .

Summing over all $n$ subintervals:

\int_a^b f(x)\,dx - T_n(f) = -\frac{h^3}{12}\sum_{k=1}^{n} f''(\xi_k)

(12)

Since $f'' \in C[a,b]$ , the sum $\frac{1}{n}\sum_{k=1}^{n} f''(\xi_k)$ lies between $\min f''$ and $\max f''$ . By the intermediate value theorem, there exists $\xi \in (a, b)$ such that:

\frac{1}{n}\sum_{k=1}^{n} f''(\xi_k) = f''(\xi)

(13)

Therefore:

\int_a^b f(x)\,dx - T_n(f) = -\frac{h^3}{12} \cdot n \cdot f''(\xi) = -\frac{h^3 n}{12}f''(\xi)

(14)

Since $n = (b-a)/h$ :

\int_a^b f(x)\,dx - T_n(f) = -\frac{(b-a)h^2}{12}f''(\xi)

(15)

Understanding Local vs Global Error¶

Error Type	Definition	Trapezoidal Rule
Local	Error on one subinterval of width $h$	$O(h^3)$
Global	Total error over $[a, b]$	$O(h^2)$

Why does the order drop from 3 to 2?

The global error accumulates local errors from $n \sim 1/h$ subintervals:

\text{Global error} \sim n \times \text{Local error} \sim \frac{1}{h} \times h^3 = h^2

(16)

This is the typical pattern: global order = local order − 1.

Remark 1 (Python Implementation)

def trapezoidal(f, a, b, n):
    """Composite trapezoidal rule with n subintervals."""
    h = (b - a) / n
    x = np.linspace(a, b, n + 1)
    y = f(x)
    return h * (0.5 * y[0] + np.sum(y[1:-1]) + 0.5 * y[-1])

Higher-Order Methods¶

By using higher-degree polynomial approximations, we can achieve better accuracy.

Simpson’s rule uses a quadratic polynomial through three points $(a, f(a))$ , $(m, f(m))$ , $(b, f(b))$ where $m = (a+b)/2$ :

\int_a^b f(x)\,dx \approx \frac{h}{6}\left(f(a) + 4f(m) + f(b)\right)

(17)

where $h = b - a$ . This has local error $O(h^5)$ and global error $O(h^4)$ —two orders better than trapezoidal.

Even higher-order Newton-Cotes formulas exist (using more equally-spaced points), though they become unstable for high orders. The optimal approach—Gaussian quadrature—chooses both the nodes and weights optimally and achieves remarkable efficiency. We will explore this in the chapter on interpolation.

Why Integration is Easier Than Differentiation¶

Remark 2 (Smoothing vs. Roughening)

Operation	Error behavior
Differentiation	Errors amplify (dividing by small $h$ )
Integration	Errors average out (summing many terms)

This is why numerical integration is generally more stable than numerical differentiation. Integration “smooths,” differentiation “roughens.”

From the perspective of conditioning:

Differentiation amplifies high-frequency noise
Integration damps high-frequency components

This is why we can often integrate noisy data reliably, but differentiating noisy data is notoriously difficult.

Summary¶

Rule	Local Error	Global Error
Trapezoidal	$O(h^3)$	$O(h^2)$
Simpson’s	$O(h^5)$	$O(h^4)$

Key principle: Global order = local order − 1, because we sum $O(1/h)$ local errors.