We know what linear functions are. A function f is linear iff it satisfies additivity f(x + y) = f(x) + f(y) and homogeneity f(ax) = af(x).
We know what continuity is. A function f is continuous iff for all ε there exists a δ such that if |x − x0| < δ, then |f(x) − f(x0)| < ε.
An equivalent way to think about continuity is the sequence criterion: f is continuous iff a sequence (xk) converging to x implies that (f(xk)) converges to f(x). That is to say, if for all ε there exists an N such that if k ≥ N, then |xk − x| < ε, then for all ε, there also exists an M such that if k ≥ M, then |f(xk) − f(x)| < ε.
Sometimes people talk about discontinuous linear functions. You might think: that’s crazy. I’ve seen many linear functions in my time, and they were definitely all continuous. f(x): ℝ → ℝ := ax is continuous for any a ∈ ℝ. T(x⃗): ℝ² → ℝ² := is continuous no matter what the entries in the matrix are. Stop being crazy!!
Actually, it’s not crazy. It’s just that all the discontinuous linear functions live in infinite-dimensional spaces.
Take, say, the space C1(ℝ) of continuously differentiable functions ℝ → ℝ with the uniform norm. (The uniform norm means that the “size” of a function for the purposes of continuity is the least upper bound of its absolute value.) If you think of a vector in the n-dimensional ℝn as a function from {1...n} to ℝ, then you can see why a function from a continuous (not even countable) domain would be infinite-dimensional.
Consider the sequence of functions (fk) = in C1(ℝ). The sequence converges to the zero function: for any ε, we can take
and then
.
Now consider that the sequence of derivatives is , which doesn’t converge. But the function D: C1(ℝ) → C0(ℝ) that maps a function to its derivative is linear. (We have additivity because the derivative of a sum is the sum of the derivatives, and we have homogeneity because you can “pull out” a constant factor from the derivative.)
By exhibiting a function D and a sequence (fk) for which (fk) converges but (D(fk)) doesn’t, we have shown that the derivative mapping D is a discontinuous linear function, because the sequence criterion for continuity is not satisfied. If you know the definitions and can work with the definitions, it’s not crazy to believe in such a thing!
The infinite-dimensionality is key to grasping the ultimate sanity of what would initially have appeared crazy. One way to think about continuity is that a function not having any “jumps” implies that it can’t have an “infinitely steep” slope: a small change in the input can’t correspond to an arbitrarily large change in the output.
Consider a linear transformation T on a finite-dimensional vector space; for simplicity of illustration, suppose it’s diagonalizable with eigenbasis {u⃗j} and eigenvalues {λj}. Then for input x⃗ = Σj cju⃗j, we have T(x⃗) = Σj cjλju⃗j: the eigencoördinates of the input get multiplied by the eigenvalues, so the amount that the transformation “stretches” the input is bounded by maxj |λj|. The linearity buys us the “no arbitrarily large change in the output” property which is continuity.
In infinite dimensions, linearity doesn’t buy that. Consider the function T(x1, x2, x3, …) = (x1, 2x2, 3x3, ...) on sequences with finitely many nonzero elements, under the uniform norm. The effect of the transformation on any given dimension is linear and bounded, but there’s always another dimension that’s getting stretched more. A small change in the input can result in an arbitrarily large change in the output, by making the change sufficiently far in the sequence (where the input is getting stretched more and more).
Leave a Reply