Vector and Matrix Norms
A norm measures the size of a vector or matrix. Different norms encode different notions of “size” (Euclidean length, max coordinate, sum of magnitudes, …) and which one to use is usually dictated by the geometry of the problem.
Vector norms
Section titled “Vector norms”A function that satisfies (1) and (2) but not (3) is sometimes called a quasi-norm. The third condition is what makes a norm geometrically well-behaved: it forces the unit ball to be convex.
The family
Section titled “The ℓp\ell^pℓp family”For and , define
Three values of get their own names:
- : the Manhattan or taxicab norm, .
- : the Euclidean norm, .
- : the sup or Chebyshev norm, (this is the limit of as ).
The condition is essential. For the formula is still defined, but the triangle inequality fails and the unit ball becomes non-convex.
Unit balls
Section titled “Unit balls”The shape of the unit ball is the cleanest visual handle on how the norm depends on . In :
- : diamond (square rotated 45°), vertices at .
- : circle of radius 1.
- : axis-aligned square, vertices at .
- : a convex shape interpolating between the diamond and the square.
- : a concave four-pointed star.
The diagonal extent (the distance from the origin to the ball boundary along the line ) is , plotted live above. It interpolates from at to at to at , and shrinks rapidly toward for . The faint dashed circle is the ball for reference.
Equivalence of norms
Section titled “Equivalence of norms”A central fact in finite-dimensional analysis: all norms on are equivalent.
Concretely, the norms satisfy
so convergence in any one norm is convergence in every other. The constants degrade with dimension, which is exactly why “switching norms” is innocuous in low dimensions but matters in high-dimensional statistics.
Matrix norms
Section titled “Matrix norms”A matrix norm is just a norm on the vector space of matrices. There are two flavors that dominate practice.
Frobenius norm
Section titled “Frobenius norm”Treat the matrix as a long vector and apply the Euclidean norm:
where are the singular values of . The last form is the most useful in proofs: it shows that depends only on the singular value spectrum, not on the bases in the SVD. In particular, the Frobenius norm is invariant under orthogonal transformations on either side: when are orthogonal.
Operator (induced) norms
Section titled “Operator (induced) norms”Given a vector norm , the induced operator norm of a matrix is
This measures the worst-case stretching of the unit ball under . Three cases have clean closed forms:
- (maximum column sum).
- (maximum row sum).
- (largest singular value). Also called the spectral norm.
The spectral norm and the Frobenius norm coincide for rank-1 matrices but differ in general. They satisfy
where is the rank of . The upper bound is tight when all nonzero singular values are equal; the lower bound is tight when the spectrum is dominated by a single mode.
Submultiplicativity
Section titled “Submultiplicativity”All induced operator norms (and the Frobenius norm) satisfy
which is exactly the property needed for bounding products and powers. For the spectral norm this follows from .
Submultiplicativity is the reason matrix norms are useful in numerical analysis: error bounds compound as products of operator norms, so a small per-step bound on a transformation gives a small bound on the composition.