Eigenvalues and Eigenvectors
Naturally, eigenvalues are defined only for square matrices because for non-square matrices, the transformation changes the dimensions of the resulting vector, hence for any pair . An matrix has at most linearly independent eigenvectors, but it need not have that many. A matrix with linearly independent eigenvectors is called diagonalizable; matrices that fall short (e.g., a Jordan block) are called defective.
Geometric interpretation
Section titled “Geometric interpretation”One intuitive way to understand eigenvectors is to look at any matrix as a transformation being applied to a vector . A transformation can rotate and/or scale a vector. There’s a very nice video about this by Grant Sanderson where he explains this idea with beautiful visuals. In essence, eigenvectors of a matrix (transformation) are those vectors that are not rotated, but only scaled without changing the direction. The magnitude of that scale is what we call the eigenvalue.
Interactive: The directions that don’t rotate
Section titled “Interactive: The directions that don’t rotate”For a matrix
the faint gray arrows are 12 unit vectors evenly spaced around the unit circle. The accent-colored arrows show their images under multiplication by . Most input directions get rotated and scaled; the eigenvectors are the (at most two) directions for which the output points the same way as the input. Drag the sliders to find them.
When the discriminant goes negative, the eigenvalues become a complex conjugate pair and no real direction is preserved (the transformation has a genuine rotational component).
Properties
Section titled “Properties”The key property of eigenvectors can be seen by looking at for some positive integer . Let’s say is an eigenvector of with eigenvalue . Then we have
Thus, if is an eigenvector of with eigenvalue then is also an eigenvector of with eigenvalue . This also tells us another key fact about matrices:
If is diagonalizable, its linearly independent eigenvectors form a basis for (or when the eigenvectors can be chosen real), so any vector can be expressed as a linear combination of the eigenvectors :
Similarity
Section titled “Similarity”A matrix is said to be similar to if for some invertible matrix .
This simple result is very useful in computation. For example, a software like MATLAB will use a sequence of matrices and reduce to where and for . This sequence can be chosen carefully so as to make a diagonal matrix, hence, the eigenvalues show up on the diagonal. This helps us calculate the eigenvalues of a matrix much faster.
For a matrix , the eigenspace associated with a set of eigenvalues is defined to be the subspace spanned by the eigenvectors associated with those eigenvalues.
Some obvious but important facts are as follows:
- The sum of the eigenvalues of is , which is the sum of the elements on the diagonal.
- The product of the eigenvalues of is .
- In general, eigenvalues of or cannot be inferred directly from eigenvalues of and .
The first two facts can be proved easily using the characteristic polynomial
We know that the sum of the roots of such a polynomial is determined by the coefficient of , being the degree of the polynomial. Also, the constant term in the polynomial determines the product of the roots.
Symmetric matrices
Section titled “Symmetric matrices”If is a symmetric matrix, then
- Eigenvalues of are real if is real.
- Eigenvectors of corresponding to distinct eigenvalues are orthogonal, and an orthonormal basis of eigenvectors can always be chosen (even when eigenvalues are repeated, by picking an orthonormal basis within each eigenspace).
- We always have a full set of eigenvectors, even when some eigenvalues are repeated. For example, consider the identity matrix. It has only one eigenvalue, , but every vector is an eigenvector.
Let be an symmetric matrix with eigenvalues . If we consider the matrix
we see that and are similar matrices. This means that there must be an invertible such that (equivalently, ). It is not hard to see that is the eigenvector matrix (columns of are eigenvectors of ), and we have the spectral decomposition , where we use for when the eigenvectors are chosen orthonormal so that .
General matrices
Section titled “General matrices”For a diagonalizable square matrix (which may not be symmetric), we can factorize it as
where is the eigenvector matrix and is the diagonal eigenvalue matrix. This factorization is another way to look at the fact that the eigenvectors of powers of are the same as those of , with correspondingly exponentiated eigenvalues. It is only when is symmetric (more generally, normal) that we can use , since the eigenvectors are orthogonal in that case. A non-diagonalizable (defective) matrix does not admit this factorization, and one has to use the more general Jordan normal form instead.