Skip to content

Positive Definite Matrices

A symmetric matrix S\Sv is positive definite (PD), if it has the following equivalent properties. Positive definiteness is a notion of thinking about symmetric matrices as positive or negative in the same sense as one would think about a real number.

  • All eigenvalues of S\Sv are positive, i.e., λi(S)>0\lambda_i(\Sv) > 0.
  • For any vector x0\xv \neq 0, the energy contained in x\xv for S\Sv is positive, i.e.,
xTSx>0.\xv^{\rm T}\Sv\xv > 0.
  • All leading determinants of S\Sv are >0>0. Leading determinants are determinants of all square sub-matrices of S\Sv with the top-left corner fixed.
  • All pivots in the Gaussian elimination are >0>0.
  • S\Sv can be factored as S=ATA\Sv = \Av^{\rm T}\Av, where A\Av has independent columns.

For a matrix S\Sv, the energy in a vector x\xv is defined to be xTSx\xv^{\rm T}\Sv\xv. For example, consider

x=(ab),S=(3446).\begin{aligned} \xv=\begin{pmatrix}a\\b\end{pmatrix}, & & \Sv=\begin{pmatrix} 3 & 4\\ 4 & 6 \end{pmatrix} \end{aligned}.

The corresponding energy is f(a,b)=xTSx=3a2+6b2+8abf(a,b)=\xv^{\rm T}\Sv\xv=3a^2 + 6b^2 + 8ab. Below is the plot for f(a,b)f(a,b) showing that energy >0>0 for all aa and bb. Thus, the matrix S\Sv in this example is positive definite.

Energy in a vector for S.

On a side note, this can be seen like a loss function that one may try to minimize using something simple like gradient descent.

If two matrices A\Av and B\Bv are PD, then so are the matrices A+B\Av+\Bv and A1\Av^{-1}. This can be easily seen by the positive energy property and the fact that if λ\lambda is an eigenvalue of A\Av then 1/λ1/\lambda is an eigenvalue of A1\Av^{-1}.

Positive semi-definiteness (PSD) is analogous to being non-negative. A matrix is PSD if it satisfies the above properties with a minor change: replace >0>0 with 0\ge 0 in the first four properties, and in the last property, we may have dependent columns in the factor A\Av.