Preface

This is a tutorial made solely for the purpose of education and it was designed for students taking Applied Mathematics 0340. It is primarily for students who have some experience using Mathematica. If you have never used Mathematica before and would like to learn more of the basics for this computer algebra system, it is strongly recommended looking at the APMA 0330 tutorial. As a friendly reminder, don'r forget to clear variables in use and/or the kernel.

Finally, the commands in this tutorial are all written in bold black font, while Mathematica output is in regular fonts. This means that you can copy and paste all commands into Mathematica, change the parameters and run them. You, as the user, are free to use the scripts to your needs for learning how to use the Mathematica program, and have the right to distribute this tutorial and refer to this tutorial as long as this tutorial is accredited appropriately. The tutorial accompanies the textbook Applied Differential Equations. The Primary Course by Vladimir Dobrushkin, CRC Press, 2015; http://www.crcpress.com/product/isbn/9781439851043

Return to computing page for the first course APMA0330
Return to computing page for the second course APMA0340
Return to Mathematica tutorial for the first course APMA0330
Return to Mathematica tutorial for the second course APMA0340
Return to the main page for the course APMA0340
Return to the main page for the course APMA0330

2.2.9. Sigular Value Decomposition

One of the most fruitful ideas in the theory of matrices is that of matrix decomposition or canonical form. The theoretical utility of matrix decompositions has long been appreciated. Previously, we have discused LU-decomposition and QR-factorization for rectangular m-by-n matrices. For a square \( n \times n \) matrix, we know even more canonical forms:

Of many useful decompositions, the singular value decomposition---that is, the factorization of a m-by-n matrix A into the product \( {\bf U}\, {\bf \Sigma} \,{\bf V}^{\ast} \) of a unitary (or orthogonal) \( m \times m \) matrix U, a \( m \times n \) 'diagonal' matrix \( {\bf \Sigma} \) and another unitary \( n \times n \) matrix \( {\bf V}^{\ast} \) ---has assumed a special role. There are several reasons. In the first place, the fact that the decomposition is achieved by unitary matrices makes it an ideal vehicle for discussing the geometry of n-space. Second, it is stable; small pertubations in A corresponds to small pertubabtions in \( {\bf \Sigma} .\) Third, the diagonality of \( {\bf \Sigma} \) makes it easy to operate and solve equations.

It is an intriguing observation that most of the classical matrix decompositions predated the widespread use of matrices. In 1873, the Italian mathematician Eugenio Beltrami (1835--1899) published a first paper on SVD, which was followed by the work of Camille Jordan in 1874, whom we can consider as a codiscover. Later James Joseph Sylvester (1814--1897), Edhard Schmidt (1876--1959), and Hermann Weyl (1885--1955) were responsible for establishing the existence of the singular value decomposition (SVD) and developing its theory. The term ``singular value'' seems to have come from the literature on integral equations. The British-born American mathematician Harry Bateman (1882--1946) used it in a research paper published in 1908.

Definition: If A is an \( m \times n \) matrix, and if \( \lambda_1 , \lambda_2 , \ldots , \lambda_n \) are the eigenvalues of A^TA, then the numbers

are called the singular values of A. In other words, a nonnegative real number σ is a singular value for m-by-n matrix A if and only if there exist unit-length vectors \( {\bf u} \in \mathbb{R}^m \quad\mbox{and} \quad {\bf v} \in \mathbb{R}^n \) such that \( {\bf A}{\bf v} = \sigma {\bf u} . \) The vectors u and v are called left singular and right singular vectors for σ, respectively. ■

Theorem: Let A be a m-by-n matrix with complex or real entries. Then its largest singular value equals to the Euclidean 2-norm: \( \max \{ \sigma_i \} = \| {\bf A} \|_2 = \max_{\| {\bf x} \| =1} \| {\bf A} {\bf x} \| , \) and square root of the sum of squares of the singular values equals the Frobenius norm: \( \sum_i \sigma_i^2 = \| {\bf A} \|_F^2 = \sum_{i,j} |a_{i,j} |^2 = \mbox{trace} \left( {\bf A}^{\ast} {\bf A} \right) . \) ■

There are known two forms of the singular value decompositions---the brief (or reduced) version and the expanded form:

Theorem: (Singular Value Decomposition in expanded form) If A is an \( m \times n \) matrix of rank r, then A can be factored as

When rank r of \( m \times n \) matrix A is less than \( \min \{ m, n \} , \) the most entries of 'diagonal' \( m \times n \) matrix Σ are zeroes except singular values \( \sigma_1 , \ldots , \sigma_r . \) Therefore, when this matrix is multiplied from left by U and from right by V^T, their entries \( {\bf u}_{r+1} , \ldots , {\bf u}_m \) and \( {\bf v}_{r+1} , \ldots , {\bf v}_n \) do not contribute to the final matrix. So we can drop them and represent A in the reduced form, as the following theorem states.

Let r be the rank of m-by-n matrix A. Then \( \sigma_{r+1} = \sigma_{r+2} = \cdots = \sigma_n . \) Partition \( {\bf U} = \left[ {\bf U}_1 \ {\bf U}_2 \right] \) and \( {\bf V} = \left[ {\bf V}_1 \ {\bf V}_2 \right] , \) where \( {\bf U}_1 = \left[ {\bf u}_1 \ , \ldots , \ {\bf u}_r \right] \) and \( {\bf V}_1 = \left[ {\bf v}_1 \ , \ldots , \ {\bf v}_r \right] \) have r columns. Then with \( {\bf \Sigma}_r = \mbox{diag}(\sigma_1 , \ldots , \sigma_r ) : \)

Theorem: (Singular Value Decomposition in brief/reduced form) If A is an \( m \times n \) matrix of rank r, then A can be expressed in the form \( {\bf A}_{m\times n} = {\bf U}_{m\times r} {\bf \Sigma}_r {\bf V}^{\mathrm T}_{r\times n} , \) where Σ_r has size r-by-r and

The matrices U and V are not uniquely determined by A, but the diagonal entries of Σ are necessarily the singular values of A. The set \( \{ {\bf u}_1 , \ldots , {\bf u}_r \} \) is an orthonormal basis for the column space of A. The set \( \{ {\bf u}_{r+1} , \ldots , {\bf u}_m \} \) is an orthonormal basis for the kernel (null space) of A^T. The set \( \{ {\bf v}_1 , \ldots , {\bf v}_r \} \) is an orthonormal basis for the row space of A. The set \( \{ {\bf v}_{r+1} , \ldots , {\bf v}_n \} \) is an orthonormal basis for the kernel of A.

Example. If \( {\bf A} = \left[ \begin{array}{ccc} 1\ & 2& \ 6 \\ 2\ & -3 \ & -2 \end{array} \right] , \) then the linear transformation \( {\bf x} \,\mapsto \,{\bf A}{\bf x} \) maps the unit sphere \( \left\{ {\bf x} : \, \| {\bf x} \| =1 \right\} \) in \( {\bf \mathbb R}^3 \) onto an ellipse in \( {\bf \mathbb R}^2 , \) shown in Figure:

The singular values of A are the square roots of the eigenvalues of \( {\bf B}_1 = {\bf A}^{\mathrm T} {\bf A} : \)

Example. Consider a 3-by-2 matrix \( {\bf A}^{\mathrm T} = \begin{bmatrix} 4& -1&1 \\ -4 & 1& -1 \end{bmatrix} . \) First, we compute two symmetric singular matrices:

MATHEMATICA TUTORIAL for the Second Course in Differential Equations. Part 2.2, SVD

Vladimir Dobrushkin

Preface

Contents [hide]

2.2.9. Sigular Value Decomposition