
APPENDIX A
✦
Matrix Algebra
989
or, assuming b is not 0, the set of equations
X
y = X
Xb.
The means of solving such a set of equations is the subject of Section A.5.
In Figure A.3, the linear combination Xb is called the projection of y into the column space
of X. The figure is drawn so that, although y and y
∗
are different, they are similar in that the
projection of y lies on top of that of y
∗
. The question we wish to pursue here is, Which vector, y
or y
∗
, is closer to its projection in the column space of X? Superficially, it would appear that y is
closer, because e is shorter than e
∗
.Yety
∗
is much more nearly parallel to its projection than y,so
the only reason that its residual vector is longer is that y
∗
is longer compared with y. A measure
of comparison that would be unaffected by the length of the vectors is the angle between the
vector and its projection (assuming that angle is not zero). By this measure, θ
∗
is smaller than θ,
which would reverse the earlier conclusion.
THEOREM A.2
The Cosine Law
The angle θ between two vectors a and b satisfies
cos θ =
a
b
a·b
.
The two vectors in the calculation would be y or y
∗
and Xb or (Xb)
∗
. A zero cosine implies
that the vectors are orthogonal. If the cosine is one, then the angle is zero, which means that the
vectors are the same. (They would be if y were in the column space of X.) By dividing by the
lengths, we automatically compensate for the length of y. By this measure, we find in Figure A.3
that y
∗
is closer to its projection, (Xb)
∗
than y is to its projection, Xb.
A.4 SOLUTION OF A SYSTEM OF LINEAR
EQUATIONS
Consider the set of n linear equations
Ax = b, (A-56)
in which the K elements of x constitute the unknowns. A is a known matrix of coefficients, and b
is a specified vector of values. We are interested in knowing whether a solution exists; if so, then
how to obtain it; and finally, if it does exist, then whether it is unique.
A.4.1 SYSTEMS OF LINEAR EQUATIONS
For most of our applications, we shall consider only square systems of equations, that is, those in
which A is a square matrix. In what follows, therefore, we take n to equal K. Because the number
of rows in A is the number of equations, whereas the number of columns in A is the number of
variables, this case is the familiar one of “n equations in n unknowns.”
There are two types of systems of equations.