2.
Suppose
A €
R
nxn
is
banded
with half-bandwidth
p.
Determine
the
exact
number
of
arithmetic operations required
to
factor
A
into
LU.
Exercises
1.
Suppose
A 6
R
nxn
.
Determine
the
exact number
of
arithmetic operations
required
for the
computation
of A = LU via
Gaussian elimination.
Further
count
the
number
of
operations required
to
compute
L
-1
b
and
U~
1
L~
1
b.
Verify
the
results given
in the
text.
The
following
formulas
will
be
useful:
486
Chapter
10.
More about
finite
element methods
methods when
n
is
very large, making Gaussian elimination
too
expensive.
In
such
a
case,
an
iterative method must give
a
reasonable approximation
in
much
less
than
n
iterations,
or it
also
is too
expensive.
The CG
algorithm
is
useful
precisely because
it can
give very good results
in a
relatively small
number
of
iterations.
The
rate
of
convergence
of CG is
related
to the
condition number
of A,
which
is
denned
as the
ratio
of the
largest eigenvalue
of A to the
smallest. When
the
condition number
is
relatively small
(that
is,
when
A is
well-conditioned),
CG
will
converge
rapidly.
The
algorithm also works
well
when
the
eigenvalues
of A are
clustered into
a few
groups.
In
this case, even
if the
largest eigenvalue
is
much
larger
than
the
smallest,
CG
will
perform
well.
The
worst case
for CG is a
matrix
A
whose eigenvalues
are
spread
out
over
a
wide range.
10.2.7
Preconditioned
CG
It is
often
possible
to
replace
a
matrix
A
with
a
related matrix whose eigenvalues
are
clustered,
and for
which
CG
will
converge quickly. This technique
is
called
preconditioning,
and it
requires
that
one find a
matrix
M
(the
preconditioned
that
is
somehow
similar
to A (in
terms
of its
eigenvalues)
but is
much simpler
to
invert.
At
each step
of the
preconditioned conjugate gradient (PCG) algorithm,
it is
necessary
to
solve
an
equation
of the
form
Mq = r.
Preconditioners
can be
found
in
many
different
ways,
but
most
require
an
intimate knowledge
of the
matrix
A. For
this reason, there
are few
general-purpose
methods.
One
method
that
is
often
used
is to
define
a
preconditioner
from
an
incomplete
factorization
of A. An
incomplete factorization
is a
factorization (like
Cholesky)
in
which
fill-in is
limited
by fiat.
Another method
for
constructing
pre-
conditioners
is to
replace
A by a
simpler matrix (perhaps arising
from
a
simpler
PDE)
that
can be
inverted
by FFT
methods.
486
Chapter 10. More
about
finite element methods
methods when n
is
very large, making Gaussian elimination too expensive.
In such a case, an iterative method must give a reasonable approximation in
much less
than
n iterations, or
it
also
is
too
expensive. The CG algorithm
is
useful precisely because it can give very good results in a relatively small
number of iterations.
The
rate
of convergence of CG
is
related to
the
condition number of A, which
is defined as the ratio of the largest eigenvalue of
A
to
the
smallest. When the
condition number
is
relatively small
(that
is, when A
is
well-conditioned), CG will
converge rapidly. The algorithm also works
well
when
the
eigenvalues of A are
clustered into a
few
groups. In this case, even if
the
largest eigenvalue
is
much
larger
than
the
smallest, CG will perform well. The worst case for CG
is
a matrix
A whose eigenvalues are spread out over a wide range.
10.2.7 Preconditioned
CG
It
is often possible
to
replace a matrix A with a related matrix whose eigenvalues
are clustered, and for which CG will converge quickly. This technique
is
called
preconditioning, and it requires
that
one find a matrix M (the preconditioner)
that
is
somehow similar
to
A (in terms of its eigenvalues)
but
is
much simpler to invert. At
each step of
the
preconditioned conjugate gradient (PCG) algorithm, it is necessary
to solve an equation of the form
Mq
=
r.
Preconditioners can be found in many different ways,
but
most require an
intimate knowledge of the matrix
A.
For this reason, there are
few
general-purpose
methods. One method
that
is often used
is
to define a preconditioner from an
incomplete factorization of
A.
An incomplete factorization
is
a factorization (like
Cholesky) in which fill-in is limited by fiat. Another method for constructing pre-
conditioners
is
to
replace A by a simpler matrix (perhaps arising from a simpler
PDE)
that
can be inverted by
FFT
methods.
Exercises
1. Suppose A E
Rnxn.
Determine the exact number of arithmetic operations
required for
the
computation of A =
LU
via Gaussian elimination. Further
count
the
number of operations required
to
compute
L-1b
and
U-1L-1b.
Verify the results given in the text. The following formulas will be useful:
ti =
n(n
+ 1),
i=l
2
ti
2
=
n(n+1)(2n+1).
i=l
6
2. Suppose A E
Rnxn
is banded with half-bandwidth p. Determine
the
exact
number of arithmetic operations required
to
factor A into L
U.