8.2. The Method of Least Squares 325
Consider the three points (0, 0), (1, C), and (2, 0), where C > 0,
and the problem of finding the best horizontal line y = b to fit these
points.
a. Explain why any horizontal line below all three points cannot be
the best fit, by drawing a plot and imagining what happens to TD
as the line is moved upward.
b. Explain similarly why any horizontal line above all three points
cannot be the best fit.
c. Explain why, if a horizontal line is below the middle point and
above the others, then TDcan be decreased by lowering the line to
go through the bottom two points.
d. Conclude y = 0 is the best-fit horizontal line when TDis used as
a measure of total error. Because this result does not depend on C,
the value of C has no effect on the line.
e. For a challenge, explain why y = 0 is the best-fit line (horizontal
or not) for the three data points.
8.1.8. Using TD to measure total error does not always produce a single
best-fit curve; there can be many curves that are all equally good.
To see how this can happen, consider the four points (0, 0), (1, 1),
(2, 1), and (3, 0), and the problem of finding the best horizontal line
y = b to fit these points.
a. As in the previous problem, explain why the best-fit horizontal line
cannot lie above all the points or below all the points.
b. Explain why any horizontal line above the two bottom points and
below the two top points will have TD= 2.
c. Conclude from parts (a) and (b) that there may not be a unique
solution to the problem of fitting a curve to data, if total error is
measured using TD. (If total error is measured by SSE, there is a
unique best-fit line.)
8.2. The Method of Least Squares
While exploring the idea of fitting curves to data in the last section, we
discovered that even fitting an exponential curve to data could be reformulated,
through the use of semilog graphs, as a problem of fitting a straight line.
In fact, the most common curve-fitting problems experimentalists face are
usually those of straight line fits. Data are collected, a plot is made (using
a transformation if necessary), and the data points often appear to cluster in
a roughly linear manner. Then, the best-fit line to describe the data must be
chosen.