
270 10: Correlation and linear regression ⏐ Part E Inter-relationships between variables
5.1 Example: The scattergraph method
Suppose we have the following pairs of data about output and costs.
Month
Output Costs
'000 units $'000
1 20 82
2 16 70
3 24 90
4 22 85
5 18 73
(a) These pairs of data can be plotted on a scattergraph (the horizontal axis representing the independent
variable and the vertical axis the dependent) and a line of best fit might be judged as the one shown below.
It is drawn to pass through the middle of the data points, thereby having as many data points below the line
as above it.
(b) A
formula for the line of best fit can be found. In our example, suppose that we read the following data
from the graph.
(i) When X = 0, Y = 22,000. This must be the value of
a in the formula Y = a + bX.
(ii) When X = 20,000, Y = 82,000. Since Y = a + bX, and a = 22,000, 82,000 = 22,000 + (b × 20,000)
b × 20,000 = 60,000
b = 3
(c) In this example the estimated equation from the scattergraph is Y = 22,000 + 3X.
5.2 Forecasting and scattergraphs
If the company to which the data in Paragraph 5.1 relates wanted to predict costs at a certain level of output (say
13,000 units), the value of 13,000 could be substituted into the equation Y = 22,000 + 3X and an estimate of costs
made.
If X = 13, Y = 22,000 + (3 × 13,000)
∴ Y = $61,000
Predictions can be made directly from the scattergraph, but this will usually be less accurate.