8 8 -
6.9. UNDERLYING ASSUMPTIONS
It is important to be as critical of our own procedures as we have been of Ball's. There is the
question of how reasonable our model is of a fair distribution. We think that it is an improvement on
Ball's model, for two reasons. First, it takes into consideration the claims of those with high test scores to
be placed in the top band and, secondly, the fact that the number of places in the top band is likely to be
restricted. Our application of the model, however, does give rise to some problems. We adopted the
model on the assumption that the placement of any one pupil is likely to result from all the decisions
about all the pupils. Unfortunately, we only know about the test scores of the pupils in Ball's sample and,
as we know, Ball's sample is unrepresentative. Our application of the model to Ball's data relies on the
assumption that in the cohort as a whole the ratio of high to middle to low test scores is 11:40:35. In other
words, it is similar to that in the sample. It also relies on the assumption that test scores and social class in
the cohort covary in the same way as in the sample. If, in fact, there were just as many working-class
pupils as middle-class pupils with test scores of 115 and above then our procedures would be severely
awry. All we can say about this is that when NFER tests are administered to large groups of pupils,
middle-class pupils do usually score more highly on average.
It is also worth remembering at this point an issue that we raised earlier about Ball's
operationalization of ability. We can ask: 'Why should we regard NFER achievement test scores as a
better measure of a pupil's ability than the estimates of junior-school teachers?'. After all, the bands group
pupils for all subjects, whereas the tests measure achievement in specific areas. While there are very few
ways of getting a high score on a test, there is a large number of ways of getting a low score, many of
which would not be indicative of underlying inability. We should rightly complain if junior-school
teachers did not take this possibility into consideration in making recommendations about the allocation
of pupils to bands in secondary school. For what it is worth, Ball's data do show more pupils being 'over-
allocated' than 'under-allocated'. Ball's argument was constructed on the basis that test scores represented
'true ability', so decisions on band allocation that departed from what might be predicted by a test score
could be seen as social-class bias. Our argument has been that Ball has demonstrated no social-class bias,
but in order to pursue it we have had to assume with him that test scores provide a sufficient indication of
the way in which pupils should be banded by ability. This is not an assumption we should like to defend.
6.10. REGERESSION ANALYSIS
Up to now we have concentrated on the underlying logic of quantitative data analysis, introducing
a small number of techniques as and when they became appropriate for the analysis of the data from
Ball's study. You will find in the literature quite a lot of research that relies primarily on the techniques
we have discussed so far. However, they represent only a very small range of the statistical techniques
that are available and that have been used by educational researchers. We cannot hope to cover all the
others, but what we shall do in the remainder of this section is to introduce one of the most frequently
used of the more advanced techniques, 'regression analysis'. As you will see, regression analysis is a
development of the techniques to which you have already been introduced. Before we discuss it fully,
however, we need to cover one or two other issues.
TYPES OF DATA
The kinds of statistical tests that can be employed in educational research depend on the kinds of
data that are used. So far we have been using data as if they were nominal or categorical in character. This
is the form of data that is least amenable to statistical use. For regression analysis, higher levels of data
are required. Below we outline the standard classification of different types of quantitative data.
•
Nominal-level data
Examples: classifications of gender and ethnicity. These are just categories. You
cannot add together the number of males and the number of females, divide by the
total and come up with an 'average gender'. You cannot multiply ethnic groups
together and come up with something different. Within the bounds of common
sense, however, you can collapse nominal categories together. For example, Ball