84 CHAPTER 7
If you need one-digit numbers, you can treat each individual column of digits as
a separate column in the table. If you need four-digit numbers, you can treat pairs
of columns together as four-digit numbers. If you need three-digit numbers, you can
simply ignore the first or last digit in a four-digit number. The spaces dividing the
numbers into columns of two-digit numbers, in short, are entirely arbitrary, as are
the wider spaces grouping the columns by threes. Likewise, the extra space setting
off groups of five rows each is included only to make the table easier to read.
Suppose the population from which you wish to select a sample contains 536
elements, numbered from 001 to 536. You need a list of three-digit random numbers
between 001 and 536. You can select a list of numbers in exactly the manner just
described, except that you ignore any number less than 001 (that is, 000) or any
number greater than 536 (that is, 537–999). You simply skip past these inapplicable
numbers in the list and continue to select those in the relevant range until you have
as many as needed.
Sometimes the same number will appear more than once in a list of random
numbers. If this happens, you can follow either of two courses. The first is to ignore
multiple appearances of the same number and continue reading the random num-
ber table until you have as many numbers as you need without repetitions. This
is called sampling without replacement. (The name makes sense if you imagine
that you were actually drawing numbered slips from a hat without replacing the
slips in the hat for potential re-selection on subsequent drawings.) Sampling without
replacement is the course of action that seems to make intuitive good sense to many
people.
Sampling with replacement, however, turns out to be a little simpler mathemat-
ically, and the equations in this book are those for sampling with replacement.
In sampling with replacement, each time you draw a numbered slip from the hat
(speaking metaphorically), you write down the number and replace the slip in the
hat so that it could be drawn again in the future. The analogous procedure, when
sampling with a random number table, is to include repeated numbers in the sample
as many times as they appear in the list from the random number table. The data
for the corresponding elements, then, are included among the sample data as if each
occurrence in the sample were an entirely different element.
Suppose, for example, that we were sampling with replacement from a pop-
ulation of scrapers, in an effort to estimate the mean length of scrapers in the
population. The random numbers chosen from the table might be 23, 42, 13, 23, and
06. We would select the scrapers with the numbers 06, 13, 23, and 42 and measure
the length of each. We would, however, write down five length measurements, not
four, so as to include the length measurement for scraper number 23 twice. The
number of elements in the sample would also be five, not four. To re-emphasize,
it is this procedure, sampling with replacement, that the equations in this book are
appropriate for. Slightly different equations are technically necessary for sampling
without replacement, although in almost every practical instance it makes very little
meaningful difference in the results. It is, however, quite easy to adhere strictly to the
assumptions on which the formulas given in this book are based simply by including
the data from an element in the sample as many times as that element is selected.