Sampling is the key to survey research. No matter how well a study
is done in other ways, if the sample has not been properly found, the
results cannot be regarded as correct. Though this chapter may be more
difficult than the others, but it is perhaps the most important
chapter in this book. It applies mainly to surveys, but is also
important for planning other types of research.
1. Populations
The first concept
you need to understand is the difference between a population and a
sample.
To make a sample, you first need a population. In non-technical
language, population means "the number of people living in
an area." This meaning of population is also used in survey
research, but this is only one of many possible definitions of
population. The word universe is sometimes used in survey
research, and means exactly the same in this context as
population.
The unit of population is whatever you are counting: there
can be a population of people, a population of households, a
population of events, institutions, transactions, and so forth.
Anything you can count can be a population unit. But if you cant
get information from it, and you cant measure it in some way,
its not a unit of population that is suitable for survey
research.
For a survey, various limits (geographical and otherwise) can be
placed on a population. Some populations that could be covered by
surveys are...
- All people living in Cambodia.
- All people aged 18 and over.
- All households in Hanoi.
- All schools in Australia.
- All instances of tuning in to a radio station in the last seven
days
...and so on. If you can express it in a phrase beginning
"All," and you can count it, its a population of some
kind. The commonest kind of population used in survey research uses
the formula:
- All people aged X years and over, who live in area Y.
The "X years and over" criterion usually rules out
children below a certain age, both because of the difficulties
involved in interviewing them and the lack of relevance of their
answers to many research questions.
Even though some populations cant be questioned directly,
theyre still populations. For example, schools cant fill
in questionnaires, but somebody can do so on behalf of each school.
The distinction is important when finding the answers to questions
like "What proportion of Samoan schools have libraries?" You
need only one questionnaire from each school - not one from each
teacher, or one from each student.
Often, the population you end up surveying is not the population
you really wanted, because some part of the population cannot be
surveyed. For example, if you want to survey opinions among the whole
population of an area, and choose to do the survey by telephoning
people at home, the population you actually survey will be people with
a telephone in their home. If the people with no telephone have
different opinions, you will not discover this.
As long as the surveyed population is a high proportion of the
wanted population, the results obtained should also be true for the
larger population. For example, if 90% of homes have a telephone, the
10% without a phone would have to be very different, for the survey's
results not to be true for the whole population.
2. Sampling frames
A sampling frame
can be one of two things: either a list of all members of a
population, or a method of selecting any member of the population. The
term general population refers to everybody in a particular
geographical area. Common sampling frames for the general population
are electoral rolls, street directories, telephone directories, and
customer lists from utilities which are used by almost all households:
water, electricity, sewerage, and so on.
It is best to use the list that is most accurate, most complete,
and most up to date, but this differs from country to country. Some of
these lists are of households, others are of people. For most surveys,
a list of households (specially if it is in street order) is more
useful than a list of people. Another commonly used sampling frame
(which I do not recommend) is a map.
3. Samples
A sample is a
part of the population from which it was drawn. Survey research is
based on sampling, which involves getting information from only some
members of the population.
If information is obtained from the whole population, its not
a sample, but a census. Some surveys, based on very small populations
(such as all members of an organization) in fact are censuses and not
sample surveys. When you do a census, the techniques given in this
book still apply, but there is no sampling error - as long as the
whole group participates in the census.
Samples can be drawn in several different ways, such as probability
samples, quota samples, purposive samples, and volunteer samples.
Probability samples
Sometimes known as random samples, probability samples are the
most accurate of all. It is only with a probability sample that
its possible to accurately estimate how different the sample is
from the whole population. With a probability sample, every member of
the population has an equal (or known) chance of being included in the
sample. In most professional surveys, each member of the population
has the same chance of being included in the sample, but sometimes
certain types of people are deliberately over-represented in the
sample. Results are calculated compensate for the sample
imbalance.
With a probability sample, the first step is usually to try to find
a sampling frame: a list of all members of the population. Using this
list, individuals or households are numbered, and some numbers are
chosen at random to determine who is surveyed. If no population list
is available, other methods are used to ensure that every population
member has an equal (or other known) chance of inclusion in the
survey.
Quota samples
In the early days of survey research, quota sampling was very
common. No population list is used, but a quota, usually based on
Census data, is drawn up.
For example, suppose the general population is being surveyed, and
50% of them are known to be male, and half of each sex is aged over
40. If each interviewer had to obtain 20 interviews, she or he would
be told to interview 10 men and 10 women, 5 of each aged under 40, and
5 of each aged 40-plus. It is usually the interviewers who decide how
where they find the respondents. In this case, age and sex are
referred to as control variables.
A problem with quota samples is that some respondents are easier to
find than others. The interviewer in the previous example may have
quickly found 10 women, and 5 men over 40, but may then have taken a
lot of time finding men under 40. If too many control variables are
used, interviewers will waste a lot of time trying to find respondents
to fit particular categories. For example (if interviews had been
specified in terms of occupation and household size, as well as age
and sex) "2 male butchers aged 40 to 44, living in households of
8 or more people".
Its important with quota sampling to use appropriate control
variables. If some people in a category are more likely to take part
in the survey than others, and also likely to give different answers
from those in another category, then that category should be a control
variable.
For example, if women are more willing than men to be surveyed
(which is often true) and if the two sexes patterns of answers
are expected to be quite different, then the quota design should
obtain balanced numbers from each sex. In fact, sex and age group are
the two commonest control variables in quota surveys, but occasionally
a different variable can be the most relevant. If youre planning
a quota sample, dont assume that by getting the right proportion
in each age group for each sex, everything else will be OK.
Pure quota samples are little used today, except for surveys done
in public places, but sometimes partial quota sampling can be useful.
A common example is when choosing one respondent from a household. The
probability method begins by finding out how many people live in the
household, then selecting an interviewee purely at random. There are
practical problems with this approach (explained later in this
chapter), so inside randomly selected households, quota sampling is
often used.
Volunteer samples
Samples of volunteers should generally be treated with
suspicion. However, as all survey research involves some element of
volunteering, there is no fixed line between a volunteer sample and a
probability sample. The main difference between a pure volunteer
sample and a probability sample of volunteers is that in the former
case, volunteers make all the effort; no sampling frame is used.
The main source of problems with volunteer samples is the
proportion who volunteer. If too few of the population volunteer for
the survey, you must wonder what was so special about them. There is
usually no way of finding out how those who volunteered are different
from those who didnt. But if the whole population volunteer to
take part in the survey, theres no problem.
In some circumstances, volunteer samples can be useful. For
example, the Australian Broadcasting Corporation used to survey panels
of listeners to its two serious radio networks, Radio National and
Classic FM. To recruit people for the panels, the networks broadcast
advertisements, asking those interested to contact the ABC.
To check whether these volunteers were representative of all
listeners to the networks, we carried out of random surveys of
listeners to these networks, and compared the answers of panel members
and randomly selected listeners. We found that the volunteers were
representative in most ways, but listened to the station much more
often.
As only about 5% of the population listen regularly to these
networks, random surveys are very expensive to conduct (20 households
must be contacted to find each listener), so using volunteer samples
saves a lot of money. Not all people who volunteer for panels need be
accepted. A quota system can be used, to ensure that various parts of
the population are accurately represented.
When people who know nothing about sampling organize surveys, they
often have a large number of questionnaires printed, and offer one to
everybody whos interested. Amateur researchers often seem to
feel that if the number of questionnaires returned is large enough,
the lack of a sample design isnt important. Certainly, you will
get some results, but you will have no way of knowing how
representative the respondents are of the population. You may not even
know what the population is, with this method. The less effort
that goes into distributing questionnaires to particular individuals
and convincing them that participation is worthwhile, the more likely
it is that those who complete and return questionnaires will be a very
small (and probably not typical) section of the population.
About the only way in which a volunteer sample can produce accurate
results (without being checked against a probability sample), is if a
high proportion of the population voluntarily returns questionnaires.
Ive known this to work a few times, usually in country areas
with a small population, where up to about 50% of all households have
returned questionnaires. Even so, if all the effort is left to the
respondents, theres no certainty that somebody who wants to
distort the results has not filled in hundreds of questionnaires.
The same problems apply to drawing conclusions from unsolicited
mail and phone calls. For example, politicians sometimes make claims
like "My mail is running five to one in favour of the stand I
made last week." There are all sorts of reasons why the letters
sent in may not be representative of the population. The same applies
to letters sent to broadcasting organizations: all these tell you is
the opinions of the letter-writers. It is only when the majority of
listeners write letters that the opinions expressed in these letters
might be representative.
Purposive samples
A purposive sample is one in which a surveyor tries to create a
representative sample without sampling at random.
One of the commonest uses of purposive sampling is in selecting a
group of geographical areas to represent a larger area. For example,
door-to-door interviewing can become extremely expensive in rural
areas with a low population density. In a country such as Cambodia, it
is not feasible to do a door-to-door survey covering the whole
country. Though areas could be picked purely at random, if the budget
was small and only a small number of towns and cities could be
included, you might choose these in a purposive way, perhaps ensuring
that different types of town were included. However, there are better
ways to do this. Read on
Maximum-diversity samples
A maximum-diversity sample is a special kind of purposive
sample. Normally, a purposive sample is not representative, and does
not claim to be. A maximum-diversity sample aims to be more
representative than a random sample (which, despite what many people
think, is not always the most representative, specially when the
sample size is small).
Instead of seeking representativeness through equal probability,
its sought by including a wide range of extremes. This is an
extension of the statistical principle of regression towards the
mean - in other words, if a group of people that is (on average)
extreme in some way, it will contain some people who themselves are
average. So if you sought a "minimum diversity" sample by
only trying to cover the types of people who you thought were average,
youd be likely to miss out on a number of different groups which
might make up quite a high proportion of the population. But by
seeking maximum diversity, average people are automatically
included.
When you are selecting a multi-stage sample (explained in more
detail below) the first stage might be to draw a sample of districts
in the whole country. If this number is less than about 30, its
likely that the sample will be unrepresentative in some ways. Two
solutions to this are stratification (also explained below) and
maximum-diversity sampling. For both of these, some local knowledge is
needed.
With maximum-diversity sampling, you try to include all the
extremes in the population. This method is normally used to choose no
more than about 30 units. For example, in a small village, you might
decide to interview 10 people. If this was a radio audience survey,
you could ask to interview
- the oldest person in the village who listens to radio
- the oldest who does not listen to radio
- the youngest who listens to radio
- a man who listens to radio all day
- a woman who listens to radio all day
- somebody who has never listened to radio in his or her life
- the person with the most radios (a repairman, perhaps)
- the person with the biggest aerial
- a person who is thought to be completely average in all
ways
...and so on. The principle is that if you deliberately try to
interview a very different selection of people, their aggregate
answers will be close to the average. The method sounds odd, but works
well in places where a random sample cannot be drawn. And of course it
only works when information about the different kinds of sample unit
(e.g. a person) is widely known.
Map-based sampling
When you are planning a door to door survey, it is tempting to
use a map as the basis for sampling. To get 100 starting points for
clusters, all you need to do is throw 100 darts at the map.
This method, if properly done, gives every unit of area on the map
an equal chance of being surveyed. This would be valid only if your
unit of measurement was a unit of land area for example, if you
were estimating the distribution of a plant species. If you are
surveying people or households, this equal-area method will
over-represent farmers and people living on large properties. People
living in high-density urban areas will be greatly under-represented.
Even within a small urban area, large differences in density can
exist.
Slightly better, but still badly flawed, is a method used in the
1980s by a Sydney research company. This was based on a street
directory, and gave every street an equal chance of being surveyed.
The trouble was that streets ranged in size from the Pacific Highway
(with thousands of addresses on it) to cul-de-sacs with only one or
two dwellings. There would have been no problem, however, if there
were no consistent difference between long streets and short streets.
However, in Sydney, long streets tend to have many blocks of flats,
while short streets tend to have single-unit houses. The people living
on long streets tend to be poorer and more transient than others, and
include fewer families with children.
Found samples
Perhaps you have a list of names and addresses of some of your
audience, collected for a marketing purpose. This is known as a
found sample or convenience sample. Its tempting
to survey these people, because it seems so easy. But dont do
it! You have no way of knowing how representative such a sample is.
You can certainly get a result, but you wont know to what extent
that result is true of people who were not included in the sample.
Snowball samples
If youre researching a rare population, sometimes the
only feasible way to find its members is by asking others. First of
all, you somehow have to find a few members of the population - by any
method you can. That is the first round.
You now ask each of these first-round members if they know of any
others. These names form the second round.
Then you go to each of those second-round people, and ask them for
more names.
Keep repeating the process, for several more rounds. The important
thing is knowing when to stop. For each round, keep a count of the
number of names you get, and also the number of new names - people you
havent heard about before. Calculate the number of new names as
a percentage of the total number of names. For example, if one round
gives you 50 names, but 20 are for people who were mentioned in
earlier rounds, the percentage of new names for that round is 60%.
Youll probably find that the percentage of new names rises at
first, then drops sharply. When you start hearing about the same
people over and over again, it's time to stop - perhaps when the
percentage of new names drops to around 10%. This is often at the
fourth or fifth round.
You now have something close to a list of the whole population (and
many of them will know that youre planning some research). Using
that list, you can now draw a systematic sample, as detailed in
section 9 of this chapter.
Snowball sampling requires a lot of work, if the population is
large, because you need to draw up an almost-complete list of the
population. So this method works best when a population is very small.
But if the population is small enough to list every member without a
huge amount of work, you could do a census, rather than a sample: in
other words, contact all of them.
Snowball sampling works well when members of a population know the
other members. For example, if you are studying people who speak a
minority language, or who share some disability, theres a good
chance that most of them know of each other. The biggest problem with
snowball sampling is that isolated people, who are not known to other
members of the population, will not be included in your study, because
youll never find out about them. In the case of minorities,
sometimes the more successful members will blend into the ruling
culture, feeling no need to communicate with other members of that
minority. So if you survey only the ones who know each other, you may
get a false impression. A partial solution to this is to begin with a
telephone directory or other population list. If people in that
population have some distinctive family names, you can find them in
the directory, and take those people for the first round.
Stratification
The simplest type of sampling involves drawing one sample from
the whole survey area. If the coverage area of a radio station is a
large town and its surrounding countryside, there may be a population
list that covers the whole area - an electoral roll, perhaps. If you
want to select (say) 40 random addresses as starting points for a
door-to-door cluster survey, you could simply pick 40 addresses from
the population list.
This is simple, but theres a slight danger that all 40
addresses may be in the same part of the coverage area. This happened
to me once, when I planned a survey in Timaru in New Zealand. We drew
20 addresses as starting points, then plotted them on a map.
Unfortunately, they were all in one quarter of the town. I considered
throwing out the sample, and selecting another 20 addresses. But what
if the same imbalance occurred again?
The solution was to stratify the sample. Using census data from
small areas, Timaru was divided into four quarters, with almost
exactly equal populations. We then selected 5 addresses in each
quarter. This way, we were certain that the clusters would be spread
evenly across the town.
Stratification is easy to do, and you should use it whenever
possible. But for it to be possible, you need to have (a) census data
about smaller parts of the whole survey area, and (b) some way of
selecting the sample within each small area. For example, if you were
using a telephone directory as a sampling frame, each residential
listing might show the suburb where that number was. (It doesnt
matter if the person mentioned in the listing still lives there - you
use a telephone directory as a list of addresses, not people.) In this
case, youd need census data on the number of households in each
suburb, to be able to use stratification effectively.
The principle of stratification is simply that, if an area has X%
of the population, it should also have X% of the interviews.
Heres an example of a stratified sample design for the Amhara
Region of Ethiopia, from a survey I helped to organize there. The
Amhara Region is divided into 11 zones. For each zone, you find out
the population, work out what percentage it is of the total, then make
sure that the number of clusters is as close as possible to those
proportions. Produce a table laid out like this. You begin with the
numbers shown in bold type, and calculate the rest.
|
Zone
|
Population (1994) 000s |
% of total population |
Clusters |
% of clusters |
|
exact |
rounded |
| |
A |
B |
C |
D |
E |
|
North Gondar |
2
089 |
15.1 |
6.80 |
7 |
15.6 |
|
South Gondar |
1769 |
12.8 |
5.76 |
6 |
13.3 |
|
North Wello |
1260 |
9.1 |
4.10 |
4 |
8.9 |
|
South Wello |
2124 |
15.4 |
6.93 |
7 |
15.6 |
|
North Shewa |
1561 |
11.3 |
5.08 |
5 |
11.1 |
|
East Gojam |
1700 |
12.3 |
5.54 |
6 |
13.3 |
|
West Gojam |
1779 |
12.9 |
5.80 |
6 |
13.3 |
|
Wag Himera |
276 |
2.0 |
0.90 |
1 |
2.2 |
|
Agew Awi |
717 |
5.2 |
2.34 |
2 |
4.4 |
|
Oromiya |
463 |
3.3 |
1.48 |
1 |
2.2 |
|
Bahir Dar |
96 |
0.7 |
0.32 |
0 |
0 |
|
Total |
13 835 |
100.0 |
45 |
45 |
100.0 |
Calculate the table as follows. For each zone:
Figure in column B = Figure in
column A / the total of A x 100.
E.g. Column B for North Gondar = 2089 / 13835 x 100 = 15.1
Column C = B x total of C
Column D = same as C, but rounded to nearest whole number
Column E = D / total of D
As North Gondar has 15.1% of the population, it should also have
15.1% of the clusters.
But 15.1% of 45 clusters is 6.8, and you can't have 0.8 of a
cluster. So the number of clusters in North Gondar is rounded up to 7.
This process is repeated for each other zone. Sometimes, because of
the rounding, the total number of clusters is 1 more or less than the
total you planned for. To fix this, you can change the final number of
clusters, adding or subtracting 1. Another solution is to cheat a
little, by rounding an exact number of clusters in the wrong
direction: in the above table you could round 1.48 up to 2, or 5.54
down to 5, with very little effect on the accuracy of the proportions.
When you round a figure in the wrong direction, choose the number
ending in the closest figure to .5
You could also add a column F: the difference between B and E. The
maximum difference depends on the number of clusters, but should
usually be less than 2%. If the difference is too large, you may need
to have more clusters, with fewer interviews in each.
In the above table, there's a problem with the Bahir Dar zone. The
population there was only 96,000, so this zone needed 0.32 of a
cluster. This is rounded down to 0, so there are no clusters in that
zone, and therefore no interviews in that zone.
However, this makes nonsense of stratification: what do you do
about it?
There are three solutions:
(1) If an area would have no interviews, combine it with an
adjoining area. As Bahir Dar is inside West Gojam, these two zones
could be combined. The exact number of clusters would be 6.12 (5.80 +
0.32), which in this case still rounds down to 6. This is normally the
best solution.
(2) Round the 0.32 upwards instead of downwards, and include one
cluster in Bahir Dar. The total number of clusters would then be 46 -
but people living in Bahir Dar would be over-represented in the
survey: 2.2% of the clusters (1 in 46), but 0.7% of the
population.
(3) Change the cluster size in Bahir Dar. Instead of using clusters
of 8 households (as elsewhere), you could do a single cluster of 4
households. So now there would be 45.5 clusters, and Bahir Dar, with
0.7% of the population, would have 1.1% of the interviews.
That's close enough. However, having different cluster sizes often
confuses the interviewers, so you'd need two slightly different sets
of interviewer instructions.
Multi-stage sampling
With door-to-door surveys, sampling is done in several steps.
Often, the first step is stratification. For example, census data can
be used to select which districts in the survey area will be included.
In the second step, random sampling could be used, but each district
might need to be treated separately, depending on the information
available there. This would decide which households would be surveyed.
The third step would involve sampling individuals within households,
perhaps using quota sampling.
4. Random sampling
The concept of randomness
Before we discuss random sampling, you need to be clear about
the exact meaning of "random." In common speech, it means
"anything will do", but the meaning used in statistics is
much more precise: a person is chosen at random from a population when
every member of that population has the same chance of being sampled.
If some people have a higher chance than others, the selection is not
random. To maximize accuracy, surveys conducted on scientific
principles always use random samples.
Imagine a complete list of the population, with an entry for every
member: for example, a list of 1500 members of an organization,
numbered from 1 up to 1500. Suppose you want to survey 100 of them. To
draw a simple random sample, choose 100 different random numbers,
between 1 and 1500. Any member whose number is chosen will be
surveyed. If the same number comes up twice, the second occurrence is
ignored, as nobody will be surveyed more than once. So if the method
for selecting random numbers can produce the same number twice, about
110 selections will need to be made to get 100 people.
Another type of random sampling, called systematic sampling,
is more commonly used. This ensures that no number will come up twice.
No matter how many thousands of people you will interview, you need
only one random number for systematic sampling.
In the above example, you are surveying 1 member in 15. Think of
the population as divided into 100 groups, each with 15 people. You
need to choose one person from each group, so you choose a random
number between 1 and 15. Lets say this number is 7. You then
choose the 7th person in each group. If the members were numbered 1-15
in the first group, 16-30 in the second, 31-45 in the third, and so
on, you'd interview people with numbers 7, 22, and 37 - adding 15 each
time. Exactly 100 members would be chosen for the survey, and their
numbers would be evenly spread through the membership list.
Sources of random numbers
The commonest source of random numbers in most countries is the
serial numbers on banknotes. There can be no bias in using the last
few digits of the first banknote you happen to pull out of your
pocket, because there should be an equal chance of drawing each
possible combination of numbers. Other source of unpredictable large
numbers (from which you can use the last few digits) include lottery
results, public transport tickets, even stock market indexes.
You can also cheat. With systematic sampling, only one random
number is needed. Just think of a number, between 1 and the upper
limit. Though statisticians would frown, it will probably make no
difference to the results.
Principles of random sampling
The essential principle in survey research is that everybody in
the population to be surveyed should have an equal chance of being
questioned. If you do a survey, and everybody had an equal chance of
inclusion, youre in a position to estimate the accuracy of your
results.
Every survey has sampling variation. If you survey 100 people, and
get a certain result, this result will be slightly different than if
you had surveyed another group of 100 people. This is like tossing
coins: if you toss a coin 100 times, you know that there should be 50
heads and 50 tails. But the chances are quite strong (92 in 100, to be
exact) that you wont get exactly 50 heads and 50 tails. However,
the chances of getting 0 heads and 100 tails are practically
nonexistent.
Using statistical techniques, its possible to work out the
exact chances of every possible combination of heads and tails. For
example, there are 680 chances in 1000 that youll get between 45
and 55 heads in 100 throws. (If you doubt this, find 100 coins, throw
them 1000 times, and see the result for yourself!)
In the same way, even though you know the results from a survey
are not exactly accurate, they are probably pretty close but
only if every member of the surveyed population had an equal chance of
being included in the survey.
To estimate how much sampling error there is likely to be in a
survey result, use the following table. "Standard error"
means (roughly) the average difference between the true figure and
each case.
Table of standard errors
|
% of sample giving this answer |
Sample size (no. of interviews) |
|
100 |
200 |
400 |
800 |
|
5 or 95% |
2.2% |
1.6% |
1.1% |
0.8% |
|
10 or 90 |
3.0% |
2.1% |
1.5% |
1.1% |
|
15 or 85 |
3.6% |
2.5% |
1.8% |
1.3% |
|
20 or 80 |
4.0% |
2.8% |
2.0% |
1.4% |
|
30 or 70 |
4.6% |
3.3% |
2.3% |
1.6% |
|
40 or 60 |
4.9% |
3.5% |
2.4% |
1.7% |
|
50% |
5.0% |
3.5% |
2.5% |
1.8% |
When using the above table, think of each question as having two
possible answers. Although a question may have more than two answers
(e.g. age groups of under 25, 25 to 44, and 45 or over), the number
can always be reduced to two, conceptually.
For example, suppose 20% of a sample is in the 25 to 44 group.
Therefore, the other 80% is in the "not 25 to 44" age group.
The margin of error on this 20/80 split is 4%, so the true population
figure is likely to be anywhere between 16% and 24%. There is one
chance in three that it will be outside this range, and 1 chance in 20
that it be outside twice this range: i.e. less than 12 or more than
28%.
If all that sounds too difficult, just assume that the margin of
error is 5%, on any result. For example, if a survey finds that 25% of
the population listen to your station, it's likely that the true
figure will be somewhere between 20% and 30%. (Likely - but not
certain - because there's a small chance that the true figure could be
less than 20% or more than 30%. A well-known saying among
statisticians is "statistics means never having to say youre
certain.")
Always remember that the above table shows only sampling error,
which is fairly predictable. There could also be other, unpredictable,
sources of error.
Note in the above table that the margin of error for 400 interviews
is always half that for 100. This means that to halve the error in a
survey, you must quadruple the sample size. So unless you have a huge
budget, you must learn to tolerate sampling error.
5. Choosing a sample size
There are several
ways to choose a sample size: you can either calculate it from a
formula, or use a rough "rule of thumb."
The formula for calculating the sampling error to a survey question
is:
n = p x q / SE2
where:
n is the sample size: the number of people interviewed.
p is the percentage answering Yes to the question.
q is the percentage not answering Yes to the question.
SE is the standard error as shown in the table above.
An example
You guess that maybe a quarter of all people listen to your
station, so p is 25%, and q is 75%. You want the figure
to be correct within 3%. If you do find a figure of 25% who listen,
you want to make sure the true figure is between 22% and 28%. So to
calculate the required sample size:
n = 25 x 75 / (3 x 3)
= 208
This formula (which I have over-simplified slightly), is useful in
working out how big a sample size you need for a given survey. But to
calculate the sample size you first have to know roughly how many
people will answer Yes to the question, and also decide how large a
standard error you can tolerate. For beginners, this is not simple.
Another problem is that samples calculated in this way can be
horrifyingly large. For example, if you changed the tolerance from 3%
to 1% in the above example, youd have to interview 1875 people.
Yet another problem is that every question in a survey may require a
different sample size.
In an ideal world, youd calculate the sample size for a
survey as shown above, and cost would never be a problem. However, as
most surveys are done to a budget, your starting point in practice may
not be how much error you can tolerate, but rather how little error
you can get for a given cost.
To do this, you need to divide the cost of the survey into two
parts:
- a fixed part, whose cost is not proportional to sample size, and
- a variable part, for which the cost is so much per member of the
sample.
Once you have allocated a proportion of the total budget to the
fixed cost, and estimated the cost of getting back each completed
questionnaire, you can calculate the affordable sample size.
But what if you dont know the survey cost, and have to
recommend a sample size? This is where the rule-of-thumb is
useful.
For the majority of surveys, the sample size is between 200 and
2000. A sample below 200 is useful only if you have a very low budget,
and little or no information on what proportion of the population
engages in the activity of most interest to you or if the
entire population is not much larger than that. A sample size over
2000 is probably a waste of time and money, unless there are subgroups
of the population, which must be studied in detail.
If you dont absolutely need such large numbers, and have more
funds than you need, dont spend it on increasing the sample size
beyond the normal level. Instead, spend it on improving the quality of
the work: more interviewer training, more detailed supervision, more
verification, and more pre-testing. Better still, do two surveys: a
small one first, to get some idea of the data, then a larger one. With
the experience you gain on the first survey, the second one will be of
higher quality.
The sample size also depends on how much you know about the subject
in question. If you have no information at all on a subject, a sample
of only 100 can be quite useful, though its standard error is large.
Rule of thumb
Are you confused about which sample size to choose? Try my rule
of thumb:
|
Condition |
Recommended sample |
|
No previous experience at doing
surveys.
No existing survey data. |
100 to 200 |
|
Some previous experience, or some
previous data. Want to divide sample into sets of 2 groups (e.g.
young/old, male/female) |
200 to 400 |
|
Have previous experience and previous
data.
Want to divide sample into sets of up to 4 groups.
Want to compare with previous survey data. |
400 to 600 |
A common misconception
Consider this question: if a survey in a town with 10,000
people needs a sample of 400 for a given level of accuracy, what
sample size would you need for the same level of accuracy in the whole
country, with a population of 10,000,000? (That's 1000 times the
population of the town.)
Did you guess 400,000? Many people do. The correct answer is 400.4
- you might as well call it 400.
The formula I gave above isn't quite complete. The full version has
what's called the finite population correction (or FPC) added
to the end, so the full formula is:
n = p x q / SE2 x (N-n)/N
where N is the
population. Unless the sample size is more than about 5% of the
population, the (N-n)/N bit (the FPC) makes almost no
difference to the required sample size.
Is that too technical? Think of it another way. Imagine that you
have a bowl of soup. You dont know what flavour it is. So you
stir the soup in the bowl, take a spoonful, and sip it. The bowl of
soup is the population, and the spoonful is the sample. As long as the
bowl is well-stirred (so that each spoonful is a random sample), the
size of the bowl is irrelevant. If the bowl was twice the size, you
wouldnt need to take two spoonfuls to assess the flavour: one
spoonful would still be fine. This is equally true for human
populations.
6. Nonrandom sampling
Though random sampling is
the ideal, sometimes its not possible. In some countries, census
information is either not available, or so far out of date that
its useless. Even when good census data exists, there may be no
maps showing the boundaries of the areas to which the data applies.
And even when there exist both good census data and related maps,
there may be no sampling frames.
The good news (from a sampling point of view) is that these
conditions usually apply in very poor and undeveloped countries with
large rural populations. In my experience, theres not a wide
range of variation in these populations. This is a difficult thing to
prove, but I suspect that the more developed a country, the more
differences there are between its citizens. All this is a way of
saying that where random sampling is not possible, perhaps its
not so necessary.
The best solution I can think of is to use maximum diversity
sampling, described briefly in section 3 of this chapter.
Maximum-diversity samples are normally drawn in several stages, so
they are multi-stage samples. The first stage is to decide which parts
of the population area will be surveyed. For example, if a survey is
to represent a whole province, and its not feasible to survey
every part of the province, you must decide which parts of the
province will be included. Lets assume that these parts are
called counties, and you will need to select some of these.
Maximum-diversity
sampling works like this:
Stage 1
1. Think of all the ways in which the counties differ from the
province as a whole -specially ways relevant to the subject of the
survey. If the survey is about FM radio, and some areas are hilly,
reception may be poorer there. If the survey is about malaria, and
some counties have large swamps with a lot of mosquitoes, that will be
a factor. If the survey will be related to wealth or education levels
(as many surveys are), try to find out which counties have the richest
and best-educated people, and which have the poorest and
least-educated. Try to think of about 10 factors, which are relevant
to the survey.
2. Try to find objective
data about these factors. Failing that, try to find experts on the
topics, or people who have travelled around the whole province. Using
this information, for each factor make a list of the counties which
have a high level of the factor (e.g. lots of mountains, lots of
swamps, wealthy) and counties which have a low level (e.g. all flat,
no swamps, poor).
3. The counties mentioned
most often in these lists of extremes should be included in the
survey. Mark these counties on a map of the province. Has any large
area been omitted? If so, add another county, which is as far as
possible from all the others mentioned.
Stage 2
When the counties (or whatever they are called) have been chosen,
the next stage is to work out where in each county the cluster should
be chosen. Continue the maximum-diversity principle by using the same
principle in each country as in stage 1. If a county was chosen for
its swampiness and flatness, choose the flattest and swampiest area in
the country. If it was chosen for its mountains and wealth, choose a
wealthy mountainous area.
To find out where these areas are, you will probably need to travel
to that county and speak to local officials. Sometimes you then find
that there are local population lists - e.g. lists of all houses in
the area. In that case, you might be able to use random sampling for
the final stage. If there are no population lists you can use, the
surveyed households will have to be chosen by block listing, aerial
photographs, or radial sampling - see section ii for details of these
methods.
Maximum diversity sampling can produce samples that are as
representative as random samples. The problem is that you can never be
sure of this.
7. Choosing the sampling unit
Now you need to
choose your sampling unit: what will you sample? It seems obvious at
first: your sample will be people, because only people can be
interviewed.
In fact, its not that simple, specially with door to door
surveys. Most door to door surveys begin by sampling dwellings. (A
dwelling is the place where the household lives: households are
people, dwellings are homes.) Dwellings are easier to find than
people: they dont move around. Even if you make your initial
sample from a list of people, such as an electoral roll, youll
find that some people have moved since the list was compiled.
Its much easier to sample dwellings, and then, as a second
stage, interview the people who live in those selected dwellings.
Sometimes its more appropriate to sample households than
people. For example, a few years ago I organized an Australia survey
about media usage. Part of this survey asked about the types of media
equipment that were available in households. In each household, the
interviewers asked for the person who knew most about technology. This
person was then asked questions such as "How many radios in this
household can receive FM programs?" The average numbers reported
in the survey were then applied to the whole population of Australian
households. We were able to make statements such as "there are
between 29 and 31 million FM radios in Australia."
When the sampling unit is people, some parts of the population are
usually excluded. Usually, children below some minimum age are
excluded - because they dont do the activity the survey deals
with (e.g. reading newspapers), and also because interviews with
children must be done differently. Normal questionnaires are usually
too difficult for them. Depending on the subject of the survey, the
minimum age is usually between 10 and 18 - most commonly, 15. Children
under 10 seldom listen to radio, or read newspapers, so theres
no problem excluding them if this is the subject of your survey. But
children as young as 2 watch TV, so any TV survey that does not
involve young viewers will be incomplete. The best solution is usually
to survey only people aged 10 or over, acknowledge the lack of data
from younger viewers, and to do a separate study among children aged
under 10, using observation instead of questionnaires.
Door-to-door surveys usually exclude people who dont live in
private households: visitors in hotels, troops in barracks, homeless
people, and so on. These people are usually only a few percent of the
population, so excluding them makes very little difference to the
survey results. For any proposed door-to-door survey, you should try
to find out how many people you will not be able to reach, and whether
these people are likely to give different answers from the others.
In the 1980s, an Australian government department did a telephone
survey with teenagers, and found a surprisingly low rate of
unemployment - because it mainly reached teenagers who were living
with their parents, in households rich enough to have telephones. At
the time, only 10% of households had no telephones - but these were
the poorest households.
8. Selecting samples from lists
If you have a
complete list of all people in the population, with addresses, you can
use this to draw a sample.
When population lists are available, they are often for specific
populations, such as all people who work in a particular organization.
Even when a list is supposed to contain the entire population, it
usually doesnt: perhaps because its out of date, or
because certain types of people are excluded. Here are some lists that
usually claim to apply to an entire population. Some of these
populations are people, some are households, and some can be used as
both.
Electoral rolls
Though an electoral roll is designed as a sample of people, it
can be used as a sample of households. People may come and go, but
addresses stay the same.
In most countries, electoral rolls are not very complete. I made a
study in South Australia, around 1990, and found that approximately
20% of people were not on the electoral rolls at their current
address. Some were not citizens, some had not bothered to enrol, and
some (e.g. police and judges) were excluded from published rolls.
Also, many people had moved in the several years since the rolls were
last updated. And of course, electoral rolls always exclude people
below the minimum voting age - though children over 10 can usually
answer survey questions well.
In some countries, such as Britain, the situation is much better,
because electoral rolls are updated every year, and printed in a very
convenient street order (unlike the alphabetical order of surname used
in Australia).
Before you use an electoral roll to represent the entire public, I
suggest you take a small geographical area - perhaps a few street
blocks, or a few hundred dwellings - and check how many people in that
area are on the rolls. If the figure is less than 90%, look for a
better population list.
A good compromise with electoral rolls is to use them with
multi-stage sampling: i.e. carry out a cluster survey, and use
electoral rolls only to choose the starting point for each cluster. If
the sample is stratified, and the number of clusters in each small
area is proportional to the population of that area, this helps to
ensure that people living in areas where many are not on the electoral
roll will still be included in the survey.
Street directories
These are lists of addresses, in street order. Typically, you
first look up a locality, and find all the streets in that area,
listed in alphabetical order. For each street, addresses are listed in
numerical order. Where up-to-date street directories exist, they are
an excellent source of addresses for door-to-door surveys. But they
are often incomplete, omitting many addresses - particularly where
several dwellings share one street address.
Telephone directories
A few years ago in rich countries, telephone directories were
an excellent population list with one (and only one) entry for every
household. In Australia in the late 1980s, over 90% of households had
a telephone, few households were unlisted with "silent
numbers", business numbers were clearly separated from
residential numbers, few households had more than one number or
answering machines, and few people had mobile phones.
But now, its all a mess. A telephone directory is no longer a
good population list. Many households have more than one phone number,
and these are usually the wealthier households. Other households have
only mobile phones, which are not usually listed in printed
directories. Because mobile phones are carried around, any directory
of mobile phone numbers will be a sampling frame of people, not of
households.
See the chapter on telephone surveys for full instructions on how
to draw samples from telephone directories.
Utility subscribers
The advantage of a telephone directory, in an area where nearly
everybody has a phone at home, is that it is a readily available list
which includes most dwellings. But other such lists often exist. In
areas where nearly every household has electricity from the mains, you
can sometimes get access to a list of electrical subscribers. Unlike
telephone directories, which are becoming messier by the day - with
households having multiple numbers, unlisted numbers, and
home-business numbers, utility lists mention each household once and
only once.
Other utilities which can be used are those for services which
almost all households use - such as local government, water supply,
sewerage, rubbish collection, and so on. These lists are usually up
to date and accurate. However, they are not published, so the main
problem with using these for survey sampling is getting access to them
from the authorities that own them.
Other population lists
Many organizations have mailing lists of their subscribers or
users. These can easily be used to draw samples for surveys, using
systematic sampling. They are usually samples of people.
A lot of these lists are not representative of a population. For
example, a radio network I once worked with held a competition with a
very valuable prize (an overseas holiday). All competitors had their
names and addresses entered into a database, possibly to be surveyed
later. Though competitors werent meant to put in more than one
entry, we found that some addresses had 20 entrants. Judging from the
names, it seemed that some people had entered their pets!
Before using any population list, find out more about it. Analyse
it closely, and consider:
1. Is it a sample of people, of households, or what? Is it suitable
for your purpose?
2. How complete is it?
3. How up to date is it?
4. How accurate is it?
5. Does it contain duplication?
A simple way to check a list is to find 20 people who should be on
it, and look them up on it. On a perfect list, all of them will be
there once (and only once), and all information will be up to date.
This is rare!
Cleaning lists
Before using any list to generate a sample, I suggest you bring
it into a computer program with which you can view the data in a
spreadsheet-like format (rows and columns, with one row for each
person, and a column for each piece of information). Then sort it on
every field in turn. Examine the first and last few entries in each
field. If there are any problems, you are likely to find them at the
top or bottom of a column.
Read carefully through the list, searching for duplicates.
Its often easier to see mistakes in print than on a computer
screen. If possible, print it out, in several different sequences, and
have a different person check each printout. Youll probably be
amazed how many obvious errors you find. Youll find some people
on it several times, maybe with different addresses or slightly
different versions of their name.
The biggest problem with an incomplete list is that its
likely to be biased in some way: in other words, it may not represent
a typical cross-section of your audience. If this problem exists,
random sampling cannot fix it: all youll get is a representative
sample of an unrepresentative list.
It can be tempting to use these "found" lists. It can be
cheap and easy to do a mail survey with such a list, but if you
dont know how representative it is, the results it produces can
be extremely misleading. Never rely on the results from a single
survey, unless you know it's random.
Finding and creating maps
If you only have a map, and no information about where the
population is spread across the map area, its difficult to
achieve an accurate sample. But its more difficult still when
you dont even have a map. Unfortunately, this is common in
developing countries. Even though census data at a district level may
be available, if there are no detailed maps it can be difficult to
relate the census place-names to areas on the ground.
So its vital to find or create a local map. Sometimes these
are painted on walls at local government offices, and you can copy
this by hand from the wall (or photograph it).
If you cant find a map, hold a meeting some well-informed
local people and create a hand-drawn map. This does not need to be
exactly to scale. It should show all locality names and main roads in
the district.
9. How to draw a sample from a population list
The simplest
method of sampling from population lists is to use systematic
sampling. This means that you divide the list into a number of equal
groups, select one random number, and sample the same location in each
group. Heres how.
1. Find the sampling interval
Divide the size of the list by the number of sampling points
wanted. (If you are using a stratified sample, as described above,
its more complicated: you need to use a separate list for each
stratum.)
For example, you may have a list of households, taking up 411
pages. Lets say youre doing a cluster survey, and you need
40 starting points. Divide 411 by 40. The answer is 10.275. That means
dividing the list into 40 groups of 10.275 pages. This can be done,
but it is not easy - youd be counting a lot of lines. So I
suggest taking 40 groups of 10 pages, skipping one page after every 4
groups, so that the unused pages are evenly spread through the
list.
2. Draw a random number.
Banknotes are a good source of random numbers. The last few
digits of a banknotes serial number are effectively random. Find
a banknote, and take down its last 3 digits. I just did this: the
serial number was VG 95872658. The last 3 digits are 658. Interpret
this as meaning that in each group of 10 pages (or whatever you have
divided your list into), you will take the entry that is 658
thousandths (65.8%) of the way through the list.
3. Work out which entries this random number corresponds to
65.8% of 10 pages is 6.58 pages. How much is 0.58 of a page?
An easy way to do this (if every household takes up the same number of
lines) is to use a ruler, and measure the height of the address list
on the printed page. If there are two columns, each 235 mm high,
thats 470 mm of addresses. 65.8% of that is 309 mm: this means
74 mm down the second column.
So for each group of 10 pages, find the address 74 mm down the
second column of the 7th page: thats your random address. Repeat
this 40 times, and theres your list of random starting points.
To save counting lines, you can make a card to show you which lines
to choose. Measure the distance from the foot of the page to the line
you need, and cut a piece of card exactly that high. If you hold the
bottom of the card level with the bottom of the page with your thumbs,
the first line visible above the card will be the one with the
selected number.
So thats systematic sampling: the advantage is that you draw
only one random number, and use it over and over again. You should
look for two problems:
1. There is a tiny chance that the population list is arranged so
that theres a regular sequence in the entries. Maybe in a list
of people, every alternate one is a man and every other one a woman.
If you use systematic sampling, you would select either only men or
only women. (Its extremely unlikely that any list would be
ordered like this, but it would badly spoil your sample.)
2. Its easier to round off the sampling interval so each
group comes from a whole number of pages. This means that there will
be some unused pages. Dont leave all of these at the end -
scatter them throughout the list. If the list is in geographical
order, this will ensure you dont exclude a large area. Later,
you may be able to use some addresses on these unused pages, to
replace sampled addresses that turn out to no longer exist.
Stratification of lists
If you are designing a stratified sample (dividing the survey
area into a number of smaller areas, and taking a separate sample from
each smaller area) you should check any population list you want to
use, to see if is already stratified in a suitable form.
As stratification is based on Census data, the population list you
use must be divided into Census areas. If it is not already divided in
this way (e.g. a telephone directory covering more than the whole
survey area) many hours work will be needed to draw a properly
stratified sample.
10. Choosing the place of interview
People can be
interviewed in three main places: at their homes, at their workplaces,
and in public places. In most surveys, people are interviewed at home.
As almost everybody has one home, home-based sampling provides a
better coverage of the population than samples based on workplace
(because not everybody has a job) and public places (because some
people spend very little time there).
With a probability sample, it's usual to interview people at home,
because it's usually the homes that are sampled, rather than the
people who live in them.
If you are using a quota sample, people can be interviewed anywhere
you find them: at home, at work, or in a street. Though this seems
easier, it's not as valid - see section 3 above.
With some types of sample, it's better to find people at some place
other than their home. If you are surveying the workforce of an
organization, it may be more convenient to interview them at work (as
long as they'll tell the truth there). If you are surveying people
about shopping - common with market research, but rare with audience
research - it can be better to survey them in a shopping area (see
section 15 of this chapter). And if you are surveying the audience to
some kind of event, the obvious place to interview them is at the
event: see the chapter on event surveys for more details of this.
11. Selecting starting points for door-to-door surveys
Door-to-door
surveys nearly always use cluster sampling, because it is so
much cheaper than choosing individual households at random. When you
are using clusters, the sampling is done in at least three stages:
1. Choose the starting address (at random, from a list)
2. Choose a random route to take after finding the starting
address: a route that gives every household in the cluster an equal
chance of being selected for the survey.
3. Choose one or more persons in each selected household.
When people are surveyed in their homes, usually one person is
selected at each address.
With 500 respondents, 500 separate addresses would be used. Unless
the survey was confined to a small town, the chances are that these
addresses would be widely scattered. This would provide wide
geographical coverage, but much time would be wasted going from one
dwelling to another. In a large area, more of the interviewers
time would be taken up by travelling than by doing interviews.
To increase productivity, surveys normally use cluster samples.
Instead of selecting 500 individual addresses at random, only 50 might
be chosen, and a cluster of 10 neighbouring households surveyed at
each point. So there would be 50 clusters each of 10 households.
You can see intuitively that taking only 50 separate parts of the
city is not going to be as representative as taking 500, because
neighbours tend to be similar in their habits and characteristics. To
equal the accuracy of a simple random sample of 500, a cluster sample
would need about 750 people. However, it is cheaper to interview 750
people in clusters than 500 individually.
Clustering saves most money when interviews are brief, and travel
cost (from home or office to cluster) is high, and few or no extra
trips to the cluster need be made, to interview the last few
respondents. If your survey is in a large city, and you have few
interviewers, and questionnaires are left for respondents to fill in
and be collected on a return trip, clustering can save a tremendous
amount of money, and clusters can be quite large.
In practical terms, cluster sizes usually range between 3 and 20
households. If clusters are too small, travel costs rise, but (for a
fixed sample size) there will be more clusters, and the effective
sample size will be larger. If clusters are too large, travel costs
will be less, but the effective sample size will also be smaller.
Another problem with large clusters is that interviewers can run out
of households.
A good compromise in most situations is to have about 10 households
per cluster.
Three ways of selecting cluster starting points
(1) Using a local population list
In many countries, local authority offices have a list showing
all households. If the authorities co-operate, this can be used to
draw a random number, to select the starting point for a cluster.
Other alternatives, as discussed above, include electoral rolls,
street directories, and telephone directories.
Use systematic sampling, as explained in more detail above. For
example, if you want to select 5 clusters in a village with 600
households:
1. Find a list of all households.
2. Divide this list into 5 equal sections, each with 120
households.
3. Choose a random number between 1 and 120 (e.g. using the last
digits of a serial number on a banknote). Say it's 57.
4. In each of the 5 sections of the household list, choose the 57th
household.
(2) Block listing
If you cant find a population list, what can you do? The
answer: create one. This is called block listing. It is time
consuming, and therefore expensive. But when accuracy is important,
and labour is cheap, block listing is the ideal choice. It will also
be more up to date than any official population list.
To start with, youll need a large map, because it will have
to show every street or road in the district. If such a map
doesnt already exist, youll build it up as you go. It
doesnt need to be exactly to scale. For a large district,
its best to have a number of partial maps, and assign one
interviewer to work on each.
Interviewers are now sent out to walk the whole length of every
road in their assigned area. Nobody is interviewed at this stage, but
the interviewers note each dwelling, and write a brief description
(enough to distinguish it from its neighbours). If a street is not
already on the map, it must be marked there. Dwellings that are
clearly unoccupied are noted as such.
One interviewer can list several hundred dwellings in a days
work - but this depends on the distance between dwellings, the
difficulty of counting separate dwellings, and the interviewers
ability to fend off interruptions from curious residents.
When every house on every road is listed, you have created a street
directory for the survey area. When the block listing is completed,
count the total number of dwellings listed - ignoring unoccupied
dwellings. Number each dwelling, from 1 upwards, giving adjacent
dwellings adjacent numbers (where possible). You can take a systematic
sample from this list to work out the starting points for clusters.
(3) Area-based sampling
When no population list is available, and block listing is too
expensive, the only other method of finding cluster starting points is
to use area-based sampling -which is similar to sampling from maps.
This type of sampling is not ideal, because it gives each area of land
an equal chance of being surveyed, not each person. Therefore, people
who live in thinly populated areas have a greater chance of being
included in the survey. As towns are densely populated, people living
in towns will have a lower chance of being included.
There are several solutions to this problem, but the best solution
may be different in each cluster.
Separating areas of
equal population density
One solution is to survey towns separately from rural areas. In
some rural areas -for example, fertile plains - the rural population
is distributed quite evenly, with most people living on small farms.
As long as the area of each cluster has a consistent population
density, area-based sampling (e.g. from a map) will be reasonably
accurate.
Aerial photographs
The second solution uses aerial photos. If you can get an
aerial photograph of the area where the cluster is, you can count the
roofs, number them, and form a sampling frame that way. If the scale
is no greater than 1:10,000, and the roofs are clearly visible on the
photo, this works quite well. However, aerial photos are sometimes
many years out of date. Professional aerial photography is very
expensive, because special aircraft are used. But if you hire a small
plane for an hour or so, and take photos from that, this is much
cheaper than block listing. Choose a time of day when roofs are most
visible - this varies with the roof materials, roof shape, and the
weather. In some places, the middle of the day is best; in others,
early morning or late afternoon. The best height is about 5,000 feet
(1,500 metres): below this, the area of each photo is too small, and
above it, individual roofs are too difficult to make out. If possible,
use a high-wing plane, so that the wings dont get in the way. If
you are taking photos though the windows, avoid using an auto-focus
camera; these often try to focus on the glass. To be safe, have two
photographers (one on each side of the plane) and two different
cameras. Though official aerial photos are usually in black-and-white,
colour photos are easier to interpret.
When the photos are developed, there will be a lot of overlapping.
Its best to number the photos, and draw lines on each to show
where other photos overlap -otherwise its too easy to count some
roofs twice.
Radial sampling
The third solution, which I call radial sampling, works
well in countries where most people live along roads, and there are
not a lot of roads. This often applies in south-east Asia.
For example, in 1997 I was training people in Laos in survey
methods. Our class, with 12 students, decided to do a survey in the
town of Phonhong, about two hours drive north of Vientiane (the
capital). We had no information about Phonhong except its total
population. Our first stop was at the local authority office, where we
found the only map of the town: it was painted on a wall. We found
that the town was Y-shaped; it had grown around the intersection of
three roads.
By driving though the town, we found that early everybody lived on
these three roads. The number of houses on each road was approximately
equal. There and then, we decided to divide the town into six strips:
three roads, each with two sides. The 12 trainees were divided into 6
teams, with 2 people in each. Three teams started from the central
junction and worked outwards. The other three started at the edge of
town and worked inwards, from the opposite side of each road
- as shown on the above map.
The result was probably a good sample of the Phonhong population. I
say "probably" because we had no way of being certain. If
this had been a real survey, not just a training exercise, Id
have done a block-listing first, because the town had only about 600
households and we surveyed about 120 of these.
This method, radial sampling, works in any town or district
where a number of roads meet in a central place. Heres a more
systematic set of rules for radial sampling.
1. Draw a very
rough map of the area for the cluster/s, showing only the roads that
meet, and approximate distances.
2. For each cluster, choose 3 random numbers (e.g. the last 3
digits of a banknote serial number).
3. The first random number selects the direction from the centre
point. 0 = north, s = south, and so on. (This is like dividing a clock
face into 10 parts instead of 12.)
4. Use the second random number to select the distance from the
centre point. o = the centre, and 9 = the outer boundary.
5. If the second random number was o, the interviewing must work
outwards from the centre. If it was 9, the interviewing must go back
towards the centre. If it was neither 0 nor 9, look at the 3rd random
number. If this is odd, work outwards. If it is even, work
inwards.
When most people live
along the radial roads, this method will produce a representative
sample. The exception is when densely populated areas (e.g. squatter
settlements) are in areas between the main roads. Radial sampling will
often miss these areas - and block listing or recent aerial
photographs would be better.
12. Sampling inside clusters
When the starting
point of a cluster has been chosen, how is the rest of the cluster
then decided? The interviewer finds at the randomly selected address,
then follows a set of rules to work out which addresses will be chosen
for interviews. The important thing is to have some rules. Don't let
the interviewer choose - because the houses where interviewers prefer
to go are not typical. Interviewers always prefer to visit rich homes
rather than poor ones, homes where somebody is there, and homes that
are easy to reach.
Interviewers also prefer to talk to people who are similar to
them. In developing countries, interviewers are usually well educated,
and don't like speaking to people they see as ignorant. This causes a
bias in many surveys, making it appear that populations are better
educated and wealthier than in fact they are. Although cluster
sequence rules are arbitrary, they must be followed.
In each cluster, the address selected at random is usually not
surveyed. This may seem strange, but the chances are that the
population list from which that address was drawn is incomplete.
Excluding the address actually selected partially compensates for the
under-representation of other addresses on the population list.
A common set of rules for making a cluster is:
(1) Find the
address selected at random.
(2) Going around the street block anti-clockwise, ignore the
address next door to the one selected at random. Make your first call
at the next address to that: two to the right of it, when looking from
across the street - or two higher in number, if there are a number of
dwellings at one address.
(3) Continue to call at every second address, going anti-clockwise
around the block. (Turn left at every street corner.)
(4) If you get right around the block without having located enough
addresses to make up the cluster, cross the road outside the address
originally selected, and start to go around the neighbouring block,
again anti-clockwise, again taking every second dwelling.
(5) If you run out of houses, and theres a section of road
where nobody lives (for at least 1 kilometre), cross the road and come
back along the other side.
(6) What do you do if you run out of households? This seldom
happens, but it needs to be anticipated. A simple solution is to
extend the next cluster by the number of households missing. For
example, if the plan is for 12 households in a cluster, but one
cluster only has 10, the closest unfinished cluster should have another
2 added.
In this map, the starting point (marked Start) is just before the dwelling
marked 1. Every second dwelling is ignored. Those marked x were
selected, but did not result in interviews (due to refusals, etc.) and had
to be replaced. Note the route taken when the interviewers went around
the block and reached the starting point again: they crossed the road,
turned around, and kept going in the opposite direction - still turning
left whenever necessary.
Why every second household, and not every household? (This is
called a skip interval of 2 households.) Mainly because
neighbouring households tend to be more similar to each other. So
using a skip interval brings more variety into the cluster, while
still keeping it reasonably compact. The fewer households in a
cluster, the more addresses should be skipped. But when a cluster
includes more than about 50 dwellings, including skipped ones, it
becomes too large (specially in rural areas), and some of the cost
savings disappear.
Why go around the block, and not continue in a straight line?
Because this would favour households living on main roads - which are
often richer than those living in side streets. Where everybody lives
on a long road (as they do in some parts of the world) there are no
street blocks: observing the above rules, the route will simply follow
the road.
Why keep turning left, and not right? This is completely arbitrary;
it's just a convention. Change it, if you like - but don't give
interviewers a choice in each cluster.
Instructions to interviewers should make it clear exactly what you
mean by cluster size. What happens when a household is visited and
nobody is home, or the occupants refuse to take part in the survey?
Are these households counted in the cluster, or not?
The simplest solution is to keep going, adding more households to
the end of the route, until interviews have been done at the required
number of households. However, substitutes are not usually taken until
a dwelling has been visited at least three times, in an attempt to
find somebody home.
13. How many respondents in each household?
Another factor to
take into account when designing a sample for a door-to-door survey is
the number of respondents in each participating household.
For a personal-interview survey, when each respondent is questioned
directly by the interviewer, its easiest to interview only one
person per household. If, as is common, others are present during an
interview, those who have already heard all the questions may give
different answers from the initial respondent. If most of the
questions relate to facts which would be known by anybody in the
household (e.g. "how many television sets are at this
address?") having extra people present may produce more accurate
results. But for questions asking about personal attitudes, it is best
not to have anybody else present, so that the selected respondent will
feel free to give his or her true opinion.
An exception to interviewing only one person occurs when the focus
of the survey is on something that is not particularly common. A
survey of computer users, for example, may begin with the interviewer
asking "Does anybody in this household use a computer?" and
interviewing all computer users, if the household had more than one.
A estimation problem which occurs when only one person in a
household is interviewed: people in small households will be
over-represented. Among all households contacted for a survey, people
living alone will have a 100% probability of being interviewed. But in
a household with four eligible persons, each of these people will have
only a 25% chance.
In Australia, about 10% of adults live alone, but these make up 20%
of all households, and their media use habits are quite different from
those of larger households. In developing countries, which generally
have more people per household, single-person households are rarer, so
it will not distort results so much to interview one person in each
household. The easiest way to compensate for an excess of small
households in the survey is for the interviewer to find out how many
adults live in each household visited. Then multiple interviews can be
made at larger households.
By "larger" households, I mean 3 or more people in
developed countries, and 4 or more in developing countries (where
households tend to be larger).
If you interview more people in larger households, this can
slightly increase the accuracy of the survey, but you will be unable
to determine the exact sample size in advance. The simplest solution
is to base your calculations on one person per household. Not many
households will have two interviews, so the final sample size will be
perhaps 5% to 10% larger than you planned.
If you survey all people in the household (except perhaps
children), this solves one problem, but creates several others:
- Sometimes one person can affect the answers given by others. This
applies specially to questions that measure knowledge, such as
"Please name all the radio stations you can think of."
- The effective sample size is lowered, for some types of question.
This is equivalent to increasing the sampling error. If you ask 100
people in 100 households how many radios their household has, this
result will be based on more radios than if you ask 100 people in 50
households the same question.
- It is awkward to interview more than one person in the same
household. Often, after hearing two interviews, prospective
respondents will simply say "my answers are the same", and
refuse to be fully interviewed.
Another approach is to interview all household members at the same
time, using a single questionnaire. We used this in a survey of
Aboriginal people in central Australia. In the evenings, they usually
sat outdoors in small groups, listening to portable radios. The
interviewers would approach one of these groups, play brief taped
extracts from radio programs, and ask the respondents opinions
of each program. But the questionnaire was different from a normal
one: instead of ticking boxes for "like it", "dislike
it", and "not sure", the interviewer would write in the
number of people giving each possible answer.
In a survey where respondents fill in their own questionnaires, and
these are collected later by the interviewer, its normal to give
a questionnaire to each person in the household. This boosts the
sample size at little extra cost, but also helps prevent people
filling in questionnaires intended for another member of the
household.
14. Choosing respondents within households
A common mistake
in survey research is to interview the first person met in each
household. This will produce a badly skewed sample, nullifying any
care that has been taken in producing a representative sample of
households. This is important for any survey, but particularly for
surveys measuring radio and TV audiences.
What is the problem with interviewing the first person the
interviewer meets? It's because the more time somebody spends at home,
the more chance they have of being interviewed, with this method of
choosing respondents. People who spend a lot of time at home have
different habits from people who are out a lot. For example, most
radio and TV viewing is done at home, so if the first person found in
each household is interviewed, the survey will overestimate the amount
of listening and viewing.
For the same reason, surveys carried out in streets and public
places will usually underestimate radio and TV audiences.
In Australia, some types of people (e.g. women and younger people)
are much more likely than others to answer the door (or the telephone)
when an interviewer visits. In other countries, such as Western Samoa,
it is normal for the oldest man in the household to greet any
strangers.
The best approach is for the interviewer to speak to the first
person met, work out who should be interviewed, then to interview the
appropriate person. There are three main methods for choosing a
respondent: the birthday method, the Kish Grid, and quotas.
The birthday method
Most market research books recommend asking for the person who
last had a birthday (or who next will have one). In theory, everybody
in a household has an equal chance of being selected by this last
birthday or next birthday method, but my research has found
this does not produce the correct balance of sexes and age groups.
Also, it only works in households where everybody knows everybody
else's birthday. In countries where birthdays are not celebrated, many
people don't know their family's birthdays.
The Kish Grid
This is a table of numbers, named after the statistician who
invented it. The number of people in the household is discovered, and
a random number is chosen to select a particular person.
My research in Australia found that the Kish method can cause a
high refusal rate: elderly women, in particular, are often suspicious
when the first question in a survey is "How many people live in
your household?" particularly if they live alone. In
developing countries, where few old people live alone, this may not be
a problem. Heres an example of a Kish grid, with instructions.
This is based on 8 households per cluster, interviewing 1 person per
household.
Instructions
for using Kish Grid
1. Find out how many people living in the household are
eligible to be interviewed. Include people who sleep there, but are
not there when you visit. Ignore children aged under 15.
2. The youngest (excluding children under 15) is number 1, the
second youngest is number 2, and so on.
3. The first household where you do an interview is household 1,
the second is household 2, and so on, up to household 8 - the last in
the cluster.
4. Look up the column for the household number, and the row for the
number of eligible people. The number in the cell where the column and
row meet is the person to interview. For example, if household 2 has 3
adults, interview the 2nd youngest (shown in bold type). If that
person is not there when you call, arrange to come back later.
Eligible
people |
Household |
|
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
|
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
|
2 |
1 |
2 |
1 |
2 |
1 |
2 |
1 |
2 |
|
3 |
1 |
2 |
3 |
1 |
2 |
3 |
1 |
2 |
|
4 |
1 |
2 |
3 |
4 |
1 |
2 |
3 |
4 |
|
5 |
1 |
2 |
3 |
4 |
5 |
3 |
4 |
5 |
|
6 |
1 |
2 |
3 |
4 |
5 |
6 |
3 |
6 |
|
7 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
4 |
|
8 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
|
9 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
|
10 or more |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
The reason for numbering the household members from the youngest
upwards (instead of the seemingly more obvious oldest downwards) is
that younger people are more difficult to find at home, so the above
grid gives young people a slightly higher chance of being
interviewed.
Quota selection within a household
When selecting a respondent within a household, the most
practical method is often a type of quota sampling. Though quota
sampling was criticized earlier in this chapter, most of its problems
do not apply when selecting a member of a household.
A common approach is to interview a woman in half of all
households, and a man in the other half in most parts of the
world, where the sex balance is close to 50/50.
To ensure a good balance of old and young people, age-based quotas
can also be applied. One of the simplest quota systems is to ask to
interview the oldest person in the household (in half the households
visited) and the youngest eligible person (in the other half of the
households).
Household quotas can be based on other factors apart from sex or
age group. It can be useful in radio and TV surveys to have separate
quotas for people who stay home most of the time (housewives, retired
and unemployed people), and those who spend less time at home: workers
and students. Of course, such quotas must be based on known figures,
usually on Census data. If 60% of the eligible population are workers
and students, and 40% stay home, and the quotas reflect these
percentages, 60% of respondents will be workers and students.
15. Sampling people in public places
If survey results
are to be projected to the general population, a bad way of selecting
a sample is to interview people in the street or at a shopping centre,
particularly on a weekday. Workers and students are under-represented
in such surveys, as are people who are too busy to be interviewed, and
those who seldom walk around streets or shopping centres. About the
only valid use of shopping-centre surveys is when the population of
interest is shoppers. Market research companies do many surveys in
shopping centres, usually about products bought in shops. Because
their target audience is shoppers, these surveys are reasonably
representative.
Another problem with surveys in public places is that they often
greatly under-estimate broadcast audiences - because people who spend
a lot of time in public places (and therefore less at home, where they
might watch TV or listen to radio) are more likely to be
interviewed.
A partial solution to this problem is to control for how much time
a respondent is likely to spend away from home, by setting quotas
based on employment status and age, as shown in Census data. For
example, if 15% of the whole population are students aged under 25,
then 15% of respondents should be in the same category. This method is
far from perfect, but it produces more accurate results than not using
quotas.
Occasionally, theres no alternative to doing surveys in
public places. In cities in Papua New Guinea, for example, the crime
rate is horrendous. Houses are surrounded by high fences with locked
gates, and guarded by fierce dogs. Interviewers cannot get access
during the day, and it is dangerous to go to unknown places at night.
Its not possible to do a survey by telephone, as less than 1% of
households have a phone. Nor is it possible to do a mail survey,
because the literacy rate is less than 50%. So surveys in public
places are the only feasible alternative - despite their problems.
16. Checklist of sampling decisions to be made
This checklist
applies to a door-to-door survey, which uses the most complex
sampling. For other types of survey, which do not use clusters, items
5, 6, and 7 do not apply.
1. Decide on the exact area to be surveyed. If possible, get a
map of this area. Also, try to get census data for the area.
2. Will there be one questionnaire per person, or one per
household? If one per person, what will be the minimum age? (Usually
between 10 and 18.)
3. Decide on the sample size - always a compromise between the
funding available and the need for accuracy. If youre doing the
survey yourself, and its your first one, I suggest 100. If this
later turns out to be too small, youll now be able to do a
second survey, with a larger sample - with your newly gained
experience, youll do it better than the first one. Otherwise, I
recommend a sample size of about 300. This is on the small side, but
will usually provide detailed enough information.
4. Decide how the sampling will be done. If a population list is
available, use it. Otherwise, find the method which best gives
everybody in the surveyed population the same chance of being
interviewed.
5. Decide on the cluster size. Suggestion: between 4 and 20. A
size of 8 to 10 usually works well. At the same time, decide the
number of clusters. If you are interviewing one person per household,
the sample size is the cluster size times the number of clusters.
6. Can you sample respondents directly, or will you have to use
another sampling method within each district? If the latter, each
district will have to be visited before interviewing, to draw a local
sample.
7. Decide on the route interviewers will take from the starting
address - e.g "always turn left, and skip two households after
each interview".
8. Decide how many people per household to interview: 1 per
household, or 2 in larger households, or every adult.
9. Decide which method you will use to choose the respondents
within households: last-birthday, Kish grid, quota, or everybody.
10. Decide on your substitution policy: if some people refuse to
be surveyed, will they be replaced? By somebody in the same household,
or by adding another household to the end of the cluster, or what?
Conclusion: is sample design really necessary?
"Is it
really worthwhile to go to all this trouble, just to get a
sample?" you may wonder. "Why not just interview
anybody?" Occasionally, an informal method of sampling will give
reasonably accurate answers. The problem is that if you do a survey
that takes such shortcuts, you will never know how inaccurate your
findings are.
Market research companies, by repeated testing and comparison of
results from various surveys, may have found they can get away with
statistically imperfect sampling, but its harder for
inexperienced researchers to justify such shortcuts.
If you are doing a survey whose results are likely to encounter
some opposition, people who do not like the results may challenge the
surveys validity. If you can demonstrate that the sample was
drawn by correct probability methods, the surveys results are
more likely to withstand scrutiny.
Even if you intend to use the results only for your own purposes,
there is little point in doing a survey unless the results are as
accurate as possible.
|