Skip to main content
EnglandStatisticsSyllabus dot point

How do you choose a fair sample that represents a population?

Populations, sampling frames, census versus sample, random, systematic, stratified, quota and cluster sampling.

A focused answer to AQA GCSE Statistics on sampling methods, covering populations and sampling frames, census versus sample, and how random, systematic, stratified, quota and cluster sampling work, with the stratified sample calculation.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Population, sampling frame, census and sample
  3. Random sampling
  4. Systematic sampling
  5. Stratified sampling
  6. Quota and cluster sampling

What this dot point is asking

AQA wants you to define a population and a sampling frame, contrast a census with a sample, and describe and apply the main sampling methods: random, systematic, stratified, quota and cluster. You must also be able to calculate a stratified sample, which is one of the most common calculation questions in this module.

Population, sampling frame, census and sample

A census is the most accurate way to learn about a population, but it is expensive, slow and sometimes impossible (you cannot test every light bulb to destruction). A sample is faster and cheaper, and if chosen well it represents the population closely, but it introduces sampling error and the risk of bias. The quality of any sample depends on having a good sampling frame: if the frame misses part of the population, no sampling method can fix the resulting bias.

Random sampling

Random sampling is fair and free of selection bias, which makes it the benchmark other methods are judged against. Its drawbacks are that it needs a complete sampling frame and that, by chance, a random sample can still under-represent a group (it might pick mostly boys from a mixed school).

Systematic sampling

In systematic sampling you choose a starting point at random and then take every nnth member of the list. If a population of 200200 needs a sample of 2020, the interval is 20020=10\frac{200}{20} = 10, so you pick a random start between 11 and 1010 and take every 1010th person after it. It is quick and spreads the sample evenly through the list, but it can go wrong if the list has a repeating pattern that matches the interval.

Stratified sampling

Stratified sampling guarantees that each group is represented in proportion to its size, which is why examiners favour it for populations split into obvious groups (year groups, age bands, departments). Within each group the members are then chosen at random.

Quota and cluster sampling

Quota sampling sets a target number to survey in each group (for example 1010 men and 1010 women) and the interviewer fills the quotas; it needs no sampling frame but can be biased by who the interviewer chooses to approach. Cluster sampling divides the population into clusters (for example schools or towns), selects some clusters at random, and surveys everyone in the chosen clusters; it is convenient and cheap when the population is spread out, but the chosen clusters may not represent the whole population.

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20195 marksA school has 540540 students in Years 77 to 1111 as follows: Year 77 (120120), Year 88 (110110), Year 99 (120120), Year 1010 (100100), Year 1111 (9090). A stratified sample of 9090 students is taken. Calculate how many students should be sampled from each year group.
Show worked answer →

Sampling fraction =90540=16= \frac{90}{540} = \frac{1}{6}, so take one sixth of each year.

Year 77: 16×120=20\frac{1}{6} \times 120 = 20. Year 88: 16×11018.318\frac{1}{6} \times 110 \approx 18.3 \to 18. Year 99: 16×120=20\frac{1}{6} \times 120 = 20. Year 1010: 16×10016.717\frac{1}{6} \times 100 \approx 16.7 \to 17. Year 1111: 16×90=15\frac{1}{6} \times 90 = 15.

Check: 20+18+20+17+15=9020 + 18 + 20 + 17 + 15 = 90.

Markers reward the sampling fraction, each group calculation, sensible rounding, and a total that matches the required sample size of 9090.

AQA 20223 marksExplain the difference between quota sampling and stratified sampling, and give one disadvantage of quota sampling.
Show worked answer →

Stratified sampling splits the population into groups and then selects randomly within each group in proportion to its size, so it needs a sampling frame. Quota sampling just sets target numbers for each group and an interviewer fills the quotas, with no random selection and no frame needed.

Disadvantage of quota sampling: it can be biased, because the interviewer chooses who to approach (for example only confident-looking people).

Markers reward the proportional/random nature of stratified versus the non-random quota filling, plus one valid disadvantage.

Related dot points

Sources & how we know this