How do you choose a sampling method that represents the population without bias?
Population, sampling frame and sample; simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling; selecting random members; calculating strata sizes.
A focused answer to Edexcel GCSE Statistics on sampling, covering population, sampling frame and sample, simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling, selecting random members electronically, and calculating stratified sample sizes.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel codes 1c.01 to 1c.06 require you to define a population, sampling frame and sample; to know when and why judgement (purposive) and opportunity (convenience) sampling are used and the bias risks they carry; to understand random, systematic, quota and cluster sampling with their advantages and disadvantages; to describe how random members are selected (including handling repeats and out-of-range numbers); and to use stratification, including calculating strata sizes. The stratified sample calculation is one of the most common calculation questions in the qualification.
Population, sampling frame and sample
Edexcel notes that "population" depends on context: it might mean all employees in an office, all females in the UK, or all items produced in a factory. A census surveys the whole population (accurate but slow and costly); a sample surveys a part (faster and cheaper, but introduces sampling error and the risk of bias). The quality of a sample depends on a good sampling frame: if the frame misses part of the population, no method can remove the resulting bias.
Simple random sampling
You select by numbering the frame and using a random device: dice, cards, random number lists, or calculator and spreadsheet functions. Edexcel expects you to deal with practical issues: ignore repeated random numbers and out-of-range numbers, and keep selecting until you have enough distinct members. Random sampling is free of selection bias, but it needs a complete frame and can, by chance, under-represent a group.
Systematic sampling
In systematic sampling you choose a random starting point and then take every th member of the frame. For a population of and a sample of , the interval is , so you pick a random start between and and take every th member after it. It is quick and spreads the sample through the list, but Edexcel warns that it is generally non-random and can fail if the list has a repeating pattern (period) that coincides with the interval.
Stratified sampling
Stratified sampling guarantees each group is represented in proportion to its size, then selects at random within each group. Edexcel favours it for populations split into obvious strata (year groups, age bands, gender). At Higher tier you may stratify by more than one category.
Quota, cluster, judgement and opportunity sampling
- Quota sampling sets target numbers per group (for example men and women); the interviewer fills the quotas. No frame is needed, but it is non-random and can be biased by who the interviewer approaches.
- Cluster sampling divides the population into clusters (schools, towns), picks some clusters at random, and surveys everyone in them. It is cheap when the population is spread out, but chosen clusters may not represent the whole.
- Judgement (purposive) sampling chooses members the investigator believes are representative. It is quick but relies on opinion and risks bias.
- Opportunity (convenience) sampling takes whoever is easiest to reach (the first 20 people who pass). It is the quickest but the most biased, because it ignores the rest of the population.
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 1ST0 20195 marksA college has students: in Year 12 and in Year 13. Mr Khan wants a stratified sample of students by year group. (a) Calculate how many students he should sample from each year group. (b) Describe how he could then select the Year 12 students for the sample.Show worked answer →
(a) Sampling fraction .
Year 12: . Year 13: .
Check: , matching the required sample size.
(b) Number the Year 12 students from to , then use a random number generator (or random number table) to pick different numbers in that range, ignoring repeats and numbers out of range.
Markers reward the sampling fraction, each correct stratum size, the check, and a valid random selection method.
Edexcel 1ST0 20214 marksExplain the difference between quota sampling and stratified sampling. State one advantage of each method.Show worked answer →
Stratified sampling divides the population into groups and selects from each group at random in proportion to its size, so it needs a sampling frame. Quota sampling sets a target number to interview in each group and an interviewer fills the quotas, with no random selection and no frame needed.
Advantage of stratified: it is random and represents each group proportionally, reducing bias. Advantage of quota: it is quick and cheap, and needs no sampling frame.
Markers reward the proportional/random nature of stratified versus the non-random quota filling, plus one valid advantage of each.
Related dot points
- The statistical enquiry cycle: planning a hypothesis, recognising constraints, collecting, processing, interpreting and evaluating, with proactive strategies to manage problems.
A focused answer to Edexcel GCSE Statistics on the statistical enquiry cycle, covering the five stages, writing a testable hypothesis, recognising constraints such as time, cost and ethics, and planning proactive strategies to handle problems like non-response.
- Types of data: raw, quantitative, qualitative, categorical, ordinal, discrete, continuous, ungrouped, grouped, bivariate and multivariate; primary versus secondary; explanatory and response variables; grouping into class intervals.
A focused answer to Edexcel GCSE Statistics on types of data, covering quantitative versus qualitative, categorical and ordinal, discrete and continuous, grouped and bivariate data, primary versus secondary sources, explanatory and response variables, and the effect of grouping into class intervals.
- Sources of data, reliability and validity, designing questionnaires and data collection sheets, open and closed questions, leading questions, pilots, and cleaning data before processing.
A focused answer to Edexcel GCSE Statistics on collecting data and designing questionnaires, covering data sources, reliability and validity, open and closed questions, designing non-overlapping response boxes, spotting leading questions, pilots, and cleaning data before processing.
- Identifying and controlling extraneous variables, control groups and matched pairs, sources of bias, sensitivity of content, and the random response technique for sensitive questions.
A focused answer to Edexcel GCSE Statistics on controlling variables and bias, covering explanatory and extraneous variables, control groups and matched pairs, sources of bias, sensitivity of content, and the random response technique for sensitive questions at Higher tier.
Sources & how we know this
- Pearson Edexcel GCSE (9-1) Statistics (1ST0) specification — Pearson Edexcel (2017)