Skip to main content
EnglandMathsSyllabus dot point

How do you select a sample that fairly represents a population, and what are the trade-offs of each method?

Populations and samples, the advantages and limitations of sampling, simple random sampling, systematic, stratified, quota and opportunity sampling, and the importance of the large data set.

A focused answer to the AQA A-Level Mathematics sampling content, covering populations and samples, the trade-offs of sampling, simple random, systematic, stratified, quota and opportunity sampling, and the role of the large data set.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Populations and samples
  3. Random methods
  4. Non-random methods
  5. The large data set
  6. Bias and how methods control it

What this dot point is asking

AQA wants you to understand the difference between a population and a sample, judge the advantages and limitations of sampling, describe and apply the main sampling methods, and appreciate why the qualification uses a large data set to ground statistics in real contexts. Sampling questions reward precise vocabulary and clear, replicable descriptions of method.

Populations and samples

A census is fully accurate but expensive, time-consuming and sometimes destructive (you cannot test every match to destruction and still sell them). A sample is cheaper and faster but introduces sampling error. A good sample is representative, large enough to be reliable, and selected by a method that avoids systematic bias.

Random methods

In simple random sampling every member, and indeed every possible sample of the chosen size, has an equal chance of selection. You typically number the population and use random numbers (ignoring repeats). It avoids selection bias but requires a complete, accurate sampling frame, which is not always available.

Systematic sampling lists the population and selects every kkth member after a random start, where kk is the population size divided by the sample size. It is simple to administer but can interact badly with hidden periodic patterns in the list, producing bias if the period matches kk.

Non-random methods

Quota sampling fills set numbers from each group but lets the interviewer choose who fills the quota, which is fast but open to interviewer bias. Opportunity (convenience) sampling uses whoever or whatever is available, which is the easiest method but typically the least representative.

The large data set

AQA provides a large real data set that you should explore during the course, so you can clean data, spot patterns, compute summary statistics, and answer questions set in its realistic context in the exam. Familiarity with its variables, units and quirks (such as missing values) is examined directly.

Bias and how methods control it

Bias is any systematic tendency for a sample to misrepresent the population. It is not the same as the random sampling error that shrinks as the sample grows; bias does not go away with a larger sample if the method itself is flawed. Selection bias arises when some members are more likely to be chosen (an opportunity sample of shoppers in one street), non-response bias when those who reply differ from those who do not, and the periodicity problem in systematic sampling is another form of bias.

Random methods (simple random and, within strata, stratified) control bias because selection does not depend on any characteristic of the member. Non-random methods (quota, opportunity) trade that protection for speed and low cost. When an exam asks you to "comment on" or "criticise" a sampling method, name the specific source of bias and say which group is likely over- or under-represented, rather than writing "it might be biased".

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20196 marksPaper 3, Section A. A college has 12001200 students: 540540 in Year 12 and 660660 in Year 13. A researcher wants a stratified sample of 8080 students. (a) Calculate the number of students to sample from each year group. (b) Describe how the researcher could then select the required number of Year 12 students using simple random sampling. (c) Give one advantage of stratified sampling over simple random sampling in this context.
Show worked answer →

For (a), the sampling fraction is 801200=115\frac{80}{1200} = \frac{1}{15}. Year 12: 5401200×80=36\frac{540}{1200} \times 80 = 36; Year 13: 6601200×80=44\frac{660}{1200} \times 80 = 44. Check: 36+44=8036 + 44 = 80. For (b), number all 540540 Year 12 students 11 to 540540, generate 3636 distinct random numbers in that range (ignoring repeats), and select the matching students. For (c), stratified sampling guarantees both year groups are represented in proportion, reducing the risk that simple random sampling under-represents one year. Markers reward exact proportional calculations that total 8080, a clear random-selection method, and a valid advantage.

AQA 20225 marksPaper 3, Section A. (a) Explain the difference between a census and a sample. (b) A factory manager inspects every 5050th item leaving a production line. State the name of this sampling method and one situation in which it could give a biased sample. (c) State one advantage and one disadvantage of opportunity sampling.
Show worked answer →

For (a), a census collects data from every member of the population, whereas a sample collects data from a subset and uses it to make inferences. For (b), inspecting every 5050th item is systematic sampling; it can be biased if there is a periodic pattern in the production with the same period (for instance if every 5050th item comes from the same faulty mould). For (c), opportunity sampling is quick and cheap to carry out, but it is often unrepresentative and prone to bias because the sample depends on who or what is available. Markers reward precise definitions and a correct named method with a sensible biasing scenario.

Related dot points

Sources & how we know this