How does the statistical enquiry cycle turn a question into a sound conclusion?
The statistical enquiry cycle: planning a hypothesis, recognising constraints, collecting, processing, interpreting and evaluating, with proactive strategies to manage problems.
A focused answer to Edexcel GCSE Statistics on the statistical enquiry cycle, covering the five stages, writing a testable hypothesis, recognising constraints such as time, cost and ethics, and planning proactive strategies to handle problems like non-response.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel frames the whole qualification around the statistical enquiry cycle. For this opening content (codes 1a.01 to 1a.03) you must know the five stages of the cycle, write a clear testable hypothesis, recognise the constraints involved in designing an investigation, and plan proactive strategies to handle problems that arise. Both papers integrate the cycle, so questions can ask you to plan an enquiry from scratch or to slot a single task into the right stage.
The five stages of the cycle
The cycle is drawn as a loop because the final stage feeds back into the first. If evaluation shows the sample was biased or the questionnaire was unclear, you refine the hypothesis or method and run a better investigation. Edexcel expects you to see statistics as this iterative process of testing and refinement, not as a single isolated calculation. A question may give you a task (for example "draw a box plot" or "suggest an improvement") and ask which stage of the cycle it belongs to.
Writing a hypothesis
A hypothesis is a clear, testable statement that predicts a relationship and can be supported or rejected by data. Edexcel does not require the formal language of a null hypothesis at GCSE; a plain statement is enough. Good and poor examples:
- Good: "Houses with more bedrooms sell for higher prices."
- Good: "As motorcycles get older, their value tends to fall."
- Poor: "I will look at house prices." (an aim, not a testable statement)
- Poor: "Bigger is better." (too vague to test with data)
The hypothesis controls everything that follows: it tells you which variables to measure and which diagrams and calculations will test it. Identifying the relevant variables (and which is explanatory and which is response) is part of good planning.
Recognising constraints
When you design an investigation you must anticipate the constraints that limit it. Edexcel lists the main ones (code 1a.02):
- Time. A full survey may take too long, so you sample or use existing data.
- Cost. Collecting or buying data can be expensive.
- Ethical issues. Some questions intrude on people or could cause harm, so they need care or must be avoided.
- Confidentiality. Personal data (salaries, medical records) may not be available or shareable.
- Convenience. Some populations are hard to reach, so a complete frame may be impossible.
A strong answer names a constraint and links it to the specific context in the question, rather than listing them generically.
Proactive strategies for problems
Code 1a.03 asks you to plan proactive strategies to mitigate issues before they derail the enquiry. Typical problems and responses:
- Hard to identify the population. Define it carefully and choose a sampling frame that matches it as closely as possible.
- Non-response. Plan to collect extra data, send reminders, or replace missing units, so the achieved sample is still large enough.
- Unexpected outcomes or anomalies. Decide in advance how you will treat outliers and how you will clean the data.
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 1ST0 20194 marksAisha thinks that newer cars are worth more money. She plans to investigate this using data on second-hand cars. (a) Write a suitable hypothesis for Aisha's investigation. (b) State two constraints Aisha might face when collecting her data, and for each give one way she could deal with it.Show worked answer →
(a) A suitable hypothesis is testable and states a relationship, for example: "Newer cars have a higher value than older cars." A vague aim such as "find out about cars" earns no mark.
(b) Any two constraints with a sensible mitigation, for example:
Time: collecting a large sample takes a long time, so she could use secondary data from a car-sales website.
Cost: buying data may be expensive, so she could use a free published source and acknowledge it.
Convenience or non-response: some listings omit the age or price, so she could plan to collect extra cars to replace incomplete records.
Markers reward a testable hypothesis and two distinct constraint plus mitigation pairs.
Edexcel 1ST0 20213 marksWrite the five stages of the statistical enquiry cycle in the correct order, and explain why the cycle is drawn as a loop.Show worked answer →
The five stages in order are: plan (including writing a hypothesis), collect data, process and represent the data, interpret and discuss the results, then evaluate.
It is drawn as a loop because evaluation feeds back into planning: weaknesses found at the end (a biased sample, an unclear question) lead to a refined hypothesis and a better second investigation.
Markers reward the five stages in the right order (2 marks) and the idea that evaluation feeds back into planning (1 mark).
Related dot points
- Types of data: raw, quantitative, qualitative, categorical, ordinal, discrete, continuous, ungrouped, grouped, bivariate and multivariate; primary versus secondary; explanatory and response variables; grouping into class intervals.
A focused answer to Edexcel GCSE Statistics on types of data, covering quantitative versus qualitative, categorical and ordinal, discrete and continuous, grouped and bivariate data, primary versus secondary sources, explanatory and response variables, and the effect of grouping into class intervals.
- Population, sampling frame and sample; simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling; selecting random members; calculating strata sizes.
A focused answer to Edexcel GCSE Statistics on sampling, covering population, sampling frame and sample, simple random, systematic, stratified, quota, cluster, judgement and opportunity sampling, selecting random members electronically, and calculating stratified sample sizes.
- Sources of data, reliability and validity, designing questionnaires and data collection sheets, open and closed questions, leading questions, pilots, and cleaning data before processing.
A focused answer to Edexcel GCSE Statistics on collecting data and designing questionnaires, covering data sources, reliability and validity, open and closed questions, designing non-overlapping response boxes, spotting leading questions, pilots, and cleaning data before processing.
- Identifying and controlling extraneous variables, control groups and matched pairs, sources of bias, sensitivity of content, and the random response technique for sensitive questions.
A focused answer to Edexcel GCSE Statistics on controlling variables and bias, covering explanatory and extraneous variables, control groups and matched pairs, sources of bias, sensitivity of content, and the random response technique for sensitive questions at Higher tier.
Sources & how we know this
- Pearson Edexcel GCSE (9-1) Statistics (1ST0) specification — Pearson Edexcel (2017)