Edexcel GCSE Statistics The collection of data: the enquiry cycle, data types, sampling, questionnaires and bias
A deep-dive Edexcel GCSE Statistics guide to The collection of data (Topic 1 of 1ST0). Covers the statistical enquiry cycle, types of data, sampling methods, questionnaire design and controlling variables and bias, with the calculations and exam patterns Edexcel repeats.
Reviewed by: AI editorial process; not yet individually human-reviewed
Jump to a section
What this topic demands
The collection of data is where every investigation begins. Edexcel tests whether you can plan an enquiry, classify data, choose a fair sample, design clear questions and recognise bias. Get these right and the rest of the course builds on solid foundations; get them wrong and every later calculation is suspect. Because both papers integrate the statistical enquiry cycle, this topic is examined throughout, not just in its own questions.
This guide walks through the five areas in specification order, then sets out the exam patterns Edexcel repeats. Each area has a matching dot-point page with practice questions; this overview ties them together.
The statistical enquiry cycle
The topic opens with the statistical enquiry cycle: plan and write a hypothesis, collect data, process and represent it, interpret and discuss, then evaluate and refine. It is a loop because evaluation feeds back into planning. A clear, testable hypothesis, recognising constraints (time, cost, ethics, confidentiality, convenience), and planning proactive strategies for problems such as non-response are central exam ideas.
Types of data
Types of data covers quantitative versus qualitative, discrete versus continuous, categorical versus ordinal, primary versus secondary, and grouped and bivariate (and, at Higher, multivariate) data. The data type controls which diagrams and averages are valid, so classification is a recurring first step. You also need to use the terms explanatory and response variable, and to explain the trade-off in grouping data into class intervals (easier display, loss of accuracy).
Sampling methods
Sampling methods covers populations and sampling frames, census versus sample, and the simple random, systematic, stratified, quota, cluster, judgement and opportunity methods. The stratified sample calculation, , is examined almost every series, and you must also describe how random members are selected, handling repeats and out-of-range numbers.
Collecting data and questionnaires
Collecting data and questionnaires covers data sources, reliability and validity, designing tally charts and data collection sheets, open and closed questions, designing non-overlapping response boxes, spotting leading or biased questions, pilots, and cleaning data before processing. Rewriting a faulty question or set of boxes, and identifying a value to clean, are classic exam tasks.
Controlling variables and bias
Controlling variables and bias covers explanatory, response and extraneous variables, control groups and matched pairs, the sources of bias, the sensitivity of content, and the random response technique. A fair test controls everything except the variable being investigated.
How this topic is examined
A typical Edexcel profile for this topic:
- The enquiry cycle. Writing a hypothesis, recognising constraints, and slotting tasks into the right stage.
- Data types. Classifying data and justifying the choice in context.
- Sampling. Describing methods and calculating a stratified sample.
- Questionnaires. Improving leading questions and faulty response boxes.
- Bias. Identifying sources of bias and how to reduce them, including for sensitive questions.
Check your knowledge
A mix of recall and calculation questions covering this topic. Attempt them under timed conditions, then check against the solutions.
- State the five stages of the statistical enquiry cycle. (2 marks)
- Is the number of pets a person owns discrete or continuous? (1 mark)
- A college of has in Year 12. In a stratified sample of , how many Year 12 students should be chosen? (2 marks)
- Rewrite the response boxes " to " and " to " so they do not overlap. (2 marks)
- Give one source of bias in a survey. (1 mark)
- State one advantage of primary data over secondary data. (1 mark)
- Name the sampling method that takes every th item from a list after a random start. (1 mark)
- Name the technique used to get honest answers to a sensitive question. (1 mark)
Sources & how we know this
- Pearson Edexcel GCSE (9-1) Statistics (1ST0) specification — Pearson Edexcel (2017)