Skip to main content
EnglandStatistics

Edexcel GCSE Statistics The collection of data: the enquiry cycle, data types, sampling, questionnaires and bias

A deep-dive Edexcel GCSE Statistics guide to The collection of data (Topic 1 of 1ST0). Covers the statistical enquiry cycle, types of data, sampling methods, questionnaire design and controlling variables and bias, with the calculations and exam patterns Edexcel repeats.

Generated by Claude Opus 4.814 min read1ST0 Topic 1

Reviewed by: AI editorial process; not yet individually human-reviewed

Jump to a section
  1. What this topic demands
  2. The statistical enquiry cycle
  3. Types of data
  4. Sampling methods
  5. Collecting data and questionnaires
  6. Controlling variables and bias
  7. How this topic is examined
  8. Check your knowledge

What this topic demands

The collection of data is where every investigation begins. Edexcel tests whether you can plan an enquiry, classify data, choose a fair sample, design clear questions and recognise bias. Get these right and the rest of the course builds on solid foundations; get them wrong and every later calculation is suspect. Because both papers integrate the statistical enquiry cycle, this topic is examined throughout, not just in its own questions.

This guide walks through the five areas in specification order, then sets out the exam patterns Edexcel repeats. Each area has a matching dot-point page with practice questions; this overview ties them together.

The statistical enquiry cycle

The topic opens with the statistical enquiry cycle: plan and write a hypothesis, collect data, process and represent it, interpret and discuss, then evaluate and refine. It is a loop because evaluation feeds back into planning. A clear, testable hypothesis, recognising constraints (time, cost, ethics, confidentiality, convenience), and planning proactive strategies for problems such as non-response are central exam ideas.

Types of data

Types of data covers quantitative versus qualitative, discrete versus continuous, categorical versus ordinal, primary versus secondary, and grouped and bivariate (and, at Higher, multivariate) data. The data type controls which diagrams and averages are valid, so classification is a recurring first step. You also need to use the terms explanatory and response variable, and to explain the trade-off in grouping data into class intervals (easier display, loss of accuracy).

Sampling methods

Sampling methods covers populations and sampling frames, census versus sample, and the simple random, systematic, stratified, quota, cluster, judgement and opportunity methods. The stratified sample calculation, group sizepopulation×sample size\frac{\text{group size}}{\text{population}} \times \text{sample size}, is examined almost every series, and you must also describe how random members are selected, handling repeats and out-of-range numbers.

Collecting data and questionnaires

Collecting data and questionnaires covers data sources, reliability and validity, designing tally charts and data collection sheets, open and closed questions, designing non-overlapping response boxes, spotting leading or biased questions, pilots, and cleaning data before processing. Rewriting a faulty question or set of boxes, and identifying a value to clean, are classic exam tasks.

Controlling variables and bias

Controlling variables and bias covers explanatory, response and extraneous variables, control groups and matched pairs, the sources of bias, the sensitivity of content, and the random response technique. A fair test controls everything except the variable being investigated.

How this topic is examined

A typical Edexcel profile for this topic:

  • The enquiry cycle. Writing a hypothesis, recognising constraints, and slotting tasks into the right stage.
  • Data types. Classifying data and justifying the choice in context.
  • Sampling. Describing methods and calculating a stratified sample.
  • Questionnaires. Improving leading questions and faulty response boxes.
  • Bias. Identifying sources of bias and how to reduce them, including for sensitive questions.

Check your knowledge

A mix of recall and calculation questions covering this topic. Attempt them under timed conditions, then check against the solutions.

  1. State the five stages of the statistical enquiry cycle. (2 marks)
  2. Is the number of pets a person owns discrete or continuous? (1 mark)
  3. A college of 12001200 has 480480 in Year 12. In a stratified sample of 5050, how many Year 12 students should be chosen? (2 marks)
  4. Rewrite the response boxes "00 to 55" and "55 to 1010" so they do not overlap. (2 marks)
  5. Give one source of bias in a survey. (1 mark)
  6. State one advantage of primary data over secondary data. (1 mark)
  7. Name the sampling method that takes every nnth item from a list after a random start. (1 mark)
  8. Name the technique used to get honest answers to a sensitive question. (1 mark)

Sources & how we know this

  • statistics
  • gcse-edexcel
  • edexcel-statistics
  • the-collection-of-data
  • gcse
  • enquiry-cycle
  • data-types
  • sampling
  • questionnaires