How do you choose a sample, design data collection and classify data so that conclusions are reliable?
Understand populations and samples, use random and other sampling methods and recognise bias, design data collection including questionnaires, and classify data as qualitative or quantitative and discrete or continuous.
A focused answer to the WJEC GCSE Mathematics statistics content on sampling and data, covering populations and samples, random and other sampling methods, sources of bias, designing questionnaires and classifying data as qualitative or quantitative and discrete or continuous.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
This is the data-handling foundation of WJEC statistics. You are asked to understand the difference between a population and a sample, to choose a sample using random and other methods while avoiding bias, to design data collection (especially questionnaires), and to classify data correctly. Statistics begins with how data is gathered, because a poorly chosen sample or a leading question makes every later calculation worthless. The exam rewards clear descriptions of sampling methods and sharp criticism of flawed survey questions, both worded-answer skills.
Populations and samples
Studying a whole population is often impractical, so we sample.
So to find the average height of students in a school of , you might measure a sample of rather than all , provided those fairly represent the whole school.
Sampling methods and bias
How the sample is chosen decides whether it is fair.
The exam often gives a flawed sampling plan (surveying shoppers on a weekday morning, say) and asks why it is biased. The answer names the group that is over- or under-represented.
Designing data collection
Questionnaires must collect data fairly and clearly.
A good survey question is clear (no ambiguity), neutral (not leading the respondent to an answer), and offers response options that do not overlap and cover all cases. For example, age groups " to ", " to " overlap at ; better are " to ", " to ". A data collection sheet or tally chart records responses efficiently. Recognising and fixing a leading or loaded question is a recurring two-mark task.
Classifying data
Data falls into types that decide which charts and averages suit it.
Why this matters
Sampling and data is the entry point to the whole statistics strand and is examined as worded reasoning rather than calculation, so it tests AO2 and AO3 directly. The skills (choosing a fair sample, spotting bias, criticising and improving survey questions, classifying data) appear in short, reliable mark-earning questions, and the data type determines whether you later use a bar chart or a histogram, a discrete or a grouped frequency table. Getting the foundation right makes the calculation topics that follow meaningful.
Exam-style practice questions
Practice questions written in the style of WJEC exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
WJEC 20192 marksA school has students. A teacher wants a sample of students that fairly represents the school. Describe one suitable method of choosing the sample. (Foundation and Higher, Unit 2, calculator.)Show worked answer →
A suitable method is simple random sampling: give every student a number from to , then use a random number generator (or random number tables) to pick different numbers, and select those students.
The key idea is that every student has an equal chance of being chosen, which avoids bias.
Markers award a mark for describing a method that gives every student an equal chance, and a mark for the practical detail (numbering the students and selecting at random). Vague answers such as "pick people at random" without a method may not score the second mark.
WJEC 20212 marksA survey question reads: 'Do you agree that our excellent local park should get more funding?' Give one reason why this is a poor question and write an improved version. (Foundation and Higher, Unit 2, calculator.)Show worked answer →
The question is leading (biased): the word "excellent" pushes the respondent towards agreeing, so the results would not be fair.
An improved version is neutral, for example: "Should the local park receive more funding?" with response options "Yes / No / Don't know".
Markers give a mark for identifying the bias (a leading or loaded question) and a mark for a neutral rewrite, ideally with clear, non-overlapping response options. Just saying "it is biased" without explaining why, or without an improvement, loses a mark.
Related dot points
- Calculate and interpret the mean, median, mode and range for lists and frequency tables, estimate the mean and identify the modal class from grouped data, and compare distributions using an average and a measure of spread.
A focused answer to the WJEC GCSE Mathematics statistics content on averages and spread, covering the mean, median, mode and range for lists and frequency tables, estimating the mean and modal class from grouped data, and comparing distributions.
- Construct and interpret bar charts, pictograms, vertical line graphs, pie charts and frequency diagrams, and at Higher tier draw and interpret histograms using frequency density for unequal class widths.
A focused answer to the WJEC GCSE Mathematics statistics content on charts and graphs, covering bar charts, pictograms, vertical line graphs, pie charts and frequency diagrams, and histograms with frequency density for unequal class widths at Higher tier.
- Plot and interpret scatter graphs, describe the type and strength of correlation, draw a line of best fit and use it to estimate values, and understand that correlation does not imply causation.
A focused answer to the WJEC GCSE Mathematics statistics content on scatter graphs, covering plotting bivariate data, describing positive, negative and no correlation, drawing and using a line of best fit for predictions, and the limits of interpolation and extrapolation.
- Construct and interpret a cumulative frequency curve to estimate the median, quartiles and interquartile range, and draw and compare box plots from five-number summaries (Higher tier).
A focused answer to the WJEC GCSE Mathematics statistics content on cumulative frequency and box plots, covering constructing and reading cumulative frequency curves, estimating the median and quartiles, finding the interquartile range, and drawing and comparing box plots at Higher tier.
Sources & how we know this
- WJEC GCSE Mathematics specification (3300) — WJEC (2015)
- WJEC GCSE Mathematics specification PDF (3300) — WJEC (2015)