How do you identify types of data, design a sample, and recognise bias in data collection?
Identify types of data (qualitative and quantitative, discrete and continuous); understand populations and samples; use random and stratified sampling; and recognise sources of bias.
A focused answer to the Eduqas GCSE Mathematics statistics content on sampling and data, covering qualitative and quantitative data, discrete and continuous data, populations and samples, random and stratified sampling, and sources of bias.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
The Eduqas statistics content begins with data and sampling: identifying the type of data (qualitative or quantitative, discrete or continuous), understanding the difference between a population and a sample, using random and stratified sampling, and recognising sources of bias. This is the foundation of the statistical handling cycle, and Eduqas tests it through both calculation (stratified sample sizes) and reasoning (why a sample is biased, how to improve it). It appears at both tiers, with the stratified-sample calculation and the bias-explanation being reliable questions.
Types of data
Data is classified first as qualitative or quantitative, and quantitative data is then discrete or continuous.
So shoe size is discrete, height is continuous, and team supported is qualitative. The distinction matters because continuous data is grouped into class intervals and shown with histograms, while discrete data can be shown with bar charts, so identifying the type points to the right chart.
Populations and samples
It is rarely practical to collect data from a whole population, so a sample stands in for it.
Sampling saves time and cost, but only works if the sample reflects the population. A census surveys the whole population, which is accurate but expensive, so sampling is the usual compromise.
Random and stratified sampling
Two named sampling methods appear in the specification.
Stratified sampling is the one most often calculated: find the sampling fraction , then multiply each group's size by it.
Recognising bias
Bias occurs when a sampling method systematically over- or under-represents part of the population.
A survey about reading habits conducted in a library will be biased because library users read more than the general population, so the result is unrepresentative. Common causes are sampling from the wrong place (gym-goers for exercise), self-selection (only motivated people respond), and leading questions. To improve a biased sample, choose a method that gives a fair cross-section of the whole population. Because Eduqas weights reasoning heavily, a strong answer explains precisely why the group is unrepresentative and what the effect on the results would be.
Exam-style practice questions
Practice questions written in the style of WJEC Eduqas exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Eduqas 20183 marksA school has 600 students. A sample of 50 is to be taken, stratified by year group. There are 120 students in Year 7. How many Year 7 students should be in the sample? (Foundation, Component 2, calculator.)Show worked answer →
Stratified sampling takes the same fraction from each group.
The sampling fraction is .
Year 7 contribution: students.
Markers award a mark for the sampling fraction, a mark for the method, and a mark for the answer 10. Taking an equal number from each year (ignoring the different group sizes) is the standard error.
Eduqas 20223 marksA researcher surveys people leaving a gym about how much they exercise. Explain why this sample is likely to be biased, and suggest one improvement. (Higher, Component 2, calculator.)Show worked answer →
People leaving a gym already exercise more than average, so the sample is not representative of the whole population. This is sampling bias.
The result would overestimate how much the population exercises.
An improvement would be to sample from the general public, for example a random selection from the local population, rather than only gym-goers.
Markers give marks for identifying that gym-goers are unrepresentative, for explaining the resulting bias, and for a sensible improvement. A vague answer that does not explain why the group is unrepresentative loses marks.
Related dot points
- Calculate the mean, median, mode and range; find the mean from a frequency table and an estimated mean from grouped data; and compare distributions using an average and the range (and quartiles at Higher tier).
A focused answer to the Eduqas GCSE Mathematics statistics content on averages and spread, covering the mean median mode and range, the mean from a frequency table, the estimated mean from grouped data, and comparing distributions with quartiles at Higher tier.
- Draw and interpret statistical charts including bar charts, pie charts, frequency polygons, stem-and-leaf diagrams, box plots and histograms with unequal class widths (histograms at Higher tier).
A focused answer to the Eduqas GCSE Mathematics statistics content on charts and graphs, covering bar charts, pie charts, frequency polygons, stem-and-leaf diagrams, box plots, and histograms with unequal class widths and frequency density at Higher tier.
- Plot and interpret scatter graphs; describe correlation; draw a line of best fit; use it to estimate values; and understand interpolation, extrapolation and the difference between correlation and causation.
A focused answer to the Eduqas GCSE Mathematics statistics content on scatter graphs and correlation, covering plotting and interpreting scatter graphs, describing correlation, the line of best fit, interpolation and extrapolation, and correlation versus causation.
- Use Venn diagrams and set notation (union, intersection and complement) to represent and count outcomes and to calculate probabilities, including conditional probability (Higher tier).
A focused answer to the Eduqas GCSE Mathematics probability content on Venn diagrams and set notation, covering union, intersection and complement, counting outcomes, calculating probabilities, and conditional probability at Higher tier.
- Carry out the four operations with fractions; convert between fractions, decimals and percentages (including recurring decimals to fractions at Higher tier); and find a percentage of an amount and one quantity as a percentage of another.
A focused answer to the Eduqas GCSE Mathematics number content on fractions, decimals and percentages, covering the four operations with fractions, converting between the three forms, recurring decimals to fractions, and percentage calculations.
Sources & how we know this
- WJEC Eduqas GCSE (9-1) Mathematics specification (C300) — WJEC Eduqas (2015)