What kinds of data are there and why does the type matter?
Qualitative and quantitative data, discrete and continuous data, primary and secondary data, and categorical and ranked data.
A focused answer to AQA GCSE Statistics on types of data, covering qualitative and quantitative, discrete and continuous, primary and secondary, categorical and ranked data, and why the type controls which diagrams and calculations you can use.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
AQA wants you to classify data correctly: qualitative or quantitative, discrete or continuous, primary or secondary, and categorical or ranked. The data type decides which averages, diagrams and techniques are valid later, so getting it right matters: you cannot draw a histogram of car colours, and the mean is meaningless for ranked finishing positions.
Qualitative and quantitative data
This is the first split you make, because it limits everything afterwards. Qualitative data can only use the mode as an average and bar charts or pie charts as diagrams. Quantitative data can use the mean, median and mode, and a wider range of diagrams.
Discrete and continuous data
A useful test: if data comes from counting it is usually discrete; if it comes from measuring it is usually continuous. Continuous data can always be measured more precisely (a time of s could be s with a better stopwatch), which is why it is grouped into classes and drawn as histograms rather than bar charts. The distinction matters for choosing diagrams later in the course: discrete data uses bar charts with gaps, continuous data uses histograms with no gaps.
Primary and secondary data
Primary data fits your question exactly and you control its quality, but it is slow and expensive to collect. Secondary data is fast, cheap and often very large scale, but it may not match your question precisely, and it could be out of date or collected with bias you cannot see. Exam questions often ask you to weigh these trade-offs in context.
Categorical and ranked data
Categorical data sorts items into named groups with no natural order, such as nationality or eye colour. Ranked (ordinal) data places items in order of position, such as finishing places in a race or a satisfaction scale from "poor" to "excellent", where the order matters but the gaps between positions may not be equal. Because the gaps are uneven, you can find the median of ranked data but not a meaningful mean, and ranked data is exactly what Spearman's rank correlation coefficient works on later in the course.
Exam-style practice questions
Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AQA 20184 marksFor each of the following, state whether the data is qualitative or quantitative, and if quantitative whether it is discrete or continuous: (a) the colours of cars in a car park; (b) the number of passengers on a bus; (c) the time taken to run m; (d) shoe sizes sold in a shop.Show worked answer →
(a) Qualitative (a described quality, no number).
(b) Quantitative and discrete (counted, whole numbers only).
(c) Quantitative and continuous (measured, any value in a range).
(d) Quantitative and discrete (shoe sizes are separate listed values).
Markers reward the qualitative/quantitative split first, then discrete (counted) versus continuous (measured) for the numerical items. The classic trap is calling shoe size continuous: it is discrete because only set sizes exist.
AQA 20213 marksA student uses figures from a published government report instead of collecting their own. (a) State whether this is primary or secondary data. (b) Give one advantage and one disadvantage of using it.Show worked answer →
(a) Secondary data, because it was collected by someone else (the government), not first-hand by the student.
(b) Advantage: it is quick, cheap and large scale. Disadvantage: it may not exactly fit the student's question, and could be out of date or biased.
Markers reward identifying secondary data and one valid advantage plus one valid disadvantage, in context.
Related dot points
- The statistical enquiry cycle, hypotheses, the stages of an investigation, and types of statistical problem.
A focused answer to AQA GCSE Statistics on the statistical enquiry cycle, covering the stages of an investigation, writing a hypothesis, the role of pilot studies, and how the cycle structures a real statistical problem.
- Populations, sampling frames, census versus sample, random, systematic, stratified, quota and cluster sampling.
A focused answer to AQA GCSE Statistics on sampling methods, covering populations and sampling frames, census versus sample, and how random, systematic, stratified, quota and cluster sampling work, with the stratified sample calculation.
- Data collection sheets, tally charts, questionnaires, open and closed questions, response boxes and avoiding leading or biased questions.
A focused answer to AQA GCSE Statistics on collecting data, covering data collection sheets and tally charts, open and closed questions, designing non-overlapping response boxes, and how to spot and fix leading or biased questions.
- Frequency tables, grouped frequency tables, two-way tables, pictograms, bar charts and pie charts.
A focused answer to AQA GCSE Statistics on organising data, covering frequency and grouped frequency tables, two-way tables, pictograms, bar charts and pie charts, including how to calculate pie chart angles.
- Histograms with equal and unequal class widths, frequency density, frequency polygons and population pyramids.
A focused answer to AQA GCSE Statistics on histograms, covering equal and unequal class widths, frequency density, reading frequencies as areas, frequency polygons and population pyramids.
Sources & how we know this
- AQA GCSE Statistics (8382) specification — AQA (2017)