Skip to main content
EnglandStatisticsSyllabus dot point

What kinds of data are there and why does the type matter?

Qualitative and quantitative data, discrete and continuous data, primary and secondary data, and categorical and ranked data.

A focused answer to AQA GCSE Statistics on types of data, covering qualitative and quantitative, discrete and continuous, primary and secondary, categorical and ranked data, and why the type controls which diagrams and calculations you can use.

Generated by Claude Opus 4.88 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Qualitative and quantitative data
  3. Discrete and continuous data
  4. Primary and secondary data
  5. Categorical and ranked data

What this dot point is asking

AQA wants you to classify data correctly: qualitative or quantitative, discrete or continuous, primary or secondary, and categorical or ranked. The data type decides which averages, diagrams and techniques are valid later, so getting it right matters: you cannot draw a histogram of car colours, and the mean is meaningless for ranked finishing positions.

Qualitative and quantitative data

This is the first split you make, because it limits everything afterwards. Qualitative data can only use the mode as an average and bar charts or pie charts as diagrams. Quantitative data can use the mean, median and mode, and a wider range of diagrams.

Discrete and continuous data

A useful test: if data comes from counting it is usually discrete; if it comes from measuring it is usually continuous. Continuous data can always be measured more precisely (a time of 12.412.4 s could be 12.4312.43 s with a better stopwatch), which is why it is grouped into classes and drawn as histograms rather than bar charts. The distinction matters for choosing diagrams later in the course: discrete data uses bar charts with gaps, continuous data uses histograms with no gaps.

Primary and secondary data

Primary data fits your question exactly and you control its quality, but it is slow and expensive to collect. Secondary data is fast, cheap and often very large scale, but it may not match your question precisely, and it could be out of date or collected with bias you cannot see. Exam questions often ask you to weigh these trade-offs in context.

Categorical and ranked data

Categorical data sorts items into named groups with no natural order, such as nationality or eye colour. Ranked (ordinal) data places items in order of position, such as finishing places in a race or a satisfaction scale from "poor" to "excellent", where the order matters but the gaps between positions may not be equal. Because the gaps are uneven, you can find the median of ranked data but not a meaningful mean, and ranked data is exactly what Spearman's rank correlation coefficient works on later in the course.

Exam-style practice questions

Practice questions written in the style of AQA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AQA 20184 marksFor each of the following, state whether the data is qualitative or quantitative, and if quantitative whether it is discrete or continuous: (a) the colours of cars in a car park; (b) the number of passengers on a bus; (c) the time taken to run 100100 m; (d) shoe sizes sold in a shop.
Show worked answer →

(a) Qualitative (a described quality, no number).

(b) Quantitative and discrete (counted, whole numbers only).

(c) Quantitative and continuous (measured, any value in a range).

(d) Quantitative and discrete (shoe sizes are separate listed values).

Markers reward the qualitative/quantitative split first, then discrete (counted) versus continuous (measured) for the numerical items. The classic trap is calling shoe size continuous: it is discrete because only set sizes exist.

AQA 20213 marksA student uses figures from a published government report instead of collecting their own. (a) State whether this is primary or secondary data. (b) Give one advantage and one disadvantage of using it.
Show worked answer →

(a) Secondary data, because it was collected by someone else (the government), not first-hand by the student.

(b) Advantage: it is quick, cheap and large scale. Disadvantage: it may not exactly fit the student's question, and could be out of date or biased.

Markers reward identifying secondary data and one valid advantage plus one valid disadvantage, in context.

Related dot points

Sources & how we know this