How do psychologists summarise data and decide whether a result is statistically significant?
Descriptive statistics (central tendency, dispersion, graphs) and inferential statistics: choosing and interpreting the sign test, Mann-Whitney U, Wilcoxon, Spearman's rho and chi-square.
An OCR A-Level Psychology answer to statistics, covering mean, median, mode, range and standard deviation, choosing the correct inferential test (sign test, Mann-Whitney, Wilcoxon, Spearman, chi-square) from level of measurement and design, significance, critical values and Type 1 and Type 2 errors for Component 1.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
OCR Component 1 includes the maths of psychology: summarising data with descriptive statistics, then testing whether a result is unlikely to be due to chance with an inferential test chosen from the OCR list. At least 10 per cent of the marks across the course assess these skills, so the test-choice decision and the significance rule must be automatic.
The answer
Descriptive statistics
Graphs display data: bar charts for categories, histograms for continuous data, scattergrams for correlations, and a normal distribution curve when data are symmetrically spread around the mean (mean, median and mode coincide).
Choosing an inferential test
Significance and errors
Examples in context
Example 1. Reading a chi-square () result. A study categorises participants by whether they were primed (yes or no) and whether they recalled a target (yes or no). These are nominal frequency counts in an unrelated design, so chi-square is correct. With a contingency table the degrees of freedom are , and the critical value at is . If the calculated , then because the result is significant: there is a significant association between priming and recall, with less than a 5 per cent probability it is due to chance.
Example 2. Interpreting Spearman's rho. A researcher measures revision hours and exam mark for 20 students and finds Spearman's (). Spearman is correct because the data are ordinal and the aim is a relationship, not a difference. The coefficient is a strong positive correlation (more revision is associated with higher marks), and significance means the relationship is unlikely to be chance. It does not prove revision causes higher marks, because a third variable such as motivation could drive both, so causation cannot be claimed from a correlation.
Try this
Q1. Name and justify the inferential test for ordinal data from a repeated measures design testing a difference. [3 marks]
- Cue. Wilcoxon, because the data are ordinal, the design is related (repeated measures) and the aim is a difference.
Q2. Explain what is meant by a Type 1 error. [2 marks]
- Cue. A false positive: rejecting the null hypothesis and accepting a difference or relationship that is not actually real.
Q3. A researcher uses rather than . Explain one effect of this choice. [3 marks]
- Cue. It makes a Type 1 error less likely but a Type 2 error more likely, because the stricter level demands stronger evidence before a result is called significant.
Exam-style practice questions
Practice questions written in the style of OCR exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
OCR 20196 marksA researcher compares ratings (treated as ordinal) from two separate groups to test for a difference. Identify the appropriate inferential test, justify the choice, and explain how to decide whether the result is significant. [6 marks]Show worked answer →
A test-choice and significance item (AO2 and AO3).
Test: Mann-Whitney U. Justification: the design is unrelated (two separate, independent groups), the data are ordinal, and the aim is to test a difference. Mann-Whitney is the OCR test of difference for an unrelated design with ordinal data (Wilcoxon would be wrong because it is for related designs).
Deciding significance: compare the calculated U value with the critical value from the statistical table for the relevant values and significance level. For Mann-Whitney the result is significant if the calculated U is equal to or less than the critical value at . If so, reject the null hypothesis; if not, retain it.
Markers reward the correct named test, a justification referring to ordinal data and an unrelated design, and the correct rule (calculated equal to or less than critical for Mann-Whitney).
OCR 20226 marksEight participants' recall scores were . Calculate the mean and the range, and explain one reason a researcher might report the standard deviation rather than the range. [6 marks]Show worked answer →
A full worked calculation (AO2) plus a reasoned point (AO3).
Mean: .
Range: highest minus lowest (some specifications add for the inclusive range, giving ; either is creditworthy if stated).
Reason to prefer standard deviation: the range uses only the two most extreme scores, so a single outlier distorts it, whereas the standard deviation uses every score and so gives a more sensitive, representative measure of how spread out the data are around the mean.
Markers reward the correct mean (), a correct range ( or with method shown), and a valid reason that the standard deviation uses all the data and is less distorted by extremes.
Related dot points
- Research methods and techniques: experiments, self-report, observation and correlation; variables and operationalisation; experimental designs; hypotheses; and the strengths and weaknesses of each method.
An OCR A-Level Psychology answer to research methods and design, covering laboratory, field and quasi-experiments, self-report, observation and correlation, independent and dependent variables, operationalisation, experimental designs, directional and non-directional hypotheses, and the strengths and weaknesses of each technique for Component 1.
- Sampling methods, ethical considerations, reliability and validity, levels of measurement, and recording, analysing and presenting data.
An OCR A-Level Psychology answer to sampling and data handling, covering random, stratified, systematic, opportunity and self-selected sampling, BPS ethics, reliability and validity, levels of measurement, and how to record, analyse and present qualitative and quantitative data for Component 1.
- Planning and conducting research, report writing and sections of a report, peer review, the features of science, and evaluating research for reliability, validity and ethics.
An OCR A-Level Psychology answer to conducting and reporting research, covering the sections of a psychological report, the use of peer review and replication, the features of science (objectivity, falsifiability, paradigms), pilot studies, and evaluating studies for reliability, validity and ethics for Component 1.
- Classic study: Loftus and Palmer (1974), Reconstruction of automobile destruction. Aim, method, results and conclusions, evaluation, and links to the cognitive area and eyewitness memory.
An OCR A-Level Psychology answer to the classic cognitive study, Loftus and Palmer (1974) on the reconstruction of automobile destruction. Covers the aim, two laboratory experiments, the leading-question and broken-glass findings, reconstructive memory, evaluation, and links to the cognitive area and eyewitness testimony.
- Contemporary study: Casey et al. (2011), Behavioral and neural correlates of delay of gratification. Aim, method, results and conclusions, evaluation, and links to the biological area and Sperry.
An OCR A-Level Psychology answer to the contemporary biological study, Casey et al. (2011) on the neural correlates of delay of gratification. Covers the aim, the longitudinal follow-up of Mischel's marshmallow participants, the go/no-go task and fMRI, the prefrontal cortex and ventral striatum findings, evaluation, and links to Sperry and the biological area.