Reliability (internal and external; test-retest, inter-observer; how to assess and improve it) and validity (internal and external; face, concurrent, ecological, temporal and population validity; demand characteristics and investigator effects; how to assess and improve it).

An Eduqas A-Level Psychology answer to reliability and validity in Component 2. Covers internal and external reliability, test-retest and inter-observer reliability, internal and external validity, face, concurrent, ecological, temporal and population validity, demand characteristics and investigator effects, and how to assess and improve each.

Generated by Claude Opus 4.814 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Quick answer

Reliability is consistency (the same results each time) and validity is accuracy (measuring what is claimed). Internal reliability is within a measure, external reliability is over time; both are assessed by test-retest and inter-observer reliability (about $+0.8$ is good) and improved by standardisation, clear operationalised categories and observer training. Internal validity is whether the IV caused the DV change; external validity (ecological, temporal, population) is generalisability, with face and concurrent validity as checks. Threats are demand characteristics and investigator effects, reduced by single- and double-blind procedures, covert or naturalistic observation, and realistic tasks. A measure can be reliable without being valid, but not valid without being reliable.

Jump to a section

What this dot point is asking
The answer
Examples in context
Try this

What this dot point is asking

Component 2 requires you to understand reliability and validity: what each means, the types of each, the threats (demand characteristics, investigator effects), and how to assess and improve them.

The answer

Reliability

Validity

Improving validity

Control confounds (standardisation, single-blind and double-blind procedures), use covert or naturalistic observation to reduce demand characteristics, use realistic tasks and settings for ecological validity, and check the measure against an established one (concurrent validity).

Examples in context

Example 1. Inter-observer reliability in practice. In an observation of aggression, two observers independently tally the same behavioural categories; if their tallies correlate at about $+0.8$ or above, the observation is reliable. This shows how reliability is quantified and improved by clear categories and training.

Example 2. Ecological validity and lab tasks. Memorising word lists in a lab (as in Bartlett's contemporaries) may not reflect everyday memory, lowering ecological validity. Using realistic material and settings raises it, illustrating the trade-off between control and real-world relevance.

Try this

Q1. Define reliability and validity in one sentence each. [2 marks]

Cue. Reliability is the consistency of a measure (the same results each time); validity is the accuracy of a measure (it measures what it claims to).

Q2. Explain how inter-observer reliability is assessed. [2 marks]

Cue. Two or more observers independently record the same behaviour using the same categories, and their results are correlated; a coefficient of about $+0.8$ or above indicates good reliability.

Q3. Explain what demand characteristics are and how to reduce them. [2 marks]

Cue. Demand characteristics are cues that let participants guess the aim and change their behaviour; they can be reduced by single-blind procedures, cover stories or covert observation.

Exam-style practice questions

Practice questions written in the style of WJEC Eduqas exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Eduqas 20196 marksExplain the difference between reliability and validity, using an example. [6 marks]

Show worked answer →

A knowledge item (AO1) with an example.

Reliability is consistency: whether a measure produces the same results each time it is used under the same conditions. Validity is accuracy: whether a measure actually measures what it claims to measure.

Example: a set of scales that always reads 2 kg too heavy is reliable (consistent every time) but not valid (it does not give the true weight). A psychological test could give the same score on two occasions (reliable) but still fail to measure the trait it claims to (low validity). A measure can be reliable without being valid, but to be valid it must first be reliable.

Markers reward clear definitions of consistency versus accuracy and an example showing reliability without validity.

Eduqas 20218 marksExplain how the reliability and the validity of an observation could be improved. [8 marks]

Show worked answer →

An application item (AO2/AO3).

Reliability: use clear, operationalised behavioural categories so observers record the same events; train observers; use two or more observers and check inter-observer reliability (a correlation of about $+0.8$ or above indicates good agreement); standardise the procedure.

Validity: use covert observation so participants behave naturally (reducing demand characteristics); observe in a naturalistic setting to raise ecological validity; ensure the behavioural categories genuinely capture the target behaviour (face/content validity); and check against another measure (concurrent validity).

Markers reward practical, named techniques for each (inter-observer reliability and operationalised categories for reliability; covert/naturalistic observation and valid categories for validity).

Related dot points

Sources & how we know this

Eduqas GCE A Level in Psychology (A290) specification — Eduqas (2015)