How do you analyse spoken and multimodal texts (transcripts, speeches, broadcast, digital) using discourse, pragmatics, prosody and graphology, and read mode as part of the meaning?
Analysing spoken and multimodal texts: reading transcripts and speech-like texts through discourse (turn-taking, adjacency pairs), pragmatics, prosody and the features of spontaneous speech, and multimodal texts through graphology, with mode read into the analysis (AO1, AO2, AO3).
How to analyse spoken and multimodal texts (transcripts, speeches, broadcast, digital) for OCR A-Level English Language and Literature Component 01: reading discourse, pragmatics, prosody and the features of spontaneous speech, plus graphology in multimodal texts, with mode read into the analysis (AO1, AO2, AO3).
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Component 01 is titled "non-fiction and spoken texts" because spoken and multimodal texts sit alongside written non-fiction in the anthology and the unseen material: transcripts of conversation, speeches, broadcast extracts, and digital or multimodal texts. Analysing them well needs the tools of spoken-language and multimodal study, discourse, pragmatics, prosody, the features of spontaneous speech, and graphology, with mode read into the meaning. This dot point covers those tools and the discipline of treating mode as part of the effect.
The answer
The key move with spoken and multimodal texts is to recognise what mode lets a text do and to analyse the resources specific to that mode, rather than reading everything as if it were crafted prose. A transcript negotiates meaning live; a speech projects a scripted persona to a listening audience; a digital text blends modes. Three clusters of tools cover the territory.
Interaction: discourse and pragmatics
A transcript of conversation is structured interaction, and discourse analysis reads its architecture. Turn-taking and who controls the floor reveal power; interruptions and overlaps show competition or collaboration; adjacency pairs (question-answer, greeting-greeting) show how talk is organised; topic management shows who steers. Alongside, pragmatics reads the relationship: politeness and face-work, the implicatures behind indirectness, the speech acts performed. Together these analyse how speakers build a relationship and negotiate power in real time.
Spontaneous speech features
Spontaneous speech carries features that written texts edit out, and they mean something. Fillers ("um", "er", "you know") and false starts mark planning in real time; pauses (often timed in a transcript) signal hesitation, emphasis or turn-management; repairs (self-correction) show the speaker monitoring their talk; non-fluency can mark spontaneity, nervousness or thought. Read these not as errors but as evidence of the spoken mode at work, and connect them to the speaker's stance or the relationship.
Planned speech and the scripted-spoken hybrid
A political speech or sermon is a hybrid: written to be spoken. It uses rhetoric designed for the ear (anaphora, tricolon, parallelism that build aurally), direct address, and prosodic effects (stress, pause, rhythm) that a transcript or stage direction may mark. Analyse it as crafted for delivery: the patterning that would look heavy on the page lands as rhythm aloud, and the persona is projected to a listening, often public, audience.
Multimodal and digital texts
Multimodal texts combine modes: writing with image, layout and typography (graphology), and often a spoken-written blend (a social post's informal, speech-like register in written form). Read graphological features (layout, font, colour, image) for purpose, and read the mode blend: how a digital text imports spoken immediacy and interactivity into the written mode. Always connect the visual or multimodal feature to meaning, not just description.
Examples in context
The texts vary by series, so the moves below are illustrative.
Interaction read as power. "The transcript stages a struggle for the floor: speaker A's repeated overlaps and latched turns seize topics before speaker B can finish, while B's lengthening pauses and rising fillers mark a speaker losing ground. The discourse itself, who holds and who yields the floor, dramatises a power imbalance that no single word states, and because this is spontaneous talk the imbalance is negotiated live, turn by turn." Discourse read as live power.
A scripted-spoken speech. "The address is built for the ear: the tricolon mounts through three parallel clauses to a stressed final beat, and the inclusive 'we' gathers the listening crowd into a single body. On the page the repetition might seem laboured, but delivered aloud it is rhythm and build, the spoken mode's way of carrying a public audience by sound as much as sense." Scripted-spoken rhetoric read for delivery.
Try this
Q1. Why must a transcript be analysed as interaction, not prose? [2 marks]
- Cue. A transcript is structured live talk; its meaning is in the turns, overlaps, repairs and face-work, not only in the words, so reading it as prose misses the spoken mode.
Q2. What do spontaneous speech features (fillers, pauses, repairs) signal? [2 marks]
- Cue. Real-time planning, hesitation, emphasis, turn-management or thought; they are meaningful evidence of the spoken mode, not errors.
Q3. Analyse how a spoken text builds a relationship between speakers, and compare with a written text. [32 marks]
- What the marker wants. Discourse and pragmatic analysis of the interaction named precisely (AO1) and read for effect (AO2), with mode read as context (AO3), woven into comparison with the written text (AO4).
A note on spoken texts
This guide is AI-written and not individually human-reviewed. The spoken and multimodal texts vary by series and transcription conventions differ, so confirm them against the current OCR H474/01 materials and the EMC anthology. The tools of discourse, pragmatics, prosody and graphology, with mode read into the meaning, transfer across spoken and multimodal texts.
Exam-style practice questions
Practice questions written in the style of OCR exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
OCR H474/01 (style of)16 marksAnalyse how the spoken text creates a relationship between the speakers, and compare this with the way the anthology text addresses its audience. Explore relevant contexts. [marked out of 32]Show worked answer →
A Component 01 comparison (OCR marks the paper out of 32) where one text is spoken (a transcript or speech) and the analysis must use the tools of spoken-language study.
For a transcript of interaction, read discourse and pragmatics: turn-taking and who controls the floor, interruptions and overlaps, adjacency pairs, topic management, and the politeness or face-work between speakers. For a planned speech, read its scripted-spoken hybridity: rhetorical patterning designed for the ear, direct address, prosodic emphasis. Name the features precisely (AO1), read how they build the relationship (AO2), and read mode into the effect (AO3): a spontaneous conversation negotiates the relationship live, where a speech projects it outward.
Reward use of spoken-language frameworks and mode-aware analysis. Weaker answers treat a transcript like a piece of writing, ignore interaction features, or label disfluencies without reading their effect.
OCR H474/01 (style of)16 marksCompare how the two texts use the features of their modes to engage their audiences. Explore connections and contexts. [marked out of 32]Show worked answer →
A comparison foregrounding mode (marked out of 32), often pairing a spoken or multimodal text with a written one.
Read each text's mode-specific resources: a spoken text's prosody, spontaneity features (fillers, false starts, pauses) and interaction; a multimodal or digital text's graphology, layout, images and the blend of written permanence with spoken-style informality; a written text's crafted structure. Compare how each exploits its mode to engage its audience, naming features precisely (AO1), reading effect (AO2) and treating mode as context (AO3).
Reward mode read as part of the meaning and genuine comparison (AO4). Weaker answers describe layout or transcription conventions without effect, or treat all texts as if they shared one mode.
Related dot points
- The Component 01 comparative question (H474/01): one timed comparison (1 hour, 32 marks) of a printed anthology text and an unseen non-fiction or spoken text, assessing AO1, AO2, AO3 and AO4, with idea-led comparison the key to the marks.
How to answer the OCR A-Level English Language and Literature Component 01 question (H474/01): a 1 hour, 32 mark comparison of one anthology text and one unseen non-fiction or spoken text, assessing AO1, AO2, AO3 and AO4, and why idea-led comparison with both texts live is the key to the marks.
- Analysing non-fiction language: reading rhetoric (ethos, pathos, logos, rhetorical patterning), voice and persona, register and lexis, and grammatical positioning across non-fiction genres, integrated with literary method and context (AO1, AO2, AO3).
How to analyse the language of non-fiction texts (speeches, journalism, letters, diaries, memoir) for OCR A-Level English Language and Literature Component 01: reading rhetoric, voice and persona, register and grammatical positioning with the integrated method, against audience, purpose and context (AO1, AO2, AO3).
- Approaching the unseen text (H474/01): a fast, systematic method for an unseen non-fiction or spoken text under time pressure, establishing mode, audience, purpose and genre, then finding the patterned features that bear on the question for the comparison (AO1, AO2, AO3).
How to approach the unseen non-fiction or spoken text in OCR A-Level English Language and Literature Component 01: a fast, systematic method under time pressure to establish mode, audience, purpose and genre, then find the patterned features that bear on the question for the comparison (AO1, AO2, AO3).
- Mode, context and representation: mode as a spoken-written continuum, context as production and reception (AO3), and representation as the constructed version a text builds of people, events and ideas, read into the language rather than written as separate background.
How mode, context and representation work in OCR A-Level English Language and Literature (H474): mode as a spoken-written continuum, context as production and reception (AO3), and representation as the constructed version a text builds, all read into the language rather than written as detachable background.
- The language levels toolkit (lexis, grammar, phonology and prosody, pragmatics, discourse, graphology) applied to literary texts: using linguistic precision to sharpen analysis of poetry, drama and prose, not only non-fiction (AO1 feeding AO2).
How to apply the language levels (lexis, grammar, phonology and prosody, pragmatics, discourse, graphology) to literary texts as well as non-fiction in OCR A-Level English Language and Literature (H474): using linguistic precision to sharpen literary analysis of poems, plays and prose, the AO1 toolkit that feeds AO2.