How do you plan, run and write up an independent language investigation that reaches genuine, evidence-led conclusions?
The language investigation: framing a focused research question, collecting and handling data ethically, applying analytical methods, and writing up findings.
An Edexcel A-Level English Language (9EN0) answer on the coursework language investigation: framing a narrow research question, collecting data ethically, applying the language levels and named theory, presenting quantitative and qualitative findings, and structuring the write-up for the non-exam assessment.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
Edexcel wants you to plan and write the coursework language investigation: a focused, independent study of real language data in which you frame a narrow research question, gather data ethically, apply analytical frameworks rigorously, and write up findings that reach an evidence-led conclusion. It is the research half of the non-exam assessment, and it is the place where the analytical toolkit from the rest of the course is put to independent work. The question being tested is not "do you know about language and gender" but "can you turn a curiosity into a rigorous, answerable investigation".
The answer
A successful investigation is built as a small piece of empirical research. You choose a narrow research question in an area of language study (variation, change, gender and power, occupation, child language, or the language of a genre), collect authentic data ethically, and apply the language levels plus relevant theory as the analytical frameworks. The write-up follows a clear research structure (introduction and aim, methodology, analysis, conclusion), and the marks reward a tight focus, a systematic method, and analysis that reaches conclusions rather than narrating the data.
Framing the question
The most common cause of a weak investigation is a question that is too broad. "Language and power" or "how children learn to talk" are fields, not questions; they cannot be answered with a small data set. The fix is to narrow on three axes at once: a single feature (tag questions, interruptions, lexical density, modal verbs), a defined context (one radio interview, one set of estate-agent listings, one parent-child play session), and a clear comparison or pattern to test (does the feature differ between speakers, registers, or time periods).
A well-formed question is a hypothesis in disguise. "How do tag questions function in this radio interview?" can be sharpened to "Do the host's tag questions serve facilitative or epistemic functions, and does this differ from the guest's?" Now the data has something to confirm or disconfirm.
Collecting and handling data ethically
Data must be authentic (real language, not invented) and gathered ethically. The non-negotiables are informed consent (participants know they are being recorded and how the data will be used), anonymisation (no names or identifying details in the transcript), and special care with vulnerable participants, above all children, where consent comes from a parent or guardian.
The analytical pipeline: from data to conclusion
A rigorous investigation moves through a fixed pipeline. First, quantify: count the target feature and tabulate it, giving an objective evidence base (for example, hedges per 100 words by speaker). Second, analyse qualitatively: use the language levels to explain the pattern, quoting representative examples and naming the features precisely. Third, interpret through theory: ask whether the data supports, complicates or contradicts the theory you set out to test. Fourth, conclude and evaluate: answer the question directly, and state honestly what the data cannot show.
Writing it up
The write-up mirrors a research report: an introduction stating the aim and question and reviewing the relevant theory; a methodology justifying the data and method and addressing ethics; an analysis working systematically through the evidence with quantitative support and language-level commentary; and a conclusion that answers the question and evaluates limitations. Throughout, the prose should analyse and argue, not describe.
Examples in context
An occupational-register investigation. A student investigates "how does estate-agent property-listing language construct desirability?" using twenty online listings as data. The quantitative stage counts evaluative pre-modifiers ("stunning", "deceptively spacious", "characterful") and euphemisms ("compact", "in need of modernisation"); the qualitative stage analyses these at the lexis and pragmatics levels, showing how euphemism manages negative information through implicature. The investigation interprets the pattern through Fairclough's concept of synthetic personalisation (the "your dream home" address) and concludes that the register persuades by combining hyperbole with deniable euphemism, while noting the limit that twenty listings from one website cannot represent the whole genre.
A child-language investigation. A student records a three-year-old over two play sessions to investigate overregularisation. The data shows forms like "goed" and "foots". The analysis applies the grammar and morphology level and interprets the errors through Chomsky's notion of an innate rule-forming capacity and the U-shaped learning curve, arguing the errors are evidence of rule application, not failure. The conclusion answers the question (the child is overgeneralising the regular past-tense and plural rules) and evaluates the limits honestly: one child, two sessions, and a setting that may not be fully naturalistic.
Try this
Q1. Why must a language-investigation research question be narrow? [2 marks]
- What the marker wants. That a narrow, single-feature, single-context focus allows depth and a genuinely answerable hypothesis within the word limit, where a broad field cannot be analysed with a small data set.
Q2. State two ethical requirements and one methodological limitation typical of an investigation. [3 marks]
- What the marker wants. Ethics: informed consent and anonymisation (especially with children); limitation: small sample size, single context, or the observer's paradox.
Q3. Explain how an investigation should move from data to an evaluated conclusion. [16 marks]
- What the marker wants. The pipeline: quantify and tabulate, analyse qualitatively with the language levels, interpret through named theory as hypothesis-testing, then conclude by answering the question and evaluating limitations honestly.
A note on sources
This guide is AI-written and not individually human-reviewed. It reflects the Pearson Edexcel A-Level English Language (9EN0) non-exam assessment requirements and published examiner guidance. Verify current coursework rules, word limits and ethical requirements against the official Pearson specification before relying on it.
Exam-style practice questions
Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Edexcel 201920 marksA student plans to investigate language and gender by recording and analysing mixed-sex conversation. Evaluate the strengths and limitations of this design and explain how the student should refine the research question and methodology to produce a rigorous investigation.Show worked answer →
This coursework-planning question models the thinking the non-exam assessment rewards, testing AO1 (terminology and expression), AO2 (concepts and methods) and AO3 (context and ethics).
- Diagnose the breadth problem
- "Language and gender" is a field, not a question. A rigorous design narrows to one feature in one context, for example "how do interruptions and minimal responses pattern by sex in this 20-minute friendship-group conversation".
- Address methodology and ethics
- Informed consent, anonymisation, and the observer's paradox (recording changes behaviour) should all be acknowledged; a small naturalistic sample limits generalisability and that limit must be stated, not hidden.
- Tie to theory
- The design should test, not merely illustrate, theory (Zimmerman and West on interruption, Fishman on conversational labour), treating it as hypothesis. Top band evaluates rather than describes, reaching a conclusion the data can support.
Edexcel 202216 marksExplain how a language investigation should move from data to conclusion. Refer to the role of quantitative method, language-level analysis and evaluation of limitations.Show worked answer →
A method-focused question testing AO1 and AO2. The examiner wants the analytical pipeline, not a description of one topic.
Quantify then interpret. Counting and tabulating a feature (frequency of hedges per speaker, ratio of declaratives to interrogatives) gives an evidence base, but numbers alone are description; the analysis is in explaining the pattern using the language levels and theory.
Reach a genuine conclusion. The conclusion answers the research question, states what the data shows and, crucially, what it cannot show (sample size, observer's paradox, single context).
Top band treats the investigation as hypothesis-testing: frameworks chosen in advance, applied consistently, and evaluated honestly. Scripts that narrate the data without reaching an evaluated conclusion stay mid-band.
Related dot points
- Methods of language analysis: the language levels of phonology, lexis and semantics, grammar, pragmatics, discourse and graphology, and moving from feature to effect.
An Edexcel A-Level English Language (9EN0) answer on methods of language analysis: the language levels (phonology, lexis and semantics, grammar and morphology, pragmatics, discourse and graphology), the GRAPE and discourse frameworks, and how to move systematically from naming a feature to proving its effect on audience and purpose.
- Original writing and commentary: writing for a chosen genre, audience and purpose using a style model, and reflecting analytically on linguistic choices in a commentary.
An Edexcel A-Level English Language (9EN0) answer on the coursework original writing and commentary: using a style model, crafting for a precise genre, audience and purpose, and writing a reflective commentary that analyses your own linguistic choices with the language levels and metalanguage.
- Exam text analysis: analysing and comparing unseen texts using the discourse framework, building a comparative argument, and writing to time.
An Edexcel A-Level English Language (9EN0) answer on exam text analysis, covering the discourse (mode, field, tenor) framework, comparing unseen texts, building a comparative thesis, integrating context and theory, and writing analytically under timed conditions.
- Social and regional variation: regional dialects, sociolinguistic studies of class, social networks and the named research of Labov, Trudgill and Milroy.
An Edexcel A-Level English Language (9EN0) answer on social and regional variation: regional dialect, class-based variation, overt and covert prestige, and the sociolinguistic studies of Labov (Martha's Vineyard, New York), Trudgill (Norwich) and the Milroys (Belfast social networks), with their methods evaluated.
- Language and gender, power and occupation: deficit, dominance and difference models, instrumental and influential power, and occupational register, with Lakoff, Tannen, Zimmerman and West, Fairclough and Drew and Heritage.
An Edexcel A-Level English Language (9EN0) answer on language, gender, power and occupation: the deficit, dominance and difference models, instrumental and influential power, occupational register and discourse communities, with Lakoff, Tannen, Zimmerman and West, Fishman, Fairclough, Drew and Heritage and Swales, and how to evaluate them.
Sources & how we know this
- Pearson Edexcel A-Level English Language (9EN0) specification — Pearson Edexcel (2015)