Skip to main content
EnglandMathsSyllabus dot point

How do you use Venn diagrams and set notation to organise data and calculate probabilities?

Venn diagrams for two or three sets, set notation (union, intersection and complement), and using a completed Venn diagram to find probabilities including conditional probability (Higher tier).

A focused answer to the Edexcel GCSE Mathematics probability content on Venn diagrams and set notation, covering two and three set diagrams, union, intersection and complement notation, and finding probabilities from a Venn diagram including conditional probability.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Set notation
  3. Filling a Venn diagram
  4. Three-set Venn diagrams
  5. Probability and conditional probability
  6. Reading probabilities of unions and intersections
  7. Try this

What this dot point is asking

Edexcel expects you to use Venn diagrams to organise overlapping groups, to read and use set notation (union, intersection and complement), and to find probabilities from a completed Venn diagram, including conditional probability at Higher tier. Venn diagrams turn wordy "how many do both" problems into a clear picture, and set notation gives a precise language for the regions.

Set notation

Set notation names the regions of a Venn diagram precisely, and Edexcel uses these symbols in questions.

So P(AB)P(A \cap B) is the probability of being in the overlap, and P(AB)P(A \cup B) is the probability of being in either circle. Reading the symbols correctly is half the task.

Filling a Venn diagram

The reliable order is overlap first, then the parts of each set outside the overlap, then anything outside all sets.

For two sets, if 2020 like apples, 1515 like bananas, 88 like both, then the overlap is 88, apples-only is 208=1220 - 8 = 12, bananas-only is 158=715 - 8 = 7. Starting with the overlap prevents double-counting, which is the single most common mistake.

Three-set Venn diagrams

With three sets there are more regions, but the same principle applies: work from the centre (in all three) outwards.

Probability and conditional probability

To find a probability from a Venn diagram, count the items in the region you want and divide by the total in the diagram. Conditional probability, written P(AB)P(A \mid B) ("A given B"), narrows the focus to set BB only: the denominator becomes the total in BB, not the whole universal set. This is why, in the exam question, "coffee given tea" divides by the number of tea drinkers, not by everyone.

Reading probabilities of unions and intersections

Once a Venn diagram is filled, set-notation probabilities follow directly. P(AB)P(A \cap B) counts only the overlap; P(AB)P(A \cup B) counts every region inside either circle; and P(A)P(A') counts everything outside AA. A useful identity is P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B), because adding the two circles double-counts the overlap, so you subtract it once. For example, if P(A)=0.5P(A) = 0.5, P(B)=0.4P(B) = 0.4 and P(AB)=0.2P(A \cap B) = 0.2, then P(AB)=0.5+0.40.2=0.7P(A \cup B) = 0.5 + 0.4 - 0.2 = 0.7. Translating the symbols into regions before counting prevents most errors.

Try this

Q1. In a group of 2525 people, 1414 play tennis, 1111 play golf and 55 play both. How many play neither? [3 marks]

  • Cue. Tennis only 99, golf only 66, both 55, so 2020 play at least one; 2520=525 - 20 = 5 play neither.

Q2. From the same group, find P(plays golfplays tennis)P(\text{plays golf} \mid \text{plays tennis}). [2 marks]

  • Cue. Tennis players total 1414; of these 55 also play golf, so 514\dfrac{5}{14}.

Exam-style practice questions

Practice questions written in the style of Pearson Edexcel exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Edexcel 20193 marksIn a class of 3030 students, 1818 study French, 1414 study Spanish and 77 study both. Draw a Venn diagram and work out how many students study neither language. (Paper 2, calculator.)
Show worked answer →

Start with the overlap (both), then fill the rest of each set.

Both: 77. French only: 187=1118 - 7 = 11. Spanish only: 147=714 - 7 = 7.

Students studying at least one: 11+7+7=2511 + 7 + 7 = 25.

Neither: 3025=530 - 25 = 5.

Markers award a mark for the overlap, a mark for the "only" regions, and a mark for the answer 55. Forgetting to subtract the overlap from each set (so the total exceeds 3030) is the usual error.

Edexcel 20213 marksUsing the Venn diagram from a survey, where 1212 people like tea only, 88 like coffee only, 55 like both and 44 like neither, find P(likes coffeelikes tea)P(\text{likes coffee} \mid \text{likes tea}). (Higher tier, Paper 2, calculator.)
Show worked answer →

Conditional probability "coffee given tea" restricts attention to the tea drinkers only.

Tea drinkers in total: tea only plus both =12+5=17= 12 + 5 = 17.

Of these, those who also like coffee are the "both" group: 55.

P(coffeetea)=517P(\text{coffee} \mid \text{tea}) = \dfrac{5}{17}.

Markers award a mark for the tea total of 1717, a mark for the 55 in the overlap, and a mark for the fraction. Using the whole sample of 2929 as the denominator is the common error.

Related dot points

Sources & how we know this