Skip to main content
EnglandComputer ScienceSyllabus dot point

How is text stored in binary, and why was Unicode needed when ASCII already existed?

How text is represented using character sets, the ASCII character set, the need for and nature of Unicode, and how a character maps to a binary code.

An Eduqas GCSE Computer Science answer on how text is stored in binary using character sets, the 7-bit ASCII set, why Unicode was needed to support many languages, and how a character maps to a binary code.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Character sets
  3. ASCII
  4. Unicode
  5. Try this

What this dot point is asking

Eduqas wants you to explain how text is stored in binary using a character set, describe ASCII, and explain why Unicode was needed when ASCII already existed. The marks come from the idea that each character has a unique binary code and from the contrast between ASCII's small range and Unicode's huge one.

Character sets

ASCII

Storing the alphabet in order has a useful side effect: you can find a letter's code from a known one. Since A=65A = 65, the letter EE (four letters after AA) is 65+4=6965 + 4 = 69. The gap between a capital and its lowercase version is always 3232 (for example A=65A = 65 and a=97a = 97).

Unicode

Try this

Q1. State how many bits standard ASCII uses per character and how many characters that allows. [2 marks]

  • Cue. 77 bits, allowing 27=1282^7 = 128 characters.

Q2. The ASCII code for AA is 6565. State the ASCII code for CC. [1 mark]

  • Cue. 6767 (two after AA).

Q3. Give one reason Unicode was created. [1 mark]

  • Cue. ASCII could not represent the characters of other languages, so Unicode provides far more codes.

Exam-style practice questions

Practice questions written in the style of WJEC Eduqas exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Eduqas Component 1, 20224 marksExplain what is meant by a character set, and explain why Unicode is used in addition to ASCII.
Show worked answer →

Character set (up to 2 marks): a character set is an agreed mapping that gives each character (letter, digit, symbol or control code) a unique binary code, so the same text is stored and read the same way on different systems.

Why Unicode (up to 2 marks): ASCII uses only 7 bits, giving 128 characters, which is enough for English letters, digits and basic symbols but not for the many thousands of characters in other languages and scripts. Unicode uses more bits, so it can represent characters from almost every writing system, plus emoji, while keeping the first 128 codes the same as ASCII.

Markers reward the "unique binary code per character" idea and the "ASCII has too few codes for other languages" point.

Eduqas Component 1, 20233 marksThe ASCII code for the capital letter A is 65. State the ASCII code for the capital letter D, the code for a lowercase a, and explain how you worked out the code for D.
Show worked answer →

Capital D: the capital letters are stored in order, so D is three after A: 65 + 3 = 68 (1 mark plus 1 for the reasoning that letters are sequential).

Lowercase a: lowercase letters start at 97, so a is 97 (1 mark).

Explanation: because ASCII stores the alphabet in consecutive codes, you can add the letter's position difference to a known code, which is also why sorting by ASCII code puts letters in alphabetical order.

Related dot points

Sources & how we know this