What does the Data representation content in WJEC GCSE Computer Science cover?

Data representation covers why computers use binary, the units of data capacity, converting between binary, denary and hexadecimal, adding binary numbers and overflow, arithmetic shifts, representing signed numbers using sign and magnitude and two's complement, character sets such as ASCII and Unicode, representing bitmap images and sampled sound, calculating file sizes, and lossy and lossless compression including run-length encoding. It is the foundation of Unit 1 and underpins how every kind of data is stored.

How do you calculate the file size of an image or a sound?

For a bitmap image, file size in bits is width in pixels times height in pixels times the colour depth (bits per pixel), then divide by eight for bytes. For sound, file size in bits is the sample rate (samples per second) times the duration in seconds times the sample resolution (bit depth), then divide by eight for bytes. In both cases more detail, whether more pixels, higher colour depth, a higher sample rate or a higher bit depth, increases the file size.

What is the difference between lossy and lossless compression in WJEC computer science?

Lossy compression makes a file smaller by permanently removing some data, so the original cannot be recovered exactly; it is used for photos, music and video where smaller files matter more than perfect quality. Lossless compression makes a file smaller with no loss of data, so the original can be reconstructed exactly; it is used for text, spreadsheets and program code. Run-length encoding is a lossless method that stores runs of repeated values as a value and a count.

How do you represent a negative number in two's complement?

Write the positive value in binary, invert (flip) every bit so each zero becomes a one and each one becomes a zero, then add one. The leftmost bit then indicates the sign. Two's complement is preferred over sign and magnitude because subtraction can be carried out as a single binary addition, the hardware is simpler, and there is only one representation of zero.

Why is hexadecimal used in computing?

Hexadecimal is base sixteen, using the digits zero to nine and the letters A to F for ten to fifteen. It is used as a shorthand for binary because each hexadecimal digit represents exactly four bits, so an eight-bit byte becomes just two hexadecimal digits. That makes long binary numbers much shorter and easier for people to read and less error-prone, which is why hexadecimal appears in colour codes, memory addresses and error codes.

WalesComputer Science

WJEC GCSE Computer Science Data representation: a complete overview of binary, hexadecimal, negative numbers, characters, images, sound and compression

A deep-dive WJEC GCSE Computer Science guide to the Data representation content in Unit 1. Covers binary and denary, hexadecimal, binary arithmetic and overflow, arithmetic shifts, signed numbers in sign and magnitude and two's complement, character sets, bitmap images, sampled sound, file-size calculations and compression, with the conversions and exam patterns WJEC repeats.

Generated by Claude Opus 4.814 min read3500 Unit 1 Data representation and data typesUpdated 2026-06-15

Reviewed by: AI editorial process; not yet individually human-reviewed

Jump to a section

What the Data representation content demands
Binary and the denary system
Hexadecimal
Binary arithmetic and overflow
Representing negative numbers
Characters, ASCII and Unicode
Representing images
Representing sound
Compression
Check your knowledge

What the Data representation content demands

Data representation is where WJEC checks that you understand how a machine built from two-state switches can store numbers, text, pictures and sound. Every other part of the course, from hardware to networks to programming, sits on top of the idea that everything is ultimately a pattern of bits. This area is also the most calculation-heavy part of Unit 1, so fluent, accurate conversions and file-size sums earn marks reliably under exam pressure.

This guide walks through the Data representation content and ties together the matching dot-point pages, each of which has its own worked examples and practice questions.

Binary and the denary system

Computers use binary because their electronic components have two stable states, on and off, that map onto $1$ and $0$ . A bit is one binary digit, a byte is $8$ bits, and capacity rises through kilobytes, megabytes, gigabytes and terabytes. Binary columns are powers of two ( $128, 64, 32, 16, 8, 4, 2, 1$ for a byte). To convert binary to denary, add the place values where a $1$ appears; to convert denary to binary, subtract the largest power of two that fits at each step.

Hexadecimal

Hexadecimal is base $16$ , using $0$ to $9$ then $\text{A}$ to $\text{F}$ for $10$ to $15$ . It is a shorthand for binary because each hex digit is exactly four bits, so a byte is two hex digits. Convert hex to binary one digit at a time, binary to hex by grouping bits into fours from the right, and hex to denary using the column values $16$ and $1$ . Hexadecimal appears in colour codes, memory addresses and error messages.

Binary arithmetic and overflow

Add binary from the right using $1 + 1 = 10$ (carry one) and $1 + 1 + 1 = 11$ (carry one). Overflow happens when a result is too large for the available bits, so the final carry is lost and the answer is wrong. An arithmetic shift moves all the bits: a left shift multiplies by two and a right shift divides by two, with a shift of $n$ places multiplying or dividing by $2^n$ . Shifts are fast, so compilers use them for multiplying and dividing by powers of two.

Representing negative numbers

Plain binary stores only positive numbers, so signed methods are needed. In sign and magnitude the leftmost bit is the sign and the rest is the size, but it has two zeros. In two's complement you write the positive value, flip every bit and add one; subtraction then becomes a single addition, there is one zero, and an $8$ -bit range runs from $-128$ to $+127$ . Two's complement is what real processors use.

Characters, ASCII and Unicode

Text is stored using a character set that gives each character a unique binary code. ASCII uses $7$ bits ( $128$ characters, or $256$ in extended ASCII) and orders the letters in sequence, with upper and lower case $32$ apart. Unicode uses more bits to cover every language plus symbols and emoji, so the same text takes more storage than in ASCII. A string's size is the number of characters times the bits per character.

Representing images

A bitmap is a grid of pixels, each storing a colour as a binary number. Resolution is the number of pixels; colour depth is the bits per pixel, and the number of colours is $2^{\text{colour depth}}$ . The file size in bits is width times height times colour depth, divided by eight for bytes. Metadata such as the dimensions and colour depth is stored with the image so it can be displayed correctly. More pixels or more colours mean a larger file.

Representing sound

Sound is an analogue wave, digitised by sampling: the amplitude is measured at regular intervals and stored as binary. The sample rate is samples per second and the sample resolution (bit depth) is bits per sample; both raise quality and file size. The file size in bits is sample rate times duration times bit depth, divided by eight for bytes. Sound is often compressed (for example to MP3) to make files small enough to stream.

Compression

Compression makes files smaller so they take less storage and less bandwidth. Lossy compression permanently removes data and cannot be reversed exactly; it suits photos, music and video. Lossless compression keeps every bit so the original is recovered exactly; it suits text, spreadsheets and code. Run-length encoding is lossless and stores each run of repeated values as the value and a count, which helps on data with long runs but can enlarge varied data.

Check your knowledge

A mix of conversion, arithmetic, file-size and compression questions covering the Data representation content. Attempt them under timed conditions, then check against the solutions.

Convert the binary number $10101100$ to denary. (2 marks)
Convert the denary number $53$ to $8$ -bit binary. (2 marks)
Convert the hexadecimal number $\text{3D}$ to denary. (2 marks)
Add the binary numbers $00101101$ and $00010110$ . (2 marks)
Write $-9$ in $8$ -bit two's complement. (2 marks)
An image is $40$ by $30$ pixels with a colour depth of $8$ bits. Calculate its size in bytes. (3 marks)
A $4$ -second sound is sampled at $2000\,\text{Hz}$ with a bit depth of $8$ bits. Calculate its size in bytes. (3 marks)
Compress WWWWWWWWBBWWWW using run-length encoding. (2 marks)

Solutions

Step 1: Q1. Convert binary to denary

Add the place values where a $1$ appears. The columns from left to right in $10101100$ are $128, 64, 32, 16, 8, 4, 2, 1$ :

128 + 32 + 8 + 4 = 172

Step 2: Q2. Convert denary to 8-bit binary

Subtract the largest power of two that fits at each step. $53 = 32 + 16 + 4 + 1$ , placing a $1$ in those columns and $0$ elsewhere:

00110101

Step 3: Q3. Convert hexadecimal to denary

Each hex digit has a place value that is a power of $16$ . $\text{3D}$ means $3 \times 16 + 13$ , where $\text{D} = 13$ :

3 \times 16 + 13 = 48 + 13 = 61

Step 4: Q4. Add two binary numbers

Convert each to denary to confirm, then add in binary, carrying $1$ when a column sum reaches $2$ :

00101101\ (45) + 00010110\ (22) = 01000011\ (67)

Step 5: Q5. Write a negative number in two's complement

Write the positive value, flip every bit, then add $1$ . The leftmost bit becomes $1$ , indicating a negative:

+9 = 00001001 \xrightarrow{\text{flip}} 11110110 \xrightarrow{+1} 11110111

Step 6: Q6. Calculate an image file size in bytes

Multiply pixels by colour depth for bits, then divide by $8$ for bytes. The three-step chain is: pixel count, then bits, then bytes:

40 \times 30 = 1200 \text{ pixels}; \quad 1200 \times 8 = 9600 \text{ bits}; \quad 9600 \div 8 = 1200 \text{ bytes}

Step 7: Q7. Calculate a sound file size in bytes

Multiply sample rate by duration to get the total number of samples, multiply by bit depth for bits, then divide by $8$ for bytes:

2000 \times 4 = 8000 \text{ samples}; \quad 8000 \times 8 = 64000 \text{ bits}; \quad 64000 \div 8 = 8000 \text{ bytes}

Step 8: Q8. Apply run-length encoding

Count consecutive identical values and replace each run with its count and value. The string $\text{WWWWWWWWBBWWWW}$ has a run of $8$ Ws, then $2$ Bs, then $4$ Ws:

8\text{W}\ 2\text{B}\ 4\text{W}

Sources & how we know this

WJEC GCSE Computer Science specification (3500) from 2017 — WJEC (2017)