Resources For Teachers For Tutors For Students & Parents Pricing
Year 11 General Univariate Data Analysis

Displaying And Describing Categorical Data Distributions

20 practice questions 1 video lesson Theory + worked examples

Theory

Categorical data is summarised with frequency tables and bar charts. Convert raw counts to percentages using percentage=frequencytotalร—100%. The mode is the most common category. Bar charts have gaps between bars to mark distinct categories; histograms (for numerical data) do not.

For categorical data, the goal is to summarise how often each category occurs. The two main displays are frequency tables and bar charts.

A frequency table lists each category and how many times it occurs. A percentage column is often added:

percentage=frequencytotalร—100%

A bar chart shows categories on the horizontal axis and frequency (or percentage) on the vertical axis. The bars are separated by gaps to indicate the categories are distinct, not continuous.

The mode is the most common category โ€” the one with the tallest bar (or the largest frequency in the table).

A segmented (stacked) bar chart is a single bar split into coloured sections, with each section representing a category's share. It's useful for percentage breakdowns or for comparing two groups side-by-side.

The first diagram is the favourite snacks bar chart from the worked example, with the mode (Chips) highlighted. The second compares a bar chart (categorical, with gaps) to a histogram (numerical, no gaps).

Bar chart of favourite snacks A bar chart showing favourite snacks among 120 students with chips the most popular at 36, biscuits at 30, chocolate at 24, fruit at 18, and sandwich at 12. Bars are separated by gaps. Favourite snacks (n = 120) Frequency Snack 10 20 30 40 36 Chips 18 Fruit 24 Choc 30 Bisc 12 Sand Mode Gaps between bars โ€” categories are distinct
Five separated bars for five categories. Chips is the mode (tallest bar at 36).
Bar chart versus histogram A side-by-side comparison: a bar chart with gaps between bars for categorical data on the left, and a histogram with touching bars for numerical data on the right. Bar chart vs histogram Bar chart categorical data A B C gap โœ“ gaps between bars Histogram numerical data 0-4 5-9 10-14 15-19 20-24 โœ“ bars touch Gap = categorical. No gap = numerical. A quick way to identify the chart type
Bar charts have gaps (distinct categories). Histograms don't (continuous numerical values).

The only real formula is the percentage conversion.

Percentage from frequency

percentage=frequencytotalร—100%

%=frequencytotalร—100%

Frequency from percentage

frequency=percentage100ร—total

Quick reference

QuantityHow to find it
Modecategory with the largest frequency
Missing frequencytotal minus the sum of the others
Combined %add the relevant percentages, or add their frequencies then รท total ร— 100%
Watch the totals. A "45%" in one survey isn't directly comparable to "45%" in another unless the totals are the same. Always compute percentages within each group before comparing.

Building a frequency table

  1. List every category in the data โ€” including any with zero count.
  2. Count (tally) how many times each category appears.
  3. Add a percentage column using the formula above.
  4. Check that all frequencies add to the total and all percentages add to 100% (allowing small rounding error).

Drawing a bar chart

  1. Put categories on the horizontal axis, frequency (or percentage) on the vertical.
  2. Make all bars the same width, separated by equal-sized gaps.
  3. Start the vertical axis at 0 (no truncation).
  4. Label both axes and give the chart a title.

Describing a categorical distribution

  1. State the mode โ€” the most common category.
  2. State the least popular category.
  3. Describe the general pattern: "one category dominates", "frequencies are roughly even", or similar.
EXAMPLE 1 โ€” FREQUENCY FROM PERCENTAGE
32% of 250 Year 11 students chose blue as their favourite colour. How many students chose blue?
SOLUTION

Convert the percentage to a decimal and multiply by the total.

Number=0.32ร—250
=80

Answer: 80 students chose blue.

number=80
EXAMPLE 2 โ€” PERCENTAGE FROM FREQUENCY
A survey of 180 pizza orders had 76 Medium pizzas. What percentage were Medium?
SOLUTION

Use percentage=frequencytotalร—100%.

%=76180ร—100%
โ‰ˆ42.2%

Answer: about 42.2% of pizzas were Medium.

%โ‰ˆ42.2%
EXAMPLE 3 โ€” FIND A MISSING FREQUENCY
A pet survey of 100 Brisbane families: Dog 45, Cat ?, Bird 12, Fish 8. Find the missing Cat count.
SOLUTION

The total of all categories equals 100. Subtract the known categories.

Cat=100โˆ’45โˆ’12โˆ’8
=35

Answer: 35 families chose Cat.

Cat=35
EXAMPLE 4 โ€” COMBINED PERCENTAGE
For the snack data above (Chips 36, Choc 24, total 120), what percentage of students chose Chips or Chocolate?
SOLUTION

Either add the individual percentages, or add the frequencies then divide by the total.

Combined=30%+20%
=50%

Check directly: 36+24120ร—100%=60120ร—100%=50%. โœ“

Answer: 50% of students chose Chips or Chocolate.

combined %=50%

Common pitfalls

Bar charts need gaps; histograms don't. The bars in a bar chart (categorical data) should be separated by gaps. Histograms (numerical, continuous data) have bars touching. Drawing them the wrong way around is a classic mark-losing mistake.
Truncated axes mislead. If the vertical axis doesn't start at 0, small differences between bars look much bigger than they are. Always check the axis โ€” and always start your own bar chart at 0.
Percentages need their totals. "45% of one survey" isn't directly comparable with "45% of another" unless the totals are the same. For comparing groups of different sizes, compute percentages within each group.
The mode is the category, not the frequency. If Chips has frequency 36 and is the most common, the mode is "Chips" โ€” not "36". A common slip.
Don't drop categories with zero frequency. A frequency of 0 is still a valid data point โ€” it tells you nobody chose that category. Keep the row in the table.

Frequently asked questions

What is a frequency table?

A frequency table lists each category in one column and how many times it occurs in another. A percentage column is often added: percentage equals frequency divided by total, times 100.

What is the difference between a bar chart and a histogram?

Bar charts are for categorical data and have GAPS between the bars to show categories are distinct. Histograms are for numerical data and have bars TOUCHING because values run continuously into each other.

What is the mode of a categorical dataset?

The mode is the most common category โ€” the one with the tallest bar or largest frequency. Note that the mode is the CATEGORY itself (like 'Chips') and not the frequency number (like 36).

What is a segmented bar chart?

A segmented (stacked) bar chart is a single bar divided into coloured sections, with each section's length showing one category's share. It's useful for visualising a percentage breakdown or comparing two groups side-by-side.

What is a truncated axis and why is it misleading?

A truncated axis is one that doesn't start at zero. This makes small differences between bars look much larger than they really are. It's a classic technique used in misleading graphs โ€” always check whether the vertical axis starts at zero.

How do I compare two groups of different sizes?

Convert raw frequencies to percentages within each group first. If 30 out of 60 Year 7s prefer soccer and 10 out of 60 Year 11s do, the percentages are 50 percent versus about 17 percent. Comparing raw numbers when totals differ is misleading.

Video Lesson

  • Stats Honors: 1.2 Displaying Categorical Data Watch

Practice Questions

20 questions available.

Practice Questions