Displaying And Describing Numerical Data
Theory
Choose the display by dataset type: dot plots for small discrete data, stem-and-leaf for medium datasets where you want to keep every value, and histograms (with grouped frequency tables) for continuous or large datasets. The modal class is the interval with the highest frequency. Class width, midpoint, and outliers are key quantities to recognise.
For numerical data, the type of display depends on the data values. Small discrete datasets use dot plots; medium-sized datasets often use stem-and-leaf plots; large or continuous datasets are displayed with histograms.
A dot plot shows each data value as a dot above its position on a number line. Identical values stack vertically. The mode is the tallest stack.
A stem-and-leaf plot splits each value into a stem (leading digit(s)) and a leaf (last digit). All values are preserved โ nothing is lost. The row with the most leaves is the modal stem class.
A histogram displays grouped numerical data: values are bundled into class intervals, and each bar's height is the frequency for that interval. Bars touch because the data is continuous.
For grouped data:
- Class width = the span of one interval (e.g. interval
โ has width in whole cm). - Midpoint = average of the two endpoints (
+ ) รท 2 = . - Modal class = the interval with the highest frequency.
An outlier is a value far from the rest โ visible as an isolated dot or an unusually low-frequency far-off bar.
The first diagram is a dot plot of "pets per household". The second is a histogram of "sales calls per day", with the modal class highlighted in orange.
The only formulas are for grouped data.
Class width
For the interval
For continuous intervals like "
Class midpoint
When to use which display
| Dataset | Best display |
|---|---|
| Small, discrete (whole numbers) | Dot plot |
| Medium, want to keep all values | Stem-and-leaf plot |
| Large or continuous | Grouped frequency table + histogram |
Drawing a dot plot
- Draw a horizontal number line spanning the data's range.
- Above each value, stack one dot per occurrence.
- Label the axis and identify the mode (the tallest stack).
Building a stem-and-leaf plot
- Decide what's stem and what's leaf โ usually the last digit is the leaf, everything before it is the stem.
- List each unique stem in a column. For each data value, write its leaf in the row for its stem, in increasing order.
- Identify the smallest value, largest value, modal stem class, and count the total leaves.
Building a grouped frequency table and histogram
- Choose class intervals of equal width that cover the data.
- Tally how many values fall in each class.
- Draw the histogram with bars whose heights equal the frequencies โ bars touch.
- State the modal class (interval with the highest frequency), class width, and midpoints as needed.
The tallest stack is at
| Total | ||
Answer: mode =
The smallest value has the smallest stem and the smallest leaf in that row. The largest has the largest stem and largest leaf. The count is the total number of leaves.
| Min | ||
| Max | ||
| Count |
Answer: min
The class with the highest frequency (
| Modal class |
Answer: the modal class is
For whole-unit intervals, width = top โ bottom + 1. The midpoint is the average of the endpoints.
| Width | ||
| Midpoint |
Answer: class width
Common pitfalls
Frequently asked questions
What is a dot plot?
A dot plot shows each data value as a dot above its position on a number line. Identical values stack vertically. Dot plots are best for small datasets of discrete numerical values.
What is a stem-and-leaf plot?
A stem-and-leaf plot splits each data value into a stem (the leading digit or digits) and a leaf (the last digit). All the values are preserved โ nothing is lost. It works well for medium-sized datasets where you still want to see each value.
When should I use a histogram?
Use a histogram for continuous data or for large datasets where listing every value would be cluttered. Group the values into class intervals and draw bars whose heights match the frequencies. Histogram bars TOUCH because the data is continuous.
What is a modal class?
For grouped data, the modal class is the class interval with the highest frequency. It's an interval like '10 to 14', not a single number. The true mode can't be pinpointed exactly from grouped data.
How do I find the class width and midpoint?
The class width is the span of one interval. For the interval 150 to 154 (in whole cm), the width is 5. The midpoint is the average of the two endpoints: (150 + 154) divided by 2 equals 152.
What is an outlier?
An outlier is a value far from the rest of the data. In a dot plot it shows up as a single dot well separated from the main cluster. In a histogram it sits in a far-off interval with very low frequency.
Video Lessons
Practice Questions
20 questions available.
Practice Questions