Box-and-Whisker Plots
Constructing and interpreting box plots from cumulative frequency data.
About This Topic
Box-and-whisker plots summarize data distributions by showing the median, quartiles, range, and outliers in a compact visual form. Secondary 3 students construct these plots from cumulative frequency curves: they locate the median at the 50th percentile, lower quartile at 25th, and upper quartile at 75th on the curve, then plot the five-number summary. Interpretation focuses on the box length for interquartile range, whisker lengths for spread, and asymmetry for skewness in the data set.
This topic anchors the Data Analysis and Probability unit, linking to prior skills in central tendency and dispersion measures. Students compare box plots to histograms or dot plots, noting how box plots excel at highlighting outliers and spread without detailing shape frequencies. Real-world applications, like comparing test scores across classes, build statistical inference skills essential for further probability studies.
Active learning benefits this topic greatly, as students gather class data on heights or reaction times, construct plots in groups, and debate interpretations. Hands-on plotting from cumulative curves makes quartiles tangible, group comparisons sharpen critical analysis, and peer discussions correct visual misreads, turning passive graphing into dynamic data sense-making.
Key Questions
- Explain what the shape of a box plot tells us about the skewness of a data set.
- Compare box plots to other graphical representations for summarizing data distribution.
- Construct a box plot and use it to compare two different datasets.
Learning Objectives
- Construct a box plot accurately from cumulative frequency data, identifying the median, quartiles, and range.
- Analyze the shape of a box plot to explain the skewness of a dataset, distinguishing between symmetric, right-skewed, and left-skewed distributions.
- Compare and contrast box plots with histograms and dot plots, articulating the strengths and weaknesses of each for data summarization.
- Interpret and compare two or more box plots to draw conclusions about differences in central tendency and spread between datasets.
Before You Start
Why: Students must be able to read and interpret cumulative frequency curves to locate the percentiles needed for constructing a box plot.
Why: Understanding concepts like median, quartiles, and range is fundamental to interpreting the components of a box plot.
Key Vocabulary
| Median | The middle value in a sorted dataset, representing the 50th percentile. |
| Quartiles | Values that divide a dataset into four equal parts: the lower quartile (25th percentile), the median (50th percentile), and the upper quartile (75th percentile). |
| Interquartile Range (IQR) | The difference between the upper quartile and the lower quartile, representing the spread of the middle 50% of the data. |
| Skewness | A measure of the asymmetry of a probability distribution of a real-valued random variable about its mean. A box plot can visually indicate skewness. |
| Five-Number Summary | A set of five key statistics that describe a dataset: minimum, lower quartile, median, upper quartile, and maximum. |
Watch Out for These Misconceptions
Common MisconceptionWhiskers always extend to the absolute minimum and maximum data points.
What to Teach Instead
Whiskers reach the smallest and largest values within 1.5 times the IQR from quartiles; points beyond are outliers. Group activities plotting real data help students spot and plot outliers correctly, as they measure IQR themselves and see extremes flagged visually.
Common MisconceptionThe median in a box plot is the average of the data.
What to Teach Instead
The median is the middle value at the 50th percentile from cumulative data, not the mean. Hands-on construction from curves lets students trace percentiles precisely, and pair checks prevent averaging confusion through shared verification.
Common MisconceptionA longer box always means more skewed data.
What to Teach Instead
Box length shows IQR spread, while skewness comes from unequal whiskers. Comparing multiple plots in small groups clarifies this, as students measure and debate asymmetry directly from their constructions.
Active Learning Ideas
See all activitiesPairs Task: Cumulative Curve to Box Plot
Pairs receive a cumulative frequency curve for student quiz scores. They identify median, Q1, Q3, min, and max values, then sketch the box plot. Partners check each other's plots against the curve and note any skewness.
Small Groups: Dataset Comparison Challenge
Provide two cumulative frequency curves for heights in different classes. Groups construct box plots for both, measure IQR and whisker lengths, then discuss which group has greater variability or skewness. Present findings on chart paper.
Whole Class: Real-Time Data Plotting
Collect class data on travel times to school via quick survey. Plot cumulative frequency as a class on the board, then have volunteers draw the box plot. Discuss outliers like late buses and what they reveal.
Individual: Skewness Interpretation
Give printed box plots of exam scores. Students label skewness direction, justify with whisker lengths, and predict mean position relative to median. Share one insight with a partner.
Real-World Connections
- Financial analysts use box plots to visualize the distribution of stock prices or company earnings over a period, quickly identifying median returns, the spread of performance, and potential outliers.
- Sports statisticians employ box plots to compare player performance across different seasons or teams, for example, comparing the distribution of points scored per game by two basketball players.
- Medical researchers might use box plots to compare the effectiveness of different treatments by visualizing the distribution of patient recovery times or symptom severity scores.
Assessment Ideas
Provide students with a cumulative frequency table and ask them to calculate the median and quartiles. Then, have them plot these values to begin constructing a box plot, checking their calculations and plotting accuracy.
Present two box plots side-by-side, one representing test scores from Class A and another from Class B. Ask students: 'Which class performed better overall? How do you know? What does the length of the boxes tell you about the consistency of performance in each class?'
Give students a completed box plot. Ask them to write down the five-number summary it represents and to describe in one sentence whether the data appears to be skewed and in which direction.
Frequently Asked Questions
How do you construct a box plot from cumulative frequency data?
What does the shape of a box plot reveal about skewness?
How can active learning improve understanding of box-and-whisker plots?
How do box plots compare to histograms for data summaries?
Planning templates for Mathematics
5E Model
The 5E Model structures lessons through five phases (Engage, Explore, Explain, Elaborate, and Evaluate), guiding students from curiosity to deep understanding through inquiry-based learning.
Unit PlannerMath Unit
Plan a multi-week math unit with conceptual coherence: from building number sense and procedural fluency to applying skills in context and developing mathematical reasoning across a connected sequence of lessons.
RubricMath Rubric
Build a math rubric that assesses problem-solving, mathematical reasoning, and communication alongside procedural accuracy, giving students feedback on how they think, not just whether they got the right answer.
More in Data Analysis and Probability
Review of Data Representation
Revisiting histograms, bar charts, pie charts, and stem-and-leaf plots for data visualization.
2 methodologies
Measures of Central Tendency
Calculating and interpreting mean, median, and mode for grouped and ungrouped data.
2 methodologies
Measures of Spread (Range and IQR)
Calculating and interpreting range and interquartile range for grouped and ungrouped data.
2 methodologies
Cumulative Frequency Curves
Constructing and interpreting cumulative frequency curves.
2 methodologies
Introduction to Probability
Defining probability, outcomes, events, and calculating simple probabilities.
2 methodologies
Probability of Combined Events (Independent)
Using tree diagrams and possibility diagrams to calculate probabilities for multiple independent events.
2 methodologies