Activity 01
Gallery Walk: The Messy Dataset Museum
Print five different messy datasets and post them around the room, each with a different type of data quality problem (duplicates, missing values, format mismatches, outliers, impossible values). Groups rotate through stations with sticky notes to identify the problem type and propose a cleaning strategy before moving on.
Explain the common types of data inconsistencies and errors.
Facilitation TipDuring the Gallery Walk, position students as curators who must explain their cleaning decisions to peers using the provided rubric.
What to look forProvide students with a small, messy dataset (e.g., a CSV snippet with errors). Ask them to identify two specific data quality issues present and suggest one cleaning step for each. Collect these as they leave class.