22-25 April 2026
Video unavailable
SQLBits 2022

Lessons from spreadsheet data horror stories

Lessons from published stories of mismanagement of data using Excel leading to data loss and corruption.
There are many pitfalls in using Microsoft Excel to processing important data. I present published reports of horror stories and draw lessons from them. Firstly, how unskilled Excel users corrupted gene name records; researchers decided to rename genes to suit Excel! The lesson is to learn how to import CSVs into Excel correctly. Then I describe the Public Health England debacle, where they lost Covid lab results. That illustrates the risks of automated data imports without safety controls and the lesson is a basic reconciliation technique.