Linear Digressions
Linear Digressions
Ben Jaffe and Katie Malone
When Private Data Isn't Private Anymore
26 minutes Posted Mar 4, 2018 at 7:35 pm.
0:00
26:20
Download MP3
Show notes
After all the back-patting around making data science datasets and code more openly available, we figured it was time to also dump a bucket of cold water on everyone's heads and talk about the things that can go wrong when data and code is a little too open.
In this episode, we'll talk about two interesting recent examples: a de-identified medical dataset in Australia that was re-identified so specific celebrities and athletes could be matched to their medical records, and a series of military bases that were spotted in a public fitness tracker dataset.