Episode

What is de-identified data?

Podcast
Behind the Genes
Published
Mar 18, 2026
Duration seconds
399
Processing state
not_requested
Canonical source
https://genomicsengland.podbean.com/e/what-is-de-identified-data/
Audio
https://mcdn.podbean.com/mf/web/uuqkawsqsx8vpe6f/Genomics_101_What_is_de-identified_data.mp3
JSON
/v1/public/podcasts/behind-the-genes-3509739/episodes/what-is-de-identified-data
Markdown
/podcast/behind-the-genes-3509739/what-is-de-identified-data.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/behind-the-genes-3509739/episodes/what-is-de-identified-data/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/behind-the-genes-3509739/what-is-de-identified-data.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

In this explainer episode, we’ve asked Georgia Chan, Senior Data Wrangler at Genomics England, to explain what de-identified data is. You can also find a series of short videos explaining some of the common terms you might encounter about genomics on our YouTube channel. If you’ve got any questions, or have any other topics you’d like us to explain, let us know on [email protected]. You can download the transcript or read it below. Florence: What do we mean by de-identified data? My name is Florence Cornish, and today I'm here with Georgia Chan. Georgia is Senior Data Wrangler here at Genomics England, which just means that she cleans up and adds structure to complicated data so that it becomes usable, and she is going to be telling us much more about the topic of de-identified data. Georgia, I think it would be a good place to start by talking about the National Genomic Research Library, which is the library that we at Genomics England store data in. So maybe you could explain more about that and what kind of data is in there. Georgia: Sure. Thanks Florence. So, we have genomic data. Genomic data is information that comes from a person's DNA. It helps us understand how the body works and why disease happens. This can include whole genome sequencing data, variants found in genes, small differences that make each of us unique, and information about how genes function or how they differ between people. Genomic data does not include a person's name or who they are. It's biological information, not identity, and it's used to understand health and disease. It's really important to note that by nature, it's nature, genomic information is incredibly rich. We all have millions of common genetic variants, but your whole genome is unique to you. So although genomic d…