Challenges with Data Quality, Sharing, and Versioning in Next-Generation Sequencing
An fine talk by David Dooling highlighting some of the false impressions about second generation sequencing. A partial list:
- Why sequencing quality trump base pair output
- Why genomes are really probabilities rather than strings
- Why centralized repositories break down when it comes to second generation sequencing data.
- Collaborative Software development and versioning has been moving from centralized repositories like SVN to distributed repositories like git. What can the designers of genomic databases learn from that?
Anybody interested in second generation sequencing, and especially in the bioinformatics involved should clear 27 minutes of their time and listen. Very much worth it. Thanks to Anil Thomas for pointing this out.
Challenges with Data Quality, Sharing, and Versioning in Next-Generation Sequencing
View more presentations from David Dooling.
Comments are closed.