Challenges with Data Quality, Sharing, and Versioning in Next-Generation Sequencing

By Iddo on March 27th, 2009

An fine talk by David Dooling highlighting some of the false impressions about second generation sequencing. A partial list:

Why sequencing quality trump base pair output
Why genomes are really probabilities rather than strings
Why centralized repositories break down when it comes to second generation sequencing data.
Collaborative Software development and versioning has been moving from centralized repositories like SVN to distributed repositories like git. What can the designers of genomic databases learn from that?

Anybody interested in second generation sequencing, and especially in the bioinformatics involved should clear 27 minutes of their time and listen. Very much worth it. Thanks to Anil Thomas for pointing this out.

View more presentations from David Dooling.

Share and Enjoy: