Displaying posts tagged with


The Bio* projects: a history in graphs

Yesterday I received an email from Kristjan Liiva, a student at  RWTH Aachen University Germany. Kristjan has developed a really cool dashboard to analyze and visualize the development of collaborative OSS projects by mining their mailing lists and software repositories.  (If the link doesn’t work, try again later; the project is heavily under development). The […]

Short note on getting students busy

I recently read this post about lacunae in  Bioinformatics.  One complaint was: I know that documentation is a thankless task. But some parts of the Bio[Java|Perl|Python] libraries are described only as an API? This became apparent to me when I had to teach the libraries to students. What does this module do and why does it do […]

And I should go because?

Found this in my inbox: Dear Dr.Iddo Friedberg,     Greeting from OMICS Group! I came across your contribution entitled “Biopython: freely available Python tools for computational molecular biology and bioinformatics” published in the Journal of Bioinformatics and thought your expertise would be an excellent fit for Toxicology-2012 Conference that OMICS Group is hosting.   I’m just wondering how many legitimate calls for […]

Short bioinformatics hacks: reading mate-pairs from a fastq file

If you have a merged file of paired-end reads, here is a quick way to read them using Biopython: from Bio import SeqIO from itertools import izip_longest # Loop over pairs of reads readiter = SeqIO.parse(open(inpath), “fastq”) for rec1, rec2 in izip_longest(readiter, readiter): print rec1.id # do something with rec1 print rec2.id # do something […]

The Friedberg Lab is Recruiting Graduate Students

  The Friedberg Lab is recruiting graduate students, for both Master’s and Ph.D. WE ARE:  A dynamic young lab  interested in gene, gene cluster and genome evolution, understanding microbial communities and microbe-host interactions by metagenomic analyses, developing algorithms for understanding gene cluster evolution, and prediction of protein function from protein sequence and structure. YOU ARE: […]

Bioinformatics Open Source Conference 2010 (and a poll)

The 11th Annual Bioinformatics Open Source Conference (BOSC) 2010 is coming up in Boston, July 9-10 2010. The BOSC meetings are a great get-together of a community of programmers who are like-minded in their advocacy of open source code for science, and specifically for bioinformatics. The whole thing is run by volunteers who take a […]

Closing gaps

Geek alert: this post for coders. So you sequenced your genome, reached an optimally small number of contigs, they look sane, and now you would like to see what you need for the finishing stage. Namely, how many gaps you have and what are their sizes. UPDATE: “might just be worth clarifying this is for […]

Short bioinformatics hacks, ch. 2: chunk it.

First, a non-bioinformatic one liner, which is very relevant to most of us working on 3 different machines simultaneously, not including the 80 in our cluster. ssh-ing and giving your password each time is painful, and makes it almost impossible to do scripted file transfers, like backups. A good solution is shared key ssh in […]

Ten years of coding with the snake

Biopython is entering its 10th year; the unofficial birthday is on September, since that is when the mailing list started: September 1999. I stumbled onto that list mid-September, 1999. I believe the Python version was 1.5, I was still working on SGI Irix, and I was an 0.3 PhD candidate. Today I am coding with […]