Displaying posts categorized under

Bioinformatics

A FLORA of Protein Structure to Protein Function

Proteins are the machinery of life, and they facilitate most of life’s functions. Traffic into and out of the cell? Protein pumps, pores and channels. Respiration? Proteins. Metabolism and catabolism? Proteins. Immune system, signaling, development…  all complex networks of interacting proteins. Understanding a protein’s  structure can tell us a lot about how it performs its […]

Short bioinformatics hacks pt. 3: more FASTA counting

A few one-liners to kick off the workweek: To order a set of fasta files by the number of sequences each one contains. If anyone knows how to put a tab as the output delimiter, please let us know: grep -c “>” fasta-files/*.fna | cut –fields=1,2 -d “:” –output-delimiter=”  ” | sort -k 2 -nr […]

Absolut standards: report from the M3-2009 meeting, part 2: signature genes and big science

Some more presentations from the metagenomics, metadata, and metaanalysis (M3) meeting, Stockholm June 27, 2009 Pathway Signature Genes Lucas A. Brouwers, Martijn A. Huynen and Bas E. Dutilh CMBI / NCMLS, Radboud University Nijmegen Medical Centre, The Netherlands If we take a sample of soil, how can we know whether it is adequate for growing […]

A Flurry of Red and Green

UPDATE: I submitted this post to the National Evolutionary Synthesis Center’s sponsored contest for a travel award to ScienceOnline2010. Let’s see how it goes… #scio10 In a previous post about Hatena we saw what might very well be the beginning of a (beautiful?) [:ttip=”symbiosis where one partner lives inside the cell of the other” id=”10″]endosymbiotic[:/ttip] […]

(Not Only) Microblogging ISMB 2009

Intelligent Systems in Molecular Biology (ISMB) is a large international gathering of  computational biologists, mostly from the bioinformatics side: genomics, structural bioinformatics, computational genomics, etc. This year there is a friendfeed room for microblogging ISMB 2009. So if you are not in Stockholm, or also if you are, look it up. Most of the microbloggers […]

Short bioinformatics hacks, ch. 2: chunk it.

First, a non-bioinformatic one liner, which is very relevant to most of us working on 3 different machines simultaneously, not including the 80 in our cluster. ssh-ing and giving your password each time is painful, and makes it almost impossible to do scripted file transfers, like backups. A good solution is shared key ssh in […]

Skin flick

Interesting report in Science today about the human skin metagenome. The skin is a fairly large organ, and it is home to an estimated 1012 bacteria. It is the first barrier our body poses against pathogens, toxins, and sarcastic comments.  An adult’s skin area is about 2m2, virtually all of it exposed to the outside […]

Short Bioinformatics Hacks, ch. 1

In any programming gig, and that includes bioinformatics, a lot of repeat scriptology comes cropping up. I decided to share some of that, pro publico bono, and also because I hope to start some sort of ongoing cookbook  for short bioinformatics hacks. If you have any cool short scripts you like to share, please email […]

Test driving the Wolfram Alpha

There has been a lot of buzz recently about Wolfram’s new product, the Wolfram Alpha (WA). After attending a webinar on WA, I was given a preview account, and started messing around with it.  In case you were wondering, that is the extent of my involvement with Wolfram Research, LLC, I don’t even have a […]

The Seven Deadly Sins and Seven Heavenly Virtues of Scientific Websites

When I say here “scientific websites”, I am not referring to education sites, science blogs, or scientific journal web sites. I am talking about sites scientists use for their day to day research. Sites like Entrez, EBI, FlyBase, ExPasy, PDB etc. The sites I just mentioned I deem quite virtuous, but there are many sinful […]

The Human Genome Variome Project and Google News Reader

Apparently sequencing two white males of European extraction does not make for a very good sample of mankind, and that if we really want to get a good view of what we are really like, we need to sequence a couple more. Maybe even, you know, a woman, or someone from India or China or […]

Challenges with Data Quality, Sharing, and Versioning in Next-Generation Sequencing

An fine talk by David Dooling highlighting  some of the false impressions about second generation sequencing. A partial list: Why sequencing quality trump base pair output Why genomes are really probabilities rather than strings Why centralized repositories break down when it comes to second generation sequencing data. Collaborative Software development and versioning has been moving […]

NIH Stimulus money: what is in it for Bioinformatics?

Following  Shirley Wu’s excellent post on the stimulus money at the NIH, I decided to do my bit, and post some bioinformatically relevant programs from the  Challenge Grants. I am defining bioinformatics rather narrowly here, and excluding most biomedical informatics, imaging technologies, clinical data management, etc. Also, many other topics would be supported to some […]