<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Byte Size Biology &#187; Science</title>
	<atom:link href="http://bytesizebio.net/index.php/category/science/feed/" rel="self" type="application/rss+xml" />
	<link>http://bytesizebio.net</link>
	<description>The musings and ravings of a computational biologist about science, computers, music and, you know, stuff</description>
	<lastBuildDate>Fri, 18 May 2012 18:10:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Job opening: Scientific Curator at the Jackson Laboratory</title>
		<link>http://bytesizebio.net/index.php/2012/05/18/job-opening-scientific-curator-at-the-jackson-laboratory/</link>
		<comments>http://bytesizebio.net/index.php/2012/05/18/job-opening-scientific-curator-at-the-jackson-laboratory/#comments</comments>
		<pubDate>Fri, 18 May 2012 18:09:13 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[curator]]></category>
		<category><![CDATA[gene annotation]]></category>
		<category><![CDATA[genome annotation]]></category>
		<category><![CDATA[Jackson Lab]]></category>
		<category><![CDATA[jobs]]></category>
		<category><![CDATA[mouse]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=6130</guid>
		<description><![CDATA[Scientific Curator – Bioinformatics Interested individuals should apply on-line at www.jax.org/careers, referring to job posting #3256.  Contact Jeannine Ross at ext. 6045 with questions. The incumbent in this position plays a critical role in data annotation and curation for the Gene Ontology (GO) and Protein Ontology (PRO) programs at The Jackson Laboratory in Bar Harbor [...]]]></description>
			<content:encoded><![CDATA[<blockquote>
<div><strong>Scientific Curator – Bioinformatics</strong></div>
<div>
<p>Interested individuals should apply on-line at <a href="http://www.jax.org/careers" target="_blank">www.jax.org/careers</a>, referring to job posting #3256.  Contact Jeannine Ross at ext. 6045 with questions.</p>
</div>
<div>
<p>The incumbent in this position plays a critical role in data annotation and curation for the Gene Ontology (GO) and Protein Ontology (PRO) programs at The Jackson Laboratory in Bar Harbor Maine, through diverse activities to gather, analyze, evaluate and integrate information and analysis results using biomedical ontologies.  Activities include, but are not limited to, obtaining data via literature or electronic-based means, determining data object identity/uniqueness, judging information or analyses for appropriateness of incorporation into GO and PRO resources, and evaluating and applying biomedical ontologies.  This individual must keep abreast of new scientific developments that are relevant to functional genomics, and should attend group meetings and seminars, as well as make poster present posters/platform sessions at conferences.  Team participation in project development andsoftware testing is expected, as well as collaborations with outside research groups and international bioinformatics communities.  Assisting with training new curation staff, authoring project proposals, responsibility for writing/maintaining curational documentation are some of the additional roles that may be played by scientific curators.</p>
<p>Required:</p>
<p>·       advanced knowledge in mouse as an experimental organism</p>
<p>·       expert knowledge in specific data areas of biochemistry as well as functional and comparative genomics</p>
<p>·       broad understanding of database principles, biomedical ontologies, and skills with computational analysis techniques and data interpretation</p>
<p>·       exceptional communication and organizational skills</p>
<p>Experience/Education:</p>
<p>·       requires a Doctoral degree in the Life Sciences, and</p>
<p>·       a minimum of 1 – 3 years of experience</p>
</div>
</blockquote>
<p>&nbsp;</p>
<div id="attachment_6131" class="wp-caption alignnone" style="width: 394px"><a href="http://bytesizebio.net/wp-content/uploads/2012/05/mouse-annotations.jpg"><img class=" wp-image-6131" title="mouse-annotations" src="http://bytesizebio.net/wp-content/uploads/2012/05/mouse-annotations.jpg" alt="" width="384" height="288" /></a><p class="wp-caption-text">Credit: Mr.Thomas, Flickr</p></div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/05/18/job-opening-scientific-curator-at-the-jackson-laboratory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crowdsourcing Genomics II: Unveiling HINdeR and Phrux</title>
		<link>http://bytesizebio.net/index.php/2012/05/11/crowdsourcing-genomics-ii-unveiling-hinder-and-phrux/</link>
		<comments>http://bytesizebio.net/index.php/2012/05/11/crowdsourcing-genomics-ii-unveiling-hinder-and-phrux/#comments</comments>
		<pubDate>Fri, 11 May 2012 18:59:34 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Molecular biology]]></category>
		<category><![CDATA[bacteriophages]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[genomics]]></category>
		<category><![CDATA[viruses]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=6118</guid>
		<description><![CDATA[About this time last year, I posted about a new course I was going to teach, Phage Genomics. Briefly: Phage isolation, electron microscopy, DNA sequencing in the first semester, annotation and comparative genomics in the second. And I get to teach the bioinformatics bit: annotation and comparative genomics. Woo-hoo! The great thing about this course, [...]]]></description>
			<content:encoded><![CDATA[<p>About this time last year, I <a href="http://bytesizebio.net/index.php/2011/05/19/crowdsourcing-genomics/">posted</a> about a new course I was going to teach, Phage Genomics. Briefly:</p>
<blockquote><p>Phage isolation, electron microscopy, DNA sequencing in the first semester, annotation and comparative genomics in the second. And <em>I </em>get to teach the bioinformatics bit: annotation and comparative genomics. Woo-hoo! The great thing about this course, is that unlike most lab courses, the students (and faculty) will be setting up experiments intended not only to teach, but also to discover something new.  Also, the results of the research are meaningful. Genomics data generated by student participants will be used by other researchers to answer medical, ecological, and evolutionary scientific questions</p></blockquote>
<p>The students isolated, sequenced and annotated two previously unknonwn mycobacteriophages, <a href="http://phagesdb.org/phages/HINdeR/" target="_blank">HINdeR</a> and <a href="http://phagesdb.org/phages/Phrux/" target="_blank">Phrux</a>. The links are to the Mycobacteriophage Database <a href="http://phagesdb.org/" target="_blank">phagesdb.org</a> where the sequences and associated metadata (where and when HINdeR and Phrux were found and isolated) can be found. The annotations will be there shortly.</p>
<p>I had a great time teaching this course, together with <a href="http://microbiology.muohio.edu/people/balish.html" target="_blank">Mitch Balish</a> from my department, who is not only a great teacher, but shares my vice for keeping the students guessing when we are being serious and when we are kidding.  Mitch is the guy with the goatee in the short sleeved shirt; I&#8217;m the one in the black sweatshirt. Here&#8217;s what the students had to say about the course (<a href="http://www.cas.muohio.edu/phages.html" target="_blank">original site at Miami University</a>). Mitch starts talking at 2:57, I&#8217;m at 4:08, Gary Janssen (who taught the first semester) is at 5:08:<br />
<iframe width="420" height="315" src="http://www.youtube.com/embed/Su5HqVzi8f0?rel=0" frameborder="0" allowfullscreen></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/05/11/crowdsourcing-genomics-ii-unveiling-hinder-and-phrux/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Repost: the Scope(s) of Substance</title>
		<link>http://bytesizebio.net/index.php/2012/05/05/repost-the-scopes-of-substance/</link>
		<comments>http://bytesizebio.net/index.php/2012/05/05/repost-the-scopes-of-substance/#comments</comments>
		<pubDate>Sat, 05 May 2012 23:13:28 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[blogging]]></category>
		<category><![CDATA[Evolution]]></category>
		<category><![CDATA[creationism]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[repost]]></category>
		<category><![CDATA[teaching]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=6101</guid>
		<description><![CDATA[This tweet from Neil Degrasse Tyson jolted me from a pleasant rest before tomorrow&#8217;s race: &#160; &#8230;which led to the (in)famous Scopes Trial. On May 5, 1925 John Scopes was charged and subsequently tried, found guilty, and fined $100 for teaching Evolution, a violation of Tennessee&#8217;s Butler Act. The trial became a battleground for science [...]]]></description>
			<content:encoded><![CDATA[<p><a href=" http://bit.ly/IMbZuy " target="_blank">This tweet</a> from Neil Degrasse Tyson jolted me from a pleasant rest <a href="http://www.flyingpigmarathon.com/race_information/schedule/half.shtml" target="_blank">before tomorrow&#8217;s race</a>:</p>
<p><a href="http://bytesizebio.net/wp-content/uploads/2012/05/evo-neil.png"><img class="alignnone  wp-image-6104" title="evo-neil" src="http://bytesizebio.net/wp-content/uploads/2012/05/evo-neil.png" alt="" width="466" height="146" /></a></p>
<p>&nbsp;</p>
<p>&#8230;which led to the (in)famous <a href="http://en.wikipedia.org/wiki/Scopes_Trial" target="_blank">Scopes Trial</a>. On May 5, 1925 John Scopes was charged and subsequently tried, found guilty, and fined $100 for teaching Evolution, a violation of Tennessee&#8217;s <a href="http://en.wikipedia.org/wiki/Butler_Act" target="_blank">Butler Act</a>. The trial became a battleground for science vs. religion, evolution vs. creationism, and the interpretation of the <a href="http://en.wikipedia.org/wiki/Establishment_Clause" target="_blank">Establishment Clause</a> and <a href="http://en.wikipedia.org/wiki/Freedom_of_speech_in_the_United_States" target="_blank">Freedom of Speech</a> in the US constitution.</p>
<p>I published a blog post two years ago, on the 85th anniversary of the trial, July 2010. Today  marks the 87th anniversary of the arrest, so it seems like a good occasion to repost. Especially since there is still some work needed in the area of teaching evolution:</p>
<div id="attachment_6106" class="wp-caption alignnone" style="width: 610px"><a href="http://bytesizebio.net/wp-content/uploads/2012/05/1000px-Views_on_Evolution.svg_.png"><img class=" wp-image-6106" title="1000px-Views_on_Evolution.svg" src="http://bytesizebio.net/wp-content/uploads/2012/05/1000px-Views_on_Evolution.svg_.png" alt="" width="600" height="450" /></a><p class="wp-caption-text">Source Wikimedia Commons. Credit: John D. Croft. Based on: New Scientist Magazine 2006 191:2565 p11</p></div>
<p>&nbsp;</p>
<p>To follow is the original post: &#8220;The Scope(s) of Substance&#8221;,  from July 29, 2010. Still relevant, I believe:</p>
<hr />
<p>&nbsp;</p>
<p><a href="http://blog.coturnix.org/">Bora Zivkovic</a>, the BUCA (Best Universal Common Ancestor) of science bloggers has <a href="http://blog.coturnix.org/2010/07/23/blogging-with-substance/" target="_blank">tagged</a> this blog with with a Blog of Substance award. As a grateful recipient of this award I am obligated to do two things:<br />
<em>1. Sum up my blogging motivation, philosophy and experience in exactly 10 words.<br />
2. Pass this award on to 10 other blogs.</em></p>
<p>Of course, I never do anything without researching it first, because I am such an awesome scientist, or detail-oriented !@#*^, depending on whether you ask me or my students. So I looked up &#8220;substance&#8221; in the Merriam-Webster dictionary. Here is what I found:</p>
<blockquote><p>Main Entry: sub·stance<br />
Pronunciation: \ˈsəb-stən(t)s\<br />
Function: noun<br />
Etymology: Middle English, from Anglo-French, from Latin substantia, from substant-, substans, present participle of substare to stand under, from sub- + stare to stand — more at stand<br />
Date: 14th century</p>
<p>1 a : essential nature : essence b : a fundamental or characteristic part or quality c Christian Science : god 1b<br />
2 a : ultimate reality that underlies all outward manifestations and change b : practical importance : meaning, usefulness<br />
3 a : physical material from which something is made or which has discrete existence b : matter of particular or definite chemical constitution c : something (as drugs or alcoholic beverages) deemed harmful and usually subject to legal restriction</p>
<p>4 : material possessions : property</p></blockquote>
<p>Hmmm&#8230; 2a and 2b seem to be relevant. Perhaps 3c should be too, as my blogging could be construed harmful to other more productive activities, which I am obviously not engaged with at this moment. Actually you, gentle reader, are not engaged in more productive activities either right now. Be that as it may, the word <em>substance</em> does seem to have an air of permanence about it, which is contrary to the perceived ephemeral nature of blogging. Bora is actually one of the people who are doing something about making blogs less ephemeral by publishing <a href="http://www.amazon.com/s/qid=1280419877/ref=a9_sc_1?ie=UTF8&amp;search-alias=us-stripbooks-tree&amp;field-keywords=the open laboratory 2009" target="_blank">The Open Laboratory</a> collection (full disclosure: I&#8217;m published in the 2009 book) and by supporting science bloggers, blogging and activities wherever they may be. This makes me so happy to be among Bora&#8217;s chosen 10 (OK, 11, he cheated a bit) among the hundreds of blogs he must be reading. Thanks Bora!</p>
<p>I do wonder though, eighty-five years from now, how many of us science bloggers would be remembered for our blogging? Well, maybe not as individuals, but what kind of impact are we having now, and how much will it remain 85 years from now? Hopefully as a collective, science bloggers are impacting the understanding of science, which is one of the reasons I am blogging. Hopefully, we do have substance, as a group if not as individuals.</p>
<p>Why eighty-five years? Well, the answer to that brings me to the main topic (substance?) part of this post, which is the anniversary of the <a href="http://en.wikipedia.org/wiki/Scopes_Trial">Scopes trial</a>. This month, 85 years ago, a schoolteacher in Tennessee was convicted of a high misdemeanor for violating the State of Tennessee&#8217;s Butler Act which prohibited the teaching of evolution in any of the state&#8217;s public schools and universities. He was fined $100.</p>
<blockquote>
<p style="text-align: center;"><strong><span style="font-size: xx-small;">PUBLIC ACTS</span></strong></p>
<p style="text-align: center;"><span style="font-size: xx-small;">OF THE</span></p>
<p style="text-align: center;"><strong><span style="font-size: xx-small;">STATE OF TENNESSEE</span></strong></p>
<p style="text-align: center;"><span style="font-size: xx-small;">PASSED BY THE</span></p>
<p style="text-align: center;"><strong><span style="font-size: xx-small;">SIXTY &#8211; FOURTH GENERAL ASSEMBLY</span></strong></p>
<div style="text-align: center;"><strong><span style="font-size: xx-small;">1925</span></strong></div>
<p>________</p>
<p><span style="font-size: xx-small;">CHAPTER NO. 27</span></p>
<p><span style="font-size: xx-small;">House Bill No. 185</span></p>
<p>(By Mr. Butler)</p>
<p>AN ACT prohibiting the teaching of the Evolution Theory in all the Universities, Normals and all other public schools of Tennessee, which are supported in whole or in part by the public school funds of the State, and to provide penalties for the violations thereof.</p>
<p>Section 1. <em>Be it enacted by the General Assembly of the</em> <em>State of Tennessee</em>, That it shall be unlawful for any teacher in any of the Universities, Normals and all other public schools of the State which are supported in whole or in part by the public school funds of the State, to teach any theory that denies the story of the Divine Creation of man as taught in the Bible, and to teach instead that man has descended from a lower order of animals.</p>
<p>Section 2. <em>Be it further enacted</em>, That any teacher found guilty of the violation of this Act, Shall be guilty of a misdemeanor and upon conviction, shall be fined not less than One Hundred $ (100.00) Dollars nor more than Five Hundred ($ 500.00) Dollars for each offense.</p>
<p>Section 3. <em>Be it further enacted</em>, That this Act take effect from and after its passage, the public welfare requiring it.</p>
<p>Passed March 13, 1925</p>
<p>W. F. Barry,</p>
<p><em>Speaker of the House of Representatives</em></p>
<p>L. D. Hill,</p>
<p><em>Speaker of the Senate</em></p>
<p>Approved March 21, 1925.</p>
<p>Austin Peay,</p>
<p><em>Governor.</em></p></blockquote>
<p>Seems incredible at this day an age&#8230; or maybe not so incredible given <a href="http://ncse.com/news/2010/07/creationist-rumblings-louisiana-005799" target="_blank">recent events in Louisiana</a>.</p>
<div id="attachment_3894" class="wp-caption alignnone" style="width: 307px"><a href="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE19.jpg"><img class="size-full wp-image-3894" title="SCOPE19" src="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE19.jpg" alt="" width="297" height="355" /></a><p class="wp-caption-text">William Jennings Bryan, counsel for the prosecution, attacking evolution</p></div>
<p><a href="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE14.jpg"><img class="alignnone size-full wp-image-3895" title="SCOPE14" src="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE14.jpg" alt="" width="298" height="423" /></a></p>
<div id="attachment_3896" class="wp-caption alignnone" style="width: 299px"><a href="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE18.jpg"><img class="size-full wp-image-3896" title="SCOPE18" src="http://bytesizebio.net/wp-content/uploads/2010/07/SCOPE18.jpg" alt="" width="289" height="389" /></a><p class="wp-caption-text">The city of Dayton as the organ grinder profiting from the Scopes trial</p></div>
<p>The trial, which originated as something of a publicity affair for the town of <a href="http://en.wikipedia.org/wiki/Dayton,_Tennessee" target="_blank">Dayton, Tennessee</a>, quickly became a battleground for evolution vs. creation. In the short term, the trial actually increased the number of anti-evolution bills proposed in different state legislatures in the US. In the long term, however, <em>Tennessee vs. Scopes</em> is seen as a watershed moment in the teaching and public acceptance of evolution, and has had long terms ramifications in the US and internationally. Scopes himself spoke only once at the trial, was not called to testify, and only had this to say when granted a statement after sentence was passed:</p>
<blockquote><p>Your honor, I feel that I have been convicted of violating an unjust statute. I will continue in the future, as I have in the past, to oppose this law in any way I can. Any other action would be in violation of my ideal of academic freedom — that is, to teach the truth as guaranteed in our constitution, of personal and religious freedom. I think the fine is unjust.</p></blockquote>
<p>Now <span style="text-decoration: underline;">that</span> is substance.</p>
<p>Back to the award; I still have some conditions to fulfill:</p>
<p><em>1. Sum up your blogging motivation, philosophy and experience in exactly 10 words.</em></p>
<p><sup>1</sup>Blogging <sup>2</sup>motivation, <sup>3</sup>philosophy <sup>4</sup>and <sup>5</sup>experience <sup>6</sup>cannot <sup>7</sup>be <sup>8</sup>summed <sup>9</sup>in <sup>10</sup>ten <span style="text-decoration: line-through;"><sup>11</sup>words</span>.</p>
<p>2. <em>Pass this award on to 10 other blogs</em></p>
<p>Given the 10<sup>n</sup> growth rate of tagged blogs, chain-letter fashion, I wonder about how this Blogging with Substance award has originated. Search engines was no help, as so many blogs are now tagged with the Blogging with Substance. If someone has an answer, let me know. Anyhow, here are my 10 tags, based on what I am reading nowadays, ephemerality of blogging substance, and all that jazz. Tough choices though, so many good blogs out there:</p>
<p>1. <a href="http://bcbio.wordpress.com/">Blue Collar Bioinformatics</a></p>
<p>2. <a href="http://sandwalk.blogspot.com/">Sandwalk</a></p>
<p>3. <a href="http://www.lucasbrouwers.nl/blog/">Thoughtomics</a></p>
<p>4. <a href="http://blogs.discovermagazine.com/loom/">The Loom</a></p>
<p>5. <a href="http://scienceblogs.com/mikethemadbiologist/">Mike the Mad Biologist</a></p>
<p>6. <a href="http://genome.fieldofscience.com/">Genomics, Evolution and Pseudoscience</a></p>
<p>7. <a href="http://www.pawelszczesny.org/">Circle of Complexity</a></p>
<p>8. <a href="http://larsjuhljensen.wordpress.com/">Buried Treasure</a></p>
<p>9. <a href="http://phylogenomics.blogspot.com">The Tree of Life</a></p>
<p>10. <a href="http://www.iayork.com/MysteryRays/">Mystery Rays form Outer Space</a></p>
<p>Final word: if this post seems a bit confused, and you are not sure that you are &#8220;getting it&#8221;, well, that&#8217;s this post&#8217;s substance.</p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/05/05/repost-the-scopes-of-substance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Inside Poop</title>
		<link>http://bytesizebio.net/index.php/2012/05/04/the-inside-poop/</link>
		<comments>http://bytesizebio.net/index.php/2012/05/04/the-inside-poop/#comments</comments>
		<pubDate>Fri, 04 May 2012 17:20:38 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Health]]></category>
		<category><![CDATA[Metagenomics]]></category>
		<category><![CDATA[Microbiology]]></category>
		<category><![CDATA[baby health]]></category>
		<category><![CDATA[breastfeeding]]></category>
		<category><![CDATA[gut microbiome]]></category>
		<category><![CDATA[infant health]]></category>
		<category><![CDATA[infants]]></category>
		<category><![CDATA[metagenomics]]></category>
		<category><![CDATA[microbiology]]></category>
		<category><![CDATA[microbiome]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=6061</guid>
		<description><![CDATA[It&#8217;s pretty much common knowledge that mother&#8217;s milk is the healthiest food for infants, and that it bestows health benefits upon mother and baby that formula feeding cannot match. The unique combination of lipids, sugars, proteins and antibodies is not even close to being rivaled by baby formula manufacturers. With few exceptions, such as when [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s pretty much common knowledge that mother&#8217;s milk is the healthiest food for infants, and that it bestows health benefits upon mother and baby that formula feeding cannot match. The unique combination of lipids, sugars, proteins and antibodies is not even close to being rivaled by baby formula manufacturers. With few exceptions, such as when there is a concern that the mother is contagious and may infect the baby, breastmilk is the recommended diet for infants.</p>
<p>As I am interested in things microbiological, I have been especially interested in the effect of breastmilk on the baby gut and gut microbiota. There have actually been quite a few studies on that, but most of these studies were about the gut microbiota only. However,  we can&#8217;t really separate our gut from the microbes that reside in it. The bacteria in the human gut affect the gut (and, in turn, the entire body) and are affected by it. The gut is really a superorgan, composed of a minority of human cells, and 10<sup>14</sup> bacterial cells. Most of the gut is actually bacteria, not human, but the part that is human is important, since, well, it&#8217;s &#8220;us&#8221;. (Well, kinda hard to tell now which &#8220;us&#8221; is &#8220;us&#8221; and which &#8220;us&#8221; is &#8220;the bacteria that live in us&#8221;.) To understand what goes on there we need to study both bacterial and human cells. While adult microbiota+gut systems have been studied, mostly for the effect of probiotics, there have not been studies of baby guts because you cannot perform consented invasive procedures on babies. In other words, you cannot scrape their colons for gut lining, or epithelial, cells. So there has not been much of an opportunity to study the gut epithelium+microbiome in human infants.</p>
<p>The opportunity came with Robert (&#8220;Robb&#8221;) Chapkin from Texas A&amp;M University, and Sharon Donovan from the University of Illinois at Urbana-Champaign. Robb has developed a system to isolate gut epithelial cells from the feces. We shed about millions of cells from our gut when we defecate, and Robb&#8217;s lab has a way to fish those gut lining cells out of the stool. Thus, we can sequence the mRNA, and find out which genes are transcribed in the baby gut. At the same time, we can analyze the baby&#8217;s microbiome. Enter Sharon Donovan&#8217;s lab, who has studied 12 babies,  six were breast fed and six were formula fed.</p>
<p>This is where Robb contacted me, and generously invited me to College Station, Texas about a year and a half ago. Aside from enjoying Texan hospitality (big steaks) and meeting people, Robb brought me into this fascinating study. They needed a bionformatician to help analyze the gut transcriptome and gut metagenome data. I am very glad they contacted me, since this started a very enjoyable collaboration and a scientific journey whose results are published this week  in Genome Biology. I was put in touch with two great statisticians, Ivan Ivanov and Scott Schwartz, also at Texas A&amp;M. We put our heads together, and came up with  a strategy.</p>
<div id="attachment_6078" class="wp-caption alignnone" style="width: 478px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/flowchart.png"><img class=" wp-image-6078 " title="flowchart" src="http://bytesizebio.net/wp-content/uploads/2012/04/flowchart.png" alt="" width="468" height="376" /></a><p class="wp-caption-text">Analysis flowchart. Reproduced from Genome Biology 2012, 13:R32 doi:10.1186/gb-2012-13-4-r32 under BMC CC2.0 license. Click to enlarge.</p></div>
<p>&nbsp;</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span></p>
<p>First, we analyzed the microbiome data, using several standard pipelines, like <a href="http://metagenomics.anl.gov/" target="_blank">MG-RAST</a> for function analysis (thanks to the folks at Argonne National Lab and  for making MG-RAST happen  and for all their support) , and <a href="http://www.cbcb.umd.edu/software/phymm/" target="_blank">PhymmBL</a> and <a href="http://greengenes.lbl.gov/cgi-bin/nph-index.cgi" target="_blank">GreenGenes</a> for taxonomic analysis.  The gut transcriptome data were already available, as part of a previous study. Our next step was to look for correlations between the distribution of bacterial phyla in the babies, and whether the type of bacteria they had in their guts had anything to do with their diet.</p>
<p>So here is what we found. First, most breastfed babies had a greater variety of bacterial phyla between them than formula-fed babies. Probably because the formula babies were all fed the same diet, whereas breastmilk composition varies between women. Second, the breastfed babies were richer in gram negative bacteria. Those are bacteria with a thin cell wall, a double cell membrane, and which have certain features that the gram positives (thick cell wall, single membrane) do not have.  Also,  almost all breastfed babies had a richer gut ecosystem.</p>
<p>&nbsp;</p>
<div id="attachment_6088" class="wp-caption alignnone" style="width: 541px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/phylo.png"><img class=" wp-image-6088 " title="phylo" src="http://bytesizebio.net/wp-content/uploads/2012/04/phylo.png" alt="" width="531" height="249" /></a><p class="wp-caption-text">Firmicutes and Actinobacteria are gram+; Proteobacteria and Bacteroidetes are gram-. FF-formula fed babies, BF-breastfed babies. Genome Biology 2012, 13:R32 doi:10.1186/gb-2012-13-4-r32</p></div>
<p>We then moved on to look at the genetic potential of the gut microbiome: how do the microbial communities differ between the breastfed and bottle-fed babies in terms of what they can do. The strongest difference between breastfed babies and bottle-fed babies was in the presence of virulence genes, and mostly those typical of gram-negatives: Type III &amp; IV secretion systems. There were other differences, such as in carbohydrate processing enzymes. But the kicker was that the differences in the frequency of virulence genes in the microbiome also correlated well with the expression of immunity related-genes in  the infant gut epithelial cells.</p>
<div id="attachment_6079" class="wp-caption alignnone" style="width: 606px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/SEED-func1.png"><img class=" wp-image-6079 " title="SEED-func1" src="http://bytesizebio.net/wp-content/uploads/2012/04/SEED-func1.png" alt="" width="596" height="458" /></a><p class="wp-caption-text">Reproduced from Genome Biology 2012, 13:R32 doi:10.1186/gb-2012-13-4-r32</p></div>
<div id="attachment_6085" class="wp-caption alignnone" style="width: 605px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/SEED-func2.png"><img class=" wp-image-6085" title="SEED-func2" src="http://bytesizebio.net/wp-content/uploads/2012/04/SEED-func2.png" alt="" width="595" height="437" /></a><p class="wp-caption-text">Reproduced from Genome Biology 2012, 13:R32 doi:10.1186/gb-2012-13-4-r32</p></div>
<p>&nbsp;</p>
<p>We observed the following: 1. Certain gram negative bacteria are dominant in the breastfed babies. 2. We saw that bacterial genes having to do with virulence were more abundant in the bacterial communities of breastfed babies 3. When looking closely at those genes, we saw that most of them were the virulence factors typical of gram negative bacteria (OK, not surprising given point[2] above, but a good verification). 4. At the same time, the breastfed babies expressed genes that had to do with immunity in their gut lining (epithelial) cells. The presence of virulence genes, and the expression of immunity genes in the gut epithelium correlated quite strongly (see <strong>B</strong>, below).</p>
<p>&nbsp;</p>
<div id="attachment_6092" class="wp-caption alignnone" style="width: 504px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/intestinal-immunity-genes.png"><img class=" wp-image-6092" title="intestinal-immunity-genes" src="http://bytesizebio.net/wp-content/uploads/2012/04/intestinal-immunity-genes.png" alt="" width="494" height="523" /></a><p class="wp-caption-text">Reproduced from Genome Biology 2012, 13:R32 doi:10.1186/gb-2012-13-4-r32</p></div>
<p>&nbsp;</p>
<p>Taken together, this tells us that the following scenario may apply: mother&#8217;s milk tends to enrich certain types of gram negative bacteria, and those, in turn, stimulate the babies&#8217; immune system. It&#8217;s as if the mother&#8217;s milk is setting up an immunity boot camp for the breastfed babies.</p>
<p>We got all sorts of feedback and even a bit of <a href="http://www.foxnews.com/health/2012/04/30/breast-feeding-may-help-babies-develop-healthy-mix-gut-bacteria/" target="_blank">media coverage</a> on this study. I was really happy when this study hit Reddit. Reddit is an aggregation site where anyone can submit any kind of story, and the &#8220;redditors&#8221; vote it up or down. Highly voted submissions are more visible, and get discussed more on the site. Generally, having a submission receive many &#8220;upvotes&#8221;, in Reddit parlance, shows an interest. (Well, the highest upvotes tend to go to pictures of funny kittens, but still.) The story <a href="http://www.reddit.com/r/science/comments/szt0r/breastfeeding_linked_to_healthy_infant_gut/" target="_blank">made it  to the top of the r/science category</a>  (also known as &#8220;subreddit&#8221;) with over 1300 upvotes . I logged in using my real name, and <a href="http://www.reddit.com/r/science/comments/szt0r/breastfeeding_linked_to_healthy_infant_gut/c4ieoza" target="_blank">referred people to another subreddit</a>, called <a href="http://www.reddit.com/r/IAmA/" target="_blank">IAmA</a> (&#8220;I am a&#8230;&#8221;). In this case &#8220;I am a <a href="http://www.reddit.com/r/IAmA/comments/t03l8/iama_scientist_who_worked_on_the_breastfeeding/" target="_blank">scientist who worked on this study, ask me anything</a>&#8220;. There were quite a few <a href="http://www.reddit.com/r/IAmA/comments/t03l8/iama_scientist_who_worked_on_the_breastfeeding/" target="_blank">questions</a>, and it was a very interesting engagement with people about this work. Hopefully, good PR <img src='http://bytesizebio.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  and science communication.</p>
<p>&nbsp;</p>
<hr />
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Genome+Biology&amp;rft_id=info%3Adoi%2F10.1186%2Fgb-2012-13-4-r32&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=A+metagenomic+study+of+diet-dependent+interaction+between+gut+microbiota+and+host+in+infants+reveals+differences+in+immune+response&amp;rft.issn=1465-6906&amp;rft.date=2012&amp;rft.volume=13&amp;rft.issue=4&amp;rft.spage=0&amp;rft.epage=&amp;rft.artnum=http%3A%2F%2Fgenomebiology.com%2F2012%2F13%2F4%2FR32&amp;rft.au=Schwartz%2C+S.&amp;rft.au=Friedberg%2C+I.&amp;rft.au=Ivanov%2C+I.&amp;rft.au=Davidson%2C+L.&amp;rft.au=Goldsby%2C+J.&amp;rft.au=Dahl%2C+D.&amp;rft.au=Herman%2C+D.&amp;rft.au=Wang%2C+M.&amp;rft.au=Donovan%2C+S.&amp;rft.au=Chapkin%2C+R.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology%2CHealth%2CBioinformatics%2C+Computational+Biology%2C+Microbiology+%2C+Nutrition">Schwartz, S., Friedberg, I., Ivanov, I., Davidson, L., Goldsby, J., Dahl, D., Herman, D., Wang, M., Donovan, S., &amp; Chapkin, R. (2012). A metagenomic study of diet-dependent interaction between gut microbiota and host in infants reveals differences in immune response <span style="font-style: italic;">Genome Biology, 13</span> (4) DOI: <a href="http://dx.doi.org/10.1186/gb-2012-13-4-r32" rev="review">10.1186/gb-2012-13-4-r32</a></span></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/05/04/the-inside-poop/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>It&#8217;s a smORF world, after all?</title>
		<link>http://bytesizebio.net/index.php/2012/04/27/its-a-smorf-world-after-all/</link>
		<comments>http://bytesizebio.net/index.php/2012/04/27/its-a-smorf-world-after-all/#comments</comments>
		<pubDate>Fri, 27 Apr 2012 19:25:05 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Biology]]></category>
		<category><![CDATA[Evolution]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[drosophila]]></category>
		<category><![CDATA[fly]]></category>
		<category><![CDATA[genomics]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5774</guid>
		<description><![CDATA[Here is a study that looked for a type of genes that the authors felt was neglected by classic genomic annotation. The research shows how to employed concepts in molecular evolution to validate the existence of these genes. Some background: the first question we ask after assembling a genome is: &#8220;where are the genes&#8221;? Not [...]]]></description>
			<content:encoded><![CDATA[<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span></p>
<p>Here is a study that looked for a type of genes that the authors felt was neglected by classic genomic annotation. The research shows how to employed concepts in molecular evolution to validate the existence of these genes.</p>
<p>Some background: the first question we ask after assembling a genome is: &#8220;where are the genes&#8221;? Not an easy question to answer, since a gene is classically defined as a <em>unit of heredity</em>. It may code for RNA, protein, or sometimes, nothing at all. The actual implementation of the &#8220;unit of heredity&#8221; can take several physical forms, each one of them different. Therefore, the algorithms for finding genes would depend on which type gene one is looking for, exactly.</p>
<p>A somewhat more tractable question is: &#8220;where are the open reading frames&#8221;? Open reading frames or ORFs are those stretches of DNA that code for proteins.  Indeed, most gene calling software actually identifies ORFs. There are many attributes that go into an ORF calling algorithm: the frequency of the bases  (<em></em>or <em>k-</em>mers of bases) in the suspected coding regions, the signals for the beginning and ends of introns, the existence of non-coding regions that aid transcription such as promoters and enhancers, the location on the chromosome with relation to other ORFs, and the length of the of the final product. The latter criterion is actually quite important, as many ORF-calling algorithms will discount anything coding for a protein that is shorter than 100 amino acids as being &#8220;too short&#8221;. The reason for employing this length cutoff, is that the number of false positives increases dramatically when ORFs coding for proteins shorter than 100aa (or 300 nucleotides) are called. Therefore, most gene-callers would just tend to discard any short peptides.</p>
<p>But throwing away the baby with the bathwater is not a good solution, since short peptides are known to be responsible for many of life&#8217;s activities: mating pheromones, small compound transporters, hormones, neurotransmitters and regulation of other proteins&#8217; activities, to name a few. Many of these short peptides are the result of the cleavage of larger proteins, which means that the ORFs encoding for them are originally longer than 300bp.  But some may actually have their own ORFs, coding only for them. How can we find those small ORFs or <strong>smORFs</strong> out? How many of them are there? Is the number of smORFs large enough to make it worth re-annotating genomes?</p>
<div class="wp-caption alignnone" style="width: 310px"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/1209px-Gene2-plain.svg_.png"><img class="size-medium wp-image-6032" title="1209px-Gene2-plain.svg" src="http://bytesizebio.net/wp-content/uploads/2012/04/1209px-Gene2-plain.svg_-300x254.png" alt="" width="300" height="254" /></a></dt>
</dl>
<p class="wp-caption-dd">Click to enlarge. Gene Structure. Source: Wikimedia commons. Credit: Forluvoft</p>
</div>
<p>Emmanuel Ladoukakis from the University of Crete and colleagues from the university of Essex, UK have set up a bioinformatic pipeline to look for smORFs in the <em>Drosophila melanogaster</em> genome. Bear with me, there are a few steps in this pipeline. But there&#8217;s a lot to learn about genomics just from looking at what they did, and why they took those steps.</p>
<p>Here&#8217;s what they did: <strong>1) Find smORF candidates:</strong> they looked for all potential smORFs (starting with a start codon and ending with an in-frame stop codon, 30-300bp long) in those parts of <em>D. melanogaster&#8217;</em>s genome that were annotated as non-coding. <strong></strong>To keep things simple, they looked only for intron-less smORFs: smORFs that are encoded consecutively in the DNA.  They found 593,586 potential sequences. <strong>2) Remove transposons: </strong>they then removed all those that had a similarity to transposons. Transposons are DNA elements that multiply in the chromosome: something like an internal virus, only usually benign. They may carry bits of other genes they &#8220;grab&#8221; on the way, but they are not functional. They were left with 556,554 sequences <strong>3) Big step: look for homologs in another fly species: </strong>they then looked for smORFs with similar  translated amino-acid sequences in <em>D. pseudoobscura, </em>which diverged from the <em>melanogaster </em> 25 to 55 million years ago. The reason they looked for similar amino-acid sequences was that if there is a selection to conserve a smORF, it would be on the protein, and not at the DNA level. This step reduced the number of smORF candidates by 93%: from 556,554 down to 43,210.  <strong></strong>Looking only for <strong>4) global alignments, (another big step)</strong>  they found 4,561 smORF candidates by looking at alignments of whole smORF sequences, not only of partial local similarities. this reduced the number of candidates by 72% from the  step (3). We are now down to 0.8% of the original 593,586 smORF candidates.</p>
<p>Quite a filtering process. Note the huge elimination: 99.2% of all initial smORFs candidates are gone. I believe that they decided to sacrifice sensitivity in favor of specificity</p>
<p>So they had 4,561 smORF candidates conserved between two flies. Still, how many ORFs got in by chance? Hard to know, but they continued to rely on evolutionary conservation as a guideline. There may be smORFs that appeared independently in <em>melanogaster</em> and <em>pseudoobscura</em> after they separated 55 million years ago,  but the main evidence for true smORFs would be their evolutionary conservation between the two fly species.</p>
<p>To get even more specific, they now<strong> 5) looked for <a href="http://en.wikipedia.org/wiki/Synteny#Shared_synteny">shared synteny</a></strong><a href="http://en.wikipedia.org/wiki/Synteny#Shared_synteny">:</a>  conservation not only of sequence, but also of the genomic context: the sequences surrounding it. That brought the number down to 3,314.</p>
<p>OK, so they looked for conservation based on homology and based on synteny. Anything more? Well, yes. The next step would be to <strong>6) look for evolutionarily selected smORFs</strong>. The two evolutionary criteria they used until now were homology and synteny. Now comes a third:  selection. If  smORF candidates are actually coding, they will be subject to  purifying selection, that is, to selection that eliminates deleterious mutations. This is evident in a low rate of non-synonymous <em>vs</em>. synonymous substitutions, or a <a href="http://en.wikipedia.org/wiki/Ka/Ks_ratio" target="_blank">Ka/Ks ratio</a> of &lt;&lt; 1. (Read about Ka/Ks ratios also <a href="http://www.sciencedirect.com/science/article/pii/S0168952502027221" target="_blank">here</a>.) <strong>7) Looking at what actually gets transcribed in Drosophila</strong> (from looking at the transcriptome) this number was whittled down to a final <span style="text-decoration: underline;">401</span>.</p>
<div class="mceTemp">
<dl id="attachment_6039" class="wp-caption alignnone" style="width: 203px;">
<dt class="wp-caption-dt"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/smorf-pipeline.jpg"><img class="size-medium wp-image-6039" title="smorf-pipeline" src="http://bytesizebio.net/wp-content/uploads/2012/04/smorf-pipeline-193x300.jpg" alt="" width="193" height="300" /></a><p class="wp-caption-text">Click to enlarge. Search pipeline for Drosophila smORFs. Diagram of the smORF search pipeline followed in this study. The percentages of smORFs passing each filter are indicated. For full details, see Results and Materials and methods. CDS, coding DNA sequence; Dm, Drosophila melanogaster; Dp, Drosophila pseudoobscura; Ka/Ks, ratio of non-synonymous (Ka) to synonymous (Ks) nucleotide substitution.Ladoukakis et al. Genome Biology 2011 12:R118   doi:10.1186/gb-2011-12-11-r118</p></div>
<p>So the chosen 401 smORFs are evolutionarily conserved, both in sequence and in synteny, subject to purifyng selection (by Ka/Ks ratio) and produce a transcript. The authors obviously went for specificity over sensitivity: they looked for &#8220;good bet&#8221; smORFs rather than a large number of candidates. What I like about this study is the way that the authors used a large number of evolutionary traits that can be used as attributes for identifying smORFs. They also were careful to rule out, as much as possible, that these smORFs that may be a result of a larger transcript. This is a really nice molecular evolution work. There is no experimental evidence yet of the functionality of these smORFs: those are left to future proteomic and fly geneticists. But the idea of a small(er) world of genes, hiding in plain site among the more familiar large ones, does have its appeal, and may yield some surprises about how are genomes are structured.</p>
<p>Finally, for the evolutionary biologists: read the <a href="http://genomebiology.com/2011/12/11/R118" target="_blank">paper</a>; there is quite a lot more to it that what I wrote. I just gave the highlights.</p>
<p>&nbsp;</p>
<hr />
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Genome+Biology&amp;rft_id=info%3Adoi%2F10.1186%2Fgb-2011-12-11-r118&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Hundreds+of+putatively+functional+small+open+reading+frames+in+Drosophila&amp;rft.issn=1465-6906&amp;rft.date=2011&amp;rft.volume=12&amp;rft.issue=11&amp;rft.spage=0&amp;rft.epage=&amp;rft.artnum=http%3A%2F%2Fgenomebiology.com%2F2011%2F12%2F11%2FR118&amp;rft.au=Ladoukakis%2C+E.&amp;rft.au=Pereira%2C+V.&amp;rft.au=Magny%2C+E.&amp;rft.au=Eyre-Walker%2C+A.&amp;rft.au=Couso%2C+J.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology%2CBioinformatics%2C+%2C+Genetics+%2C+Evolutionary+Biology%2C+Genomics">Ladoukakis, E., Pereira, V., Magny, E., Eyre-Walker, A., &amp; Couso, J. (2011). Hundreds of putatively functional small open reading frames in Drosophila <span style="font-style: italic;">Genome Biology, 12</span> (11) DOI: <a href="http://dx.doi.org/10.1186/gb-2011-12-11-r118" rev="review">10.1186/gb-2011-12-11-r118</a></span></p>
<p>&nbsp;</p>
<p><a href="http://genomebiology.com/2011/12/11/R118/abstract">http://genomebiology.com/2011/12/11/R118/abstract</a></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/04/27/its-a-smorf-world-after-all/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Biocuration 2012</title>
		<link>http://bytesizebio.net/index.php/2012/04/06/biocuration-2012/</link>
		<comments>http://bytesizebio.net/index.php/2012/04/06/biocuration-2012/#comments</comments>
		<pubDate>Fri, 06 Apr 2012 15:03:25 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Biology]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[annotation]]></category>
		<category><![CDATA[biocuration]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[DC]]></category>
		<category><![CDATA[protein function prediction]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5977</guid>
		<description><![CDATA[&#160; Great meeting:  Biocuration 2012, Georgetown University, DC.  When I leave a meeting with my head exploding with new ideas and a need to try them all out at once, I know I got my money&#8217;s worth, and then some. Even a three hour flight delay followed by discovering my car with a dead battery [...]]]></description>
			<content:encoded><![CDATA[<p>&nbsp;</p>
<p>Great meeting:  <a href="http://pir.georgetown.edu/biocuration2012/">Biocuration 2012</a>, Georgetown University, DC.  When I leave a meeting with my head exploding with new ideas and a need to try them all out at once, I know I got my money&#8217;s worth, and then some. Even a three hour flight delay followed by discovering my car with a dead battery at 1am at the deserted Dayton Airport parking lot did not dampen my enthusiasm upon return. I will make sure my dome light is off before I leave my car  the next time though. To follow are bits and pieces from the meeting I enjoyed. I&#8217;m doing this mostly from memory, two days later, so I may have an addendum once I get my notes together.</p>
<p>What is biocuration? Well, anything that has to do with annotating, labeling, indexing, identifying biological entities. Almost exclusively genes in this conference. Genome databases, especially those of model organisms, employ curators to annotate, check and re-annotate the genomic data Here&#8217;s a more elaborate explanation, <a href="http://biocurator.org/what.shtml" target="_blank">taken</a> from the website of the <a href="http://biocurator.org/home.shtml" target="_blank">International Society for Biocuration</a>:</p>
<blockquote><p>Biocuration involves the translation and integration of information relevant to biology into a database or resource that enables integration of the scientific literature as well as large data sets. Accurate and comprehensive representation of biological knowledge, as well as easy access to this data for working scientists and a basis for computational analysis, are primary goals of biocuration.</p>
<p>The goals of biocuration are achieved thanks to the convergent endeavors of biocurators, software developers and researchers in bioinformatics. Biocurators provide essential resources to the biological community such that databases have become an integral part of the tools researchers use on a daily basis for their work.</p></blockquote>
<p><a href="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses.jpg"><img class="aligncenter" src="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses-296x300.jpg" alt="" width="178" height="180" /></a></p>
<p>&nbsp;</p>
<p><strong>Day 1</strong> started off with many community annotation tools. I thought that the Wikipedia model for annotation was dead, but maybe I&#8217;m wrong. Many community efforts use a large number of experts, as opposed to a huge number of non-experts, which is what the speakers at the first session were discussing. <a href="http://www.pombase.org/" target="_blank">Pombase</a> (whose title drew some chuckles from the French speakers at my table), the <a href="http://ciliate.org/index.php/home/welcome" target="_blank">Tetrahymna Genome Database</a> Wiki and the <a href="http://en.wikipedia.org/wiki/Gene_Wiki" target="_blank">Gene Wiki</a> were presented. The Gene Wiki, presented by <a href="http://sulab.org/" target="_blank">Andrew Su</a> from TSRI is a <em>bona-fide</em> crowdsourcing approach, not just Wikipedia-like but actually comprised of a set of 10,000 gene definition stubs folded into Wikipedia. Jennifer Harrow from Sanger presented a poster with an accession model of annotations: the &#8220;blessed annotator&#8221; who has been trained for 3 months and has the run of the wiki, and the &#8220;gatekeeper&#8221;, who has been trained in a 2-day workshop, and whose contributions need to be monitored. Lots of talks about trusted annotators, etc. Perhaps we should look to cryptography&#8217;s &#8220;circles of trust&#8221; to enable trusted annotations yet increase the number of curators. (I use &#8220;curation&#8221; and &#8220;annotation&#8221; interchangeably throughout.)</p>
<p>An afternoon workshop, discussed <a href="http://database.oxfordjournals.org/content/2012/bar059.abstract" target="_blank">who are biocurators</a>. If you are a biocurator, there&#8217;s a good probability you are 31-50 years young (80%), female (60%), with a PhD (76%), been through the academic mill and found it to be a bad fit for one reason or the other. You like your work, you rarely burn out, it is challenging and stimulating, you are not in it for the money. (Few people in non-industry science are.)  Actually, since non-profit science is run on soft money, funding is a serious concern, and your job may have a shorter half-life that you would care for it to have, as you are probably employed on a 3-5 year contract. Your boss is rarely a biocurator her/himself, which may mean that your job description may sometimes be ill-defined.</p>
<p>After  that, there was a  whole session devoted to curation workflows and tools. If  you are setting up your own genomic database, check these out: <a href="http://gmod.org/wiki/WebApollo" target="_blank">WebApollo</a>,  <a href="http://database.oxfordjournals.org/content/2012/bas001.short" target="_blank">CvManGO</a> and the <a href="http://www.reactome.org/" target="_blank">Reactome</a>. <a href="http://pimm.wordpress.com/about/" target="_blank">Attila Csordas</a> from EBI presented <a href="http://www.ebi.ac.uk/pride/" target="_blank">PRIDE</a>, a tool for curating proteomic data. While proteomic data are growing, there are few choices of software tools to annotate them. So PRIDE is a welcome player in the field.</p>
<p style="text-align: center;"><a href="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses.jpg"><img class="wp-image-5996 aligncenter" src="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses-296x300.jpg" alt="" width="178" height="180" /></a></p>
<p><strong> Day 2</strong> had a &#8220;Genomics, metagenomics comparative genomics&#8221; session, only without the metagenomics. <img src='http://bytesizebio.net/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' />   What I really liked was the <a href="http://viralzone.expasy.org/" target="_blank">ViralZone</a> resource for viral genomes, out of SIB. High time someone did this for the most abundant biological particle on Earth, and the one responsible for most diversity in life.</p>
<p>The breakout sessions were my favorite, getting a change to interact with like-minded people interested in similar questions. (That is, those that share my prejudices.) I went to the one organized by <a href="http://www.unil.ch/dee/page22707_en.html" target="_blank">Marc Robinson-Rechavi</a> and <a href="http://www.unil.ch/dee/page48559_en.html">Frederic Bastian</a> which dealt with the question of quality in gene annotation.  Here is the problem: when we annotate a gene with a function (or functions), we also need to say what is the evidence that brought us to think that this gene does what it does. The most popular vocabulary for annotating genes is the <a href="http://www.geneontology.org/" target="_blank">Gene Ontology</a> or GO. GO provides us with <a href="http://www.geneontology.org/GO.evidence.shtml" target="_blank">evidence codes</a> which allow the curator to say what is the evidence for the function they assign to a gene. Those range from experimental evidence codes such as &#8220;inferred from mutant phenotype&#8221; which are always entered by a human curator, to &#8220;Inferred from Electronic Annotation&#8221; which have no human oversight. These evidence codes are used as a proxy for quality: people generally tend to accept that evidence from an experiment may be stronger evidence that that gene does what it does than an electronic one. That may not necessarily be true. For example, high-throughput experiments that results in many genes getting assigned with annotations wholesale. Even with the uncharacteristically low) 5% error rate, a single paper used as a source from which 5,000 genes are annotated would result in 25 wrongly annotated genes.  In addition, these types of experiments supply annotations that are not very specific, such as &#8220;protein binding&#8221; or &#8220;embryonic development&#8221;, terms that in many cases are too general to be useful. On  the other hand, Nives Škunca of ETH Zurich has shown a beautiful study about how fully automated annotations may not be as inferior to human-curated ones as most people think, with some caveats. (Note: Nives also showed her work in a poster that won the best poster award at the meeting, and this work has just been accepted to <em>PLoS Computational Biology</em>. I will try to blog more about it once it&#8217;s published, it&#8217;s really brilliant.) The discussion revolved around how we should ascertain the quality of annotations, what would be considered a useful annotation, and how can we establish trustworthiness. Seems like there is quite a bit of work to be done, as people are only beginning to realize that this is a more complex problem than we thought. A major player in this will be the Evidence Ontology or <a href="http://www.ebi.ac.uk/ontology-lookup/browse.do?ontName=ECO">ECO</a>, an elaborate ontology in the making describing lines of evidence for gene annotation.</p>
<p><a href="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses.jpg"><img class="aligncenter" src="http://bytesizebio.net/wp-content/uploads/2012/04/Solar-and-Lunar-eclipses-296x300.jpg" alt="" width="178" height="180" /></a></p>
<p><strong>Day 3</strong>: Atilla Csordas, whom I mentioned earlier, organized an <a href="http://en.wikipedia.org/wiki/Unconference" target="_blank">unconference</a> session early morning. A few of us gave brief talks there. Ben Good from Andrew Su&#8217;s lab talked about biocuration through games, with harnessing  The idea is to do for biocuration what <a href="http://fold.it" target="_blank">fold.it</a> has done for protein folding. The <a href="http://sulab.org/2011/11/learning-from-the-dizeez-game/" target="_blank">Dizeez</a> game quizzes you about diseases related to genes, and scores you according to how well you link genes to diseases. But as Andrew says on his <a href="http://sulab.org/2011/11/learning-from-the-dizeez-game/" target="_blank">blog</a>:</p>
<blockquote><p> Generally, the gene-disease links in structured databases will be reasonably correct (though likely not at all complete). When we analyze the game logs in aggregate, we expect that players’ answers will generally reinforce what’s already known. But given enough game player data, also expect that we’ll see multiple instances of gene-disease links that <em>aren’t</em> reflected in current annotation databases. And these are candidate novel annotations.</p></blockquote>
<p>So there may be something there, although it is not the &#8220;wisdom of the crowds&#8221; that is being exploited, since I imagine that only people with advanced degrees in their field can contribute to Dizeez. You can see games from the Su lab on <a href="http://genegames.org/" target="_blank">genegames.org</a>. Sean Mooney from Buck talked about the <a href="http://www.mooneygroup.org/stop/input" target="_blank">Statistical Tracking of Ontological Phrases</a> (STOP) project. The idea here is to automatically enrich GO annotation of genes with other ontologies, to get a more comprehensive description of their function, especially when it comes to disease.  I talked about the <a href="http://bytesizebio.net/index.php/2011/07/02/cafa-update/" target="_blank">Critical Assessment of Function Annotations</a> (we finally submitted the paper, yay!).  Atilla talked about annotating proteomic data.</p>
<p>Great meeting. A big thank you to the <a href="http://pir.georgetown.edu/biocuration2012/organizers.html" target="_blank">organizers</a>, it went without a hitch.  Logistics, food, coffee were all fantastic. Looking forward to Cambridge nest year! <strong>EDIT</strong>: a <a href="http://www.oxfordjournals.org/our_journals/databa/biocuration_virtual_issue.html" target="_blank">virtual special issue of <em>Database</em></a> has been published for this meeting, Some of the talks are there as papers. Open Access, of course.</p>
<p>Finally, my favorite promotional item from the meeting:</p>
<p><a href="http://bytesizebio.net/wp-content/uploads/2012/04/2012-04-03-19.09.47.jpg"><img class="alignnone size-medium wp-image-6000" title="2012-04-03 19.09.47" src="http://bytesizebio.net/wp-content/uploads/2012/04/2012-04-03-19.09.47-225x300.jpg" alt="" width="225" height="300" /></a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/04/06/biocuration-2012/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>You. Want. This. Job.</title>
		<link>http://bytesizebio.net/index.php/2012/03/27/you-want-this-job/</link>
		<comments>http://bytesizebio.net/index.php/2012/03/27/you-want-this-job/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 15:47:51 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[genomics]]></category>
		<category><![CDATA[jobs]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5961</guid>
		<description><![CDATA[NSF grant funded, woohoo! Now I am hiring a programmer. So if you want to be part of a dynamic, growing lab, do lots of interesting stuff and upgrade yourself from just a great bioinformatician to a super-bioinformatician, this job&#8217;s for you.  You&#8217;ll be working primarily on microbial genome evolution, including setting up a kick-butt [...]]]></description>
			<content:encoded><![CDATA[<p>NSF grant funded, woohoo! Now I am hiring a programmer. So if you want to be part of a dynamic, growing lab, do lots of interesting stuff and upgrade yourself from just a great bioinformatician to a super-bioinformatician, this job&#8217;s for you.  You&#8217;ll be working primarily on microbial genome evolution, including setting up a kick-butt multi-genome database, and all sorts of interesting distractions.  See below for the nitty-gritty. Original ad here: <a href="https://www.miamiujobs.com" target="_blank">https://www.miamiujobs.com</a>, job posting number: <strong>0001377</strong> . Pass on to interested parties. Three year position, renewable annually.</p>
<blockquote><p><strong>Microbiology</strong>: Scientific Programmer/Specialist to implement and maintain a genomic database web site; implement data management tools including relational database management applications for efficient storage and retrieval of genomic data; perform other duties as related to the position such as data and project management to ensure data are being processed in an efficient and timely manner; contribute to writing scientific manuscripts.</p>
<p><strong>Required qualifications</strong>: BS or BA in Computer Science, bioinformatics, or a related discipline; demonstrated programming experience, particularly in Python and SQL databases; demonstrated web programming experience; knowledge of Linux/Unix; excellent spoken and written communication and documentation skills.</p>
<p><strong>Preferred qualifications</strong>: Advanced degree (M.Sc. or Ph.D) or equivalent in Computer Science, Bioinformatics, Molecular Biology or a related discipline; experience in development of bioinformatic algorithms; knowledge of R programming; experience in development of or contribution to open source projects; experience in collaborative software development such as the use of version control software, writing and following software specifications, participation in code review; knowledge of basic molecular biology; experience with genomic browser programming, such as GMOD or equivalent.</p>
<p>Candidates should send a CV or resume and have three letters of reference sent separately to Dr. Iddo Friedberg at <a href="http://is.gd/40N6zn">Friedberg.lab.jobs &#8216;at&#8217; gmail &#8216;dot&#8217; com</a>. Screening of applications begins April 14, 2012 and will continue until the position is filled.</p>
<p>Miami University is an affirmative action/equal opportunity employer with smoke-free campuses. Consumer Information http://www.miami.muohio.edu/about-miami/publications-and-policies/student-consumer-info/. Hard copy upon request.</p></blockquote>
<p><span style="font-family: Arial,sans-serif;"><a href="http://bytesizebio.net/wp-content/uploads/2012/03/job-ad-programmer.pdf" target="_blank">Ad in PDF</a>.<br style="font-family: Arial,sans-serif;" /></span></p>
<p style="margin-bottom: 0in;">
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/03/27/you-want-this-job/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Dirty Genomics</title>
		<link>http://bytesizebio.net/index.php/2012/03/21/dirty-genomics/</link>
		<comments>http://bytesizebio.net/index.php/2012/03/21/dirty-genomics/#comments</comments>
		<pubDate>Wed, 21 Mar 2012 15:02:30 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Biology]]></category>
		<category><![CDATA[Film]]></category>
		<category><![CDATA[Funny]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Clint Eastwood]]></category>
		<category><![CDATA[genomics]]></category>
		<category><![CDATA[movies]]></category>
		<category><![CDATA[next generation sequencing]]></category>
		<category><![CDATA[sequencing]]></category>
		<category><![CDATA[short read sequencing]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5957</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" title="Dirty Genomics" src="http://i.imgur.com/lNWOy.png" alt="" width="599" height="450" /></p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/03/21/dirty-genomics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Repost: a very loose and circular association to Pi Day</title>
		<link>http://bytesizebio.net/index.php/2012/03/14/repost-a-very-loose-and-circular-association-to-pi-day/</link>
		<comments>http://bytesizebio.net/index.php/2012/03/14/repost-a-very-loose-and-circular-association-to-pi-day/#comments</comments>
		<pubDate>Wed, 14 Mar 2012 16:26:33 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Biochemistry]]></category>
		<category><![CDATA[Mathematics]]></category>
		<category><![CDATA[Microbiology]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Structural biology]]></category>
		<category><![CDATA[cyclotides]]></category>
		<category><![CDATA[pi]]></category>
		<category><![CDATA[pi day]]></category>
		<category><![CDATA[proteins]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5954</guid>
		<description><![CDATA[(Originally published March 14, 2009) Happy Pi (π) Day! Americans write dates in the MM/DD/YYYY format instead of the DD/MM/YYYY format used by the rest of the world.  Usually a rather painful and confusing format if you did not grow up with it, causing checks to bounce and leases to expire for those who recently [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.researchblogging.org"><img src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></p>
<p>(Originally published March 14, 2009)</p>
<p>Happy <a href="http://en.wikipedia.org/wiki/Pi_day" target="_blank">Pi (</a><a title="Pi" href="http://en.wikipedia.org/wiki/Pi">π</a>) <a href="http://en.wikipedia.org/wiki/Pi_day" target="_blank">Day</a>! Americans write dates in the MM/DD/YYYY format instead of the DD/MM/YYYY format used by the rest of the world.  Usually a rather painful and confusing format if you did not grow up with it, causing checks to bounce and leases to expire for those who recently moved to the US, but it has a few benefits: you can take the numeric representation of March 14, and you have the first three digits of Pi. This coincidence is good enough to celebrate a day around the uber-celebrity of numbers. (Heh, I said  &#8220;around&#8221;). Everybody&#8217;s welcome.</p>
<p>This is the day all geeky bloggers come out and try to: (1) show how smart they are; (2) connect Pi, usually in some improbable and tenuous fashion, to whatever theme they have in their blogs and (3) try to make an original observation of pi no one else has made before. So that is exactly what I am going to do today.</p>
<p>Sort of.</p>
<p>Well,  probably not.</p>
<h5><strong>Smarts</strong></h5>
<p>Well, I remembered Pi day, didn&#8217;t I? OK, that does not show I&#8217;m smart, just shows my brain is a repository of useless trivia. Look at the time of publication of this post:  March14, 1:59am which is 3.14159. Hey, five digit time stamp that&#8217;s smart! (Not very original though, also I&#8217;m actually up at this time finishing a grant proposal).</p>
<div>
<dl id="attachment_671">
<dt><a href="http://bytesizebio.net/wp-content/uploads/2009/03/1aym_bio_r_500.jpg"><img title="1aym_bio_r_500" src="http://bytesizebio.net/wp-content/uploads/2009/03/1aym_bio_r_500-300x300.jpg" alt="1aym_bio_r_500" width="240" height="240" /></a></dt>
<dd>Human Rhinovirus capsid. Not a perfect sphere, but close connection to blog theme</dd>
</dl>
</div>
<h5>A post with a less than tenuous connection to Pi</h5>
<p>Some virus capsids are icosahedral. Not really spherical but sort-of. Bacteria have flagella motors that are circular. Micelles are usually spherical.  Microvesicles are spherical. All these are a good start for pi-topics.</p>
<p>Well, too bad. I actually want to write about circular proteins. Only &#8220;circular&#8221; in this case does not mean &#8220;circle shaped&#8221;:  hence, we are chucking Pi out the window right now. Stick around though, these proteins are really cool.</p>
<div>
<dl id="attachment_680">
<dt><a href="http://bytesizebio.net/wp-content/uploads/2009/03/peptbond.gif"><img title="peptbond" src="http://bytesizebio.net/wp-content/uploads/2009/03/peptbond.gif" alt="Formation of a peptide bond" width="228" height="211" /></a></dt>
<dd>Formation of a peptide bond</dd>
</dl>
</div>
<p>You were probably taught that proteins are linear chains of amino acids that fold into a shape that produces their function. The links connecting the chains are peptide bonds. But there is no real reason why the carboxy terminus (right side) and amino terminus (left side) would not bond themselves.  It just has never been observed, or looked for. Well, they do. And some proteins are circular, like a snake biting its own tail.</p>
<div>
<dl id="attachment_681">
<dt><a href="http://bytesizebio.net/wp-content/uploads/2009/03/cyclotide_structure.jpg"><img title="cyclotide_structure" src="http://bytesizebio.net/wp-content/uploads/2009/03/cyclotide_structure-300x137.jpg" alt="Structure and sequence of the cyclotide kalata B1" width="300" height="137" /></a></dt>
<dd>Structure and sequence of the cyclotide kalata B1</dd>
</dl>
</div>
<p>These <em>cyclotides</em> are very robust. For one, they are almost immune to proteases: enzymes that break up proteins. Many proteases attack the edge of the protein (exoproteases, because they start from the &#8220;outside&#8221;), but there are no edges to attack here. The disulfide bonds, their short length make them immune to <em>endoproteases </em>as well as to heat, pH, etc.</p>
<h5>What do cyclotides do?</h5>
<p>They protect the organism that produces them.  All kingdoms of life produce cyclotides, everything from bacteria to Rhesus monkeys. (Actually, I am not sure about Archaea). Cyclotides seem to act in different mechanisms: some form holes in the membrane of the attacking microbe;  plant cyclotides stunt the growth of feeding caterpillars. Interestingly, the same plant peptide, Kalata B1 induces uterine contractions in mammals. This is how it was discovered: a physician working in the Democratic Republic of Congo noticed that laboring women were drinking tea made from <em>Oleanda affinis</em> to induce childbirth. Theactive ingredient was the first cyclotide to be discovered. Since then, cyclotides have been shown to be antibiotic, antiviral and insecticidal.</p>
<h5>Do humans produce cyclotides?</h5>
<p>I could not find anything about that in the literature. So I took the amino acid sequence of a recently discovered monkey cyclotide, rhesus theta defensin 1 (RTD1) sequence and BLASTed it (TBLASTN: protein vs. nucleotide)  against the human genome. No results. Of course, this 5 minute trial proves very little. TBLASTNing short sequences  (the RTD1 is only 18aa long) is a bit sticky. If you are a beginning bioinformatics student looking for a course or rotation project, finding candidate Cyclotides in humans (or in other genomes) might be a good idea.  There are about 100 known sequences, so quite a bit for a training set to start from.  You can build a profile or an HMM, and do some more sensitive searches.</p>
<h5>But what about Pi?</h5>
<p>Sigh.. well, here is an XKCD oldie but goldie nerd litmus test&#8230; enjoy&#8230;</p>
<p><a href="http://imgs.xkcd.com/comics/pi.jpg"><img src="http://imgs.xkcd.com/comics/pi.jpg" alt="" width="375" height="198" /></a></p>
<hr />
<p>Trabi, M. (2002). Circular proteins — no end in sight Trends in Biochemical Sciences, 27 (3), 132-138 DOI: <a href="http://dx.doi.org/10.1016/S0968-0004(02)02057-1" rev="review">10.1016/S0968-0004(02)02057-1</a></p>
<p>PELEGRINI, P., QUIRINO, B., &amp; FRANCO, O. (2007). Plant cyclotides: An unusual class of defense compounds Peptides, 28 (7), 1475-1481 DOI: <a href="http://dx.doi.org/10.1016/j.peptides.2007.04.025" rev="review">10.1016/j.peptides.2007.04.025</a></p>
<p>Wang, C., Hu, S., Martin, J., Sjogren, T., Hajdu, J., Bohlin, L., Claeson, P., Goransson, U., Rosengren, K., Tang, J., Tan, N., &amp; Craik, D. (2009). Combined X-ray and NMR analysis of the stability of the cyclotide cystine knot fold that underpins its insecticidal activity and potential use as drug scaffold Journal of Biological Chemistry DOI: <a href="http://dx.doi.org/10.1074/jbc.M900021200" rev="review">10.1074/jbc.M900021200</a></p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/03/14/repost-a-very-loose-and-circular-association-to-pi-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Origin of Gender Symbols in Biology</title>
		<link>http://bytesizebio.net/index.php/2012/03/08/the-origin-of-gender-symbols-in-biology/</link>
		<comments>http://bytesizebio.net/index.php/2012/03/08/the-origin-of-gender-symbols-in-biology/#comments</comments>
		<pubDate>Thu, 08 Mar 2012 22:57:34 +0000</pubDate>
		<dc:creator>Iddo</dc:creator>
				<category><![CDATA[Biology]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[science culture]]></category>
		<category><![CDATA[taxonomy]]></category>

		<guid isPermaLink="false">http://bytesizebio.net/?p=5930</guid>
		<description><![CDATA[A quick post for International Women&#8217;s Day: how did the gender symbols originate in biology? What do ♀ and ♂ actually stand for? The answer starts in antiquity, when planets and gods were almost synonymous. Religious rites (at least in Europe) were also associated with the working of metals. Thus, each heavenly body was associated with a [...]]]></description>
			<content:encoded><![CDATA[<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span></p>
<p>A quick post for<a href="http://en.wikipedia.org/wiki/International_Women's_Day" target="_blank"> International Women&#8217;s Day</a>: how did the gender symbols originate in biology? What do ♀ and ♂ actually stand for?</p>
<p>The answer starts in antiquity, when planets and gods were almost synonymous. Religious rites (at least in Europe) were also associated with the working of metals. Thus, each heavenly body was associated with a metal, a god and provided with a proper symbol, thus:</p>
<div id="attachment_5933" class="wp-caption alignnone" style="width: 593px"><a href="http://bytesizebio.net/wp-content/uploads/2012/03/planets-metals.png"><img class="size-full wp-image-5933" title="planets-metals" src="http://bytesizebio.net/wp-content/uploads/2012/03/planets-metals.png" alt="" width="583" height="151" /></a><p class="wp-caption-text">1. Sun  (gold) 2. Moon (silver) 3. Saturn (lead) 4. Jupiter (tin) 5. Mars (iron) 6. Mercury (mercury, duh) 7. Venus (copper) After woodcuts by Friz Kredel, published in Stearn 1962.</p></div>
<p>&nbsp;</p>
<p>But how did the symbols of Mars (iron) and Venus (copper) migrate to describe sex in biology? It seems obvious to us that of all symbols, that of the god of war be assigned to male, and the goddess of love to female (stereotypes nonwithstanding), but who was the first who did that?</p>
<p>The answer can be traced to one of the greatest biologists of all times: <a href="http://en.wikipedia.org/wiki/Linneaus" target="_blank">Carl Linnaeus</a>. He is better known for being the father of modern taxonomy: Linnaeus  is the reason that we uniquely identify organisms using genus and species names in Latin grammatical form, a system known as Linneael <a href="http://en.wikipedia.org/wiki/Binomial_nomenclature" target="_blank">binomial nomnclature</a>. From <em>Homo sapiens</em> to <em>Escherichia coli</em>, we all owe our scientific names to Linnaeus.</p>
<p>But Linnaeus was also the one to appropriate the planet symbols to biology. In his notes, he used the Venus symbol as shorthand for female and the Mars symbol as shorthand for male. He also used Saturn to denote woody plants, the Sun for annual plants and Jupiter for perennials. As for gender, the Mercury symbol was used by Linnaeus for hermaphrodite plants. However, that symbol&#8217;s meaning has changed over the years, at least in scientific shorthand, and is now used to denote virgin female (e.g. in genetic analysis).  Mars was also used by Linnaeus, somewhat confusingly, for biennial plants.</p>
<p>But how did the symbols actually originate? The accepted thought now is that they were derived by the Roman from the Greek initial letters for the planets / deities. So Phosphoros  Φωσφόρος (Greek: &#8220;Morning Star&#8221; or later the planet Venus) was abbreviated to Φκ and Thouros (Mars) to θρ further contracted over the years, by metal workers, astrologers and alchemists to the modern symbols.</p>
<div id="attachment_5934" class="wp-caption alignnone" style="width: 353px"><a href="http://bytesizebio.net/wp-content/uploads/2012/03/greek-abbrev.png"><img class="size-full wp-image-5934" title="greek-abbrev" src="http://bytesizebio.net/wp-content/uploads/2012/03/greek-abbrev.png" alt="" width="343" height="147" /></a><p class="wp-caption-text">Kronos (saturn); Zeus (Jupiter); Thouros (Mars); Phosphoros (Venus) Stilbon (Mercury). After Stearn 1962</p></div>
<p>&nbsp;</p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Taxon&amp;rft_id=info%3Aother%2F&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=The+Origin+of+the+Male+and+Female+Symbols+of+Biology&amp;rft.issn=&amp;rft.date=1962&amp;rft.volume=11&amp;rft.issue=4&amp;rft.spage=109&amp;rft.epage=113&amp;rft.artnum=+http%3A%2F%2Fwww.jstor.org%2Fstable%2F1217734&amp;rft.au=William+T.+Stearn&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology%2CBioinformatics%2C+Biophysics%2C+Structural+Biology%2C+Molecular+Biology%2C+Microbiology%2C+Structural+Biology%2C+Computational+Biology%2C+Evolutionary+Biology"><a href="http://www.jstor.org/stable/1217734">William T. Stearn (1962). The Origin of the Male and Female Symbols of Biology</a> <span style="font-style: italic;">Taxon, 11</span> (4), 109-113</span></p>
]]></content:encoded>
			<wfw:commentRss>http://bytesizebio.net/index.php/2012/03/08/the-origin-of-gender-symbols-in-biology/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

