<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https:/// /w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=RobEdwards</id>
	<title>TheSeed - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https:/// /w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=RobEdwards"/>
	<link rel="alternate" type="text/html" href=" /wiki/Special:Contributions/RobEdwards"/>
	<updated>2026-05-21T03:55:06Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.6</generator>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1814</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1814"/>
		<updated>2008-05-14T21:59:21Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. &lt;br /&gt;
&lt;br /&gt;
In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
   &amp;gt;sequenceid\rgatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''This is a sequence that has been edited in a mac.'' &lt;br /&gt;
    ''We try to fix them, because we're mac users too, but can't always.''&lt;br /&gt;
    ''Please make sure you save using UNIX format if you are using a mac&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1813</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1813"/>
		<updated>2008-05-14T21:59:04Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. &lt;br /&gt;
&lt;br /&gt;
In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
   &amp;gt;sequenceid\rgatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''this is a sequence that has been edited in a mac.'' &lt;br /&gt;
    ''We try to fix them, because we're mac users too, but can't always.''&lt;br /&gt;
    ''Please make sure you save using UNIX format if you are using a mac&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1812</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1812"/>
		<updated>2008-05-14T21:58:35Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. &lt;br /&gt;
&lt;br /&gt;
In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
   &amp;gt;sequenceid\rgatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''this is a sequence that has been edited in a mac. &lt;br /&gt;
    We try to fix them, because we're mac users too, but can't always. &lt;br /&gt;
    Please make sure you save using UNIX format if you are using a mac&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1811</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1811"/>
		<updated>2008-05-14T21:58:01Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. &lt;br /&gt;
&lt;br /&gt;
In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
   &amp;gt;sequenceid\rgatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''this is a sequence that has been edited in a mac. We try to fix them, because we're mac users too, but can't always. Please make sure you save using newlines (sometimes called UNIX format) if you are using a mac&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1810</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1810"/>
		<updated>2008-05-14T20:59:07Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* 2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
# DeJongh, M., Formsma, K., Boillot, P.,  Gould, J.,  Rycenga M.,   and Best, A. (2007) Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 2007 [http://www.ncbi.nlm.nih.gov/pubmed/17462086 Pubmed] [http://www.biomedcentral.com/1471-2105/8/139 BMC Bioinformatics]&lt;br /&gt;
&lt;br /&gt;
=2008=&lt;br /&gt;
# Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. (2008) , The RAST Server: rapid annotations using subsystems technology. BMC Genomics [http://www.ncbi.nlm.nih.gov/pubmed/18261238 Pubmed] [http://www.biomedcentral.com/1471-2164/9/75 BMC Genomics]&lt;br /&gt;
# Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., Brulc, J., Chau, B., Furlan, M., Desnues, C.G., Haynes, M., Li, L., McDaniel, L., Moran, M.A., Nelson, K.E., Nilsson, C., Olson, R., Paul, J., Rodriguez-Brito, B., Swan, B., Stevens, R., Valentine, D., Vega-Thurber, R., Wegley, L., White, B., and F. Rohwer. 2008. Functional Metagenomic Profiling of Nine Biomes. Nature 452:629-632. [http://www.ncbi.nlm.nih.gov/pubmed/18337718 Pubmed] [http://www.nature.com/nature/journal/vaop/ncurrent/full/nature06810.html Nature]&lt;br /&gt;
# Krause, L., Diaz, N., Edwards, R.A., Rohwer, F., Kelley, S., and J. Stoye. 2008 Taxonomic Classification of Short Environmental DNA Fragments. 2008 NAR Online Feb 19th.  [http://www.ncbi.nlm.nih.gov/pubmed/18285365 Pubmed]    [http://nar.oxfordjournals.org/cgi/content/full/36/7/2230 NAR]&lt;br /&gt;
# Dinsdale, E. A., O. Pantos, S. Smriga, R. A. Edwards, F. Angly, L. Wegley, M. Hatay, D. Hall, E. Brown, M. Haynes, L. Krause, E. Sala, S. A. Sandin, R. V. Thurber, B. L. Willis, F. Azam, N. Knowlton, and F. Rohwer. 2008. Microbial ecology of four coral atolls in the northern line islands. PLoS ONE 3:e1584. [http://www.plosone.org/doi/pone.0001584 PLoS One]&lt;br /&gt;
# Desnues, C.G., Rodriguez-Brito, B., Rayhawk, S., Kelley, S., Tran, T., Haynes, M., Liu, H., Hall, D., Angly, F.E., Edwards, R.A., Vega, R., Breitbart, M., Siefert, J., Souza, V., Reid, P., Rohwer, F. 2008. Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature 20;452(7185):340-343. [http://www.ncbi.nlm.nih.gov/pubmed/18311127 PubMed] [http://www.nature.com/nature/journal/v452/n7185/abs/nature06735.html Nature]&lt;br /&gt;
# Mou, X., Sun, S., Edwards, R., Hodson, R., and Moran, M.A. 2008. Bacterial Carbon Processing by Generalist Species in the Coastal Ocean. Nature. 451:709-711. [http://www.ncbi.nlm.nih.gov/pubmed/18223640 Pubmed] [http://www.nature.com/nature/journal/v451/n7179/abs/nature06513.html Nature]&lt;br /&gt;
# Field, D., Garrity, G., Gray, T., Morrison, N., Selengut, J., Sterk, P., Tatusova, T., Thomson, N., Allen, M.J., Ashburner, M., Baldauf, S., Ballard, S., Boore, J., Cochrane, G., Cole, J., dePamphilis, C., Edwards, R., Faruque, N., Feldman, R., Glockner, F-O., Haft, D., Hancock, D., Hermjakob, H., Hertz-Fowler, C., Hugenholtz, P., Joint, I., Kane, M., Kennedy, J., Kowalchuk, G., Kottmann, R., Kolker, E., Kyrpides, N., Leebens-Mack, J., Lewis, S.E., Liste, A., Lord, P., Maltsev, N., Markowitz, V., Martiny, J., Methe, B., Moxon, R., Nelson, K., Parkhill, J., Sansone, S-A., Spiers, A., Stevens, R., Swift, P., Taylor, C., Tateno, Y., Tett, A., Turner, S., Ussery, D., Vaughan, B., Ward, N., Whetzel, T., Wilson, G., and Wipat, A., 2008. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnology, 26:541-547. [http://www.ncbi.nlm.nih.gov/pubmed/18464787 Pubmed] [http://www.nature.com/nbt/journal/v26/n5/abs/nbt1360.html Nature Biotech]&lt;br /&gt;
# Frank JA, Reich CI, Sharma S, Weisbaum JS, Wilson BA, Olsen GJ. Critical evaluation of two primers commonly used for amplification of bacterial 16S rRNA genes. Appl Environ Microbiol. 2008 Apr;74(8):2461-70. [http://www.ncbi.nlm.nih.gov/pubmed/18296538 PubMed] [http://aem.asm.org/cgi/content/full/74/8/2461?view=long&amp;amp;pmid=18296538 AEM]&lt;br /&gt;
&lt;br /&gt;
=In press=&lt;br /&gt;
&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1809</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1809"/>
		<updated>2008-05-14T15:09:45Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* 2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
# DeJongh, M., Formsma, K., Boillot, P.,  Gould, J.,  Rycenga M.,   and Best, A. (2007) Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 2007 [http://www.ncbi.nlm.nih.gov/pubmed/17462086 Pubmed] [http://www.biomedcentral.com/1471-2105/8/139 BMC Bioinformatics]&lt;br /&gt;
&lt;br /&gt;
=2008=&lt;br /&gt;
# Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. (2008) , The RAST Server: rapid annotations using subsystems technology. BMC Genomics [http://www.ncbi.nlm.nih.gov/pubmed/18261238 Pubmed] [http://www.biomedcentral.com/1471-2164/9/75 BMC Genomics]&lt;br /&gt;
# Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., Brulc, J., Chau, B., Furlan, M., Desnues, C.G., Haynes, M., Li, L., McDaniel, L., Moran, M.A., Nelson, K.E., Nilsson, C., Olson, R., Paul, J., Rodriguez-Brito, B., Swan, B., Stevens, R., Valentine, D., Vega-Thurber, R., Wegley, L., White, B., and F. Rohwer. 2008. Functional Metagenomic Profiling of Nine Biomes. Nature 452:629-632. [http://www.ncbi.nlm.nih.gov/pubmed/18337718 Pubmed] [http://www.nature.com/nature/journal/vaop/ncurrent/full/nature06810.html Nature]&lt;br /&gt;
# Krause, L., Diaz, N., Edwards, R.A., Rohwer, F., Kelley, S., and J. Stoye. 2008 Taxonomic Classification of Short Environmental DNA Fragments. 2008 NAR Online Feb 19th.  [http://www.ncbi.nlm.nih.gov/pubmed/18285365 Pubmed]    [http://nar.oxfordjournals.org/cgi/content/full/36/7/2230 NAR]&lt;br /&gt;
# Dinsdale, E. A., O. Pantos, S. Smriga, R. A. Edwards, F. Angly, L. Wegley, M. Hatay, D. Hall, E. Brown, M. Haynes, L. Krause, E. Sala, S. A. Sandin, R. V. Thurber, B. L. Willis, F. Azam, N. Knowlton, and F. Rohwer. 2008. Microbial ecology of four coral atolls in the northern line islands. PLoS ONE 3:e1584. [http://www.plosone.org/doi/pone.0001584 PLoS One]&lt;br /&gt;
# Desnues, C.G., Rodriguez-Brito, B., Rayhawk, S., Kelley, S., Tran, T., Haynes, M., Liu, H., Hall, D., Angly, F.E., Edwards, R.A., Vega, R., Breitbart, M., Siefert, J., Souza, V., Reid, P., Rohwer, F. 2008. Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature 20;452(7185):340-343. [http://www.ncbi.nlm.nih.gov/pubmed/18311127 PubMed] [http://www.nature.com/nature/journal/v452/n7185/abs/nature06735.html Nature]&lt;br /&gt;
# Mou, X., Sun, S., Edwards, R., Hodson, R., and Moran, M.A. 2008. Bacterial Carbon Processing by Generalist Species in the Coastal Ocean. Nature. 451:709-711. [http://www.ncbi.nlm.nih.gov/pubmed/18223640 Pubmed] [http://www.nature.com/nature/journal/v451/n7179/abs/nature06513.html Nature]&lt;br /&gt;
# Field, D., Garrity, G., Gray, T., Morrison, N., Selengut, J., Sterk, P., Tatusova, T., Thomson, N., Allen, M.J., Ashburner, M., Baldauf, S., Ballard, S., Boore, J., Cochrane, G., Cole, J., dePamphilis, C., Edwards, R., Faruque, N., Feldman, R., Glockner, F-O., Haft, D., Hancock, D., Hermjakob, H., Hertz-Fowler, C., Hugenholtz, P., Joint, I., Kane, M., Kennedy, J., Kowalchuk, G., Kottmann, R., Kolker, E., Kyrpides, N., Leebens-Mack, J., Lewis, S.E., Liste, A., Lord, P., Maltsev, N., Markowitz, V., Martiny, J., Methe, B., Moxon, R., Nelson, K., Parkhill, J., Sansone, S-A., Spiers, A., Stevens, R., Swift, P., Taylor, C., Tateno, Y., Tett, A., Turner, S., Ussery, D., Vaughan, B., Ward, N., Whetzel, T., Wilson, G., and Wipat, A., 2008. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnology, 26:541-547. [http://www.ncbi.nlm.nih.gov/pubmed/18464787 Pubmed] [http://www.nature.com/nbt/journal/v26/n5/abs/nbt1360.html Nature Biotech]&lt;br /&gt;
&lt;br /&gt;
=In press=&lt;br /&gt;
&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1808</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1808"/>
		<updated>2008-05-14T15:07:29Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* 2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
# DeJongh, M., Formsma, K., Boillot, P.,  Gould, J.,  Rycenga M.,   and Best, A. (2007) Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 2007 [http://www.ncbi.nlm.nih.gov/pubmed/17462086 Pubmed] [http://www.biomedcentral.com/1471-2105/8/139 BMC Bioinformatics]&lt;br /&gt;
&lt;br /&gt;
=2008=&lt;br /&gt;
# Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. (2008) , The RAST Server: rapid annotations using subsystems technology. BMC Genomics [http://www.ncbi.nlm.nih.gov/pubmed/18261238 Pubmed] [http://www.biomedcentral.com/1471-2164/9/75 BMC Genomics]&lt;br /&gt;
# Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., Brulc, J., Chau, B., Furlan, M., Desnues, C.G., Haynes, M., Li, L., McDaniel, L., Moran, M.A., Nelson, K.E., Nilsson, C., Olson, R., Paul, J., Rodriguez-Brito, B., Swan, B., Stevens, R., Valentine, D., Vega-Thurber, R., Wegley, L., White, B., and F. Rohwer. 2008. Functional Metagenomic Profiling of Nine Biomes. Nature 452:629-632. [http://www.ncbi.nlm.nih.gov/pubmed/18337718 Pubmed] [http://www.nature.com/nature/journal/vaop/ncurrent/full/nature06810.html Nature]&lt;br /&gt;
# Krause, L., Diaz, N., Edwards, R.A., Rohwer, F., Kelley, S., and J. Stoye. 2008 Taxonomic Classification of Short Environmental DNA Fragments. 2008 NAR Online Feb 19th.  [http://www.ncbi.nlm.nih.gov/pubmed/18285365 Pubmed]    [http://nar.oxfordjournals.org/cgi/content/full/36/7/2230 NAR]&lt;br /&gt;
# Dinsdale, E. A., O. Pantos, S. Smriga, R. A. Edwards, F. Angly, L. Wegley, M. Hatay, D. Hall, E. Brown, M. Haynes, L. Krause, E. Sala, S. A. Sandin, R. V. Thurber, B. L. Willis, F. Azam, N. Knowlton, and F. Rohwer. 2008. Microbial ecology of four coral atolls in the northern line islands. PLoS ONE 3:e1584. [http://www.plosone.org/doi/pone.0001584 PLoS One]&lt;br /&gt;
# Desnues, C.G., Rodriguez-Brito, B., Rayhawk, S., Kelley, S., Tran, T., Haynes, M., Liu, H., Hall, D., Angly, F.E., Edwards, R.A., Vega, R., Breitbart, M., Siefert, J., Souza, V., Reid, P., Rohwer, F. 2008. Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature 20;452(7185):340-343.&lt;br /&gt;
# Mou, X., Sun, S., Edwards, R., Hodson, R., and Moran, M.A. 2008. Bacterial Carbon Processing by Generalist Species in the Coastal Ocean. Nature. 451:709-711. [http://www.ncbi.nlm.nih.gov/pubmed/18223640 Pubmed] [http://www.nature.com/nature/journal/v451/n7179/abs/nature06513.html Nature]&lt;br /&gt;
# Field, D., Garrity, G., Gray, T., Morrison, N., Selengut, J., Sterk, P., Tatusova, T., Thomson, N., Allen, M.J., Ashburner, M., Baldauf, S., Ballard, S., Boore, J., Cochrane, G., Cole, J., dePamphilis, C., Edwards, R., Faruque, N., Feldman, R., Glockner, F-O., Haft, D., Hancock, D., Hermjakob, H., Hertz-Fowler, C., Hugenholtz, P., Joint, I., Kane, M., Kennedy, J., Kowalchuk, G., Kottmann, R., Kolker, E., Kyrpides, N., Leebens-Mack, J., Lewis, S.E., Liste, A., Lord, P., Maltsev, N., Markowitz, V., Martiny, J., Methe, B., Moxon, R., Nelson, K., Parkhill, J., Sansone, S-A., Spiers, A., Stevens, R., Swift, P., Taylor, C., Tateno, Y., Tett, A., Turner, S., Ussery, D., Vaughan, B., Ward, N., Whetzel, T., Wilson, G., and Wipat, A., 2008. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnology, 26:541-547.&lt;br /&gt;
&lt;br /&gt;
=In press=&lt;br /&gt;
&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1807</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1807"/>
		<updated>2008-05-14T15:03:02Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* 2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
# DeJongh, M., Formsma, K., Boillot, P.,  Gould, J.,  Rycenga M.,   and Best, A. (2007) Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 2007 [http://www.ncbi.nlm.nih.gov/pubmed/17462086 Pubmed] [http://www.biomedcentral.com/1471-2105/8/139 BMC Bioinformatics]&lt;br /&gt;
&lt;br /&gt;
=2008=&lt;br /&gt;
# Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. (2008) , The RAST Server: rapid annotations using subsystems technology. BMC Genomics [http://www.ncbi.nlm.nih.gov/pubmed/18261238 Pubmed] [http://www.biomedcentral.com/1471-2164/9/75 BMC Genomics]&lt;br /&gt;
# Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., Brulc, J., Chau, B., Furlan, M., Desnues, C.G., Haynes, M., Li, L., McDaniel, L., Moran, M.A., Nelson, K.E., Nilsson, C., Olson, R., Paul, J., Rodriguez-Brito, B., Swan, B., Stevens, R., Valentine, D., Vega-Thurber, R., Wegley, L., White, B., and F. Rohwer. 2008. Functional Metagenomic Profiling of Nine Biomes. Nature 452:629-632. [http://www.ncbi.nlm.nih.gov/pubmed/18337718 Pubmed] [http://www.nature.com/nature/journal/vaop/ncurrent/full/nature06810.html Nature]&lt;br /&gt;
# Krause, L., Diaz, N., Edwards, R.A., Rohwer, F., Kelley, S., and J. Stoye. 2008 Taxonomic Classification of Short Environmental DNA Fragments. 2008 NAR Online Feb 19th. &lt;br /&gt;
# Dinsdale, E. A., O. Pantos, S. Smriga, R. A. Edwards, F. Angly, L. Wegley, M. Hatay, D. Hall, E. Brown, M. Haynes, L. Krause, E. Sala, S. A. Sandin, R. V. Thurber, B. L. Willis, F. Azam, N. Knowlton, and F. Rohwer. 2008. Microbial ecology of four coral atolls in the northern line islands. PLoS ONE 3:e1584.&lt;br /&gt;
# Desnues, C.G., Rodriguez-Brito, B., Rayhawk, S., Kelley, S., Tran, T., Haynes, M., Liu, H., Hall, D., Angly, F.E., Edwards, R.A., Vega, R., Breitbart, M., Siefert, J., Souza, V., Reid, P., Rohwer, F. 2008. Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature 20;452(7185):340-343.&lt;br /&gt;
# Mou, X., Sun, S., Edwards, R., Hodson, R., and Moran, M.A. 2008. Bacterial Carbon Processing by Generalist Species in the Coastal Ocean. Nature. 451:709-711.&lt;br /&gt;
# Field, D., Garrity, G., Gray, T., Morrison, N., Selengut, J., Sterk, P., Tatusova, T., Thomson, N., Allen, M.J., Ashburner, M., Baldauf, S., Ballard, S., Boore, J., Cochrane, G., Cole, J., dePamphilis, C., Edwards, R., Faruque, N., Feldman, R., Glockner, F-O., Haft, D., Hancock, D., Hermjakob, H., Hertz-Fowler, C., Hugenholtz, P., Joint, I., Kane, M., Kennedy, J., Kowalchuk, G., Kottmann, R., Kolker, E., Kyrpides, N., Leebens-Mack, J., Lewis, S.E., Liste, A., Lord, P., Maltsev, N., Markowitz, V., Martiny, J., Methe, B., Moxon, R., Nelson, K., Parkhill, J., Sansone, S-A., Spiers, A., Stevens, R., Swift, P., Taylor, C., Tateno, Y., Tett, A., Turner, S., Ussery, D., Vaughan, B., Ward, N., Whetzel, T., Wilson, G., and Wipat, A., 2008. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnology, 26:541-547.&lt;br /&gt;
&lt;br /&gt;
=In press=&lt;br /&gt;
&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1806</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1806"/>
		<updated>2008-05-14T15:02:36Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
# DeJongh, M., Formsma, K., Boillot, P.,  Gould, J.,  Rycenga M.,   and Best, A. (2007) Toward the automated generation of genome-scale metabolic networks in the SEED. BMC Bioinformatics 2007 [http://www.ncbi.nlm.nih.gov/pubmed/17462086 Pubmed] [http://www.biomedcentral.com/1471-2105/8/139 BMC Bioinformatics]&lt;br /&gt;
&lt;br /&gt;
=2008=&lt;br /&gt;
# Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. (2008) , The RAST Server: rapid annotations using subsystems technology. BMC Genomics [http://www.ncbi.nlm.nih.gov/pubmed/18261238 Pubmed] [http://www.biomedcentral.com/1471-2164/9/75 BMC Genomics]&lt;br /&gt;
# Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., Brulc, J., Chau, B., Furlan, M., Desnues, C.G., Haynes, M., Li, L., McDaniel, L., Moran, M.A., Nelson, K.E., Nilsson, C., Olson, R., Paul, J., Rodriguez-Brito, B., Swan, B., Stevens, R., Valentine, D., Vega-Thurber, R., Wegley, L., White, B., and F. Rohwer. 2008. Functional Metagenomic Profiling of Nine Biomes. Nature 452:629-632. [http://www.ncbi.nlm.nih.gov/pubmed/18337718 Pubmed] [http://www.nature.com/nature/journal/vaop/ncurrent/full/nature06810.html Nature]&lt;br /&gt;
# Krause, L., Diaz, N., Edwards, R.A., Rohwer, F., Kelley, S., and J. Stoye. 2008 Taxonomic Classification of Short Environmental DNA Fragments. 2008 NAR Online Feb 19th. &lt;br /&gt;
# Dinsdale, E. A., O. Pantos, S. Smriga, R. A. Edwards, F. Angly, L. Wegley, M. Hatay, D. Hall, E. Brown, M. Haynes, L. Krause, E. Sala, S. A. Sandin, R. V. Thurber, B. L. Willis, F. Azam, N. Knowlton, and F. Rohwer. 2008. Microbial ecology of four coral atolls in the northern line islands. PLoS ONE 3:e1584.&lt;br /&gt;
# Desnues, C.G., Rodriguez-Brito, B., Rayhawk, S., Kelley, S., Tran, T., Haynes, M., Liu, H., Hall, D., Angly, F.E., Edwards, R.A., Vega, R., Breitbart, M., Siefert, J., Souza, V., Reid, P., Rohwer, F. 2008. Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature 20;452(7185):340-343.&lt;br /&gt;
Mou, X., Sun, S., Edwards, R., Hodson, R., and Moran, M.A. 2008. Bacterial Carbon Processing by Generalist Species in the Coastal Ocean. Nature. 451:709-711.&lt;br /&gt;
# Field, D., Garrity, G., Gray, T., Morrison, N., Selengut, J., Sterk, P., Tatusova, T., Thomson, N., Allen, M.J., Ashburner, M., Baldauf, S., Ballard, S., Boore, J., Cochrane, G., Cole, J., dePamphilis, C., Edwards, R., Faruque, N., Feldman, R., Glockner, F-O., Haft, D., Hancock, D., Hermjakob, H., Hertz-Fowler, C., Hugenholtz, P., Joint, I., Kane, M., Kennedy, J., Kowalchuk, G., Kottmann, R., Kolker, E., Kyrpides, N., Leebens-Mack, J., Lewis, S.E., Liste, A., Lord, P., Maltsev, N., Markowitz, V., Martiny, J., Methe, B., Moxon, R., Nelson, K., Parkhill, J., Sansone, S-A., Spiers, A., Stevens, R., Swift, P., Taylor, C., Tateno, Y., Tett, A., Turner, S., Ussery, D., Vaughan, B., Ward, N., Whetzel, T., Wilson, G., and Wipat, A., 2008. Towards a richer description of our complete collection of genomes and metagenomes: the “Minimum Information about a Genome Sequence” (MIGS) specification. Nature Biotechnology, 26:541-547.&lt;br /&gt;
&lt;br /&gt;
=In press=&lt;br /&gt;
&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1799</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1799"/>
		<updated>2008-03-24T02:17:21Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]. For the metagenomics service please also read this explanation of [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: [http://greengenes.lbl.gov/cgi-bin/nph-index.cgi greengenes], [http://rdp.cme.msu.edu/ RDP-II] and [http://bioinformatics.psb.ugent.be/webtools/rRNA/ European ribosomal RNA database].&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[Image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found (&amp;quot;Browse annotated genome in SEED Viewer&amp;quot;). Users can also download the results in compressed GenBank format.&lt;br /&gt;
&lt;br /&gt;
===MetaGenome Overview===&lt;br /&gt;
&lt;br /&gt;
The MetaGenome Overview provides the user with various statistics regarding their metagenome and details on how each of these numbers are calculated can be found [[MG-RAST_Numbers|here]].&lt;br /&gt;
&lt;br /&gt;
[[Image:MG-RAST-sample-overview.png]]&lt;br /&gt;
&lt;br /&gt;
Users can search for a given function, subsystem or process in the table, or browse the Subsystem Overview. At the top right hand side of the page is a set of tabs that offer a wide set of information to browse, explore, compare and download. Browse allows users to look through the features of this metagenome either graphically or through a table. Both allow quick navigation and filtering for features of your interest. Each feature is linked to its own detail page. Explore allows users to view scenarios. Scenarios are isolated metabolic divisions that in aggregate represent the metabolic functionality of the metagenome. Each scenario is tested for reaction availability against the annotated functions. They provide the foundation for generating a metabolic reconstruction. Comparison of two metagenomes is also possible via the compare tab. You can also export all information about this metagenome (e.g. annotations, scenarios, subsystems) into a variety of formats (e.g. EMBL, Excel) for further analysis on your own system.&lt;br /&gt;
&lt;br /&gt;
==16S Sequences==&lt;br /&gt;
&lt;br /&gt;
The metagenomics-RAST is primarily designed to handle random community genomes. At the moment, we only provide rudimentary support for 16S DNA sequence analysis, although this is near the very top of our to-do list.&lt;br /&gt;
&lt;br /&gt;
Our colleagues at San Diego State University have developed two different tools for handling 16S rDNA sequences. FastGroup, a stand-alone java application ([http://www.ncbi.nlm.nih.gov/pubmed/11707150?ordinalpos=2&amp;amp;itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum Seguritan V and Rohwer F. (2001) FastGroup: a program to dereplicate libraries of 16S rDNA sequences]. BMC Bioinformatics. 2:9. Epub 2001 Oct 16.) is the original program, and it was updated to FastGroupII ([http://www.ncbi.nlm.nih.gov/pubmed/16464253?ordinalpos=1&amp;amp;itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum Yu Y, Breitbart M, McNairnie P, and Rohwer F. (2006) FastGroupII: a web-based bioinformatics platform for analyses of large 16S rDNA libraries]. BMC Bioinformatics. Feb 7;7:57.). We have provided some [[instructions for using FastGroupII with large data sets]]. We recommend FastGroupII for clustering and primary analysis of 16S libraries, and then the data from that can be fed into RDP Classifier and other programs.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Instructions_for_using_FastGroupII_with_large_data_sets&amp;diff=1798</id>
		<title>Instructions for using FastGroupII with large data sets</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Instructions_for_using_FastGroupII_with_large_data_sets&amp;diff=1798"/>
		<updated>2008-03-24T02:15:55Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= URL =&lt;br /&gt;
&lt;br /&gt;
[http://biome.sdsu.edu/fastgroup/ http://biome.sdsu.edu/fastgroup/]&lt;br /&gt;
&lt;br /&gt;
= Overview =&lt;br /&gt;
&lt;br /&gt;
The FastGroup and FastGroupII programs were written by [http://phage.sdsu.edu Forest Rohwer]'s group at [http://www.sdsu.edu/ San Diego State University]. These programs were really written and tested on small 16S data sets typically from Sanger Sequencing. However, as is often the case, they work well for larger datasets like a single 454 run focussing on 16S genes. &lt;br /&gt;
&lt;br /&gt;
FastGroup uses different clustering algorithms to generate both Richness (Chao1) and Diversity (Shannon-Wiener index) measures, as well as returning information about the groups and the statistics.&lt;br /&gt;
&lt;br /&gt;
= Tips =&lt;br /&gt;
&lt;br /&gt;
* I recommend removing the trimming options (by unchecking the boxes) unless you have a specific sequence you want to trim to. &lt;br /&gt;
* '''Please do not request the rarefaction curves'''  These are computationally very expensive, and will take a long time to compute on large data sets. &lt;br /&gt;
* For Mac users, the calculation may take some time to compute and return. I recommend using firefox rather than safari. The latter times out after a few minutes whereas firefox keeps waiting.&lt;br /&gt;
* If you have problems with this site, please email the mg-rast mailing list. Although we do not control FastGroupII, we know the developers and will work with them. We are particularly concerned not to direct too much load on their server while we figure out a solution compatible with the mg-rast.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Instructions_for_using_FastGroupII_with_large_data_sets&amp;diff=1797</id>
		<title>Instructions for using FastGroupII with large data sets</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Instructions_for_using_FastGroupII_with_large_data_sets&amp;diff=1797"/>
		<updated>2008-03-24T02:15:39Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= URL =&lt;br /&gt;
&lt;br /&gt;
[http://biome.sdsu.edu/fastgroup/]&lt;br /&gt;
&lt;br /&gt;
= Overview =&lt;br /&gt;
&lt;br /&gt;
The FastGroup and FastGroupII programs were written by [http://phage.sdsu.edu Forest Rohwer]'s group at [http://www.sdsu.edu/ San Diego State University]. These programs were really written and tested on small 16S data sets typically from Sanger Sequencing. However, as is often the case, they work well for larger datasets like a single 454 run focussing on 16S genes. &lt;br /&gt;
&lt;br /&gt;
FastGroup uses different clustering algorithms to generate both Richness (Chao1) and Diversity (Shannon-Wiener index) measures, as well as returning information about the groups and the statistics.&lt;br /&gt;
&lt;br /&gt;
= Tips =&lt;br /&gt;
&lt;br /&gt;
* I recommend removing the trimming options (by unchecking the boxes) unless you have a specific sequence you want to trim to. &lt;br /&gt;
* '''Please do not request the rarefaction curves'''  These are computationally very expensive, and will take a long time to compute on large data sets. &lt;br /&gt;
* For Mac users, the calculation may take some time to compute and return. I recommend using firefox rather than safari. The latter times out after a few minutes whereas firefox keeps waiting.&lt;br /&gt;
* If you have problems with this site, please email the mg-rast mailing list. Although we do not control FastGroupII, we know the developers and will work with them. We are particularly concerned not to direct too much load on their server while we figure out a solution compatible with the mg-rast.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1796</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1796"/>
		<updated>2008-03-24T02:05:15Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]. For the metagenomics service please also read this explanation of [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: [http://greengenes.lbl.gov/cgi-bin/nph-index.cgi greengenes], [http://rdp.cme.msu.edu/ RDP-II] and [http://bioinformatics.psb.ugent.be/webtools/rRNA/ European ribosomal RNA database].&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[Image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found (&amp;quot;Browse annotated genome in SEED Viewer&amp;quot;). Users can also download the results in compressed GenBank format.&lt;br /&gt;
&lt;br /&gt;
===MetaGenome Overview===&lt;br /&gt;
&lt;br /&gt;
The MetaGenome Overview provides the user with various statistics regarding their metagenome and details on how each of these numbers are calculated can be found [[MG-RAST_Numbers|here]].&lt;br /&gt;
&lt;br /&gt;
[[Image:MG-RAST-sample-overview.png]]&lt;br /&gt;
&lt;br /&gt;
Users can search for a given function, subsystem or process in the table, or browse the Subsystem Overview. At the top right hand side of the page is a set of tabs that offer a wide set of information to browse, explore, compare and download. Browse allows users to look through the features of this metagenome either graphically or through a table. Both allow quick navigation and filtering for features of your interest. Each feature is linked to its own detail page. Explore allows users to view scenarios. Scenarios are isolated metabolic divisions that in aggregate represent the metabolic functionality of the metagenome. Each scenario is tested for reaction availability against the annotated functions. They provide the foundation for generating a metabolic reconstruction. Comparison of two metagenomes is also possible via the compare tab. You can also export all information about this metagenome (e.g. annotations, scenarios, subsystems) into a variety of formats (e.g. EMBL, Excel) for further analysis on your own system.&lt;br /&gt;
&lt;br /&gt;
==16S Sequences==&lt;br /&gt;
&lt;br /&gt;
The metagenomics-RAST is primarily designed to handle random community genomes. At the moment, we only provide rudimentary support for 16S DNA sequence analysis, although this is near the very top of our to-do list.&lt;br /&gt;
&lt;br /&gt;
Our colleagues at San Diego State University have developed two different tools for handling 16S rDNA sequences. FastGroup, a stand-alone java application ([http://www.ncbi.nlm.nih.gov/pubmed/11707150?ordinalpos=2&amp;amp;itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum Seguritan V and Rohwer F. (2001) FastGroup: a program to dereplicate libraries of 16S rDNA sequences]. BMC Bioinformatics. 2:9. Epub 2001 Oct 16.) is the original program, and it was updated to FastGroupII ([http://www.ncbi.nlm.nih.gov/pubmed/16464253?ordinalpos=1&amp;amp;itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum Yu Y, Breitbart M, McNairnie P, and Rohwer F. (2006) FastGroupII: a web-based bioinformatics platform for analyses of large 16S rDNA libraries]. BMC Bioinformatics. Feb 7;7:57.). We have provided some [[instructions for using FastGroupII with large data sets]].&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1782</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1782"/>
		<updated>2008-01-08T06:18:44Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* 2006 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Gerdes, S., Kurnasov, O., Shatalin, K., Polanuyer, B., Sloutsky, R., Vonstein, V., Overbeek, R., and A.L. Osterman (2006). Comparative Genomics of NAD Biosynthesis in Cyanobacteria. J. Bacteriol. 188: 3012-3023. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16585762&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/content/abstract/188/8/3012 J. Bact]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
=In press=&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1781</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1781"/>
		<updated>2007-12-31T21:55:28Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=2005=&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
=2006=&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
=2007=&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
=In press=&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1780</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1780"/>
		<updated>2007-12-31T21:54:51Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;*2005&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
*2006&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
*2007&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
*In press&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1779</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1779"/>
		<updated>2007-12-31T21:54:17Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;*2005&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
*2006&lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855 PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6VRV-4KWTFBB-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=3067cb821f4b494047d5270860f5557c Science Direct]&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394 PubMed] [http://jb.asm.org/cgi/content/full/188/23/8259?view=long&amp;amp;pmid=17012394 JBact]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666 PubMed] [http://www.jbc.org/cgi/content/full/281/40/29872 JBC]&lt;br /&gt;
*2007&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474 PubMed]&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515 PubMed] [http://www.pnas.org/cgi/content/full/104/11/4279 PNAS]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1778</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1778"/>
		<updated>2007-12-31T21:46:40Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16857666&lt;br /&gt;
&lt;br /&gt;
# Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17012394&lt;br /&gt;
&lt;br /&gt;
# Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16978855.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17195474&lt;br /&gt;
&lt;br /&gt;
# Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17360515&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17690205 PubMed] [http://www.genome.org/cgi/content/full/17/9/1362 Genome Research]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1777</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1777"/>
		<updated>2007-12-31T21:43:19Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
1.	Ye, Y., Osterman, A., Overbeek, R., Godzik, A. (2005) Automatic detection of subsystem/pathway variants in genome analysis. BIOINFORMATICS 21,  Suppl. 1, i1–i9. PMID: 15961494&lt;br /&gt;
2.	Osterman, A. (2006). A hidden metabolic pathway exposed. Proc Natl Acad Sci U S A 103:5637-8. PMID: 16595627&lt;br /&gt;
3.	Gerdes, S. Y., O. V. Kurnasov, K. Shatalin, B. Polanuyer, R. Sloutsky, V. Vonstein, R. Overbeek, and A. L. Osterman (2006). Comparative genomics of NAD biosynthesis in cyanobacteria. J Bacteriol 188:3012-23. PMID: 16585762&lt;br /&gt;
4.	Yang, C., D. A. Rodionov, X. Li, O. N. Laikova, M. S. Gelfand, O. P. Zagnitko, M. F. Romine, A. Y. Obraztsova, K. H. Nealson, and A. L. Osterman. (2006). Comparative genomics and experimental characterization of N-acetylglucosamine utilization pathway of Shewanella oneidensis. J Biol Chem 281:29872-85. PMID: 16857666&lt;br /&gt;
5.	Joyce AR, Reed JL, White A, Edwards R, Osterman A, Baba T, Mori H, Lesely SA, Palsson BO, Agarwalla S., (2006). Experimental and computational assessment of conditionally essential genes in E. coli. J Bacteriol. 188(23):8259-71. PMID: 17012394&lt;br /&gt;
6.	Gerdes, S., R. Edwards, M. Kubal, M. Fonstein, R. Stevens, and A. Osterman. (2006). Essential genes on metabolic maps. Curr Opin Biotechnol 17:448-56. PMID: 16978855.&lt;br /&gt;
7.	Osterman, A., and T. Begley. (2007). A Subsystems-based approach to the identification of drug targets in bacterial pathogens, p. 132-170. In H.Boshoff and C.Barry (ed.), Progress in Drug Research, vol. 64. Birkhauser Verlag, Basel. PMID: 17195474&lt;br /&gt;
8.	Dmitry A. Rodionov, Oleg V. Kurnasov, Boguslaw Stec, Yan Wang, Mary F. Roberts, and Andrei L. Osterman (2007) Genomic identification and in vitro reconstitution of a complete biosynthetic pathway for the osmolyte di-myo-inositol-phosphate. . Proc Natl Acad Sci U S A. Mar 13;104(11):4279-84. PMID: 17360515&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Gupta N, Tanner S, Jaitly N, Adkins JN, Lipton M, Edwards R, Romine M, Osterman A, Bafna V, Smith RD, Pevzner PA. (2007) Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotation.Genome Res. Sep;17(9):1362-77. Epub 2007 Aug 9. PMID: 17690205&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# JR. Yates III and AL. Osterman (2007) Introduction: advances in genomics and proteomics. Chem Rev. Aug;107(8):3363-6. Epub 2007 Jul 21. [http://pubs3.acs.org/acs/journals/doilookup?in_doi=10.1021/cr068201u ACS]&lt;br /&gt;
&lt;br /&gt;
# Yang C, Rodionov DA, Rodionova IA, Li X, Osterman AL. (2007) Glycerate 2-kinase of Thermotoga maritima and genomic reconstruction of related metabolic pathways., J Bacteriol. 2007 Dec 21; [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=18156253&amp;amp;cmd_current= PubMed] [http://jb.asm.org/cgi/reprint/JB.01469-07v1?view=long&amp;amp;pmid=18156253 JBact]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1776</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1776"/>
		<updated>2007-12-31T21:40:23Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# El Yacoubi B, Bonnett S, Anderson JN, Swairjo MA, Iwata-Reuyl D, de Crécy-Lagard V. (2006) Discovery of a new prokaryotic type I GTP cyclohydrolase family. J Biol Chem. Dec 8;281(49):37586-93. Epub 2006 Oct 10. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17032654&amp;amp;cmd_current= PubMed] [http://www.jbc.org/cgi/content/full/281/49/37586 JBC]# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# de Crecy-Lagard V, Hanson AD. (2007) Finding novel metabolic genes through plant-prokaryote phylogenomics. Trends Microbiol. Dec;15(12):563-70. Epub 2007 Nov 9. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17997099&amp;amp;cmd_current= PubMed] [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6TD0-4R3BW70-1&amp;amp;_user=10&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_acct=C000050221&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=10&amp;amp;md5=45c7be528111535816858125516a93ad Trends]&lt;br /&gt;
# de Crecy-Lagard V, El Yacoubi B, de la Garza RD, Noiriel A, Hanson AD. (2007) Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations. BMC Genomics. Jul 23;8:245.[http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17645794&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/8/245 Biomed Central]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1775</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1775"/>
		<updated>2007-12-31T19:30:29Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplemental Online Material] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1774</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1774"/>
		<updated>2007-12-31T19:28:05Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html SOM] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1773</id>
		<title>Papers that use the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Papers_that_use_the_SEED&amp;diff=1773"/>
		<updated>2007-12-31T19:27:38Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Papers related to the SEED =&lt;br /&gt;
&lt;br /&gt;
# Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edwards R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V. (2005) Oct 7;33(17):5691-702. [http://www.ncbi.nlm.nih.gov/pubmed/16214803 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/33/17/5691 NAR] [http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html SOM] &lt;br /&gt;
# Angly, F.E., Felts, B., Breitbart, M., Salamon, P., Edwards, R.A., Carlson, C., Chan, A.M., Haynes, M., Kelley, S., Liu, H. et al. (2006) The marine viromes of four oceanic regions. PLoS Biol, 4. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=The%20marine%20viromes%20of%20four%20oceanic%20regions&amp;amp;cmd_current= PubMed] [http://biology.plosjournals.org/perlserv/?request=get-document&amp;amp;doi=10.1371/journal.pbio.0040368&amp;amp;ct=1 PLoS Biology]&lt;br /&gt;
# DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N.U., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R. et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science, 311, 496-503. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=Community%20genomics%20among%20stratified%20microbial%20assemblages%20in%20the%20ocean%27s%20interior&amp;amp;cmd_current= PubMed]&lt;br /&gt;
# Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., Peterson, D.M., Saar, M.O., Alexander, S., Alexander, E.C., Jr. and Rohwer, F. (2006) Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics, 7, 57. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16549033&amp;amp;cmd_current= PubMed] [http://www.biomedcentral.com/1471-2164/7/57 BioMed Central]&lt;br /&gt;
# Krause, L., Diaz, N.N., Bartels, D., Edwards, R.A., Puhler, A., Rohwer, F., Meyer, F. and Stoye, J. (2006) Finding novel genes in bacterial communities isolated from the environment. Bioinformatics, 22, e281. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=16873483&amp;amp;cmd_current= PubMed] [http://bioinformatics.oxfordjournals.org/cgi/reprint/22/14/e281 NAR]&lt;br /&gt;
# Rodriguez-Brito, B., Rohwer, F. and Edwards, R. (2006) An application of statistics to comparative metagenomics. BMC Bioinformatics, 7, 162.&lt;br /&gt;
# Fierer, N., Breitbart, M., Nulton, J., Salamon, P., Lozupone, P., Jones, R., Robeson, M., Edwards, R., Felts, B., Rayhawk, S. et al. (2007) Metagenomic and small-subunit RNA surveys reveal the high genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol, 73, 7059-7066. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17827313&amp;amp;cmd_current= PubMed] [http://aem.asm.org/cgi/content/full/73/21/7059?view=long&amp;amp;pmid=17827313 AEM]&lt;br /&gt;
# McNeil LK, Reich C, Aziz RK, Bartels D, Cohoon M, Disz T, Edwards RA, Gerdes SY, Hwang K, Kubal M, Margaryan GR, Meyer F, Mihalo W, Olsen GJ, Olson R, Osterman AL, Paarmann D, Paczian T, Parrello B, Pusch GD, Rodionov DA, Shi X, Vassieva O, Vonstein V, Zagnitko OP, Xia F, Zinner J, Overbeek R, Stevens R.(2007) The National Microbial Pathogen Database Resource (NMPDR): A genomics platform based on subsystem annotation. Nucleic Acids Res. 2007 Jan;35(Database issue):D347-53 [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17145713 PubMed] [http://nar.oxfordjournals.org/cgi/content/full/35/suppl_1/D347 NAR]&lt;br /&gt;
# Wegley, L., Breitbart, M., Edwards, R. and Rohwer, F. (2007) Functional and taxonimic analysis of coral-associated microbes using metagenomic analysis. Environ Microbiol, 9, 2707-2719. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&amp;amp;orig_db=PubMed&amp;amp;term=17922755&amp;amp;cmd_current= PubMed] [http://www.blackwell-synergy.com/doi/abs/10.1111/j.1462-2920.2007.01383.x Environ Microbiol]&lt;br /&gt;
&lt;br /&gt;
==In press==&lt;br /&gt;
&lt;br /&gt;
# Desnues CG, B Rodriguez-Brito, S Rayhawk, S Kelley, T Tran, M Haynes, H Liu, D Hall, FE Angly, RA Edwards, RB Thurber, P Reid, J Siefert, V Souza, DL Valentine, BK Swan, M Breitbart, F Rohwer (2008; accepted) Biodiversity and biogeography of phages in modern stromatolites and thrombolites. Nature In press.&lt;br /&gt;
# Mou, X.S., S., Edwards, R.A., Hodson, R.E. and Moran, M.A. (2007) Generalist Species Dominate Bacterial Carbon Processing in the Coastal Ocean. Nature, In Press.&lt;br /&gt;
# Breitbart M, M Haynes, S Kelley, F Angly, R Edwards, B Felts, JM Mahaffy, J Mueller, J Nulton, S Rayhawk, B Rodriguez-Brito, P Salamon, F Rohwer (2008) Viral diversity and dynamics in an infant's gut.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
~~&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Home_of_the_SEED&amp;diff=1772</id>
		<title>Home of the SEED</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Home_of_the_SEED&amp;diff=1772"/>
		<updated>2007-12-31T18:39:13Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;With the growing number of available genomes, the need for an environment to support effective comparative analysis increases. The original SEED Project was started in 2003 by the [http://thefig.info Fellowship for Interpretation of Genomes (FIG)] as a largely unfunded open source effort.  Argonne National Laboratory and the University of Chicago joined the project, and now much of the activity occurs at those two institutions (as well as the University of Illinois at Urbana-Champaign, Hope college, San Diego State University, the Burnham Institute and a number of other institutions).  The cooperative effort focuses on the development of the comparative genomics environment called the SEED and, more importantly, on the development of curated genomic data. &lt;br /&gt;
&lt;br /&gt;
We provide a [http://seed-viewer.theseed.org/ SEED-Viewer] that allows read-only access to the latest curated data sets. For users interested in editing and learning how to use the system, we provide the [http://theseed.uchicago.edu/FIG/index.cgi Trial-SEED]. As described in our [[Annotating_1000_genomes|manifesto]] the [[Glossary#Annotation|annotation]] is not performed on a gene by gene basis per genome, but rather by [[Glossary#Subsystem|subsystem]] by an expert curator across many genomes at a time.  &lt;br /&gt;
&lt;br /&gt;
Answering numerous requests for automatic Seed-Quality annotations for more or less complete bacterial and archaeal genomes, we have established the free [http://rast.nmpdr.org RAST-Server] (RAST=Rapid Annotation using Subsytems Technology). Using similar technology, we make the [http://metagenomics.nmpdr.org Metagenomics-RAST-Server] freely available.&lt;br /&gt;
&lt;br /&gt;
We make all our software and data available for download and use on our [[DownloadPage]] page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* When using the SEED, please cite: Overbeek et al., [http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;amp;db=pubmed&amp;amp;dopt=Abstract&amp;amp;list_uids=16214803&amp;amp;query_hl=2&amp;amp;itool=pubmed_docsum|Nucleic Acids Res 33(17)], 2005  ([http://www.theseed.org/SubsystemPaperSupplementalMaterial/index.html Supplementary material])&lt;br /&gt;
* Our approaches to annotation, gene calling etc are outlined in a series of [[SOPs|Standard Operating Procedures]].&lt;br /&gt;
* Other [[papers that use the SEED]]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1751</id>
		<title>Metagenomics sequence formats</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1751"/>
		<updated>2007-11-04T19:36:11Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Common Errors ==&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not the right [[Metagenomics sequence formats|file formats]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
# See also [[Which Sequences Should I Upload, and Where]]&lt;br /&gt;
&lt;br /&gt;
== File formats ==&lt;br /&gt;
&lt;br /&gt;
To upload sequence data to the metagenomics RAST server, we accept several file formats. &lt;br /&gt;
&lt;br /&gt;
* You can upload a fasta file containing '''just the nucleotide sequences'''. This is the simplest format, just have a regular [[Valid fasta format]] nucleotide sequence file, and upload it. However, there may be some limitation on the file size. In this case the file name should end .fa, .fasta, or .fna.&lt;br /&gt;
&lt;br /&gt;
* You can compress the sequence file containing '''just the nucleotide sequences''' with tar and gzip a popular compression tool. This will significantly reduce the size of the file to upload, and hence speed things up. In this case the file name should end .tgz and the fasta file should end .fa, .fasta, or .fna.&lt;br /&gt;
&lt;br /&gt;
* You can also include a '''separate''' quality file in this same compressed file. To do this, compress both files into a single archive and then upload the archive.tgz file (don't worry, we'll take care of the name, you can call it whatever you want!):&lt;br /&gt;
&lt;br /&gt;
    tar zcf archive.tgz sequence.fa sequence.qual&lt;br /&gt;
&lt;br /&gt;
If you do this, we will renumber the sequences and their corresponding quality scores at the same time. At the moment we don't use the quality scores, although we are experimenting with assembly tools that may take advantage of them. Therefore, the inclusion of quality scores is completely optional.&lt;br /&gt;
&lt;br /&gt;
* Please note that at the moment we only accept tar/gzipped compressed formats, and if you upload other formats the upload will fail. Sorry.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1750</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1750"/>
		<updated>2007-11-02T04:39:46Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]. For the metagenomics service please also read this explanation of [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: [http://greengenes.lbl.gov/cgi-bin/nph-index.cgi greengenes], [http://rdp.cme.msu.edu/ RDP-II] and [http://bioinformatics.psb.ugent.be/webtools/rRNA/ European ribosomal RNA database].&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1741</id>
		<title>Metagenomics sequence formats</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1741"/>
		<updated>2007-10-05T07:48:38Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* File formats */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Common Errors ==&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not the right [[Metagenomics sequence formats|file formats]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
# See also [[Which Sequences Should I Upload, and Where]]&lt;br /&gt;
&lt;br /&gt;
== File formats ==&lt;br /&gt;
&lt;br /&gt;
To upload sequence data to the metagenomics RAST server, we accept several file formats. &lt;br /&gt;
&lt;br /&gt;
* You can upload a fasta file containing '''just the nucleotide sequences'''. This is the simplest format, just have a regular [[Valid fasta format]] nucleotide sequence file, and upload it. However, there may be some limitation on the file size.&lt;br /&gt;
&lt;br /&gt;
* You can compress the sequence file containing '''just the nucleotide sequences''' with [http://www.gzip.org/ gzip], a popular compression tool. This will significantly reduce the size of the file to upload, and hence speed things up.&lt;br /&gt;
&lt;br /&gt;
* You can also include a '''separate''' quality file in this same compressed file. To do this, compress both files into a single archive and then upload the archive.gz file (don't worry, we'll take care of the name, you can call it whatever you want!):&lt;br /&gt;
&lt;br /&gt;
    gzip archive.gz sequence.fa sequence.qual&lt;br /&gt;
&lt;br /&gt;
If you do this, we will renumber the sequences and their corresponding quality scores at the same time. At the moment we don't use the quality scores, although we are experimenting with assembly tools that may take advantage of them. Therefore, the inclusion of quality scores is completely optional.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1740</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1740"/>
		<updated>2007-10-05T07:47:43Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Which Seqeunces Should I Upload? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not the right [[Metagenomics sequence formats|file formats]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system. Please also read this explanation of appropriate [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[[Valid fasta format]]'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1739</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1739"/>
		<updated>2007-10-05T07:47:22Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Which Seqeunces Should I Upload? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not in the right [[Metagenomics sequence formats|file formats]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system. Please also read this explanation of appropriate [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[[Valid fasta format]]'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1738</id>
		<title>Metagenomics sequence formats</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1738"/>
		<updated>2007-10-05T07:46:21Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* File formats */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== File formats ==&lt;br /&gt;
&lt;br /&gt;
To upload sequence data to the metagenomics RAST server, we accept several file formats. &lt;br /&gt;
&lt;br /&gt;
* You can upload a fasta file containing '''just the nucleotide sequences'''. This is the simplest format, just have a regular [[Valid fasta format]] nucleotide sequence file, and upload it. However, there may be some limitation on the file size.&lt;br /&gt;
&lt;br /&gt;
* You can compress the sequence file containing '''just the nucleotide sequences''' with [http://www.gzip.org/ gzip], a popular compression tool. This will significantly reduce the size of the file to upload, and hence speed things up.&lt;br /&gt;
&lt;br /&gt;
* You can also include a '''separate''' quality file in this same compressed file. To do this, compress both files into a single archive and then upload the archive.gz file (don't worry, we'll take care of the name, you can call it whatever you want!):&lt;br /&gt;
&lt;br /&gt;
    gzip archive.gz sequence.fa sequence.qual&lt;br /&gt;
&lt;br /&gt;
If you do this, we will renumber the sequences and their corresponding quality scores at the same time. At the moment we don't use the quality scores, although we are experimenting with assembly tools that may take advantage of them. Therefore, the inclusion of quality scores is completely optional.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1737</id>
		<title>Metagenomics sequence formats</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1737"/>
		<updated>2007-10-05T07:45:37Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* File formats */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== File formats ==&lt;br /&gt;
&lt;br /&gt;
To upload sequence data to the metagenomics RAST server, we accept several file formats. &lt;br /&gt;
&lt;br /&gt;
* You can upload a fasta file containing ''just the nucleotide sequences''. This is the simplest format, just have a regular [[Valid fasta format]] nucleotide sequence file, and upload it. However, there may be some limitation on the file size.&lt;br /&gt;
&lt;br /&gt;
* You can compress the sequence file containing ''just the nucleotide sequences'' with [http://www.gzip.org/ gzip], a popular compression tool. This will significantly reduce the size of the file to upload, and hence speed things up.&lt;br /&gt;
&lt;br /&gt;
* You can also include a ''separate'' quality file in this same compressed file. To do this, compress both files into a single archive and then upload the archive.gz file (don't worry, we'll take care of the name, you can call it whatever you want!):&lt;br /&gt;
&lt;br /&gt;
    gzip archive.gz sequence.fa sequence.qual&lt;br /&gt;
&lt;br /&gt;
If you do this, we will renumber the sequences and their corresponding quality scores at the same time. At the moment we don't use the quality scores, although we are experimenting with assembly tools that may take advantage of them. Therefore, the inclusion of quality scores is completely optional.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1736</id>
		<title>Metagenomics sequence formats</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Metagenomics_sequence_formats&amp;diff=1736"/>
		<updated>2007-10-05T07:45:00Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: What files to upload&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== File formats ==&lt;br /&gt;
&lt;br /&gt;
To upload sequence data to the metagenomics RAST server, we accept several file formats. &lt;br /&gt;
&lt;br /&gt;
* You can upload a fasta file containing ''just the nucleotide sequences''. This is the simplest format, just have a regular [[Valid fasta format]] nucleotide sequence file, and upload it. However, there may be some limitation on the file size.&lt;br /&gt;
&lt;br /&gt;
* You can compress the sequence file containing ''just the nucleotide sequences'' with [http://www.gzip.org/ gzip], a popular compression tool. This will significantly reduce the size of the file to upload, and hence speed things up.&lt;br /&gt;
&lt;br /&gt;
* You can also include a ''separate'' quality file in this same compressed file. To do this, compress both files into a single archive:&lt;br /&gt;
&lt;br /&gt;
    gzip archive.gz sequence.fa sequence.qual&lt;br /&gt;
   &lt;br /&gt;
    and then upload the archive.gz file (don't worry, we'll take care of the name!)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
If you do this, we will renumber the sequences and their corresponding quality scores at the same time. At the moment we don't use the quality scores, although we are experimenting with assembly tools that may take advantage of them. Therefore, the inclusion of quality scores is completely optional.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1735</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1735"/>
		<updated>2007-10-05T07:27:51Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]. For the metagenomics service please also read this explanation of [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: GREENGENES, RDP-II and European ribosomal RNA database.&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1734</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1734"/>
		<updated>2007-10-05T07:27:18Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Metagenomics Rast Server */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system. Please also read this explanation of appropriate [[metagenomics sequence formats]].&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[[Valid fasta format]]'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1733</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1733"/>
		<updated>2007-10-05T07:26:20Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]. For the metagenomics service please also read this explanation of [[sequence formats]].&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: GREENGENES, RDP-II and European ribosomal RNA database.&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1732</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1732"/>
		<updated>2007-10-04T19:18:25Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Which Seqeunces Should I Upload? */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequences anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system:&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[[Valid fasta format]]'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1731</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1731"/>
		<updated>2007-10-04T19:16:48Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Remember! */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequnces anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system:&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[[Valid fasta format]]'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1730</id>
		<title>Which Sequences Should I Upload, and Where</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Which_Sequences_Should_I_Upload,_and_Where&amp;diff=1730"/>
		<updated>2007-10-04T19:16:05Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: Which sequences should be uploaded to the RAST or MG-RAST services&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Which Seqeunces Should I Upload? ==&lt;br /&gt;
&lt;br /&gt;
We have different options for publicly available sequence analysis, and some can take different types of sequence data:&lt;br /&gt;
&lt;br /&gt;
# The [[#Metagenomics Rast Server |Metagenomics RAST Server]] for metagenome sequences&lt;br /&gt;
# The [[#RAST Server|RAST Server]] for complete genomes&lt;br /&gt;
&lt;br /&gt;
Please note, that most of our users problems are because the sequences are&lt;br /&gt;
# Not in [[Valid fasta format]]&lt;br /&gt;
# Not nucleotide sequences. At the moment we don't have any service for annotating just protein sequences. If there is a large demand, we can add this, but you probably don't want to use genome annotation tools to annotate protein sequnces anyway.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Metagenomics Rast Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] is designed to annotate '''nucleotide''' sequences from metagenome projects. You can supply either assembled or unassembled data, and reads can be as short as 100 bp and as long as you would like. There are some caveats to the system:&lt;br /&gt;
&lt;br /&gt;
==== Unassembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to do statistical comparisons between metagenomes, you most likely need unassembled sequences. The frequency that any gene is found is an approximation of the abundance of that gene in the environment. Thus if you two different samples you can compare gene frequencies between them to figure out which are the important environments. In this case, just upload the unassembled '''nucleotide''' sequences in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
==== Assembled Data ====&lt;br /&gt;
&lt;br /&gt;
If you want to look for complete genes or pieces of a genome, then you can use assembled sequences. These are typically longer, and the ORF caller we use on the short fragments and sequences may have problems with longer sequences. On the ''to do list'' is to add specific ORF callers for different sequence sets. &lt;br /&gt;
&lt;br /&gt;
For sequences over about 1,000,000 bp (1 Mbp) you should consider pulling out those sequences individually and running them through the [[#RAST Server|RAST Server]]  server for complete genomes. This server uses far superior gene identification and analysis algorithms that are only applicable once you have longer sequences. However, the algorithm will not work very well with sequences under about 1 Mbp.  If you assemble sequences you will loose the frequency information, and cannot easily do statistical comparisons between metagenomes.&lt;br /&gt;
&lt;br /&gt;
=== RAST Server ===&lt;br /&gt;
&lt;br /&gt;
The [http://www.nmpdr.org/anno-server RAST Server] is designed for complete, or nearly complete, microbial genomes. This uses a novel form of gene calling based on protein families, that is described in our upcoming paper. One of the basics of this technique is understanding where your organism lies using phylogenomics to ensure that we get accurate ORF calling. It doesn't make sense to use this technique for metagenomics. Furthermore, the RAST Server also leverages the SEED's work on functional coupling, assigning functions based on nearby genes.&lt;br /&gt;
&lt;br /&gt;
The RAST Server is currently the best annotation platform for complete genomes.&lt;br /&gt;
&lt;br /&gt;
== Remember! == &lt;br /&gt;
&lt;br /&gt;
Remember:&lt;br /&gt;
* '''nucleotides only'''&lt;br /&gt;
* '''[Valid fasta format'''&lt;br /&gt;
* Metagenomes, assembled or unassembled, use the [http://metagenomics.nmpdr.org/ Metagenomics RAST Server] &lt;br /&gt;
* Complete genomes, or nearly complete (to the order of tens of contigs), use the [http://www.nmpdr.org/anno-server RAST Server]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1729</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1729"/>
		<updated>2007-10-04T18:56:57Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]. For more information, please see [[Which Sequences Should I Upload, and Where]]&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: GREENGENES, RDP-II and European ribosomal RNA database.&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1728</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1728"/>
		<updated>2007-10-04T18:54:28Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. &lt;br /&gt;
&lt;br /&gt;
In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Valid_fasta_format&amp;diff=1727</id>
		<title>Valid fasta format</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Valid_fasta_format&amp;diff=1727"/>
		<updated>2007-10-04T18:53:58Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the most frequent errors with uploading the data is incorrect file format. We recommend fasta format for all the sequence data to be uploaded. In particular, please check the following things:&lt;br /&gt;
&lt;br /&gt;
# There should be no spaces or tabs at the start or ends of the lines&lt;br /&gt;
# The identifier line should begin with a greater than sign &amp;quot;&amp;gt;&amp;quot;, and only one line is allowed&lt;br /&gt;
# Typically most bioinformatics applications use the first word after the &amp;gt; as the identifier for the sequence. Its nice (but not essential) if this is unique&lt;br /&gt;
# In the sequence lines (not header lines), spaces and numbers are removed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Examples of valid fasta&lt;br /&gt;
&lt;br /&gt;
    &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    &amp;gt;1 this is a sequence that i know something about&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
&lt;br /&gt;
Examples of invalid fasta&lt;br /&gt;
   &lt;br /&gt;
   &amp;gt;sequenceid&lt;br /&gt;
   This is a comment about the sequence&lt;br /&gt;
   gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
   ''Pleae don't include comments in the sequence data''&lt;br /&gt;
&lt;br /&gt;
          &amp;gt;sequenceid&lt;br /&gt;
    gatgcagcatgcagctagcagcgacggactac...&lt;br /&gt;
    ''please don't have spaces before the &amp;gt; in the identifier''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
fasta is probably the most common sequence format because it is relatively compact, and very easy to parse.&lt;br /&gt;
&lt;br /&gt;
There is more information about the fasta format at:&lt;br /&gt;
# [http://en.wikipedia.org/wiki/Fasta_format Wikipedia]&lt;br /&gt;
# [http://www.ncbi.nlm.nih.gov/blast/fasta.shtml NCBI]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=MG_RAST_Tutorial&amp;diff=1726</id>
		<title>MG RAST Tutorial</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=MG_RAST_Tutorial&amp;diff=1726"/>
		<updated>2007-10-04T18:44:36Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Overview===&lt;br /&gt;
&lt;br /&gt;
The metagenomics RAST server (http://metagenomics.nmpdr.org) is a SEED-based environment that allows users to upload metagenomes for automated analyses. The server is built as a modified version of the  RAST server. The  RAST (Rapid Annotation using Subsystem Technology) technology was originally implemented to allow automated high-quality annotation of complete or draft microbial genomes using SEED data, and has been adapted for metagenome analysis.&lt;br /&gt;
&lt;br /&gt;
Our freely available server provides the annotation of sequence fragments, their phylogenetic classification, functional classification of samples, and comparison between multiple metagenomes. The server also computes an initial metabolic reconstruction for the metagenome and allows comparison of metabolic reconstructions of metagenomes and genomes.&lt;br /&gt;
&lt;br /&gt;
User submission and analysis are confidential. Although we do not guarantee a maximum turnover time, the current average processing time is about 24 hours. Currently the server handles 454 and Sanger sequence data. Data sets supplied by 454 can be uploaded directly. In either case, the data needs to be in [[Valid fasta format]]&lt;br /&gt;
&lt;br /&gt;
The server relies on the technology and data established by FIG and the NMPDR team at Argonne National Laboratory and the University of Chicago.&lt;br /&gt;
&lt;br /&gt;
In addition to SEED data we use the following ribosomal RNA databases for our analyses: GREENGENES, RDP-II and European ribosomal RNA database.&lt;br /&gt;
&lt;br /&gt;
===Registration===&lt;br /&gt;
Registration is required for metagenome submission and viewing of results. This enables us to contact users once the computation is finished and in case the users intervention is required.&lt;br /&gt;
&lt;br /&gt;
At the bottom of the main page is a like for registration (see Figure 1).&lt;br /&gt;
&lt;br /&gt;
[[image:mg-rast-main-page.jpeg]]&lt;br /&gt;
&lt;br /&gt;
Required fields for registration include first and last name and your valid email address. Login information and other communication regarding the status of your metagenome analysis job(s) will be sent to the email address you provide. Optional information includes your organization and any notes you would like to send the rast server support team.&lt;br /&gt;
&lt;br /&gt;
Please note that your login and password are valid for use in both the MG-RAST and RAST servers.&lt;br /&gt;
&lt;br /&gt;
===Submitting a Job===&lt;br /&gt;
&lt;br /&gt;
Once you have registered and logged into the server, you will be directed to your Jobs Overview. At the top of this page will be a link labeled &amp;quot;Upload Genome&amp;quot; which will allow you to start a new job.&lt;br /&gt;
&lt;br /&gt;
Your metagenome file(s) should be uploaded as either a single plain text file containing all the sequences in FASTA format, or a gzip compressed tar archive (tar.gz) that has your FASTA sequences.&lt;br /&gt;
&lt;br /&gt;
Please do not upload uncompressed files larger than 30 MB. If your data set is larger, use the compressed format or contact us for other options. If you would like, you can also include the quality files in your archive. The fasta file names should end either *.fna, *.fa, or *.fasta, and the quality files should be named *.qual. The quality files are not currently used in the analysis, but the sequences will be renamed and renumbered along with the fasta sequences. If you have trouble with the upload format please email mg-rast@mcs.anl.gov and we'll be happy to help.&lt;br /&gt;
&lt;br /&gt;
''Data entered into the server will not be used for any purposes or integrated into the main SEED environment, it will remain on this server for 120 days or until deleted by the submitting user.''&lt;br /&gt;
&lt;br /&gt;
An email will be sent once the automatic annotation has finished or in case user intervention is required.&lt;br /&gt;
&lt;br /&gt;
===Viewing Results===&lt;br /&gt;
The overall status of your metagenome analyses can be viewed from the main portion of the Jobs Overview page. This contains information regarding a user’s personal jobs and when applicable, jobs of the user’s organization. Information includes each job/metagenome and its status and contains information including job number, name of the user who started the job, metagenome name, and annotation progress. &lt;br /&gt;
&lt;br /&gt;
The table of jobs can be sorted on any column containing textual information. When the user has numerous metagenomes to select from, they can use the text boxes in the table header to search and refine the list of jobs.&lt;br /&gt;
&lt;br /&gt;
Clicking on the bars for a given job in the annotation progress column directs the user to the “Job Details” page where the detailed job status and access to the metagenome analysis can be found.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Release_to_BRC&amp;diff=1589</id>
		<title>Release to BRC</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Release_to_BRC&amp;diff=1589"/>
		<updated>2006-11-14T14:43:59Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* How to Create and Release GFF3 files to the BRCs */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== How to Create and Release GFF3 files to the BRCs ==&lt;br /&gt;
&lt;br /&gt;
We regularly release our data to BRC-central via GFF3 files. This page describes the steps to release the data.&lt;br /&gt;
&lt;br /&gt;
===Creating the files===&lt;br /&gt;
&lt;br /&gt;
Choose a machine that is upto date, and create an empty directory. For this example, we'll use the directory NMPDR&lt;br /&gt;
&lt;br /&gt;
Run the command &lt;br /&gt;
&lt;br /&gt;
 nmpdr2gff NMPDR&lt;br /&gt;
&lt;br /&gt;
This looks through all genomes for the NMPDR flag, and if it is found then a GFF3 file is created using the seed2gff command. If you suspect that files are not created for some genomes that should have them, then the NMPDR flag has not been set. If you would like to create a gff3 file of a single organism, you can use seed2gff with just that organism.&lt;br /&gt;
&lt;br /&gt;
The creation takes about 30-40 seconds per genome, so you can expect it to run for some time.&lt;br /&gt;
&lt;br /&gt;
Once complete you will have a directory structure that looks something like this (only the first two genomes are shown for each organism):&lt;br /&gt;
&lt;br /&gt;
*NMPDR&lt;br /&gt;
**Campylobacter&lt;br /&gt;
***Campylobacter.coli.RM2228.gff3&lt;br /&gt;
***Campylobacter.jejuni.subsp.jejuni.84-25.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Listeria&lt;br /&gt;
***Listeria.innocua.Clip11262.gff3&lt;br /&gt;
***Listeria.monocytogenes.EGD-e.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Staphylococcus&lt;br /&gt;
***Staphylococcus.aureus.RF122.gff3&lt;br /&gt;
***Staphylococcus.aureus.subsp.aureus.MRSA252.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Streptococcus&lt;br /&gt;
***Streptococcus.pneumoniae.R6.gff3&lt;br /&gt;
***Streptococcus.pyogenes.MGAS10270.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Vibrio&lt;br /&gt;
***Vibrio.cholerae.MO10.gff3&lt;br /&gt;
***Vibrio.cholerae.O395.gff3&lt;br /&gt;
***...&lt;br /&gt;
&lt;br /&gt;
===Uploading the files===&lt;br /&gt;
&lt;br /&gt;
One the creation of the GFF3 files is complete, you need to use the [http://iowg.brcdevel.org/gff3validator/ brc-central validator] to validate and upload the data to the site. The one tricky thing about this was that it requires the GO::Parser PERL module. This should be part of the standard install everywhere now, but you may run into problems if it is missing. Please contact Bob for help.&lt;br /&gt;
&lt;br /&gt;
Use this command to validate and upload our data:&lt;br /&gt;
&lt;br /&gt;
 gff3_validator.pl -b NMPDR -d /path/to/directory/NMPDR -p CDS&lt;br /&gt;
&lt;br /&gt;
One this has completed you should ftp to [ftp://ftp.brc-central.org] and check that the files are correct. If there are problems with the validator or upload you should email Todd Creasy at TIGR for help.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Release_to_BRC&amp;diff=1588</id>
		<title>Release to BRC</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Release_to_BRC&amp;diff=1588"/>
		<updated>2006-11-14T14:18:36Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: /* How to Create and Release GFF3 files to the BRCs */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== How to Create and Release GFF3 files to the BRCs ==&lt;br /&gt;
&lt;br /&gt;
We regularly release our data to BRC-central via GFF3 files. This page describes the steps to release the data.&lt;br /&gt;
&lt;br /&gt;
===Creating the files===&lt;br /&gt;
&lt;br /&gt;
Choose a machine that is upto date, and create an empty directory. For this example, we'll use the directory NMPDR&lt;br /&gt;
&lt;br /&gt;
Run the command &lt;br /&gt;
&lt;br /&gt;
 nmpdr2gff NMPDR&lt;br /&gt;
&lt;br /&gt;
This looks through all genomes for the NMPDR flag, and if it is found then a GFF3 file is created. If you suspect that files are not created for some genomes that should have them, then the NMPDR flag has not been set.&lt;br /&gt;
&lt;br /&gt;
The creation takes about 30-40 seconds per genome, so you can expect it to run for some time.&lt;br /&gt;
&lt;br /&gt;
Once complete you will have a directory structure that looks something like this (only the first two genomes are shown for each organism):&lt;br /&gt;
&lt;br /&gt;
*NMPDR&lt;br /&gt;
**Campylobacter&lt;br /&gt;
***Campylobacter.coli.RM2228.gff3&lt;br /&gt;
***Campylobacter.jejuni.subsp.jejuni.84-25.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Listeria&lt;br /&gt;
***Listeria.innocua.Clip11262.gff3&lt;br /&gt;
***Listeria.monocytogenes.EGD-e.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Staphylococcus&lt;br /&gt;
***Staphylococcus.aureus.RF122.gff3&lt;br /&gt;
***Staphylococcus.aureus.subsp.aureus.MRSA252.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Streptococcus&lt;br /&gt;
***Streptococcus.pneumoniae.R6.gff3&lt;br /&gt;
***Streptococcus.pyogenes.MGAS10270.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Vibrio&lt;br /&gt;
***Vibrio.cholerae.MO10.gff3&lt;br /&gt;
***Vibrio.cholerae.O395.gff3&lt;br /&gt;
***...&lt;br /&gt;
&lt;br /&gt;
===Uploading the files===&lt;br /&gt;
&lt;br /&gt;
One the creation of the GFF3 files is complete, you need to use the [http://iowg.brcdevel.org/gff3validator/ brc-central validator] to validate and upload the data to the site. The one tricky thing about this was that it requires the GO::Parser PERL module. This should be part of the standard install everywhere now, but you may run into problems if it is missing. Please contact Bob for help.&lt;br /&gt;
&lt;br /&gt;
Use this command to validate and upload our data:&lt;br /&gt;
&lt;br /&gt;
 gff3_validator.pl -b NMPDR -d /path/to/directory/NMPDR -p CDS&lt;br /&gt;
&lt;br /&gt;
One this has completed you should ftp to [ftp://ftp.brc-central.org] and check that the files are correct. If there are problems with the validator or upload you should email Todd Creasy at TIGR for help.&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Release_to_BRC&amp;diff=1587</id>
		<title>Release to BRC</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Release_to_BRC&amp;diff=1587"/>
		<updated>2006-11-14T14:03:45Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: How to create and upload GFF3 files to the BRC&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== How to Create and Release GFF3 files to the BRCs ===&lt;br /&gt;
&lt;br /&gt;
We regularly release our data to BRC-central via GFF3 files. This page describes the steps to release the data.&lt;br /&gt;
&lt;br /&gt;
* Creating the files&lt;br /&gt;
&lt;br /&gt;
Choose a machine that is upto date, and create an empty directory. &lt;br /&gt;
Run the command &lt;br /&gt;
&lt;br /&gt;
nmpdr2gff NMPDR&lt;br /&gt;
&lt;br /&gt;
This looks through all genomes for the NMPDR flag, and if it is found then a GFF3 file is created. If you suspect that files are not created for some genomes that should have them, then the NMPDR flag has not been set.&lt;br /&gt;
&lt;br /&gt;
Once complete you will have a directory structure that looks something like this (only the first two genomes are shown for each organism):&lt;br /&gt;
&lt;br /&gt;
*NMPDR&lt;br /&gt;
**Campylobacter&lt;br /&gt;
***Campylobacter.coli.RM2228.gff3&lt;br /&gt;
***Campylobacter.jejuni.subsp.jejuni.84-25.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Listeria&lt;br /&gt;
***Listeria.innocua.Clip11262.gff3&lt;br /&gt;
***Listeria.monocytogenes.EGD-e.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Staphylococcus&lt;br /&gt;
***Staphylococcus.aureus.RF122.gff3&lt;br /&gt;
***Staphylococcus.aureus.subsp.aureus.MRSA252.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Streptococcus&lt;br /&gt;
***Streptococcus.pneumoniae.R6.gff3&lt;br /&gt;
***Streptococcus.pyogenes.MGAS10270.gff3&lt;br /&gt;
***...&lt;br /&gt;
**Vibrio&lt;br /&gt;
***Vibrio.cholerae.MO10.gff3&lt;br /&gt;
***Vibrio.cholerae.O395.gff3&lt;br /&gt;
***...&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=SOPs&amp;diff=1586</id>
		<title>SOPs</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=SOPs&amp;diff=1586"/>
		<updated>2006-11-14T13:52:12Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== SEED standard operating procedures ==&lt;br /&gt;
&lt;br /&gt;
To generate data that is useful to the various communities involved in the process of annotation and use of annotations, we make available out standard operating procedures.&lt;br /&gt;
&lt;br /&gt;
* [[GeneCalling|Gene calling]]&lt;br /&gt;
* Annotation&lt;br /&gt;
** [[Annotation_of_close_strain_sets|Annotation of close strain sets]]&lt;br /&gt;
** Annotation of diverse genomes.&lt;br /&gt;
*BRC&lt;br /&gt;
**[[Release_to_BRC|Releasing GFF3 files]] -- Creating and uploaing GFF3 files&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Glossary&amp;diff=1484</id>
		<title>Glossary</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Glossary&amp;diff=1484"/>
		<updated>2006-08-30T17:29:12Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== Annotators SEED ===&lt;br /&gt;
The master copy for all data in the SEED environment. Users can not access this password protected site.&lt;br /&gt;
All annotations are made available via the [[#SEED-Viewer| SEED-Viewer]] and the [[#Trial-SEED|Trial-SEED]].&lt;br /&gt;
&lt;br /&gt;
=== Annotation ===&lt;br /&gt;
please see [[#Assigning_a_gene_function_and_annotation| Assigning a gene function and annotation]]&lt;br /&gt;
&lt;br /&gt;
=== Assigning a gene function and annotation ===&lt;br /&gt;
Annotators assign gene functions to genes, and we call this process annotation.  In most contexts, people use the term annotation to refer to assignments of function to the genes within a single organism.  We certainly use the term in this sense, but we also use it to describe the process of assigning functions to corresponding genes from numerous genomes.  Our basic approach to annotation is to ask our annotators to annotate the genes included in a [[#subsystem|Subsystem]] (e.g., glycolysis) across all genomes.  This process of annotation of the genes within a subsystem across a set of genomes, rather than annotation of genes within a single genome, allows our annotators to focus on a constrained set of functional roles and attempt to accurately identify exactly what variant, if any, of a subsystem exists in each of the genomes. &lt;br /&gt;
&lt;br /&gt;
We use the term annotation to refer to assigning functions to genes (either within a single organism or to a constrained set of gene/protein families across a set of organisms).  This activity certainly is closely related to the construction of subsystems and protein families (which we call FIGfams), but we will describe those activities elsewhere. &lt;br /&gt;
&lt;br /&gt;
=== Assignment ===&lt;br /&gt;
please see [[#Assigning_a_gene_function_and_annotation| Assigning a gene function and annotation]]&lt;br /&gt;
&lt;br /&gt;
=== Bidirectional Best Hit (BBH) ===&lt;br /&gt;
&lt;br /&gt;
The paper [http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=EBI&amp;amp;pubmedid=10077608 The use of gene clusters to infer functional coupling]&lt;br /&gt;
defines a Bidirectional Best Hit or BBH as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&lt;br /&gt;
Given two genes Xa and Xb from two genomes Ga and Gb, Xa and Xb are called a “bidirectional best hit (BBH)” if and only if recognizable similarity exists between them (in our case, we required fasta3 scores lower than 1.0 × 10−5), there is no gene Zb in Gb that is more similar than Xb is to Xa, and there is no gene Za in Ga that is more similar than Xa is to Xb. Genes (Xa, Ya) from Ga and (Xb, Yb) from Gb form a “pair of close bidirectional best hits (PCBBH)” if and only if Xa and Ya are close, Xb and Yb are close, Xa and Xb are a BBH, and Ya and Yb are a BBH.&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Clearinghouse ===&lt;br /&gt;
please see [[#SubsystemClearinghouse|Subsystem Clearinghouse]]&lt;br /&gt;
&lt;br /&gt;
=== FIGfam ===&lt;br /&gt;
FIGfams are protein families generated by the Fellowship for Interpretation of Genomes (FIG).  These families are based on the collection of subsystems, as well as correspondences between genes in closely related strains (we describe the construction of FIGfams in a separate SOP).  The important properties of these families are as follows:&lt;br /&gt;
&lt;br /&gt;
#	Two PEGs which both occur within a single FIGfam are believed to have the same function.&lt;br /&gt;
&lt;br /&gt;
#	There is a decision procedure associated with the family which can be invoked to determine whether or not a gene can be “safely” assigned the function associated with the family.&lt;br /&gt;
&lt;br /&gt;
=== Functional role ===&lt;br /&gt;
The concept of functional role is both basic and primitive in the sense that we will not pretend to offer a precise definition.  It corresponds roughly to a single logical role that a gene or gene product may play in the operation of a cell.&lt;br /&gt;
&lt;br /&gt;
=== Gene function ===&lt;br /&gt;
The function of a protein-encoding gene (PEG) is the functional role played by the product of the gene or an expression describing a set of roles played by the encoded protein.  The operators used to construct expressions and the meanings associated with the operators are  described in &lt;br /&gt;
&lt;br /&gt;
 http://www.nmpdr.org/FIG/Html/SEED_functions.html&lt;br /&gt;
&lt;br /&gt;
Genes other than PEGs can also be assigned functions (e.g., SSU rRNA).  However, in most cases the functions assigned to genes other than PEGs tend not to be problematic. &lt;br /&gt;
&lt;br /&gt;
=== Metabolic Reconstruction ===&lt;br /&gt;
When we use the term metabolic reconstruction of a given genome we will simply mean the set of populated subsystems that contain the genome, the PEGs (and their assigned functions) that are connected to functional roles in these populated subsystems, and the specific variant code associated with the genome in each of the populated subsystems.&lt;br /&gt;
&lt;br /&gt;
=== NMPDR pathogen genome === &lt;br /&gt;
&lt;br /&gt;
The NMPDR is responsible for five classes of genomes:&lt;br /&gt;
&lt;br /&gt;
#	Campylobacter jejuni&lt;br /&gt;
#	Listeria monocytogenes&lt;br /&gt;
#	Staphylococcus aureus&lt;br /&gt;
#	Streptococcus pneumoniae and Streptococcus pyogenes&lt;br /&gt;
#	Pathogenic Vibrio&lt;br /&gt;
&lt;br /&gt;
=== Pair of Close Homologs (PCH) ===&lt;br /&gt;
&lt;br /&gt;
The paper [http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=EBI&amp;amp;pubmedid=10077608 The use of gene clusters to infer functional coupling]&lt;br /&gt;
defines a Pair of Close Homologs as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&lt;br /&gt;
We can also define the concept of “pairs of close homologs” (PCHs) as follows: genes (X′a, Y′a) from Ga and (X′b, Y′b) from Gb form a PCH if and only if X′a and Y′a are close, X′b and Y′b are close, X′a and X′b are recognizably similar, and Y′a and Y′b are recognizably similar. Here, we will consider two genes to be recognizably similar if their gene products produce fasta3 scores lower than 1.0 × 10−5. We use a scoring scheme analogous to the one described for PCBBHs to evaluate the connections between PCHs, except that if Ga and Gb are the same genome, we assign an arbitrary “same-genome score” (“same-genome” pairs cannot occur for PCBBHs by definition, but for PCHs they are possible). Unlike PCBBHs from two very close genomes for which contiguity is completely uninformative in the vast majority of cases, PCHs allow recognition of gene clusters that play similar (but usually not identical) roles (such as two transport cassettes containing pairs of homologs) in the same or similar organisms. The arbitrary “same-genome score” should, we believe, have a value that is high enough to rank such instances as significant. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== PEG ===&lt;br /&gt;
A Protein Encoding Gene (PEG) is equivalent to a CDS (Coding Sequence).&lt;br /&gt;
&lt;br /&gt;
=== Populated Subsystem ===&lt;br /&gt;
please see [[#Subsystem|Subsystem]]&lt;br /&gt;
&lt;br /&gt;
=== SEED-Viewer ===&lt;br /&gt;
A read-only version of the SEED that presents the latest data.&lt;br /&gt;
&lt;br /&gt;
 http://seed-viewer.theseed.org&lt;br /&gt;
&lt;br /&gt;
'''Please note''': The data is updated automatically every 24 hours. When citing or linking to the SEED please use this version.&lt;br /&gt;
&lt;br /&gt;
=== Subsystem ===&lt;br /&gt;
A subsystem is a set of functional roles that an annotator has decided should be thought of as related.  Frequently, subsystems represent the collection of &lt;br /&gt;
[[#Functional_role|functional roles]] that make up a metabolic pathway, a complex (e.g., the ribosome), or a class of proteins (e.g., two-component signal-transduction proteins within Staphylococcus aureus).  A '''populated subsystem''' is a subsystem with an attached spreadsheet.   The rows of the spreadsheet represent genomes and the columns represent the functional roles of the spreadsheet.  Each cell contains the identifiers of genes from the corresponding genome the implement the specific functional role.  That is, a populated subsystem specifies which genes implement instances of the subsystem in each of the genomes.  The rows of a populated genome are assigned '''variant codes''' which describe which of a set of possible variants of the subsystem exist within each genome (special codes expressing a total absence of the subsystem or remaining uncertainty exist).  Construction of a large set of curated populated subsystems is at the center of the NMPDR and SEED annotation efforts.&lt;br /&gt;
&lt;br /&gt;
=== Subsystem Clearing House ===&lt;br /&gt;
Since annotators can work on any machine (including the public SEED) the way to propagate subsystems is via &lt;br /&gt;
 http://clearinghouse.theseed.org/clearinghouse_browser.cgi? &lt;br /&gt;
&lt;br /&gt;
=== Trial-SEED ===&lt;br /&gt;
A public, read-write copy of the SEED is made available on &lt;br /&gt;
&lt;br /&gt;
 http://theseed.uchicago.edu/FIG/index.cgi&lt;br /&gt;
&lt;br /&gt;
'''Please note''': The data on this server is updated in irregular intervals. Users should not assume that annotations made on this system will persist. Please publish your annotations to the [[#Subsystem_clearing_house|Subsystem Clearing house]].&lt;br /&gt;
&lt;br /&gt;
=== Variant Code===&lt;br /&gt;
please see [[#Subsystem|Subsystem]]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
	<entry>
		<id> /w/index.php?title=Glossary&amp;diff=1483</id>
		<title>Glossary</title>
		<link rel="alternate" type="text/html" href=" /w/index.php?title=Glossary&amp;diff=1483"/>
		<updated>2006-08-30T16:48:53Z</updated>

		<summary type="html">&lt;p&gt;RobEdwards: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== Annotators SEED ===&lt;br /&gt;
The master copy for all data in the SEED environment. Users can not access this password protected site.&lt;br /&gt;
All annotations are made available via the [[#SEED-Viewer| SEED-Viewer]] and the [[#Trial-SEED|Trial-SEED]].&lt;br /&gt;
&lt;br /&gt;
=== Annotation ===&lt;br /&gt;
please see [[#Assigning_a_gene_function_and_annotation| Assigning a gene function and annotation]]&lt;br /&gt;
&lt;br /&gt;
=== Assigning a gene function and annotation ===&lt;br /&gt;
Annotators assign gene functions to genes, and we call this process annotation.  In most contexts, people use the term annotation to refer to assignments of function to the genes within a single organism.  We certainly use the term in this sense, but we also use it to describe the process of assigning functions to corresponding genes from numerous genomes.  Our basic approach to annotation is to ask our annotators to annotate the genes included in a [[#subsystem|Subsystem]] (e.g., glycolysis) across all genomes.  This process of annotation of the genes within a subsystem across a set of genomes, rather than annotation of genes within a single genome, allows our annotators to focus on a constrained set of functional roles and attempt to accurately identify exactly what variant, if any, of a subsystem exists in each of the genomes. &lt;br /&gt;
&lt;br /&gt;
We use the term annotation to refer to assigning functions to genes (either within a single organism or to a constrained set of gene/protein families across a set of organisms).  This activity certainly is closely related to the construction of subsystems and protein families (which we call FIGfams), but we will describe those activities elsewhere. &lt;br /&gt;
&lt;br /&gt;
=== Assignment ===&lt;br /&gt;
please see [[#Assigning_a_gene_function_and_annotation| Assigning a gene function and annotation]]&lt;br /&gt;
&lt;br /&gt;
=== Bidirectional Best Hit ===&lt;br /&gt;
&lt;br /&gt;
The paper [http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=EBI&amp;amp;pubmedid=10077608 The use of gene clusters to infer functional coupling]&lt;br /&gt;
defines a Bidirectional Best Hit or BBH as follows:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&lt;br /&gt;
Given two genes Xa and Xb from two genomes Ga and Gb, Xa and Xb are called a “bidirectional best hit (BBH)” if and only if recognizable similarity exists between them (in our case, we required fasta3 scores lower than 1.0 × 10−5), there is no gene Zb in Gb that is more similar than Xb is to Xa, and there is no gene Za in Ga that is more similar than Xa is to Xb. Genes (Xa, Ya) from Ga and (Xb, Yb) from Gb form a “pair of close bidirectional best hits (PCBBH)” if and only if Xa and Ya are close, Xb and Yb are close, Xa and Xb are a BBH, and Ya and Yb are a BBH.&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Clearinghouse ===&lt;br /&gt;
please see [[#SubsystemClearinghouse|Subsystem Clearinghouse]]&lt;br /&gt;
&lt;br /&gt;
=== FIGfam ===&lt;br /&gt;
FIGfams are protein families generated by the Fellowship for Interpretation of Genomes (FIG).  These families are based on the collection of subsystems, as well as correspondences between genes in closely related strains (we describe the construction of FIGfams in a separate SOP).  The important properties of these families are as follows:&lt;br /&gt;
&lt;br /&gt;
#	Two PEGs which both occur within a single FIGfam are believed to have the same function.&lt;br /&gt;
&lt;br /&gt;
#	There is a decision procedure associated with the family which can be invoked to determine whether or not a gene can be “safely” assigned the function associated with the family.&lt;br /&gt;
&lt;br /&gt;
=== Functional role ===&lt;br /&gt;
The concept of functional role is both basic and primitive in the sense that we will not pretend to offer a precise definition.  It corresponds roughly to a single logical role that a gene or gene product may play in the operation of a cell.&lt;br /&gt;
&lt;br /&gt;
=== Gene function ===&lt;br /&gt;
The function of a protein-encoding gene (PEG) is the functional role played by the product of the gene or an expression describing a set of roles played by the encoded protein.  The operators used to construct expressions and the meanings associated with the operators are  described in &lt;br /&gt;
&lt;br /&gt;
 http://www.nmpdr.org/FIG/Html/SEED_functions.html&lt;br /&gt;
&lt;br /&gt;
Genes other than PEGs can also be assigned functions (e.g., SSU rRNA).  However, in most cases the functions assigned to genes other than PEGs tend not to be problematic. &lt;br /&gt;
&lt;br /&gt;
=== Metabolic Reconstruction ===&lt;br /&gt;
When we use the term metabolic reconstruction of a given genome we will simply mean the set of populated subsystems that contain the genome, the PEGs (and their assigned functions) that are connected to functional roles in these populated subsystems, and the specific variant code associated with the genome in each of the populated subsystems.&lt;br /&gt;
&lt;br /&gt;
=== NMPDR pathogen genome === &lt;br /&gt;
&lt;br /&gt;
The NMPDR is responsible for five classes of genomes:&lt;br /&gt;
&lt;br /&gt;
#	Campylobacter jejuni&lt;br /&gt;
#	Listeria monocytogenes&lt;br /&gt;
#	Staphylococcus aureus&lt;br /&gt;
#	Streptococcus pneumoniae and Streptococcus pyogenes&lt;br /&gt;
#	Pathogenic Vibrio&lt;br /&gt;
&lt;br /&gt;
=== PEG ===&lt;br /&gt;
A Protein Encoding Gene (PEG) is equivalent to a CDS (Coding Sequence).&lt;br /&gt;
&lt;br /&gt;
=== Populated Subsystem ===&lt;br /&gt;
please see [[#Subsystem|Subsystem]]&lt;br /&gt;
&lt;br /&gt;
=== SEED-Viewer ===&lt;br /&gt;
A read-only version of the SEED that presents the latest data.&lt;br /&gt;
&lt;br /&gt;
 http://seed-viewer.theseed.org&lt;br /&gt;
&lt;br /&gt;
'''Please note''': The data is updated automatically every 24 hours. When citing or linking to the SEED please use this version.&lt;br /&gt;
&lt;br /&gt;
=== Subsystem ===&lt;br /&gt;
A subsystem is a set of functional roles that an annotator has decided should be thought of as related.  Frequently, subsystems represent the collection of &lt;br /&gt;
[[#Functional_role|functional roles]] that make up a metabolic pathway, a complex (e.g., the ribosome), or a class of proteins (e.g., two-component signal-transduction proteins within Staphylococcus aureus).  A '''populated subsystem''' is a subsystem with an attached spreadsheet.   The rows of the spreadsheet represent genomes and the columns represent the functional roles of the spreadsheet.  Each cell contains the identifiers of genes from the corresponding genome the implement the specific functional role.  That is, a populated subsystem specifies which genes implement instances of the subsystem in each of the genomes.  The rows of a populated genome are assigned '''variant codes''' which describe which of a set of possible variants of the subsystem exist within each genome (special codes expressing a total absence of the subsystem or remaining uncertainty exist).  Construction of a large set of curated populated subsystems is at the center of the NMPDR and SEED annotation efforts.&lt;br /&gt;
&lt;br /&gt;
=== Subsystem Clearing House ===&lt;br /&gt;
Since annotators can work on any machine (including the public SEED) the way to propagate subsystems is via &lt;br /&gt;
 http://clearinghouse.theseed.org/clearinghouse_browser.cgi? &lt;br /&gt;
&lt;br /&gt;
=== Trial-SEED ===&lt;br /&gt;
A public, read-write copy of the SEED is made available on &lt;br /&gt;
&lt;br /&gt;
 http://theseed.uchicago.edu/FIG/index.cgi&lt;br /&gt;
&lt;br /&gt;
'''Please note''': The data on this server is updated in irregular intervals. Users should not assume that annotations made on this system will persist. Please publish your annotations to the [[#Subsystem_clearing_house|Subsystem Clearing house]].&lt;br /&gt;
&lt;br /&gt;
=== Variant Code===&lt;br /&gt;
please see [[#Subsystem|Subsystem]]&lt;/div&gt;</summary>
		<author><name>RobEdwards</name></author>
	</entry>
</feed>