Basic Information | |
---|---|
IMG/M Taxon OID | 3300008260 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053144 | Ga0114292 |
Sample Name | Human stool microbial communities from NIH, USA - visit 1, subject 159551223 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 180331707 |
Sequencing Scaffolds | 12 |
Novel Protein Genes | 13 |
Associated Families | 13 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria | 3 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 3 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1 |
Not Available | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F042910 | Metagenome | 157 | N |
F043945 | Metagenome | 155 | N |
F047126 | Metagenome | 150 | N |
F050793 | Metagenome | 145 | N |
F055775 | Metagenome | 138 | N |
F058154 | Metagenome | 135 | N |
F059982 | Metagenome | 133 | N |
F067720 | Metagenome | 125 | Y |
F068855 | Metagenome | 124 | N |
F074899 | Metagenome / Metatranscriptome | 119 | N |
F088914 | Metagenome | 109 | N |
F099269 | Metagenome | 103 | N |
F101356 | Metagenome | 102 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0114292_100531 | All Organisms → cellular organisms → Bacteria | 33675 | Open in IMG/M |
Ga0114292_101004 | All Organisms → cellular organisms → Bacteria | 22075 | Open in IMG/M |
Ga0114292_102114 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 13055 | Open in IMG/M |
Ga0114292_102146 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella | 12889 | Open in IMG/M |
Ga0114292_103223 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 9294 | Open in IMG/M |
Ga0114292_105361 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 5919 | Open in IMG/M |
Ga0114292_106206 | All Organisms → cellular organisms → Bacteria | 5171 | Open in IMG/M |
Ga0114292_106389 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 5026 | Open in IMG/M |
Ga0114292_108030 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 4023 | Open in IMG/M |
Ga0114292_109133 | Not Available | 3544 | Open in IMG/M |
Ga0114292_110582 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 3077 | Open in IMG/M |
Ga0114292_124766 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 1244 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0114292_100531 | Ga0114292_10053122 | F055775 | MAQIAQQDNLVIEVTTTADALDGDTKKKLIECIEGRTITDVILVTKEVEKKISHARVVSWLVDTTGDSPKYTIDIINASSGAVVTIALN* |
Ga0114292_101004 | Ga0114292_10100425 | F050793 | MRKQVLVPENVEMADIHAIQTRVKKHNLLTVVLLIPLFAGCALRESHETLGGIVFVLSMVLMVLGFVSGRGDEKRLPAYWAKYQQIHRMDNLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLADVREYATQEGYEPMAVDFETMTDEEFQQFLDRSRI* |
Ga0114292_102114 | Ga0114292_1021144 | F059982 | MTMRRMTRLLCLMLLFSLITFSCPLAEGTDAPTGTPAPTPMLTAVPESALAPFNVVLPEDAHVEMAEGRITLVRGDSRVVAMVISRVPDEDPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFGDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENASMDGAAVYVKEEATETASPEV* |
Ga0114292_102146 | Ga0114292_1021464 | F099269 | MQKEASAAMRKPPFFVHGVLLSVQVGELLLLDDLGDRAGGASVLASATGDAGVLVSDGSDVLELQNASGAGVDANATSDALVGINYGMSHGSFLSVDRRYRRCAPV* |
Ga0114292_103223 | Ga0114292_1032238 | F058154 | MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMMCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGTAIRQVMFRMVDYALPAYAVATLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES* |
Ga0114292_105361 | Ga0114292_1053612 | F088914 | VGRILPVCSGFFVFWSRKEGANYQISVKSFSNLDSYDIIQLYIMTELTDGRVSDSPFSVPYTPKENIKNPGKFIVFSAWYAAKALEMDVK* |
Ga0114292_106206 | Ga0114292_1062065 | F043945 | MAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLNGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAQYGLYVIAKDKTDEYQIVRFDVDTGKGEMMEFALEDEEDSHLLTDMPGIFQWSGTSSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNSAMYFVHDNTVWVMDYDEETSEFGEPHGIDKLPFFAEDTMCGFYWRGFYLIQTHDGLYLCHTGDETARAYLYGEVE* |
Ga0114292_106389 | Ga0114292_1063893 | F042910 | MLQLPQGGGLPGVHSHNGGLHLLRRQREQLSLRVRFLIEEYDIATRHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITALTISAAGKISFQGLTIAPGAAIRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC* |
Ga0114292_108030 | Ga0114292_1080306 | F068855 | VPTVEAQGPQVRKSLAPQGLQAEKESEDVNVRNAGWLHSFDADYHYRGSDLHYHVNVSAALSEGGAAW* |
Ga0114292_109133 | Ga0114292_1091338 | F067720 | MASTTYRHLGDVTGMFAAQEQFRDITKMVCARFRGLTKTYHLGNVNKLVTFCHRFAVIGNMVRNAGQLPQPFWLGAACGGGSRGLFARA |
Ga0114292_110582 | Ga0114292_1105824 | F074899 | MRGRKQWNKQAVTSSFLDKKPPKSLVQQGLEGSTAVGKDEVGSSNLPSSSNINRLNRLI* |
Ga0114292_124766 | Ga0114292_1247665 | F047126 | TNIPQTQAEFKSKSLGILCGFQGFLTQNPAALVETDDIFDVSDTPYGFSGLNTLRAAGVPHPLPPFAQRFIACFCSQTAAASQRKSAYILSSLESPCILCSLLRYFHILLKKLQKTC* |
Ga0114292_132556 | Ga0114292_1325562 | F101356 | VGELIPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPSSDFCVYIAGNTMELQDTTKVYVKYMYGQPNSDTYLKMKYETDHRISIYLTSDNSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ* |
⦗Top⦘ |