| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300007802 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052825 | Ga0105667 |
| Sample Name | Human stool microbial communities from NIH, USA - visit 2, subject 159268001 reassembly |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 161361095 |
| Sequencing Scaffolds | 5 |
| Novel Protein Genes | 7 |
| Associated Families | 6 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F026592 | Metagenome / Metatranscriptome | 197 | Y |
| F029444 | Metagenome | 188 | Y |
| F074899 | Metagenome / Metatranscriptome | 119 | N |
| F078004 | Metagenome | 117 | N |
| F099269 | Metagenome | 103 | N |
| F101356 | Metagenome | 102 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0105667_100639 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 35705 | Open in IMG/M |
| Ga0105667_102897 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 10166 | Open in IMG/M |
| Ga0105667_103502 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 8165 | Open in IMG/M |
| Ga0105667_111909 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella | 1926 | Open in IMG/M |
| Ga0105667_125644 | All Organisms → cellular organisms → Bacteria | 805 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0105667_100639 | Ga0105667_10063916 | F074899 | MSGWKQWSKQAVTSSFLDKKPPKSLVQQGLEGSTPVGKDEVGSSNLPSSSNKAL* |
| Ga0105667_102897 | Ga0105667_1028977 | F029444 | MEVSEQLTGFELEYLMSWTVSNLQRPFREVFSLEKSGIIAEKESQIFGRRFVGFDGPKKAAPFFNF* |
| Ga0105667_103405 | Ga0105667_10340510 | F078004 | LSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDVHMLRRTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF* |
| Ga0105667_103502 | Ga0105667_1035021 | F074899 | MSGRKQWNKQTATSSFLDKKPPQSLIQQGLEGSTVVGKDEVGSSNLPSSSKKHRKLRFLV |
| Ga0105667_111909 | Ga0105667_1119093 | F099269 | VGELLLLDDLGDRASGASVLASAAGDAGVLVSDGSDVLELQNASGAGVDANATSDALVGINYGMSHGSFLSVDRRYRRCAPV* |
| Ga0105667_112764 | Ga0105667_1127642 | F101356 | VGELMPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFCVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDNSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ* |
| Ga0105667_125644 | Ga0105667_1256441 | F026592 | QTVCQNRLRTASPQGIAAQASQGGVATLTERSDATFSVMQYPPADGE* |
| ⦗Top⦘ |