| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006499 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052510 | Ga0100371 |
| Sample Name | Human stool microbial communities from NIH, USA - visit 1, subject 159268001 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 147067976 |
| Sequencing Scaffolds | 7 |
| Novel Protein Genes | 14 |
| Associated Families | 14 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM158 | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F032312 | Metagenome / Metatranscriptome | 180 | N |
| F044555 | Metagenome / Metatranscriptome | 154 | N |
| F047125 | Metagenome / Metatranscriptome | 150 | N |
| F060985 | Metagenome / Metatranscriptome | 132 | N |
| F062845 | Metagenome | 130 | N |
| F064817 | Metagenome | 128 | N |
| F068811 | Metagenome | 124 | N |
| F068855 | Metagenome | 124 | N |
| F076189 | Metagenome | 118 | N |
| F094005 | Metagenome / Metatranscriptome | 106 | N |
| F099269 | Metagenome | 103 | N |
| F101356 | Metagenome | 102 | N |
| F101357 | Metagenome / Metatranscriptome | 102 | N |
| F105375 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0100371_100127 | Not Available | 86301 | Open in IMG/M |
| Ga0100371_101261 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM158 | 19167 | Open in IMG/M |
| Ga0100371_102579 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 9619 | Open in IMG/M |
| Ga0100371_107070 | All Organisms → cellular organisms → Bacteria | 3193 | Open in IMG/M |
| Ga0100371_107334 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 3072 | Open in IMG/M |
| Ga0100371_119578 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium | 1095 | Open in IMG/M |
| Ga0100371_126203 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 805 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0100371_100127 | Ga0100371_10012776 | F094005 | MKKEVIKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLEGQFTRLDGKGNAIILPCTTENEQKYNSFVAYHQSKKSLEAIKKWLDDNGKHKDDETPEKVITLDKKLKKLIEKLNE* |
| Ga0100371_100127 | Ga0100371_10012777 | F032312 | MARIKDYDEDLSAPKLLRERARDSKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDALVCLGFTWFKLLGVALMILLWPIVFIYALNDGIEGYPFKKYAIPYIFILVVWFIIFLYGLVS* |
| Ga0100371_100127 | Ga0100371_10012778 | F060985 | MSNIDEKAKNNFTIEMRIFENYEKVKHEIIKVIDFLRHAETNLGMCRIFDNQNHEFWHSVIKPWFKPERFGITHLWFSSGFSHIGYGEYHTIKGNRWLKTPIDKIDRENRIFGYWFPPYKKYIPHRIKVLKLALKDLERIKEEYGKD* |
| Ga0100371_100127 | Ga0100371_10012784 | F044555 | MKTTNPSSRITLSQNGNQILTCKVYKEPNYILSMSNEEILEFISGLDYMGNIPMVPDLEKPIEIQVSTIRRIPLEQNKEVQTKIKEIIYNNLYDTLIDELKGTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHHKR* |
| Ga0100371_100127 | Ga0100371_10012792 | F101357 | MNLNNITTALKTGITIYQYEQWQNTGSVNLMQKESHMLSKVWLKTNIYNPDSLDKPFIQLSATFTSEFDIQEYNEWLNANQYKLYPLLLDILKISLKDAFYNYSNASNIHYEGGKFPSMLTIQLFNLEF* |
| Ga0100371_100127 | Ga0100371_10012795 | F105375 | MKNNETFQTTQHLDKLVTNLGLQIQELFSLDLEEILDYSNNLMNLLVNAYVENQCLALSAMISKQDGFAIYSFLFQTPDTSNGAADAMVNFAMNFTDGEANIKSINRISSNIMQITFTV* |
| Ga0100371_100127 | Ga0100371_10012797 | F064817 | MNIKNLFNRFRKREPELSYSLNLIYLEDTKVVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKNRYWVYDKEPYLLYYKVPLIVNTSRKLSGKSDMVITKEKYQAAKDLVPAHEVSDRFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQHNWLKEFELLVFKRNYDTDMLFLSHSLTYILDQTEEEGRRICIQNIIERNINQENQDENETI* |
| Ga0100371_101261 | Ga0100371_1012612 | F068855 | VPTVEAQGPQVRKSLAPQGLQAEKERENANVQNAGWLHSFDADYHYRGSDLHYHVNVSAALSEGGAVW* |
| Ga0100371_102579 | Ga0100371_10257910 | F062845 | MEKTPFILHEKFRSDNNEQRKEHFQKKFERYIVDELSDTAPSKSCA* |
| Ga0100371_106781 | Ga0100371_1067817 | F101356 | MPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFCVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDNSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ* |
| Ga0100371_107070 | Ga0100371_1070702 | F099269 | MRRPPFVHGVLLSVQVGELLLLDDLGDRAGGASVLASATGDAGVLISDGSDVLELQNASGAGVDANATSDALVGINYGMSHGSFLSVDRRYRRCAPV* |
| Ga0100371_107334 | Ga0100371_1073343 | F076189 | VLSAGHCFFLSPFIKPLLYVEKLQIGTVLPVVSALYREFAELSAYFDLYAIQSAQKQLRMLCNFHENTFGLLIFYTNYAIL* |
| Ga0100371_119578 | Ga0100371_1195782 | F068811 | MDQDGSEHNICSTREGLCPGKKQHGASGWKKIFQHGKEPLRNKDSISQYCNKKAAVSLILNENVSETLCIFSIDKTNCCRIK* |
| Ga0100371_126203 | Ga0100371_1262032 | F047125 | MDVVLLLMVLGVMLRGFWAADALDHMRKEILQQEGKRRGWWS* |
| ⦗Top⦘ |