Basic Information | |
---|---|
IMG/M Taxon OID | 3300008100 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053101 | Ga0114174 |
Sample Name | Human stool microbial communities from NIH, USA - visit 2, subject 158944319 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 94952779 |
Sequencing Scaffolds | 8 |
Novel Protein Genes | 12 |
Associated Families | 12 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM158 | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F055775 | Metagenome | 138 | N |
F056682 | Metagenome | 137 | Y |
F068855 | Metagenome | 124 | N |
F073656 | Metagenome | 120 | N |
F075481 | Metagenome | 119 | N |
F080673 | Metagenome | 115 | N |
F085718 | Metagenome | 111 | N |
F089590 | Metagenome | 109 | N |
F090484 | Metagenome | 108 | N |
F092228 | Metagenome | 107 | N |
F099406 | Metagenome | 103 | N |
F101356 | Metagenome | 102 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0114174_100006 | Not Available | 144047 | Open in IMG/M |
Ga0114174_100007 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae | 128755 | Open in IMG/M |
Ga0114174_100255 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes | 36072 | Open in IMG/M |
Ga0114174_100552 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 23303 | Open in IMG/M |
Ga0114174_102245 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM158 | 7318 | Open in IMG/M |
Ga0114174_109196 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1794 | Open in IMG/M |
Ga0114174_110519 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1544 | Open in IMG/M |
Ga0114174_127936 | Not Available | 531 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0114174_100006 | Ga0114174_100006102 | F099406 | MAEVGYNSKFEGQEVDSRLENVVQAAPGTGSESGKGGLIPAPPAGSQDGSKTLLSDMTWGDYVNKKYIDDAVSAAGWKKQIVSKLPTVEEAKDNVMYLVKDDVASTETKNVYNEYILVTEEGGTKVLESLGMISTRVDSTYLDLSIFPSTSGTLDEDSYAKVIDAYNNRITLGKLSFYYFSLDYFLDNDNSELKIIAVLFNNTNSKEDVSGSYIDIEMVTYIVAQDKTYRAIANTATLFNTMLSYLKFMANTPNVVTTLASLPTDAHNIIANVASATNLSMSVSSEYAGREWQVRVNNTTGTDITQPLPTSGLFQSMSGDSVVVPKNSFIELSIWYINDKLVIRVGEQA* |
Ga0114174_100006 | Ga0114174_10000683 | F090484 | MIKISRRDSRIRFKNQKDLLHKLGIGYSKFKNMTGHPMFDELFRMTDSTLVARRYRVNGIQLTLGCGKVNIPKNRILIKIKKNEITNHEKVLDRIKEAMFVNLVRNNESVLNSGETNSQADVVDGSHSYYGLIDSTISNKTIALYLNVGLTKAKEIVGMAIQDKLVKRFENIQFITYVDNPRAYIEANEHNYPIGKLIPVYRHGAVFWQIANTWTLYKKGATNRWYFGEKDIEKGEKEKVSKKDDFNFFLKDNTHILRFLNAEEVVSEDGEILGIDRKKTKEEEARSLASVMAKEAHKDFWDGYERSTQNQIIRKYYRAIIAEDKKRRMDMFLNRLKQSYDKVSGWSKEKVATVKAGMADAEACCAEVGTSVAGVCGRVSRRMKSYNNTAPDKKAGFNEVRDMYAEFAGEMAKAVGSVSEDIYMYVKAEQFKEKIENMDISVQSLPNINITVDNDKELDGESVFKDIPFEELSFYNDTYLYPSSQYSSL* |
Ga0114174_100007 | Ga0114174_100007121 | F075481 | MRLRISLKAVFCLGLSLFLSSCGSRRQVSEAFIDNRLISRIETMIDEVMDRKIVEIRTSDLNADIVITERKFDTTKEVDPSTGERPVSSQTDAHIVIGRRDSTVTTDSLGVDKTITGIEDIDKKTDIKHKDIDDKEESRWPMAIIFMSILGILVVLFVLLKRFGLIK* |
Ga0114174_100007 | Ga0114174_100007122 | F089590 | MKDKDMIERVGALWNIALAYGASCWAYFQPVHHLLTVLLIVLIANFLARLAQSVRGWKLRRSRRRRFSFKRWFREVRFTDILKEFALSCFIVMTLCVIYKTLYPIEEEASMILTVTKYGVYIALVGYVMLFLNTIGDTFADAYLVKVFKAVFKRINVFKMFGFSKNIPDETFDDIRRIADDEVKDKS* |
Ga0114174_100255 | Ga0114174_1002552 | F073656 | MTMEQDQEQMQGALYVAVDDGNKIIAMERSRRGDEGFRALLDEFTDYAANCGAIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGAVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWAFVDPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRNGFDNIERKYASDLSDEGRAAKREEQYNLAAYILDRKFPIRRRPSSEIPPMQAEGIHTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGTEFHLKRH* |
Ga0114174_100552 | Ga0114174_10055214 | F080673 | MDEIMKLQDEALLYLRDNITKDEAYYILTTDKDMIEILISDKKDGSKRIKILDAEYTIEKDDMLFLFDTDGVIDECLLVASYIGVNMYFRGQDVNAILNNINREKVMKYPYIAIQLDNIQTIEKRRVIFEITGHRVDYDKVDFMFVYFMARML* |
Ga0114174_100552 | Ga0114174_1005527 | F085718 | MIRISVEVMRKSDVRSISGITSVLQGYMGISFISRRQRKVVRNNISIGGDKMVIEFDFEIYKNGDYDKVYLRNGKEARVLCDNGKGDRPMVVMIENNNVDDYIILRYNETGRRNIDGQSSLDLMLSVKEREPELWVVVISYIDNKDKRQKMVLPNFFSRNIGGNIYLQGSSKSNVSYYVGRLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDKATV* |
Ga0114174_102245 | Ga0114174_1022453 | F068855 | MVPTISRCPSSSVPTVEAQGPQVRKSLAPQGLQAEKERENANVQNAGWLHSFDADYHYRGSNLHYHVNVSAALSEGGATW* |
Ga0114174_109196 | Ga0114174_1091961 | F092228 | MKKKCTLVLISVLTVACIVFAYLLFFYNPSFNMVYDSDTDSYFNNSYLSYNDGTLAAADYRKTKVTAYDSKNNTTVNLPSNGSLINDNLFYINGDKLCCLDTTTNTRKTIDTDCRSFVCNNEVIAYTKNDSVILKNSDTLENIGDIKFDNQIYYINISDGNLYIAERIFEDKTDEYGYSFKVVKQYIFKKYDLKSCKLLKSKIANYVNEIRYVTVCKDTFYFFCDETQTVNNVCLDKDVNYPTIQHPDVKFITSNNDCVYYISEKTESAIIRKTVESPYNGIWKLEVGSNKPVKIAEKCDCDELLATKNFLYCYTINYIL |
Ga0114174_110519 | Ga0114174_1105194 | F056682 | VAEKSDKQNTMKTQSRAGKAANQPIRQGEIYSASFGAFPSKNRSTFPIQKLGKIYENQEVL* |
Ga0114174_115892 | Ga0114174_1158921 | F101356 | MPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFCVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDKSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ* |
Ga0114174_127936 | Ga0114174_1279361 | F055775 | LVIEVATTAATLDGDTKKKLIECIEGGTITDVILVTKEDEKKISHARVVSWLVDTTGNSPKYTIDIINANSGAVKAIELN* |
⦗Top⦘ |