| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300008482 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053237 | Ga0115187 |
| Sample Name | Human stool microbial communities from NIH, USA - visit 2, subject 159490532 reassembly |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 115628221 |
| Sequencing Scaffolds | 15 |
| Novel Protein Genes | 18 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis | 2 |
| Not Available | 4 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 6 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | National Institutes of Health, USA | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F050794 | Metagenome | 145 | N |
| F057001 | Metagenome | 137 | Y |
| F058555 | Metagenome | 135 | N |
| F067844 | Metagenome | 125 | N |
| F076653 | Metagenome | 118 | N |
| F078004 | Metagenome | 117 | N |
| F078005 | Metagenome | 117 | N |
| F078006 | Metagenome | 117 | N |
| F080673 | Metagenome | 115 | N |
| F081453 | Metagenome | 114 | N |
| F083451 | Metagenome | 113 | N |
| F085718 | Metagenome | 111 | N |
| F089591 | Metagenome | 109 | N |
| F089592 | Metagenome | 109 | N |
| F090515 | Metagenome | 108 | N |
| F093883 | Metagenome | 106 | N |
| F097172 | Metagenome / Metatranscriptome | 104 | Y |
| F102167 | Metagenome | 102 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0115187_100173 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis | 70996 | Open in IMG/M |
| Ga0115187_100400 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis | 42856 | Open in IMG/M |
| Ga0115187_102091 | Not Available | 9769 | Open in IMG/M |
| Ga0115187_103300 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 5571 | Open in IMG/M |
| Ga0115187_103396 | Not Available | 5370 | Open in IMG/M |
| Ga0115187_103475 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 5212 | Open in IMG/M |
| Ga0115187_103964 | Not Available | 4434 | Open in IMG/M |
| Ga0115187_105229 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 3112 | Open in IMG/M |
| Ga0115187_107252 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 2013 | Open in IMG/M |
| Ga0115187_108206 | All Organisms → Viruses → Predicted Viral | 1722 | Open in IMG/M |
| Ga0115187_108268 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 1706 | Open in IMG/M |
| Ga0115187_109479 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 1441 | Open in IMG/M |
| Ga0115187_110078 | Not Available | 1332 | Open in IMG/M |
| Ga0115187_111741 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium | 1098 | Open in IMG/M |
| Ga0115187_120271 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 608 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0115187_100173 | Ga0115187_10017355 | F081453 | MYPFFLLAGENIIGKHTLANPVCGTNIKAPELAVKGAPEKDIRIP* |
| Ga0115187_100400 | Ga0115187_10040025 | F097172 | MSIIQIYLQNPKEGFIMKKNVFKKLMCAVLATACVATAVVPAMADDVVTAEAATRKVTSAYKYHIDGYDKKGYPIDGFSKTSFYKDLNSLPSVKTGKTTINVPAVTSSVKSVSKEKGEPCYESYVKFKAPKTGKYVVTLNNLQGTDDKSLKSLSCSLCEIAKTGKKYTLSGFEPDCNTVGKYDTLCENNYLARLRTILDNYKAEHPEYADVIEETYEYQKDFVNKYPVAKDKFTTKLKKGQTYVFVIDNRGMQKAVPPYFTTHGSDEQSCLWGGNYLKAYSFDMNIEYRK* |
| Ga0115187_100699 | Ga0115187_10069911 | F078004 | MIRHTLVIAVINTFYRLTVDTDGMAWMRQGITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF* |
| Ga0115187_102091 | Ga0115187_1020915 | F093883 | MEEDKDIKKEIRDYLKEEADTHIRHWIAIKRESKRLYSDIEDRTKKIALKSSSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFMIQLATSFGMDLDLDTTYASTAKGIIEDKTSGFVFYDDKERLRYADKELEDMFHDMSVTEVSKIGVVQSYELLMKQYNEFKDMKANATGKTKADE* |
| Ga0115187_103300 | Ga0115187_1033005 | F102167 | MDINQIKKYLPLGWDVVDLIDHGIIDLDIMNGKMMGEYVAVLMIKSYDKTNGHILTTFSFHDKDMDKLRMLIGNAIMAVGYRNNPLNGDGNTAIK* |
| Ga0115187_103396 | Ga0115187_1033961 | F050794 | MTEFNKRLSDKIGIECTMDILLPTDDDNANIIIEYNGIIKKLMREAEKLELDTDAIKDMMRDLLNELKDDVDLNILIFDVTQLLIKYNLFRLDAITEQEFKDSFVRMDSRNMEIKKLTLSDIKEVVTMIEDRYSYALYMTEECD* |
| Ga0115187_103475 | Ga0115187_1034758 | F083451 | MDRNEREKQVLDLLMSRKDIRKLVEKSNECYSKMDFVGAMKYRQEIKDIVDRESKIMLTKSESLVSLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNFKFVKFDNLDRLMTECKKEIDYLMKGTSKSFQISFAVRSDELREMIENMVGDNIREGYDIFKEEAEMVNETDRSKIEEFNKKLDHD* |
| Ga0115187_103964 | Ga0115187_1039642 | F089591 | MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLVKYYLNRKDDETIRSIQTLLVYWESLNKGNHIDDPESTSFGLPDDYLWFSNIKGSFSYKGCKVGDFVMWEAKNENVHELLGDDSNKPSFDYRETFYTIGDGKVVVYEDGFRTDEVRMTYYRNPVRVDLTGYINAAGMQSTDIDPELPDPLVEEILDMVAKQFSLNENELNRYSMDKDNVASFK* |
| Ga0115187_103964 | Ga0115187_1039643 | F089592 | MKEILKSRKVLAEVNGFNIMSDTLYEVVGKHDGSAPQAFQDANIAKAPFPENATHVCCPWDDFSKAYNTGFYPRSRCYNGLDKNEIDRLVKQRVDNIMKPFEEMSQMDLSQTNLEFWDDAKDKIFMGKVYNTANTVDLFYLYLAVFSGMLTPQEMDGDPVFMNSMFCFVEKDNMKDFVQQREINKMNISYKFISALKKGGDDRQAVIDLLLYIGIVTRPDFTEDEYYTGSLSNWMNEKKTNVDYLLDIWDRSLEGDFKEVLEFYRIVNVLQRNGRINMTPSGLQYNGQIIGPDVRTSAEFLATKKDFINIKANVLDEYEEIISMSNIDNKSKTKKVKDIKKKDGVEEGDKVKEE* |
| Ga0115187_105229 | Ga0115187_1052296 | F076653 | MTIRDKYFGWKDIFFDRFVHCCNEKSDQPQGSNIPLAKINFDNKTGYVEDGTINIAELLQYLWINNKVYGCEYAPIDISSALQTLIRLTENAKHMFEDQPGVYDMIPYRGFFLRDDFLSGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSILAKELGYDINSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRENKFKMTDKVKRGANNWYYSGGTISCVDSFLGKKYRKNLRTFIYRGIVFFLDRIWHTPLFEKMGVKMKYNAYYCYAATSGIWYNKGFKKRLAKRFNESLRGGGDLFGANLACMVCDHKDIDWEALRLWLDKYDEPNDKGMVNSPIQFMYLYLYYYFNK* |
| Ga0115187_107252 | Ga0115187_1072524 | F067844 | MNTLAAETASAWYLILNRKLQTLPIYITQLSSHLPRPLTMGTQQVSAGAAVITPQHRSY |
| Ga0115187_108206 | Ga0115187_1082061 | F057001 | MTSKDLNKVQSEVKKASEKTLTGAVKAWCQLFKSGKEVNEILKENDIKVDKDIVPALVSLAKDKEVVIQLCKDILPRINNTFCAYKEIEREYLDKLDQDKNVKMTVDKIESIAILGTTHKRFGYNEPVEYDGGVYYDVFNGSDKRIVKCAVPIKRYTYNLIAKCITYYLTHPKNER* |
| Ga0115187_108268 | Ga0115187_1082681 | F078005 | LIQKQNVYFCRHIKIAYMKKSEFVKKLEKIIDMVKTEDDGFEYGGKVIFYKEDDSNYEVSVMNIEMNLEVEANVMAGMDDMDFTCLMSEVYKQKAAKAIMMEKDDDEDN* |
| Ga0115187_109479 | Ga0115187_1094792 | F085718 | MVIEFDFEIYKNGDYDKVYLRNGKEARVLCDNGKGNSPMVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREPELWVVVISYMDNKDKRQKMALPNFFSRNIRGNIYLQGSSKSSVSYYVDKLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDETAV* |
| Ga0115187_109479 | Ga0115187_1094794 | F078006 | RVFAWQDDTYNRGWSKGDYIMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSRGSGSGCGIKEEATFDLATALDVMNGPLPRWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI* |
| Ga0115187_110078 | Ga0115187_1100781 | F058555 | MGTKISLLQKMKSNFDKILTEAYIPKDIQAKKDELGCLRLPAGSLVCPVDYKPVTNKDGKKVTAVKYSNKKDNIRGSGMVIEKKCKQVTAYLSIINVQKHVFLRNRMREGYRDRIEINTDDFIDILSDGIAYFCFRHVIENCHEDIDYQLKTLKAYAEGEIRIALPDIMIYSYKAKKNEDTKDIFVGKKTSVYKCLNKNLSSDERRNMANKSRKLDRVKILSKIIFRARTRNVHHIYKVTKRKTVKFNVAYLLNELNKKLIGIGMREISQSTIYRYISMFLDMCKKSISDLYDEVKKNNGVVNTKDRKNVTIGCLRLLYKGKYMHILISTEYIRDVFLGEKSSEMSKAG* |
| Ga0115187_111741 | Ga0115187_1117412 | F080673 | MDEIIKLQDEILSYLRNNITKDEAYYILTTDKDMIEVLISDKKDGSKRIKILDMEYTIEKDDMLLLFDTDGVIDECLLVASYIGVNMYFRRQDVNAILYNINREKVMKYPYIAIQLDNIQTIEKRRVVFEITGHRMDDNKERIDFMFIF* |
| Ga0115187_120271 | Ga0115187_1202711 | F090515 | QESVITCCKAGQKAALQGTAPGRVACLEFAGGEKRKTNIAADYAVKVPAFTALLCRHIGSIRTFLKS* |
| ⦗Top⦘ |