Basic Information | |
---|---|
IMG/M Taxon OID | 3300007650 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052761 | Ga0105532 |
Sample Name | Human stool microbial communities from NIH, USA - visit 1, subject 604812005 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 129959930 |
Sequencing Scaffolds | 19 |
Novel Protein Genes | 23 |
Associated Families | 23 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes | 2 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 11 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:74 | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F026592 | Metagenome / Metatranscriptome | 197 | Y |
F032312 | Metagenome / Metatranscriptome | 180 | N |
F041208 | Metagenome | 160 | N |
F042910 | Metagenome | 157 | N |
F044555 | Metagenome / Metatranscriptome | 154 | N |
F051934 | Metagenome | 143 | N |
F051935 | Metagenome | 143 | N |
F068856 | Metagenome | 124 | N |
F072366 | Metagenome | 121 | N |
F073656 | Metagenome | 120 | N |
F074898 | Metagenome | 119 | N |
F074899 | Metagenome / Metatranscriptome | 119 | N |
F076064 | Metagenome | 118 | N |
F077319 | Metagenome | 117 | N |
F089005 | Metagenome | 109 | N |
F094005 | Metagenome / Metatranscriptome | 106 | N |
F095494 | Metagenome | 105 | N |
F100397 | Metagenome | 102 | N |
F101192 | Metagenome | 102 | N |
F101193 | Metagenome | 102 | N |
F101357 | Metagenome / Metatranscriptome | 102 | N |
F105374 | Metagenome | 100 | N |
F105375 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0105532_100096 | Not Available | 84161 | Open in IMG/M |
Ga0105532_100460 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes | 37292 | Open in IMG/M |
Ga0105532_101816 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii | 10436 | Open in IMG/M |
Ga0105532_103227 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes | 5465 | Open in IMG/M |
Ga0105532_104007 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 4501 | Open in IMG/M |
Ga0105532_109398 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 2140 | Open in IMG/M |
Ga0105532_109907 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 2047 | Open in IMG/M |
Ga0105532_110554 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1936 | Open in IMG/M |
Ga0105532_111103 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:74 | 1855 | Open in IMG/M |
Ga0105532_111564 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1787 | Open in IMG/M |
Ga0105532_112877 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1631 | Open in IMG/M |
Ga0105532_113043 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae | 1612 | Open in IMG/M |
Ga0105532_115065 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1421 | Open in IMG/M |
Ga0105532_115956 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1347 | Open in IMG/M |
Ga0105532_119980 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1095 | Open in IMG/M |
Ga0105532_123342 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 939 | Open in IMG/M |
Ga0105532_128005 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 774 | Open in IMG/M |
Ga0105532_132126 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus | 668 | Open in IMG/M |
Ga0105532_134519 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 618 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0105532_100096 | Ga0105532_100096108 | F094005 | MKKEIVKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLDGQFTRLDGKGKGNAIILPCTTENEQKYNAFVAYHQSKKSLEAIKKWLDDNGKHKDEETFEKVITLDKKLKKLIEKLNE* |
Ga0105532_100096 | Ga0105532_100096109 | F032312 | MARIKDYDEDLSAPKLLKERARDSKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDAMVCLGFTWFKLLGVALMMLLWPIVFIYATLNDGIEGYPFKKYAIPYIFILVVWFIIFLYELVS* |
Ga0105532_100096 | Ga0105532_10009611 | F101357 | MNLNNITTVLKTGITIYQYEQWQNTGSVNLMQKESHMLSKVWLKTNIYNPDSLDKPFIQLSATFTSESDIQEYNEWLNANQYKLYPLLLDILKISLKDDFYNYSNASNIHYEGGKFPSMLTIQLFNLEF* |
Ga0105532_100096 | Ga0105532_10009614 | F105375 | MKNIETFQTTQHLDNLVTNLGLQIQELFSLDLEEILDYSNNLMNLLVNAYVENQCLALSAMISKQDGFAIYSFLFQTPDTSNGAADALVSFAMNFTDGEANIKSINRISSNIMQITFTV* |
Ga0105532_100096 | Ga0105532_1000965 | F044555 | MKTTNPSSRITLSQNGNQILTCKVYKEPNYILSMSNEEILEFISGLDYMGNLPTVPDLEKPIEIQVSTTRQIPLEQNKEVQTKIKEIIYNNLYDTLIDELKNTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHDK* |
Ga0105532_100460 | Ga0105532_10046013 | F073656 | MTMEQDQEQMQGALYVAVDDGNKIVAMERSRRGDEGFRALLDEFTDYAANCGAIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGTVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWTFVDPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRSGFDNIERKYASDLSDEGRAAKREEQYNLAAYILDRKFPIRRRPSSEIPPMQAEGIRTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGTEFHLKRH* |
Ga0105532_101816 | Ga0105532_1018163 | F076064 | LKKTIPGRALENLFYLQYDLPPEASFHATTEEAKRPDELYMRKLLPELTRLKLQPRHVVANDEVYYAAMKGVMLFTPEAEKLMLTEDYFSARRQIRLCAPDLKRRNETRRYPMPVLKLY* |
Ga0105532_103227 | Ga0105532_1032274 | F105374 | LRPGFGAAENIRYLVLSKGVFAMKKRVTLLVALCIWKVVAAQTPYGEMPERFRPDTLPCRLGGGVCFGMDGLDAAIPRGGGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRMRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVVRSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP* |
Ga0105532_104007 | Ga0105532_1040075 | F026592 | RTASPQGIAALAAQGGVATLTERSDATFSVMQFSSADGE* |
Ga0105532_109398 | Ga0105532_1093982 | F101193 | MARKCTEYHYFKHTYWDAMARYNWTRCPHCGKLCKPSRFKAATVLVPIELVIFVVCIFFRNSMNDAIGWFAAWLLFVLLLFLPQYIYVRFFMPYETLSEDETRKFHELQKN* |
Ga0105532_109907 | Ga0105532_1099072 | F074898 | MRKILSLLLMLALFLPCALAETPQGIDLALTSTYGDGLSLWMTAGLGETPFCSLTLPSGQINLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEVLGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHNLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFRQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDAIGTIELTQLLDGLF* |
Ga0105532_110554 | Ga0105532_1105542 | F100397 | MKKLIAILVLSLSLILCLTACGSKMPYDLSETASVELHAYNNDSTEPFAKIVVDGEDVATIVEMFNSLKLKEMKYTEPSIRGYEFWFRDENGSEIVKIELPYGPSPWLVVGGTEYPYQDVNGGVDVDYLAQLVDMTISAGPMQPEDGVDHNAPVEPLFYKSAPGLSIQHDGKSVRALPGTSSWQYMNPDGTSTGIEADSMHPLDAKEYMPVLPAVEGEAWLVFDTAPDEITVKAWPVSKWGDLSAVDEAIVVPVSGDKITLLEGGHIFE |
Ga0105532_111103 | Ga0105532_1111033 | F101192 | MQTEPSDKEGRNMTYRGWLLVDIALLLTALSGTQTSLCQRMRSVPVSRGLTYWEVQQLAKAAPTSITPCGEDTIRRWVSGRYQIALRFNRYDVCLGVEEEIDG* |
Ga0105532_111564 | Ga0105532_1115641 | F072366 | MLDILAIKADVYHLERQGKRLPVYRYLREVWQKEPPSEGLTVLALQQMVDYVEYVDDLTVLGEPWEAENEYDLYQDFLLDVISWGLQKYRAKKRFLWQICYYVNAWATFYYIFGREITQENVEQWKKTLFEEAKERYPDSMLFEFIPHAAQLDYVWFYRLTDEQRLQIRLEVGEWNLQKNDMDQAVQSYFDDAMTWYRDNGRKLLEAKNKTNN* |
Ga0105532_112877 | Ga0105532_1128771 | F041208 | MRKILAMLLSVILLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAIASYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDVAQTFFAVQADDADGETNLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDMLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDTFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGLWGPQDG |
Ga0105532_113043 | Ga0105532_1130431 | F074899 | WNKQAVTSSFLDKKPPESLILQGLEGSTTLGKDEVGSSNLPSSSK* |
Ga0105532_115065 | Ga0105532_1150652 | F068856 | VKDSGASAEGNVLAAAHAVLGKGRKGMKRLTSILLALLMLVGMALAEETPDAALGDWYALPSDETVLRLTLREDGTFFFGTEGISGIEGKWRKTTDGEYNLAYTNRSSSLLDVIMSMVDSQAPAPDMTMTARLTESGLDVFYGSTAEGAVVHMARDAEELRTERTPRTDTPLEAFAGTWTMETMFLGTMQLTYTPEMGERQVFCTIDGLTMFPGAGLESFPEGTSFPLTFEDGVLRTTIPLTVQMAASSALVKEIVVDYDLTFFQ |
Ga0105532_115956 | Ga0105532_1159563 | F051935 | MKRLLGLLLAMMVMMGGISCAVAENTNPIVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAENLKEMRFLPVVAQKGNQLRLLILRKQGDLWKVSEQNDRALMRDGWMLQNFSAMPYVNSDSTYIYFDFVDENQTEWELVLNLSTVYVSYFRMIYTAEEYGITEIVFNYERGVDFQFDAPFYFRLSYDVDPAKSLSFGVADFELATCPLSMREFLVPAVVSCGEEGAGLYIMAKQGIQPILVLADGEMIEAIPQRWQRDWVIVCYRGNYLFMKTENCKMEE* |
Ga0105532_119980 | Ga0105532_1199801 | F051934 | LKSLFRLLTVEGDVVASDGSFDARIDLSLTNAPEKTATRIRFFGLDSHWGIQTTDLGGETLMFNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGTDADIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTVIIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLSGPSGGTAPGGK |
Ga0105532_123342 | Ga0105532_1233421 | F042910 | GARCAGGCRRPGAGAGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITTLTISAEGKISFQGLTLAPGAAVRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC* |
Ga0105532_128005 | Ga0105532_1280051 | F095494 | ETQRRLLNIIQAEAARTDAPTDETKIHTCMDLLERLQGEQKPIAPARVDALRQHIAAAHQKNERKRQKRKKIIAAAACSAAAIVVAFAVSHPLLWYENWTTSDEQQHFVTSHEIAIEMLETAVADPTLPSGDTVEVQSIAALDALIGRKTGIPEMVNGQWELQHRYVNFTRSGISISLMYVNAADAQQTIVGVINLISNPQYMMLSFEQSYEGTIQQFDGLNFYITENINKPVALWQGHDQTLLFFRRTSPPPAAAL |
Ga0105532_132126 | Ga0105532_1321261 | F089005 | KKCLTKSKQCSIIALALLRLATSNEESKQALKVRRTLKIEQRDNSKETRNDFEESSKNYSEMYTKKHQE* |
Ga0105532_134519 | Ga0105532_1345191 | F077319 | MQENTVCAANCTVQEKRGGQKIMLIRNATLQMSATERTCMDVRVMNGCVWEMGAALVKGLYESETDLCGDVLMPGRMLETPIPAADEKALRLLCRRLYREGVRYFVADCPADALLRVQNHPERRGALPVTALPTPEPLRSGTGMPLTRWTAAGGVVCGWVLYCRRGRSARGEALRRS |
⦗Top⦘ |