Basic Information | |
---|---|
IMG/M Taxon OID | 3300008282 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053086 | Ga0114156 |
Sample Name | Human stool microbial communities from NIH, USA - visit 1, subject 159166850 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 151681049 |
Sequencing Scaffolds | 12 |
Novel Protein Genes | 14 |
Associated Families | 13 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 2 |
Not Available | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 5 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium D5 | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F047755 | Metagenome | 149 | Y |
F051935 | Metagenome | 143 | N |
F055775 | Metagenome | 138 | N |
F056682 | Metagenome | 137 | Y |
F062845 | Metagenome | 130 | N |
F068941 | Metagenome | 124 | N |
F074898 | Metagenome | 119 | N |
F076064 | Metagenome | 118 | N |
F078003 | Metagenome | 117 | N |
F082714 | Metagenome | 113 | N |
F092227 | Metagenome | 107 | N |
F097493 | Metagenome | 104 | Y |
F101356 | Metagenome | 102 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0114156_1000002 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 427172 | Open in IMG/M |
Ga0114156_1000113 | Not Available | 88220 | Open in IMG/M |
Ga0114156_1000567 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 35337 | Open in IMG/M |
Ga0114156_1003912 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii | 5058 | Open in IMG/M |
Ga0114156_1009357 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium D5 | 2226 | Open in IMG/M |
Ga0114156_1010591 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1993 | Open in IMG/M |
Ga0114156_1010704 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1977 | Open in IMG/M |
Ga0114156_1012015 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1782 | Open in IMG/M |
Ga0114156_1019914 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1122 | Open in IMG/M |
Ga0114156_1020235 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 1106 | Open in IMG/M |
Ga0114156_1022621 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 998 | Open in IMG/M |
Ga0114156_1035102 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium | 666 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0114156_1000002 | Ga0114156_1000002117 | F068941 | MKNNKGLIIGIIGLIIVMIGGTYAYYRWNSTSNINVSVKISGNTVTFVGGSNVTGTLTPVDSKEEGIKKDITVKASEAGSTMSLYMDLTTMPNELKEESFVYELYYNDTTLVKKGNFKAYNASSNASGITYASSGVTTLTLFTDRNVNTTTDKYTLYLWFNGKDFTNPDTMQNKTLSFDLYATGKNATLNG* |
Ga0114156_1000113 | Ga0114156_100011320 | F055775 | MAQIAQQDNLVIEVTTTAAALDGDTKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTTGDSPKYTIDIINANSETVEAIALN* |
Ga0114156_1000567 | Ga0114156_10005672 | F047755 | MENKKVEYLINMINDMDIKDKLRLAICMSQSKWSGLIYNTNENYEKFDNMLKSIDEEYRTTFINFGKYKVVMFAMAKLMEMETTEQNKVALYLSNILVKKLGL* |
Ga0114156_1000567 | Ga0114156_10005676 | F047755 | MENKKVQYLINIINDMDLKDKLRLAICMSQSKWAGLIYNTKENYEKFDKILKEVDEEYRTTLINFSKYKLVMFAMAKIMEMTKEEQNQVALFYLMQ* |
Ga0114156_1003315 | Ga0114156_10033152 | F101356 | LDEKLQTDVGELIPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFYVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDKSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ* |
Ga0114156_1003912 | Ga0114156_10039123 | F076064 | LKKTIPGRALDNLFYLQYDLPPEASFHATTEEAKRPDELYIRKLLPELTRLKLQPCHVVANDEAYYAAMKGISLFTPEAEKLMHRADYYSARRQIRLCAPDLKRRNEAKRPPKPALKFY* |
Ga0114156_1009357 | Ga0114156_10093574 | F056682 | MKMQSRAGKVANQPIGQSKTHSASFGAFPSKNRSTFPIQKLGKNYKNQEVL* |
Ga0114156_1010591 | Ga0114156_10105911 | F074898 | MEKCTVSFAACQDREAERTKRKQNKKKQCGTRNLSFSLTKRTKCCILYVAKAFVLYRKECFFMRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMMAGLGETPFCSLTLPSGQIDLAFSPDKGLCVQSGGQWAQLLVEGRPVDAALLTTPLKLLDGRSPSEVLGLLAEDINKMLDAIPSLYNLPXXXXMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHNLLASVVQTSEVARAYRSAMRGYLLSNFRLSGQIGIEKGELLFSQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQTGSNFSLRLDNENANVFALNANVFTLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQAGTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPEAIGTIELTQLLDVLF* |
Ga0114156_1010704 | Ga0114156_10107041 | F078003 | VNEHLAFAQCLQRVLNETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKHMGEQLTEAQWAALNRALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGTEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREEYEPNVAALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPYTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKISGQ* |
Ga0114156_1012015 | Ga0114156_10120151 | F051935 | MKRLMGLLLAMMVMMGGISCAVAENANPLVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAENLKEMRFLPVVAQKDNQLRLLILRRQGDLWKVSEQNDRALMRDGWMLQNFSAMPYGNSDWTYIYFDFVDENQKRWNLMLNLGDGYVSSFGTISHYVEGYGTTYINMNYDRGLEFLIDAPAYSRFSYEVYPVERYSFGVEDFDLATCPLTMQEFLVPAIVTCGEEGAGLYIMVQQDVQPIVTLADGDAIEAIPQKWELDWTIVYYQGNYLFMKTENCKMEE* |
Ga0114156_1019914 | Ga0114156_10199141 | F082714 | RNARVGEKSGTRNADDEGRELVMVVGIRFAADAPVRTVLQAVLPIFSTADVDFLVREYWVCTFGNGLPERRFTAQEMRRALDALTPDEHAELFTIYVLPHNAPGTPPSSCEDFCARGFTMAFYAYDGDGYALLAQSGEQLRAVIETLRKAVEIRSVEAVERKTLARWQF* |
Ga0114156_1020235 | Ga0114156_10202352 | F092227 | MNDKKRKRILRVGCLILACVFGLSLLGSLVMMLLV* |
Ga0114156_1022621 | Ga0114156_10226212 | F097493 | MKNGTAYFYEHGVEIDGTVYGIRTDRDTLRIKRSVVNDKFAETDDNFDMDTEIAKIQHTDVTFEQPTAEQLEQIQAKTYNSMSELKQHVQSVMNGDETMSQDEINAMLLLQIAELKAGVGGE* |
Ga0114156_1035102 | Ga0114156_10351022 | F062845 | MEKTPFILHEKFRSDNNEQRKEHFQKEFERYIVDGLSNTVPSKSCA* |
⦗Top⦘ |