| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300008472 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053283 | Ga0115373 |
| Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 159551223 reassembly |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 158364712 |
| Sequencing Scaffolds | 18 |
| Novel Protein Genes | 22 |
| Associated Families | 19 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 1 |
| Not Available | 3 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 2 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium | 2 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | National Institutes of Health, USA | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F018385 | Metagenome | 235 | Y |
| F027205 | Metagenome | 195 | N |
| F032313 | Metagenome | 180 | N |
| F033081 | Metagenome | 178 | Y |
| F036281 | Metagenome | 170 | N |
| F043991 | Metagenome | 155 | N |
| F045567 | Metagenome | 152 | N |
| F046431 | Metagenome | 151 | Y |
| F051213 | Metagenome | 144 | N |
| F073671 | Metagenome | 120 | N |
| F077405 | Metagenome | 117 | N |
| F080164 | Metagenome | 115 | N |
| F085820 | Metagenome | 111 | N |
| F089055 | Metagenome | 109 | Y |
| F097527 | Metagenome | 104 | N |
| F103431 | Metagenome | 101 | N |
| F103432 | Metagenome | 101 | N |
| F105378 | Metagenome | 100 | N |
| F105380 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0115373_1000897 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 19467 | Open in IMG/M |
| Ga0115373_1001205 | Not Available | 16026 | Open in IMG/M |
| Ga0115373_1001841 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 11727 | Open in IMG/M |
| Ga0115373_1002596 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 9242 | Open in IMG/M |
| Ga0115373_1003979 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 6668 | Open in IMG/M |
| Ga0115373_1004425 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 6152 | Open in IMG/M |
| Ga0115373_1007604 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 3822 | Open in IMG/M |
| Ga0115373_1015485 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 1879 | Open in IMG/M |
| Ga0115373_1016452 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae | 1767 | Open in IMG/M |
| Ga0115373_1016777 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1731 | Open in IMG/M |
| Ga0115373_1017835 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium | 1625 | Open in IMG/M |
| Ga0115373_1020109 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1428 | Open in IMG/M |
| Ga0115373_1024626 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium | 1153 | Open in IMG/M |
| Ga0115373_1027788 | All Organisms → Viruses → Predicted Viral | 1017 | Open in IMG/M |
| Ga0115373_1033523 | Not Available | 831 | Open in IMG/M |
| Ga0115373_1034290 | All Organisms → cellular organisms → Bacteria → Terrabacteria group | 812 | Open in IMG/M |
| Ga0115373_1035271 | Not Available | 789 | Open in IMG/M |
| Ga0115373_1037725 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 734 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0115373_1000897 | Ga0115373_10008972 | F085820 | MYPRDLKVFAAGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDHLKQRIVLRLADGTEIEKELSEKGKK* |
| Ga0115373_1001205 | Ga0115373_100120510 | F105378 | MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNAPITMQDHLENNQIGYDSANSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPLFVDVQCVEYTAGDLGEVSVSYTADSISIYNTGSFTGSFQCLIVYPLGSVNE* |
| Ga0115373_1001841 | Ga0115373_10018411 | F032313 | MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK* |
| Ga0115373_1001841 | Ga0115373_10018412 | F032313 | MACDNNTPQEKPHEQEKHEVPXXXXKPQFDEVGERIWYGRTPAMRLDSTDYGAGLISVFGMLTSKIPKQRFDSLFKQTVWEVKDIRVVETDLSLAKKNPGILGWVTTTEFTCRNGVILLHWQGIDVNQVDTVNYVYDEVDNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK* |
| Ga0115373_1002596 | Ga0115373_100259610 | F089055 | HKRTFWHLKVEKMNSTPECVTKTPEIKAREKEAREKLAVIFSDAEQRDNSKVNPELGKTAFDVANIPNNAAVDLCNKALGSYGKSLDRIKNSPLEAVWAIGTSLQHLRDEYKTEESCG* |
| Ga0115373_1003979 | Ga0115373_10039791 | F103432 | MKKLIHSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDSTTMFRRHTLPSVSAYSFQVVAVGDTIYRQKESDAQFNADLDALFQQSIGIAPRLFGVRELSVLGIDSRGKTRDLGNYSCPLLQGKRRNANFRTTEGVFHEHYEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDG |
| Ga0115373_1004425 | Ga0115373_100442512 | F018385 | MADLINSWLPYQELSIEKDRDPVTDDEIIYGSNVKHFTLTVYSPEGRVSKYWNARILKDQVGYCRVACPREKKILCFNWVNWTAYMFTHDGMNELVFMPDARRRTVSQLSFDN* |
| Ga0115373_1004425 | Ga0115373_10044255 | F036281 | MITLIKVDEGPVDIYELRMQYLAKLKQTDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYNAKEYYDYWTAREGKPAPFFYELRQYHVKSFMRVPGSTDLWITAEREHGRWYTFRMSDDQKSKFTRHTMTNEKGHQTYDWVLENVEWAADTIRYF* |
| Ga0115373_1004425 | Ga0115373_10044256 | F027205 | VASRLIVSADDILKAVKESEEFERKALNEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPRRIKRSTK* |
| Ga0115373_1004425 | Ga0115373_10044259 | F043991 | MSKKNPSVIDYFDLNGDLNEEAYEFEDVKLDEYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLGIVYADINFPNGIRTILFKCRQKKNLTRFISRVLELAQGDSSNIHPDFRA* |
| Ga0115373_1007604 | Ga0115373_10076042 | F103431 | MIDLDALVVGMLFFTQLFLQGIAWRIAIAHFLHTERGNAAAAAFDGAFGENIADCHAEDDKDKDTESKEEGFHVCIPEG* |
| Ga0115373_1015485 | Ga0115373_10154851 | F051213 | MKASKLLWAVIVAFTFVFTSCDRVGDEPTIEGKLDKFFDSQAQRKSFRILTGSGKPYNHKVDWHIIGITDPYSDTYLTKKVDTLSNGDLKISYDWVSFTVRENKSVIDVEVQKNETGKVRAVYLNTSTSGRQITLPDMRVTQRAE* |
| Ga0115373_1016452 | Ga0115373_10164521 | F045567 | GLDSLNMGDVAPLADTIAYDWERAMRQRDGRDTLTEELKGGITPLLYRAEGEARRPWVGMVTEDVVHTSTHRVEDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVRATVNEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEIPSLSSERGCVVERVVR* |
| Ga0115373_1016777 | Ga0115373_10167772 | F018385 | MMAEYENQWGPYKEHSIEKDRDPVLDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGYCRIACPRDGKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLYHEEVK* |
| Ga0115373_1017835 | Ga0115373_10178354 | F046431 | VIISRAAESELTLTPKPETNNIHLKWTGPQNSSYKVYQKKPGSNNFETIGLTDFSNNATDEEVKVLNIYPQESNADGRLWPTLNPSDVANIAKVIPKVQVTYLDGQTETIQKSALLKVWMEGRNSKRR* |
| Ga0115373_1020109 | Ga0115373_10201092 | F033081 | MYPPYLIMVHRPQKGVMAWLFKKGMPQDPKPVFVWPRLVTEIENAGYFSRRKFSILAVGLIIMTIATIKMLLLVPGLNQSVVSLLTRGLETFLPTRWATVTAWTVGMAGVFLMGDLTNYTPSQMFLHKIKATRFEVYNIILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIANIWYSFAAGIALSVTGFGFLLVYLWRYRKYRSQIIATAAATTVHALYNAIALSLIAVVLAIDIAKLL* |
| Ga0115373_1024626 | Ga0115373_10246261 | F046431 | KMTIVSILLIIMLTYTQTLVFAAESELTLTPKPETNNIHLKWTGPQNSSYKVFQKKPGATQFETIGLTDFSNTDEEVRVLNVYPVSIAEYNTPYVNVTYLDGTSEDIPKSALLKVWMEGRNSK* |
| Ga0115373_1027788 | Ga0115373_10277882 | F073671 | MNKEAEHELAELHEKERSLEKALELVREKIRELINYTNKNKAAR* |
| Ga0115373_1033523 | Ga0115373_10335231 | F077405 | RLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQKQKVRLK* |
| Ga0115373_1034290 | Ga0115373_10342902 | F097527 | MIYFKMEKIGNSTYNKEKKTRSENLVFITIPAAGV |
| Ga0115373_1035271 | Ga0115373_10352711 | F080164 | GLTLCAAPQVTLRERANAFPLITEKDESEIDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGFGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRGDLLLNAANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEMELEDGTKLSTSLQHYDEQTSILDHPDRS* |
| Ga0115373_1037725 | Ga0115373_10377251 | F105380 | VKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRNKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK* |
| ⦗Top⦘ |