| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006477 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052507 | Ga0100232 |
| Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 2 of subject 158883629 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 127012939 |
| Sequencing Scaffolds | 21 |
| Novel Protein Genes | 23 |
| Associated Families | 22 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 1 |
| Not Available | 2 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 4 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F0040 | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F032313 | Metagenome | 180 | N |
| F046433 | Metagenome | 151 | N |
| F054110 | Metagenome | 140 | N |
| F066860 | Metagenome | 126 | N |
| F068942 | Metagenome | 124 | N |
| F072446 | Metagenome | 121 | N |
| F073671 | Metagenome | 120 | N |
| F077405 | Metagenome | 117 | N |
| F080164 | Metagenome | 115 | N |
| F080166 | Metagenome | 115 | N |
| F081510 | Metagenome | 114 | N |
| F085820 | Metagenome | 111 | N |
| F089055 | Metagenome | 109 | Y |
| F092230 | Metagenome | 107 | N |
| F094007 | Metagenome | 106 | N |
| F095629 | Metagenome | 105 | N |
| F095633 | Metagenome | 105 | N |
| F097527 | Metagenome | 104 | N |
| F103431 | Metagenome | 101 | N |
| F103432 | Metagenome | 101 | N |
| F105376 | Metagenome | 100 | N |
| F105378 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0100232_100171 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 54737 | Open in IMG/M |
| Ga0100232_100249 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 42902 | Open in IMG/M |
| Ga0100232_100352 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 33072 | Open in IMG/M |
| Ga0100232_100523 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 25556 | Open in IMG/M |
| Ga0100232_100525 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae | 25436 | Open in IMG/M |
| Ga0100232_100829 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 18593 | Open in IMG/M |
| Ga0100232_101013 | Not Available | 15800 | Open in IMG/M |
| Ga0100232_101339 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 12533 | Open in IMG/M |
| Ga0100232_102770 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 7058 | Open in IMG/M |
| Ga0100232_102951 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 6701 | Open in IMG/M |
| Ga0100232_103542 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 5712 | Open in IMG/M |
| Ga0100232_104628 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 4538 | Open in IMG/M |
| Ga0100232_105931 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F0040 | 3644 | Open in IMG/M |
| Ga0100232_108878 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 2490 | Open in IMG/M |
| Ga0100232_113108 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1701 | Open in IMG/M |
| Ga0100232_117876 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1242 | Open in IMG/M |
| Ga0100232_118339 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 1211 | Open in IMG/M |
| Ga0100232_125867 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 852 | Open in IMG/M |
| Ga0100232_125904 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 851 | Open in IMG/M |
| Ga0100232_126212 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 843 | Open in IMG/M |
| Ga0100232_136054 | Not Available | 605 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0100232_100171 | Ga0100232_10017154 | F081510 | MKFDLNTIKASTKSALVTTKILGKKYAPYVLLGAGLIGYGYSVYEGIKSGKKLEATKAKYEQMDAVGEEYTRMDVVKDITKDVAIPVAVATASTASIILGFAIQTNRLKAVSSALAIVTEEHARYRLRAKTVLDEETFKKVDAPLETKTVNVDGEDIEVESIIPNEGDLYGMWFKHSHKYASDSPEYNEGVIKEADKVLTEKMMRSGMLTFAEVLDILGFEVPRAALPFGWTDTDGFYIEWDAHEVWNDDKQETEIQFYVRWKTPRNLYATTSFKDFIPKKTRKELN* |
| Ga0100232_100249 | Ga0100232_10024938 | F105376 | MKMNKNKKGGDMEPDVSAKEFGALQAKVEYIKDGVDKHTATLERIENIARANVAQAQLKTYITEHEQESEKKYVKRSEIEGVMNFWSLVTSNLAKLFAVALVGLAIYATNNLIQQNKAITELQEEVQTQVRRK* |
| Ga0100232_100352 | Ga0100232_10035231 | F080164 | MRPTSFVLSLLLGMVGLAPCAAQQVTLRERALAFPLITEKAPSEIYEPYAWRLPVVPLSLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKVEKPLLYKGQAGQLVLCEYYESHRGDLFLDVANARPEIFGELCPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS* |
| Ga0100232_100352 | Ga0100232_10035232 | F103432 | MKLIHSLFSLSLLLALSGLFCTTACQDEVEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTAAGLRVSEPRRIVPMLPRQLHVEMEGKTLFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTHDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAARVDTFSVKDDWLLKTKAEPSLYVPSFRLLVWDQPADDYTKLRFMLTLVDGRSLVAEVPLY* |
| Ga0100232_100523 | Ga0100232_1005237 | F032313 | MYRFLILIFALTLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAIRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEEK* |
| Ga0100232_100523 | Ga0100232_1005238 | F032313 | MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGILGWVTTTEFTCRNGVILLHWQGIDVNHVDTVNYVYDEVDNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK* |
| Ga0100232_100525 | Ga0100232_10052512 | F054110 | VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGE* |
| Ga0100232_100829 | Ga0100232_1008294 | F068942 | MIRKILSLPTLALCFTLGSAFFAGCNENYIEGFVTEVRWSNVKNPKYGEYINIRLKAEGETFTTVGNHSWISFSGSVSTLDTFTRHRFSEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFDFSIEVTPPAMYIFKVRQPALPAKAQ* |
| Ga0100232_101013 | Ga0100232_1010139 | F105378 | MKVSVYVDKLKKWVPISSDEILDRNKNLSDVKDKDAAITNLGLYDKFISKEALQSGFLPDVFTPENIQTDADHQFVSDSDKNNWNNKLNKPVEIQTNLEENQIGYDEVNEKFYIGLNNKNVLIGGASALDNIKIVNGFFSGNSQPTIIRNTKTREDGTLISPVFVDVQCVEYTGGNLGEVSVSYTSELINIYNTGSFTGAFQCMIVYPLGSVNR* |
| Ga0100232_101339 | Ga0100232_10133914 | F046433 | MIELPTSPDALSELSPMAPPKLLSQAQDASRNNTVTYVADDGYMGTMTSDPRFRKMRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRSELVFLGMSELHFAATALAKRLRHLLEVDNKPVYVDVGNSLSQCRVKKEMKSSQYILSLVLSKFLDDEFEEYEGRLKVYGGRGEIDKSSKILFIDDWIIGGDQVRERISVFGAYNNPGAHKVSVLVMAASSSYINNGIVADSLWGEVTYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSTFGCEVDDIAYRAIEGGILKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLEKG* |
| Ga0100232_102770 | Ga0100232_1027706 | F066860 | MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDRE* |
| Ga0100232_102951 | Ga0100232_1029513 | F085820 | MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSEKRKK* |
| Ga0100232_103542 | Ga0100232_1035424 | F092230 | MRQAHKRMVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNTVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLIIILPPCMLFLLIV* |
| Ga0100232_104628 | Ga0100232_1046287 | F094007 | MLLIRTVMLCRWTDYLGEDFVPDANSYSDKDFTKRNRIIVEMCDLFGRIRRRAGFAECHRGRGDYDRARSIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA* |
| Ga0100232_105931 | Ga0100232_1059313 | F072446 | MSSKLHNLRSFTPRHIHYQNRSGLGMTTGPKHSTTNLTIPLRIYKLGDTLLKNIPPMKKLLFLLSSLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPERTTYRVPVIPRSVEDRTKKEYNDMELGKEAHLVFRATVHGDTINRHKKGLKALSLQLNRLTETSLGTSPVLCGVKSIEAVGIAENGNTYDLRAEMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLGSIREDELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVQQYYTPNGYEREATYFTTLWPLPDYKYNEREL* |
| Ga0100232_108878 | Ga0100232_1088782 | F095633 | MGNYENSTEVGRGEGLTEGELRTMGALAVEATEKLRKTTVRKETVLLGSVPFGSWDEFAKAVQEMAVHSYEPIPVEINTKRLIATAFLDDEGEMSVEENFVPEDVFIDLSRTRCDAEEDRNRKSYEFTCPALERYPDGELCPTRKAYVISAIDVNGSQEVDFNIIYGGLN* |
| Ga0100232_113108 | Ga0100232_1131084 | F097527 | MIYFKMEKIGNSTKTEKKKTRSENLVFITIPAAGVEPARPCGQ |
| Ga0100232_117876 | Ga0100232_1178762 | F073671 | MEKEHAEHELSELHEKERSLEKALEIVREKIRELVNYTDKNKVQK* |
| Ga0100232_118339 | Ga0100232_1183391 | F077405 | RPRPWQGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQKQKARLK* |
| Ga0100232_125867 | Ga0100232_1258672 | F095629 | MRELIICLCLLGCFSIANANNVEQSKEVKIVHNDDSIILHKKIYQLEKRIERLEELLKKEGK* |
| Ga0100232_125904 | Ga0100232_1259041 | F080166 | LNYYITFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSSRGEKVGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGSNRQNMLK* |
| Ga0100232_126212 | Ga0100232_1262123 | F089055 | MNSTPECVTKTPEIEAREKLAAIFSDAERCDNSKVNPELGKTAIDIENTSRMNSADDGAVYLCNQALGSYEKSLDHINNGPLETVQTIGISLQRLREYKIATRCLY* |
| Ga0100232_136054 | Ga0100232_1360542 | F103431 | MIDLDALIVGMLFFIQLFLQSIAWRVAIAHFLHAERGNAAAAAFDGAFGEDIADCHAEDDNDKDAESQKEGFHVCIPEG* |
| ⦗Top⦘ |