Basic Information | |
---|---|
IMG/M Taxon OID | 3300008621 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053325 | Ga0115613 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 763536994 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 172906680 |
Sequencing Scaffolds | 18 |
Novel Protein Genes | 21 |
Associated Families | 20 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 3 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 6 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → TM7 phylum sp. oral taxon 350 | 1 |
Not Available | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | National Institutes of Health, USA | |||||||
Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F032313 | Metagenome | 180 | N |
F033081 | Metagenome | 178 | Y |
F046432 | Metagenome | 151 | Y |
F068942 | Metagenome | 124 | N |
F077404 | Metagenome | 117 | N |
F080166 | Metagenome | 115 | N |
F081455 | Metagenome | 114 | N |
F085820 | Metagenome | 111 | N |
F089057 | Metagenome | 109 | N |
F092229 | Metagenome | 107 | N |
F092232 | Metagenome | 107 | N |
F095629 | Metagenome | 105 | N |
F097525 | Metagenome | 104 | N |
F097527 | Metagenome | 104 | N |
F099452 | Metagenome | 103 | N |
F099453 | Metagenome | 103 | N |
F103432 | Metagenome | 101 | N |
F103433 | Metagenome | 101 | N |
F103435 | Metagenome | 101 | N |
F105379 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0115613_1000088 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 77637 | Open in IMG/M |
Ga0115613_1000265 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 44014 | Open in IMG/M |
Ga0115613_1000459 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 | 32943 | Open in IMG/M |
Ga0115613_1001120 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 19103 | Open in IMG/M |
Ga0115613_1001651 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 14243 | Open in IMG/M |
Ga0115613_1001968 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 12507 | Open in IMG/M |
Ga0115613_1001991 | All Organisms → cellular organisms → Bacteria | 12394 | Open in IMG/M |
Ga0115613_1003176 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 8581 | Open in IMG/M |
Ga0115613_1003306 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 8341 | Open in IMG/M |
Ga0115613_1004137 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 6915 | Open in IMG/M |
Ga0115613_1005535 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 5390 | Open in IMG/M |
Ga0115613_1009981 | All Organisms → Viruses → Predicted Viral | 3151 | Open in IMG/M |
Ga0115613_1020297 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1565 | Open in IMG/M |
Ga0115613_1023790 | All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1316 | Open in IMG/M |
Ga0115613_1024277 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1285 | Open in IMG/M |
Ga0115613_1032102 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → TM7 phylum sp. oral taxon 350 | 925 | Open in IMG/M |
Ga0115613_1039101 | Not Available | 741 | Open in IMG/M |
Ga0115613_1042363 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 677 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0115613_1000088 | Ga0115613_100008813 | F085820 | MYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPPNPDHLKQRIVLRLADGTEIEKELSEKGTK* |
Ga0115613_1000088 | Ga0115613_100008871 | F068942 | MIRKILSLPTLALCFTLGSAFFAGCNEDYIEDTETKVRWSNVKPPQYGDPINITLKAEGETFTTVGDYPWISFRSYASTLDTFTRHSFSEADKDTAYYKDIVIYLTRNKREQTATLKLVAPPNRTQQPKQFDFSIGVTPLGTYIFKVRQPALPAKAQ* |
Ga0115613_1000265 | Ga0115613_100026518 | F077404 | MTCSRSRNPPNTDGSADHYAPQAGPALSRKKPTQIGNTKNPFAMNTLKTIFTFFFTLCFMMVANGYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTTLLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR* |
Ga0115613_1000459 | Ga0115613_10004592 | F032313 | MACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILLHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK* |
Ga0115613_1000459 | Ga0115613_10004593 | F032313 | MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKVTTTEFTCRNGVILLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWYVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK* |
Ga0115613_1001120 | Ga0115613_100112012 | F092229 | MNDYRFDHIPEVVLRNIRFIRENGIDIGTGDDVLECMMDINPIIRTKIYDDYEFAKDVAECRFGSTIEGLDMVTLLQKCNTRPYNSILNNIYFRYFNSKLIDDLFELAQSPKILDLAIEYECEYYAINTAKTSIRRYNSDAYYNKFAADSNIVSSTRVLNNPQVNAVKSAEFTHELLMASRAEKFSPENVREIFIKYGLKPNHSRNLYNRINDNLNLFYYIEDYLDEYREEGKFIYGGKEYKGFKDIRSLPLMVVLIQLTRENASGYILNSKLELVKG* |
Ga0115613_1001651 | Ga0115613_100165112 | F095629 | MRELIICACLFGCFGVANAAAPVDQPKEVKVVHNDDNVALHKKIYKLEQRIERLEKLLAEKEGK* |
Ga0115613_1001968 | Ga0115613_100196811 | F033081 | MHTDITVVYRPKKGVIAWLFRRAMPQDTRPTFVWSRLVTAIENAGYFSRRKFSILAVGLIVMTIAMIKMLLFVPGLNQSVVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGNFTNYTPSQKLLHKIKATRCEVYNTILLLALLEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSATGFGFLLVYLWYYRKYRNQIIATAAAATVHALYNAIALSLITVVVAVYLAIDIAKLL* |
Ga0115613_1001991 | Ga0115613_10019919 | F103433 | MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTMKYE* |
Ga0115613_1002503 | Ga0115613_10025038 | F103435 | MKFVFCTEPIYQYYRSYLYTDDKDKLDKQLMIEYGDYKDIWDLKQQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYQYLIDSPDFETIFSEILFNQSEVEFYEFYKAIFRFYNGSEVFIIVSNDEYSDMVTQMMCNVIRRTYGIHPQIIYDMDDVYSIRDDIDFSPQGAQLAYLQRAAYYKLEAKKNFEPLQIWYPFDMNTYTNALE* |
Ga0115613_1003176 | Ga0115613_10031763 | F089057 | MVHPIKFGCTIYIVLEANMTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIITSNDKYSEVLSNVLGSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILSLFMTICIVNKDIDKLASLCTGYLAISKDEVLVKKLMNESATTAFQYMSEEDIHNVVDDINSRSVLSRYLSRM* |
Ga0115613_1003306 | Ga0115613_10033067 | F099452 | MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNIFNLYKDIINLDDISLLAELRHTEWYKDWFTSDKRNSDLIDLSKFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFNTLREDEDIELFKLAAENILINHGFFNNTDYNLYEIPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIVKDRICGSVYFTIFDSLNEDTRTRAR* |
Ga0115613_1004137 | Ga0115613_10041379 | F092232 | MNTQAKFIADYNDKNRPKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACAKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANSHMKSPFYISAVKTFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDAQLLGQAIAKVARTRAFEKQLRPALINMEDSCSIYFEEV* |
Ga0115613_1005535 | Ga0115613_10055356 | F081455 | MEITNTENKNLFQQLSSLGFDVNESLLELEKDYSPVEEIGMQNVEIIDSAEEAIQQPLANTDSSIAVNFSQMINKPEVEEVKTEVASVPDNGETKVNVFFPKNEHILSNYVDYDSFNKIKESNTETIVRAVRLLNYKMSDQNAAMKFGQFVSEFNSECDPNKRLRYELIRHQGREKDLVVRLSTVINGTTKYYADIYPDLNKIDVDHHLISSARK* |
Ga0115613_1009981 | Ga0115613_10099812 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKDNPEIAIHRPVVSVFNWDAEYVKACMNSLREYQVDDNIIARTDEFHNTEYYNELMAGSASTGACRISVGGYMIDIPKSAMPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFLPY* |
Ga0115613_1020297 | Ga0115613_10202971 | F099453 | MNRFDIIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLTNEVYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRVYINSANLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA* |
Ga0115613_1023790 | Ga0115613_10237901 | F097527 | MIYFKMEKIGNSTYNKEKKTRSENLVFNTIPAAGVEPARPCGHWILSPARL |
Ga0115613_1024277 | Ga0115613_10242772 | F080166 | YVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTIIDEAIDICDPGNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGDRLDLIPFVLIDDHSGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMK* |
Ga0115613_1032102 | Ga0115613_10321022 | F046432 | MQMTENELSEVISKFQMPEGRYSIEQEGSFGRGEFFWIIKNQSTNQKYLLMNTYSHHGVESELECYREEGFDNLEAIPRKIETLEIPSDAEDEISKYLFGFYSPCKLIFLFRFFIFYTSIYQIQ |
Ga0115613_1039101 | Ga0115613_10391011 | F103432 | NPTPRAGLISTDSLIHAAKVYDGKAFEHVVSTTTAGLRVSEPRRVVPMLPRQLHVEMEGKTLFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSCPLLLGKIQNVNYRTREGVFHEHYEAASVDTFSVKDDWLLKTKAEPSLYVPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY* |
Ga0115613_1042363 | Ga0115613_10423632 | F097525 | MQQKKIMFKIQNAYQKIIFSIHGHRERKDNFEDWLKVEVKVKDDLEGKYYTRVSECMLFSEVLGLLEWFEQISADKEKSTEIDFIEPELAFEYQNKKLTVLLCYDIAPVSYGEEPYQLTFSLDDKTLAMIIKEL |
⦗Top⦘ |