| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300008273 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053108 | Ga0114250 |
| Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit number 3 of subject 158883629 reassembly |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 126591385 |
| Sequencing Scaffolds | 19 |
| Novel Protein Genes | 20 |
| Associated Families | 20 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 5 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 3 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 1 |
| All Organisms → Viruses → Predicted Viral | 2 |
| Not Available | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F033081 | Metagenome | 178 | Y |
| F043235 | Metagenome | 156 | N |
| F045567 | Metagenome | 152 | N |
| F046433 | Metagenome | 151 | N |
| F047508 | Metagenome | 149 | N |
| F053092 | Metagenome | 141 | N |
| F072446 | Metagenome | 121 | N |
| F080164 | Metagenome | 115 | N |
| F081455 | Metagenome | 114 | N |
| F089057 | Metagenome | 109 | N |
| F090517 | Metagenome | 108 | N |
| F092230 | Metagenome | 107 | N |
| F094006 | Metagenome | 106 | Y |
| F095631 | Metagenome | 105 | N |
| F095633 | Metagenome | 105 | N |
| F097527 | Metagenome | 104 | N |
| F099453 | Metagenome | 103 | N |
| F103433 | Metagenome | 101 | N |
| F103435 | Metagenome | 101 | N |
| F105379 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0114250_100275 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 31306 | Open in IMG/M |
| Ga0114250_100637 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 19218 | Open in IMG/M |
| Ga0114250_101076 | All Organisms → cellular organisms → Bacteria | 13385 | Open in IMG/M |
| Ga0114250_102116 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 8434 | Open in IMG/M |
| Ga0114250_102444 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 7631 | Open in IMG/M |
| Ga0114250_102678 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 7163 | Open in IMG/M |
| Ga0114250_103687 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 5728 | Open in IMG/M |
| Ga0114250_104213 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 5167 | Open in IMG/M |
| Ga0114250_104341 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 5058 | Open in IMG/M |
| Ga0114250_104459 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 4973 | Open in IMG/M |
| Ga0114250_104634 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 4835 | Open in IMG/M |
| Ga0114250_106986 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 3466 | Open in IMG/M |
| Ga0114250_107503 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 3275 | Open in IMG/M |
| Ga0114250_108983 | All Organisms → Viruses → Predicted Viral | 2787 | Open in IMG/M |
| Ga0114250_108997 | All Organisms → Viruses → Predicted Viral | 2784 | Open in IMG/M |
| Ga0114250_113428 | Not Available | 1922 | Open in IMG/M |
| Ga0114250_117447 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1483 | Open in IMG/M |
| Ga0114250_124466 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1030 | Open in IMG/M |
| Ga0114250_129010 | Not Available | 850 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0114250_100275 | Ga0114250_1002754 | F072446 | MKKLLFLLSSLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPERTTYRVPVIPRSVEDRTKKEYNDMELGKEAHLVFRATVHGDTINRHKKGLKALSLQLNRLTETSLGTSPVLCGVKSIEAVGIAENGNTYDLRAEMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLGSIREDELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVQQYYTPNGYEREATYFTTLWPLPDYKYNEREL* |
| Ga0114250_100637 | Ga0114250_1006374 | F053092 | MQSDQGLILCLTHTLLVLGALILEPAEMEDPMDDHTVQLFGILVAKELGIATHRIKADKHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIAGSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK* |
| Ga0114250_101076 | Ga0114250_1010769 | F103433 | MKEWSKNKSGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVITGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVSVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSAIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE* |
| Ga0114250_101798 | Ga0114250_1017982 | F103435 | MKFVFTTEPIYQYYRAYLYPSDKDKLDKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDILNIRDDIDFSPQGAQLAYLQRSAYLKLEAKRSLEPLQIWYSFDMNTYTNALE* |
| Ga0114250_102116 | Ga0114250_1021166 | F047508 | LGMLRHRLVEGRIKYPYLRRIWEYLRHSFDTEDVGWVVKRSELCALMEHIYYLWGDTYALSKALCTVYEAVTNGIDLIEGLYEVLFFENVEDNLYAACVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQRGAATVEDQDFHKVLYYMVRCELILSSP* |
| Ga0114250_102444 | Ga0114250_1024446 | F043235 | MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIAPFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDTLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGETAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSAH* |
| Ga0114250_102678 | Ga0114250_1026781 | F090517 | SFCGLLLGLLFLASSCKKDTPRLRLSSVELRQTVWNGTLEYKSAKYGPYFVYLSFVSDSIVEVSTFATTDLTVAYDSKDLCSYTMDDRILTLRARDEPYLNVSIDRNSWYLIRKEPSLLVFQANAGNPAAEVTLTLRKKL* |
| Ga0114250_103687 | Ga0114250_1036874 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTISYMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY* |
| Ga0114250_104213 | Ga0114250_1042131 | F046433 | MIELSTSPDALSELSPVAPPKLLSQAQDASRGNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIEMIKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRRHLEVNNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASRDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGKNFDGLSRFRQLLEKE* |
| Ga0114250_104341 | Ga0114250_1043413 | F095631 | MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRIKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARKNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIDLVHKDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA* |
| Ga0114250_104459 | Ga0114250_1044592 | F033081 | MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPTRWATTTAWTVGVAGVFLMGNFTNYTPSQKFLHKIKATRYEVYNTILLLALLEEQAFRSGSEKWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL* |
| Ga0114250_104634 | Ga0114250_1046344 | F045567 | MRQRDGRDTLTEELKGDITPLLYRAEGEARRPWVRMVTEDVVHTSTHRVEDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVWATVNEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKETPSLSSERGCVVERVVR* |
| Ga0114250_106986 | Ga0114250_1069862 | F094006 | MAEELAKAPLTALSRARVAAMRGTLEPAGPTGALYEPAVCVEAHIGELQAGSIACRGLFAFGGRHLLAVAELGAEAPHAIDMYDIALCEPLSELFAEELEYAFNFGA* |
| Ga0114250_107503 | Ga0114250_1075034 | F095633 | MRNYENFTKVGRGEGLTEGELRTMGALAVEATEELKKTTIRKEAVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVEINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEHPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN* |
| Ga0114250_108983 | Ga0114250_1089833 | F099453 | MLRRKDMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLTNLTNVIYTHKYPIAIAMAKLEYMLGRKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA* |
| Ga0114250_108997 | Ga0114250_1089973 | F089057 | IVLEANMINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSNDEYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVVSQIKHTASSKEGRALGIFMTICILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRTVLSRYLNKM* |
| Ga0114250_113428 | Ga0114250_1134282 | F080164 | MVGLAPCAAQQVTLRERALAFPLITEKAPSEIYEPYAWRLPVVPLSLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKVEKPLLYKGQAGQLVLCEYYESHRGDLFLDVANARPEIFGELCPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS* |
| Ga0114250_117447 | Ga0114250_1174471 | F097527 | MIYFKMEKIGNSTKTEKKKTRSENLVFITIPAAGGEPARPCGHWILSSITPLFK |
| Ga0114250_124466 | Ga0114250_1244661 | F092230 | MVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNTVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPS |
| Ga0114250_129010 | Ga0114250_1290101 | F081455 | VLFKVKEKHIMETMKRNLELEQILEEATNTLNRIGDFVICGKKPKTEEKDIPPAEEIGAETVDIIDSAEEAIQQPLQNKDASIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEYILGNYVDYDSFKKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECNPNKRLRYELIRHQGREKDLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK* |
| ⦗Top⦘ |