| Basic Information | |
|---|---|
| IMG/M Taxon OID | 7000000163 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052664 | Ga0031254 |
| Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 764305738 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 98778510 |
| Sequencing Scaffolds | 21 |
| Novel Protein Genes | 24 |
| Associated Families | 22 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 3 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 2 |
| All Organisms → Viruses → Predicted Viral | 4 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F032313 | Metagenome | 180 | N |
| F041827 | Metagenome | 159 | Y |
| F042387 | Metagenome | 158 | N |
| F054109 | Metagenome | 140 | N |
| F054110 | Metagenome | 140 | N |
| F071327 | Metagenome | 122 | N |
| F072446 | Metagenome | 121 | N |
| F080164 | Metagenome | 115 | N |
| F081455 | Metagenome | 114 | N |
| F085820 | Metagenome | 111 | N |
| F092229 | Metagenome | 107 | N |
| F092230 | Metagenome | 107 | N |
| F095629 | Metagenome | 105 | N |
| F095633 | Metagenome | 105 | N |
| F099452 | Metagenome | 103 | N |
| F103432 | Metagenome | 101 | N |
| F103433 | Metagenome | 101 | N |
| F103435 | Metagenome | 101 | N |
| F103436 | Metagenome | 101 | Y |
| F105378 | Metagenome | 100 | N |
| F105379 | Metagenome | 100 | N |
| F105380 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| C2315100 | Not Available | 600 | Open in IMG/M |
| C2347415 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 842 | Open in IMG/M |
| C2361587 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 1051 | Open in IMG/M |
| C2365675 | All Organisms → Viruses → Predicted Viral | 1137 | Open in IMG/M |
| C2369149 | Not Available | 1225 | Open in IMG/M |
| C2380545 | All Organisms → Viruses → Predicted Viral | 1700 | Open in IMG/M |
| C2382793 | All Organisms → Viruses → Predicted Viral | 1869 | Open in IMG/M |
| C2383865 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1952 | Open in IMG/M |
| C2383969 | All Organisms → Viruses → Predicted Viral | 1962 | Open in IMG/M |
| C2386437 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 2210 | Open in IMG/M |
| C2389485 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum | 2687 | Open in IMG/M |
| C2390833 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae | 3018 | Open in IMG/M |
| C2394253 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 4698 | Open in IMG/M |
| C2396639 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella | 10632 | Open in IMG/M |
| SRS015893_WUGC_scaffold_14335 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 13629 | Open in IMG/M |
| SRS015893_WUGC_scaffold_2564 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 2513 | Open in IMG/M |
| SRS015893_WUGC_scaffold_29254 | Not Available | 24148 | Open in IMG/M |
| SRS015893_WUGC_scaffold_40214 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 19527 | Open in IMG/M |
| SRS015893_WUGC_scaffold_41295 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 3611 | Open in IMG/M |
| SRS015893_WUGC_scaffold_41707 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 7763 | Open in IMG/M |
| SRS015893_WUGC_scaffold_6970 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 859 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| C2315100 | C2315100__gene_118622 | F054109 | MTGRPKSKKGVKVHTAFKIYPADKARAQAMADKLDLTLSAYVNKAVLEKVARDEKSED |
| C2347415 | C2347415__gene_134044 | F042387 | MVGLFLVSCSRENDKMTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKTNYEYSNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPGGHFYLNSNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDENENTIADYLNFRWVSHNPTLIQKSTNLYGSGADSSEYKFQYEYKNNFPIKTKLNINN |
| C2361587 | C2361587__gene_141565 | F071327 | MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILACAGMITYVYMFLKNNQILTLKVLIIALAAALLFEYAYPWRIIFG |
| C2365675 | C2365675__gene_143942 | F054110 | LLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPEKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKAYKGGD |
| C2365831 | C2365831__gene_144046 | F103435 | LPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDILSIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE |
| C2369149 | C2369149__gene_146032 | F085820 | MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVANECHFHRTWLTQGVKAIRVVRTHADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLANGTEIEKELSKKRTK |
| C2380545 | C2380545__gene_153403 | F092229 | DVLECMMDINPIVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLVASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGRFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG |
| C2382793 | C2382793__gene_154996 | F054110 | VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPYKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD |
| C2383865 | C2383865__gene_155844 | F095629 | MTFKERMMRELIICVCLLGCFSIANANNIEQPKEVKIVHNDDSVILHKKIYQLEKRIERLEELLKKEGK |
| C2383969 | C2383969__gene_155924 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLDKITQLQPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAMPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY |
| C2386437 | C2386437__gene_157977 | F081455 | MENFLTKEISALISKHFEFTNNSDLIKDPIISDDNIIDIPPAEEIGAEKVNMSDIIDCVQEALQQPLENKDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPECNPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK |
| C2389485 | C2389485__gene_160487 | F041827 | MKHFLSALALGGLLLSCNRGLENNENNETPAPPKEERLVLASLYEFGSNVRFQYKNENEINRMTIDGPHREASMDFEYDTYGRIVKERRFDHKSDYGETNITYQYDNQSRLTSSHAISTQYYPDTGYTPRCSVEKKHTYTYQGNKVTVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIEQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNLGIQRIEDLRYIDNIKTRDFHNGSYWEYRY |
| C2390833 | C2390833__gene_161784 | F072446 | MRKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDKTEKEYNDMDLGKEAHLVFRATVHGDTINRRRKELDAIALQLGRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLSWEIKLRIRDYFGRVKHRSSGIITIDCEDTRSKTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVLENYQQYGYEREGTYFTTLWPVPDYKYNEREW |
| C2394253 | C2394253__gene_165519 | F032313 | MACDNNTPQEKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMLTSKISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEEK |
| C2394253 | C2394253__gene_165520 | F032313 | MACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAIRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK |
| C2396639 | C2396639__gene_169383 | F103432 | MKLIHSLFSLPLLLVLGGFLCLTACQDDAEPTQRTGLISTDSLFHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVEMEGKTIFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAASVDTFSVKSNWLFKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY |
| C2396639 | C2396639__gene_169384 | F080164 | MVGLTLCAAPQVTLRERANAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLVVGDTVAVHRDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDKSRQAVDEQVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYESHRGDLLLNAANARPEIFGELCPVVDFHFPVELRRAYAWLLLEMELEDGTKLSTSLQHYDEQTSILDHPDRS |
| SRS015893_WUGC_scaffold_14335 | SRS015893_WUGC_scaffold_14335__gene_17708 | F103433 | MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAVDSKEMMSQDCNPSQVEPYVSQYGLQARVIAGLSPNDTSKIPAYIKRVSIFLALLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLAPIIFFELIHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINAKASYRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGYALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE |
| SRS015893_WUGC_scaffold_2564 | SRS015893_WUGC_scaffold_2564__gene_3088 | F095633 | MGNYEKSTEAWRREGLTEGELRTMGALAVEATEELKKTTIRKEVVLLGGVPFNSWDEFAKAVQEMAAHSYEPISVKINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEHPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN |
| SRS015893_WUGC_scaffold_29254 | SRS015893_WUGC_scaffold_29254__gene_37114 | F105378 | MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNVPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVINGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTTDAISIYNTGSFTGSFQCLIVYPLGSVNE |
| SRS015893_WUGC_scaffold_40214 | SRS015893_WUGC_scaffold_40214__gene_54903 | F105380 | MSILQLDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLNKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSYAEDYFPNGDRLTLKRLCKGSMMSDVSSSDYDEWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPRHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK |
| SRS015893_WUGC_scaffold_41295 | SRS015893_WUGC_scaffold_41295__gene_57900 | F103436 | CSKSDAEILESLKDWVSKCDVSSKREALKKIDSAFALWGGAEYEAAVHLLDENEVFLKKSDWPYYALGIEILKARKHEFFNE |
| SRS015893_WUGC_scaffold_41707 | SRS015893_WUGC_scaffold_41707__gene_59602 | F099452 | MDKTYAALLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSMNIFNLYKEIVDLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSRFNFNSLARFEKEEYLRNVERYDFEAVNPVDGYGLFDTLTKDNDVELFKLAAENILINHGFFNKTDYNFCDVPNEYMGDKEVSVYMCLLNIENMIFVDKKTLDTTILYNIVKDHICGFIYFTLFDRLNKDTRTRAM |
| SRS015893_WUGC_scaffold_6970 | SRS015893_WUGC_scaffold_6970__gene_8617 | F092230 | MRQAHKRTVDKLKAYLLKVFFPLFIVCIIFVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMFLIVIAMTIGYYSYLKPETVYGFYCKLRRKEKYPSDVNLVKRIGFLIIILP |
| ⦗Top⦘ |