Basic Information | |
---|---|
IMG/M Taxon OID | 3300008138 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053176 | Ga0114843 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 764892411 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 131286902 |
Sequencing Scaffolds | 20 |
Novel Protein Genes | 24 |
Associated Families | 17 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 5 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 2 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 2 |
All Organisms → Viruses → Predicted Viral | 2 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → environmental samples → Prevotella sp. CAG:1124 | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 1 |
Not Available | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F054109 | Metagenome | 140 | N |
F054110 | Metagenome | 140 | N |
F068942 | Metagenome | 124 | N |
F073671 | Metagenome | 120 | N |
F080164 | Metagenome | 115 | N |
F080166 | Metagenome | 115 | N |
F081455 | Metagenome | 114 | N |
F089057 | Metagenome | 109 | N |
F092229 | Metagenome | 107 | N |
F092232 | Metagenome | 107 | N |
F095629 | Metagenome | 105 | N |
F095631 | Metagenome | 105 | N |
F099452 | Metagenome | 103 | N |
F099453 | Metagenome | 103 | N |
F103432 | Metagenome | 101 | N |
F103436 | Metagenome | 101 | Y |
F105379 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0114843_100054 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 68283 | Open in IMG/M |
Ga0114843_100165 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 43765 | Open in IMG/M |
Ga0114843_100194 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis | 41191 | Open in IMG/M |
Ga0114843_100232 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 38042 | Open in IMG/M |
Ga0114843_100243 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis | 37171 | Open in IMG/M |
Ga0114843_100338 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 31172 | Open in IMG/M |
Ga0114843_100510 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 24923 | Open in IMG/M |
Ga0114843_100947 | All Organisms → cellular organisms → Bacteria | 17031 | Open in IMG/M |
Ga0114843_101914 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 10670 | Open in IMG/M |
Ga0114843_102626 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes | 8331 | Open in IMG/M |
Ga0114843_106756 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 3650 | Open in IMG/M |
Ga0114843_106770 | All Organisms → Viruses → Predicted Viral | 3638 | Open in IMG/M |
Ga0114843_108552 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 2871 | Open in IMG/M |
Ga0114843_111003 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → environmental samples → Prevotella sp. CAG:1124 | 2206 | Open in IMG/M |
Ga0114843_111493 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 2107 | Open in IMG/M |
Ga0114843_112512 | Not Available | 1922 | Open in IMG/M |
Ga0114843_114930 | All Organisms → Viruses → Predicted Viral | 1584 | Open in IMG/M |
Ga0114843_117241 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1350 | Open in IMG/M |
Ga0114843_119276 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1198 | Open in IMG/M |
Ga0114843_129028 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 781 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0114843_100054 | Ga0114843_10005410 | F080166 | VNILANFENYTKVVEQIFELNYQLTLKMEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGIIIDWDNYDDLCNIIDEAIDICDPNNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGSMARYMAITPLLGNNRQNMMR* |
Ga0114843_100054 | Ga0114843_10005440 | F095631 | MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELVLNTISDTFRFDRDIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGVSNFCNYTYIDSSTIRVMFDNTSYLPTANTEVTVNLYTSQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNVIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYLSDRDTYFGDISIMQNIQSDIGLVHKDDPHDPEKITGVDIKVLAVFYTDEKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDQIFTANFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIVDRVPVISYDYVNTEERIQDFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA* |
Ga0114843_100054 | Ga0114843_1000547 | F099453 | MLRRKDMNRFDIIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVELHGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLANVAYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRIYINSANLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA* |
Ga0114843_100165 | Ga0114843_10016562 | F073671 | MEKQQAEHELAELHEKERSLEKALELVREKIRELINYTDKNKEQK* |
Ga0114843_100194 | Ga0114843_10019446 | F054110 | VNYQPTIKKLLKALQMNGRRYVVDTRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKPKQLYKKESEYSSTKQHEVLLFLVKTYKGGD* |
Ga0114843_100232 | Ga0114843_10023230 | F092232 | MNTQAKFIADYNDKNRHKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDAHLLGRTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV* |
Ga0114843_100243 | Ga0114843_10024317 | F054110 | VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD* |
Ga0114843_100338 | Ga0114843_10033811 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQVDDNIIARTDEFHNTDDYNELMAGSASTGAYRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFLPY* |
Ga0114843_100510 | Ga0114843_1005103 | F099452 | MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSVNIFNLYKDIINLDDVSLLTDLRKTEWYKDWFTNDKRNSDLIDLSRFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFDTLREDEDIELFKLAAENILINHGFFNNTDYNLYEVPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIIKDRICGSVYFTIFDSLNEDTRTRAR* |
Ga0114843_100947 | Ga0114843_1009473 | F054109 | MAGRPKSKKGVKVHTAFKIYPDDKARVQVMADKLELSLSAYINKAVLEKVERDEKSED* |
Ga0114843_101914 | Ga0114843_10191414 | F092229 | MNKEYRFKHIPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFNSTIEDLDLTTILQKCTTRPYIAILNNIYFRYFNSKLIEDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLTDPQVNAVKSAEFTYDLLMAARSENFNPEMVRDIFLKYGLKTNSSRNLYNRMDNNLSLFYYLEDYLEEYVNTGKFTYGSQEYHTIKEFKYLPLMNVLTQLTSSNPSGYILNHKLELVKENK* |
Ga0114843_102626 | Ga0114843_1026261 | F089057 | MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIISSNDKYSEVLSNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILGLFMTICIINKDIDKLASLCTGYLAITKDEVLVKKLMNESATMAFQYM |
Ga0114843_106756 | Ga0114843_1067562 | F103436 | MITTSKDGWCDMSDAEILNSLRDWVLKCDLKYSKREALKKIDSAFALWGGRQYVAAVDLLDENEVYFSKEDWPYYALGIEILKARKYTYFY* |
Ga0114843_106770 | Ga0114843_1067703 | F092229 | MNKEYRFNHIPEVVLRNIRFIRENNIDIGTGDDVLECMMDSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSPKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLEEYCEEGKFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG* |
Ga0114843_108552 | Ga0114843_1085523 | F103436 | MMATSKYGWGNKSDXXXXSKYSWCNKSDAEILENLKDWVSKCDVSSKREALKKIDSAFALWGARQYVAAVHLLDENEVFLEKSDWPYYALGIEILKARKHEFFNE* |
Ga0114843_111003 | Ga0114843_1110032 | F080164 | MRPTSFVLSLLLGVIGLAPCAARQVTLRERALAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGILTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYESHRGDLFLDVANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS* |
Ga0114843_111003 | Ga0114843_1110033 | F103432 | MKLIHSLFSLPLLFVLGGLFCTTACQDDVEPTQRTGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDGKTIFRRHTLPSVSAYSLQVVAVGDTIYRQKESDAQFNADLDALFRQSIGMAPRLFGVKELSVGGIDRKGKPRDLGNYSCPLLQGKRRNVNFRTKEGVFHEYFEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLR* |
Ga0114843_111493 | Ga0114843_1114932 | F068942 | MIRKILSLPTLALCFTLCTALFAGCGENNLGFVTEVRWSNVKNPKYGDDINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFPEMDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFKFSVSVTPPAMYMFKVRQPALPAKAQ* |
Ga0114843_112512 | Ga0114843_1125122 | F099452 | MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSVNIFNLYKDVINLDDVSLLTDLRKTPWYKDWFIDDKRNSDLIDLSRFNFRSLERFEKAEYLRDAEHYDFEGVIEVDSYSLYDILAEDNGISLFGLAAMNILLNHGFFNNTDYQLYDIPDAYINDQEVCLYMCLLNKDNLDFMDKKTFDDTLLYDIVKDRICGAIYFSIYDSLNEDTRTRAM* |
Ga0114843_114930 | Ga0114843_1149301 | F081455 | METTNIQEINMKAAEKLGELFDYVFCGKKPNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDTSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFKKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK* |
Ga0114843_114930 | Ga0114843_1149302 | F080166 | VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLYTVIEEAINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGTNRQNMLK* |
Ga0114843_117241 | Ga0114843_1172411 | F099453 | QQTLTFVYNTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNMQYIRSLGLVVIPEVYQARLTNLTDIIYTPKYPIAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA* |
Ga0114843_119276 | Ga0114843_1192762 | F095629 | MIFKERMMRELIICVCLLGCFSIVNANNVEQPKDVKIVHNDDSVVLHKKIYQLEKRIERLELLLQKEGK* |
Ga0114843_129028 | Ga0114843_1290282 | F054110 | VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQ |
⦗Top⦘ |