NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006477

3300006477: Human tongue dorsum microbial communities from NIH, USA - visit 2 of subject 158883629



Overview

Basic Information
IMG/M Taxon OID3300006477 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052507 | Ga0100232
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2 of subject 158883629
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size127012939
Sequencing Scaffolds21
Novel Protein Genes23
Associated Families22

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4733
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
Not Available2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus4
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00401
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F046433Metagenome151N
F054110Metagenome140N
F066860Metagenome126N
F068942Metagenome124N
F072446Metagenome121N
F073671Metagenome120N
F077405Metagenome117N
F080164Metagenome115N
F080166Metagenome115N
F081510Metagenome114N
F085820Metagenome111N
F089055Metagenome109Y
F092230Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095633Metagenome105N
F097527Metagenome104N
F103431Metagenome101N
F103432Metagenome101N
F105376Metagenome100N
F105378Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0100232_100171All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes54737Open in IMG/M
Ga0100232_100249All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria42902Open in IMG/M
Ga0100232_100352All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47333072Open in IMG/M
Ga0100232_100523All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47325556Open in IMG/M
Ga0100232_100525All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae25436Open in IMG/M
Ga0100232_100829All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae18593Open in IMG/M
Ga0100232_101013Not Available15800Open in IMG/M
Ga0100232_101339All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus12533Open in IMG/M
Ga0100232_102770All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus7058Open in IMG/M
Ga0100232_102951All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4736701Open in IMG/M
Ga0100232_103542All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae5712Open in IMG/M
Ga0100232_104628All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus4538Open in IMG/M
Ga0100232_105931All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00403644Open in IMG/M
Ga0100232_108878All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus2490Open in IMG/M
Ga0100232_113108All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1701Open in IMG/M
Ga0100232_117876All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1242Open in IMG/M
Ga0100232_118339All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1211Open in IMG/M
Ga0100232_125867All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8852Open in IMG/M
Ga0100232_125904All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416851Open in IMG/M
Ga0100232_126212All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria843Open in IMG/M
Ga0100232_136054Not Available605Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0100232_100171Ga0100232_10017154F081510MKFDLNTIKASTKSALVTTKILGKKYAPYVLLGAGLIGYGYSVYEGIKSGKKLEATKAKYEQMDAVGEEYTRMDVVKDITKDVAIPVAVATASTASIILGFAIQTNRLKAVSSALAIVTEEHARYRLRAKTVLDEETFKKVDAPLETKTVNVDGEDIEVESIIPNEGDLYGMWFKHSHKYASDSPEYNEGVIKEADKVLTEKMMRSGMLTFAEVLDILGFEVPRAALPFGWTDTDGFYIEWDAHEVWNDDKQETEIQFYVRWKTPRNLYATTSFKDFIPKKTRKELN*
Ga0100232_100249Ga0100232_10024938F105376MKMNKNKKGGDMEPDVSAKEFGALQAKVEYIKDGVDKHTATLERIENIARANVAQAQLKTYITEHEQESEKKYVKRSEIEGVMNFWSLVTSNLAKLFAVALVGLAIYATNNLIQQNKAITELQEEVQTQVRRK*
Ga0100232_100352Ga0100232_10035231F080164MRPTSFVLSLLLGMVGLAPCAAQQVTLRERALAFPLITEKAPSEIYEPYAWRLPVVPLSLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKVEKPLLYKGQAGQLVLCEYYESHRGDLFLDVANARPEIFGELCPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0100232_100352Ga0100232_10035232F103432MKLIHSLFSLSLLLALSGLFCTTACQDEVEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTAAGLRVSEPRRIVPMLPRQLHVEMEGKTLFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTHDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAARVDTFSVKDDWLLKTKAEPSLYVPSFRLLVWDQPADDYTKLRFMLTLVDGRSLVAEVPLY*
Ga0100232_100523Ga0100232_1005237F032313MYRFLILIFALTLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAIRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEEK*
Ga0100232_100523Ga0100232_1005238F032313MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGILGWVTTTEFTCRNGVILLHWQGIDVNHVDTVNYVYDEVDNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0100232_100525Ga0100232_10052512F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGE*
Ga0100232_100829Ga0100232_1008294F068942MIRKILSLPTLALCFTLGSAFFAGCNENYIEGFVTEVRWSNVKNPKYGEYINIRLKAEGETFTTVGNHSWISFSGSVSTLDTFTRHRFSEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFDFSIEVTPPAMYIFKVRQPALPAKAQ*
Ga0100232_101013Ga0100232_1010139F105378MKVSVYVDKLKKWVPISSDEILDRNKNLSDVKDKDAAITNLGLYDKFISKEALQSGFLPDVFTPENIQTDADHQFVSDSDKNNWNNKLNKPVEIQTNLEENQIGYDEVNEKFYIGLNNKNVLIGGASALDNIKIVNGFFSGNSQPTIIRNTKTREDGTLISPVFVDVQCVEYTGGNLGEVSVSYTSELINIYNTGSFTGAFQCMIVYPLGSVNR*
Ga0100232_101339Ga0100232_10133914F046433MIELPTSPDALSELSPMAPPKLLSQAQDASRNNTVTYVADDGYMGTMTSDPRFRKMRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRSELVFLGMSELHFAATALAKRLRHLLEVDNKPVYVDVGNSLSQCRVKKEMKSSQYILSLVLSKFLDDEFEEYEGRLKVYGGRGEIDKSSKILFIDDWIIGGDQVRERISVFGAYNNPGAHKVSVLVMAASSSYINNGIVADSLWGEVTYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSTFGCEVDDIAYRAIEGGILKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLEKG*
Ga0100232_102770Ga0100232_1027706F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDRE*
Ga0100232_102951Ga0100232_1029513F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSEKRKK*
Ga0100232_103542Ga0100232_1035424F092230MRQAHKRMVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNTVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLIIILPPCMLFLLIV*
Ga0100232_104628Ga0100232_1046287F094007MLLIRTVMLCRWTDYLGEDFVPDANSYSDKDFTKRNRIIVEMCDLFGRIRRRAGFAECHRGRGDYDRARSIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0100232_105931Ga0100232_1059313F072446MSSKLHNLRSFTPRHIHYQNRSGLGMTTGPKHSTTNLTIPLRIYKLGDTLLKNIPPMKKLLFLLSSLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPERTTYRVPVIPRSVEDRTKKEYNDMELGKEAHLVFRATVHGDTINRHKKGLKALSLQLNRLTETSLGTSPVLCGVKSIEAVGIAENGNTYDLRAEMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLGSIREDELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPSKSVQQYYTPNGYEREATYFTTLWPLPDYKYNEREL*
Ga0100232_108878Ga0100232_1088782F095633MGNYENSTEVGRGEGLTEGELRTMGALAVEATEKLRKTTVRKETVLLGSVPFGSWDEFAKAVQEMAVHSYEPIPVEINTKRLIATAFLDDEGEMSVEENFVPEDVFIDLSRTRCDAEEDRNRKSYEFTCPALERYPDGELCPTRKAYVISAIDVNGSQEVDFNIIYGGLN*
Ga0100232_113108Ga0100232_1131084F097527MIYFKMEKIGNSTKTEKKKTRSENLVFITIPAAGVEPARPCGQ
Ga0100232_117876Ga0100232_1178762F073671MEKEHAEHELSELHEKERSLEKALEIVREKIRELVNYTDKNKVQK*
Ga0100232_118339Ga0100232_1183391F077405RPRPWQGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQKQKARLK*
Ga0100232_125867Ga0100232_1258672F095629MRELIICLCLLGCFSIANANNVEQSKEVKIVHNDDSIILHKKIYQLEKRIERLEELLKKEGK*
Ga0100232_125904Ga0100232_1259041F080166LNYYITFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSSRGEKVGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGSNRQNMLK*
Ga0100232_126212Ga0100232_1262123F089055MNSTPECVTKTPEIEAREKLAAIFSDAERCDNSKVNPELGKTAIDIENTSRMNSADDGAVYLCNQALGSYEKSLDHINNGPLETVQTIGISLQRLREYKIATRCLY*
Ga0100232_136054Ga0100232_1360542F103431MIDLDALIVGMLFFIQLFLQSIAWRVAIAHFLHAERGNAAAAAFDGAFGEDIADCHAEDDNDKDAESQKEGFHVCIPEG*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.