NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008133

3300008133: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 763840445 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008133 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052982 | Ga0111365
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 763840445 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size154875861
Sequencing Scaffolds21
Novel Protein Genes23
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4731
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
Not Available2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F040685Metagenome161N
F041827Metagenome159Y
F042387Metagenome158N
F043990Metagenome155N
F051212Metagenome144N
F064818Metagenome128N
F071327Metagenome122N
F072446Metagenome121N
F073671Metagenome120N
F080164Metagenome115N
F081454Metagenome114N
F081510Metagenome114N
F084342Metagenome112N
F084362Metagenome112N
F094006Metagenome106Y
F094007Metagenome106N
F097527Metagenome104N
F098763Metagenome103N
F103431Metagenome101N
F103432Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111365_100238All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae48058Open in IMG/M
Ga0111365_100621All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes26836Open in IMG/M
Ga0111365_100726All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47324393Open in IMG/M
Ga0111365_100837All Organisms → cellular organisms → Bacteria22278Open in IMG/M
Ga0111365_101014All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales19311Open in IMG/M
Ga0111365_102131All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae11546Open in IMG/M
Ga0111365_102989All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis8747Open in IMG/M
Ga0111365_106815All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae4189Open in IMG/M
Ga0111365_107447All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae3845Open in IMG/M
Ga0111365_108319Not Available3433Open in IMG/M
Ga0111365_111917All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2340Open in IMG/M
Ga0111365_114881All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1852Open in IMG/M
Ga0111365_114902All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1849Open in IMG/M
Ga0111365_117151All Organisms → Viruses → Predicted Viral1593Open in IMG/M
Ga0111365_118232All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1493Open in IMG/M
Ga0111365_118675All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1456Open in IMG/M
Ga0111365_126936All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae974Open in IMG/M
Ga0111365_129196All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium892Open in IMG/M
Ga0111365_133701All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium760Open in IMG/M
Ga0111365_145708Not Available516Open in IMG/M
Ga0111365_146510All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas502Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111365_100238Ga0111365_10023855F073671MNKEQAEHELAELHAQERSLEKALELVREKIRELVNYTDKNKEQK*
Ga0111365_100621Ga0111365_10062129F081510MKFNVNAIKSTAKTTWVTTKILGKKYAPYILLGAGLVGYGYSVYEGIKSGKKLEATKAKYEQMDAQGDQYSRMEVVADIAKDVAVPVAVATASTAAIVLGFAIQTNRLKAVSAALAMVTEEHARYRLRAKTVLDEETFKKIDTPLETKTVEVDGEEIEVESIVPNEGDFYGMWFKKSHKYASDSPEYNEGVIKEADNVLTEKMMRKGMLTFAEVLDILGFEVPKAALPFGWTDTDGFYLEWDAHEVWNDDKQETEIQFYVRWKLPRNLYATTNFHDFIPKKTRKELN*
Ga0111365_100726Ga0111365_1007267F103432MKLIHSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLQVEMVGKTLFRRHPLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTHDLGNYSYPLLQGKIKNVNYRTREGVFHEHYEAASVDTFSVKDDWLLKTKAEPSLYVPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY*
Ga0111365_100726Ga0111365_1007268F080164MVGLTLCAAPQVTLRERASAFPLITEKDESEIIAPYAWRLPVVPLSLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRITLGFGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRGDLFLDVTNAHPEIFGELRPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0111365_100837Ga0111365_10083711F094007MKSKTVEVLELARPNRAGIIDVVDSDGNVVPLDYSGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAEYHRGRGNYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKTGKKRSGWNKGCIDNNA*
Ga0111365_101014Ga0111365_10101420F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILTCAGMITYVYMFLKNNQILTLKMLIIALVAALLFEYAYPWRIIFG*
Ga0111365_102131Ga0111365_1021313F043990MIKKLGIIFTFGVIILGIVVYVNHKIERSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDKLDSSFTKLNKLPIKEDLPPNGIPKQFLNIANGYYKYVVDENNDRNFDILIVDTTRKEICIYYQIL*
Ga0111365_102989Ga0111365_10298910F033081MHTDITVVYRPKKGVMAWLFRRALPQDTRPTFVWSRIVTEIENAGYFSRLKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPAGWAKVTAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDTLLLLALIEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIAISLIAVVLAIDIAKLL*
Ga0111365_106815Ga0111365_1068153F081454MKTFKLVLLLFITSASLVFGQEKRYFFKHEFQPNSKYLIKYKTDMDGGYKFVGSKEVIDKIGMDGVKMTINSDIESAISTQKKQGNNVPFILEYTKYFYKAEINGETVNRKIPLQGVKLIGDIVNGKKMEVKNVEGNIDENTKKILIESIKQFSAIDTDFPKEGLKIGDSFDVVVPYKQSTQMGDIEMIMNIKYTLLKVEKEEAYFDMLIDFVMGDKNVKNMDLSASGDGKGFLLFDMKNNYFTSQNIDMTINLKLKTELLTLENTSKAKSVITQQKIK*
Ga0111365_107447Ga0111365_1074472F042387MKRIILFFMAGLFLVSCSRENDKMTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEEHGKTNYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYSYSGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPNGHFYLNPNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDEDENTIADYLNFRWVSHNPTLIQKSTNLYGSGADSSEYKFQYEYKNNFPIKTKLNINNQTVIMMVYEYNK*
Ga0111365_108319Ga0111365_1083193F072446MHEFNAQAKNKFNSPTHPQMKKLVFLLFGLCLYCFTACDSDHEPTKPVRPFNGDTLAQIAWNFPYIVEEHYHSISGIVPEGTTYRVPVIPRSVEDRTKKEYNEQDVDKVAHLVFRATVHGDTINRHKKEFETLARQLHALTLPTVGTSPVLCGVKSIKAVGVAENGRTYDLSWEMKLRIRDYKSRRKYDSGRIVTLECEDTESLTARYVVQLGQIRKAELAEHIQPELKFYLPVKRCMDFSSIRFDITLFNGEVLSFQHKLPSKSVLQELPNNSVLENYKQYGFEREATYFTTLWPVPEKEIGR*
Ga0111365_111917Ga0111365_1119174F051212MKKFFFIFVLYWLHSCNGTEKVNADTIKTSISEKQNAEKIERIIYSETGGDTGGKNVHLVITKDSIIYRLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLKDFEKGKPSKELIMDLPTTKIIIKTDKKEYSKTDIQNNTTWDYITKQVMDIKYSQLYNHLNLEK*
Ga0111365_114881Ga0111365_1148811F094006MWSALEPAGPTGALYESAVCVEAHIGELQTGGIACRGLFAFGGSNILAVAELGAEAPHAINMYDIALCKPLSELFAKELEYAFNFGA*
Ga0111365_114902Ga0111365_1149021F041827FFMKHFLSALALGCLLLSCNRDLENNENNETPAPPKEERLVLASLFEYISNVRFQYKNGNEINRMTINEASIDFEYDTYGRIVKERRFDHKSDYGETIITYQYDNQSRLTSSHAISTQYYPDTGYTPRCSVEKKHTYTYQGNKVTVKIEMGTDTCSAIPETGKEKTITLFVENGKVVKSLDENNQIIETIEYLNTKNTLRNMKGFPPLVVEFYIRPLTYELPFYNDIEHIEDLRFIDNIKTRDFHNGSYWEYRYSYNKKKTYDNDYLEREKITVYEKSHNDPTHDNYLFSVSPSRYYIKEK*
Ga0111365_114902Ga0111365_1149022F041827MKHFLSALALGCLLLSCNRDLENNETPAPQNEKLVLLDELREGSTITFQYKNRNEIESVNIDGVGNSDIDYEYDTYGRIVKERRFHRRYDYGETNITYQYDSQGRLASSHAISTEFYPGTGLTPRCSVEKKHTYTYQGNKVTVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIKQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNLGIERIEDLRYIDNIKTRDFHDGDYWEYRYSYDKENTYNGD
Ga0111365_117151Ga0111365_1171512F084362MSLMNCAFTVRWSDEKNKPHAKTYATEADAKRAKKWLLEHGVWSVDIAVKINNKPAGSLEDGDKPSETEAEQKGFWWEK*
Ga0111365_118232Ga0111365_1182321F103431FIQLFLQGIAWRVAIAHFFHAERGNAAAATFDGAFRENIADCHAEDDNDKNAESKEEGFHVCIPEG*
Ga0111365_118675Ga0111365_1186753F097527MIYFKMEKIGNSTYNKEKKTRSENLVFNTIPAAGVEPARPCG
Ga0111365_126936Ga0111365_1269361F084342VSGQCGAKLFLLGGYLAIPIRCPFFAQALKKTLGQARSLGDAYDKDYQNEKALQWIHRRRILGICKEE*
Ga0111365_129196Ga0111365_1291961F040685MKKLLFKLFFALAFASISLHGQEKIQQVEVHIFGGMALYSSQYTLNSLKKEFSAKPLMGQKEELPKEISLPNTPKNWEAFTKKINLDKFKKLRDGPSEQAFDGQDKVIIIKTDKKTYRKMNASGNDHDREVWYDLLQIIPKEFGKKGVYE*
Ga0111365_133701Ga0111365_1337011F064818KFIVVLLPYFTYMNKDEFLKKLIAFIADNSSELHPKVFKNKIRFGINKSSYTEMRFDYSQNYKGFYLQLASYNREVGDFFEQEMGNAFLKMLEDESKEFRNLFFAQNSFQLSHYYYGFPIMTNDNTGHLYPEMGTTIFNDVLRNLQANHFKFIQAAEVLSPDLLHYIKRFPSCFFNTALVALLTIEKNLLSLDDERVQGLFEYDNMVTKNECKLFSPFDLIFGKKDYQQTAKQRILQRK*
Ga0111365_145708Ga0111365_1457082F073671KEQTEHELAELHEKEQSLEKALEIVREKIRELINYTNKNKAAR*
Ga0111365_146510Ga0111365_1465102F098763MDRRTSLRILSTVLYLATKLFEAVVWTTISYDPSDEVEGKGGMNAIPTAGGGEALG

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.