NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008472

3300008472: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 159551223 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008472 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053283 | Ga0115373
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 159551223 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size158364712
Sequencing Scaffolds18
Novel Protein Genes22
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4731
Not Available3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F027205Metagenome195N
F032313Metagenome180N
F033081Metagenome178Y
F036281Metagenome170N
F043991Metagenome155N
F045567Metagenome152N
F046431Metagenome151Y
F051213Metagenome144N
F073671Metagenome120N
F077405Metagenome117N
F080164Metagenome115N
F085820Metagenome111N
F089055Metagenome109Y
F097527Metagenome104N
F103431Metagenome101N
F103432Metagenome101N
F105378Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115373_1000897All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47319467Open in IMG/M
Ga0115373_1001205Not Available16026Open in IMG/M
Ga0115373_1001841All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae11727Open in IMG/M
Ga0115373_1002596All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis9242Open in IMG/M
Ga0115373_1003979All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae6668Open in IMG/M
Ga0115373_1004425All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes6152Open in IMG/M
Ga0115373_1007604All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria3822Open in IMG/M
Ga0115373_1015485All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1879Open in IMG/M
Ga0115373_1016452All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1767Open in IMG/M
Ga0115373_1016777All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1731Open in IMG/M
Ga0115373_1017835All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1625Open in IMG/M
Ga0115373_1020109All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1428Open in IMG/M
Ga0115373_1024626All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1153Open in IMG/M
Ga0115373_1027788All Organisms → Viruses → Predicted Viral1017Open in IMG/M
Ga0115373_1033523Not Available831Open in IMG/M
Ga0115373_1034290All Organisms → cellular organisms → Bacteria → Terrabacteria group812Open in IMG/M
Ga0115373_1035271Not Available789Open in IMG/M
Ga0115373_1037725All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2734Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115373_1000897Ga0115373_10008972F085820MYPRDLKVFAAGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDHLKQRIVLRLADGTEIEKELSEKGKK*
Ga0115373_1001205Ga0115373_100120510F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNAPITMQDHLENNQIGYDSANSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPLFVDVQCVEYTAGDLGEVSVSYTADSISIYNTGSFTGSFQCLIVYPLGSVNE*
Ga0115373_1001841Ga0115373_10018411F032313MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK*
Ga0115373_1001841Ga0115373_10018412F032313MACDNNTPQEKPHEQEKHEVPXXXXKPQFDEVGERIWYGRTPAMRLDSTDYGAGLISVFGMLTSKIPKQRFDSLFKQTVWEVKDIRVVETDLSLAKKNPGILGWVTTTEFTCRNGVILLHWQGIDVNQVDTVNYVYDEVDNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0115373_1002596Ga0115373_100259610F089055HKRTFWHLKVEKMNSTPECVTKTPEIKAREKEAREKLAVIFSDAEQRDNSKVNPELGKTAFDVANIPNNAAVDLCNKALGSYGKSLDRIKNSPLEAVWAIGTSLQHLRDEYKTEESCG*
Ga0115373_1003979Ga0115373_10039791F103432MKKLIHSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDSTTMFRRHTLPSVSAYSFQVVAVGDTIYRQKESDAQFNADLDALFQQSIGIAPRLFGVRELSVLGIDSRGKTRDLGNYSCPLLQGKRRNANFRTTEGVFHEHYEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDG
Ga0115373_1004425Ga0115373_100442512F018385MADLINSWLPYQELSIEKDRDPVTDDEIIYGSNVKHFTLTVYSPEGRVSKYWNARILKDQVGYCRVACPREKKILCFNWVNWTAYMFTHDGMNELVFMPDARRRTVSQLSFDN*
Ga0115373_1004425Ga0115373_10044255F036281MITLIKVDEGPVDIYELRMQYLAKLKQTDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYNAKEYYDYWTAREGKPAPFFYELRQYHVKSFMRVPGSTDLWITAEREHGRWYTFRMSDDQKSKFTRHTMTNEKGHQTYDWVLENVEWAADTIRYF*
Ga0115373_1004425Ga0115373_10044256F027205VASRLIVSADDILKAVKESEEFERKALNEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPRRIKRSTK*
Ga0115373_1004425Ga0115373_10044259F043991MSKKNPSVIDYFDLNGDLNEEAYEFEDVKLDEYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLGIVYADINFPNGIRTILFKCRQKKNLTRFISRVLELAQGDSSNIHPDFRA*
Ga0115373_1007604Ga0115373_10076042F103431MIDLDALVVGMLFFTQLFLQGIAWRIAIAHFLHTERGNAAAAAFDGAFGENIADCHAEDDKDKDTESKEEGFHVCIPEG*
Ga0115373_1015485Ga0115373_10154851F051213MKASKLLWAVIVAFTFVFTSCDRVGDEPTIEGKLDKFFDSQAQRKSFRILTGSGKPYNHKVDWHIIGITDPYSDTYLTKKVDTLSNGDLKISYDWVSFTVRENKSVIDVEVQKNETGKVRAVYLNTSTSGRQITLPDMRVTQRAE*
Ga0115373_1016452Ga0115373_10164521F045567GLDSLNMGDVAPLADTIAYDWERAMRQRDGRDTLTEELKGGITPLLYRAEGEARRPWVGMVTEDVVHTSTHRVEDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVRATVNEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEIPSLSSERGCVVERVVR*
Ga0115373_1016777Ga0115373_10167772F018385MMAEYENQWGPYKEHSIEKDRDPVLDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGYCRIACPRDGKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLYHEEVK*
Ga0115373_1017835Ga0115373_10178354F046431VIISRAAESELTLTPKPETNNIHLKWTGPQNSSYKVYQKKPGSNNFETIGLTDFSNNATDEEVKVLNIYPQESNADGRLWPTLNPSDVANIAKVIPKVQVTYLDGQTETIQKSALLKVWMEGRNSKRR*
Ga0115373_1020109Ga0115373_10201092F033081MYPPYLIMVHRPQKGVMAWLFKKGMPQDPKPVFVWPRLVTEIENAGYFSRRKFSILAVGLIIMTIATIKMLLLVPGLNQSVVSLLTRGLETFLPTRWATVTAWTVGMAGVFLMGDLTNYTPSQMFLHKIKATRFEVYNIILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIANIWYSFAAGIALSVTGFGFLLVYLWRYRKYRSQIIATAAATTVHALYNAIALSLIAVVLAIDIAKLL*
Ga0115373_1024626Ga0115373_10246261F046431KMTIVSILLIIMLTYTQTLVFAAESELTLTPKPETNNIHLKWTGPQNSSYKVFQKKPGATQFETIGLTDFSNTDEEVRVLNVYPVSIAEYNTPYVNVTYLDGTSEDIPKSALLKVWMEGRNSK*
Ga0115373_1027788Ga0115373_10277882F073671MNKEAEHELAELHEKERSLEKALELVREKIRELINYTNKNKAAR*
Ga0115373_1033523Ga0115373_10335231F077405RLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQKQKVRLK*
Ga0115373_1034290Ga0115373_10342902F097527MIYFKMEKIGNSTYNKEKKTRSENLVFITIPAAGV
Ga0115373_1035271Ga0115373_10352711F080164GLTLCAAPQVTLRERANAFPLITEKDESEIDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGFGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRGDLLLNAANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEMELEDGTKLSTSLQHYDEQTSILDHPDRS*
Ga0115373_1037725Ga0115373_10377251F105380VKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRNKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.