NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000052

7000000052: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 404239096



Overview

Basic Information
IMG/M Taxon OID7000000052 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052517 | Ga0031291
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 404239096
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size125496045
Sequencing Scaffolds24
Novel Protein Genes25
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → Neisseria sicca1
Not Available3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales3
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F030786Metagenome184N
F032313Metagenome180N
F042387Metagenome158N
F043990Metagenome155N
F046431Metagenome151Y
F046432Metagenome151Y
F047508Metagenome149N
F051211Metagenome144N
F051212Metagenome144N
F057446Metagenome136N
F061925Metagenome131N
F063777Metagenome129N
F064819Metagenome128N
F071328Metagenome122N
F072446Metagenome121N
F077404Metagenome117N
F080164Metagenome115N
F084342Metagenome112N
F085820Metagenome111N
F094007Metagenome106N
F097490Metagenome104N
F103432Metagenome101N
F103433Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3017007All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria513Open in IMG/M
C3041294All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas596Open in IMG/M
C3041556All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → Neisseria sicca597Open in IMG/M
C3054022Not Available657Open in IMG/M
C3093981All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1029Open in IMG/M
C3097639All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1091Open in IMG/M
C3104310All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1233Open in IMG/M
C3105828All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1273Open in IMG/M
C3114693All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1578Open in IMG/M
SRS042643_WUGC_scaffold_16891All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria951Open in IMG/M
SRS042643_WUGC_scaffold_17039All Organisms → cellular organisms → Bacteria → Proteobacteria13515Open in IMG/M
SRS042643_WUGC_scaffold_18759Not Available1936Open in IMG/M
SRS042643_WUGC_scaffold_37724All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella2742Open in IMG/M
SRS042643_WUGC_scaffold_45213All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2906Open in IMG/M
SRS042643_WUGC_scaffold_47985Not Available1066Open in IMG/M
SRS042643_WUGC_scaffold_54110All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria562Open in IMG/M
SRS042643_WUGC_scaffold_57256All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium854Open in IMG/M
SRS042643_WUGC_scaffold_57391All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales5309Open in IMG/M
SRS042643_WUGC_scaffold_57913All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales564Open in IMG/M
SRS042643_WUGC_scaffold_63277All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales6467Open in IMG/M
SRS042643_WUGC_scaffold_63504All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales2111Open in IMG/M
SRS042643_WUGC_scaffold_63802All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1493Open in IMG/M
SRS042643_WUGC_scaffold_65003All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum2536Open in IMG/M
SRS042643_WUGC_scaffold_9962All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes9663Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3017007C3017007__gene_159465F051211LPIMRVKKAIKVFEKIRDLPYGTSGSDEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLQIPEHILKIRRQQYERHVILRVFIDEFAYDVDPSIDIGLTPMLPMACWDGKSSTTTMVPLRRLRVYRPHSLHERILSQLRRKIFRSNPESFYTAIDIWLATIRVS
C3041294C3041294__gene_168609F084342AKLFLLGGYLAIPICRPFFAQALKETLGQERSLGDAYDKDYQNEKALQWIHRHHILGICKEE
C3041556C3041556__gene_168709F047508IGMLGHRLVEGRVKYPYLRNLGEYLRHGFDTEDVGWVVKRSELCALMEHIYYLWGDTYALCKALCTVYEAVTDGVDLIEGLYEVLFFENVEDNLYAACVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATVEDQDFHKVLYYMVRCELIPSSP
C3054022C3054022__gene_173570F071328MKWEFILEGEYIVEFVKLCKHLVLERTIAPKKHQAAAIYLRYSSQLLLKKRRAIRRLGIEKEYVSAILRQYGIHYREYGDNEHRVFFLDTGINLYFSKHDQSSIYIIQRIEFSN
C3093981C3093981__gene_189744F064819MILKDSQGKDRIKFSDIVRLNKEEPVFVKMEVRISTETMVSEENLDIERSDLEDLTRNLKDLSECKIRKFFFQNIDETIEIVFSINDIGTIAVEGKMYDESYMNSINFSFQTDLNGIAIFSKEISQELEKYK
C3097639C3097639__gene_191372F061925MSWEYSINLDSEESVSSVVTDLKICELFSSSTTDYIDWKNPKSIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI
C3104310C3104310__gene_194277F030786MKRTKIHNVVFQMLVVMIVTGSLQLLLKNGSAAKGGSATGTKKISLADITGGDDSEIGVLKVKYFDDEGADKIFENNNNRILNTINSQHISYNSQSAEYSKPQLFLLYQSLKVDC
C3105828C3105828__gene_194869F043990MIKKFGIIFIFGVIILGIAVYANHKIERSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDSSFTKLNKLPIKEDLPPNGIPEQFLNIANGYYKYVVDENDDRSFDILIVDTTRKEICIYYQIL
C3114693C3114693__gene_198916F063777LKVLNIKISALPPENNSFWMKLLLYLCIPFLLIFGILLLIGWGIYSGISSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLKKEDYFPDENSQEYKIFDDFCAKSNEYLDDGYLFYKLTDKKSATDLNRAIISEFQEDIGDYILLQNLILEDNQLKNQLISFDKNTGKITVLTDIKDFFWLDFDSETKIINGYNNKEQIEIAISE
SRS042643_WUGC_scaffold_16891SRS042643_WUGC_scaffold_16891__gene_23119F046432MWEMTESKLRNIISKYQLPMDDYLAEVDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRKIETLENASDAEDEISKYLFGMYSIFEMKS
SRS042643_WUGC_scaffold_17039SRS042643_WUGC_scaffold_17039__gene_23320F103433MVFVMKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRVLFGGITASVFVVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE
SRS042643_WUGC_scaffold_18759SRS042643_WUGC_scaffold_18759__gene_25511F080164LCAAPQVTLRERASAFPLITEKDESEVDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGKLTVRVLVVGDTVAVHRDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKAEKPLLYKGQTGRLVLCEYYDSHRGDLLLNAANARPEIFGELCPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYGEQTSILDHPARS
SRS042643_WUGC_scaffold_18759SRS042643_WUGC_scaffold_18759__gene_25512F103432MKPINSLFSLSLLLALSGLFCTTACQDDAEPTQRTGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDSTTMFRRHNLPSVSAYSLRVVAVGDTIYRQKESDAQFNADLDALFQQSIGIAPRLFGVRELSVLGIDRKGKPRDLGNYSCPLLQGKRRNVNYRTREGVFHEHYEAASVDTFSVKDDWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY
SRS042643_WUGC_scaffold_37724SRS042643_WUGC_scaffold_37724__gene_52391F072446MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDRTEKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTCGLCLIISIMKGNCKRAIAQTF
SRS042643_WUGC_scaffold_45213SRS042643_WUGC_scaffold_45213__gene_63612F046431MKIFKKITIVSILLIIMLTYTQTLVFAAESELTLTPKPETNNIHLKWTGPQNSSYKVFQKKPGATQFETIGLTDFSNTDEEVRVLNVYPVSIAEYNTPYVNVTYLDGTSEDIPKSALLKVWMEGR
SRS042643_WUGC_scaffold_47985SRS042643_WUGC_scaffold_47985__gene_67749F032313MYRFLILLFAIPLIACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGINVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK
SRS042643_WUGC_scaffold_54110SRS042643_WUGC_scaffold_54110__gene_77392F094007VFCGVDYLWILWYNSYYMKSKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA
SRS042643_WUGC_scaffold_57256SRS042643_WUGC_scaffold_57256__gene_82867F046432MWEMTENELSEIISKYQMPEGRYSVVEEGSFGESEFFWVIKNESTNKKYLLMNTYSHHGVEAEAEYYREEGFDNLEAIPRKIETLENVSDAEDAIFKYLFGLYSIFEIKP
SRS042643_WUGC_scaffold_57391SRS042643_WUGC_scaffold_57391__gene_83125F085820MKSTFYLFAILFLVTTFFSCETVEPSPRATWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVIPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECNFRRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSDKRKK
SRS042643_WUGC_scaffold_57913SRS042643_WUGC_scaffold_57913__gene_84103F057446MNSISFGKTTITSYPEYFEIADNKKTSKLLYLSASFVFIAIYLFDLYQNDFDFGKVAHFKTISAVLWLVIFALQFWLINTESKIEKSKIKEIVVRKNRWASIVIHYGDKKRKIDGFSQDEAEQIIKFLMNNR
SRS042643_WUGC_scaffold_63277SRS042643_WUGC_scaffold_63277__gene_95659F077404MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLQRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTPNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR
SRS042643_WUGC_scaffold_63504SRS042643_WUGC_scaffold_63504__gene_96259F051212MKKFFFIFVLYWLHSCNGTEKAMPTSPDTQKTSISEKQNAEKIERIIYSQTGGDTGGKNVHLVITKDSIIYHLTEDVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTNKKEYSKTNIQNNKTWDYITKQIIDIKYSQLNNHLNLEK
SRS042643_WUGC_scaffold_63802SRS042643_WUGC_scaffold_63802__gene_97189F042387MKRIILFFMTGLFLVSCSRENDKMTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKANYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPGGHFYLNPNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDEDENTIADYLNFRWVSHNPALIQKSTNLYGSGADSSE
SRS042643_WUGC_scaffold_65003SRS042643_WUGC_scaffold_65003__gene_100802F097490MELDEEKKEVLVEIRNNTENNYYLLSPIVSIMTKHLQDIGGEMIEGQIHHKKLDSIVCSVCIWDDICKEEYYAMRDIVLLPKKSVKKIKYKYDNEEYIEIETVHIGFPYNGYYNEIGKKMQFMLKKKLDSSNIIKGYEFYNKDIETMTIKM
SRS042643_WUGC_scaffold_9962SRS042643_WUGC_scaffold_9962__gene_13856F046431MLTYTQILVFAAESELTLTPKPETNNIHLKWTGPLNSKYRVYQKKPGSSNFETIGLTDFSPEAIDEEVKVLNVYPHSKNVGRLWPGSTAFQSLPMVNVTYLDGRTETIEKSALLKVWMEGRNIK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.