NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008515

3300008515: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765074482 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008515 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053239 | Ga0115189
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765074482 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size171284191
Sequencing Scaffolds10
Novel Protein Genes10
Associated Families10

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
Not Available1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria6
All Organisms → cellular organisms → Bacteria → Proteobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F046432Metagenome151Y
F046433Metagenome151N
F084362Metagenome112N
F094007Metagenome106N
F095630Metagenome105N
F095633Metagenome105N
F103431Metagenome101N
F103433Metagenome101N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115189_1000001All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2335433Open in IMG/M
Ga0115189_1000342Not Available30986Open in IMG/M
Ga0115189_1004100All Organisms → cellular organisms → Bacteria5569Open in IMG/M
Ga0115189_1006183All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria4165Open in IMG/M
Ga0115189_1008599All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3246Open in IMG/M
Ga0115189_1023648All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1400Open in IMG/M
Ga0115189_1026328All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1271Open in IMG/M
Ga0115189_1029059All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1161Open in IMG/M
Ga0115189_1031717All Organisms → cellular organisms → Bacteria → Proteobacteria1068Open in IMG/M
Ga0115189_1052010All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria662Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115189_1000001Ga0115189_100000177F105380MPILELDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLSELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWHDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKKCAVFLGKITKEQENVNTYEEFMEWTKNTKWFK*
Ga0115189_1000342Ga0115189_100034236F084362MNCTFTVRWSDDKNKPHAKTYATESDAKRARKWLLEHGVRSVDIAVKINNKPAGSLEDGDKPSEAEAEQKGFWWQE*
Ga0115189_1004100Ga0115189_10041006F095630MIISSLYKTVENNGLLAHIYEHLLAQYVMKALQGRGFFISSDIILTAKTYGDTCFMDVEFYSPEVQDAYNEALRLFDKWNIPRSAALRAATECGIEMNRLVAELAQDELLHELSAVQSSPWRQQSDMTYRKADSKSSVNTLFHAPYIKYGLESKNLFPEYVLEYSVDEKYIQSPVDQALAAVVMQAVALNFLVMIRKKHTVYDRGDQWSEASKSVGYRTFLGISKKDGSIVHQLKNEFMEYVQFLSASPFCGNLQVALVRCSHNYEQVLLGRDTLNSILGGCVVGGRGWLEMADDTRIRKMIDLIEIDIYEIDY*
Ga0115189_1006183Ga0115189_10061833F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVITGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVSVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGK
Ga0115189_1008599Ga0115189_10085991F094007MKLKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGHGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0115189_1023648Ga0115189_10236482F095633MKNYENFTEVGHREGLTEGELRTMGMLAMEATEELKKTTIRKEAVLLGGVPFNSWDEFAKAVQEMAAHSYEPIPVKINTKRLIATAFLDDRGEMSVEEHSVPEEVFIDLSRTRCVVDADRSHKSYKFTCPVLKKYPDGELYPIREAYVISVIDVNGSQEVDFKII*
Ga0115189_1026328Ga0115189_10263282F046433MIELPTSPDALSELSPVAPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPSFMESRCETTEYEAINDFVQFIEMTKHYLPDYMEDCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQCRTKNKMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASGDYLDNGISAYSQYGGATYSVEACYRLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGKNFDGLSRFRQLLEKE*
Ga0115189_1029059Ga0115189_10290592F033081MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIVTIATIKILLFVPGLNQSVVSLLTRGLETFLPTGWATIAAWVVGTTGVFLIGSFTSNYTPSQRLLYSLEATGCGVYDTLLLLALIEEQAFRSGSERWNWRERVRASVCFGLLHIANIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL*
Ga0115189_1031717Ga0115189_10317172F103431MIDFDTLVVGMLFFIQLFLQGIAWRVAIAHFLHAERGNAAAAAFDGAFGENIADCHAEDDNDKNAESKEEGFHVCIPEG*
Ga0115189_1052010Ga0115189_10520101F046432MWEMTENELSEVISKYQMPEGRYLVEQEGSFGESEFFWVIKNQLTNQKYLLMNTYSHHGVEDEVEYYREEGFDNLEAIPRKIETLENASDADDEISKYLFGMYSIFEIKS*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.