NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008281

3300008281: Human buccal mucosa microbial communities from NIH, USA - visit 1, subject 160218816 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008281 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053059 | Ga0113996
Sample NameHuman buccal mucosa microbial communities from NIH, USA - visit 1, subject 160218816 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size149476067
Sequencing Scaffolds13
Novel Protein Genes18
Associated Families15

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F04321
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Buccal Mucosa → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F022002Metagenome216Y
F027205Metagenome195N
F033081Metagenome178Y
F036281Metagenome170N
F046431Metagenome151Y
F049707Metagenome146N
F051211Metagenome144N
F063778Metagenome129Y
F074985Metagenome119N
F077403Metagenome117N
F095630Metagenome105N
F097526Metagenome104Y
F101358Metagenome102Y
F101360Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0113996_1000006All Organisms → cellular organisms → Bacteria281903Open in IMG/M
Ga0113996_1000154All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis60533Open in IMG/M
Ga0113996_1000478All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis31661Open in IMG/M
Ga0113996_1001768All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium12152Open in IMG/M
Ga0113996_1001813All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes11907Open in IMG/M
Ga0113996_1002086All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis10447Open in IMG/M
Ga0113996_1002663All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes8391Open in IMG/M
Ga0113996_1003419All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces6571Open in IMG/M
Ga0113996_1003695All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales6107Open in IMG/M
Ga0113996_1004976All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas4541Open in IMG/M
Ga0113996_1005109All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F04324427Open in IMG/M
Ga0113996_1020874Not Available1137Open in IMG/M
Ga0113996_1022207All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1069Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0113996_1000006Ga0113996_100000670F051211MKAKKAIRIFEKIRDLPYGTSGSDEVWSCYRKCVLLKQELQHIGITSQLLIGVFDWQDLPIPEHTLNLRRQRHERHVILRVFIDGSAYDIDPSIDIGLAPTLPIAHWDGTSSTATMASLKHLRIYRPHSLHERILSRLRRKLFRGNPKEFYTAIDKWLADTRAHQLS*
Ga0113996_1000006Ga0113996_100000697F095630MYSSLYKISKDNGLLAHVYEHLLAQYILKYLQDKGLFISSDIILTAKTYGDTCYMDVELYNPAAPNAYNEALQVFDKHTIPEKAVRRAVSECGIEMNRAVLELKQDELMSNLSRMQSSDWRQQSEMTYRRSYDKSSVNTLFCVPYLKYGKKSKKLFPEYVLEYSIDEEYINSPIDQALAAIIMQAVALNFLVAVRENYTVYDRGDQWSEASLSVGYRMFLGLAKEDKQITSQLKHEFTVYIQYLLKSPFCSNLQKALLRCSCNLEQVLLGRSTLNNILGGCVIGGRGWLEMADDARIKQMIAAIQLDVYDI*
Ga0113996_1000154Ga0113996_100015450F101358MTEKHSASPTAKEETLAPLPPRQPEHLLTQPITVDGAPEKRATHPDTHSIPKRQDVPQYLWNVLRSSGQLDEGWIDREIIDKDGTVLVRMTKLVNRSNIHRLEKCVSLAVLQEQNLDFTNAYLQRAYGVMIKDGRLQPVADTTHATTSSQEQQTEILPAQGGDTYEHLGGDSARQLVGSVAAKASDGEVDNSLLAQRALERIR*
Ga0113996_1000478Ga0113996_100047818F033081MAWLFNRVMPTDSRPAFVWPRLVVAIEDTRHFDRRELSFIAVVLIVMTIATIKVLLMIPGLDSSVVNLLTRGFATFLPRGWATGAAWVAGMTGAFLIGDFTNYTPSQKLLHKTKATRYEAYNTLLLFALMEEQAFRSGSEKWSWCERVRASVCFGLAHVVNIWYSFAAGTALSMTGFGFLLVYLWYYRKYRSQIVATAAAATLHTLYNVIALSLIAAAAVLYMAISVIKML*
Ga0113996_1001768Ga0113996_10017683F046431MKKISRISITIILILSIIISYGSVIISMAAESELTLTPKPETNNIHLKWTGPQNSSYKVYQKKPGSSNFETIGLTDFSNNATDEEVKVLNIYPQESNADGRLWPTLNPSDVANIAKVLPKVQVTYLDGQTETIQKSALLKVWMEGRNSKRR*
Ga0113996_1001813Ga0113996_10018134F101360VNKENALRRAAIAAHVAKVASQEKKKALNELMEVMAPGDRSYATVNGEQVGAISVTTATPAYQVTDEQALVRWLEWNKPDAIHRVPAPWFTAKAALDGFIKQTGEIPDGVELVTPTPRISARVSPAQEEVIRELIAIGDISLIEIEGAE*
Ga0113996_1002086Ga0113996_10020869F063778MPETISEEAQQELLRQLQDALGLVKNADTSALDVAAITHSAADGHQLTEVMLQQMTAIEAYLKNCQVSINDAISNIEAIPLDPPPDD*
Ga0113996_1002663Ga0113996_10026631F074985MELTDGGWYKTPRIIKGKDFLAHIHDTCASGNAMYVEFKASEGEVRILEYQRLYEVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQKAWKAIHMGSTKRLNLEQFDQLWLDQTFQKLHPVIVNHDGKFWHVMGLKLDVDADGSFWGLYLKRQDSDFMKEIRMPLTQKFIYNPISGSWSLDDPTQEIKDLEEIKQTLRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDSETGEQKYLLDHIKAMHID*
Ga0113996_1002663Ga0113996_10026632F036281MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEEQKTYSAKEYYDYWAAREGRPAPFFYEARQYHVKSFMRVPGSTDLWITAERETGHWYTFRMSDDQKSKFTRHTMTNEKGHQSYDWVLENVEWAADTIRYF*
Ga0113996_1002663Ga0113996_10026633F027205VASRLIVTADDIMRAVKESEEFEKKALSEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYILDEYDIPRRIKRSTK*
Ga0113996_1002663Ga0113996_10026639F018385MMAEYENQWGPYKEHSIENDRDPVLDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGYCRIACPRDGKILCFNWVHWTAYMFTYDGLNELVFMPGSSRKTISRLYYEEVK*
Ga0113996_1003419Ga0113996_10034198F063778MPETISEGAQQQLLQQLQDALGLVAEADTSAHDVAAITHSAADGHQLTELMLQELTTAREYLKSCADQIEHAISNVKAIPLDPPPDD*
Ga0113996_1003695Ga0113996_10036951F022002ALLLSLPSYAQDGQSKEPINRTISGFTLGVTTPAEARAIIQRQGGEIEETQAWSDEVVYAITGLKYARRPTLSVRLYFYKGHLRSISFVFGDLKIFEQIESGLENKYGTMAEGKATSKMRVKGIADAFTSLEVVVHSFEDDGHVGFAYAYISYTDLELDRAYSAENENEI*
Ga0113996_1004976Ga0113996_10049764F077403MSAIYAEERISVHFEVRWHRERNPMDTLRRELDVPYLHVVYQNHTDTAYYLVRQDQSNWIFPRLRYYTVIEAIPRTEERSLTHYVPRWATFTLHARGAELQKRKVLTRQVLLQDQAWEVELEPSILDGKPKPRITYEEGVSNWSERAYYLQGYLYYLMNPQQAHDWYQVSYLPDHAMHSVATEAVVTEGELRPYVHRLAFLPARSRREYVYSLLAFRLIRGRWHFMLPDWQASARLRDEGTSLNVYLLRDQEAPTSPQGYALPDHYEGYQLYQGQVRGDSIWIEM*
Ga0113996_1005109Ga0113996_10051096F097526MATTIYTQDDYCRLCERLDDDTVAALLHAHIRAFAADGNWQLLKRLTEAMRIAAEFEQACRNENPDPAEQARRARWDNQFRELQQQARMAHCEIVNADNAAVSHLTAQCERNGRCDTDTELSVARDGAYGFAAALRDIPLNEKQTVLLWRMAVLTIAEITDATPSLVAHYLNGTGGEHLGRALAGKTVYPVTVVSNLAWLLHEQHKSGQLQRDLSHTAKLLRLTEVAEMRGAMPHRAVWQADNPKSRQSTLEPGE*
Ga0113996_1020874Ga0113996_10208742F022002MSSRLALTLGLLLALLLSLPSYAQDGQSKEPIDRTISGFTLGVTTPAEARAIIQRQGGKIILEVGIHAGSKDVSYTVTGLNYARCPTQTVYMFFYKGHLQSIFFSFDGWDVLEQIESELENKYGMMIESEGMFKKKGIVDAFTSLEVVRTFEYEGHSMSKNAYIAYTDKELDRARSAEKENEN*
Ga0113996_1022207Ga0113996_10222073F049707VVSEYRSPHNDGHDPYILIWEYGNDIRRAEFTERWAECEPDTGWTVWYFRLEGGEVMKFRALEWEQKDDVNHLTTIWMRPSLHDIERKEN*
Ga0113996_1026004Ga0113996_10260042F077403VGHAEEGISVRFEVHWHREHNPMDTLRRELDVPYLHVVYQNHTDTAYYLVRQDQSNWIFPRLRYYTVIEDASRPEKYNLTYYVPRWATYPLKGAGHSVRERRHYTHHVLLRDQAWEVELEPSIVDGQPKPRIAYGEGMDNWSECAYYLQGYLYYLMTPQLDHDWYQVTYLSEHMEYSVGMEMVGTEGTLKPFVHRLAFVPAHSRREYVYSLLAFAVIHGGWQFLLPDWEASGRLRDER

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.