NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006457

3300006457: Human supragingival plaque microbial communities from NIH, USA - visit 1, subject 765337473



Overview

Basic Information
IMG/M Taxon OID3300006457 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052497 | Ga0100214
Sample NameHuman supragingival plaque microbial communities from NIH, USA - visit 1, subject 765337473
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size78283055
Sequencing Scaffolds13
Novel Protein Genes20
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Supragingival Plaque → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F027205Metagenome195N
F036281Metagenome170N
F040685Metagenome161N
F041827Metagenome159Y
F043990Metagenome155N
F043991Metagenome155N
F047127Metagenome150N
F049707Metagenome146N
F061925Metagenome131N
F071329Metagenome122N
F073671Metagenome120N
F074985Metagenome119N
F077781Metagenome / Metatranscriptome117N
F081456Metagenome114N
F089056Metagenome109N
F095632Metagenome105N
F101360Metagenome102N
F103434Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0100214_100053All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales44089Open in IMG/M
Ga0100214_100054All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes43813Open in IMG/M
Ga0100214_100163All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae22185Open in IMG/M
Ga0100214_100292All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes16345Open in IMG/M
Ga0100214_100373All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes14927Open in IMG/M
Ga0100214_100655All Organisms → cellular organisms → Bacteria10889Open in IMG/M
Ga0100214_110079All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1371Open in IMG/M
Ga0100214_110420All Organisms → cellular organisms → Bacteria1331Open in IMG/M
Ga0100214_112714All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1117Open in IMG/M
Ga0100214_114401All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1004Open in IMG/M
Ga0100214_116497All Organisms → cellular organisms → Bacteria → Proteobacteria892Open in IMG/M
Ga0100214_124665All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan636Open in IMG/M
Ga0100214_131280All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Apibacter → Apibacter adventoris514Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0100214_100053Ga0100214_10005310F074985MELTDGGWYKTPRIIKGKDFLAHIHDTYASGNAIYVEFKVSEGEVRILEYQRLYEVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQKAWKAIHMGSTKRFNLEQFDQLWLDQTFQKLHPVIVNHDGNFWHVMGLKLDVDADGSFWGLYLKRQDSDFMKEIRMPLTQKFLYNPISGSWSLDDPTQEIKDLEEIKQTLRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDSKTGEQKYLLDHIKAMHID*
Ga0100214_100053Ga0100214_10005311F036281MITLIKVDEGPVDIYELRMQYLAKLKQTDGVMLPTFIYRNKDLFITEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLISNGSVLFLAEERKTYNAKEYYDYWSAREGKPAPFFYESRQYHVKSFMRVPGSTDLWITAERETGHWYTFHMSDDQKSKFTRHTMTNEKGHQSYDWVLENVEWAADTIRYF*
Ga0100214_100053Ga0100214_10005312F027205VASRLIVSADDILKAVKESEAFERKALSEARKRDRAEGKEPRETLYPNADLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPRRIKRSAK*
Ga0100214_100053Ga0100214_10005315F043991MSKKNPSVIDYFDLNGDLNEEAYEFEDVKLDEYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLNIVYADINFAGGIRTILFKCRQKKNLTRFISRVLEIAQGDPSNVHPDFRA*
Ga0100214_100053Ga0100214_10005317F071329MLPVAKIIISGLTSIGAGMIASKLTKPIVSNANGIAKILLWFGSVGTGVAASAIVAREVELQFDATVKAVQEARDHVEIED*
Ga0100214_100053Ga0100214_10005318F018385MAEYENQWGPYKEHSIEKDRDPVLDDPIIYGVNTKHFTVTVYSQDGRVNKYWNARILKDDLGYCRIACPRDGKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLHHEEVK*
Ga0100214_100053Ga0100214_10005329F081456MFEEPPIYYILISLIFLIVFGAISMATWMVWLTSIAFFAKLVITAIGFLLAAMTVILYTISAE*
Ga0100214_100054Ga0100214_10005437F101360VSKESALRRAAIAAHIAKVASQEKKKALKELEEYMAPGDTSKPMIDGLQVGTVSVSSPQPRYQVVDEKALVAWLEWNKPDAVHKVPAPWFVAAAALDGFIKQTGEVPDGVEAVQGDPRISVRISTAQEEAIRELISTGDISILEIESGDA*
Ga0100214_100054Ga0100214_1000546F103434VVGFRPRRGRYLENGPHVVEVTLAVVKEGRTGRRFERGETFMIDKVLVQPSAGNALKATENRVIRGDLTDETTLKVFGTGRKWPGGPHSWVKIIKGPESLVGKTFQQAGEPLTYDASPMTRHWSVRCDTLGTEAK*
Ga0100214_100163Ga0100214_10016337F073671MTNNKQAEHELAELHEKERSLEKALELVREKIRELINYTNKNKSAR*
Ga0100214_100292Ga0100214_10029212F043991MGKKKPSAVDGFDLNGNIIEEANEFDGVLIEDWVNQRSPLKPSWVGRYSDNMHFDLKDGTEVSFYKRPDIVYGDILFAEGIRTILFKCRQKKNLTRFISRVLKLAEMGPSSVHPDLRA*
Ga0100214_100373Ga0100214_10037310F049707VSEYKSPHNDGHDPYILIWEYGNDIRRAEFSERWAEYDETGWTVWYFRLVDGGIMTFSSREWEQKDDVNHLTTVWMKPSLYDIERKEN*
Ga0100214_100655Ga0100214_10065510F095632MSFNEKTVYQVVSLTASTCASITAGAVVSALCPPAGAALTVIYSLGSGVLGSYVGEKAGQQYAESLADTIDSMKKSQTN*
Ga0100214_110079Ga0100214_1100791F041827MKHFLSALALGCLLLSCNRDLENNETHETPAPQKEKLVLINNLSGNTSVDFWYKNGNEIESARTDGMGKSNIDYEYDTYGRIVKERRFHRRYDRGETNITYQYDNQGRLVSSHAISTKFYPGTGLTPRCSVEKKHTYTYQGNKVIVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIEQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNLGIQRIEDLRYIDNIKTRDFHNGSYWEYRYRYDKENTYNGDYPNGVGIHARSHNDPTYDEYLYEISANRSYIKEE
Ga0100214_110420Ga0100214_1104203F047127MKKTFAFILLSIISLAKAQLTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERREAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEIMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDKPEIAKEVDQMVGDRDIREILEKYNSK*
Ga0100214_112714Ga0100214_1127142F040685MKKLLFKLFFTLAFASISLHGQEKIQQVEVHIFGGMALYSSYYTINFLYKEFEAKQVMGEPAELPKKILLLNPPDKWRVFTKKINLDRFKKLRNGPSEQAVDGQDEVIIIKTDKKTYRKMNASGNDHDREVWYDLLQIIAKEFGKKGIYE*
Ga0100214_114401Ga0100214_1144011F043990FGVIILGIVVYANHKIERSWIEGEFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDNSFTKLNKLPIKEDLPPNGTPEQFLNTTNGYYKYVVDENDDWNFGILIVDTARKEICIYNQILQFPDR*
Ga0100214_116497Ga0100214_1164973F061925MSWEYSINLDSEKAVSSVVTDLKICELFSSSTTDYIDWKNPKSIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHITDCDTDEEVTLEHIFRSVI*
Ga0100214_124665Ga0100214_1246651F077781LRPTFCSATGTPPPIAAPARPAPAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGALSVRTADSLYLGFPRPHPGTPGLGRFWPFLALQSLSETPSHARMPRVTVARTSPGTLEISPLRAAT*
Ga0100214_131280Ga0100214_1312801F089056SKFNPLIIKKMKVFCRIILPLLCIISCSKRKEADNTMVLEKNHTFFLWNNDSLGCKHERTIEMGEELYNTFKKSNKNDSILLKEYLGTPTRRFKDKEVIIFMYYINSCCDNGQLLEECDVSFISITFTNKNKILFGKGIQ*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.