NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097079

Metagenome / Metatranscriptome Family F097079

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097079
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 45 residues
Representative Sequence IGPPCYVPRVTDAALLERLQGQMEQELRRLYGVARDALVRRG
Number of Associated Samples 99
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 93.27 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.038 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(28.846 % of family members)
Environment Ontology (ENVO) Unclassified
(25.962 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.038 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72
1JGI12648J13191_10120411
2JGI12627J18819_103718711
3Ga0066678_101898112
4Ga0066671_103391952
5Ga0066676_104717562
6Ga0070666_114693962
7Ga0070711_1002425511
8Ga0066682_102776861
9Ga0070679_1007441471
10Ga0070734_104456441
11Ga0070731_105179482
12Ga0066693_103233111
13Ga0070761_101558941
14Ga0066903_1002806181
15Ga0070766_111302171
16Ga0079222_114559331
17Ga0066659_102586731
18Ga0066660_108841902
19Ga0079220_108558541
20Ga0105237_103496302
21Ga0105085_10439391
22Ga0123355_119771281
23Ga0126310_109361051
24Ga0126378_104297652
25Ga0126381_1009691312
26Ga0150983_126957751
27Ga0137364_102056461
28Ga0137395_109091762
29Ga0137416_104506952
30Ga0137407_105582042
31Ga0153915_115387771
32Ga0134110_103193561
33Ga0157369_103394451
34Ga0132256_1003939832
35Ga0182036_104508441
36Ga0182041_115050892
37Ga0182035_110895882
38Ga0182037_105136842
39Ga0182039_113061871
40Ga0187824_100036853
41Ga0187825_101308762
42Ga0187785_102608222
43Ga0187863_101725061
44Ga0187766_101944351
45Ga0187769_100961522
46Ga0179594_101278532
47Ga0210401_103737571
48Ga0210401_114522412
49Ga0210406_103643742
50Ga0210408_100429371
51Ga0210396_102736712
52Ga0210388_107198282
53Ga0210385_110702321
54Ga0210387_113225502
55Ga0210384_103938551
56Ga0213878_102889011
57Ga0187846_102699771
58Ga0210398_113298122
59Ga0126371_110433182
60Ga0224712_105011262
61Ga0207671_102914731
62Ga0209235_12799491
63Ga0209473_10881582
64Ga0209158_13422692
65Ga0209156_103371391
66Ga0209274_102156312
67Ga0209169_103779861
68Ga0209168_101492061
69Ga0137415_112799842
70Ga0308309_106516991
71Ga0307509_102603431
72Ga0318571_104024101
73Ga0318573_104632362
74Ga0318542_101536962
75Ga0310686_1141455682
76Ga0307476_100627182
77Ga0307469_112389951
78Ga0307469_123670442
79Ga0307468_1012111931
80Ga0307468_1024238131
81Ga0306918_104078181
82Ga0307477_108938822
83Ga0318498_104559652
84Ga0318552_101545792
85Ga0318529_100658391
86Ga0318503_102530011
87Ga0318568_106900231
88Ga0307478_103221191
89Ga0318564_104605692
90Ga0318564_105179881
91Ga0318499_101294582
92Ga0318517_101229952
93Ga0310916_105513771
94Ga0310913_100514441
95Ga0306922_123381091
96Ga0318562_104399521
97Ga0310911_108163532
98Ga0318558_102691271
99Ga0318532_102862772
100Ga0318553_100074215
101Ga0307471_1004340912
102Ga0307471_1010468262
103Ga0318519_106752041
104Ga0314866_033435_1_138
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 41.43%    β-sheet: 0.00%    Coil/Unstructured: 58.57%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540IGPPCYVPRVTDAALLERLQGQMEQELRRLYGVARDALVRRGSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Freshwater Wetlands
Freshwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Serpentine Soil
Bulk Soil
Grasslands Soil
Surface Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Peatland
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Groundwater Sand
Biofilm
Termite Gut
Ectomycorrhiza
Corn Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
5.8%2.9%2.9%9.6%28.8%6.7%8.7%2.9%2.9%4.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12648J13191_101204113300001084Forest SoilPYYVPRVTDSAILARLQGEMEQELRRLYGVARAALRAPGRAAG*
JGI12627J18819_1037187113300001867Forest SoilCYVPRVTDAAELGRWQERMEKELKRLFGVARAALEGDSVN*
Ga0066678_1018981123300005181SoilFSRIAIAIGPPYYVPRVSDASSLARLQSQMEQELKRLYGVARAALHPG*
Ga0066671_1033919523300005184SoilIAIGPPCYVPRVTDPPSLARLQSQMEQELKRLYGVARAALCRD*
Ga0066676_1047175623300005186SoilPFSRIAIAIGPPCYVPRVSDASSLARLQSQMEQELKRLYGVARAALHPD*
Ga0070666_1146939623300005335Switchgrass RhizosphereFVIPVPFSRVAIAIGPPRYVPRTTAAAGVESLQGEMEQELKRLYGVAQDALGKR*
Ga0070711_10024255113300005439Corn, Switchgrass And Miscanthus RhizosphereIGPPCYVPRVTDAATLEQLQGKMEEELRRLFAVARQALQRGR*
Ga0066682_1027768613300005450SoilAIAIGPPCYVPRVTDPPSLARLQSQMEQELKRLYGVARAALCRD*
Ga0070679_10074414713300005530Corn RhizosphereVIPVPFSRVAIAIGPPRYVPRTTGAGIESLQGEMEQELERLYRVARDALAKR*
Ga0070734_1044564413300005533Surface SoilFSRVAIAIGPPRYVPRTSSAGGIEALQGEMEQELKRLYGVARDALVKP*
Ga0070731_1051794823300005538Surface SoilFLARIAIAIGPPRYVARVNGAAGLAQLQGEMERELHRVYGVARDALGRQR*
Ga0066693_1032331113300005566SoilARVALAIGPPRYVPRVTHAAALEALQAQMEQELKRLFILAKGALNTH*
Ga0070761_1015589413300005591SoilFVIPMPFSRIAIAIGAPYYVPRVTDSAILARLQAEMEQELRRLYGVARAALRAPGRAAG*
Ga0066903_10028061813300005764Tropical Forest SoilAPCYVPRVTDATTLGKLQGKMEEELRRLFAVAREALQRRR*
Ga0070766_1113021713300005921SoilRVVIAIGPPRYIPRTTAAPGIEALQVEMERELQRLYGVARDALGQR*
Ga0079222_1145593313300006755Agricultural SoilIAIGPPRYVPRTTGAGIESLQGEMEQELKRLYGVARDALANS*
Ga0066659_1025867313300006797SoilVPRVTDPPSLARLQSQMEQELKRLYGVARAALCRD*
Ga0066660_1088419023300006800SoilVPFSRIAIAIGPPFYVPRVTDAALLERLAGEMEGELKRLYGVARAALRAR*
Ga0079220_1085585413300006806Agricultural SoilIPVPFLARIAIAVGPPHYVPRVMDAVGLARLEGEMERELHRLFGVARAALAQAR*
Ga0105237_1034963023300009545Corn RhizosphereIPVPFSRVAIAIGPPRYVPRTTAAAGVESLQVAMEQELQRLYGVAKDALEKR*
Ga0105085_104393913300009820Groundwater SandPRYVPRVTDSASLERMQGELKSELKRLYGVARASLQGHAP*
Ga0123355_1197712813300009826Termite GutIAIGPPFQVPRVTDAATLARLQNQMEAELHRLYGVARDALR*
Ga0126310_1093610513300010044Serpentine SoilIAIVIGAPRYVARVTDAAAIEKMQGEMELELKRLFEVARAAL*
Ga0126378_1042976523300010361Tropical Forest SoilPVPFARIAIAIGPPCYVPRVTDAASLERLQRQMEAELKRLYEVARAALGTRG*
Ga0126381_10096913123300010376Tropical Forest SoilYVPRVMDATGLERLEGEMERELHRLYGTAREALGGRG*
Ga0150983_1269577513300011120Forest SoilIGPPCYVPRVTDAPSLTRLQSQMEQELKRLYGVARAALQPNPGGFC*
Ga0137364_1020564613300012198Vadose Zone SoilGPPCYVPRVTDPPSLARLQSQMEQELKRLYGVARGALCRD*
Ga0137395_1090917623300012917Vadose Zone SoilVPFSRIAIAIGPPCYVARTTDPATLEALQSQMERELKRLFAEARAALR*
Ga0137416_1045069523300012927Vadose Zone SoilVTDAPSLARLQSQMEQELKRLYAVARAALQPHGEGFC*
Ga0137407_1055820423300012930Vadose Zone SoilVPFARVALAIGPPRYVPRVTDAAALEALQAQMEQELKRLFTVAKGALNTD*
Ga0153915_1153877713300012931Freshwater WetlandsPMPFAKIAIAIGAPRYVPRVTDPAGLESLQAEMESELKRLYETARSALHGKRS*
Ga0134110_1031935613300012975Grasslands SoilVPFSRIAIAIGPPFYVPRVTDAALLERLAGEMEGELKRLYAVARAALRAR*
Ga0157369_1033944513300013105Corn RhizosphereFSRVAIAIGPPRYVPRTTAVASVESLQGEMEQELKRLYGVARDALAKR*
Ga0132256_10039398323300015372Arabidopsis RhizosphereGPPCYVPRVTDAPTLERLQIQMEEELGRLFGVAREALREGG*
Ga0182036_1045084413300016270SoilVPRVTDAAMLERLQGQLELELRRLYEVARDALTRRD
Ga0182041_1150508923300016294SoilGPPCYVPRVTDAASLERLQGRLEVELKRLYEVAREALVRRT
Ga0182035_1108958823300016341SoilVAIAIGPPCYVPRVTDAASLARLQQQMEEELKRLYGVARDALEAP
Ga0182037_1051368423300016404SoilGPPCYVPRVTDAASLARLQQQMEEELKRLYGVARDALEAP
Ga0182039_1130618713300016422SoilIGPPCYVPRVSDAASLEKLQGKMEEELRRLFAVAREALQRRR
Ga0187824_1000368533300017927Freshwater SedimentPFLARIAIAIGPPRYVARVNGAAGLAQLQGEMERELHRVYGVARDALGRQR
Ga0187825_1013087623300017930Freshwater SedimentIGPPCYVPRVTDAPTLERLQRRMEEELKRVFGVAQEALRKVR
Ga0187785_1026082223300017947Tropical PeatlandFVIPVPFSRIAIAVGPPRYVPRVSDRKALERLQAEMELELKRLYLEARAAL
Ga0187863_1017250613300018034PeatlandPPRYIPRVTDAAVLVAVQAEMEAELKRLFGVARAALGSA
Ga0187766_1019443513300018058Tropical PeatlandFSRIAIAIGPPCYVPRVTDAATLERLQGQMEQELKRLYGVARAALRGDSVK
Ga0187769_1009615223300018086Tropical PeatlandVPFSRIAIAIGPPRYVPRVSDAAGLVRLQGQMEEELARLYAVARAALDGKR
Ga0179594_1012785323300020170Vadose Zone SoilAIAIGPPRYVPRVTDAPSLTRLQSQMEQELKRLYGVARAALQPDPGGFC
Ga0210401_1037375713300020583SoilKFVIPVPFLARIAIAIGPPRYVPRVTDAATLATLQAEMERELQRLYGVAREALR
Ga0210401_1145224123300020583SoilIPMPFSRIAIAIGPPCYVPRVTDGATLERLQGRMEEELRRLFEVAREALQARR
Ga0210406_1036437423300021168SoilVAIAIGPPRYVPRTSAAAGIEALQVEMEQELKRLYGVAREALGR
Ga0210408_1004293713300021178SoilGPPCYVPRVTDPPSLARLQSQMEQELKRLYGVARAALHPNPGRFC
Ga0210396_1027367123300021180SoilAWLVKWDKFVIPVPFLARIAIAIGTPRYVARVTDAAGLERLQEEMERELQRLYGVARDVLGERA
Ga0210388_1071982823300021181SoilAIGPPRYVPRVTDAATLATLQAQMERELQRLYGVAREALR
Ga0210385_1107023213300021402SoilPRSNDGAALEVLQGEMERELKRLYGVARAALAPRT
Ga0210387_1132255023300021405SoilPVPFSRVVIAIGPPRYIPRTTAAPGIEALQVEMERELQRLYGVARDALGQR
Ga0210384_1039385513300021432SoilRAWLVKWDKFVIPVPFLARIAIAIGTPRYVARVTDAAGLERLQEEMERELQRLYGVARDVLGERA
Ga0213878_1028890113300021444Bulk SoilPMPFARIAIAIGPPCYVPRVTDAASLERLQLELEQELGKLFGIAREALQAPR
Ga0187846_1026997713300021476BiofilmIGPPRYVSRVTDAAGLARLEEEMEHELHRLYGVARDALAERP
Ga0210398_1132981223300021477SoilPRYIPRATAAPGIEALQAEMEQELKRLYGVARDALVRR
Ga0126371_1104331823300021560Tropical Forest SoilIAIAIGPPCYVPRVTDAAALQRLQEQMEEELKRLYGVARAALDAPA
Ga0224712_1050112623300022467Corn, Switchgrass And Miscanthus RhizosphereRITIAIGPPRYVPRAMDAATLERLQAEMEQELGRVYRLAQSRLRRFERPAEPLE
Ga0207671_1029147313300025914Corn RhizosphereGPPRYVPRTTAAAGVESLQSEMEQELKRLYGVARDALGKR
Ga0209235_127994913300026296Grasslands SoilVVPRVSDASSLARLQSQMEQELKRLYGVARAALHPG
Ga0209473_108815823300026330SoilYVPRVTDPPSLARLQSQMEQELKRLYGVARAALCRD
Ga0209158_134226923300026333SoilPRYVPRVLDAPSLARLQSHMERELKRLYGVARAALHAE
Ga0209156_1033713913300026547SoilCYVPRVTDPPSLARLQSQMEQELKRLYGVARAALCRD
Ga0209274_1021563123300027853SoilAIGPPVYVPRVIGAAGLEGLQRQMEQELKRLYGVARAAVDLGT
Ga0209169_1037798613300027879SoilIPVPFARIALAIGPPVYVPRVVDAAGLERLQRQMEQELKRLYAVARAALST
Ga0209168_1014920613300027986Surface SoilLARIAVAVGPPRYVPRVTDAATLQQLQSEMETELKRLYAQARGMLSGTDS
Ga0137415_1127998423300028536Vadose Zone SoilAIAIGPPRYVPRTSAAAGIEALQVEMEQELDRLYGVARDALGR
Ga0308309_1065169913300028906SoilVIAIGPPRYIPRTTAAPGIEALQVEMEQELQRLYGVARDALGQR
Ga0307509_1026034313300031507EctomycorrhizaGPPVYVPRVMDAASLERKQVELAAELKRLYGVARASLDERAA
Ga0318571_1040241013300031549SoilYVPRVTDAATLKQLQAEMEQELRRLYQQARDALGRS
Ga0318573_1046323623300031564SoilAIGPACYVPRVTDAASLERLRRQMEEELRRLYGVARGALEMHR
Ga0318542_1015369623300031668SoilYVPRVTDAAGLARLQEQMEEELKRLFGVAREALRAAR
Ga0310686_11414556823300031708SoilARIAIAIGPPFYVPRVTDATTLARLQTQMEEELLRLYRVARAALGTV
Ga0307476_1006271823300031715Hardwood Forest SoilFVIPMPFARIAIAIGPPCYVPRVTDAASLARLQGQMEEELRRLFGVAHEALRAAR
Ga0307469_1123899513300031720Hardwood Forest SoilPRVTDAASLTRLQSQMEQELKRLYGVARAALHPNPGGFC
Ga0307469_1236704423300031720Hardwood Forest SoilRIAIAIGEPRYVPRVTDAAALERMQGEMEAELRRLFGVARAALES
Ga0307468_10121119313300031740Hardwood Forest SoilAIAIGAPVYVPRVTDAASLERLQGQMEGTLKGLFGVARAALSGG
Ga0307468_10242381313300031740Hardwood Forest SoilPPYYVPRVTDAASLERLQRHMEGELQRLYGVARAALSPAARVAG
Ga0306918_1040781813300031744SoilARIAIAIGPPCYVPRVTDAASLERLQGQLEVELKRLYEVARQALVRRT
Ga0307477_1089388223300031753Hardwood Forest SoilFVIPMPFSRIAIAIGPACYVPRVTDAPSLARLQSQMEQELKRLYGVARAALHPNPGAFC
Ga0318498_1045596523300031778SoilVIPIPFSRIALAIGPPCYVPRVTDAAMLERLQGQMEQELRRLYGVARDALARRD
Ga0318552_1015457923300031782SoilVPRVTDAASLERLQGRLEVELKRLYEVAREALVRRT
Ga0318529_1006583913300031792SoilPMPFSRIALAIGPPCYVPRVTDAALLERLQGQMEQELRRLYGVARDALVRRG
Ga0318503_1025300113300031794SoilPCYVPRVTDAAGLARLQEQMEEELKRLFGVAREALRAAR
Ga0318568_1069002313300031819SoilCYVPRVTDAASLERLQGRMEEELKRLYAVARAALEAHR
Ga0307478_1032211913300031823Hardwood Forest SoilPPRYVPRATGAAALTQLQGQMQEELKRLYGVARAAL
Ga0318564_1046056923300031831SoilFARIAIAIGAPCYVPRVTDAAAIERLQGQMEADLKRLYEVAREALERRG
Ga0318564_1051798813300031831SoilIGPACYVPRVTDAASLERLRRQMEEELRRLYGVARGALEMHR
Ga0318499_1012945823300031832SoilIGPPCYVPRVTDAALLERLQGQMEQELRRLYGVARDALVRRG
Ga0318517_1012299523300031835SoilGPPCYVPRVTDAASLERLQGRMEEELKRLYAVARAALEAHR
Ga0310916_1055137713300031942SoilAIGPPCYVPRVTDAASLERLQGRLEVELKRLYEVAREALVRRT
Ga0310913_1005144413300031945SoilYVPRVTDAATLERLQRQMEEELRRLYGVAREALERHR
Ga0306922_1233810913300032001SoilGPPCYVPRVTDAAALAGLQSQLELELQRLFAVARAALGPR
Ga0318562_1043995213300032008SoilPCYVPRVSDAASLEKLQGKMEEELRRLFAVAREALQRRR
Ga0310911_1081635323300032035SoilVIPMPFSRIAIAIGPPCYVPRVSDAASLEKLQGKMEEELRRLFAVAREALQRRR
Ga0318558_1026912713300032044SoilFYVPRVTDTALLERLQDQMAEELRKLFGVAREALQKAR
Ga0318532_1028627723300032051SoilYVPRVTDAASLERLQGRLEVELKRLYEVAREALVRRT
Ga0318553_1000742153300032068SoilSRIALAIGPPCYVPRVTDAALLERLQGQMEQELRRLYGVARDALVRRG
Ga0307471_10043409123300032180Hardwood Forest SoilPVPFSSIAIAIGPACYVPRVTDAAALARLQLKMEGELKRLYGVAREALRAR
Ga0307471_10104682623300032180Hardwood Forest SoilIPMPFSRIAIAIGPPCYVPRVTDAPTLTRLQSQMEQELKRLYGVARAALQPSPGAFC
Ga0318519_1067520413300033290SoilFARVAIAIGPPCYVPRVTDAASLARLQQQMEEELKRLYGVARDALEAP
Ga0314866_033435_1_1383300033807PeatlandRIAIAIGPPCYVPRVTDGATLERLQGEMEQELKRLFGVARDALRG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.