NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F105748

Metagenome Family F105748

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105748
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 41 residues
Representative Sequence GVVYLPNADKYRRLTRRPSQEQKGLDERALMASQRPSPPW
Number of Associated Samples 83
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 2.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(17.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 42.65%    β-sheet: 0.00%    Coil/Unstructured: 57.35%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540GVVYLPNADKYRRLTRRPSQEQKGLDERALMASQRPSPPWSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
3.0%17.0%5.0%17.0%3.0%9.0%7.0%6.0%16.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI11615J12901_1058850213300000953SoilVVSLQRAGKYQRLAWRPSQEQKGLDERALMAALQPSPPS*
JGI1027J12803_10944629833300000955SoilVDGTSLRVVYLYAAAKYHRRTRRLSQEQKGRDERALMAS*
JGI25384J37096_1015884223300002561Grasslands SoilRAVYLSPAAKYHRLTRRLSQGQKGLKERALMASQRPSHPW*
Ga0066397_1004962823300004281Tropical Forest SoilAVYLPTADKYRRLTRRLSQGQKGLEERALMASQQPSEPW*
Ga0066397_1008606523300004281Tropical Forest SoilPVMYLAAAGKYLRLTHRPSQGQKDLDERGIIASQQLSDPW*
Ga0066395_1003327453300004633Tropical Forest SoilWVVYLRPADKYPRFTRRLSQEQKGLEERALMASQRPSDPW*
Ga0066395_1014625623300004633Tropical Forest SoilVVYLPNADKYRGLTRRPPQGHKGLDERALMASQRPLHPW*
Ga0066809_1001318723300005168SoilMSHWVVYLPNADKYRRLTRRPSQGQKGLDERALMASQRPS
Ga0066683_1074198523300005172SoilLQQAVYLAAAAKYRRLTHRLSQEQKGLDERALMVSQRPSHLW*
Ga0066685_1109661713300005180SoilVVYLSPADKYHRLTRRLPQGQKGLKERAIMASQRPSPPW*
Ga0066675_1082788523300005187SoilMVQKPVDNLTLSAAEVVYLLRADKSPRLTCRPSQGQKGLKERALMASQRPSEPW*
Ga0065705_1015477413300005294Switchgrass RhizosphereVVYLSPAGKYKKPAWRLSQEQKGLDERPLKASSRPSEPW*
Ga0065705_1039857043300005294Switchgrass RhizosphereVYLAAAAKYRRLTPRLSQEQKGLDERARTASQWPLPPVDV
Ga0070668_10176348313300005347Switchgrass RhizosphereAVVYLSPADKYHRLTRRLSQGQKSLDERALLASQWPSEPW*
Ga0070667_10076930923300005367Switchgrass RhizosphereVVYLLRADKSPRLTRRPSQEQKGLGERARMASQRSSEAW*
Ga0070700_10005441743300005441Corn, Switchgrass And Miscanthus RhizosphereVVYLLRADKSPRLTRRPSQEQKGLDERARMASQRSSEAW*
Ga0066682_1065174213300005450SoilVVYLPNADKYRGLTRRPSQGHKGLDERTLMASQRPFHPW*
Ga0070706_10165401013300005467Corn, Switchgrass And Miscanthus RhizosphereLVGGIAVVYLEPADKYPRLIRRLPQGQKGLDERALMASRWPSHPW*
Ga0070706_10178234413300005467Corn, Switchgrass And Miscanthus RhizosphereMIFQLIEYWVVYLGPADKYHRFTRRPSQEHKGLDERAIMASQWPS
Ga0070697_10182918813300005536Corn, Switchgrass And Miscanthus RhizosphereVYLPTADKYRRLTRRLSKGQKGLEERARMASQRPSPRMVKKL
Ga0070686_10008598313300005544Switchgrass RhizosphereMEWVVYLLRADKSPRLTRRPSQEQKGLDERARMASQRSSEAW*
Ga0070695_10116965613300005545Corn, Switchgrass And Miscanthus RhizosphereVVYLPNADKYRRLTRRPSQGQKGLDERTLMASQRPFHPW*
Ga0066701_1082017013300005552SoilVVYLSPADKYHRLIRRLSQGQKGLEERDLMASQRPFEPW*
Ga0066701_1082346313300005552SoilMIQYKVVYLSPADKYRRLTRRLSQDQKGLEERALMAS
Ga0066695_1076142113300005553SoilTVRLGASASPERVVYLPNADKYRGLTHRLLQEQKGLDERALMASQRPSAPW*
Ga0066695_1077202113300005553SoilYLSPADKYHRLTRRLSQAQEGLEERALRTSQRPSEPW*
Ga0066707_1033675623300005556SoilAVYLAAADKYRRHTHRLSQEHKGLDERALMVSQRRSHLW*
Ga0066707_1094859813300005556SoilMQEVVYLSPADKSHRLTRRLSQEQKGLEERALMASQRPSEP
Ga0066698_1012313923300005558SoilEAVYLAAAAKYRRLTHRLSQEQKGLDERALMVSQRPSHLW*
Ga0066698_1014915033300005558SoilVVYLPNADKYRGLTHRLLQEQKGLDERALMASQRPSAPW*
Ga0066694_1043806513300005574SoilRAVYLAAAAKYRRLTHRLSQEQKGLDERALMVSQRPSHLW*
Ga0068852_10231801923300005616Corn RhizosphereVVYLPNADKYRRLTRRPSQEQKGLDERARMASQRSSEAW*
Ga0066905_10200885923300005713Tropical Forest SoilAAVYLSPADKSHRLTRRLSQEQKGLEERALMASQRPSAPW*
Ga0066903_10113707713300005764Tropical Forest SoilMYLWAADKSPRLTCQLSQEQKGLDERARMASQRPS
Ga0066903_10121438933300005764Tropical Forest SoilVYLPTADKYRRLTRRLSQGQKGLEEWALMASQQPSEPW
Ga0066903_10293279813300005764Tropical Forest SoilAVVYLSPADKYHRLTHRLSQEQKGLEERALMASQRPFHPW*
Ga0070716_10057453723300006173Corn, Switchgrass And Miscanthus RhizosphereAVYLHPADKYRRLTYRLSKGQKGLGERALMVSQRLSDS*
Ga0066665_1011219743300006796SoilVVYLSPADKYRRLTHRSSQGQKGLAERALMASQWPSHLW*
Ga0075428_10038602113300006844Populus RhizosphereMTHRRVVYLSPADKYHRLTHRPSQGQKGLDERAIITSPWSSH
Ga0075421_10088188733300006845Populus RhizosphereAVYLHPADKYYRLTRRPSQGQKGLDERAIMASQWPSHPW*
Ga0075421_10109859723300006845Populus RhizosphereWVVYLSPADKYRMLTRRLSKDQKGLEERALMASQRPFHPW*
Ga0075430_10085098123300006846Populus RhizosphereVYLHPAAKYRRLTPRLSQEQKGLDERTRMASQWPSPPGDAFDAGHVT
Ga0075430_10142776623300006846Populus RhizosphereRHVAVYLHTADKYRRFTRRPPQGQKDLDERARMVS*
Ga0075433_1084607523300006852Populus RhizosphereVVYLSPADKYHRLTHRLSQEQKGLEERALMASQRSFHPW*
Ga0075433_1156631823300006852Populus RhizosphereRVVYLSPADKYRMLTRRLSKDQKGLEERALMASQRPFHPW*
Ga0075420_10192242613300006853Populus RhizosphereGVVYLPNADKYRRLTRRPSQEQKGLDERALMASQRPSPPW*
Ga0075425_10126928733300006854Populus RhizosphereVVYLPNADKYRRLTRRPSQGQKGIDERTLMTSQRPFHPW*
Ga0075434_10052481843300006871Populus RhizosphereMEWAVYLHTADKYRRSTRRPPQGQKGLDERARMVSSR
Ga0075429_10070144013300006880Populus RhizosphereWAVYLHPADKYRRLTYRLSKGQKGLGERALMVSQRLSDS*
Ga0075419_1098688623300006969Populus RhizosphereMSKLTVVYLPTADKYRGLTRRPLQEQKGLDERALMASQRPSAPW*
Ga0075435_10196211713300007076Populus RhizosphereRKTPRTRGKWVVYLRPAGKYQRPAWRLSQEQKSLDERPLKASQRPSEPW*
Ga0066710_10049118423300009012Grasslands SoilAVYLAAAAKYRRLTHRLSQEQKGLDERALMVSQRPSHLW
Ga0099829_1047828713300009038Vadose Zone SoilMAADNWVVYLHPADNYPSLIRRPSQGQKDLYEKALMASQRPSPP
Ga0075418_1095556623300009100Populus RhizosphereVVYLSPADKYRMLTRRLSKDQKGLEERALMASQRPFHPW*
Ga0114129_1027009433300009147Populus RhizosphereMSKLTVVYLPTADKYRGLTRRPPQEQKGLDERTRMASQRPFPAS*
Ga0105069_103486913300009800Groundwater SandMKEVVYLATADKYPRLIRRPSQGQKSREERALMASQRPSDPG*
Ga0105062_105061413300009817Groundwater SandVVYLRSADKYHSLTRRLSQGQKGLHEMALLASQRPSHPW*
Ga0105064_102631343300009821Groundwater SandALVVYLPTADKYRRLTRRPLQGQKGLDEKTLLAAQRPLHPW*
Ga0126312_1147719613300010041Serpentine SoilVYLAAADKYRRLTHRLSQEPKGIDERAFMAFQRASHSW*
Ga0126384_1156789323300010046Tropical Forest SoilWAVYLPTADKYRRLTRRLSQGQKGLEEWALMASQQPSEPW*
Ga0134071_1035955023300010336Grasslands SoilDAMPLRVVYLHTADKYHGFTHRPSPGQKGMDEKAIMASQWPSEPW*
Ga0126370_1197307913300010358Tropical Forest SoilMVLEARCRVVYLSPADNSLRLTRQLSQGLKVLDEWALMTSQQP
Ga0126376_1099276013300010359Tropical Forest SoilAVYLPTADKYRRLTRRLSQGQKGLEERALMASQQPAEPW*
Ga0126379_1109611323300010366Tropical Forest SoilVYLHHAAKYRRLTPRLSQEQKGLDERTRMASQWPSPPGDAFDA
Ga0134123_1320091013300010403Terrestrial SoilAKRPVVYLHSADKYHRLTRRPLQGQKGLDERPLLASQ*
Ga0137391_1025144713300011270Vadose Zone SoilMVSLSPADTYHMRTRRPSQEQKGLDERAIMTSQWPSHPW
Ga0137388_1169607113300012189Vadose Zone SoilVVSLGFHRPVVYLHTADKYHRLTRRPSQDQKGLEERPIMVS
Ga0137363_1077645513300012202Vadose Zone SoilRSRVALLGVVYLHHADKYHRLTRRLSQGQKSLDERAIMASQRPSHPW*
Ga0137380_1026090813300012206Vadose Zone SoilVVYLAAADKYRRLTQRLSQEQKGLDERVLMVSQWPSHLW*
Ga0137380_1124968913300012206Vadose Zone SoilHLAVYLHPADKYYRLTRRPSQGQKGLDERAIMASQWPSHPW*
Ga0137372_1007423433300012350Vadose Zone SoilVVYLAPADKYRRLTQRLSQEQKGLDERVLMVSQWPSHLW*
Ga0137367_1005381053300012353Vadose Zone SoilVVYLHHTDKYPRLTHRLSQEQKGLDKRALMASQRPSEP*
Ga0137369_1036512533300012355Vadose Zone SoilVVYLPNADKYRGLTHRLSQGQKGLDERTLMASQRPFHPW*
Ga0137360_1026070423300012361Vadose Zone SoilMITVELVVYLSPADKYHRFTRRLSKGQKVLQERALLASQRPSEPW*
Ga0137395_1053075013300012917Vadose Zone SoilKFEAESVVYLHPADKYPRLTHRPSQEQKGLYERALMAS*
Ga0137395_1108033133300012917Vadose Zone SoilYLLNADKYRGLTRRPLQEQKGLEERTLMASQRPFHPW*
Ga0137394_1138138213300012922Vadose Zone SoilVYLPTADTYRGLTRRPSQEQKGLDKRTLMASQQPFHP
Ga0126369_1192257423300012971Tropical Forest SoilSTAIHMRWAVYLSPAEKYHRLTRRPSQEQKGLDERTLIASQRPFHPW*
Ga0157377_1039759713300014745Miscanthus RhizosphereDIVRRVVYLLRADKSPRLTRRPSQEQKGLDERARMASQRSSEAW*
Ga0137403_1094036613300015264Vadose Zone SoilVYLLLADKSHRNTHRLSQGQKGLDERALMASQRPSEPW*
Ga0182032_1008140113300016357SoilTWYWVVYLHHADKSHRLTPRLSQEQKGLDERTLMASQRPSEPW
Ga0163161_1103694413300017792Switchgrass RhizosphereNSGALKAAVYLHTADKYRRFTRRPPQEQKGLDERARMVS
Ga0190269_1159070623300018465SoilVSASIDELVVSLEPADNSPRLTRRLSQGLKVLDELALMTSQQPSEPW
Ga0179596_1020678113300021086Vadose Zone SoilAPSVVYLAAADKDRGLTRRPLQEQKGLDKRTLIASQQPFHPW
Ga0207684_1019245223300025910Corn, Switchgrass And Miscanthus RhizosphereSLHLVVYLPNADKYRGLTRRPSEGQKGLDERALMASQRPFHPW
Ga0207689_1076518013300025942Miscanthus RhizosphereVVYLLRADKSPRLTRRPSQEQKGLGERARMASQRSSEAW
Ga0257170_101965823300026351SoilSTVVYLPNADKYRGLTRRPSKGQKGLDERALMASQRPFHPW
Ga0209157_124794523300026537SoilGKSVVYLSPADNSPRLTRRLSQGLKVLDEWALMTSQQLSEPW
Ga0209879_108308223300027056Groundwater SandHRAGSKEVVYLPTADKYPRLTRRPSQGHKGIDERVLMAS
Ga0209843_102718613300027511Groundwater SandVVYLAAADKYRRLTHRLSQGQKGLDERALMVSQRLSDP
Ga0209843_103595813300027511Groundwater SandMVKEVVYLHPADKSPRLTRRLSQGQKGLEERALMASQRPSH
Ga0209799_103947513300027654Tropical Forest SoilNRTAVYLPTADKYRRLTRRLSQGQKGLEERALMASQQPSEPW
Ga0209382_1010982183300027909Populus RhizosphereMCKESLCSAVYLSPADKYHRLTRRLSQGQKGLDERALMAFQRPSEPW
Ga0137415_1030895613300028536Vadose Zone SoilGLGAVYLAAAAKYRRLTHRLSQGHKGLDERTLMASQQPFHPW
Ga0137415_1061758223300028536Vadose Zone SoilLPAAEPVVYLLAADKYRRLTRRLSQEQKGLDERALMASQRPSHPW
Ga0307305_1054131413300028807SoilVVYLPNADKYRGLTHRPLQEQKGLDERALMASQRPSAP
Ga0307278_1012186123300028878SoilMSELHDFNQRVVYLSPADKYHRFTRRLSKGQKVLQERALLASQRPSEPW
Ga0307473_1092345313300031820Hardwood Forest SoilHWVVYLPNADKYRGLTRRPLQEHKGLDERALMASQRPSAPW
Ga0306926_1022555133300031954SoilMEFQGAERWVVYLHPADKYFRLTHRPSQEQKGLDERALMASQRPSPPW
Ga0306924_1135871523300032076SoilPQQVVYLSPADKYHRLTRRLSQEQKGLEERALMASQRPSPPW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.