NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102740

Metagenome Family F102740

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102740
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 39 residues
Representative Sequence MSALGQKQTYALQKAMSALPPIATAKADIRKRSCPLYP
Number of Associated Samples 72
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.67 %
% of genes near scaffold ends (potentially truncated) 2.97 %
% of genes from short scaffolds (< 2000 bps) 2.97 %
Associated GOLD sequencing projects 68
AlphaFold2 3D model prediction Yes
3D model pTM-score0.23

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.020 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(20.792 % of family members)
Environment Ontology (ENVO) Unclassified
(40.594 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.455 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 39.39%    β-sheet: 0.00%    Coil/Unstructured: 60.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MSALGQKQTYALQKAMSALPPIATAKADIRKRSCPLYPSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.23
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Tropical Forest Soil
Soil
Soil
Soil
Forest Soil
Soil
Soil
Tropical Peatland
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.0%20.8%4.0%5.9%18.8%5.0%4.0%7.9%3.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ARSoilOldRDRAFT_00015213300000044Arabidopsis RhizosphereMKTTSALGQKQTYALQKAMSALPPIATAKADSRKRS
ICChiseqgaiiFebDRAFT_1443853213300000363SoilMSALGHEQTFALHNRMSAFPIATAKADFGNLSCLLS
AF_2010_repII_A100DRAFT_102189713300000655Forest SoilFYGSSADRAMSALGHKQTYAAQNGMSALAPLATAKAEFRKRSCPLYP*
JGI11643J12802_1075249423300000890SoilMLFRKMSTLGHKPAFRSAIVMSALPPIATAKADSRKRSC
Ga0062593_10048963333300004114SoilMSALGHKRTFAPQHVTSALPPIATAKADFGKPSCLVYPQEQTCAV*
Ga0062595_10014332123300004479SoilMILWRPMSALGHKQTYALQKAMSAITPIATAKADIGRLSCLLYP
Ga0070676_1019313413300005328Miscanthus RhizosphereMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCPLCLRKRTC
Ga0066388_10148467313300005332Tropical Forest SoilMSALGQKRTYAVQNGMSALPPKATAKADIRKRSCLLYTRKRTC
Ga0066388_10218241333300005332Tropical Forest SoilANSGILRRGMMSALGQKQTCALQNVMSAVPPTATAKADIGNPSCLLYPQ*
Ga0066388_10308377823300005332Tropical Forest SoilMSALGQKQTYAAHKLMSALPPIATKKADIGNLSCLLYPQ*
Ga0068859_10194114623300005617Switchgrass RhizosphereMSALGHKQTFALRNAMSALPPKATAKADIRKRSCPL
Ga0066905_10021355423300005713Tropical Forest SoilMSALGQKQTFAAQKVMSALPPKATAKADFGNPSCLLYPQ*
Ga0066905_10021682633300005713Tropical Forest SoilMSALGQKQTYASQQAMSALPPIATAKADIGNPSCLLYPR
Ga0066905_10029038523300005713Tropical Forest SoilMSALGQKQTCALQNVMSALLPIATTKADIGKPSCLLYP
Ga0066905_10031300923300005713Tropical Forest SoilMSALGHKQTYALQKAMSALPPIATAKADFGKPPCLLYP*
Ga0066905_10087673223300005713Tropical Forest SoilMSALGQKQTHAVQQRMSALPPIATAKADMSLASCPLFP
Ga0066905_10120298913300005713Tropical Forest SoilGQKQTHAVQQRMSASPPIATVKADIRKRSCLLYPQKQTCAVH*
Ga0066905_10144324013300005713Tropical Forest SoilSALGHKQTYAVQQPMSALPPIATAKADSCKQSCLLYP*
Ga0066903_10035619323300005764Tropical Forest SoilMSALGQKQTYAAHKLMSALPPIATAKADFPQKSCLLYP
Ga0066903_10089203533300005764Tropical Forest SoilMSALGHKQTYAAHKLMSALPPIATAKADPRKVMSALP
Ga0066903_10151651523300005764Tropical Forest SoilVMSALGKKRAYAAHKVMSALPMIATAKADSRNRSCLLY
Ga0066903_10199144813300005764Tropical Forest SoilMSALGHKQTYAVQQTMSASPPIATSKADIRKRSCPLYPESGHVRRK*
Ga0066903_10266499223300005764Tropical Forest SoilMSALGQKQTYALQKAMSALPPIATMKADIGKPSGLLYP
Ga0066903_10391244723300005764Tropical Forest SoilMSALGQKQTYALQKAMSALPPIATAKADMTVCGCLLYP
Ga0066903_10409935823300005764Tropical Forest SoilMSALGQKQTYALQKAMSALPPIATAKADIRKRSCPLYP
Ga0066903_10572307013300005764Tropical Forest SoilMSALGQKQTYAMQKRMSALAPIATAKADVAQKSCLLYPQ
Ga0066903_10595084213300005764Tropical Forest SoilLGQKQTYAAHKLMSALPPIATAKADFRKRSCPLYP
Ga0068858_10205000913300005842Switchgrass RhizosphereMSALGHKQTYAVQKAMSASPLIATAKADLRKKRTCAAQQVMSALGQ
Ga0075417_1019737423300006049Populus RhizosphereMSALGQKQTFAPQKAMSALPPIATAKADSRKRSCPLYP
Ga0075433_1167108713300006852Populus RhizosphereMSALGHKRTYAVQQAMSALHPIATAKADFRKTSCLLYPGLRSDLP
Ga0075429_10004438513300006880Populus RhizosphereMSALGHKRTYAVQKGMSALHPIATAKADFRTSPCPLYP
Ga0075426_1101577513300006903Populus RhizosphereMSALGHKPAYAVHNVMSAWPPIATAKADIRKGHVCF
Ga0075424_10164890013300006904Populus RhizosphereMSALGQKQTFAPQKVMSALPAKVDIRQRSDLMSFRSSIVS
Ga0105245_1051707223300009098Miscanthus RhizosphereMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCPLCLR
Ga0111538_1021959043300009156Populus RhizosphereMKTTSALGQKQTYALQKAMSALPPIATAKADSRKRSVCF
Ga0075423_1123125723300009162Populus RhizosphereMSALGHKQTFALQKAMSALPPKATAKADIRKRSCPLHP
Ga0105242_1243800713300009176Miscanthus RhizosphereMSALGQKLTFAPQQGMSALPPIATAKADIRKTPCLLCP
Ga0105248_1093173813300009177Switchgrass RhizosphereMSALGHKRTYAPQQAMSAIPPISTAKADIRKSPCPL
Ga0126380_1043654113300010043Tropical Forest SoilMSALGQKQTFEVQKGMSALPRIAIAKADSRNGHVRF
Ga0126380_1207720123300010043Tropical Forest SoilMSALGQKQTYAVQNAMSALPPIATAKADIGKPSCLLYP
Ga0126384_1171855523300010046Tropical Forest SoilMSALGQKQTCARQKAMSALPPIAIAKADFLKRSCPLYPQER
Ga0126382_1090706823300010047Tropical Forest SoilMSALGQKQTYAVQQGMSALPPIATGKADISKRSCLLYA
Ga0126382_1135007823300010047Tropical Forest SoilMSALGQKQTFAVQNGMSALPPIATAKVDFPQTICLTRKAD
Ga0126370_1024812213300010358Tropical Forest SoilMMSALGQKQTYALQKAMSALPPIATAKADIRKRSCLLYPQKR
Ga0126370_1250094723300010358Tropical Forest SoilMSALGQKRTFAVQDGMSALIPIATMKADFRKRSCLLYP
Ga0126376_1021207613300010359Tropical Forest SoilMSALGQKQTCARQKAMSALPPIATAKADFRKSSSLLYPRKRTFTV
Ga0126372_1108056823300010360Tropical Forest SoilLMIVMSALGQKQTCALQNVMSALFPIATAKADFGNPSCLL*
Ga0126372_1125542523300010360Tropical Forest SoilMSALGQKQTYAVQNVMSALPLIATAKANSRKRSSPLH
Ga0126378_1061731813300010361Tropical Forest SoilMSALGQKQTYAVQNGMSALRPKATAKADIRKSLCPLHPQKLT
Ga0126378_1338418013300010361Tropical Forest SoilMSALGQKQTYAPQKGTSALPPIATAKADFGKPSCLLYPQ
Ga0126377_1306849013300010362Tropical Forest SoilVDLLLSRMSALGQKQTYAPQNVMSALPPIATEKADFRKRSC
Ga0126379_1016107333300010366Tropical Forest SoilMSALGHKRTHAAQNGMSALPPIATAKADSRKRSCLLYPH
Ga0126379_1045196813300010366Tropical Forest SoilMSALGQNQTYAVQKAMSALPPIATTKAEIGKTSCPLYP
Ga0126381_10077578413300010376Tropical Forest SoilMSALGQKQTCALQNVMSALPPIATAKADFRKRSCPLYP
Ga0157318_103494613300012482Arabidopsis RhizosphereMSALGQKQTFAAQKAMSALPPIATAKADIRKRSCLLYPQERT
Ga0126375_1034344723300012948Tropical Forest SoilMSALGQKRTYAVHQLMSALPPIATAKADFRKSSCLLYP
Ga0126369_1264509223300012971Tropical Forest SoilQKQTYAAHKLMSALPPIATKKADIGNLSCLLYPQ*
Ga0164305_1173885413300012989SoilMIRSRMSALGHKRTYAVQKSMSALPPIATAKPDISPGKCPLYPRKQT
Ga0157380_1015109833300014326Switchgrass RhizosphereMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCPLCLRKRT
Ga0173483_1087347923300015077SoilMSALGHKQTFAVQKGMSALPPIATAKADFGKPSCLLYPQ
Ga0132258_1122773313300015371Arabidopsis RhizosphereMSALGHKRTYALQKAMSALPPIATEKADIRKRSYLL
Ga0132256_10188039113300015372Arabidopsis RhizosphereMSALGHKPTFALQKAMFALAQTATAKADFRAMSALL
Ga0132257_10181139323300015373Arabidopsis RhizosphereMSALGQKQTYAVQNVMSALHPIATEKADFRKRSCPLNPQERT
Ga0132257_10190286833300015373Arabidopsis RhizosphereMSALGQERTYAPQQVMSALPPIGTAKADVGNPSCLLYLRKRTCAAQ
Ga0132257_10364554613300015373Arabidopsis RhizosphereMSALGQKQICAAHKLMSALPPIATAKADSRKGSCPLKADMC
Ga0132255_10024043133300015374Arabidopsis RhizosphereMSAMGHKRTYAVQKGMSALLPIATAKADFGKPSCLLTPE
Ga0182036_1051155613300016270SoilMSALGQKQTYAVQKAMSALPPIATAKADSRKRSCLLYPQKRT
Ga0182033_1214733613300016319SoilMSALGQKQTCALQNVMSALPPIATAKADSRKGVCLLYP
Ga0182035_1159740423300016341SoilMSALGQKRTYAVHNGMSALLPKATAKADSRKGACLLY
Ga0182034_1051976423300016371SoilSALGQKQTYAVHNGMSALPPIATAKADSRKGACLLYPESGHVQCN
Ga0187779_1099815823300017959Tropical PeatlandMPALGQKRTFAMQDVMSALPHIATAKADFRKRPCLLYPR
Ga0173482_1001204153300019361SoilVRLSHKQTYAMQKGMSALPLIATAKANSRKGSCLLYP
Ga0126371_1046265023300021560Tropical Forest SoilMPTMSALGQKQTYAAHKLMSALPPIATPKADSRKRPCPLYPRK
Ga0126371_1064906613300021560Tropical Forest SoilMSALGHKQTCAARNGMSALPPKATAIANFRKNHVRF
Ga0126371_1197335523300021560Tropical Forest SoilMSALGHKQTYALQNAMSALSPIATAKADLRTTSCLLYTRKQT
Ga0207645_1006870013300025907Miscanthus RhizosphereMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCPL
Ga0207671_1072447823300025914Corn RhizosphereMSALGQKQTYALQKAMSALPPIATAKADFGNPSCLLYP
Ga0207671_1080415513300025914Corn RhizosphereMSALGQKPTYELQQAMSALPPIATAKADMCLRSCLLYP
Ga0207693_1062333313300025915Corn, Switchgrass And Miscanthus RhizosphereMFTEDVQMSALGQRRTYAAHKLMSAFPPIATAKADMPH
Ga0207662_1053999113300025918Switchgrass RhizosphereMSALGQKRTYALQKAMSALPPKATAKADIRKTSCLLYLRKR
Ga0207650_1075511023300025925Switchgrass RhizosphereMSALGRKRTYAVQKGMSALLPIATVKADSRKRSCLL
Ga0207690_1028995423300025932Corn RhizosphereMSALGQKPTCALQNLMSALHPIATAKADIRKTPCLLYPRKRT
Ga0207669_1001357143300025937Miscanthus RhizosphereMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCPLCLRKRTCAV
Ga0207679_1033670523300025945Corn RhizosphereMSALGQKPTCALQNLMSALHPIATAKADIRKTPCLLYPRKR
Ga0207712_1035308813300025961Switchgrass RhizosphereMSALGQKRTYALQKATSALPPKATAKADIRKTSCLLYL
Ga0207668_1050585623300025972Switchgrass RhizosphereMSALGQKQTYAVQEGMSALSPIATAKADFRKTSCLLAP
Ga0207658_1092952223300025986Switchgrass RhizosphereMSALGHKQTYAVQKAMSASPLIATAKADLRKKRTCAA
Ga0207708_1054356823300026075Corn, Switchgrass And Miscanthus RhizosphereLGHKQTFAVQKGMSALPPIATAKADFGKPSCLLYP
Ga0209481_1031989313300027880Populus RhizosphereMSALGQKQTFALQKAMSPLPPIATVKADIRKRSCLLSLRKR
Ga0268265_1129336323300028380Switchgrass RhizosphereMSALGQKQTCAPQKSMSALVPVATAKADMSLASCPLFPRKRT
Ga0268265_1138379323300028380Switchgrass RhizosphereMSAWGHKQTYAPQKAMSALPRIATAKADSRKGACLPNP
Ga0310893_1035444223300031892SoilMSALGHKQTYAPQKAMSALPRIATAKADSRKGACLP
Ga0310916_1117134523300031942SoilMFALGQKQTYAVQNVMSALAPLATAKADSRKQSCPLYPQKQTC
Ga0310913_1042254813300031945SoilMSALGQKQTFASQNGMSALPPIATAKADFRKRSCL
Ga0310913_1125071513300031945SoilLHSSNPELLMSALGQKPTYAAHKGMSALPPIATAKADFGKPSCPLY
Ga0306922_1083562123300032001SoilMSALGQKQTYAVHKRMSALPPKATAKADIRESSCL
Ga0318504_1053278923300032063SoilMSALGQKQTFASQNGMSALPPIATAKADFRKRSCLLYPQKQT
Ga0310896_1064608113300032211SoilMSALGQKRTCAVQEAMSALLPIATEKADIRKTSCLLYPR
Ga0306920_10384605013300032261SoilALGQKQTYALQKAMSALPPIATAKADFRNRACLLCSALANVC
Ga0373948_0207661_2_1243300034817Rhizosphere SoilMSALGQKRTYAAHKSMSALPPIATAKADIGKLSCLLYPRKR
Ga0373958_0137755_498_6023300034819Rhizosphere SoilMSALGQKQTCAPQNAMSALPLIATAKADIRKRSCP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.