NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F082943

Metagenome Family F082943

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082943
Family Type Metagenome
Number of Sequences 113
Average Sequence Length 55 residues
Representative Sequence MNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDT
Number of Associated Samples 85
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 2.65 %
% of genes from short scaffolds (< 2000 bps) 1.77 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (97.345 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.593 % of family members)
Environment Ontology (ENVO) Unclassified
(37.168 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.673 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 61.73%    β-sheet: 0.00%    Coil/Unstructured: 38.27%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035404550MNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
41.6%6.2%10.6%9.7%6.2%7.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A01DRAFT_107264123300000580Forest SoilMNKNKVAIVAIGLTALLAFSPAASFAQATTFSGEAVGLKANVVGVSLSLADT
AF_2010_repII_A1DRAFT_1011659113300000597Forest SoilMNKNKVAIVAIGLTALLAFSPAASFAQATTFSGEAVGLKANVVGVSLSLADTGALPSSGGNLSNSLASVNVAGI
JGIcombinedJ26739_10005833513300002245Forest SoilMNANKNSTIVIFLTAILVVSPVASFAQAATTFSGEAVALRRASAVGISLAVSDTGPLPASGGNLKTSVGSV
JGIcombinedJ26739_10050839213300002245Forest SoilMNANKKSTIVIFLTAILVVSPVASFAQAATTFSGEAVALRASAPGISLAVSDTGPLPASGGNLKTSVGSV
JGI25613J43889_1000366253300002907Grasslands SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGPLAA
JGI25388J43891_100431743300002909Grasslands SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGA
JGI25615J43890_100751523300002910Grasslands SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDT
JGI25616J43925_1001288253300002917Grasslands SoilMNTKKNSTIAICLTAXLALSPVASFAQATTTFSGEAVALRANALGISLALSDTGPLAA
JGI25616J43925_1025537213300002917Grasslands SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLASVNVLGLAS
JGI25616J43925_1028655613300002917Grasslands SoilMNTRKNTTIAIFLTAILAFSPLAGFAQAATTFSGEAVALRANALGISASISDTGPLPSS
Ga0066673_1001785043300005175SoilMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISLDLSDT
Ga0066679_1041959123300005176SoilMQMNAKNTPMIAICLAAVLALGPVASFAQATTTFSGRAVALRA
Ga0070713_10151769713300005436Corn, Switchgrass And Miscanthus RhizosphereLNTKKNATLAICLTALLVFSPLAGYAQATTTFSGEGVALKANALGISLSAADTGALPSSGGNLSTSLASVNVLGL
Ga0066681_1030209013300005451SoilLNAKKNATIALCLTALLAFSPLAGFAQAATTYSGDATALQASAVGISLALSHAG
Ga0070706_10141530113300005467Corn, Switchgrass And Miscanthus RhizosphereMATKKPSTIAICLIGALAFGPVTSFAQATATFSGQAVALRASAVGLALA
Ga0066701_1011723033300005552SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGNLST
Ga0066707_1004834513300005556SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGLLAASGGNLSTSLASVNVLGLASADALKSTTSGSG
Ga0066704_1012357553300005557SoilMQMSTKKNSTIAICLTAILALSPVASFAQATATFSGEAVALRANALGISLALSDTGALQSSGGNLSRSLASVNV
Ga0066702_1021472233300005575SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVAL
Ga0066651_1083693213300006031SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALR
Ga0070765_10212615333300006176SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVALRASVVGISLSLADT
Ga0079222_1004405923300006755Agricultural SoilMNKKRTSMLAICLTALLAFSPLITFAQSATTFSGEAVALKANALGVSLALRCTSPGQVST
Ga0079222_1054287913300006755Agricultural SoilMNTNMHANKNSIIATCLVAILTLGPLRGFAQSATTFSGEAVALRANAVGISLALSDTGALPSSGGSL
Ga0066658_1002576513300006794SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGALPSSGGSLSTS
Ga0066665_1059420723300006796SoilMNMQKNSTIAICLTAILALSPIASLAQATTTFSGEAVALRA
Ga0066665_1140370323300006796SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGE
Ga0079220_1008915413300006806Agricultural SoilMLAMCLTALLAFSPLITFAQSATTFSGEAVALKANALGVSLALR
Ga0075431_10015592133300006847Populus RhizosphereVHMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISL
Ga0075424_10230069813300006904Populus RhizosphereVHMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGIS
Ga0099791_1019031613300007255Vadose Zone SoilMNAKKNSTIAICLMVLVTFSPVASFAQATITFSGQAVALRVSAVGL
Ga0099791_1051077013300007255Vadose Zone SoilMQMSTKKNSTIAICLTAILALSPVASFAQATTTFNGEAVALRANALGISLALSDTGALQSSGGNLSRSLASVNVLGLASADA
Ga0099794_1065931813300007265Vadose Zone SoilMQMSTKKNSTIAICLTAILALSPVASFAQATATFSGEAVALRAN
Ga0099830_1107791313300009088Vadose Zone SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRASALGISLALSD
Ga0099828_1031383313300009089Vadose Zone SoilMQMNANKNSTITIAICLIAILVFSPVAGFAQAAITFSGEAVALRASAAGISLAVSDTGPLPA
Ga0075418_1111237023300009100Populus RhizosphereMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISLSLSDT*
Ga0126382_1036034913300010047Tropical Forest SoilMNMKNNAAIAVCLTAILAFSPLAGFAQAATTFSGEAVALRANALGISLSLSDAG
Ga0126382_1043694713300010047Tropical Forest SoilMNTSKNAMIAICLTAILAFAPLAGFAQAATTFSGEAVALRANALGISLSLSDAGPLP
Ga0134082_1005510713300010303Grasslands SoilLEEENVQMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKAL
Ga0126370_1029229913300010358Tropical Forest SoilMITMKNPSIAILLAAILAFSQFAGLAQATTFSGEAVALKANALGVSLTASDTGPLP
Ga0126378_1145581413300010361Tropical Forest SoilMNTKKNAMIAILLTAILAFSPLAGFAQAATTFSGQAVALKANALGISLDVSDTGALPSSGGNLST
Ga0134066_1006119413300010364Grasslands SoilMITKKNSTMAICLMTILVFSPVATFAQATITFSGEA
Ga0126381_10475293113300010376Tropical Forest SoilMNEKKAIIAICLAAVLVLGLSPGVSLAQGTTTFSGEAVALKASVAGISLDLGDTGALPSSGGNLSTSLAS
Ga0137391_1152136613300011270Vadose Zone SoilMQMNPKKNSTIAICLMAILAFSPVATFAQASTTFSGQAVALRASAVGLALALSDTGPLPA
Ga0137388_1197660613300012189Vadose Zone SoilMNAKKNSTIAICLMVLVAFGPVASFAQAPNTFRDQAVALLVSAVGLGLALSDTGPIPAS
Ga0137364_1007872743300012198Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDL
Ga0137364_1010060633300012198Vadose Zone SoilMNTKKNSTIAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137364_1054659623300012198Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDL
Ga0137383_1032921623300012199Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137382_1025029513300012200Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKA
Ga0137382_1093863923300012200Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQAPVTFSGEAVALPARALGLSLDLSDTGPLAASG
Ga0137363_1073356113300012202Vadose Zone SoilMNTKKNAGIAVFLTALLLYSPLAGFAQTATSFSGEGIALK
Ga0137399_1009160213300012203Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANAL
Ga0137399_1020285413300012203Vadose Zone SoilMNAKKNSTIAICLMAILAFGSVTSFAQARTTYSGQAVALRASAT
Ga0137399_1069290313300012203Vadose Zone SoilMNTKKNSTMAICLTAILALSPIASLAQAATTFSGEAVALKANALGISLSLADTEALPSS
Ga0137399_1088800213300012203Vadose Zone SoilMNAKKNSTIAICLMAILAFGPVVSFAQARTTYSGQAVALRASA
Ga0137399_1106529613300012203Vadose Zone SoilMITKKNSTIAICLMTILVFGPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGN
Ga0137399_1114905213300012203Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGAVPARGGNLSTSLATFRRKNACCR*
Ga0137377_1128001813300012211Vadose Zone SoilMNTKKNSAIAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137370_1004717243300012285Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAA
Ga0137370_1021439323300012285Vadose Zone SoilMNAKNSTIAICLIAILVFSPVASFAQTTITFSGEAVALRAKALGISLDLSDTGPLAARGGNLSTSLA
Ga0137387_1075435813300012349Vadose Zone SoilMNTRKNSTIAICLTAVLGLSPVASFAQATTTFSGEAVALKANALGISLALSD
Ga0137361_1073257023300012362Vadose Zone SoilMQMNANKNSTITIAICLIAILVFSPVAGFAQAAITFSGEAVALRASAAGISLALSDTGPLPASDGNLKTSVGSVSVLGL
Ga0137361_1103375923300012362Vadose Zone SoilMQMNAKNSTIAICLIAILVFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASGGSLSTSLA
Ga0137398_1002208243300012683Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISL
Ga0137398_1021810623300012683Vadose Zone SoilMNMQKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGALPSNGGN
Ga0137398_1027254013300012683Vadose Zone SoilMNTQKNSTIAICLTAILALSPIVSFAQATTTFSGEAVALRANALGISLA
Ga0137397_1108471013300012685Vadose Zone SoilMKNKKLSTIAICLIAVLAFGPVTSFAQATTTFSGQAVALRAS
Ga0137395_1006000013300012917Vadose Zone SoilMNMQKNSTIAICLTAILALSPVASFAQATTTFSGEAVALRANALGISLALSDTGA
Ga0137395_1010372013300012917Vadose Zone SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLALSDTGA
Ga0137396_1019427123300012918Vadose Zone SoilMNAKKNSTIAICLMAILAFGSVTSFAQARTTYSGQAVALHASATGLALALSDT
Ga0137396_1024437913300012918Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATITFSGEAVALKANALG
Ga0137394_1066709913300012922Vadose Zone SoilMNTKNSTVAIGLVAALAFNPVSTFGQANTFSGQAVALRASAVGIALAL
Ga0137419_1015709833300012925Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLA
Ga0137416_1055600913300012927Vadose Zone SoilMNTKKNSTIAICLTAILALSPVVGFAQATITFSGEAVALRASALGISLALSDTGP
Ga0137410_1085187223300012944Vadose Zone SoilMSAKKNSTIAICLMVLTTFSPVAAFAQATNTFSGQAVALRV
Ga0137410_1140882913300012944Vadose Zone SoilMQMSTKKNSTMAICLTAILALSPVASFAQATTTFSGEAVALRAS
Ga0126375_1052288413300012948Tropical Forest SoilMNTSKNAMIAICLTAILAFAPLAGFAQTATAFSGEAVALRANA
Ga0126369_1182787813300012971Tropical Forest SoilMNTKKNAVIAIFLTAILAFSSLAGFAQTATSFSGEAVALKANALG
Ga0134079_1048617213300014166Grasslands SoilLEEENVQMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISLDLSDT
Ga0137418_1022808423300015241Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALR
Ga0137418_1080779013300015241Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPARGGNLSTSVASVNVL
Ga0132258_1153884623300015371Arabidopsis RhizosphereMKTHKPSTLAICLMAALTFGPLDVFAQTTTFSGQAVALRASAVGLALALSDTGALPAAGGNLATSLASV
Ga0190272_1011214713300018429SoilMPAKHNSTIAICLTAILVLGPVAASAQTNTFSGQAVALRASVIGVALALSDTGPLPATGGDLKTS
Ga0066669_1234821013300018482Grasslands SoilMNTTKNSTIVICLIAILAFSPVATFAQASATFSGRAVALRASAVGLALALS
Ga0179590_122928423300020140Vadose Zone SoilVHTNTKENSTIAICLMAILVFSPVASFAQATITFSGEAVALRAKALGISLD
Ga0210407_1022915813300020579SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSSGGN
Ga0210401_1099338813300020583SoilMNTKKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSS
Ga0210388_1139920913300021181SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVALRASVVGISLSLADTGALPSRGGNLS
Ga0210402_1097380813300021478SoilMNTKKNSTMAICLTAILVLSPIASFAQATTTFSGEAVAL
Ga0207700_1152419413300025928Corn, Switchgrass And Miscanthus RhizosphereLNTKKNATLAICLTALLVFSPLAGYAQATTTFSGEGVALKANALGISLSAADTGAL
Ga0209238_116676123300026301Grasslands SoilMKTKKKSTMAICLTAILAFSPVASFAQATITFSGEAVALRAKALGISLDLSDTGPLAASG
Ga0209240_102581413300026304Grasslands SoilMNTKKNTTIAIFLTAILAFSPLAGFAQAATTFSGEAVALRVNALGISA
Ga0209240_126932213300026304Grasslands SoilMNTKKNSTVAICLTAILALSPIASFAQATTTFSGEA
Ga0209152_1008668733300026325SoilMNTKKNSAITICLVAILTLSPAFSFAQAATTFSGEAVALRASAVGISLALADTGALPSSGGSLSTSL
Ga0257163_107999713300026359SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLASVNVLGL
Ga0257172_111010413300026482SoilMNTKKNSTIAICLTAILALSPVASFAQATTTFSGQAVA
Ga0257158_110788813300026515SoilMNTKKNSMMAACLTAMLAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPASGGNLSTSLAS
Ga0209474_1068276413300026550SoilMKTKKKSTIAICLMTILVFSPVATFAQATITFSGEAVALRAKALGISL
Ga0209648_1065738123300026551Grasslands SoilMNTKKNSTIAICLTAILALSPVASLAQATTTFSGEAVALRANALGISL
Ga0179587_1041796623300026557Vadose Zone SoilMNTQKNSTIAICLTAILALSPIASFAQATTTFSGEAVALRANALGISLALSDTGALPSSGGNLSMSLAN
Ga0209076_107819113300027643Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATITFSGEAVALK
Ga0209076_109092413300027643Vadose Zone SoilMNTRKNSMMAACLTAILAFSPVASFSQATTTFSGEAVALKANALGISLALSDTGALPARGGNLS
Ga0137415_1013921333300028536Vadose Zone SoilMNTKKNSTIALCLTAILAFSPVASFAQATTTFSGEAGALRGNALGISLALSDTGPLASGGNLNTSLASVNVLGLASA
Ga0137415_1038963113300028536Vadose Zone SoilMKMNAKKNSTIAICLTAILAFSPVASFAQATTTFTGEAVTLRASAVGISLALSDTGALLSSGGN
Ga0307476_1031518213300031715Hardwood Forest SoilMNTKKNATIASCLMAVLVFSPVATFAQATTTFTGEAV
Ga0307477_1109980423300031753Hardwood Forest SoilMNEKRTSMMTICLRVVLALSPSLSLAQGTTTFSGEAVALKANVLGISLTLADTGQLPSTG
Ga0307475_1039667313300031754Hardwood Forest SoilMNEKRTSMMTICLLVGLALSPALGFAQGTTTFSGEAVALKANALGISLSIADTG
Ga0307478_1098909913300031823Hardwood Forest SoilMNAKRTSTIAICLTAIVALSPVAGFAQATTTFSGEAVALRANVLGTSLALSDTGALPS
Ga0307478_1151528413300031823Hardwood Forest SoilMNTKKNSTMAIGLTAILALSPVASFAQATTTFSGEAVALRASALGISLTLADTGA
Ga0307479_1019890613300031962Hardwood Forest SoilVQINAKTNSTIAICLTAILALIPIASFAQATITFSGEAVALRANAAGIALALSDTGALPSSGGNLSTSLASVN
Ga0307479_1035232113300031962Hardwood Forest SoilMNSKKSSTIAICFVAILAFGPVASFAQATTTFSGEAVALRANALGISLDLSDTGPLAAS
Ga0307471_10011767133300032180Hardwood Forest SoilMNTKTNSTIAICLTAILALSPVASFARGTTTFSGEAVALRASAVGISLALSDTGAFRLAAGI
Ga0306920_10122298913300032261SoilMNEKKAIIAVCLAAVLAFSPVVSLAQGTTTFSGEAVALKATAAGISLALG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.