NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F101935

Metagenome Family F101935

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101935
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 49 residues
Representative Sequence MGLFRFAPATRREPSTDALIRIFQSEAAEIREDPEPARLRLTLHALAGLF
Number of Associated Samples 87
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 82.35 %
% of genes near scaffold ends (potentially truncated) 99.02 %
% of genes from short scaffolds (< 2000 bps) 93.14 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.333 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(44.118 % of family members)
Environment Ontology (ENVO) Unclassified
(50.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70
1JGIcombinedJ26739_1017498961
2JGI26346J50198_10305521
3JGIcombinedJ51221_102805822
4Ga0062388_1003704943
5Ga0066388_1054960811
6Ga0070711_1002609912
7Ga0070700_1019712211
8Ga0066694_104859862
9Ga0066702_109666131
10Ga0066903_1019726982
11Ga0066903_1064062491
12Ga0070715_107402772
13Ga0079221_102817591
14Ga0075425_1026996051
15Ga0075423_124755031
16Ga0126373_130576541
17Ga0074046_103163591
18Ga0126370_107152591
19Ga0126370_125789082
20Ga0126372_114736782
21Ga0137376_113438272
22Ga0182036_107531912
23Ga0182033_100925271
24Ga0182035_117523612
25Ga0182032_109023642
26Ga0182040_101851862
27Ga0182037_103535991
28Ga0066667_103727791
29Ga0066662_117904131
30Ga0066669_105120041
31Ga0210403_107739852
32Ga0210403_108967822
33Ga0210403_109965222
34Ga0210399_108038351
35Ga0210401_106927581
36Ga0210400_115934391
37Ga0213876_105501712
38Ga0210394_116503082
39Ga0210402_101584812
40Ga0126371_122199301
41Ga0126371_126474391
42Ga0126371_127114082
43Ga0179589_104366431
44Ga0207663_112461342
45Ga0257146_10302701
46Ga0208369_10020932
47Ga0208603_10154972
48Ga0208097_10297012
49Ga0209625_10058731
50Ga0209656_102563041
51Ga0209380_108890811
52Ga0209488_104730131
53Ga0318516_106934442
54Ga0318541_105551922
55Ga0318538_101502492
56Ga0318538_103613741
57Ga0318528_103745222
58Ga0318515_103207091
59Ga0318542_103170822
60Ga0318561_106347601
61Ga0318574_109085972
62Ga0318560_100039816
63Ga0318496_104741852
64Ga0307469_111127952
65Ga0318500_101710752
66Ga0318501_101792471
67Ga0318501_107973091
68Ga0318502_108533992
69Ga0318492_107243182
70Ga0307477_106367012
71Ga0318554_106327971
72Ga0318526_102750601
73Ga0318521_109628432
74Ga0318546_108740902
75Ga0318547_107806322
76Ga0318529_101394651
77Ga0318550_103615953
78Ga0318568_100683552
79Ga0318568_105226491
80Ga0318567_103236602
81Ga0310917_108771562
82Ga0306925_106646672
83Ga0306925_119062201
84Ga0306921_120283791
85Ga0306921_125343322
86Ga0310913_104482381
87Ga0310910_110031743
88Ga0306922_101965602
89Ga0306922_103161702
90Ga0306922_115301202
91Ga0318532_100971501
92Ga0318506_100516992
93Ga0318570_103005262
94Ga0318553_101076292
95Ga0318525_104544812
96Ga0318577_103511781
97Ga0318540_102418012
98Ga0307471_1008399272
99Ga0307472_1007740261
100Ga0307472_1015920312
101Ga0306920_1002131543
102Ga0306920_1013428501
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 37.18%    β-sheet: 0.00%    Coil/Unstructured: 62.82%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035404550MGLFRFAPATRREPSTDALIRIFQSEAAEIREDPEPARLRLTLHALAGLFSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
16.7%83.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Vadose Zone Soil
Tropical Forest Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Plant Roots
Populus Rhizosphere
2.9%2.9%6.9%2.9%44.1%14.7%4.9%2.9%5.9%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10174989613300002245Forest SoilMRLFRFAPALRRNPPPDALIRIFQSEVSEVREDPEPAQLRVTLHVLAGLLTA
JGI26346J50198_103055213300003351Bog Forest SoilMGLFRFAPATRREPSPDALIQIFQSETAEIREGSEPVQLRLTLHALAG
JGIcombinedJ51221_1028058223300003505Forest SoilMRLFRFAPALRRNPPPDALIRVFQSEVSEVREDPEPAQLRVTLHVLAGLLAAL
Ga0062388_10037049433300004635Bog Forest SoilMQVSRFVPSLRPKPSTDRLIRLSQSESAEITEGPEPIELRLTL*
Ga0066388_10549608113300005332Tropical Forest SoilMRFFRFAAAPQRNPAPDALIRMFQSEVSEVREDAEPAQLR
Ga0070711_10026099123300005439Corn, Switchgrass And Miscanthus RhizosphereMDLFPSVLQRRREPSSAALIRIFQSETAEVRENPEPFQLRLTLYALAGLVVALVAVSL
Ga0070700_10197122113300005441Corn, Switchgrass And Miscanthus RhizosphereMGLFRFAPAPRHSAPPEALIRVFQSETGEIREDPEPAQLRLTLYALVGLGTALLAVSLFMQL
Ga0066694_1048598623300005574SoilMHLFRFAPALRRNPEPDALIRIFQSEVSEVREDPEPAQLRITLHVLA
Ga0066702_1096661313300005575SoilMLWFRFAPALRRDPPHRTLIRIFQSEVGEVREDPEPAQLRL
Ga0066903_10197269823300005764Tropical Forest SoilMGWFRLAPMRHEQSSDPLIRVFQSETAEIRENPEPARLRLTLHLLAVL
Ga0066903_10640624913300005764Tropical Forest SoilMGLYRFALAPRREPSPEPLIRIFQSETAEIREGREPAQVRLTLYAVAGLFVSLLAVSTVM
Ga0070715_1074027723300006163Corn, Switchgrass And Miscanthus RhizosphereMGLFRFAPAPRRNSSPETLIRIFQSETGEVRENPG
Ga0079221_1028175913300006804Agricultural SoilMGLFRFALGPRRNPSPDALIRVFQSESAEVREDPEPGQVRLTLYALAGLF
Ga0075425_10269960513300006854Populus RhizosphereMGLFRFAPAPRHSAPPEALIRVFQSETGEIREDPEPAQLRLTLYALVGLGT
Ga0075423_1247550313300009162Populus RhizosphereMGWFRFAPATPRESSADALIRVFQSETAEIREDPEPAQLRLTLPALAGLFVALL
Ga0126373_1305765413300010048Tropical Forest SoilLYRFALAHRREPSLDPLIRIFQSEAAEIREGREPAQVRLTLYAVAGLFVSLLAVSIVMPMNR
Ga0074046_1031635913300010339Bog Forest SoilMGLFRSAAALRREPSPDAVIRIFQSEAAEIREDPEPAQLRLTFYALAALFVAML
Ga0126370_1071525913300010358Tropical Forest SoilVPRHSSAPDALIRIFQSEVGEVRENPEPAQLRLTLYAMCGL
Ga0126370_1257890823300010358Tropical Forest SoilMGLFRFAPAPQRNPAPNALIRIFQSEVGEVREDPEPAQLRLTLYALAGLFVALLA
Ga0126372_1147367823300010360Tropical Forest SoilMGLFRSAPQRQREPPSAALIRIFQSETAEIREDPEPFQLRLTLHALAGLVVALVAVSVLMQMN
Ga0137376_1134382723300012208Vadose Zone SoilMGLFRCASASRDPSPDALIRVFQSETAEIREDSAPAQLRLTLYAFAGLLVSLLAVT
Ga0182036_1075319123300016270SoilMGLYRFALAPRREPPTDPLIRIFQSETAEIREGREPAQLRLTLYAAAGLFVSLLAVSIVM
Ga0182033_1009252713300016319SoilMGLFRFVPAPRREPSPDALIRVFQSETAEIREGPEPAQLRLTLYALAGLFAGLVA
Ga0182035_1175236123300016341SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIREDPEPARLRLTLHALAGLFAALLVVS
Ga0182032_1090236423300016357SoilMGLFRFVAAVHEPSPDALIRIFQSETAQIREDSGPAQLRLTLHALAGLFVALLAVS
Ga0182040_1018518623300016387SoilMGLFRFALGPRRNASPDALIRIFQSETSEVREDPEPAQL
Ga0182037_1035359913300016404SoilMGFFRFAPALQRNQPADALIRVFQSEVGEVRENPEPAQLRITLHVLAGLFVVLLAVSF
Ga0066667_1037277913300018433Grasslands SoilMHLFRFAPALRRNPEPDALIRIFQSEVSEVREDPEPAQLRIT
Ga0066662_1179041313300018468Grasslands SoilMHLFRFAPALRRNPEPDALIRIFQSEVSEVREDPEPAQLRITLHVLAGLFIALL
Ga0066669_1051200413300018482Grasslands SoilMHLFRFAPALRRNPPPDALIRIFQSEVSEVREDPERAQLRVTLSALAGLFVALLAVSFF
Ga0210403_1077398523300020580SoilMHLFRFRFAPALRRNPSPDALIRIFQSEVSEVREDPEP
Ga0210403_1089678223300020580SoilMRLFRFAPVPQRSPPPDALIRIFQSEVGEVREDPEPAQLRITLHVLAGLFTALLA
Ga0210403_1099652223300020580SoilMGLFRFVPARRRNTSPDTLIRIFQSETGEVREDSEPAQL
Ga0210399_1080383513300020581SoilMGLFRFVPATRREPPTDALIRIFQSETAEIREGPEPAQLRLTLYAFTGLLAGLLAVSLV
Ga0210401_1069275813300020583SoilMNLFPSALQRRREPPSAALIRIFQSETAEIREDPEPFQLRLTLHALAGLVIALVAVSLLMQLNRVVAS
Ga0210400_1159343913300021170SoilMGWFRFAPATQREPSADALIRVFQSETAEIREDPEPAQLRLTLPALAGLFFA
Ga0213876_1055017123300021384Plant RootsMDLFRSALQRRREPPSAALIRIFQSETAEIREDPEPF
Ga0210394_1165030823300021420SoilMRLFRFAPVPQRSPPPDALIRIFQSEVGEVREDPEPAQLRITLHVLAGLFTA
Ga0210402_1015848123300021478SoilMRLFRFAPALRRNPPPDALIRIFQSEVSEVREDPEPAQLRVTLHVLAG
Ga0126371_1221993013300021560Tropical Forest SoilMALLRFVPAARREGSPDALLRVFQSETGEIREDPEPAQLRITLYAVV
Ga0126371_1264743913300021560Tropical Forest SoilMGLFSFASIRQREPSTDALIRIFQSETAEIREDPEPARLRFTLHA
Ga0126371_1271140823300021560Tropical Forest SoilMGLYRFALAPRREPSLDPLIRIFQSETAEIREGREPAQLRITL
Ga0179589_1043664313300024288Vadose Zone SoilMDLFPSALQRRREPSSAALIRIFQSETAEVREDPEPFQLRLTLHALAGLVVALVAVSLL
Ga0207663_1124613423300025916Corn, Switchgrass And Miscanthus RhizosphereMGLFRFAPAPQRKPPPNALIRIFQSEVDEVREDPEPAQLRLTLYALAGLF
Ga0257146_103027013300026374SoilMGLFRFVPARRRNTSPDSLIRIFQSETGEVREDCEPAQLRVTLHVLVGLLVAM
Ga0208369_100209323300026998Forest SoilMHLFRFRFAPALRRNPSPDALIRIFQSEVSEVREDPEPAQLRVTLHVLAG
Ga0208603_101549723300027109Forest SoilMHLFRFRFAPALRRNPPPDALIRIFQSEVSEVREDPEPAQLR
Ga0208097_102970123300027173Forest SoilMNLFPSALQRRREPPSAALIRIFQSETAEIREDPEPFQL
Ga0209625_100587313300027635Forest SoilMHLFRFRFAPALRRNPPPDALIRIFQSEVNEVREDPE
Ga0209656_1025630413300027812Bog Forest SoilMGLFRFAPAQQRNPPPDALIRIFQSEVGEVREDPEPAQLRLTLHLL
Ga0209380_1088908113300027889SoilMNLFPSALQRRREPPSAALIRIFQSETAEIREDPEPFQLRLTLHALAGLVIALVAVSL
Ga0209488_1047301313300027903Vadose Zone SoilMGLFRFAPAPRRNSSPETLIRIFQSETGEVRADPEPAQLRLTLHALAGL
Ga0318516_1069344423300031543SoilMGLFRFVLGTRRNSSPDALIRIFQSESAEVEEDPEPAQTRLTLFTL
Ga0318541_1055519223300031545SoilMGLFRFALGPRRNSSPDVLIRIFQSESAEVQEDPEPAQTRLTLFTLAGLFI
Ga0318538_1015024923300031546SoilMGLFRFALGPRRNSSPDVLIRIFQSESAEVQEDPEPAQTRLTLFTLAGLF
Ga0318538_1036137413300031546SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQLRITLHVLTGLLIALLAVS
Ga0318528_1037452223300031561SoilMGLFRFALGPRRNSSPDALIRIFQSESAEVQEDPEPAQTRIT
Ga0318515_1032070913300031572SoilMGLFRFAPAVQREPSPDALIRIFQSETAEIREDSEPAQLRLTLQA
Ga0318542_1031708223300031668SoilMGFFRFAPAPRRNPEPDALVRIFRSEVDEVREDPEPAQLRLTLYVLAGLFLALLGVSIFM
Ga0318561_1063476013300031679SoilMGFFRFAPAPRRNPEPDALVRIFRSEVDEVREDPEPAQLRLTLYVLAGLF
Ga0318574_1090859723300031680SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQLRITLHVLTGLLIALLAV
Ga0318560_1000398163300031682SoilMGLRRFVPALRPEPSTDALIRIFQSESAEIREDPEPARLRLTVHALGGLFAALLAVSLV
Ga0318496_1047418523300031713SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIREDPEPARLRLTL
Ga0307469_1111279523300031720Hardwood Forest SoilMGWFRFAPATPREPSADALIRVFQSETAEIREDPEPA
Ga0318500_1017107523300031724SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIREDPEPARLRLTLHALAGLFAALLVVSAVMPMD
Ga0318501_1017924713300031736SoilMGLFRFVLGTRRNSSPDALIRIFQSESAEVQEDPEPAQTRITLYALAGLFVALVAVTVFM
Ga0318501_1079730913300031736SoilMGLYRFALAPRREPPTDPLIRIFQSETAEIREGREPAQ
Ga0318502_1085339923300031747SoilMGFFRFAPAPRRNPEPDALVRIFRSEVDEVREDPEPAQLRLTLYV
Ga0318492_1072431823300031748SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQL
Ga0307477_1063670123300031753Hardwood Forest SoilMGLRRFVPALRPEPSTDALIRTFQSESAEIREDPEPARLRLTLHALGGLFA
Ga0318554_1063279713300031765SoilMGSFRLAPMRHEQSSDPLIRVFQSETAEIREDPEPARLRLTLHLLAVLFLSLLAVSVFMP
Ga0318526_1027506013300031769SoilMGLRRFVPALQPEPSTDALIRIFQSESAEIREDPEPARLRLTLHALGGLFA
Ga0318521_1096284323300031770SoilMGFFRFAPAPRRNPEPDALVRIFRSEVDEVREDPEPAQLRLTLYVLAGLFLALLGVSIV
Ga0318546_1087409023300031771SoilMGLFRFALGPRRNSSPDVLIRIFQSESAEVQEDPEPAQT
Ga0318547_1078063223300031781SoilMGLFSFAPIPRREPSTDALIRIFQSETAEIREDAE
Ga0318529_1013946513300031792SoilMGLRRFVPALQPEPSTDALIRIFQSESAEIREDPE
Ga0318550_1036159533300031797SoilMGLFRFALGPRRNASPDALIRIFQSETSEVREDPEPAQLRLT
Ga0318568_1006835523300031819SoilMGSFRLAPMRHEQSSDPLIRVFQSETAEIREDPEPARLRLTLHLLAVLFLSLLAV
Ga0318568_1052264913300031819SoilMGLFRFALGPRRNASPDALIRIFQSETSEVREDPEPAQLRLTLYAVAGL
Ga0318567_1032366023300031821SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIRECPEPAQLRLTLYAFTG
Ga0310917_1087715623300031833SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQLRITLHVLTGLLIALLAVSVFM
Ga0306925_1066466723300031890SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQLRITLHVLTGL
Ga0306925_1190622013300031890SoilMGFWRFVPALQRNPERDALLRIFRSEVDEVRESREPGQLRLTLYV
Ga0306921_1202837913300031912SoilMGLYRFALAPRREPSPDPLIRIFQSETAEIREGREPAQIRLTLFGVAGLFV
Ga0306921_1253433223300031912SoilMGLFRFALGPRRNSSPDVLIRIFQSESAEVQEDPEPAQTRLTLFTLAGL
Ga0310913_1044823813300031945SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIREDPEPARLRLTLHALAGLFAALLVVSAVMPMDR
Ga0310910_1100317433300031946SoilMGLFRFALGPRRNASPDALIRIFQSETSEVREDPEPAQLRLTLYA
Ga0306922_1019656023300032001SoilMGLFRFVLGTRRNSSPDALIRIFQSESAEVQEDPAPAQTRLTLY
Ga0306922_1031617023300032001SoilMGFFRFAPAVQRKPPADALTRVFQSEVGEVREDSEPAQLRITLHVL
Ga0306922_1153012023300032001SoilMGLYRFALAPRREPPTDPLIRIFQSETAEIREGREPAQLRLTLYAAA
Ga0318532_1009715013300032051SoilMGLFRFVPAPRREPSTDALIRIFQSETAEIREDPEPARLRLTLHA
Ga0318506_1005169923300032052SoilMGLRRFVPALQPEPSTDALIRIFQSESAEIREDPEPARLRLTLHALGGLFAALLAVSL
Ga0318570_1030052623300032054SoilMGLFRFALGPRRNSSPDALIRIFQSESAEVQEDPE
Ga0318553_1010762923300032068SoilMGLFRFAPATRREPSTDALIRIFQSEAAEIREDPEPARLRLTLHALAGLF
Ga0318525_1045448123300032089SoilMGLRRFVPALQPEPSTDALIRIFQSESAEIREDPEPARLRLTL
Ga0318577_1035117813300032091SoilMGLFRFVLGTRRNSSPDALIRIFQSESAEVQEDPEPAQTRLTLYTLAGLFIALVAVTVFM
Ga0318540_1024180123300032094SoilMGLFRFVLGTRRNSSPDALIRIFQSESAEVQEDPEPAQTRLTLYT
Ga0307471_10083992723300032180Hardwood Forest SoilMGLFRFAPAPQRKPPPNALIRIFQSEVDEVREDPEPAQLRLTLYALAGLFVALLAISVFM
Ga0307472_10077402613300032205Hardwood Forest SoilMGWFHFAPATRREPSADALIRVFQSETAEIREDPEPAQLRLTLPALAGLFVA
Ga0307472_10159203123300032205Hardwood Forest SoilMGLFRFAPATPPEPSADALIRIFQSETAEIREFPEP
Ga0306920_10021315433300032261SoilMGFFRFAPAPRRNPEPDALLRIFRSEVDEIREDAEPAQLRLTLYVLAGLFLALLGVS
Ga0306920_10134285013300032261SoilMRHEQSSDPLIRVFQSETAEIREDPEPARLRLTLHLLAVLFLS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.