NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F092761

Metagenome Family F092761

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092761
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 42 residues
Representative Sequence VNLINQISCEAILIWRGHEGRTGWELGLELQEPSPDFWGLDF
Number of Associated Samples 99
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 4.35 %
% of genes near scaffold ends (potentially truncated) 20.56 %
% of genes from short scaffolds (< 2000 bps) 14.95 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (78.505 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(21.495 % of family members)
Environment Ontology (ENVO) Unclassified
(27.103 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.140 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52
1JGI12637J13337_10167811
2Ga0066688_102637482
3Ga0066688_102660373
4Ga0066684_100502501
5Ga0066681_101408673
6Ga0070730_105057713
7Ga0070733_111675851
8Ga0070695_1017556952
9Ga0070704_1002522713
10Ga0066670_108897122
11Ga0066699_108324721
12Ga0066703_100329115
13Ga0066691_102122671
14Ga0066903_1090564252
15Ga0068860_1005778631
16Ga0075296_10300792
17Ga0080027_101959732
18Ga0070715_109216032
19Ga0070716_1015635111
20Ga0066660_104727822
21Ga0075433_105414772
22Ga0099794_101549842
23Ga0099830_100901754
24Ga0099830_103634951
25Ga0099830_104657302
26Ga0099828_102928621
27Ga0099792_108368952
28Ga0116135_11201842
29Ga0126380_102533363
30Ga0126382_104159251
31Ga0126378_122734031
32Ga0126379_100079111
33Ga0105239_122844711
34Ga0126381_1023053182
35Ga0126383_104117311
36Ga0137389_104462012
37Ga0137388_101958963
38Ga0137399_115877232
39Ga0137362_103437611
40Ga0137380_101233551
41Ga0137378_105882091
42Ga0137361_105759521
43Ga0137358_102849192
44Ga0137413_106578112
45Ga0137413_107196501
46Ga0137419_113466122
47Ga0137416_104483441
48Ga0137404_103905172
49Ga0126375_111111201
50Ga0157372_132705422
51Ga0182024_105288601
52Ga0182038_100857974
53Ga0182038_101783531
54Ga0187802_100682941
55Ga0187780_101561971
56Ga0066669_105529521
57Ga0210407_111763822
58Ga0210403_105568752
59Ga0210399_106065722
60Ga0210405_110845662
61Ga0210393_113345502
62Ga0210389_107186112
63Ga0210387_111859751
64Ga0210394_109589642
65Ga0210384_116053261
66Ga0210391_103019023
67Ga0210398_114319151
68Ga0210402_100302881
69Ga0210402_109699271
70Ga0126371_127815861
71Ga0137417_13442073
72Ga0207671_115446431
73Ga0209236_10228945
74Ga0209647_10604963
75Ga0209647_12083621
76Ga0209158_10990441
77Ga0257150_10429281
78Ga0257179_10208182
79Ga0257165_10894831
80Ga0209806_12818421
81Ga0209807_12457051
82Ga0179587_102436251
83Ga0208365_10259962
84Ga0209004_10296802
85Ga0209117_11377652
86Ga0209060_100326941
87Ga0209693_100794443
88Ga0209166_106672191
89Ga0209701_100734181
90Ga0209488_103185691
91Ga0265352_10024811
92Ga0209526_100578541
93Ga0268264_100457725
94Ga0302233_101999301
95Ga0170834_1023374191
96Ga0170823_131402841
97Ga0170824_1276714302
98Ga0318538_104149071
99Ga0307468_1013908931
100Ga0307468_1016675721
101Ga0307479_105419811
102Ga0310911_103193391
103Ga0318505_103855751
104Ga0306924_118537422
105Ga0318540_104492732
106Ga0335085_114755352
107Ga0335083_105062662
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 4.29%    β-sheet: 20.00%    Coil/Unstructured: 75.71%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540VNLINQISCEAILIWRGHEGRTGWELGLELQEPSPDFWGLDFSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
21.5%78.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Freshwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Rice Paddy Soil
Tropical Peatland
Prmafrost Soil
Permafrost
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Switchgrass Rhizosphere
Populus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
21.5%7.5%3.7%11.2%3.7%18.7%2.8%2.8%2.8%4.7%3.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12637J13337_101678113300001137Forest SoilNSSRCEAVLIWRGHEGRKGWELGLELIEPSQSFWGVDL*
Ga0066688_1026374823300005178SoilQKLDLVNLVNKNVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF*
Ga0066688_1026603733300005178SoilNKVNGNSCDAILIWRGHEGRGGWELGLELQGQQDDFWGVDF*
Ga0066684_1005025013300005179SoilKLRLVNLTNQISCVAVLVWRGHEGRTGWELGLELQEPLADFWGLDF*
Ga0066681_1014086733300005451SoilAVGQKLDLVNLVNKNVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF*
Ga0070730_1050577133300005537Surface SoilKLINLINKNECEAVLIWRGHEGRTGWELGLELQGASMDFWGLDF*
Ga0070733_1116758513300005541Surface SoilKNACEAVLIWRGHEGRTGWELGLELQEASMEFWGVEF*
Ga0070695_10175569523300005545Corn, Switchgrass And Miscanthus RhizosphereLVNLVNKNSVDAILIWRGYEGRTGWELGLELQGPGEEFWGVDF*
Ga0070704_10025227133300005549Corn, Switchgrass And Miscanthus RhizosphereLINLVNKNSVGAILIWRGHEGRAGWELGLELQDAGEEFWGVDF*
Ga0066670_1088971223300005560SoilQISCEAVLVWRGHEGRTGWELGLELQEPLADFWGLDF*
Ga0066699_1083247213300005561SoilVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF*
Ga0066703_1003291153300005568SoilGQKLRLVNLTNQISCDAVLVWRGHEGRTGWELGLELQEPSPDFWGLDF*
Ga0066691_1021226713300005586SoilLRLVNLTNQISCEAVLVWRGHEGRTGWELGLELQEPSPDFWGLDF*
Ga0066903_10905642523300005764Tropical Forest SoilQNACESVLVWRGHEGRAGWELGLELQKMPADFWGLDF*
Ga0068860_10057786313300005843Switchgrass RhizosphereKNSVDAVLIWRGHEGRTGWELGLELQDAGEEFWGVDF*
Ga0075296_103007923300005877Rice Paddy SoilNLINQISCEATLVWRGHEGPTGWELGLELQEPSPDFWGLDF*
Ga0080027_1019597323300005993Prmafrost SoilGNISAARLIWRGHEGRTGWELGLELDNPPHDFWGLEF*
Ga0070715_1092160323300006163Corn, Switchgrass And Miscanthus RhizosphereQKLRLINLTNQHECDSVLVWRGHEGRSGWELGLELQKLPADFWGLDF*
Ga0070716_10156351113300006173Corn, Switchgrass And Miscanthus RhizosphereKNVCKAILIWRGHEGRTGWELGLELQNPPEDYWGLDF*
Ga0066660_1047278223300006800SoilINLMNQQACDSVLIWRGHEGRSGWELGLELQNTPADFWGLDF*
Ga0075433_1054147723300006852Populus RhizosphereNKNSVDAILIWRGHEGRTGWELGLELQDAGDEFWGVDF*
Ga0099794_1015498423300007265Vadose Zone SoilVNLINQISCEAVLIWRGHEGRTGWELGLELQEPSPDFWGLDF*
Ga0099830_1009017543300009088Vadose Zone SoilFPVGQKLRLVNLINQISCEAILIWRGHEGRTGWELGLELLEPSRDFWGLDF*
Ga0099830_1036349513300009088Vadose Zone SoilHISCEAILVWRGHEGRAGWELGLELQQPSPDFWGLDF*
Ga0099830_1046573023300009088Vadose Zone SoilLVNLINQISCEAVLVWRGHEGRTGWELGLELQEPSPGFWGLDI*
Ga0099828_1029286213300009089Vadose Zone SoilTNKKTSDAILIWRGHEGRAGWELGLEIQDKPEDFWGIQF*
Ga0099792_1083689523300009143Vadose Zone SoilGVGQKLRLVNLLNQISCEATLVWRGHEGRAGWELGLELQDPSPDFWGLDF*
Ga0116135_112018423300009665PeatlandTVGQRLNLVNLTNQSVCEAVLVWRGHEGRSGWELGLELQRMPSDFWGVDF*
Ga0126380_1025333633300010043Tropical Forest SoilGNVADAVLIWRGHEGRAGWELGIELQGFQEEFWGIDF*
Ga0126382_1041592513300010047Tropical Forest SoilLVNLVNKNSVDAVLIWRGHEGRTGWELGLELQGPGDEFWGVDF*
Ga0126378_1227340313300010361Tropical Forest SoilLGQKLELVNLVNKNVSKAILIWRGHEGRTGWELGLELENPPDDFWGLDF*
Ga0126379_1000791113300010366Tropical Forest SoilKNASNQKESDATLIWRGHEGRTGWELGLELLNPPADFWGLEF*
Ga0105239_1228447113300010375Corn RhizosphereHLINLTNQNVCEAILVWRGHEGRSGWELGLELQHATEEFWGVDF*
Ga0126381_10230531823300010376Tropical Forest SoilESDATLIWRGHEGRTGWELGLELLNPPADFWGLEF*
Ga0126383_1041173113300010398Tropical Forest SoilKNSVDAILVWRGHEGRTGWELGLELQGPGEEFWGVDL*
Ga0137389_1044620123300012096Vadose Zone SoilGQKLRLVNLINQISCEAVLVWRGHEGRAGWELGLELQQPSPDFWGLDF*
Ga0137388_1019589633300012189Vadose Zone SoilFTVGQKLRLVNLINQISCEAILIWRGHEGRTGWELGLELQEPSADFWGLDF*
Ga0137399_1158772323300012203Vadose Zone SoilGQRLRLVNKVNGNSCDAILIWRGHEGRSGWELGLELQGQQDDFWGVDF*
Ga0137362_1034376113300012205Vadose Zone SoilVNKNVAKAILIWRGHEGRTGWELGLELVNPPDDYWGLDF*
Ga0137380_1012335513300012206Vadose Zone SoilRLVNLINQISCEAVLVWRGHEGRTGWELGLELQEPSPDFWGLDF*
Ga0137378_1058820913300012210Vadose Zone SoilNKNVSKATLIWRGHEGRTGWELGLELQDPPEVFWGLDF*
Ga0137361_1057595213300012362Vadose Zone SoilVNLVNKNVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF*
Ga0137358_1028491923300012582Vadose Zone SoilVGQKLDLVNLVNKNVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF*
Ga0137413_1065781123300012924Vadose Zone SoilACEAILVWRGHEGRAGWELGLELQDPSPDFWGLDF*
Ga0137413_1071965013300012924Vadose Zone SoilNSCDAILIWRGHEGRSGWELGLELQGQQDDFWGVDF*
Ga0137419_1134661223300012925Vadose Zone SoilNVAKAILIWRGHEGRAGWELGLELVNPPDDYWGLDF*
Ga0137416_1044834413300012927Vadose Zone SoilNQNSCEAVLVWRGHEGRKGWELGLELQDATRDFWGLDV*
Ga0137404_1039051723300012929Vadose Zone SoilNSCEAVLVWRGHEGRKGWELGLELQDATLDFWGLDV*
Ga0126375_1111112013300012948Tropical Forest SoilTLINLVNQKSSDAILIWRGHEGHTGWELGLELQGPSEDFWGLDF*
Ga0157372_1327054223300013307Corn RhizosphereVCDSILVWRGHEGRSGWELGLELQKMPADFWGLDF*
Ga0182024_1052886013300014501PermafrostQSACESVLVWRGHEGRSGWELGLELQKMPADFWGVDF*
Ga0182038_1008579743300016445SoilLVNQANKNSVDAILVWRGQEGRTGWEVGLELQGAGEEFWGMDF
Ga0182038_1017835313300016445SoilEAADAILIWRGHEGRAGWELGIELQNAAEAFWGVEF
Ga0187802_1006829413300017822Freshwater SedimentNGSNARQSEAVLIWRGHEGRSGWELGLELVNPPEEFWGVDF
Ga0187780_1015619713300017973Tropical PeatlandMRLINLTNQISTEATVIWRGHEGPTGWELGVELLEPSPDFWGLDF
Ga0066669_1055295213300018482Grasslands SoilQKLDLVNLVNKNVSKATLIWRGHEGRTGWELGLELQDPPEDFWGLDF
Ga0210407_1117638223300020579SoilQSEAVLIWRGHEGRTGWELGLELRDPSPEFWGPDL
Ga0210403_1055687523300020580SoilQRLRLVNLVNSSQSEAVLIWRGHEGRTGWELGLELRDPSPEFWGPDL
Ga0210399_1060657223300020581SoilKLRLINLINQNACNSVLVWRGHEGRSGWELGLELESIPSDFWGLDF
Ga0210405_1108456623300021171SoilINKNASEAVLIWRGHEGRAGWELGLELQEASMDFWGVEF
Ga0210393_1133455023300021401SoilINKNACEAILIWRGHENRTGWELGLELQGASMDFWGVDF
Ga0210389_1071861123300021404SoilLVNLINKNACEAILIWRGHENRTGWELGLELQGAPMDFWGVDF
Ga0210387_1118597513300021405SoilVNLVNKNVAQALLVWRGHEGRTGWELGLELQDPPEDFWGLDF
Ga0210394_1095896423300021420SoilLLNQISCEAVLVWRGHEGRKGWELGLELQEPTADFWGLDF
Ga0210384_1160532613300021432SoilVGQRLRVINLTNQSACEAVLVWRGHEGRSGWELGLELQKMPAEFWGVDF
Ga0210391_1030190233300021433SoilLINQISCEAVLVWRGHEGRKGWELGLELQEPPLDFWGLDF
Ga0210398_1143191513300021477SoilACEAVLIWRGHEGRTGWELGLQLQDASMDFWGLDF
Ga0210402_1003028813300021478SoilNVAQALLVWRGHEGRTGWELGLELQDPPEDFWGLDF
Ga0210402_1096992713300021478SoilGQKLRLVNLTNQNACSSVLVWRGHEGRSGWELGLELESIPSDFWGLDF
Ga0126371_1278158613300021560Tropical Forest SoilNACESVLVWRGHEGRAGWELGLELQKMPADFWGLDF
Ga0137417_134420733300024330Vadose Zone SoilVGQKLRLVNLINQISCEAVLIWRGHEGRAGWELGLELQEPSPDFWGLDF
Ga0207671_1154464313300025914Corn RhizosphereTNQHVCDSILVWRGHEGRSGWELGLELQKMPADFWGLDF
Ga0209236_102289453300026298Grasslands SoilVGQKLRLVNLINQISCEAILIWRGHEGRTGWELGLELQEPSPDFWGLDF
Ga0209647_106049633300026319Grasslands SoilVNKVNGNSCDAILIWRGHEGRSGWELGLELQGQQDDFWGVDF
Ga0209647_120836213300026319Grasslands SoilINQISCEAILIWRGHEGRAGWELGLELREPSPDFWGLDF
Ga0209158_109904413300026333SoilLVNLTNQISCEAVLVWRGHEGRTGWELGLELQEPSPDFWGLDF
Ga0257150_104292813300026356SoilQKLRLVNLINQISCEAVLIWRGHEGRAGWELGLELQEPSPDFWGLDF
Ga0257179_102081823300026371SoilNLISCEATLIWRGHEGRAGWELGLELQEPSPDFWGLDF
Ga0257165_108948313300026507SoilISCEAILIWRGHEGRTGWELGLELQEPSPDFWGLDF
Ga0209806_128184213300026529SoilFNVGQKLRLVNLTNQISCDAVLVWRGHEGRTGWELGLELQEPSPDFWGLDF
Ga0209807_124570513300026530SoilLINLTNQHVCDSILVWRGHEGRSGWELGLELQKMPADFWGLDF
Ga0179587_1024362513300026557Vadose Zone SoilVNLINSSKSEAVLIWRGHEGRTGWELGLELHDPSPEFWGLDF
Ga0208365_102599623300027070Forest SoilQFSSDASVIWRGHEGPAGWELGVELLEPSPDFWGLDF
Ga0209004_102968023300027376Forest SoilDLVNLVNKNVCKAILIWRGHEGRTGWELGLELQNPPEDYWGLDF
Ga0209117_113776523300027645Forest SoilNLVNKNVAKAILIWRGHEGRTGWELGLELVNPPDDYWGLDF
Ga0209060_1003269413300027826Surface SoilVGQRLHLVNLTNQNVCEAILVWRGHEGRSGWELGLELQHATEEFWGVDF
Ga0209693_1007944433300027855SoilAVCEAILVWRGHEGRSGWEIGLELQHATEEFWGMDF
Ga0209166_1066721913300027857Surface SoilGFAVGQRLKLINLINKNECEAVLIWRGHEGRTGWELGLELQEASMEFWGLDF
Ga0209701_1007341813300027862Vadose Zone SoilFPVGQKLRLVNLINQISCEAILIWRGHEGRTGWELGLELLEPSRDFWGLDF
Ga0209488_1031856913300027903Vadose Zone SoilINQISCEAILIWRGHEGRTGWELGLELQEPSADFWGLDF
Ga0265352_100248113300028021SoilLINKNACEAILIWRGHEGRTGWELGLQLQEASMDFWGLDF
Ga0209526_1005785413300028047Forest SoilLINLTNQSTCAAILVWRGHEGRSGWELGLELQKMPVDFWGVDF
Ga0268264_1004577253300028381Switchgrass RhizosphereSVDAVLIWRGHEGRTGWELGLELQDAGEEFWGVDF
Ga0302233_1019993013300028746PalsaNKISEATLIWRGHEGRQGWELGLELLNPPDGFWAIDL
Ga0170834_10233741913300031057Forest SoilLVNKNVCKAVLIWRGHEGRTGWELGLELQHPPDDYWGLDF
Ga0170823_1314028413300031128Forest SoilNVAKAILIWRGHEGRTGWELGLELVNPPDDYWGLDF
Ga0170824_12767143023300031231Forest SoilVNLINQISCEAILIWRGHEGRTGWELGLELQEPSPDFWGLDF
Ga0318538_1041490713300031546SoilVLNMVNKEAADAILIWRGHEGRAGWELGIELQNAAEAFWGVEF
Ga0307468_10139089313300031740Hardwood Forest SoilVSEATLIWRGHEGRTGWELGLELQNAPAEFWGVDF
Ga0307468_10166757213300031740Hardwood Forest SoilAADAVLIWRGHEGRTGWELGLELREPPAEFWGVEF
Ga0307479_1054198113300031962Hardwood Forest SoilKLRLVNLTNQISCEAVLVWRGHQGRTGWELGLELREPSADFWGLDF
Ga0310911_1031933913300032035SoilVNRETADAILIWRGHEGRTGWELGLELQDAGQAFWGVEF
Ga0318505_1038557513300032060SoilANKNSVDAILVWRGQEGRTGWEVGLELQGAGEEFWGMDF
Ga0306924_1185374223300032076SoilFALGQRLNLVNLLNSNRSVVVLIWRGHEGRAGWELGLELQEPPADFWGVEF
Ga0318540_1044927323300032094SoilVNKQTADAILIWRGHEGRAGWELGVELQDAGEEFWGVEF
Ga0335085_1147553523300032770SoilNLVNKHQCEAIVVWRGHEGRKGWELGIELQNPSVEFWEVDF
Ga0335083_1050626623300032954SoilQSVCEAILVWRGHEGRSGWELGLELQHATEEFWGLDF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.