NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101450

Metagenome / Metatranscriptome Family F101450

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101450
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 48 residues
Representative Sequence MTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLL
Number of Associated Samples 90
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 63.73 %
% of genes near scaffold ends (potentially truncated) 99.02 %
% of genes from short scaffolds (< 2000 bps) 96.08 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (60.784 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(17.647 % of family members)
Environment Ontology (ENVO) Unclassified
(37.255 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(41.176 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.
1GPICI_01028750
2JGI11643J12802_100844963
3JGI10216J12902_1130392503
4JGI24746J21847_10381381
5Ga0055471_101456852
6Ga0055470_100855611
7Ga0055465_101173771
8Ga0062593_1034241831
9Ga0063356_1024364261
10Ga0062594_1031236652
11Ga0068869_1016699581
12Ga0070680_1008110542
13Ga0070660_1007609102
14Ga0070687_1001435981
15Ga0070675_1018590301
16Ga0070693_1009041891
17Ga0068854_1002026401
18Ga0068852_1022805931
19Ga0075291_10381812
20Ga0081455_106569371
21Ga0075365_102263421
22Ga0075365_108060032
23Ga0075364_111205661
24Ga0075422_103709552
25Ga0074053_118821152
26Ga0074053_120183831
27Ga0074047_119823301
28Ga0111539_101557171
29Ga0075418_112431101
30Ga0114129_118969201
31Ga0111538_137639041
32Ga0134122_122211641
33Ga0151489_13661021
34Ga0151490_12902742
35Ga0105246_111626841
36Ga0137424_10287391
37Ga0157346_10142281
38Ga0157285_102952112
39Ga0157291_102244472
40Ga0157282_102363181
41Ga0157283_103266751
42Ga0157301_101478481
43Ga0164241_114256651
44Ga0164301_114421942
45Ga0164309_108869741
46Ga0164305_109922341
47Ga0163162_133296491
48Ga0157377_112187561
49Ga0157377_116641072
50Ga0173480_104237943
51Ga0173480_111270832
52Ga0132256_1038207261
53Ga0190270_106477511
54Ga0190270_129696361
55Ga0066669_105133481
56Ga0173481_100039881
57Ga0247753_10509122
58Ga0247788_10728351
59Ga0247802_10139643
60Ga0247802_10391551
61Ga0247748_10065613
62Ga0247798_10221121
63Ga0210087_10392693
64Ga0207642_105625071
65Ga0207642_105787081
66Ga0207680_111442362
67Ga0207654_103770363
68Ga0207662_102463733
69Ga0207681_116443842
70Ga0207650_106144101
71Ga0207659_111435391
72Ga0207670_110316592
73Ga0207711_110060843
74Ga0207689_115744271
75Ga0207651_101435561
76Ga0207712_108490361
77Ga0207668_105592153
78Ga0207639_111699932
79Ga0208912_10085881
80Ga0207434_10223661
81Ga0207582_10228972
82Ga0247828_104413552
83Ga0247818_104676083
84Ga0247818_105102673
85Ga0247823_110398682
86Ga0247821_102817983
87Ga0247825_105853453
88Ga0247825_112553921
89Ga0247826_112825072
90Ga0307495_101359292
91Ga0307497_100771153
92Ga0307497_103384771
93Ga0307408_1020552962
94Ga0310886_102596813
95Ga0310813_113379122
96Ga0310884_103990292
97Ga0307409_1029444671
98Ga0310897_102889292
99Ga0310895_103373622
100Ga0247830_100871643
101Ga0247830_103763941
102Ga0247830_109337921
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 47.30%    β-sheet: 0.00%    Coil/Unstructured: 52.70%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
39.2%60.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Natural And Restored Wetlands
Soil
Soil
Soil
Terrestrial Soil
Soil
Soil
Soil
Grasslands Soil
Soil
Natural And Restored Wetlands
Rice Paddy Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Endosphere
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.9%17.6%4.9%3.9%5.9%2.9%14.7%2.9%2.9%3.9%2.9%4.9%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPICI_010287502088090015SoilMTDVREVGEDELERWIGVARAALDEADTVEGYLDWKRQARQTVWLLA
JGI11643J12802_1008449633300000890SoilMTDVREVGEDELERWIGVARAALDEADTVEGYLDWKRQARQTVWLLASDA
JGI10216J12902_11303925033300000956SoilMTDVREVGEDELERWIGVARAALDEADTVEGYLDWKRQARQTVWLLASDAG
JGI24746J21847_103813813300001977Corn, Switchgrass And Miscanthus RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWK
Ga0055471_1014568523300003987Natural And Restored WetlandsVTSIREIHEEELGRWVEAMRAALDEADTVEGYLDWKRQAQQTVWL
Ga0055470_1008556113300003992Natural And Restored WetlandsVTEITEIHEPELERWVQTRKAAADEADTVEGYLDWKRQARE
Ga0055465_1011737713300004013Natural And Restored WetlandsMVAIREIHEDELGRWVDVTRAAFHETDTVEGYLDWKRQARETV
Ga0062593_10342418313300004114SoilVPEIREIEEDELARWVAVTKAATNEADTVDGYLDWKRQARETVWLLASEGRDDVGAAIGI
Ga0063356_10243642613300004463Arabidopsis Thaliana RhizosphereVPEVTEINEDELARWVAVVKAATNEADTAEGYLDWKRQARETVWLLASEGKD
Ga0062594_10312366523300005093SoilVPEIREIEEDELARWVAVTKAATNEADTVEGYLDWKRQARETVWLLASEGKEDVGAAIGI
Ga0068869_10166995813300005334Miscanthus RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYLDWRRQARETTWLLASSDD
Ga0070680_10081105423300005336Corn RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWL
Ga0070660_10076091023300005339Corn RhizosphereMTDVREVRENELERWIGIARAALDEADTVEGYLDWKRQARQTVWLLASDA
Ga0070687_10014359813300005343Switchgrass RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLL
Ga0070675_10185903013300005354Miscanthus RhizosphereMTEIREVAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLL
Ga0070693_10090418913300005547Corn, Switchgrass And Miscanthus RhizosphereVAEIREIHEGELERWVAATGAATHEADTVEGYLDWKKQARETIWLLASD
Ga0068854_10020264013300005578Corn RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGR
Ga0068852_10228059313300005616Corn RhizosphereMTEIREIAEDELERWIVTAKAALDEADTVEGYLDW
Ga0075291_103818123300005884Rice Paddy SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARQTAWLLAS
Ga0081455_1065693713300005937Tabebuia Heterophylla RhizosphereMKRRYCHCVTSIREIHEDELDRWVATTRAALDEADTVEGYLDWKRQARETI
Ga0075365_1022634213300006038Populus EndosphereVLEIREIDERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLASDGK
Ga0075365_1080600323300006038Populus EndosphereMEIREIHEDELGRWVEAMRAGLDEADTADGYLDWKRQSRETVWILASED
Ga0075364_1112056613300006051Populus EndosphereMTEIREIAEDELERWIATAKEALDEADTVEGYLDWKRQARETIWLLASDEGRDVGT
Ga0075422_1037095523300006196Populus RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDVGT
Ga0074053_1188211523300006575SoilVTEIREIDERELERWVATTKAAVDEADTVEGYLDWKRQARE
Ga0074053_1201838313300006575SoilMAEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLL
Ga0074047_1198233013300006576SoilMAEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARE
Ga0111539_1015571713300009094Populus RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDW
Ga0075418_1124311013300009100Populus RhizosphereVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLASDGRSDVGAAIGI
Ga0114129_1189692013300009147Populus RhizosphereVPEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLASD
Ga0111538_1376390413300009156Populus RhizosphereVPEIREVDEDELDRWVAAMRTGLDEADTAEGYLDWKRQARETVWLLA
Ga0134122_1222116413300010400Terrestrial SoilVPEVREIHEDELARWVAGVKGATNEADTAEGYLDWKRQARETVWLLASDGKDDLGAAI
Ga0151489_136610213300011106SoilMAEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDE
Ga0151490_129027423300011107SoilVTEIREIVERELERWVATTKAAVDEADTVEGYLDWRRQARET
Ga0105246_1116268413300011119Miscanthus RhizosphereVLEIREIGERDLARWVEVTKAATNEADTVEGYLDWKRQARQTAW
Ga0137424_102873913300011412SoilVAEIREIHENELARWVAAMRSALDETDTVEGYLDWKRQAR
Ga0157346_101422813300012480Arabidopsis RhizosphereMTEIREIAEDELERWIATAKEALDEADTVEGYLDWKRQA
Ga0157285_1029521123300012897SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARQTAWLLA
Ga0157291_1022444723300012902SoilVLEIREIGERDLARWVAVTSAATNEADTAEGYLDWRRQAHETAWLL
Ga0157282_1023631813300012904SoilVLEIRETDERDLARWVAVTSAATNEADTAEGYLDWKRQ
Ga0157283_1032667513300012907SoilMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIW
Ga0157301_1014784813300012911SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLASDGKADV
Ga0164241_1142566513300012943SoilVPEIREVDEDELDRWVAAMRTGLDEADTAEGYLDWKRQARETVWLLATEDGVDVGA
Ga0164301_1144219423300012960SoilVPEVREIHEDELARWVSVVAAASNETDTADGYLDWKRQARETVWQLASEGKD
Ga0164309_1088697413300012984SoilMPEIREIREDELARWVATTKEALDEADTVEGYLDWKRQARE
Ga0164305_1099223413300012989SoilMTGIREVREDELERWVATTKEALDEADTVEGYLDWKRQARETIW
Ga0163162_1332964913300013306Switchgrass RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDVGTA
Ga0157377_1121875613300014745Miscanthus RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLD*
Ga0157377_1166410723300014745Miscanthus RhizosphereVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLASDGRSDV
Ga0173480_1042379433300015200SoilVLEVREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARQTA
Ga0173480_1112708323300015200SoilVPEVREIHEDELARWVAVVKAATNEADTVEGYLDWKRQARETVWLLASEGKD
Ga0132256_10382072613300015372Arabidopsis RhizosphereVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQARETAWLLA
Ga0190270_1064775113300018469SoilVTEIREIHEDELGRWVAAMRSALDETDTVEGYLDWKRQARETGWFLASDDGRD
Ga0190270_1296963613300018469SoilVTEIREIHEDELERWVDTMRVALDEADTAEGYLDWKRQARETAWFLASLDGRDVGAAI
Ga0066669_1051334813300018482Grasslands SoilVDEIRAIDDGELERWVAAMRAVDEHTDTVEGYLDWRRQAEATVWLLASDGEADIGAGL
Ga0173481_1000398813300019356SoilMTDVREVREDELERWIGVARAALDEADTVEGYLDWKRQARQTVWLLASDAGREVGTAIG
Ga0247753_105091223300022892SoilVAEIREIHEGELERWVAATGAATHEADTVEGYLDWKKQARETIWLLASDGRDVGTAIG
Ga0247788_107283513300022901SoilVAEIREIHEGELERWVAATGAATHEADTVEGYLDWK
Ga0247802_101396433300023077SoilMTDVREVREDELERWIGVARAALDEADTVEGYLDWKRQARQTVWLLASDAG
Ga0247802_103915513300023077SoilMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDVGTAI
Ga0247748_100656133300023168SoilMTEIREIAEDELERWIATAKAALDEADTVGGYLDWKRQARETIWLLASDEGRDVGTAIGV
Ga0247798_102211213300023260SoilMTDVREVREDELERWIGVARAALDEADTVEGYLDWKRQARQTVWL
Ga0210087_103926933300025559Natural And Restored WetlandsVTEIREIHEPELERWVQTRKAAADEADTVEGYLDWKRQAHQTVWLLASRGGQDVGVAIGV
Ga0207642_1056250713300025899Miscanthus RhizosphereVAEIREIYEDELDRWVDAMRAALDETDTAEGYLDWKRQSRETGWFLATDGDE
Ga0207642_1057870813300025899Miscanthus RhizosphereMEIREIHEDELGRWVEAMRAGLDEADTADGYLDWKRQSRETVWILASEDGQDVGAAI
Ga0207680_1114423623300025903Switchgrass RhizosphereVTEIHEIDERELERWVATTKAAVDEADTVEGYLDWR
Ga0207654_1037703633300025911Corn RhizosphereMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDVG
Ga0207662_1024637333300025918Switchgrass RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYLDWRRQARETTWLLASSDDVDV
Ga0207681_1164438423300025923Switchgrass RhizosphereMTDVREVREDELERWIGVARAALDEADTVEGYLDWKRQA
Ga0207650_1061441013300025925Switchgrass RhizosphereVTKIREIDERELERWVATTKAAVDEADTVEGYLDWRRQARETTWLLASSDDVDV
Ga0207659_1114353913300025926Miscanthus RhizosphereVAEIREIHEGELERWVAATGAATHEADTVEGYLDWKKQARETI
Ga0207670_1103165923300025936Switchgrass RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYLDWKRQARETAWLLASSDDAD
Ga0207711_1100608433300025941Switchgrass RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYRDWRRQARETTWLLA
Ga0207689_1157442713300025942Miscanthus RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYRDWR
Ga0207651_1014355613300025960Switchgrass RhizosphereMTEIREIAEDELERWIVTAKAALDEADTVEGYLDWKRQARETIWLLASDKGRDVGTAIGV
Ga0207712_1084903613300025961Switchgrass RhizosphereVTEIREIDERELERWVATTKAAVDEADTVEGYLDWRRQA
Ga0207668_1055921533300025972Switchgrass RhizosphereVAEIREIHEGELERWVAATGAATYEADTVEGYLDWKKQARE
Ga0207639_1116999323300026041Corn RhizosphereVPEVREIHEDELARWVAVVKAATNEADTAEGYLDW
Ga0208912_100858813300026090Natural And Restored WetlandsVTEIREIHEGELGRWVDAMRAALDEADTVEGYLDWKRQAQQTVW
Ga0207434_102236613300026952SoilMTEIREIAEDELERWIVTAKAALDEADTVEGYLDWKRQ
Ga0207582_102289723300026960SoilMTEIREIAEDELERWIATAKAALDEADTIEGYLDWKRQARE
Ga0247828_1044135523300028587SoilVAEIREIHEGELERWVAATGAATHEADTVEGYLDWKKQARETIWLL
Ga0247818_1046760833300028589SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKKQPR
Ga0247818_1051026733300028589SoilVPEIREVNEDELDRWVAAMRTGLDEADTAEGYLDWKRQAR
Ga0247823_1103986823300028590SoilMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARE
Ga0247821_1028179833300028596SoilMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDV
Ga0247825_1058534533300028812SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKKQARETAWLLL
Ga0247825_1125539213300028812SoilVTEIREIDERELERWVATTKAAVDEADTVEGYLDWRRQARETGWFLASDDGHDVG
Ga0247826_1128250723300030336SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKKQARETAWLLASDGKA
Ga0307495_1013592923300031199SoilVTEIREIDERELERWVATTKAAVDEADTVEGYLDWKRQARETAWLLASSDDADVG
Ga0307497_1007711533300031226SoilVTEIREIDERELERWVATTKAAVDEADTVEGYLDWKRQARETAWLLASSDDADVGTAI
Ga0307497_1033847713300031226SoilMTEVREVTEDELERWVATTKEALDEADTVEGYLDWKRQARETIWLLATDR
Ga0307408_10205529623300031548RhizosphereMTEIREVAEEELERWIATAKAALDEADTVEGYLDWKR
Ga0310886_1025968133300031562SoilVAEIREIYEDELDRWVDAMRAALDETDTAEGYLDWKRQS
Ga0310813_1133791223300031716SoilVPEIREIGERDLARWVAVTKAATNEADTVEGYLDWKRQA
Ga0310884_1039902923300031944SoilVPEIREVDEDELDRWVAAMRTGLDEADTAEGYLDWKRQARETVWLLATGDG
Ga0307409_10294446713300031995RhizosphereMELREIHEDELGRWVDAMRAALDEADTVEGYLDWKRQAR
Ga0310897_1028892923300032003SoilMTEIREIAEDELERWIATAKAALDEADTVEGYLDWKRQARETIWLLASDEGRDVGTAIGV
Ga0310895_1033736223300032122SoilVAEIREIYEDELDRWVDAMRAALDETDTAEGYLDWKRQSRETGWFLATDG
Ga0247830_1008716433300033551SoilMPEVREIHEDELGRWVEAMRSALDEADTVEGFLDWKRQARETGWFLASHD
Ga0247830_1037639413300033551SoilVLEIREIGERDLARWVAVTKAATNEADTVEGYLDWKKQARETAW
Ga0247830_1093379213300033551SoilVAEIREIHEGELERWVAATGAATYEADTVEGYLDWKKQARETIWLLASDGRDVGTAIGIG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.