NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F050633

Metagenome / Metatranscriptome Family F050633

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F050633
Family Type Metagenome / Metatranscriptome
Number of Sequences 145
Average Sequence Length 88 residues
Representative Sequence MDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQMVMEGLSGKDRGVQFHAACFYVWDATRQVLGYRPSGPAD
Number of Associated Samples 121
Number of Associated Scaffolds 145

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 79.31 %
% of genes near scaffold ends (potentially truncated) 32.41 %
% of genes from short scaffolds (< 2000 bps) 77.24 %
Associated GOLD sequencing projects 116
AlphaFold2 3D model prediction Yes
3D model pTM-score0.67

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (94.483 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(11.724 % of family members)
Environment Ontology (ENVO) Unclassified
(40.690 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.276 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.
1GPICI_02706260
2GPICI_00684480
3JGI10220J13317_117424011
4JGI24751J29686_101090051
5Ga0062593_1000180204
6Ga0062593_1020580271
7Ga0058897_109673513
8Ga0062589_1013657491
9Ga0063356_1045483412
10Ga0062595_1000214493
11Ga0062595_1016401521
12Ga0062592_1010305831
13Ga0058863_113085692
14Ga0058862_121124802
15Ga0068995_100876071
16Ga0065712_105506563
17Ga0065705_103468532
18Ga0065705_110657081
19Ga0065707_101822722
20Ga0070677_106596271
21Ga0068868_1000139675
22Ga0068868_1003781312
23Ga0070689_1001274593
24Ga0070667_1013337492
25Ga0070667_1017355002
26Ga0070708_1000633073
27Ga0070708_1001908802
28Ga0070663_1001455022
29Ga0070706_1010612021
30Ga0070698_1008035262
31Ga0070697_1014999892
32Ga0070695_1000282601
33Ga0070695_1012450301
34Ga0070704_1000492021
35Ga0068859_1009049741
36Ga0068858_1000659035
37Ga0075024_1003713152
38Ga0075028_1003786523
39Ga0075018_103500742
40Ga0079222_100540174
41Ga0079222_101730701
42Ga0079222_107782691
43Ga0068865_1011189302
44Ga0079219_100041983
45Ga0099791_100234242
46Ga0099794_102591263
47Ga0105107_108324812
48Ga0105240_111500522
49Ga0105240_117449861
50Ga0105245_120287581
51Ga0114129_104663333
52Ga0105243_101466795
53Ga0075423_104037083
54Ga0075423_104563561
55Ga0105100_108969211
56Ga0105242_109094832
57Ga0105248_127680931
58Ga0105061_11045371
59Ga0134128_101657303
60Ga0134121_130863301
61Ga0134123_100453705
62Ga0105246_102663131
63Ga0150983_102045862
64Ga0150983_137975621
65Ga0137397_102347501
66Ga0137404_100568026
67Ga0164303_1000130913
68Ga0164304_102387762
69Ga0164305_100308305
70Ga0157371_101336092
71Ga0157374_104043032
72Ga0157378_100440623
73Ga0163162_126523842
74Ga0157372_100262526
75Ga0157377_102217212
76Ga0157377_104072641
77Ga0180063_10304361
78Ga0180085_12217231
79Ga0137403_100331149
80Ga0132256_1004392432
81Ga0184610_12363461
82Ga0184608_104161881
83Ga0184620_101151112
84Ga0184621_100862172
85Ga0184632_102420341
86Ga0190272_113157471
87Ga0180116_11784011
88Ga0193715_10499392
89Ga0193707_10227411
90Ga0193707_10295952
91Ga0193713_10915832
92Ga0193713_11646781
93Ga0193728_12286212
94Ga0193739_10063863
95Ga0180109_10198641
96Ga0206356_108224022
97Ga0206356_115686231
98Ga0210407_100644615
99Ga0210403_102741581
100Ga0210401_110744012
101Ga0210378_100139752
102Ga0193719_102759721
103Ga0210384_1000085227
104Ga0210384_100518375
105Ga0222622_104069521
106Ga0207697_101663742
107Ga0207656_102518411
108Ga0207653_100215672
109Ga0207653_100441672
110Ga0207682_100469151
111Ga0207688_1000204411
112Ga0207647_105753991
113Ga0207645_100072009
114Ga0207645_100523133
115Ga0207684_107725282
116Ga0207690_109445721
117Ga0207706_108412361
118Ga0207669_109365911
119Ga0210089_10209021
120Ga0207658_112253881
121Ga0207703_100581322
122Ga0207639_109618102
123Ga0207641_101107895
124Ga0257176_10621941
125Ga0257172_10227231
126Ga0207467_10118522
127Ga0209329_10563261
128Ga0209626_11975481
129Ga0209974_100707691
130Ga0209526_1001072911
131Ga0307504_100261533
132Ga0307504_100720982
133Ga0307299_100640191
134Ga0307312_102129312
135Ga0299907_106606451
136Ga0247612_10933651
137Ga0247651_101349431
138Ga0307469_106120622
139Ga0307471_1003245031
140Ga0307471_1003918281
141Ga0307471_1005609872
142Ga0307471_1007277672
143Ga0307472_1015932292
144Ga0335081_112923332
145Ga0370495_0144803_6_269
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.87%    β-sheet: 16.52%    Coil/Unstructured: 62.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080MDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQMVMEGLSGKDRGVQFHAACFYVWDATRQVLGYRPSGPADSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.67
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
94.5%5.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Natural And Restored Wetlands
Soil
Groundwater Sediment
Watersheds
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Switchgrass Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Agricultural Soil
Soil
Soil
Hardwood Forest Soil
Soil
Untreated Peat Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Host-Associated
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.4%11.7%3.4%4.8%6.2%4.1%4.1%7.6%3.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPICI_027062602088090015SoilMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG
GPICI_006844802088090015SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA
JGI10220J13317_1174240113300001139SoilMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLG
JGI24751J29686_1010900513300002459Corn, Switchgrass And Miscanthus RhizosphereRLPHNHIPRMWGGSGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0062593_10001802043300004114SoilMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0062593_10205802713300004114SoilMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0058897_1096735133300004139Forest SoilMDKPVLRLMIREKLADGRLPHDSIPRMWGGPGNGETCDGCGEIVTKTQMVMEGLSTKNHGVQLHVTCFHVWDVERQVLGHEPSGPA*
Ga0062589_10136574913300004156SoilMEKPALRIMIQERLADGRLPHDHIPRIWGGPGNGETCDGCDEPVTSTQMVMEGLSTTNGGVQFHVACFYVWDVERQVLGHEPSGPA*
Ga0063356_10454834123300004463Arabidopsis Thaliana RhizosphereMDRLTLRLMIQGKVADGRLPHHYIPRVGGGLGNGETCDGCGETVTKTQVVMGGLSGSDRGVQFHDVCFYVWDATRQVRGYGPSGPAD*
Ga0062595_10002144933300004479SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0062595_10164015213300004479SoilLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKGSGADGTGVQFHITCFHLWDVERQVLGHEPSGPAE*
Ga0062592_10103058313300004480SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFH
Ga0058863_1130856923300004799Host-AssociatedMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHEPSGPA*
Ga0058862_1211248023300004803Host-AssociatedMDRPALTLQIQAKLADGRLPHNHIPRMWGGSGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0068995_1008760713300005206Natural And Restored WetlandsMEKPTLRLMIQQKLADGRLPNNHIPRMWGGPGNGEICDGCDEIVTKAQMIMEGLSGKDSGVQFHVACFYVWDVERQVLGHEPSGPA*
Ga0065712_1055065633300005290Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQ
Ga0065705_1034685323300005294Switchgrass RhizosphereMDKSTLRLMIQDKVADGRLPHHYIPRVGGGLGNGETCDGCAEPVTKAQVLMEGLSGNARRGVKFHGACFYVWDATRQVFGYRASGPAD*
Ga0065705_1106570813300005294Switchgrass RhizosphereMDRPALTLLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0065707_1018227223300005295Switchgrass RhizosphereMDRLTLTRMIQKKLADGRLPHNHIPRLWGGPGNGETCDGCEETVTKAQMLMEGLSAKSMGVQLHVTCFHVWDAERQVLGHEPSGPA*
Ga0070677_1065962713300005333Miscanthus RhizosphereMDKPTLRLLIRAKLADGRVPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHEPSGPA*
Ga0068868_10001396753300005338Miscanthus RhizosphereMDRPALTLLIQARLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0068868_10037813123300005338Miscanthus RhizosphereKLADGRLPHNHIPRMWGGSGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0070689_10012745933300005340Switchgrass RhizosphereTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0070667_10133374923300005367Switchgrass RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGETVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0070667_10173550023300005367Switchgrass RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHET
Ga0070708_10006330733300005445Corn, Switchgrass And Miscanthus RhizosphereMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEICDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHRPSGPLD*
Ga0070708_10019088023300005445Corn, Switchgrass And Miscanthus RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHLWDVERQVIGHEPSGPA*
Ga0070663_10014550223300005455Corn RhizosphereMDEPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0070706_10106120213300005467Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDME
Ga0070698_10080352623300005471Corn, Switchgrass And Miscanthus RhizosphereMNRATLRLLIKDKLADGRLPHNHIPRMGGGPGDGEICDGCGEIVAKSQMIMEGLSGKDRGVEFHVACFYVWDATRQVFGHRPSGPLD*
Ga0070697_10149998923300005536Corn, Switchgrass And Miscanthus RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTETQMLMEGLSKGSGADGTGVQFHITCFHLWDVERQVLGHEP
Ga0070695_10002826013300005545Corn, Switchgrass And Miscanthus RhizosphereLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0070695_10124503013300005545Corn, Switchgrass And Miscanthus RhizosphereMDRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHKPSGPLD*
Ga0070704_10004920213300005549Corn, Switchgrass And Miscanthus RhizosphereMDKPTLRLMIQDKVADGRLPHHYIPRVGGGLGNGETCDACAEPVTKAQVLMEGLSGNARRGVKFHGACFYVWDATRHVFGYKASGPAD*
Ga0068859_10090497413300005617Switchgrass RhizosphereADGRLPHDHIPRIWGGPGNGETCDGCDEPVTSTQMVMEGLSTTNGGVQFHVACFYVWDVERQVLGHEPSGPA*
Ga0068858_10006590353300005842Switchgrass RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDAIGVQLHVTCFHLWDVERQVIGHDPSGPA*
Ga0075024_10037131523300006047WatershedsMDRPALRIMIRERLADGRLPHNHIPRLWGGPGNGETCDGCGETVTKGQMLMEGLSAKSSGVQLHVACFHVWDVERQVLGHEPSQPA*
Ga0075028_10037865233300006050WatershedsMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHEPSGPLD*
Ga0075018_1035007423300006172WatershedsMDKPTLRILIREKLADGRLPHNHIPRMWGGPGSGETCDGCDEIVTSTQMLMEGLSKDSGTKDSGVQFHITCFHLWDVERQVAGHDPSGLA*
Ga0079222_1005401743300006755Agricultural SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGNGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0079222_1017307013300006755Agricultural SoilMDRPALTLLIQVKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0079222_1077826913300006755Agricultural SoilDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0068865_10111893023300006881Miscanthus RhizosphereIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHEPSGPA*
Ga0079219_1000419833300006954Agricultural SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGNGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHDPSGPA*
Ga0099791_1002342423300007255Vadose Zone SoilMDRSILRHRIQEKLADGRLPHEHIPSIFWGGPGNGETCDGCGETVTKGQMVMELSTKDSGAQFHVACFHVWDEERRALGQDPSGPA*
Ga0099794_1025912633300007265Vadose Zone SoilMDRSILRHLIQEKLADGRLPHEHIPSIFWGGPGNGETCDGCGETVTKGQMVMELSTKDSGAQFHVACFHVWDEERRALGQDPSGPA*
Ga0105107_1083248123300009087Freshwater SedimentMDKPTLRLMIQDKLTDGRLPLHHIPRVGGGLGNGETCDGCGETVTKAQVVMEGLSGKDRDVQFHVACFYVWDATRQVLGQKPSGPADN*
Ga0105240_1115005223300009093Corn RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECF
Ga0105240_1174498613300009093Corn RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVELQVIGHDPSGPA*
Ga0105245_1202875813300009098Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSVTDAAHADGIGVQFHVECFQVWDAERQVLGLDPSQPA*
Ga0114129_1046633333300009147Populus RhizosphereMDRPALTLLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAE
Ga0105243_1014667953300009148Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0075423_1040370833300009162Populus RhizosphereMDRPALTLLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0075423_1045635613300009162Populus RhizosphereLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0105100_1089692113300009166Freshwater SedimentMIQDKLTDGRLPLHHIPRVGGGLGNGETCDGCGETVTKAQVVMEGLSGKDRDVQFHVACFYVWDATRQVLGQKPSGPADN*
Ga0105242_1090948323300009176Miscanthus RhizosphereMDRPALTLQMQAKLADGRLPHNHIPRMWGGSGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0105248_1276809313300009177Switchgrass RhizosphereMNQDALRTLVRQKLADGRLPNNHIPRVWGGPGAGETCDACEEVVTKAQLIMEGITLSVGRESVQFHVMCFNVWDAERQVAGHDP
Ga0105061_110453713300009807Groundwater SandMIQDKVADGRLPHNHIPRVGGGRGNGETCDGCGEAVTKVQVVMEGLSGKDRGVHFHAACFYVWDATRQVLGRKPSGPADN*
Ga0134128_1016573033300010373Terrestrial SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDLERQVIGHEPSGPA*
Ga0134121_1308633013300010401Terrestrial SoilMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQLHVACFYVWDATRQVFGHKP
Ga0134123_1004537053300010403Terrestrial SoilMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMGRQVLGHDPSRPPMGHETGGLHG*
Ga0105246_1026631313300011119Miscanthus RhizosphereRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0150983_1020458623300011120Forest SoilMDKPVLRLMIREKLPDGRLPHDSIPRMWGGPGNGETCDGCGEIVTKTQMVMEGLSSKDLGVQFHVACFYVTSSAKSSGMTRAVNA*
Ga0150983_1379756213300011120Forest SoilMIRERLADGRLPHNHIPRLWGGPGNGETCDGCGETVTKGQMLMEGLSAKSSGVQLHVACFHVWDVERQVLGHEPSQPA*
Ga0137397_1023475013300012685Vadose Zone SoilMDRCILRHLIQEKLADGRLPHEHIPSIFWGGPGNGETCDGCGETVTKGQMVMELSTKDSGAQFHVACFHVWDEERRALGQDPSGPA*
Ga0137404_1005680263300012929Vadose Zone SoilMIQDKVADGRLPHHYIPRVGGGLGNGETCDGCAEPVTKAQVLMEGLSGNARRGVKFHGACFYVWDATRQVLGYKASGPAD*
Ga0164303_10001309133300012957SoilMNRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0164304_1023877623300012986SoilLSDATRFSMDRPALTLLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0164305_1003083053300012989SoilMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGIGGVGGRFPFDGFPCWDLGGQSSAHDQVRQRWG
Ga0157371_1013360923300013102Corn RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGRGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0157374_1040430323300013296Miscanthus RhizosphereMDRPALTLLIQAKLADGSLPNNHIPRMWGGPANGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0157378_1004406233300013297Miscanthus RhizosphereMDRPARTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0163162_1265238423300013306Switchgrass RhizosphereLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDVERQVIGHEPSGPA*
Ga0157372_1002625263300013307Corn RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDAIGVQLHVTCFHLWDVERQVIGHEPSGPA*
Ga0157377_1022172123300014745Miscanthus RhizosphereRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG*
Ga0157377_1040726413300014745Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA*
Ga0180063_103043613300014885SoilMDKPTLRLMIQDKLADGRLPLHHIPRVGGGLGNGETCDGCGQTVTKAQVVMEGLSGKDRDVRFHVACFYVWDATRQVRGHKPSGPADN*
Ga0180085_122172313300015259SoilMDKPTLRLMIQDKLADGRLPLHHIPRVGGGLGNGETCDGCGETVRKTQVVMEGLSGRNRDVQFHVACFYVWDATRRVF
Ga0137403_1003311493300015264Vadose Zone SoilMDKPTLRLMIQDKVADGRLPHHYIPRVGGGLGNGETCDGCAEPVTKAQVLMEGLSGNARRGVKFHGACFYVWDATRQVLGYKASGPAD*
Ga0132256_10043924323300015372Arabidopsis RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGETVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHEPSGPA*
Ga0184610_123634613300017997Groundwater SedimentMDKSTLRLMIHDKLADGRLPHNYIPRVGGGLGNGETCDGCGEAVTKTQVVMEGLSGTARRGVKFHAACFYVWDATRQVL
Ga0184608_1041618813300018028Groundwater SedimentMDKSTLRLMIHDKLADGRLPHNYIPRVGGGLGNGETCDGCGELVTKTQVVMEGLSGKDRDVQFHVACFYVWDAARRVLGYKASGPADN
Ga0184620_1011511123300018051Groundwater SedimentMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQVVMEGLSGKDRDVQFHVACFYVWDATRRVLGYKASGPADN
Ga0184621_1008621723300018054Groundwater SedimentMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQVVMEGLSGKDRDVQFHVACFYVWDAPSPWVQGERAGG
Ga0184632_1024203413300018075Groundwater SedimentMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQVVMEGLSGKDRDVQFHVACFDVWDATRRVLGYKASGPAD
Ga0190272_1131574713300018429SoilMDQPTLRLMIHDKVADGRLPHHYIPRVGGGLGNGETCDGCGETVTKTQVVMEGLSGKARRGVKFHAACFYVWDATRQVLGYRPSGPAD
Ga0180116_117840113300019229Groundwater SedimentPDSGEASRRPLPHDHIPRMWGGHGDGETCDGCGEIVAKAQMVMEGVDARGGGVQFHVECFYLWDAERQPPGHEPSAPA
Ga0193715_104993923300019878SoilMDKSTLRLMIHDKLADGRLPHNYIPRVGGGLGNGETCDGCGELVTKTQVVMEGLSGKDRDVQFHVTCFHVWDATRQVLGHRASGPAD
Ga0193707_102274113300019881SoilMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQMVMEGLSGKDRDVQFHVACFYVWDATRRVLGYKASGPADN
Ga0193707_102959523300019881SoilMDKSTLRLMIQDKVADGRLPHHYIPRVGGGLGNGETCDGCGEAVTKTQVVMEGLSGKDRGVQFHAACFYIWDATRQVLGYRPSGPAD
Ga0193713_109158323300019882SoilMDQPTLRLMIHDKVADGRLPHHYIPRVGGGLGNGETCDGCGEAVTKTQVVMEGLSGKDRGVQFHAACFYIWDATRQVLGYRPSGPAD
Ga0193713_116467813300019882SoilPRHIARLSMERPTLRLMIQEKLADGRLPNNHIPRIWGGPGNGEICDGCDEIVTKGQMIMEGLSGKDSGVQFHVACFYVWDVERQVLGHEPSGPA
Ga0193728_122862123300019890SoilMNRATLRLLIKDKLADGRLPHNHVPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHKPSGPLD
Ga0193739_100638633300020003SoilMDQPTLRLMIHDKVADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQMVMEGLSGKDRDVQFHVACFYVWDATRRVLGYKASGPADN
Ga0180109_101986413300020067Groundwater SedimentLTILRLLIREKLADGRLPHDHIPRMWGGHGDGETCDGCGEIVAKAQMVMEGVDARGGGVQFHVECFYLWDAERQPPGHEPSAPA
Ga0206356_1082240223300020070Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECSHVWDMERQVLGHDPSRPPMGHETGGLHG
Ga0206356_1156862313300020070Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0210407_1006446153300020579SoilMDKPVLRLMIREKLADGRLPHDSIPRMWGGPGNGETCDGCGEIVTKTQMVMEGLSTKNHGVQLHVTCFHVWDVERQVLGHEPSGPA
Ga0210403_1027415813300020580SoilALRIMIRERLADGRLPHNHIPRLWGGPGNGETCDGCGETVTKGQMLMEGLSAKSSGVQLHVACFHVWDVERQVLGHEPSQPA
Ga0210401_1107440123300020583SoilRIMIRERLADGRLPHNHIPRLWGGPGNGETCDGCGETVTKGQMLMEGLSAKSSGVQLHVACFHVWDVERQVLGHEPSQPA
Ga0210378_1001397523300021073Groundwater SedimentMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQVVMEGLSGKDRDVQFHVACFYVWDATRRVLGYKASGPAD
Ga0193719_1027597213300021344SoilLERPTLRLMIREKLADGRLPNNHIPRIWGGPGNGEICDGCDELVTKGQMIMVGLSGKDSGVQFHVACFYVWDVDRQVLGHEPSGPA
Ga0210384_10000852273300021432SoilMDRPALRIMIRERLADGRLPHNHIPRLWGGPGNGETCDGCGETVTKGQMLMEGLSAKSSGVQLHVACFHVWDVERQVLGHEPSQPA
Ga0210384_1005183753300021432SoilMDKPVLRLMIREKLPDGRLPHDSIPRMWGGPGNGETCDGCGEIVTKTQMVMEGLSSKDLGVQFHVACFYVTSSAKSSGMTRAVNA
Ga0222622_1040695213300022756Groundwater SedimentMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQMVMEGLSGKDRGVQFHAACFYVWDATRQVLGYRPSGPAD
Ga0207697_1016637423300025315Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGSGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0207656_1025184113300025321Corn RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLH
Ga0207653_1002156723300025885Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0207653_1004416723300025885Corn, Switchgrass And Miscanthus RhizosphereMENPALRIMIQERLADGRLPHDHIPRIWGGPGNGETCDGCDEPVTSTQMVMEGLSTTNGGVQFHVACFYVWDVERQVLGHEPSGPA
Ga0207682_1004691513300025893Miscanthus RhizosphereLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG
Ga0207688_10002044113300025901Corn, Switchgrass And Miscanthus RhizosphereMDRPALTLLIQVKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0207647_1057539913300025904Corn RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHAWDV
Ga0207645_1000720093300025907Miscanthus RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQVIGHEPSGPA
Ga0207645_1005231333300025907Miscanthus RhizosphereMEKPALRIMIQERLADGRLPHDHIPRIWGGPGNGETCDGCDEPVTSTQMVMEGLSTTNGGVQFHVACFYVWDVERQVLGHEPSGPA
Ga0207684_1077252823300025910Corn, Switchgrass And Miscanthus RhizosphereMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEICDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHRPSGPLD
Ga0207690_1094457213300025932Corn RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDATGVQLHVTCFHVWDVERQ
Ga0207706_1084123613300025933Corn RhizosphereLRIIIQERLADVRLPHDHIPRIWGGPGNGETCDGCDEPVTSTQMVMEGLSTTNGGVQFHVACFYVWDVERQVLGHEPSGPA
Ga0207669_1093659113300025937Miscanthus RhizosphereLTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETGGLHG
Ga0210089_102090213300025957Natural And Restored WetlandsMEKPTLRLMIQQKLADGRLPNNHIPRMWGGPGNGEICDGCDEIVTKAQMIMEGLSGKDSGVQFHVACFYVWDVERQVLGHEPSGPA
Ga0207658_1122538813300025986Switchgrass RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGISGVGVQFHVECFHVWDMERQVLGHDPSRPPMGHETG
Ga0207703_1005813223300026035Switchgrass RhizosphereMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGSGETCDGCGEIVTKTQMLMEGLSKDGGPDAIGVQLHVTCFHLWDVERQVIGHDPSGPA
Ga0207639_1096181023300026041Corn RhizosphereLTLQIQAKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSKSQMVMEGLSVTDAAHADGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0207641_1011078953300026088Switchgrass RhizosphereMDRPALTLLIQTKLADGRLPHNHIPRMWGGPGNGETCDGCGETVSPSQMVMEGLSVTDAASVNGIGVQFHVECFQVWDAERQVLGHDPSQPA
Ga0257176_106219413300026361SoilMDRSILRHLIQEKLADGRLPHEHIPSIFWGGPGNGETCDGCGETVTKGQMVMELSTKDSGAQFHVACFHVWDEERRALGQDPSGPA
Ga0257172_102272313300026482SoilSTMDRSILRHLIQEKLADGRLPHEHIPSIFWGGPGNGETCDGCGETVTKGQMVMELSTKDSGAQFHVACFHVWDEERRALGQDPSGPA
Ga0207467_101185223300027036SoilMDKPTLRLLIRAKLADGRLPQDHIPRMWGGPGNGESCDACEEIITRAQFVIEGVSTSPGGLGVQLHVRCFQVWDAERQVPGHEASGPV
Ga0209329_105632613300027605Forest SoilMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFY
Ga0209626_119754813300027684Forest SoilMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQ
Ga0209974_1007076913300027876Arabidopsis Thaliana RhizosphereMDRPALTLLIQAKLADGRLPNNHIPRMWGGPGNGETCDGCGETVTKSQMVMEGLSVTDATGVGDVGVQFHVECFHVWDVERQVLGHDPSRPPMGHETGGLHG
Ga0209526_10010729113300028047Forest SoilMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHEPSGPLD
Ga0307504_1002615333300028792SoilMNRATLRLLIKDKLADGRLPHNHIPRMWGGPGNGEVCDGCGEIVAKSQMIMEGLSGKDRGVQFHVACFYVWDATRQVFGHKPSGPLD
Ga0307504_1007209823300028792SoilMDKPALRILIREKLADGRLPHNHIPRMWGGPGSGETCDGCDEVVTSTQMLMEGLSKDSGTKDSGVQFHITCFHLWDVERQVAGHDPSGLA
Ga0307299_1006401913300028793SoilMDQPTLRLMIHDKLADGRLPHNHIPRMWGGPGNGEICDGCGETVTKTQVVMEGLSGKDRDVQLHVACFYVWDATRRVLGYKASGPADN
Ga0307312_1021293123300028828SoilMDRPTLRLMIQDKVADGRLPHHYIPRVGGGLGNGETCDGCGETVTKTQVVMEGLSGKDRGVQFHTACFYIWDATRQVLGYKPSGPAD
Ga0299907_1066064513300030006SoilMDKPTLRLMIHDKVADGRLPHHYIPRVGGSLGDGGTCDGCGETVTKTQVVMEGLSGKDRDVQFHVACFYVWDATRRVLGYKASGPADN
Ga0247612_109336513300030592SoilALRLMIQGKLVDGRLPNNHIPRMWGGPGNGEICDGCDEVVTKAQMIMEGLSSKGLRRAVPRRLLYVWDLERQVLGHEPSGPA
Ga0247651_1013494313300030608SoilLESWKHREVTHGKLALRLMIQGKLVDGRLPNNHIPRMWGGPGNGEICDGCDEVVTKAQMIMEGLSSKGLRRAVPRRLLYVWDLERQVLGHEPSGPA
Ga0307469_1061206223300031720Hardwood Forest SoilLERPTLRLMIREKLADGRLPNNHIPRIWGGPGNGEICDGCDELVTKGQMIMEGLSGKDSGVQLHVACFYVWDVERQVLGHEPSGPA
Ga0307471_10032450313300032180Hardwood Forest SoilVLAPRGTGSRCSNFHIKRDTTEKLTDGRLPHNHIPRMWGGPGNGETCDGCGEIVTKPQMIMEGLSGKDRGVQFHVACFYVWDATRLVLAHKP
Ga0307471_10039182813300032180Hardwood Forest SoilMDKPILRLLIHEKLVDGRLPHDRIPRTRGGPGNGETCDGCGEIVTQAQMIMEGTGSGGGTVQFHIACFYVWEVERLVSARGPSGPS
Ga0307471_10056098723300032180Hardwood Forest SoilVDGSLLRLLIQEKLADGRLPHEHIPSIFWGGPGNGEICDGCGEIVTKAQMVMELSTKDRGARFHVACFHVWDEERRALGHDPSGPA
Ga0307471_10072776723300032180Hardwood Forest SoilMDKPTLRILIREKLADGPLPHDHIPRMWGGPGDGETCDACGEIVTKTQMLMEGLSKDSTSEGPGVQFHVQCFYVWDVERQVLGHDPSGPAE
Ga0307472_10159322923300032205Hardwood Forest SoilVLAPRGTGSRCSNFHIKRDTTEKLTDGRLPHNHIPRMWGGPGNGETCDGCGEIVTKPQMIMEGLSGKDRGVQFHVACFYVWDATRQVLAHKPSGPAD
Ga0335081_1129233323300032892SoilMDKPILRLLIHGKLVDGRLPHDRIPRTWGGPGNGETCDGCGEIVTQAQTMMAGTGSGGGAVQFHVACFYVWEVERQVLAHEPSAPA
Ga0370495_0144803_6_2693300034257Untreated Peat SoilMNKPTLRLMIQEKLADGRLPHNYIPRVGGGLGTGEICDGCGETVTKTQVVMEGLSGEDRGVKFHVACFYVWDATRQVFGYKPSGPAD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.