NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F017546

Metagenome / Metatranscriptome Family F017546

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F017546
Family Type Metagenome / Metatranscriptome
Number of Sequences 240
Average Sequence Length 77 residues
Representative Sequence MDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR
Number of Associated Samples 202
Number of Associated Scaffolds 240

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 85.42 %
% of genes near scaffold ends (potentially truncated) 27.08 %
% of genes from short scaffolds (< 2000 bps) 74.17 %
Associated GOLD sequencing projects 185
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (60.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(13.333 % of family members)
Environment Ontology (ENVO) Unclassified
(30.833 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(30.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98
1F24TB_100681914
2F14TC_1004389112
3JGI10216J12902_1011887911
4F14TB_1000789725
5C687J26623_100484622
6JGI25612J43240_10213692
7JGI26141J51220_10011412
8Ga0055438_102454901
9Ga0055437_100070543
10Ga0055439_102160531
11Ga0055490_102672231
12Ga0055500_100009012
13Ga0062593_1008613492
14Ga0055489_102006522
15Ga0063356_1005310371
16Ga0063356_1022357511
17Ga0066672_107892822
18Ga0066680_100493704
19Ga0068993_100124393
20Ga0065704_104541413
21Ga0065705_108878663
22Ga0065707_104178611
23Ga0065707_106316003
24Ga0070676_101906161
25Ga0070676_104219132
26Ga0066388_1021641481
27Ga0070691_100518822
28Ga0070692_111559591
29Ga0070668_1005254321
30Ga0070709_101551631
31Ga0070711_1005791561
32Ga0070694_1008717822
33Ga0070708_10000716611
34Ga0070708_1000644845
35Ga0070681_101823272
36Ga0068867_1001902731
37Ga0070706_1000075957
38Ga0070707_1013886621
39Ga0070732_110136141
40Ga0070696_1008258842
41Ga0066701_109542791
42Ga0066905_1000005643
43Ga0074479_106720062
44Ga0075293_10005573
45Ga0075293_10078091
46Ga0075297_10073133
47Ga0075294_10205752
48Ga0081455_1000052644
49Ga0081455_102337682
50Ga0075023_1000501222
51Ga0075023_1000709361
52Ga0075023_1006067851
53Ga0075024_1006527451
54Ga0075417_100332582
55Ga0075028_1001696373
56Ga0075028_1002992032
57Ga0075028_1006652993
58Ga0075018_103768782
59Ga0075021_111788662
60Ga0079222_108832972
61Ga0079219_103389063
62Ga0099791_100022135
63Ga0099791_104726431
64Ga0099829_100017854
65Ga0105095_101223652
66Ga0105095_101423922
67Ga0099828_111765912
68Ga0099827_108275112
69Ga0105245_101921343
70Ga0114129_100147617
71Ga0114129_110699452
72Ga0105242_117276411
73Ga0105065_10222381
74Ga0105088_10204851
75Ga0105082_10502522
76Ga0105087_10092232
77Ga0105064_10175421
78Ga0126376_100400625
79Ga0126372_109517793
80Ga0126377_116921092
81Ga0134127_102094265
82Ga0134122_102534703
83Ga0137446_10362562
84Ga0137458_10520373
85Ga0137457_10763771
86Ga0137461_11544442
87Ga0137338_10780452
88Ga0137399_104677912
89Ga0137399_110843633
90Ga0137434_10866681
91Ga0137447_10718031
92Ga0137375_103318203
93Ga0137390_117773631
94Ga0137397_100777813
95Ga0157294_101616321
96Ga0137419_108173152
97Ga0153915_101341785
98Ga0157371_103835351
99Ga0157370_121116651
100Ga0075351_10280842
101Ga0180066_10002683
102Ga0180104_10097432
103Ga0180063_10271663
104Ga0180063_11585162
105Ga0132257_1043399111
106Ga0132255_1000968896
107Ga0187825_100890783
108Ga0187775_101510662
109Ga0187779_111189643
110Ga0184610_10185963
111Ga0184604_100865752
112Ga0184608_102807251
113Ga0184621_101428292
114Ga0184636_10765381
115Ga0184618_100040314
116Ga0184640_100232493
117Ga0184632_100203435
118Ga0190265_101226313
119Ga0190265_101343782
120Ga0190265_101661633
121Ga0190265_119888902
122Ga0190272_111848512
123Ga0184642_16002742
124Ga0187892_1000538315
125Ga0187892_100092718
126Ga0187893_102625422
127Ga0187893_102811922
128Ga0187893_104532513
129Ga0137408_11737012
130Ga0193748_10107311
131Ga0193722_10617382
132Ga0193715_10090851
133Ga0193723_10462802
134Ga0193725_11295751
135Ga0193727_11935762
136Ga0193743_10407083
137Ga0193728_10592712
138Ga0193731_10411392
139Ga0193755_10114605
140Ga0193755_10141194
141Ga0193735_10725443
142Ga0210407_100061968
143Ga0210407_104960352
144Ga0210401_113712712
145Ga0210378_100125424
146Ga0210404_100090777
147Ga0210377_100318084
148Ga0210400_100612101
149Ga0210400_106160211
150Ga0210400_107825103
151Ga0210389_113678411
152Ga0210384_100088605
153Ga0222625_11677062
154Ga0247799_10008751
155Ga0209109_100309643
156Ga0209640_1000786612
157Ga0209640_103738912
158Ga0207423_10131842
159Ga0210094_10929771
160Ga0207653_100184962
161Ga0207680_108309641
162Ga0207645_106774181
163Ga0207684_1000150013
164Ga0207684_1000960110
165Ga0207684_100269252
166Ga0207684_100503715
167Ga0207693_111236743
168Ga0207660_116411431
169Ga0207646_113836211
170Ga0207709_112288432
171Ga0210090_10055033
172Ga0207668_114713892
173Ga0208000_1029062
174Ga0208000_1054052
175Ga0208907_1045732
176Ga0208285_10115262
177Ga0208532_10020072
178Ga0207703_103230442
179Ga0207678_119141792
180Ga0257180_10011872
181Ga0257173_10374972
182Ga0257155_10032442
183Ga0257168_10015594
184Ga0209845_10786512
185Ga0209969_10643191
186Ga0209854_10485791
187Ga0209984_10526432
188Ga0209899_10311492
189Ga0209843_10375921
190Ga0209074_103744152
191Ga0233416_100955532
192Ga0209726_101037662
193Ga0209580_106350101
194Ga0209701_102736772
195Ga0209814_100244945
196Ga0209590_104468053
197Ga0209068_100631393
198Ga0209583_100277394
199Ga0209583_107475361
200Ga0209859_10461541
201Ga0268265_126422881
202Ga0137415_101412373
203Ga0137415_102117672
204Ga0307504_100191982
205Ga0307504_100442501
206Ga0307504_100862683
207Ga0307281_102029882
208Ga0307305_103724391
209Ga0247824_110489391
210Ga0307302_106691361
211Ga0307296_104493471
212Ga0307312_104205842
213Ga0307278_105158711
214Ga0307304_102862052
215Ga0299907_106100352
216Ga0268386_100047157
217Ga0302046_100736252
218Ga0255311_10286353
219Ga0255310_100059066
220Ga0255310_100357431
221Ga0255310_100546661
222Ga0255312_10882692
223Ga0307469_101566685
224Ga0307468_1005560761
225Ga0307475_102127272
226Ga0307473_100283836
227Ga0214473_1001031813
228Ga0307479_110170562
229Ga0307470_103562483
230Ga0307471_1002168484
231Ga0307471_1005073791
232Ga0335085_1000197130
233Ga0334722_103342602
234Ga0310810_100084847
235Ga0214471_100811112
236Ga0326729_10048233
237Ga0326729_10486291
238Ga0316624_110737153
239Ga0364942_0039916_590_826
240Ga0364934_0404316_306_503
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 58.88%    β-sheet: 1.87%    Coil/Unstructured: 39.25%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRSRExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
60.0%40.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Sediment
Freshwater Wetlands
Freshwater Sediment
Groundwater
Natural And Restored Wetlands
Soil
Sediment (Intertidal)
Groundwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Surface Soil
Switchgrass Rhizosphere
Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Natural And Restored Wetlands
Rice Paddy Soil
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Soil
Groundwater Sand
Sandy Soil
Peat Soil
Microbial Mat On Rocks
Bio-Ooze
Sediment
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
4.2%4.6%3.3%5.0%13.3%6.7%5.4%3.3%3.8%7.1%4.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1006819143300000550SoilMDTPVKDGLVLEKMMKRGAVVCVVALGGAAVGLAVWALRRWQDERQYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
F14TC_10043891123300000559SoilMETPVKDGLVLEKIMKRGAVVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
JGI10216J12902_10118879113300000956SoilMDTPVKDGLMLEKVMKGGAIVCVIALGGVAVGLAVWAVRRWQGERQYQAWRAAVDADPYRRDRNGYPIGAQLGLSRVR*
F14TB_10007897253300001431SoilMDTPVKDGLVLEKIMKRGAVVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
C687J26623_1004846223300002122SoilMDTPLEQEQGVSLDTVMKRGALVCVVALGGAAVGLAVWALRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSSSH*
JGI25612J43240_102136923300002886Grasslands SoilMDTPVSEQGSSLELIMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR*
JGI26141J51220_100114123300003503Arabidopsis Thaliana RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
Ga0055438_1024549013300003995Natural And Restored WetlandsMDTPVSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGY
Ga0055437_1000705433300004009Natural And Restored WetlandsMDTPVSEQGNSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR*
Ga0055439_1021605313300004019Natural And Restored WetlandsMDTPVSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR*
Ga0055490_1026722313300004052Natural And Restored WetlandsTPVEQGVLLETVMKRATFVCAVALGGVAVGLAVWAIRRWHDERQYQAWRASVSADPYRRDHNGYPVGAQLGYSRAR*
Ga0055500_1000090123300004062Natural And Restored WetlandsMDTPVSEQGSSLDTMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR*
Ga0062593_10086134923300004114SoilMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK*
Ga0055489_1020065223300004145Natural And Restored WetlandsTVMKRGGLVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGVQVGAELGYSRR*
Ga0063356_10053103713300004463Arabidopsis Thaliana RhizosphereMDMPAAEQRSMLDTMMKRGAALCVVALGGVAVGLAVWALRRWQDERAYRAGRDSVSGDPYRRDRNGYPVGAQLGYSRSR*
Ga0063356_10223575113300004463Arabidopsis Thaliana RhizosphereMAAMDTPVTEQDSSLETAMKRGALICVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR*
Ga0066672_1078928223300005167SoilMETRVDEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0066680_1004937043300005174SoilMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR*
Ga0068993_1001243933300005183Natural And Restored WetlandsSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR*
Ga0065704_1045414133300005289Switchgrass RhizosphereMAAMDTPVSEKGSTLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNG
Ga0065705_1088786633300005294Switchgrass RhizosphereMAAMDTPVSEKGSTLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLG
Ga0065707_1041786113300005295Switchgrass RhizosphereAPSRADLEDMAAMDTPSEQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0065707_1063160033300005295Switchgrass RhizosphereMAAMDTPVSEKGSTLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGA
Ga0070676_1019061613300005328Miscanthus RhizosphereLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK*
Ga0070676_1042191323300005328Miscanthus RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLS
Ga0066388_10216414813300005332Tropical Forest SoilMDTPEETGLSLETVMKRGALVCVVALGAAAVGLTIWAVRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0070691_1005188223300005341Corn, Switchgrass And Miscanthus RhizosphereMDMPVDEKVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR*
Ga0070692_1115595913300005345Corn, Switchgrass And Miscanthus RhizosphereMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRN
Ga0070668_10052543213300005347Switchgrass RhizospherePTPGAASLEAVMKGGALVAVVALGGVAVGLAIWALRRWMDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSRPR*
Ga0070709_1015516313300005434Corn, Switchgrass And Miscanthus RhizosphereMETRVDEKVSLEAVMRRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0070711_10057915613300005439Corn, Switchgrass And Miscanthus RhizosphereMETRVDEKVSLEVVMRRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0070694_10087178223300005444Corn, Switchgrass And Miscanthus RhizosphereMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLG
Ga0070708_100007166113300005445Corn, Switchgrass And Miscanthus RhizosphereMDTPVEAGLSMETVMRRGALVCVIALGGVAVGLAVWAIRRWREEREYQAWAASAAGDPHRRDRNGYPVGAQLGFSRSR*
Ga0070708_10006448453300005445Corn, Switchgrass And Miscanthus RhizosphereMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR*
Ga0070681_1018232723300005458Corn RhizosphereMDMPAAEQRSTLDTMMKRGAALCVVALGGVAVGLAVWALRRWQDERAYRAGRDSVSGDPYRRDRNGYPVGAQLGYSRSR*
Ga0068867_10019027313300005459Miscanthus RhizosphereMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQ
Ga0070706_10000759573300005467Corn, Switchgrass And Miscanthus RhizosphereMETRVNGKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0070707_10138866213300005468Corn, Switchgrass And Miscanthus RhizosphereMDTPSGQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWQDEREYQAWRDSVNADPYRRDRNGFPVGAQLGYSRTR*
Ga0070732_1101361413300005542Surface SoilMDMPVDDGISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0070696_10082588423300005546Corn, Switchgrass And Miscanthus RhizosphereMDTPVAEQSGSLDTMMKRGALVCVVALGGVAVGLAVWALRRWQDERAYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR*
Ga0066701_1095427913300005552SoilMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR*
Ga0066905_10000056433300005713Tropical Forest SoilMDTPVKDGLVLEKMMKRGAVVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
Ga0074479_1067200623300005829Sediment (Intertidal)MDTPVSEQGSSLETMMKRGALVCVVAMGGIAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGFPVGAQLGYSRSR*
Ga0075293_100055733300005875Rice Paddy SoilMNKPGGGELSLETVMKRSALVCVVALGGVAVGLAVWALRRWQDEREYRAWRRSVSADPDRRDRNGYPVGTQLGFSRAR*
Ga0075293_100780913300005875Rice Paddy SoilVDEKVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR*
Ga0075297_100731333300005878Rice Paddy SoilMNKPAGGELSLETVMKRGALVCVVALGGVAVGLALWALRQWRDEREYRAWRTSVSADPDRRDRNGYPVGARLGFSRAR*
Ga0075294_102057523300005881Rice Paddy SoilSAMNKPAGGELSLETVMKRGALVCVVALGGVAVGLALWALRQWRDEREYRAWRTSVSADPDRRDRNGYPVGARLGFSRAR*
Ga0081455_10000526443300005937Tabebuia Heterophylla RhizosphereMDTPAKEGLRLETMMRRSAIVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPIGAQLGLSRVR*
Ga0081455_1023376823300005937Tabebuia Heterophylla RhizosphereMDTPVKDGLVLEKMMKRGAVVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPIGAQLGLSRAR*
Ga0075023_10005012223300006041WatershedsMDMPVDDKISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQSWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0075023_10007093613300006041WatershedsMDTRMDEKISLETVMKRGALVCVVAMGGVAVGLAVWALRRWNDERAYQAWRASVSADPYRRDRTGYPVGAQLGFSRSR*
Ga0075023_10060678513300006041WatershedsMDARAEEGLSLETVMKRGALVCVVALGGVAVGLAVWALRRWHDERTYQAWRASVSADPYRRDRNGY
Ga0075024_10065274513300006047WatershedsMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGAQLGF
Ga0075417_1003325823300006049Populus RhizosphereMETRVDERVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0075028_10016963733300006050WatershedsMDTRMDEKISLETVMKRGALVCVVAMGGVAVGLAVWALRRWNDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0075028_10029920323300006050WatershedsVMKRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0075028_10066529933300006050WatershedsMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0075018_1037687823300006172WatershedsMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0075021_1117886623300006354WatershedsMDTPVEAGLSMETAMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWWASANADPYRRDRNGYPVGAQLGIPRSR*
Ga0079222_1088329723300006755Agricultural SoilMETRVDEKVPLEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0079219_1033890633300006954Agricultural SoilMETRVDEKVPLEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVNSDPYRRDRNGYPVGA*
Ga0099791_1000221353300007255Vadose Zone SoilMDTRVEEKVSLETVMRRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0099791_1047264313300007255Vadose Zone SoilMAVMDTPVSEQGSSLELIMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR*
Ga0099829_1000178543300009038Vadose Zone SoilMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR*
Ga0105095_1012236523300009053Freshwater SedimentMAAMDTPASEQGSSLETVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGVPVGAQLGYSRR*
Ga0105095_1014239223300009053Freshwater SedimentMAAMDTPVTEQGSSLETALKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR*
Ga0099828_1117659123300009089Vadose Zone SoilMDTPVEAGLSMETAMKRGALVCVVALGGVAVGLAVWALRRWRDEREYQAWRASVRADPYRRDRNGYPVGTQLGFSRSR*
Ga0099827_1082751123300009090Vadose Zone SoilMDTPVEAGLSMETVMKRGALICVVALGGVAVGLAVWALRRWRDEREYQAWRSSVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0105245_1019213433300009098Miscanthus RhizosphereMAVMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK*
Ga0114129_1001476173300009147Populus RhizosphereMAAMDTPSEQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0114129_1106994523300009147Populus RhizosphereVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0105242_1172764113300009176Miscanthus RhizosphereMAVMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRTR
Ga0105065_102223813300009803Groundwater SandMDTPVEEGLSMETVMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0105088_102048513300009810Groundwater SandMDTPVEEGLSMETVMRRGSLVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGF
Ga0105082_105025223300009814Groundwater SandMDTPVDAGPSLEKVMNRGALVCVVALGGVAVGLTVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0105087_100922323300009819Groundwater SandLETVMNRGALVCVVALGGVAVGLAVWALRRWRDERESQAWWASVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0105064_101754213300009821Groundwater SandMDTPVKEGLSMETVMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGFSRSR*
Ga0126376_1004006253300010359Tropical Forest SoilMDTPEETGISIETVMKRGALVCVVALGAAAVGLTIWAVRRWQDEREYQAWRASVTDDPYRRDRNGYPVGAQLGLSRSR*
Ga0126372_1095177933300010360Tropical Forest SoilMDTPEETGLSLETVMKRGALVCVVALGAAAVGLTIWAVRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQ
Ga0126377_1169210923300010362Tropical Forest SoilMDTPVKDGLMLGKVMKGGAIVCVVALGGMAVGLAVWALRRWQDERQYQAWRAAVAADPYRRDRNGYPIGAQLGLSRAR*
Ga0134127_1020942653300010399Terrestrial SoilMDMPAAEQRSMLDTMMKRGAALCVVALGGVAVGLAVWALRRWQDERAYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR*
Ga0134122_1025347033300010400Terrestrial SoilVRKVAMDMPAAEQRSTLDTMMKRGALVCVVALGGVAVGLAVWALRRWQDERAYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR*
Ga0137446_103625623300011419SoilMDTPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0137458_105203733300011436SoilMAAMDTPVSEQGSSLETMMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR*
Ga0137457_107637713300011443SoilMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0137461_115444423300012040SoilMDTPLSEQGSALETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPIGAQLGYSRSR*
Ga0137338_107804523300012174SoilMDTPIAEHGGSLETMMKRGALVCVVAIGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0137399_1046779123300012203Vadose Zone SoilMAAMDTPSEQSSPLETVMKRGALVCIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0137399_1108436333300012203Vadose Zone SoilMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0137434_108666813300012225SoilMDTPVSEQGSSLETAMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR*
Ga0137447_107180313300012226SoilMDTPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGARLGYSRSR*
Ga0137375_1033182033300012360Vadose Zone SoilMAAMDTPSEQSTPLEMVMKRGALICIVAAGGVAVGLAVWALRRWQDEREYRAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0137390_1177736313300012363Vadose Zone SoilMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0137397_1007778133300012685Vadose Zone SoilMEAMDTPSEQSSPLEMVMKRGALICIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0157294_1016163213300012892SoilMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQFGLSRAR*
Ga0137419_1081731523300012925Vadose Zone SoilMAAMDTPSEQSSPLEMVMKRGALICIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR*
Ga0153915_1013417853300012931Freshwater WetlandsMDMPVDDKVSLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREWQAWRASVSADPYRRDRNGYPVGAQLGFSRSR*
Ga0157371_1038353513300013102Corn RhizosphereKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
Ga0157370_1211166513300013104Corn RhizosphereAAVSSETPTPGAASLEAVMKGGALVAVVALGGVAVGLAVWALRRWMDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSRPR*
Ga0075351_102808423300014318Natural And Restored WetlandsMVAMDTPVSEQGSPLETLLKRGALVCVVAAGGVAVGLAIWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR*
Ga0180066_100026833300014873SoilMDTPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPIGAQLGYSRSR*
Ga0180104_100974323300014884SoilMDTPIAEHGGSLETMMKRGALVCVVAVGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0180063_102716633300014885SoilMAAMDTPVSEQGSSLETLMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRPR*
Ga0180063_115851623300014885SoilMDTPLSEQGSALETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR*
Ga0132257_10433991113300015373Arabidopsis RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGIAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGA
Ga0132255_10009688963300015374Arabidopsis RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGIAGGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR*
Ga0187825_1008907833300017930Freshwater SedimentMDMPVDDGISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDERQYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0187775_1015106623300017939Tropical PeatlandMKTPVEEGPSLEKVLRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVTADPYRRDRNGYPVGAQLGLSRAR
Ga0187779_1111896433300017959Tropical PeatlandMNTPVVSTATAVEEEGPSLETVVKRGALVCVVALGGIAVGLAIWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGAQLGLSRSR
Ga0184610_101859633300017997Groundwater SedimentMDTPLSEQGSALETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0184604_1008657523300018000Groundwater SedimentMDTPSQQSSPLETLMKRGALVCIVAAGGVAVGRAVWALRRWQDEREYHAWRESVTADPYRRDRNGFPVGAQLGYSRNR
Ga0184608_1028072513300018028Groundwater SedimentMDTPSEQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWEEEREYQAWRDSVNADPYRRDRNGFPVGAQLGYSRTR
Ga0184621_1014282923300018054Groundwater SedimentMDTPSHQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRDSVNADPYRRDRNGFPVGAQLGYSRTR
Ga0184636_107653813300018068Groundwater SedimentMDTPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0184618_1000403143300018071Groundwater SedimentMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVNADPYRRDRNGFPVGAQLGYSRTR
Ga0184640_1002324933300018074Groundwater SedimentMDTPLSEQGGSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0184632_1002034353300018075Groundwater SedimentMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0190265_1012263133300018422SoilMDSPGSPASVESTSLETMMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRESLTADPYRRDRNGFPVGAQLGSTRSR
Ga0190265_1013437823300018422SoilMDTGTSESGVSFDTVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSADPYRRDRNGYPVGAQLGYTSSR
Ga0190265_1016616333300018422SoilMDTSVPEQGGSLDTLMKRGALVCVVALGGVAVGLAVWALRRWQDEREFQAWRESVDADPYRRDRNGYPVGTRLGYSRSR
Ga0190265_1198889023300018422SoilMDTPVSKQGSSLETMMKRGALVCVVAAGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGFPVGAQLGYSRR
Ga0190272_1118485123300018429SoilMDTPRSEQGSALETMMKRGAVVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0184642_160027423300019279Groundwater SedimentDFEDMAAMDTPSQQSSPLETVMKRGALVCIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLRYSRTR
Ga0187892_10005383153300019458Bio-OozeMDTPTQEQGSSLETMMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR
Ga0187892_1000927183300019458Bio-OozeMETPLEAGVSLETVMKRGALVCVVALGGVAVGLAVWALRRWRDEREYQAWRSSVTADPDRRDRNGYPVGAQLGFSRSR
Ga0187893_1026254223300019487Microbial Mat On RocksMDTPTLEQGSSLETMMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRTR
Ga0187893_1028119223300019487Microbial Mat On RocksMDTPTLEQGSSLETMMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVSADPSRRDRNGYPVGTQLGYSRSR
Ga0187893_1045325133300019487Microbial Mat On RocksMDTPVEKGLSLDTMMKRGALVCVVALGGVAVGLAVWALRRWQDERQYQAWRASVTADPYRRDRNGYPVGAQLSYSRSH
Ga0137408_117370123300019789Vadose Zone SoilMDTPVSEQGSSLELIMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
Ga0193748_101073113300019865SoilMDTRVEEKISLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0193722_106173823300019877SoilMDTRVEETVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0193715_100908513300019878SoilMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQ
Ga0193723_104628023300019879SoilMDTPASEQGSSFELIMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
Ga0193725_112957513300019883SoilMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0193727_119357623300019886SoilMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQASRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0193743_104070833300019889SoilMDTPLSEQGSALVTMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0193728_105927123300019890SoilMDTRVEEKVSLETVMKRGALVCVVAMGGIAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0193731_104113923300020001SoilMDTPSQQSSPLETVMKRGALVCIVAAGGVGVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0193755_101146053300020004SoilMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGYPVGAQLGYSRSR
Ga0193755_101411943300020004SoilLEDMAALDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVNADPYRRDRNGFPVGAQLGYSRTR
Ga0193735_107254433300020006SoilMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0210407_1000619683300020579SoilMETRVDEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVTSDPYRRDRNGYPVGAQLGYSRSR
Ga0210407_1049603523300020579SoilMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYRAWRESVSADPYRRDRNGYPVGAQLGFSRSR
Ga0210401_1137127123300020583SoilMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGAQLGFSRS
Ga0210378_1001254243300021073Groundwater SedimentMDTPLSEQRGSLDTMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0210404_1000907773300021088SoilMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGAQLGFSRSR
Ga0210377_1003180843300021090Groundwater SedimentMDTPVPEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0210400_1006121013300021170SoilLEAVMKRGALVCIVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0210400_1061602113300021170SoilNTRLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYRAWRESVSADPYRRDRNGYPVGAQLGFSRSR
Ga0210400_1078251033300021170SoilMETRVDEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRN
Ga0210389_1136784113300021404SoilMETRVDEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGFSRSR
Ga0210384_1000886053300021432SoilMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGTQLGFSRSR
Ga0222625_116770623300022195Groundwater SedimentMDTPLSEQRGSLDTMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0247799_100087513300023072SoilMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQFGLSRAR
Ga0209109_1003096433300025160SoilMDTPLEQEQGVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSSSH
Ga0209640_10007866123300025324SoilMDTPLEQEQGVSLDTVMKRGALVCVVALGGAAVGLAVWALRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSSSH
Ga0209640_1037389123300025324SoilMDTPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRPR
Ga0207423_101318423300025535Natural And Restored WetlandsMDTPVSEQGNSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR
Ga0210094_109297713300025549Natural And Restored WetlandsMDTPVSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLG
Ga0207653_1001849623300025885Corn, Switchgrass And Miscanthus RhizosphereMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK
Ga0207680_1083096413300025903Switchgrass RhizosphereLEAVMKGGALVAVVALGGVAVGLAIWALRRWMDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSRPR
Ga0207645_1067741813300025907Miscanthus RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLGLSRAR
Ga0207684_10001500133300025910Corn, Switchgrass And Miscanthus RhizosphereMDTPVEAGLSMETVMRRGALVCVIALGGVAVGLAVWAIRRWREEREYQAWAASAAGDPHRRDRNGYPVGAQLGFSRSR
Ga0207684_10009601103300025910Corn, Switchgrass And Miscanthus RhizosphereMDTRVEEKVSLETVMRRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0207684_1002692523300025910Corn, Switchgrass And Miscanthus RhizosphereMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR
Ga0207684_1005037153300025910Corn, Switchgrass And Miscanthus RhizosphereMETRVNGKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0207693_1112367433300025915Corn, Switchgrass And Miscanthus RhizosphereMETRVDEKVSLEAVMKRGTLVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGY
Ga0207660_1164114313300025917Corn RhizosphereMDMPAAEQRSTLDTMMKRGAALCVVALGGVAVGLAVWALRRWQDERAYRAGRDSVSGDPYRRDRNGYPVGAQLGYSRSR
Ga0207646_1138362113300025922Corn, Switchgrass And Miscanthus RhizosphereMDTPSGQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWQDEREYQAWRDSVNADPYRRDRNGFPVGAQLGYSRTR
Ga0207709_1122884323300025935Miscanthus RhizosphereVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR
Ga0210090_100550333300025965Natural And Restored WetlandsMDTPVSEQGSSLDTMMKRGALVCVVAMGGVAVGLAVWALRRWQDERQYQAWRESVTADPYRRDRNGYPVGAQLGYSRSR
Ga0207668_1147138923300025972Switchgrass RhizosphereQAAVSSETPTPGAASLEAVMKGGALVAVVALGGVAVGLAVWALRRWMDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSRPR
Ga0208000_10290623300026001Rice Paddy SoilMDMPVDEKVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR
Ga0208000_10540523300026001Rice Paddy SoilMNKPAGGELSLETVMKRGALVCVVALGGVAVGLALWALRQWRDEREYRAWRTSVSADPDRRDRNGYPVGARLGFSRAR
Ga0208907_10457323300026002Rice Paddy SoilGSAMNKPAGGELSLETVMKRGALVCVVALGGVAVGLALWALRQWRDEREYRAWRTSVSADPDRRDRNGYPVGARLGFSRAR
Ga0208285_101152623300026005Rice Paddy SoilMNKPGGGELSLETVMKRSALVCVVALGGVAVGLAVWALRRWQDEREYRAWRRSVSADPDRRDRNGYPVGTQLGFSRAR
Ga0208532_100200723300026011Rice Paddy SoilKPGGGELSLETVMKRSALVCVVALGGVAVGLAVWALRRWQDEREYRAWRRSVSADPDRRDRNGYPVGTQLGFSRAR
Ga0207703_1032304423300026035Switchgrass RhizosphereMDTPASEQGSLELIMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRDSVTADPYRRDRNGFPVGAKLGYSRK
Ga0207678_1191417923300026067Corn RhizosphereMAAMDTPVSEKGSTLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK
Ga0257180_100118723300026354SoilMDTRVEEKVSLETVMRRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR
Ga0257173_103749723300026360SoilMDTPLSEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQASRESVSADPYRRDRNGYPVGAQLGYSRSR
Ga0257155_100324423300026481SoilMDMRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0257168_100155943300026514SoilMDTRVEDKVSLETVMRRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0209845_107865123300027324Groundwater SandMDTPVEEGLSMETVMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGFSRSR
Ga0209969_106431913300027360Arabidopsis Thaliana RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQFGLS
Ga0209854_104857913300027384Groundwater SandMDTPESEQGSSLETVMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
Ga0209984_105264323300027424Arabidopsis Thaliana RhizosphereMDTPVKDGLVLEKLMKRGAAVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQ
Ga0209899_103114923300027490Groundwater SandMDTPVDAGPSLEKVMNRGALVCVVALGGVAVGLTVWALRRWRDEREYQAWRSSVTADPYRRDRNGYPVGAQLGFSRSR
Ga0209843_103759213300027511Groundwater SandMDTPVEEGLSMETVMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQ
Ga0209074_1037441523300027787Agricultural SoilMETRVDEKVPLEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR
(restricted) Ga0233416_1009555323300027799SedimentPQGPSPQGGSLETVMNRGALVCVVALGGVAVGLAIWALRRWQDEREYQAWRASVTADPYRRDSNGYPVGAQLGYSRSH
Ga0209726_1010376623300027815GroundwaterMATPISEQSSSLETMTKRGALVCVVAIGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0209580_1063501013300027842Surface SoilMDMPVDDGISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0209701_1027367723300027862Vadose Zone SoilMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0209814_1002449453300027873Populus RhizosphereMETRVDERVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0209590_1044680533300027882Vadose Zone SoilMDTPVEAGLSMETVMKRGALICVVALGGVAVGLAVWALRRWRDEREYQAWRSSVTADPYRRDRNGYPVGAQLGFSRSR
Ga0209068_1006313933300027894WatershedsMDTRMDEKISLETVMKRGALVCVVAMGGVAVGLAVWALRRWNDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0209583_1002773943300027910WatershedsMDTRMDEKISLETVMKRGALVCVVAMGGVAVGLAVWALRRWNDERAYQAWRASVSADPYRRDRTGYPVGAQLGFSRSR
Ga0209583_1074753613300027910WatershedsMDARAEEGLSLETVMKRGALVCVVALGGVAVGLAVWALRRWHDERTYQAWRASVSADPYRRDRNGYPV
Ga0209859_104615413300027954Groundwater SandGLSMETVMRRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRSAVTADPYRRDRNGYPVGAQLGFSRSR
Ga0268265_1264228813300028380Switchgrass RhizosphereMDMPAAEQRSTLDTMMKRGAALCVVALGGVAVGLAVWALRRWQDERAYRAGRDSVSGDPYRRDRNGYPVGARLGYSRSR
Ga0137415_1014123733300028536Vadose Zone SoilMDTPSEQSSPLETVMKRGALVCIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0137415_1021176723300028536Vadose Zone SoilMDTRVDDRISLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDERAYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0307504_1001919823300028792SoilMDARAEEGLSLETVMKRGALVCVVALGGVAVGLAVWALRRWHDERTYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0307504_1004425013300028792SoilEDMAAMDTPTQQSSPLETVMKRGALVCIVAVGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRK
Ga0307504_1008626833300028792SoilMDTRVEEKVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGF
Ga0307281_1020298823300028803SoilPISEQGSSLETMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0307305_1037243913300028807SoilVMKRGALVCIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0247824_1104893913300028809SoilMAAMDTPVSEKGSTLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
Ga0307302_1066913613300028814SoilMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRDSVNADPYRRDRNGFPVGAQ
Ga0307296_1044934713300028819SoilMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRT
Ga0307312_1042058423300028828SoilMAAMDTPSQQSSPLETVMKRGALVCIVAAGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0307278_1051587113300028878SoilMDTPSEQSSPLETVMKRGALICIVAVGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRTR
Ga0307304_1028620523300028885SoilRVEETVSLETVMKRGALVCVVAMGGVAVGLAIWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0299907_1061003523300030006SoilMGTPVAEQGISLETVMKRGALVCVVAVGGVAVGLAVWALRRWHDERQYQPWRASVSADPYRRDRNGYPVGAQLGYSRR
Ga0268386_1000471573300030619SoilMGTPTAEQGISLETVMKRGALVCVVAAGGVAVGLAVWALRRWQDERQHQAWRASVSADPYRRDRNGYLVGAQLGYARR
Ga0302046_1007362523300030620SoilMGTPIAEQGISLERMMKRGALVCVVAVGGVAVGLAVWALRRWQDERQYQAWRASVSADPYRRDRNGYPVGAQLGYSRR
(restricted) Ga0255311_102863533300031150Sandy SoilMAAMDTPVSGQGSSLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
(restricted) Ga0255310_1000590663300031197Sandy SoilMDTPASEQGSSLETVMKRGAVVCVVALGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGVPVGAQLGYSRR
(restricted) Ga0255310_1003574313300031197Sandy SoilSSLELMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRDSVTADPYRRDRNGFPVGAQLGYSRR
(restricted) Ga0255310_1005466613300031197Sandy SoilMDTPLEREQGVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVTADPYRRDRNGYPVGAQLGYSSSH
(restricted) Ga0255312_108826923300031248Sandy SoilMDMPVDEKVSLDTVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0307469_1015666853300031720Hardwood Forest SoilMDTPVAEQSGTLETMMKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0307468_10055607613300031740Hardwood Forest SoilMDTPVAEQSGTLETMMKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVSSDPYRRDRNGYPVG
Ga0307475_1021272723300031754Hardwood Forest SoilMETRVNEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPYRRDRNGYPVGAQLGYSRSR
Ga0307473_1002838363300031820Hardwood Forest SoilMETRVNEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPSRRDRNGYPVGAQLGY
Ga0214473_10010318133300031949SoilMDTPVDAGLSMETAMKRGALVCAVALGGVAVGLAVWALRRWHDEREYQAWWASVNADPYRRDRNGYPIGAQLGFSRSR
Ga0307479_1101705623300031962Hardwood Forest SoilMETRVNEKVSLEAVMKRGALVCVVALGGVAVGLAVWALRRWQDEREYQAWRASVSSDPSRRDRNGYPVGAQLGYSRSR
Ga0307470_1035624833300032174Hardwood Forest SoilMDTPVSEQGSSLETLMKRGALVCVVAAGGVAVGLAVWALRRWQDEREYQAWRESVSADPYRRDRNGFPVGAQLGYSRR
Ga0307471_10021684843300032180Hardwood Forest SoilMNTPLVSTPTPVEEGLSLETVLKRGALVCVVALGGVAVGLAVWAVRRWQDEREYQAWRESVTADPYRRDRNGYPVGAQLGLSRSR
Ga0307471_10050737913300032180Hardwood Forest SoilMDTPVKNGLVLEKMMKRGAVVCVVALGGVAVGLAVWALRRWQDERRYQAWRAAVTADPYRRDRNGYPVGAQLG
Ga0335085_10001971303300032770SoilMDTPVEKRFSLETVMKRGAIVCAVALGGVAVGLAVWAVRRWQDEREYQAWRASVSADPYRRDRNGYPVGAQLGLSRAR
Ga0334722_1033426023300033233SedimentMDTPASEQGSSLETVVKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVTADPYRRDRNGFPVGAQLGYSRR
Ga0310810_1000848473300033412SoilMDMPVDEKISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0214471_1008111123300033417SoilMDTPISEQGSSLETMMKRGALVCVVAIGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPVGAQLGYSRPR
Ga0326729_100482333300033432Peat SoilMDMPVDDKISLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREYQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0326729_104862913300033432Peat SoilMDMPVDDKVSLDAVMKRGALVCVVALGGVAVGLAVWALRRWHDEREWQAWRASVSADPYRRDRNGYPVGAQLGFSRSR
Ga0316624_1107371533300033486SoilMDMPVDEKVSLDAVMKRGALVCVVALGGVAVGLAVWALRRWYDERQYQAWRASVSADPYRRDRNGYPVGTQLGFSRSR
Ga0364942_0039916_590_8263300034165SedimentMDTPVEAGLSMETVMKRGALVCVVALGGVTVGLAVLALRRWREGREYQAWQSSVTADPYRRDRNGYPVGAQLRFSRLR
Ga0364934_0404316_306_5033300034178SedimentMMKRGALVCVVAMGGVAVGLAVWALRRWQDEREYQAWRESVSSDPYRRDRNGYPIGAQLGYSRSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.