NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F022054

Metagenome Family F022054

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F022054
Family Type Metagenome
Number of Sequences 216
Average Sequence Length 95 residues
Representative Sequence MWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Number of Associated Samples 177
Number of Associated Scaffolds 216

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 76.85 %
% of genes near scaffold ends (potentially truncated) 21.30 %
% of genes from short scaffolds (< 2000 bps) 72.69 %
Associated GOLD sequencing projects 153
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(18.982 % of family members)
Environment Ontology (ENVO) Unclassified
(26.852 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112
1JGI25385J37094_101994222
2JGI25383J37093_100565182
3JGI25382J37095_100061945
4JGI25382J37095_100837832
5JGI25382J43887_100413212
6JGI25382J43887_104900731
7JGI25386J43895_101505042
8JGI25406J46586_100197641
9rootL2_102559991
10Ga0062593_1017105692
11Ga0062589_1003034542
12Ga0062590_1009225792
13Ga0066674_101198652
14Ga0066680_100840682
15Ga0066679_103201272
16Ga0066685_109231731
17Ga0066676_100557072
18Ga0070676_105863631
19Ga0066388_1000060313
20Ga0066388_1011486942
21Ga0070680_1012370772
22Ga0070691_105517382
23Ga0070692_107469892
24Ga0070701_106737092
25Ga0070694_1005629231
26Ga0066686_100224903
27Ga0066686_101956322
28Ga0066689_100827592
29Ga0066697_100822264
30Ga0070704_1007849171
31Ga0066692_103161482
32Ga0066707_103947081
33Ga0066698_100457902
34Ga0066698_103212082
35Ga0066705_101010443
36Ga0070702_1006508112
37Ga0066905_1007242012
38Ga0081455_101207533
39Ga0081539_10000009261
40Ga0066656_100689882
41Ga0066652_1017503861
42Ga0075417_100697752
43Ga0070716_1009715661
44Ga0066658_108523032
45Ga0066665_101108232
46Ga0075428_1002758392
47Ga0075428_1004058672
48Ga0075428_1015868852
49Ga0075428_1020098272
50Ga0075430_1001438862
51Ga0075433_100146282
52Ga0075420_1001160373
53Ga0075425_1011480482
54Ga0075434_1000144538
55Ga0079217_100267482
56Ga0079215_101039602
57Ga0079215_111509432
58Ga0079216_105527252
59Ga0075419_108762812
60Ga0079218_100178842
61Ga0099791_100157962
62Ga0099794_100213732
63Ga0099794_100544343
64Ga0066710_1002255996
65Ga0099829_100130373
66Ga0099829_104547172
67Ga0099830_101550101
68Ga0099828_103382742
69Ga0099827_103540802
70Ga0099827_107657151
71Ga0075418_110304021
72Ga0066709_1001164762
73Ga0066709_1014531861
74Ga0105091_105222532
75Ga0114129_101724032
76Ga0105092_104186632
77Ga0105092_104241292
78Ga0105092_106332761
79Ga0105104_104982801
80Ga0105248_112350232
81Ga0105249_118202072
82Ga0105347_11969672
83Ga0105065_10183482
84Ga0105081_10070762
85Ga0105061_10195972
86Ga0105061_11001372
87Ga0105070_10053732
88Ga0105076_10030772
89Ga0105087_10113692
90Ga0105085_10133022
91Ga0105064_10721152
92Ga0105074_10884492
93Ga0134088_101093323
94Ga0134088_103029662
95Ga0134071_100012012
96Ga0134071_100421282
97Ga0136847_104917361
98Ga0134127_113018821
99Ga0134122_100073789
100Ga0134121_100641595
101Ga0134121_103152581
102Ga0137389_100054465
103Ga0137388_102217541
104Ga0137374_100222764
105Ga0137380_1000251114
106Ga0137381_102268712
107Ga0137387_101532272
108Ga0137372_108760421
109Ga0137369_100456963
110Ga0137397_100157497
111Ga0137397_100535142
112Ga0137394_105880202
113Ga0137419_107794382
114Ga0137416_109569991
115Ga0137404_100050783
116Ga0137404_103210522
117Ga0137404_119952801
118Ga0137407_100337653
119Ga0137410_108827932
120Ga0137410_114926252
121Ga0134077_100297252
122Ga0134075_100017211
123Ga0134075_100037968
124Ga0180066_10438912
125Ga0180094_10728542
126Ga0180104_12283381
127Ga0134089_101501362
128Ga0134089_103923892
129Ga0134085_102207331
130Ga0134112_100199212
131Ga0184610_10038992
132Ga0184608_100316373
133Ga0184634_100031402
134Ga0184634_101076682
135Ga0184634_102122852
136Ga0184637_100066783
137Ga0184637_100190132
138Ga0184637_100263294
139Ga0184609_101795562
140Ga0184612_101020122
141Ga0184627_100024055
142Ga0184639_100141336
143Ga0184639_100667271
144Ga0066667_101275692
145Ga0066669_104935781
146Ga0187892_1000251327
147Ga0137408_10810072
148Ga0179594_101163973
149Ga0179594_101580042
150Ga0196964_103305252
151Ga0210379_100542592
152Ga0210379_105622132
153Ga0179596_105515512
154Ga0222623_100117242
155Ga0207642_101697021
156Ga0207662_103743361
157Ga0207670_113175581
158Ga0207669_108334801
159Ga0207665_111199671
160Ga0207712_114751272
161Ga0207703_108046721
162Ga0207641_103304381
163Ga0207648_103467072
164Ga0209438_12117701
165Ga0209235_10060692
166Ga0209235_11671231
167Ga0209237_10304524
168Ga0209761_10849082
169Ga0209470_10985263
170Ga0257166_10241573
171Ga0209690_11355321
172Ga0209058_10046596
173Ga0209157_10385004
174Ga0209157_12230402
175Ga0209056_100351401
176Ga0179593_10137114
177Ga0209869_10208461
178Ga0209387_11515902
179Ga0209588_10647382
180Ga0233416_100344572
181Ga0209701_100737492
182Ga0209701_102088932
183Ga0209814_100612652
184Ga0209283_100400892
185Ga0209481_106572241
186Ga0209590_108142141
187Ga0209486_106317421
188Ga0209488_101513082
189Ga0207428_106437332
190Ga0209885_10119872
191Ga0209889_10126732
192Ga0209859_10149821
193Ga0233417_101145922
194Ga0268265_103081112
195Ga0137415_102533181
196Ga0137415_103656354
197Ga0307312_104203072
198Ga0307495_102218422
199Ga0307408_1000538801
200Ga0307408_1000555622
201Ga0307408_1001881472
202Ga0307469_100173882
203Ga0307469_121021312
204Ga0307468_1006025352
205Ga0307407_105621312
206Ga0307412_106656692
207Ga0307471_1032953382
208Ga0307472_1004488642
209Ga0247829_102856834
210Ga0364924_017225_313_615
211Ga0364937_010866_746_1048
212Ga0364938_102753_143_397
213Ga0364940_0088590_236_490
214Ga0364931_0321253_93_392
215Ga0364934_0364712_1_237
216Ga0364923_0180726_306_560
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 60.94%    β-sheet: 0.00%    Coil/Unstructured: 39.06%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100MWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGRSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Sediment
Freshwater Sediment
Soil
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Soil
Bio-Ooze
Sediment
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Sugarcane Root And Bulk Soil
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
6.0%19.0%5.1%3.2%11.1%7.9%3.7%6.5%3.2%7.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1019942223300002558Grasslands SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVXCHSVEVAVQQRFGPRDWSDTLDRMIKYGAPIPPEDKETL
JGI25383J37093_1005651823300002560Grasslands SoilMWRSGSLAAALALGIGAAAVSLPRAGPAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR*
JGI25382J37095_1000619453300002562Grasslands SoilMWRSLRXAALLALTAGMALGLRLAXASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
JGI25382J37095_1008378323300002562Grasslands SoilMWRSLRAPAILAFTAGIAVGLRLAXASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR*
JGI25382J43887_1004132123300002908Grasslands SoilMWRSLRPAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
JGI25382J43887_1049007313300002908Grasslands SoilALTAAPGAAQEAPIPKTAGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMVYLMQHYRDPNGR*
JGI25386J43895_1015050423300002912Grasslands SoilMWRSLRTAALLALTXGMALGLRLALASAQNTPIPQTPGWELVMRCVXCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
JGI25406J46586_1001976413300003203Tabebuia Heterophylla RhizosphereAVALALGVVAPSAAAQSPSIPQTPGWELVMRCVICHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPDDKQKLMAYLLQHYRDPNGR*
rootL2_1025599913300003322Sugarcane Root And Bulk SoilMSASSKMFLVAGGLAVLVLTAAAATPAQDAQIPQTPGWELVMRCVMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPAEDKQRLM
Ga0062593_10171056923300004114SoilMWRSRALAAALLGVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0062589_10030345423300004156SoilMWRSLALAAALGAVTAAHAQDVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0062590_10092257923300004157SoilMSASSKVVVLASGLAVLMLTAAAATRAQDAPLPQTPGWELVMRCVMCHSVEIAVQQRLGPEGWSDTLDRMIKYGAPIPAEDKQRLMAYLLRHFRDPAGR*
Ga0066674_1011986523300005166SoilMWRSLRAPAILAFTAGIAVGLRLAPASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR*
Ga0066680_1008406823300005174SoilMWRSLRTAALLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR*
Ga0066679_1032012723300005176SoilMWRSVALAAVLALAGAVVAHAQDVEIPKTPGWELIMRCAICHSVEIAVQQRFGPQGWSETHDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR*
Ga0066685_1092317313300005180SoilLLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0066676_1005570723300005186SoilWRSLRPAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0070676_1058636313300005328Miscanthus RhizosphereMWRSLALAAALLGVSLVVGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMTYLLRHFRDPSGR*
Ga0066388_10000603133300005332Tropical Forest SoilMWRSVALAALLASVSPARAQDADIPKTPGWELIMRCVICHSVEVAVQQRLGPQGWSDTLDRMIKYGAPIPPEDKAVLMAYLLRHYRDPAGR*
Ga0066388_10114869423300005332Tropical Forest SoilMWRSLALAAALVATPAAQAQDVGIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSETLDRMIAYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0070680_10123707723300005336Corn RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR*
Ga0070691_1055173823300005341Corn, Switchgrass And Miscanthus RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMTYLLRHFRDPSGR*
Ga0070692_1074698923300005345Corn, Switchgrass And Miscanthus RhizosphereMWRSLALAAALLGVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR*
Ga0070701_1067370923300005438Corn, Switchgrass And Miscanthus RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0070694_10056292313300005444Corn, Switchgrass And Miscanthus RhizosphereMWRSALLGAAAIAAGALPASAQEIPRTPGWELVMRCVMCHSVEIAVQQRLGPQGWSETLDRMIEYGAPIPPTDREQLLAYLLRHFRDPDGR*
Ga0066686_1002249033300005446SoilMRSGSMWRNTLALAVALTAAVAPVGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0066686_1019563223300005446SoilMWRSVVALAAGLATVAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPDGR*
Ga0066689_1008275923300005447SoilMWRSLRAPAILAFTAGIAVGLRLALASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR*
Ga0066697_1008222643300005540SoilMWRSLALAAALGTAAAAHAQEVEVPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR*
Ga0070704_10078491713300005549Corn, Switchgrass And Miscanthus RhizosphereMWRSLALAAALGTSAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR*
Ga0066692_1031614823300005555SoilMWRSGSLAAALALGVGAAAVSLPRAGPAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR*
Ga0066707_1039470813300005556SoilMWRSVVALAAGLATVAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR*
Ga0066698_1004579023300005558SoilMWRSLRPAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0066698_1032120823300005558SoilMWRSVVALAAGLATVAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR*
Ga0066705_1010104433300005569SoilMWRSLALAAALATASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR*
Ga0070702_10065081123300005615Corn, Switchgrass And Miscanthus RhizosphereWMWRSLALAAALLGVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMTYLLRHFRDPSGR*
Ga0066905_10072420123300005713Tropical Forest SoilMWRSTAVAVLALAAGTAAAPARRPPSAQEVPVPQTPGWELVMRCVICHSVEVAVQQRLGPQGWSDTLDRMIRYGAPIPP
Ga0081455_1012075333300005937Tabebuia Heterophylla RhizosphereMWRSALGLFVALAVTATPAAAQNAPIPQTPGWELVMRCVICHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0081539_100000092613300005985Tabebuia Heterophylla RhizosphereMRSESMWRSTLAVALALGVVAPSAAAQSPSIPQTPGWELVMRCVICHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPDDKQKLMAYLLQHYRDPNGR*
Ga0066656_1006898823300006034SoilMWRSLRPAALLAVTAWMVLGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0066652_10175038613300006046SoilMWRSLALAAALATASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0075417_1006977523300006049Populus RhizosphereMWRSALAVALALGVVAPSAAAQSPSIPQTPGWELVMRCVMCHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPDDKQKLMAYLLQHYRDPNGR*
Ga0070716_10097156613300006173Corn, Switchgrass And Miscanthus RhizosphereAVLLAAGGAAGACAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPDDKAQLMAYLLRHYRDPNGR*
Ga0066658_1085230323300006794SoilMWRSLALAAALGTAAAAHAQEVEVPKTPGWELIMRCVMCHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRH
Ga0066665_1011082323300006796SoilMWRSVVALAAGLATVAAAHAHEVEIPKMPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR*
Ga0075428_10027583923300006844Populus RhizosphereMSSSLRAGLAAGAVGLVVLGAGAAARPAAQEVPIPQTEGWELVIRCAICHSVEIAVQQRFGPRGWSETLDRMIRYGAPIPPEDKARLMVYLLRHYRDPDGP*
Ga0075428_10040586723300006844Populus RhizosphereMWRSLALAVALATAPAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR*
Ga0075428_10158688523300006844Populus RhizosphereMWRSLVVAACIAWPAAAQEPPIPQTPGWELVMRCLMCHSVEVAVQQRFGPKGWSDTLDRMIRYGAPIPPEDKEQLMVYLLRHFRDPDGR*
Ga0075428_10200982723300006844Populus RhizosphereAGTSSTRSESMWRSALAVALALGVVAPSAAAQSPSIPQTPGWELVMRCVMCHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPDDKQKLMAYLLQHYRDPNGR*
Ga0075430_10014388623300006846Populus RhizosphereVLAAGLAMTTAVGGAEPAQDTHLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPVGR*
Ga0075433_1001462823300006852Populus RhizosphereMWRNLALAAALGTASAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0075420_10011603733300006853Populus RhizosphereMSHRPRALVLAAGLAMTTAVGGAEPAQDTHLPQTPGWELVLRCLMCHSVEIAVQQRFGPEGWSDTLDRMIQYGAPIPPEDKQRLLSYLLRHFRDPVGR*
Ga0075425_10114804823300006854Populus RhizosphereMWRNLALAAALGTASAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPRGR*
Ga0075434_10001445383300006871Populus RhizosphereMWRSVALAAALGAASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0079217_1002674823300006876Agricultural SoilMSHRPRALVLAAGLALTTAVGGAAPAQDTQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR*
Ga0079215_1010396023300006894Agricultural SoilVLAAGLALTTAVGGAAPAQDTQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR*
Ga0079215_1115094323300006894Agricultural SoilMWRSTVRAIALVLATGAVAFAQEVDIPRTPGWELVMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPTGR*
Ga0079216_1055272523300006918Agricultural SoilMSHRPRAVALAAGLAMTTAVGGAAPAQDTSLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR*
Ga0075419_1087628123300006969Populus RhizosphereMSASSKLVLLAGGLAVLVLTGAAATPAQDAQIPQTPGWELVMRCVMCHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPPPDKDVLMVYLLRHFRDPDGR*
Ga0079218_1001788423300007004Agricultural SoilMSHRPRALVLAAGLAMTTAVGGAAPAQDTHLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR*
Ga0099791_1001579623300007255Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0099794_1002137323300007265Vadose Zone SoilMWRSATLATVLALVAGAAASRPPGAAAQNPPIPQTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPEDKQELLVYLLRHYRDPDGR*
Ga0099794_1005443433300007265Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRDWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0066710_10022559963300009012Grasslands SoilMWRSLALAAALGTAVAAHAQEVEVPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR
Ga0099829_1001303733300009038Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0099829_1045471723300009038Vadose Zone SoilMWRDALLAAALVGLGVAVGLGAPSVRAQPSPIPQTDGWELVMRCVICHSVEIALQQRFGPTGWSDTLDRMIKYGAPIPPEDKAKLMTYLLRHYRDPDGR*
Ga0099830_1015501013300009088Vadose Zone SoilMWRSGSLAAALALGVGAAAVSVPRAGSAQNLPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR*
Ga0099828_1033827423300009089Vadose Zone SoilMWRSGSLAAALALGVGAAAVSVPRAGSAQNSPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLRHFRDPDGR*
Ga0099827_1035408023300009090Vadose Zone SoilMWRSGSLAAALALGVGAAAVSVPRAGSAQNSPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR*
Ga0099827_1076571513300009090Vadose Zone SoilMWRSLWTPAILAFTAGIAVGLRLAAASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLV
Ga0075418_1103040213300009100Populus RhizosphereLRAGLAAGAVGLVVLGAGAAARPAAQEVPIPQTEGWELVIRCAICHSVEIAVQQRFGPRGWSETLDRMIRYGAPIPPEDKARLMVYLLRHYRDPDGP*
Ga0066709_10011647623300009137Grasslands SoilMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0066709_10145318613300009137Grasslands SoilLAAALGTAAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGQQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR*
Ga0105091_1052225323300009146Freshwater SedimentMWRSAVHAAALLLATGVFSTADAQEVPIPQTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPGEKVQLMTYLLRHYRDPNGR*
Ga0114129_1017240323300009147Populus RhizosphereMSASSKLVLLAGGLAVLVLTGAAATPAQDAQIPQTRGWELVMRCVMCHSVEIAVQQRLGPEGWSDTLDRMIKYGAPIPAEDKQRLMTYLLRHFRDPAGR*
Ga0105092_1041866323300009157Freshwater SedimentMWRSAARAAALLLATGVFTSADAQEVPIPQTPGWELVMRCVICHSVEIAVQQRFGPHGWSETLDRMIKYGAPIPPGEKAQLMTYLLRHYRDPNGR*
Ga0105092_1042412923300009157Freshwater SedimentPVVTAPAQEVPIPQTPGWELIMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPDDKTQLMVYLLRHYRDPDGR*
Ga0105092_1063327613300009157Freshwater SedimentVAAVALLVLAAVEGAGPVSAQEIPQTPGWELVMRCVMCHSVEIAVQQRLGPRGWSETLDRMIAYGAPIPPADKETLMAYLLRHFRDPDGR*
Ga0105104_1049828013300009168Freshwater SedimentMWRSAVHAAVLLLATGVFSTADAQEVPIPQTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPGEKVQLMTYLL
Ga0105248_1123502323300009177Switchgrass RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKA
Ga0105249_1182020723300009553Switchgrass RhizosphereMWRSVAAAASLVVLTGVIAAVTSAPAQEVPIPQTPGWELIMRCVMCHSVEIAVQQRLGPQGWSDTLDRMIKYGAPIPPEDKAQLIVYLLRHYSDPNGR*
Ga0105347_119696723300009609SoilMWRSLALAASLALLGASAPAPRFYSLSAQEVPIPRTPGWELVMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPI
Ga0105065_101834823300009803Groundwater SandMWRSLRTAALLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEALLVYLLRHYRDPDGR*
Ga0105081_100707623300009806Groundwater SandMWRSLRTAALLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFSPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0105061_101959723300009807Groundwater SandMWRSLRTAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0105061_110013723300009807Groundwater SandMWRSEVSTAAIALLAVAAAGAGAVPVSAQEIPQTPGWELVIRCVMCHSVEIAVQQRLGPRGWSETLDRMITYGAPIPPADKEQLMAYLLRHFRDPEGR*
Ga0105070_100537323300009815Groundwater SandMWRSLRTAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPGGR*
Ga0105076_100307723300009816Groundwater SandMWRSLRTAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEALLVYLLRHYRDPDGR*
Ga0105087_101136923300009819Groundwater SandLLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0105085_101330223300009820Groundwater SandMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0105064_107211523300009821Groundwater SandMWRSFTLMATLALGGGAAITPLVIASAQEVPIPQTPGWELILRCVICHSVEIAVQQRLGPLAWSETLDRMIKYGAPIPPDDKEKLMVYLLRHYRDSDGR*
Ga0105074_108844923300010029Groundwater SandMWRSLRTAALLPLTAGMVLGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0134088_1010933233300010304Grasslands SoilMWRNTLALAVALTAAVAPVGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0134088_1030296623300010304Grasslands SoilMWRSLALAAALGTAVAAHAQQEVEVPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR*
Ga0134071_1000120123300010336Grasslands SoilMWRNALALAVALTAAVAPVGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0134071_1004212823300010336Grasslands SoilMWRSLRTATLLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR*
Ga0136847_1049173613300010391Freshwater SedimentMWRDTLLAAALVSLGAAVALHAPPARAQPSPIPQTDGWELIMRCVICHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKAKLMTYLL
Ga0134127_1130188213300010399Terrestrial SoilMWRSLALAAALIAASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0134122_1000737893300010400Terrestrial SoilSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR*
Ga0134121_1006415953300010401Terrestrial SoilGVSLVVGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR*
Ga0134121_1031525813300010401Terrestrial SoilLALAMALAGAAAVVPAPAQEVAIPQTPGWELIMRCAICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKAQLMAYLLRHYRDPNGR*
Ga0137389_1000544653300012096Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLAVASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0137388_1022175413300012189Vadose Zone SoilMWRSGSLAAALALGVGAAAVSLPCAGSAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLRHFRDPDGR*
Ga0137374_1002227643300012204Vadose Zone SoilMWRSLRTAAVLALTTGMALGLRFALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLLYLLRHYRDPDGR*
Ga0137380_10002511143300012206Vadose Zone SoilMWRDALLAAALVGLGVAVGLGASSVRAQPSPIPQTEGWELVMRCVICHSVEIAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKAKLMTYLLRHYRDPDGR*
Ga0137381_1022687123300012207Vadose Zone SoilMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0137387_1015322723300012349Vadose Zone SoilMWRSLRTAALLAVTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0137372_1087604213300012350Vadose Zone SoilMALGLRFALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLLY
Ga0137369_1004569633300012355Vadose Zone SoilMWRSLRTAAVLALTTGMALGLRFALASAQNTPIPQTPGWEVVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLLYLLRHYRDPDGR*
Ga0137397_1001574973300012685Vadose Zone SoilMWRSLALAIALAVSAAVIPAPAQEVAIPQTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR*
Ga0137397_1005351423300012685Vadose Zone SoilMWRSLSTAAVLALAAGVAVSLRLALVSAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRLGPRGWSDTLDRMIKYGAPIPREDKEKLLVYLLRHYRDPDGR*
Ga0137394_1058802023300012922Vadose Zone SoilMWRSALGLAVALAATAAPTVAQEAPIPKTAGWELVMRCVICHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0137419_1077943823300012925Vadose Zone SoilMWRSVLLAAALGATASAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR*
Ga0137416_1095699913300012927Vadose Zone SoilMWRSLALAIALAAAAAVIPAPAQEVAIPQTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR*
Ga0137404_1000507833300012929Vadose Zone SoilMWRSLSTAAVLALAAGVAVSLRLALVSAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR*
Ga0137404_1032105223300012929Vadose Zone SoilMWRSALALAVALTAAVAPAGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0137404_1199528013300012929Vadose Zone SoilMWRSLALAIALAAAAAVIPAPAQEVAIPQTPGWELIMRCMICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR*
Ga0137407_1003376533300012930Vadose Zone SoilMWRSLSTVAVLALAAGVAVSLRLALVSAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR*
Ga0137410_1088279323300012944Vadose Zone SoilMWRDALLAAALVGLGVAVGLGAPSVRAQPSPIPQTDGWELVMRCVICHSVEIAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKAKLMTYLLRHYRDPDGR*
Ga0137410_1149262523300012944Vadose Zone SoilMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0134077_1002972523300012972Grasslands SoilMWRSLRTAALLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR*
Ga0134075_1000172113300014154Grasslands SoilMWRNTLALAVALTAAVAPVGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLM
Ga0134075_1000379683300014154Grasslands SoilMWRSVVALAAGLATVAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLERMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR*
Ga0180066_104389123300014873SoilMWRDTLLAAALVSLGAAVTLHAPPARAQPSPIPQTDGWELIMRCVICHSVEIAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKTKLMTYLLRHYRDPEGR*
Ga0180094_107285423300014881SoilMWRSVALVAALALGARAAIVSRARISAQEAPTPQTHGWELVMRCVICHSVEIAVQQRLGPQGWSDTLDRMIKYGAPIPP
Ga0180104_122833813300014884SoilMWRSAARAAALLLATGVAIVFTSADAQEVPIPQTPGWELVMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPGEKTQLMTYLLRHYRDPNGR*
Ga0134089_1015013623300015358Grasslands SoilMWRSLRTAALLALTAGMAFGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR*
Ga0134089_1039238923300015358Grasslands SoilMRAPAILAFTAGIAVGLRLAPASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR*
Ga0134085_1022073313300015359Grasslands SoilMWRSLRAPAILAFTAGIAVGLRLAPASAQNTPIPRTPGRELVVRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYL
Ga0134112_1001992123300017656Grasslands SoilLMWRSLRAPAILAFTAGIAVGLRLALASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR
Ga0184610_100389923300017997Groundwater SedimentMWRSLRSAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0184608_1003163733300018028Groundwater SedimentMWRSLRTAALLAVTAGMALGLRLALASAQNTPIPHTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0184634_1000314023300018031Groundwater SedimentMWRSLRTAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0184634_1010766823300018031Groundwater SedimentMWRDTLLAAALVSLGAAVTLHAPPARAQPSAIPLTDGWELIMRCVICHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKTKLMTYLLRHYRDPDGR
Ga0184634_1021228523300018031Groundwater SedimentMWRSAGIGAVALAFLVAVTAPPAPHRASGQEVPIPKTDGWELVMRCVICHSVEIAVQQRFGPHGWSDTLERMIKYGAPIPPEEKEKLMMYLLRHYRDPDGS
Ga0184637_1000667833300018063Groundwater SedimentMWRSLRSAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0184637_1001901323300018063Groundwater SedimentMWRSAILVAALALGAGAATVPRVRVAAQEVPIPQTPGWELIMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPIPPDDKDTLMAYLLRHYRDPDGR
Ga0184637_1002632943300018063Groundwater SedimentMWRSAVLVTALALGAGAATVPRVRVAAQEVPIPRTPGWELIMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPIPPDDKDT
Ga0184609_1017955623300018076Groundwater SedimentMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0184612_1010201223300018078Groundwater SedimentMWRSAILVAALALGAGAATVPRVRVAAQEVPIPRTPGWELIMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPIPPDDKDTLMAYLLRHYRDPDGR
Ga0184627_1000240553300018079Groundwater SedimentMWRSLRSAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKDTLLVYLLRHYRDPDGR
Ga0184639_1001413363300018082Groundwater SedimentMWRDTLLAAALLGLGAAGALHAPPARAQPSAIPLTDGWELVMRCVICHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKTKLMTYLLRHYRDPDGR
Ga0184639_1006672713300018082Groundwater SedimentMWRDTLLAAALVSLGAAVALHAPPARAQPSPIPQTDGWELIMRCVICHSVEIAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKTKLLTYLLRHYRDPDGR
Ga0066667_1012756923300018433Grasslands SoilMWRSLALAAALGTAAAAHAQEVEVPKTPGWELIMRCVMCHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPAGR
Ga0066669_1049357813300018482Grasslands SoilMWRSLALAAALATASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKAQLIAYLLRHYRDPNGR
Ga0187892_10002513273300019458Bio-OozeMWRRVGIGTAALALLAGAAAAPAPRGAGGQEVPIPQTEGWELVLRCVMCHSVEIAVQQRLGPGGWSDTLDRMIRYGAPIPPEDKEKLMVYLLRHYRDPAGS
Ga0137408_108100723300019789Vadose Zone SoilMWRSLSTAAVLALAAGVAVSLRLALVSAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0179594_1011639733300020170Vadose Zone SoilMRSGSMWRSALALAVALTAAVAPAGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLERMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR
Ga0179594_1015800423300020170Vadose Zone SoilMWRSLSTVAVLALAAGVAVSLRLALVSAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRLGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0196964_1033052523300020202SoilMSRRIRRLALGVALALTTAVPGAPAAQDAQLPQTPGWELVMRCLMCHSVEVAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLMAYLLRHFRDPAGR
Ga0210379_1005425923300021081Groundwater SedimentMWRSVALVAAIALGAGAATVRPARIAAQEVPIPQTQGWELVVRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPDDKDRLMAYLLRHYRDPDGR
Ga0210379_1056221323300021081Groundwater SedimentMWRSLRTAALLALTAGMALGLRLALASAQNAPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0179596_1055155123300021086Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0222623_1001172423300022694Groundwater SedimentMWRSLRTAALLAVTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0207642_1016970213300025899Miscanthus RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMADLLRHYRDPNGR
Ga0207662_1037433613300025918Switchgrass RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR
Ga0207670_1131755813300025936Switchgrass RhizosphereRWMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKARLMAYLLRHYRDPNGR
Ga0207669_1083348013300025937Miscanthus RhizosphereMWRSLALAAALLGVSLVVGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMAY
Ga0207665_1111996713300025939Corn, Switchgrass And Miscanthus RhizosphereSRWMWRNLALAAALGTASAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPDDKAQLMAYLLRHYRDPNGR
Ga0207712_1147512723300025961Switchgrass RhizosphereMWRSVAAAASLVVLTGVIAAVTSAPAQEVPIPQTPGWELIMRCVMCHSVEIAVQQRLGPQGWSDTLDRMIKYGAPIPPEDKAQLIVYLLRHYSDPNGR
Ga0207703_1080467213300026035Switchgrass RhizosphereMWRSLALAAALVAVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMTYLLRHFRDPSGR
Ga0207641_1033043813300026088Switchgrass RhizosphereMWRSLALAAALGAVTAAHAQDVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR
Ga0207648_1034670723300026089Miscanthus RhizosphereMWRSLALAAALLGVSLVFGAGAQEVEIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMITYGAPIPPEDKAQLMTYLLRHFRDPSGR
Ga0209438_121177013300026285Grasslands SoilGTSSTPSRWMWRSVLLAAALGATASAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR
Ga0209235_100606923300026296Grasslands SoilMWRSGSLAAALALGIGAAAVSLPRAGPAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR
Ga0209235_116712313300026296Grasslands SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRDWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0209237_103045243300026297Grasslands SoilMWRSLRAPAILAFTAGIAVGLRLAPASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADGR
Ga0209761_108490823300026313Grasslands SoilMWRSLRTAALLALTVGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0209470_109852633300026324SoilMRSGSMWRNTLALAVALTAAVAPVGAQEAPIPKTPGWELVMRCVICHSVEIAVQQRFGPQGWSDTLDRMIRYGAPIPPEDKQKLMAYLLQHYRDPNGR
Ga0257166_102415733300026358SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0209690_113553213300026524SoilMWRSLRAPAILAFTAGIAVGLRLALASAQNTPIPRTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKEQLLVYLLRHYRDADSR
Ga0209058_100465963300026536SoilMWRSLRPAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR
Ga0209157_103850043300026537SoilMWRSLRPAALLAVTAWMVLGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPAGR
Ga0209157_122304023300026537SoilMWRSVVALAAGLATVAAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPDGR
Ga0209056_1003514013300026538SoilMWRSVVALAAGLATVAAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR
Ga0179593_101371143300026555Vadose Zone SoilMWRSLALAIALAVSAAVIPAPAQEVAIPQTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR
Ga0209869_102084613300027187Groundwater SandAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0209387_115159023300027639Agricultural SoilVLAAGLALTTAVGGAAPAQDTQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR
Ga0209588_106473823300027671Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRDWSDTLDRMIKYGAPIPPEDKQELLVYLLRHYRDPDGR
(restricted) Ga0233416_1003445723300027799SedimentMWRRLAVAALALLAGTVVAPRLRHAGGQEVPIPRTEGWELVMRCVMCHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPDDKRTLMAYLLRHYRDPGGS
Ga0209701_1007374923300027862Vadose Zone SoilALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0209701_1020889323300027862Vadose Zone SoilMWRSGSLAAALALGVGAAAVSLPCAGSAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR
Ga0209814_1006126523300027873Populus RhizosphereMWRSALAVALALGVVAPSAAAQSPSIPQTPGWELVMRCVMCHSVEVAVQQRFGPQGWSDTLDRMIRYGAPIPPDDKQKLMAYLLQHYRDPNGR
Ga0209283_1004008923300027875Vadose Zone SoilMWRSGSLAAALALGVGAAAVSLPCAGSAQNPPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLRHFRDPDGR
Ga0209481_1065722413300027880Populus RhizosphereSSKLVLLAGGLAVLVLTGAAATPAQDAQIPQTPGWELVMRCVMCHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPPPDKDVLMVYLLRHFRDPDGR
Ga0209590_1081421413300027882Vadose Zone SoilMWRSGSLAAALALGVGAAAVSVPRAGSAQNSPLPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKERLLVYLLQHFRDPDGR
Ga0209486_1063174213300027886Agricultural SoilSHRPRAVVLAAGLALTTAVGGAAPAQDTQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLLAYLLRHFRDPAGR
Ga0209488_1015130823300027903Vadose Zone SoilMWRSLRTAALLALTAGMALGLRLALASAQNAPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0207428_1064373323300027907Populus RhizosphereMWRSVALAAALGAASAAHAQDVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPNGR
Ga0209885_101198723300027950Groundwater SandMWRSLRTAALLALTAGMAMGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0209889_101267323300027952Groundwater SandMWRSFTLMATLALGGGAAITPLVIASAQEVPIPQTPGWELILRCVICHSVEIAVQQRLGPLAWSETLDRMIKYGAPIPPDDKEKLMVYLLRHYRDPDGR
Ga0209859_101498213300027954Groundwater SandSSTPSRLMWRSAILVAALALGAGAATVPRGRVAAQEVPIPQTPGWELIMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
(restricted) Ga0233417_1011459223300028043SedimentMWRRLVVTALALLAGAVMAPRPRHVGSQEVPIPRTEGWELVMRCVMCHSVEIAVQQRFGPQGWSDTLDRMIKYGAPIPPADKMTLMAYLLRHYRDPGGS
Ga0268265_1030811123300028380Switchgrass RhizosphereMWRSVAAAASLVVLTGVIAAVTSAPAQEVPIPQTPGWELIMRCVMCHSVEIAVQQRLGPQGWSDTLDRMIKYGAPIPPEDKAQLMTYLLRHYRDPNGR
Ga0137415_1025331813300028536Vadose Zone SoilPSRWMWRSLALAIALAAAAAVIPAPAQEVAIPQTPGWELIMRCVICHSVEIAVQQRFGPQGWSDTLDRMIAYGAPIPPEDKRVLMAYLLRHYRDPAGR
Ga0137415_1036563543300028536Vadose Zone SoilMRYGGLYLAVLNLAVTAGMALGLRLALASAQNTPIPQTPGWELVMRCVMCHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0307312_1042030723300028828SoilRTAALLALTAGMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0307495_1022184223300031199SoilMWRSLRTAALLALTAGMAFGLRLALSSAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPRGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0307408_10005388013300031548RhizosphereSMWRSTVRAIALVLATGAVAFAQEVDIPRTPGWELVMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPGGR
Ga0307408_10005556223300031548RhizosphereMVLVSAALPAAAQETQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIGPDDKQRLMAYLLRHFRDAAGR
Ga0307408_10018814723300031548RhizosphereVSRSTRAVALAGALVATCIVSRAPAAAQDTQLPQTPGWELVMRCLMCHSVEIAVQQRFGPEGWSDTLDRMIKYGAPIPPEDKQRLMGYLLRHFRDPAGR
Ga0307469_1001738823300031720Hardwood Forest SoilMWRSAALAAVLLAAGGAAGACAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPDDKAQLMAYLLRHYRDPNGR
Ga0307469_1210213123300031720Hardwood Forest SoilMWRSLRTAALLALTAGMAVGLRLAFASAQNPPIPQTPGWELVMRCVICHSVEVAVQQRFGPGGWSDTLDRMIKYGAPIPPEDKEKLLVYLLRHYRDPDGR
Ga0307468_10060253523300031740Hardwood Forest SoilMWRSLALAAALGAASAAHAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMTYLLRHYR
Ga0307407_1056213123300031903RhizosphereMWRSTVRAIALVLATGAVAFAQEVDIPRTPGWELVMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPEDKAQLMAYLLRHYRDPGGR
Ga0307412_1066566923300031911RhizosphereMWRSTVRAIALVLATGAVAFAQEVDIPRTPGWELVMRCVICHSVEIAVQQRLGPEGWSDTLDRMIKYGAPIPPEDKQRLMGYLLRHFRDPAGR
Ga0307471_10329533823300032180Hardwood Forest SoilMWRELVATGLAAVGVLAAGSVGAQDAGIPRTPGWELILRCVICHSVEVAVQQRFGPQGWSDTLDRMIKYGAPIPPEDKAQLMESLLRHYRDPEGK
Ga0307472_10044886423300032205Hardwood Forest SoilMWRSAALAAVLLAAGGAAGACAQEVEIPKTPGWELIMRCVICHSVEIAVQQRFGPQGWSETLDRMIKYGAPIPPDDKAQLMAYLLRHYRAPNGR
Ga0247829_1028568343300033550SoilMWRSALLGAAAIAAGALPASAQEIPRTPGWELVMRCVMCHSVEIAVQQRLGPQGWSETLDRMIEYGAPIPPTDREQLLAYLLRHFRDPDGR
Ga0364924_017225_313_6153300033811SedimentMWRSLRTAALLALTAGMALGLRLALASAQNTPIPQTPGWDLVMRCVICHSVEVAVQQRFGPRDWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0364937_010866_746_10483300034113SedimentMWRSLALAASLALLGASAPAPRFYSLSAQEVPIPRTPGWELVMRCVICHSVEIAVQQRLGPRGWSETLDRMIKYGAPIPPEDKAKLMTYLLQHYRDPAGP
Ga0364938_102753_143_3973300034114SedimentMALGLRLTLASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0364940_0088590_236_4903300034164SedimentMALGLRLALASAQNTPIPQTPGWELVMRCVICHSVEVAVQQRFGPKGWSDTLDRMIKYGAPIPPEDKETLLVYLLRHYRDPDGR
Ga0364931_0321253_93_3923300034176SedimentMWRSVALVAALALGAGAATVHPARSAAQEVAIPQTQGWELVMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPDDKDRLMAYLLRHYRDPDGR
Ga0364934_0364712_1_2373300034178SedimentMWRDTLLAAALVSLGAAVALHAPPARAQPSPIPQTDGWELIMRCVICHSVEIAVQQRLGPRGWSDTLDRMIKYGAPIPP
Ga0364923_0180726_306_5603300034690SedimentMWRSLALAASLALLGASAPAPRFYSLSAQEVPIPRTPGWELVMRCVICHSVEIAVQQRLGPQGWSETLDRMIKYGAPIPPEDKAQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.