NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F030707

Metagenome / Metatranscriptome Family F030707

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F030707
Family Type Metagenome / Metatranscriptome
Number of Sequences 184
Average Sequence Length 114 residues
Representative Sequence MPHITTETSRMSWSMRVFVSRNPSRRTDTAVSLRVARALTRRTTASLLGAAGRESYLVAGAVRSLETVTGAAGIRYNAGSGTTLRFDVSVIRSRPVLSRRGVSLGVERGL
Number of Associated Samples 145
Number of Associated Scaffolds 184

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.63 %
% of genes from short scaffolds (< 2000 bps) 1.63 %
Associated GOLD sequencing projects 131
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.370 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.087 % of family members)
Environment Ontology (ENVO) Unclassified
(45.652 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.891 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166
1JGI25381J37097_10374811
2JGI25384J37096_100316963
3JGI25384J37096_100661301
4JGI25382J37095_102635281
5JGI25613J43889_101940911
6JGI25382J43887_103772671
7Ga0062589_1004760582
8Ga0066672_109814862
9Ga0066677_101530901
10Ga0066677_103965832
11Ga0066673_101849601
12Ga0066684_101957513
13Ga0066685_106210562
14Ga0070680_1002563762
15Ga0070691_107316161
16Ga0070711_1016967921
17Ga0070705_1001861421
18Ga0070700_1013402851
19Ga0066686_110222851
20Ga0066686_110648962
21Ga0066689_103169931
22Ga0066682_101517591
23Ga0070698_1019600542
24Ga0070697_1017774901
25Ga0066692_108867352
26Ga0066704_101183033
27Ga0066698_102009373
28Ga0066699_111940281
29Ga0066705_100143135
30Ga0066705_100363044
31Ga0066705_103873851
32Ga0066694_102957471
33Ga0066708_110516431
34Ga0066691_103753462
35Ga0066691_109153601
36Ga0066651_102815681
37Ga0066651_107278872
38Ga0066656_105883561
39Ga0066665_100338446
40Ga0066665_101891982
41Ga0066659_1000032516
42Ga0066659_101097983
43Ga0066660_115655041
44Ga0079220_104649232
45Ga0075421_1020043892
46Ga0075433_100189991
47Ga0075436_1001952672
48Ga0079216_110271151
49Ga0079219_103151871
50Ga0075419_111865981
51Ga0099791_102402261
52Ga0099794_101730512
53Ga0066710_1030487922
54Ga0066710_1031704532
55Ga0066710_1037216222
56Ga0099830_100113531
57Ga0099828_120026482
58Ga0099827_101918071
59Ga0066709_1026253691
60Ga0099792_108472631
61Ga0105347_15181461
62Ga0105066_10843572
63Ga0134070_101277363
64Ga0134088_100939352
65Ga0134109_100240671
66Ga0134109_100491171
67Ga0134064_101091752
68Ga0134111_101362952
69Ga0134080_100497911
70Ga0134071_102283582
71Ga0134062_100283711
72Ga0134127_110733721
73Ga0134123_101127693
74Ga0137393_117915792
75Ga0137452_12064451
76Ga0137421_11492971
77Ga0137364_106454391
78Ga0137363_103399561
79Ga0137399_104262202
80Ga0137399_115789831
81Ga0137376_103570152
82Ga0137379_111258432
83Ga0137379_117318331
84Ga0137378_102916601
85Ga0137378_115891421
86Ga0137377_103169622
87Ga0137377_107716912
88Ga0137377_108503102
89Ga0137387_105621791
90Ga0137387_107001472
91Ga0137386_103276322
92Ga0137366_104002472
93Ga0137384_103927681
94Ga0137375_109070831
95Ga0137360_118938751
96Ga0137361_102442772
97Ga0137361_102908481
98Ga0137361_112273401
99Ga0137361_115733092
100Ga0137390_104371042
101Ga0137398_107204902
102Ga0137395_109462511
103Ga0137396_110469671
104Ga0137394_101488613
105Ga0137359_101063581
106Ga0137359_101300791
107Ga0137413_103611762
108Ga0137419_114946881
109Ga0137416_102803872
110Ga0137416_106437442
111Ga0137416_110535271
112Ga0134077_101142462
113Ga0134077_103055602
114Ga0134076_103475711
115Ga0134076_104464071
116Ga0134075_102522621
117Ga0134075_102616312
118Ga0134075_105108871
119Ga0134078_100151973
120Ga0134079_104439631
121Ga0180087_10079742
122Ga0137412_100287696
123Ga0137403_114789721
124Ga0134085_100875052
125Ga0134112_102568982
126Ga0134112_105021972
127Ga0134083_104619032
128Ga0134083_105492331
129Ga0184610_10804752
130Ga0184626_103491542
131Ga0184637_106352461
132Ga0184612_101313491
133Ga0184612_104480211
134Ga0184629_101086611
135Ga0066667_102715681
136Ga0066667_105709081
137Ga0066667_114466822
138Ga0066662_122307431
139Ga0184643_10422881
140Ga0193713_10133561
141Ga0193739_10292001
142Ga0207684_103381972
143Ga0209235_12558412
144Ga0209236_10668991
145Ga0209236_10723561
146Ga0209027_10316733
147Ga0209238_12239821
148Ga0209468_10073101
149Ga0209239_10143211
150Ga0209239_10503142
151Ga0209268_11763202
152Ga0209155_11359971
153Ga0209152_103940891
154Ga0209803_12524741
155Ga0209803_12618811
156Ga0209159_10277551
157Ga0209808_12857851
158Ga0209807_10106725
159Ga0209807_11372181
160Ga0209160_11448541
161Ga0209058_12718361
162Ga0209056_101204543
163Ga0209474_102232582
164Ga0209648_106667322
165Ga0179587_101561211
166Ga0207480_1007991
167Ga0208454_10412092
168Ga0209177_101494932
169Ga0209283_100594014
170Ga0209590_102151532
171Ga0137415_109028222
172Ga0307282_100780253
173Ga0307282_106529562
174Ga0307284_102605441
175Ga0307296_106126211
176Ga0307312_101174141
177Ga0308187_100880571
178Ga0307495_102424561
179Ga0308179_10229042
180Ga0307468_1007281342
181Ga0307471_1039559261
182Ga0214471_106528361
183Ga0214471_107012571
184Ga0364928_0089084_279_716
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 48.55%    Coil/Unstructured: 51.45%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110MPHITTETSRMSWSMRVFVSRNPSRRTDTAVSLRVARALTRRTTASLLGAAGRESYLVAGAVRSLETVTGAAGIRYNAGSGTTLRFDVSVIRSRPVLSRRGVSLGVERGLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Groundwater Sand
Sediment
Populus Rhizosphere
3.3%5.4%26.1%12.5%21.2%12.0%3.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_103748113300002557Grasslands SoilALVARRSSAGVEYRRWNYAPGPVDIVIPHVTTEVARVTWDLRVFFSRNPSQRTDGAFTLRATGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF*
JGI25384J37096_1003169633300002561Grasslands SoilSWTMRVLVSRNPSRRTDAAATLRVARALSRRTTISLLGGGGRESYLVGGAVRSLKTATGVAGMRYNAGSGVTLRFDASVIHSSPILSRGGIAIGVERGL*
JGI25384J37096_1006613013300002561Grasslands SoilLIPHVTTETSRMSWSLRVXVSRNPSRRTDTAVSLRVTRAMTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRLDVSGVRSRPVLSRSGVSLGVEHGL*
JGI25382J37095_1026352813300002562Grasslands SoilPAFTPHNTWEGDVAALVAPRVSAGVGYRRWNYNVGPVDVLMPHITIETPRTSWTIRGFVSRNPSKRTDAAASLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLTGVAGIRYNAGSGTTIRLDVSGVRSRPILTRNGVSLGVERAL*
JGI25613J43889_1019409113300002907Grasslands SoilVALRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSRNPSKRTDAAAYVRLTRAVTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGVAGIRYNASTRTTIRFDVSGVHSRPILSRNGVSLGVERGL*
JGI25382J43887_1037726713300002908Grasslands SoilVARRASLGLAFRRWNYDVGAVDVVMPHITIETPRTSWTMRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL*
Ga0062589_10047605823300004156SoilATRALTRRTAISFLGAAGRESYLVAGAVRSLRTITGSAGIRYNTTGGTTFRIDLTVINSQPVLSRQGIALGVERGL*
Ga0066672_1098148623300005167SoilRATGGLSRRTTVSLVGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF*
Ga0066677_1015309013300005171SoilSRRPMFSPRNSWQADASALVAPRTSAGVEYRRWNYAPGPVDIVIPHVTTEVARVTWDLRVFLSRNPSQRTDGAFTLRSTGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDVSAIRSRPVLSRSGVAIGVERGF*
Ga0066677_1039658323300005171SoilRRTDTAVSLRASRALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGLSLGIERGL*
Ga0066673_1018496013300005175SoilFSRNPSRRTDAAVSLRATRSLSQRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNARSGLTLRVDATVIRSRPILSRRGIALSIEQEL*
Ga0066684_1019575133300005179SoilFTLRATGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRLDASAIHSRPVLSRSGVAIGVERGF*
Ga0066685_1062105623300005180SoilMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0070680_10025637623300005336Corn RhizosphereGYERRNYSPGPVDVVTPHATLEAGAVSWDLRLYLSRNPTRRTDAAFALRAAKPLSRRTSGWLLGGAGRESYVVGGAISALETVTGAAGIRHNAGSGWTVRLDATVIQSRPVLSRRGLAIGLERGF*
Ga0070691_1073161613300005341Corn, Switchgrass And Miscanthus RhizosphereDAATLVGHHASVGLTYRRWNYSEGPVDILMPRFTTETPRMSWTMRLFISRNPSDRTDWAGQVQVARAFSRRVTGSLLGAGGRETYSVAGSLQSLATTTIGLGARYNVGSNMTLRLDASFINSQPTLSRRGIALGLERGF*
Ga0070711_10169679213300005439Corn, Switchgrass And Miscanthus RhizosphereVWLTAEAGTSLRPAFTPHNTWEADAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITFESPRTSWTLRGFVSRNPSQRTDAAAYLRMTRAITRRTTVSLLGGAGRESYLVAGAIQSLKTLSGVAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGLERGL*
Ga0070705_10018614213300005440Corn, Switchgrass And Miscanthus RhizosphereNYAVGPVDVWMPHLTTETTRMSWTMRGFISRNPTRRTDVAVSLRATRAVSRRTTLALLGAAGRESYFVAGTVRSLKTVTGVAGIRYNAGTGMTLRFDASVIHSSPVLSRGGIAIGVERGL
Ga0070700_10134028513300005441Corn, Switchgrass And Miscanthus RhizosphereEADAATLVGHHASVGLTYRRWNYSEGPVDILMPRFTTETPRMSWTMRLFISRNPSDRTDWAGQVQVARAFSRRVTGSLLGAGGRETYSVAGSLQSLATTTIGLGARYNVGSNMTLRLDASFINSQPTLSRRGIALGLERGF*
Ga0066686_1102228513300005446SoilMRVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0066686_1106489623300005446SoilPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL*
Ga0066689_1031699313300005447SoilVFMPKNTWEVDASALVARRASLGLGYRRWNYGVGPVDVLMPHVTLETTRMSWTMRVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0066682_1015175913300005450SoilWEADAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSSNPSKRTDAAAYVRMTRAVTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASSRTTIRFDVSGVHSRPILSRNGVSLGVERGL*
Ga0070698_10196005423300005471Corn, Switchgrass And Miscanthus RhizosphereVVMPHITIESPRTSWTLRGFVSRNPSQRTDAAAYLRMTRAVTRRTTLSLLGGAGRESYLVAGTIQSLKMVSGVAGIRYNASTRTTIRFDVSGVRSRPILTRNGVALGLERGL*
Ga0070697_10177749013300005536Corn, Switchgrass And Miscanthus RhizosphereSAGVGYRRWNYAVGPVDVWMPHLTTETTRMSWTIRGFISRNPTRRTDVAVSLRATRAVSRRTTLALLGAAGRESYFVAGTVRSLKTVTGVAGVRYNAGTGMTLRFDASVIHSSPVLSRSGIAIGVERGL*
Ga0066692_1088673523300005555SoilNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL*
Ga0066704_1011830333300005557SoilGGAGRESYLVGAIVQSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRRGIAIGVERGF*
Ga0066698_1020093733300005558SoilNTWETDASALIAPRVSAGVGYRRWNYAVGSVDIWMPHVAIETNKMFWTMRVFVSRNPSHRTDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL*
Ga0066699_1119402813300005561SoilEYRRWNYAPGPVDIVIPHVTTEVARVTWDLRVFLSRNPSQRTDGAFTLRATGGLSRRTTVSLVGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF*
Ga0066705_1001431353300005569SoilPVDVLMPHFTAETRRMSWSVRVFISRNPSKRTDTAASLRATRALSRRTTVSLLGAAGRESYLVAGIVQSLKTLSGSAGIRYNAAGGTTLRLDVSVIRSRPILSRSGLSIGVEHGL*
Ga0066705_1003630443300005569SoilAALVAPRVSAGVGYRRWNYDVGAVDVVMPHVTIETARTSWTIRGFVSRNPSKRTDAAAYVRMTRAVTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASSRTTIRFDVSGVHSRPILSRNGVSLGVERGL*
Ga0066705_1038738513300005569SoilSSAGVGYRRWNYSVGSVDIVMPQLTTEISRVTWTVRGFVSRNPSRRTDAAVSLGATRPVTRRNALSVLGAAGRESYLVGGVVRSLKTLTGVAGIRWNSAGGSSFRIDVTVIRSRPVLSRYGVSVGMERGL*
Ga0066694_1029574713300005574SoilNPSNRTDAAAYLRMTRAVTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASSRTTIRFDVSGVHSRPILSRNGVSLGVERGL*
Ga0066708_1105164313300005576SoilGYRRWTYGVGPVDIWMPHVTTETSRMSWSMRVFVSRNPSRRTDTAVSLRASRALTRRTTASLLGAAGRESYLVAGAIRSLETATGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGLSLGIERGL*
Ga0066691_1037534623300005586SoilRTDGAFTLRATHGFSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRRGIAIGVERGF*
Ga0066691_1091536013300005586SoilDASALVARRTSAGVEYRRWNYTPGPVDIVIPHVTTEVARVTWDLRVFLSRNPSQRTDGAFTLRATRGFSRRTSAWVLGGAGRESYLVGAIVQSLQTVTGVAGIRYNAANGLTLRLDASVIRSRPVLSRRGVAIGVERGF*
Ga0066651_1028156813300006031SoilSLRATRSLTRRTAVSLLGAAGRESYLVAGAVQSLETLTGVAGIRYNATGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0066651_1072788723300006031SoilFLSRNPSKRTDTAVSLRVARALTRRTTVSLLGAGGRESYLVGGVVRSLDTVTGGAGIRYNATGGTTLRLDVTVIHSRPVLSRSGVSLRVERGL*
Ga0066656_1058835613300006034SoilHITTESNRMSWSLRVFVSRNPSKRTDAAASLRIARAITRRTTLSLLAAGGRESYLVGGVVRSLETVTGGAGMRYNGPGGTTLRLDVVAIRSRPVLSRRGFSISVERGL*
Ga0066665_1003384463300006796SoilYRRWNYVPGPVDIVIPHVTTEAARVTWDLRVFLSRNPSQRTDGAFTLRATRGFSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGIRYNAASGLTLRLDASVIRSRPVLSRRGIAIGVERGF*
Ga0066665_1018919823300006796SoilVLMPHVTLETTRMSWTMRVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0066659_10000325163300006797SoilVSRRTTISLLGAGGRESYLVAGVVQSLKTLSGVAGIRYNAAGGTTLRLDVSVIRSRPILSRSGLSIGVERVL*
Ga0066659_1010979833300006797SoilSKRTDGAFTLRATRGFSRRTSAWLLVGAGRESYLVGAIVQSLQTVTGVAGIRYNAASGLTLRLDASVIRSRPVLSRRGIAIGVERGF*
Ga0066660_1156550413300006800SoilPRNSWQADASALVARRTSAGVEYRRWNYAPGPVDIVIPHVTTEVAGVTWDLRAFLSRNPSQRTDGAFTLRATGGLSRRTTVSLVGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGLERGF*
Ga0079220_1046492323300006806Agricultural SoilPHLTLETNRMSWTMRVFVSRNPSRRTDTAVSLRATRALTRRTTASLLGAAGRESYLVTGAIRSLETVTGVAGIRYNAGSGTTLRFDVSVIRSRPVLSRSGLALGVEHGL*
Ga0075421_10200438923300006845Populus RhizosphereGFGYRRWNYAEGPVDVLMPHFTTETNKMSWTIRVYVSRNPSERWDTAALLRVARAFSRRVTGSLLGAAGRETYLVGGALQSLETATIGLGARYNAGSGMTVRFDATFINSQPVLSRRGIALSLERAL*
Ga0075433_1001899913300006852Populus RhizosphereLRATRALTRRTTASLLGAAGRESYLVTGAIRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGLSVGVERGL*
Ga0075436_10019526723300006914Populus RhizosphereSWSVRVFLSRNPSKRTDAAASLRATRALSRRTTVSLLGAAGRESYLVAGSVQSLKTLSGVAGIRYNAAGGTTLRLDVSVIRSRPILSRGGLAIGIERGL*
Ga0079216_1102711513300006918Agricultural SoilPRVWLTAEAGTSLEPVFSPKNTWEADATALVAGRASLGIGYRRLNYAAGPVDVVIPHLTAETTRMSWSLRVYLSRNPSKRTDVAGALRVARALSRRTTGSLLGGAGRESYLVGSTIRSLETTTIGAGIRYNAGSGMTLRFDATVINSQPVLSRRGVALGLERGF*
Ga0079219_1031518713300006954Agricultural SoilPTARVWLTAEAGTALRPAFTPHNTWEADAAALIAPRVSAGVGYRRWNYDVGAVDIVMPHITFDSPRTSWTMRAFVSRNPSQRTDAAAYVRMTRAITRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGLERGL*
Ga0075419_1118659813300006969Populus RhizosphereAHEPLFIPKNTWEADASALVARRSSLGIGYRRWNYAVGPVDVVMPHFTTETNRMAWSMRVFVSRNPSRRTDTAVSLRVARAFTRRTTGSLLGAAGRESYLVGGSIRSLETVTIGAGIRYNAGSGMTLRVDASVINSQPVLSRRGVAIGLERGL*
Ga0099791_1024022613300007255Vadose Zone SoilPRLWVTAEAGTAHQPDFMPKNTWDADVSALVSQRSSVGLGYRRWNFGAGPVDIVIPHITTEMSRTSLDLRVYLSRNPSRRTDTAVSLRAGHALSRRTSGWVMGGAGRESYLVGGAVQSLKTLTGVAGIRYNAGTGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0099794_1017305123300007265Vadose Zone SoilPGPVDILMPHVTTETSRMSWSMRVFVSRNPSRRTDTAVSLRVTRAMTHRTTVPLLGGAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRVDVSFIRSRPVLSRNGVSLGVEHGL*
Ga0066710_10304879223300009012Grasslands SoilYVPGPVDIVIPHVTTEVARVTWDLRVFLSRNPSKRTDGAFTLRATHGFSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGIRYNAASGLTLRLNASVIRSRPVLSRRGIAIGVERGF
Ga0066710_10317045323300009012Grasslands SoilLIPHVTTETSRMSWSLRVFVSRNPSRRTDTEVSLRVTREMTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRLDVTGVRSRPVLSRNGVSLGVEHGL
Ga0066710_10372162223300009012Grasslands SoilDRAYCCRSPSRRTAAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNARSGLTLRVDATVIRSRPILSRRGIALSIEQEL
Ga0099830_1001135313300009088Vadose Zone SoilPSRRTDTAVSLRVTRAMTRRTTVSLLGGAGRESYLVAGAIRSLETVTGVAGIRYHAGGGTTLRVDVSFIRSRPVLSRNGVSLGVEHGL*
Ga0099828_1200264823300009089Vadose Zone SoilRMSWTMRVFVSRNPSRRTDAAVYLRATRALTRRTTASLLGTVGRESYSVAGAVQSLETVTGVAGIRYNAGGGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0099827_1019180713300009090Vadose Zone SoilWEVDAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSRNPSKRTDAAAYLRMTRALTRRTTISLLGGAGRESYLVGGAIQSLKTLSGVAGIRYNASSRTTIRFDVSGVRSRPVLTRNGVSLGVERGL*
Ga0066709_10262536913300009137Grasslands SoilVRTDAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNARSGLTLRVDATVIRSRPILSRR
Ga0099792_1084726313300009143Vadose Zone SoilNPSRRTDTAVSMRATRSLTRRTAVSLLGATGRESYLVGAAVQSLKTLTGVAGIRYNAGSGMTIRFDVSAIRSRPVLSRRGVSLGVERGL*
Ga0105347_151814613300009609SoilLFIPHNTWEADASALIAQRASLGLGYRRWNYAEGPVDILMPHFTTETSKMSWTMRVFISRNPSERTDLAASLRVARAFSRRTTGSLFGAAGRETYLVGGALQSLETATIGLGARYNAGSGMTLRFDATFINSQPVLSRRGIAIGVERGF*
Ga0105066_108435723300009822Groundwater SandMGYRRLNYAAGPVDVVMPHVTTETSRMSWSLRVYQSRNPSRRTDTAVSLRATRALSRRTTGSLLGAAGRESYLVGGAVRSLETVTGVAGIRYNAGSGMTLRFDATVIRSRPVLSRNGIAIGVERGF*
Ga0134070_1012773633300010301Grasslands SoilWNYAVGPVDIWMPHVATETNKMSWTMRVFVSRNPSHRTDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL*
Ga0134088_1009393523300010304Grasslands SoilTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0134109_1002406713300010320Grasslands SoilPVFTPMNTWETDASALVAARTSLGLGYRRWNYAVGPVDVLMPHLTTESTRMSWSVRAFIARNPSKRTDTAASLRATRSLTRRTTFSVLGAAGRESYLVAGVVRSLETLSGVAGIRYNAAGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0134109_1004911713300010320Grasslands SoilVDIVIPHVTAEATRMSWDLRAYFSRNPSRRTDAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNARSGLTLRVDATVIRSRPILSRRGIALSIEQEL*
Ga0134064_1010917523300010325Grasslands SoilLVAARTSLGLGYRRWNYAVGPVDVLMPHLTTESTRMSWSVRAFIARNPSKRTDTAASLRATRSLTRRTTFSVLGAAGRESYLVAGVVRSLETLSGVAGIRYNAAGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0134111_1013629523300010329Grasslands SoilWETDASALIAPRVSTGVGYRRWNYAVGPVDIWMPHITTETSKMSWTMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0134080_1004979113300010333Grasslands SoilDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL*
Ga0134071_1022835823300010336Grasslands SoilGPVDVLMPHLTTESNRMSWSLRAFISRNPSKRTDTAAALRATRALTRRTTISLLGAAGRESYLVGGAVRSLETLSGVAGIRYNAAAGTTLRFDVSVIRSRPVLSRRGVSISVERGL*
Ga0134062_1002837113300010337Grasslands SoilRPVFTPMNTWETDASALVAARTSLGLGYRRWNYAVGPVDVLMPHLTTESTRMSWSVRAFIARNPSKRTDTAASLRATRSLTRRTTFSVLGAAGRESYLVAGVVRSLETLSGVAGIRYNAAGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0134127_1107337213300010399Terrestrial SoilRNPTRRTDAAFALRAAKPLSRRTSGWLLGGAGRESYVVGGAISALETVTGAAGIRHNAGSGWTVRLDATVIQSRPVLSRRGLAIGLERGF*
Ga0134123_1011276933300010403Terrestrial SoilASVGLTYRRWNYSEGPVDILMPRFTTETPRMSWTMRLFISRNPSDRTDWAGQVQVARAFSRRVTGSLLGAGGRETYSVAGSLQSLATTTIGLGARYNVGSNMTLRLDASFINSQPTLSRRGIALGLERGF*
Ga0137393_1179157923300011271Vadose Zone SoilSVRVFLSRNPSKRTDTAASLRATRALSRRTTVSLLGAAGRESYLVAGIVQSLKTLSGAAGIRYNAARGTTLRLDVSVIRSRPILSRSGLSIGVERGL*
Ga0137452_120644513300011441SoilIPHNTWEADASALIAQRASLGLGYRRWNYAECPVDILLPHFTTETSKMSWTMRVFISRNPSERTDLAASLHVARAFSRRTTGSLFGAAGRETYLVGGALQSLETATIGLGARYNAGSGMTLRFDATFINSQPVLSRRGIAIGVERGF*
Ga0137421_114929713300012039SoilPLFIPHNTWEADASALIAQRASLGLGYRRWNYAEGPVDILMPHFTTETSKMSWTMRVFISRNPSERTDLAASLRVARAFSRRTTGNLFGAAGRETYLVGGALQSLETATIGLGARYNAGNRMTLRFDATFINSQPVLSRRGIAIGVERGF*
Ga0137364_1064543913300012198Vadose Zone SoilWTMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAVRGITLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0137363_1033995613300012202Vadose Zone SoilMRVFVSRNPSRRTDVAVSLRATRALTRRTTASLLGAAGRESYLVTGAVRSLKTVTGAAGIRYNAGSGTTLRFDVSVISSQPVLSRSGVSLGVERGL*
Ga0137399_1042622023300012203Vadose Zone SoilSRNPSRRTDTAVSLRVARALTRRTTASLLGGAGRESYLVAGAVRSLETVTGIAGIRYNAGSGTTLRFDVSVIRSRPVLSRSGVSLGVEHGL*
Ga0137399_1157898313300012203Vadose Zone SoilSRRPAFTPHNTWEVDAAALVAPRVSAGVGYRRWNYDVGAVDAVMPHITVETPRTSWTIRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGVAGIRYNASSRTTIRFDVSGVRSRPILTRNGVSLGVERGL*
Ga0137376_1035701523300012208Vadose Zone SoilKNTWEVDATALVAARSSVGVGYRRWNYGVGPVDVVIPHITTETSRMVWTVRAFISRNPSKRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVAGAVQSLETLTGVAGIRYNATGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0137379_1112584323300012209Vadose Zone SoilYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0137379_1173183313300012209Vadose Zone SoilVAARSSIGLGYRRWNYAVGPVDVLMPHITTETSRMSWSLRVFLSRNPSRRTDTAVSLRAARALTRRTTVSLLGAAGRESYLVGGAVRSLETVTGSAGIRYNAARGMTLRFDATVTRSRPVLSRNGIAIGVERGL*
Ga0137378_1029166013300012210Vadose Zone SoilRRTSAGVEYRRWNYTPGPVDIVIPHVTTEVARVTWDLRVFLSRNPSQRTDGAFTLRATRGLSRRTSAWFLGGAGRESYLVGAVVQSLQTVTGVAGIRYNAANGLTLRLDASVIRSRPVLSRRGVAIGVERGF*
Ga0137378_1158914213300012210Vadose Zone SoilMPHITTETSKMSWTMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0137377_1031696223300012211Vadose Zone SoilWNYAVGPVDVLMPHITTESKRMSWSVRAYISRNPSKRIDTSASLRATRSVTRRTTVSVLGATGRESYLVGGVVRSLETLSGGAGIRYNAPGGTTLRFDVSVIRSRPVLSRSGISLGLERGL*
Ga0137377_1077169123300012211Vadose Zone SoilRATRSLTRRTTVSLLGAAGRESYLVGGAVQSLKTLTGLAGIRYNAGSGMTLRFDVSAIRSRPVLSRRGVSLGVERGL*
Ga0137377_1085031023300012211Vadose Zone SoilVGPVDIWMPHITTETSKMSWTMRVFVSRNPSHRTDGAVSLRVARALSRRTTVSLLGAGGRESYLVGSVVRSLKTATGVAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0137387_1056217913300012349Vadose Zone SoilRTSVGLGYRRWNYAVGPVDVLMPHLATESQRMSWSVRVYVSRNPSKRTDTAASLRATRSLTRRTTVSVLGAAGRESYLVGGAVRSLETLTGVAGIRYNAAGGMTLRFDVSVIRSRPVLSRSGVSISVEHGL*
Ga0137387_1070014723300012349Vadose Zone SoilKNTWEVDASGLVAARSSIGLGYRRWNYAVGPVDILMPHITTETNRMSWSLRVFLSRNPSRRTDTAVSLRSARALTRRTTVSLLGAAGRESYLVGGAVRSLETVTGSAGIRYNAARGMTLRFDATVTRSRPVLSRNGIAIGVERGL*
Ga0137386_1032763223300012351Vadose Zone SoilRMSWSVRAYISRNPSKRIDTSASLRATRSVTRRTTVSVLGATGRESYLVGGVVRSLETLSGGAGIRYNAPGGTTLQFDVSVIRSRPVLSRSGISLGVERGL*
Ga0137366_1040024723300012354Vadose Zone SoilARSSIGLGYRRWNYAVGPVDVLMPHITTETSRMSWSLRVFLSRNPSRRTDTAVSLRAARALTRRTTVSLLGAAGRESYLVGGAVRSLETVTGSAGIRYNAARGMTLRFDATVIRSRPVLSRNGIAIGVERGL*
Ga0137384_1039276813300012357Vadose Zone SoilTSWPLRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGVAGIRYNASSRTTIRFDMSGVRSRPSLTRNGVSLGVERGL*
Ga0137375_1090708313300012360Vadose Zone SoilVGLGYRRWNYAVGPVDVLMPHLTTESQRMSWSVRVYVSRIPSKRTDTAASLRATRSLTRRTTVSVLGAAGRESYLVGGVVRSLETLSGVAGIRYNAAGGTTLRFDVSVIRSRPVLSRSGVSIGVERGL*
Ga0137360_1189387513300012361Vadose Zone SoilVDVMMPHITTETSRMVWTARAFISRNPSKRTDTAVSLLATRSLTRRTAVSLLGAAGRESYLVAGAVQSLKTLTGVAGIRYNAASGTTLRFDINVIRSRPVLSRSGVSLGVERGL*
Ga0137361_1024427723300012362Vadose Zone SoilPRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSRNPSKRTDAAAYVRMTRALTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGVAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGVERGL*
Ga0137361_1029084813300012362Vadose Zone SoilRMTRALTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASSRTTVRFDVSGVRSRPILTRNGVSLGVERGL*
Ga0137361_1122734013300012362Vadose Zone SoilDAAATLRVARALSRRTTISLLGGGGRESYLVGGAVRSLKTATGVAGMRYNAGSGVTLRFDASVIHSRPILSRGGIAIGVERGL*
Ga0137361_1157330923300012362Vadose Zone SoilLGLGYRRWNYGAGPVDVVIPHVTLETNRMSWTMRVSVSRNPSRRTDTAVYLRAARALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRFDISVIRSRPVLSRSGVSLGVERGL*
Ga0137390_1043710423300012363Vadose Zone SoilETRRMSWSVRVFLSRNPSKRTDTAASLRATRALSRRTTVSLLGAAGRESYLVAGIVQSLKTLSGAAGIRYNAARGTTLRLDVSVIRSRPILSRSGLSIGVERGL*
Ga0137398_1072049023300012683Vadose Zone SoilRRASLGLGYRRWNYGAGPVDILLPHVTLETNRMSWTLRVFVSRNPSRRTDTAASLRATRALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRFDISVIRSRPVLSRSGVSLGVERGL*
Ga0137395_1094625113300012917Vadose Zone SoilRRTDTAVYLRAARTLTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0137396_1104696713300012918Vadose Zone SoilASALVARRSSIGLGYRRWNYGVGPVDIVMPHVTVETSRMSWSLRAFVSRNPSRRTDTAIYLRATRALTRRTTASLLGGAGRESYLVAGAVRSLETVTGIAGIRYNAGSGTTLRFDVSVIRSRPVLSRSGVSLGVEHGL*
Ga0137394_1014886133300012922Vadose Zone SoilVRAFVSRNPSRRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVGGAVRSLETLTGIAGIRYNAGSGMTLRFDVSAIRSRPVLSRRGVSLGVERGL*
Ga0137359_1010635813300012923Vadose Zone SoilSRRPAFTPHNTWEVDAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSRNPSKRTDAAAYVRMTRALTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGVAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGVERGL*
Ga0137359_1013007913300012923Vadose Zone SoilRRTDTAVYLRAARALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRFDISVIRSRPVLSRSGVTLGVERGL*
Ga0137413_1036117623300012924Vadose Zone SoilRRWNYGVGPVDVVMPHITTETSRMVWTARAFISRNPSKRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVAGAVRSLETLTGVAGIRYNAASGTTLRFDISVIRSRPVLSRSGVTLGVERGL*
Ga0137419_1149468813300012925Vadose Zone SoilIGLGYRRLNYAAGPVDVLTPHITTETTRMSWSVRAFVSRNPSRRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVGGAVRSLETLTGIAGIRYNAGSGMTLRFDVSAIRSRPVLSRRGVSLGVERGL*
Ga0137416_1028038723300012927Vadose Zone SoilGTSHQPQFSPKNTWEVDASGLVAARSSIGLGYRRLNYAAGPVDVLTPHITTETNRMSWSLRAFLSRNPSRRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVGGAVRSLETLTGIAGIRYNAGSGMTLRFDVSAIRSRPVLSRRGVSLAVERGL*
Ga0137416_1064374423300012927Vadose Zone SoilNRMSWTMRVSVSRNPSRRTDTAVYLRAARALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRIDVSVIHSRPVLSRNGLSLGVERGL*
Ga0137416_1105352713300012927Vadose Zone SoilMRVSVSRNPSRRTDTAVSLRVARALTRRTTASLLGAAGRESYLVAGAVRSLETVTGGAGIRYNAGSGTTLRFDVSVIRSRPVLSRSGVSLGVEHGL*
Ga0134077_1011424623300012972Grasslands SoilRWNYGVGPVDVLMPHVTLETTRMSWTMRVFVSRNPSRRTDTAVYLRATRAMTRRTTASLLGAAGSESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL*
Ga0134077_1030556023300012972Grasslands SoilWEADASALVSQRSSIGVGYRRWNYAAGPVDVVMPHITTESNRMSWSIRVYVSRNPSKRTDTAVSLRVARAVTRRTTLSLLAGGGRESYLVGGVVRSLETLTGGAGMRYNGPGGTTVRLDVVAIRSRPVLSRRGFSVSVERGL*
Ga0134076_1034757113300012976Grasslands SoilQPTFSPKNTWETDASALIAPRVSAGVGYRRCNYAVGPVDIWMPHVATETNKMSWTMRVFVSRNPSHRTDAAVSLRVARALSRRTIASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL*
Ga0134076_1044640713300012976Grasslands SoilLRAFISRNPSKRTDTAAALRATRALTRRTTISLLGAAGRESYLVGGVVRSLETLSGVAGIRYNAATGTTLRFNVSVIRSRPVLSRRGVSISVERGL*
Ga0134075_1025226213300014154Grasslands SoilGYRRWNYAVGPVDIWMPHVATETNKMSWTMRVFVSRNPSHRTDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL*
Ga0134075_1026163123300014154Grasslands SoilVDVVMPHITIETPRTSWTMRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL*
Ga0134075_1051088713300014154Grasslands SoilYRRWNYAAGPVDVGMPHITTESNRMSWSIRVYVSRNPSKRTDTAVSLRVARAVTRRTTLSLLAGGGRESYLVGGVVRSLETLTGGAGMRYNGPGGTTVRLDVVAIRSRPVLSRRGFSVSVERGL*
Ga0134078_1001519733300014157Grasslands SoilRMSWSVRAFIARNPSKRTDTAASLRATRSLTRRTTFSVLGAAGRESYLVAGVVRSLETLSGVAGIRYNAAGGTTLRFDVSVIRSRPVLSRSGVSLGVERGL*
Ga0134079_1044396313300014166Grasslands SoilRNPSQRTDGAFTLRATGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF*
Ga0180087_100797423300014872SoilFIPHNTWEADASALIAQRASLGLGYRRWNYAEGPVDILMPHFTTETSKMSWTMRVFISRNPSERTDLAASLRVARAFSRRTTGNLFGAAGRETYLVGGALQSLETATIGLGARYNAGSGMTLRFDATFINSQPVLSRRGIAIGVERGF*
Ga0137412_1002876963300015242Vadose Zone SoilSIGVGYRRWNYGVGPVDVVMPHITTETSRMVWTARAFISRNPSKRTDTAVSLRATRSLTRRTAVSLLGAAGRESYLVAGAVRSLETLTGVAGIRYNAASGTTLRFDISVIRSRPVLSRSGVTLGVERGL*
Ga0137403_1147897213300015264Vadose Zone SoilPSRRTDTAVSLRATRSLTRRTTVSLLGAAGRESYLVGGAVQSLKTVTGVAGIRYNAGSGMTVRFDVSAIRSRPVLSRRGVSLGVERGL*
Ga0134085_1008750523300015359Grasslands SoilLRVARAVSRRTTVSVLGAGGRESYLVGSVVRSLKTATGVAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL*
Ga0134112_1025689823300017656Grasslands SoilSRALSRRTTLSLLGAGGRESYLVGSVVRSLETATGVAGIRYNAGSGTTLRIDASVIHSSPILSRGGIAIGAERAF
Ga0134112_1050219723300017656Grasslands SoilLMPHLTTESNRMSWSLRGFISRNPSKRTDTAAALRATRALTRRTTISLLGAAGRESYLVGGVVRSLETLSGVAGIRYNAATGTTLRFDVSVIRSRPVLSRSGVSVSVERGL
Ga0134083_1046190323300017659Grasslands SoilRALSRRTTLSLLGAGGRESYLVGSVVRSLETATGVAGIRYNAGSGTTLRIDASVIHSSPILSRGGIAIGAERAF
Ga0134083_1054923313300017659Grasslands SoilAFRRWNYDVGAVDVVMPHITIETPRTSWTMRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL
Ga0184610_108047523300017997Groundwater SedimentVTRRATLGLGYRRWNYAVGPVDVLIPHMTAETGKTAWSLRVFVSRNPSRRTDTAASLRATRALSRRTTGSLLGAAGRESYLVGGTVRSLETITGAAGIRYNAGSGMTLRLDATVIRSRPVLSRRGVAIGVERGL
Ga0184626_1034915423300018053Groundwater SedimentPSRRTDTAASLRATRALSRRTTGSLLGAAGRESYLVGGTVRSLETITGAAGIRYNAGSGMTLRLDATVIRSRPVLSRRGVAIGVERGL
Ga0184637_1063524613300018063Groundwater SedimentADATALVTRRSSVGIGYRRLNYAAGPVDVVMPHVTTETSRMSWSLRAYLSRNPSKRTDTAVSLRVSRALSRRTTGSLLGAAGRESYLIGGAVRSRETVTGAAGIRYNAGSGMTLRFDVTVIRSRPVLSRNGIAIGVERGF
Ga0184612_1013134913300018078Groundwater SedimentLVARRASLGVGYRRLNYAAGAVDVVMPHVTTETSRMSWSLRVYLSRNPSRRTDTAVSLRATRALSRRTTGSLLGAAGRESYLVAGAVRSLETITGTAGIRYNAGSGITLRFDATVIRSRPVLSRNGIAVGVERGF
Ga0184612_1044802113300018078Groundwater SedimentLVARRASLGVGYRRLNYAAGAVDVVMPHVTTETSRMSWSLRVFVSRNPSRRTDTAASLRATRALSRRTTGSLLGAAGRESYLVGGTVRSLETITGAAGIRYNAGSGMTLRLDATVIRSRPVLSRRGVAIGVERGL
Ga0184629_1010866113300018084Groundwater SedimentRWNYGVGPVDVVMPHITTETSKTVWSVRGFVSRNPSKRTDTAVSLRVARAFSRRTTGSLLGAAGRESYLVGGTIRSLETTTVGAGIRYNAGSGMTLRLDATVINSQPVLSRRGVALGVERGL
Ga0066667_1027156813300018433Grasslands SoilGVGYRRWNYAVGPVDIWMPHITTETSKMSWTMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL
Ga0066667_1057090813300018433Grasslands SoilSAGVGYRRWNFAVGPVDIVIPHVTAEATRMSWDLRAYFSRNPSRRTDAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNAPSGLTLRIDATVIRSRPILSRRGIALSIEQEL
Ga0066667_1144668223300018433Grasslands SoilSRRTDTAVSLRASLALTRRTTASLLGGAGRESYLVAGAIRSLETVTGVAGVRYNAGSGTTLRFDVSVIHSRPVLSRSGLSLGIERGL
Ga0066662_1223074313300018468Grasslands SoilVARRASLGLAFRRWNYDVGAVDVVMPHITIETPRTSWTMRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILTRNGLSLGVERGL
Ga0184643_104228813300019255Groundwater SedimentMPHITTETSRMSWSMRVFVSRNPSRRTDTAVSLRVARALTRRTTASLLGAAGRESYLVAGAVRSLETVTGAAGIRYNAGSGTTLRFDVSVIRSRPVLSRRGVSLGVERGL
Ga0193713_101335613300019882SoilRTTASRLGAAGRESYLVAGAVRSLETVTGVAGIRYNAGTGTTLRFDVSVIRSRPVLSRSGVSLGVERGL
Ga0193739_102920013300020003SoilNTWEADATALVARRSSLGIGYRRLNYAAGPVDVVMPHLTTETGKTAWSLRVFLSRNPSRRTDTAVSLRVTRALSRRTTGSLLGAAGRESYLVGGAVRSLETVTGGAGIRHNAGSGMTLRFDVNVIRSKPVLSRNGIAIGVERGF
Ga0207684_1033819723300025910Corn, Switchgrass And Miscanthus RhizosphereRMSWSVRVFLSRNPSKRTDTAASLRATRALSRRTTVSLLGAAGRESYLVAGIVQSLKTLSGAAGIRYNAAGGTTLRLDVSVIRSRPILSRGGLSIGVERGL
Ga0209235_125584123300026296Grasslands SoilSRRTDTAVSLRVTRAMTRRTTVSLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRLDVSGVRSRPVLSRSGVSLGVEHGL
Ga0209236_106689913300026298Grasslands SoilVSLRATRALTRRTTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRLDVSGVRSRPVLSRSGVSLGVEHGL
Ga0209236_107235613300026298Grasslands SoilSRNPSHRTDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL
Ga0209027_103167333300026300Grasslands SoilRVTWDLRVFFSRNPSQRTDGAFTLRATGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF
Ga0209238_122398213300026301Grasslands SoilAEATRMSWDVRAYFSRNPSRRTDAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGVRYNARSGLTLRVDATVIRSRPILSRRGIALSIEQEL
Ga0209468_100731013300026306SoilGYRRWNYAVGPVDIWMPHITTETSKMSWTMRVFVSRNPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL
Ga0209239_101432113300026310Grasslands SoilGGLSRRTTVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRSGVAIGVERGF
Ga0209239_105031423300026310Grasslands SoilASALVARRTSAGVEYRRWNYTPGPVDIVIPHLTTEVARVTWDLRVFLSRNPSKRTDGAFTLRATHGFSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRRGIAIGVERGF
Ga0209268_117632023300026314SoilAVSLRATRSLTRRTAVSLLGAAGRESYLVAGAVQSLETLTGVAGIRYNATGGTTLRFDVSAIRSRPVLSRSGVSLGVERGL
Ga0209155_113599713300026316SoilPSHRTDGAVSLRVTRALSRRTTVSLLGAGGRESYLVGAVVRSLRTATGAAGIRYNAARGMTLRLDATVIHSSPILSRTGVAIGVERGL
Ga0209152_1039408913300026325SoilSIGLGYRRWNYGVGPVDVWTPHVTTETSRMSWSMRVFVSRNPTRRTDTAVSLRASRALTRRVTASLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL
Ga0209803_125247413300026332SoilRQPVFMPKNTWEVDASALVARRASLGLGYRRWNYGVGPVDVLMPHVTLETTRMSWTMRVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL
Ga0209803_126188113300026332SoilVGYRRWNYAVGPVDIWMPHAATETNKTSWTMRVFVSRNPSHRTDAAVSLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVGGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL
Ga0209159_102775513300026343SoilRSLTRRTAVSLLGAAGRESYLVAGAVQSLETLTGVAGIRYNATGGTTLRFDVSAIRSRPVLSRSGVSLGVERGL
Ga0209808_128578513300026523SoilEATRMSWDLRAYFSRNPSRRTDAAVSLRATRSLSLRTSGSLLGSAGRESYLVGAAVRSLETVTGSAGIRYNAPSGLTLRIDATVIRSRPILSRRGIALSIEQEL
Ga0209807_101067253300026530SoilVTWDLRVFFSRNPSQRTDGAFTLRATGGLSRRTVVSLLGGAGRESYLVGSVVRSLQTVTGVAGLRYHAANGLTLRLDASAIRSRPVLSRSGVAIGVERGF
Ga0209807_113721813300026530SoilRWNYDVGAVDVVMPHVTIETARTSWTIRGFVSRNPSKRTDAAAYVRMTRAVTRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASSRTTIRFDVSGVHSRPILSRNGVSLGVERGL
Ga0209160_114485413300026532SoilFTLRATHGFSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGLRYHAANGLTLRFDASAIRSRPVLSRRGIAIGVERGF
Ga0209058_127183613300026536SoilEAGTSRQPTFSPKNTWETDASALIAPRVSAGVGYRRWNYAVGSVDIWMPHVAIETNKMFWTMRVFVSRNPSHRTDAAVLLRVARALSRRTTASLLGAGGRESYLVGAVVRSLTTATGVAGIRYNAARGMTLRFDVTVIHSSPILSRTGVAIGVERGL
Ga0209056_1012045433300026538SoilVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYFVAGGVRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGVSLGVERGL
Ga0209474_1022325823300026550SoilDAAVSLGATRPVTRRHALSVLGAAGRESYLVGGVVRSLKTLTGVAGIRWNSAGGSTFRIDVTVIRSRPVLSRYGVSVGMERGL
Ga0209648_1066673223300026551Grasslands SoilASALVTRRTSAGVEYRRWNYAPGPVDIVIPHVTSEVARVTWDLRVSLSRNPSKRTDGAFTLRATHGLSRRTSAWLLGGAGRESYLVGAIVQSLQTVTGVAGIRYNAANGLTLRLDASVIRSRPVLSRRGVAIGVERGF
Ga0179587_1015612113300026557Vadose Zone SoilVFVSRNPSRRTDTAVYLRATRALTRRTTASLLGAAGRESYLVTGAIRSLETVTGIAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGLSVGVERGL
Ga0207480_10079913300026722SoilLTRRTAISFLGAAGRESYLVAGAVRSLRTITGSAGIRYNTTGGTTFRIDLTVINSQPVLSRQGIALGVERGL
Ga0208454_104120923300027573SoilDASALIAQRASLGLGYRRWNYAEGPVDILMPHFTTETSKMSWTMRVFISRNPSERTDLAASLRVARAFSRRTTGSLFGAAGRETYLVGGALQSLETATIGLGARYNAGSGMTLRFDATFINSQPVLSRRGIAIGVERGF
Ga0209177_1014949323300027775Agricultural SoilRVSAGVGYRRWNYDVGAVDIVMPHITFDSPRTSWTMRAFVSRNPSQRTDAAAYVRMTRAITRRTTVSLLGGAGRESYLVGGTIQSLKTLSGIAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGLERGL
Ga0209283_1005940143300027875Vadose Zone SoilGPVDILMPHVTTETSRMSWSMRVFVSRNPSRRTDTAVSLRVTRAMTHRTTVSLLGGAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRVDVSFIRSRPVLSRNGVSLGVEHGL
Ga0209590_1021515323300027882Vadose Zone SoilVDAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITIETPRTSWTIRGFVSRNPSKRTDAAAYLRMTRALTRRTTVSLLGGAGRESYLVGGAIQSLKTLSGIAGIRYNASSRTTIRFDVSGVRSRPILSRNGVALGVERGL
Ga0137415_1090282223300028536Vadose Zone SoilVFVSRNPSRRTDTAVSLRVTRAMTRRTTVSLLGAAGRESYLVAGAIRSLETVTGVAGIRYNAGGGTTLRLDVTGVRSRPVLSRSGVSLGVEHGL
Ga0307282_1007802533300028784SoilHNTWEVDAAALVAPRASAGVGYRRWNYNVGAVDVVMPHVTIDTPRMSWTIRGFVSRNPSKRTDAAGYLRMARALTRRTTVSLLGGAGRESYLVGGTIQSLKTLTGGAGIRYTASSGTTVRFDVTGIRSRPILTRNGVAVGVERAL
Ga0307282_1065295623300028784SoilTETSRMVWTMRGFISRNPSKRTDTAVSLRASRSLTRRTAVSLLGAAGRESYLVAGAVRSLKTLTGVAGIRYNAASGTTLRFDISVIRSRPVLSRSGVSLGVERGL
Ga0307284_1026054413300028799SoilAFTPHNTWEVDAAALVAPRASAGVGYRRWNYNVGAVDVVMPHVTIDTPRMSWTIRGFVSRNPSKRTDAAGYLRMARALTRRTTVSLLGGAGRESYLVGGTIQSLKTLTGGAGIRYTASSGTTVRFDVTGIRSRPILTRNGVAVGVERAL
Ga0307296_1061262113300028819SoilPAFTPHNTWEVDAAALVAPRASAGMGYRRWNYNVGAVDVVMPHVAIDSPRMSWTIRGFVSRNPSKRTDVAGYLRMTRALTRRTTVSLLGGAGRESYLVGGTIQSLKTVTGAAGIRYTASSGTTLRVDVTGIRSRPILTRNGVAVGVERAL
Ga0307312_1011741413300028828SoilSLRAARALTRRTTASLLGAAGRESYLVAGAVRSLETVTGVAGIRYNAGTGTTLRFDVSVIRSRPVLSRSGVSLGVERGL
Ga0308187_1008805713300031114SoilTSRMVWTMRGFISRNPSRRTDTAVSLRAARSLTRRTAVSLLGAAGRESYLVAGAVRSLETLTGVAGIRYNATGGTTLRFDISVIRSRPVLSRSGVSLGVERGL
Ga0307495_1024245613300031199SoilSWEADASTLIGHRSAVGLTYRRGNYVEGPVDILMPRFTTETSKMSWTMRLFISRNPSDRTDWAGQLQVARAFSRRITGSLLGAGGRETYSVAGSLQSLETATIGLGARYNAGKNLTLRLDATFINSQPVLSRRGIALGLERGF
Ga0308179_102290423300031424SoilSLTRRTAVSLLGAAGRESYLVAGAVRSLETLTGVAGIRYNATGGTTLRFDISVIRSRPVLSRSGVSLGVERGL
Ga0307468_10072813423300031740Hardwood Forest SoilTSLRPAFTPHNTWEADAAALVAPRVSAGVGYRRWNYDVGAVDVVMPHITIESPRTSWTLRGFVSRNPSQRTDAAAYLRMTRAVTRRTTLSLLGGAGRESYLVAGTIQSLKTLSGVAGIRYNASTRTTIRFDVSGVRSRPILTRNGVSLGLERGL
Ga0307471_10395592613300032180Hardwood Forest SoilAVSLRATRALTRRTTASLLGAAGRESYLVTGAIRSLETVTGVAGIRYNAGSGTTLRFDVSVIHSRPVLSRSGLAVGVERGL
Ga0214471_1065283613300033417SoilTGSLLGAAGRESYLVGSTIRSLETATVGAGIRYNAGSGMTLRFDATVIRSRPVLSRSGIAIGVERGL
Ga0214471_1070125713300033417SoilLSRRTTGSLLGASGRESYLVGSTIRSLETTTVGAGIRYNAGSGMTLRFDANVIRSRPVLSRSGIAIGVERGL
Ga0364928_0089084_279_7163300033813SedimentMNTWEADATALVTRRATLGLGYRRWNYAVGPVDVLIPHMTAETGKTAWSLRVFVSRNPSRRTDTAASLRITRALSRRTTGSLLGAAGRESYLVGGAVRSLETVTGGAGIRYNAGSGVTLRFDVSVIRSKPVLSRSGVALGVERGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.