NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F020074

Metagenome / Metatranscriptome Family F020074

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F020074
Family Type Metagenome / Metatranscriptome
Number of Sequences 226
Average Sequence Length 88 residues
Representative Sequence MSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Number of Associated Samples 178
Number of Associated Scaffolds 226

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 85.84 %
% of genes near scaffold ends (potentially truncated) 32.74 %
% of genes from short scaffolds (< 2000 bps) 84.07 %
Associated GOLD sequencing projects 161
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (86.283 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.203 % of family members)
Environment Ontology (ENVO) Unclassified
(30.088 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.903 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146
1OU_02270840
2JGI11823J13286_10079132
3Draft_100015242
4JGI12635J15846_100039722
5JGI12053J15887_103525632
6JGIcombinedJ26739_1000599831
7JGIcombinedJ26739_1007574872
8C688J35102_1207370412
9Ga0062389_1006639143
10Ga0062389_1026654242
11Ga0062386_1007603442
12Ga0062595_1016622212
13Ga0066683_101947012
14Ga0066685_107065901
15Ga0066388_1002290543
16Ga0070668_1016282162
17Ga0070731_100055549
18Ga0066692_108109131
19Ga0066708_104821442
20Ga0066706_103295902
21Ga0070762_108862971
22Ga0070762_109524692
23Ga0070764_100499982
24Ga0066903_1006122823
25Ga0066652_1006690612
26Ga0075364_102961843
27Ga0075018_106729221
28Ga0070765_1000456133
29Ga0070765_1001313962
30Ga0070765_1004270602
31Ga0070765_1004572072
32Ga0070765_1008916572
33Ga0075370_103044551
34Ga0066665_111620561
35Ga0066660_105494571
36Ga0075421_1000146833
37Ga0075421_1017303502
38Ga0073928_102772482
39Ga0073928_104160571
40Ga0075419_100801952
41Ga0099795_103694352
42Ga0099795_104895961
43Ga0099795_105025471
44Ga0066710_1025287852
45Ga0099829_117110541
46Ga0099830_102459812
47Ga0099828_106310692
48Ga0099828_116564071
49Ga0099827_100040773
50Ga0075418_107374812
51Ga0075418_130296442
52Ga0066709_1005914501
53Ga0099792_107515912
54Ga0099792_108293351
55Ga0114958_105287032
56Ga0126374_104363771
57Ga0099796_102766552
58Ga0126306_106192652
59Ga0134125_122400751
60Ga0134122_125811141
61Ga0126361_107649771
62Ga0126350_104055911
63Ga0137392_100737914
64Ga0137392_111954132
65Ga0137391_100253452
66Ga0137391_105091252
67Ga0137393_109688362
68Ga0153954_100050419
69Ga0137389_107467721
70Ga0137389_108918822
71Ga0137389_113635122
72Ga0153922_11020342
73Ga0137388_100951551
74Ga0137388_101470762
75Ga0137388_104148762
76Ga0137364_100894715
77Ga0137364_105455922
78Ga0137365_102035002
79Ga0137363_105212662
80Ga0137363_117807672
81Ga0137399_110263661
82Ga0137362_101011245
83Ga0137380_105682692
84Ga0137381_105812482
85Ga0137376_108443812
86Ga0137378_113481162
87Ga0137377_105491512
88Ga0137377_116799792
89Ga0137369_110146371
90Ga0137371_102521531
91Ga0137371_105112492
92Ga0137368_109520701
93Ga0137360_108456292
94Ga0137390_102439162
95Ga0137390_116361031
96Ga0150984_1219820871
97Ga0137358_106446892
98Ga0137398_110356902
99Ga0137397_105903402
100Ga0137397_106762891
101Ga0137396_108704781
102Ga0137359_104643362
103Ga0137413_108750781
104Ga0137419_111925862
105Ga0137416_102080202
106Ga0137404_117039742
107Ga0137407_107987062
108Ga0137410_113383442
109Ga0164305_112657311
110Ga0164305_120455501
111Ga0163163_122304782
112Ga0182018_105312552
113Ga0137405_14011052
114Ga0137412_101331651
115Ga0137412_101777612
116Ga0137403_104464762
117Ga0134085_105301772
118Ga0132258_117140662
119Ga0132256_1010100422
120Ga0132255_1031402212
121Ga0163161_108790902
122Ga0190266_101248762
123Ga0190266_102136241
124Ga0184610_12076231
125Ga0184605_101710421
126Ga0184608_100219932
127Ga0184620_100511433
128Ga0184620_100547972
129Ga0184619_101009452
130Ga0184635_101178502
131Ga0184609_104449662
132Ga0184625_104246012
133Ga0066667_109594432
134Ga0066667_112565512
135Ga0066667_113818253
136Ga0190269_115443021
137Ga0066662_104944352
138Ga0190270_127526342
139Ga0066669_123802302
140Ga0193701_10599512
141Ga0210399_104114582
142Ga0210381_100926452
143Ga0210406_105631772
144Ga0210406_112590901
145Ga0210400_113625121
146Ga0210400_116754811
147Ga0210388_105104932
148Ga0210393_100590813
149Ga0210385_100808342
150Ga0210389_101143441
151Ga0210389_101479772
152Ga0210387_102527662
153Ga0210383_100951161
154Ga0210383_109838492
155Ga0210394_103526833
156Ga0210392_106172052
157Ga0210392_106679372
158Ga0210398_104498962
159Ga0210402_115615242
160Ga0207668_121315311
161Ga0257160_10743352
162Ga0208997_10081012
163Ga0209213_10633782
164Ga0209332_10033561
165Ga0209735_10390451
166Ga0209115_10219063
167Ga0209528_10209542
168Ga0209009_10856401
169Ga0209772_100346881
170Ga0209448_100564702
171Ga0209139_100475603
172Ga0209180_103650332
173Ga0209579_100015204
174Ga0209283_108667341
175Ga0209169_100225082
176Ga0209590_103231292
177Ga0209275_106610272
178Ga0209275_109228822
179Ga0209624_102129641
180Ga0209624_103851122
181Ga0209624_104627363
182Ga0209488_101021624
183Ga0209488_101621652
184Ga0209488_111449472
185Ga0209006_100574583
186Ga0209006_105446182
187Ga0209006_109614752
188Ga0209382_100723143
189Ga0209526_100758843
190Ga0137415_102686401
191Ga0137415_111190482
192Ga0307285_102489421
193Ga0307307_101457192
194Ga0307317_101046391
195Ga0307318_101368111
196Ga0307280_101007351
197Ga0307280_102837172
198Ga0307306_101760432
199Ga0307323_101031782
200Ga0307287_103366122
201Ga0307503_101431212
202Ga0307292_100174242
203Ga0307292_103401742
204Ga0307302_101199462
205Ga0307296_104765472
206Ga0307310_102091592
207Ga0307314_100976661
208Ga0307286_102603242
209Ga0307304_102600651
210Ga0247827_108296782
211Ga0308309_108443421
212Ga0265746_10445172
213Ga0138296_10206871
214Ga0073997_122061963
215Ga0073996_124088082
216Ga0265313_101423151
217Ga0307476_1000018026
218Ga0307474_108167442
219Ga0307468_1000519023
220Ga0307478_110025252
221Ga0307470_100550443
222Ga0307470_107092861
223Ga0307471_1026349012
224Ga0307472_1006321801
225Ga0247829_106396152
226Ga0370497_0136195_374_607
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 58.47%    β-sheet: 0.00%    Coil/Unstructured: 41.53%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090MSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNYSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
86.3%13.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Freshwater Lake
Bog Forest Soil
Iron-Sulfur Acid Spring
Groundwater Sediment
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Surface Soil
Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Untreated Peat Soil
Palsa
Soil
Tropical Forest Soil
Forest Soil
Soil
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Populus Endosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Attine Ant Fungus Gardens
Hydrocarbon Resource Environments
Boreal Forest Soil
4.0%11.5%29.2%3.5%3.1%9.7%3.5%8.4%5.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
OU_022708402124908016VAAFALSLSTLVALEKNGTLANDELADIVEQSLATLKTIDVETSMRSEAARRFAFDLLEQLQARLARDRNSRQPDYLNY
JGI11823J13286_100791323300001164Forest SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLRARLARDRSRRQLDYLNY*
Draft_1000152423300001567Hydrocarbon Resource EnvironmentsMSIELDRPGGGVAAFMLSVGTLLALEKNGALAHDELADIVERSLAMLKATDAETSVRSQAAWGTAVDLLEQLHARLTRDRSRRQLDYLNY*
JGI12635J15846_1000397223300001593Forest SoilMSIELDRPGCGWAALMLSVSTLFALRKNGTLADYELTDIVEQSLARLKALDRGEGVRSQATWEAALYLLERLHAHLARDPNLEQVGTTLST*
JGI12053J15887_1035256323300001661Forest SoilMSIELDRPGGGVAAFMLSVSTLVALEKNGALAADELADIVEQSLARLKTIDAETSVRSQDAWRSALDLLEQLQARLARDRSRRQLDYLNY*
JGIcombinedJ26739_10005998313300002245Forest SoilLSVSTLLALKKNGTLADDELADIVEQSLARLKALDPETSVRSQEAWGAALDLLEQLPARFARDRSRKQLEATRST*
JGIcombinedJ26739_10075748723300002245Forest SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSRERVEA*
C688J35102_12073704123300002568SoilSRRKACMSIELDRPGGGVAAFMLSVGTLLALEKNGTLADDELDDIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0062389_10066391433300004092Bog Forest SoilALLLSVSTLHALKKNGTLADDELADIVEQSLSRLKVLNRGEGVRSQAAWETAHYLLEQLHAHLAREPSREQVGATRST*
Ga0062389_10266542423300004092Bog Forest SoilMSIELDRPGMGLAAFTLSVSTLLALKRNGTLADYELTDTVEQALAKLKTLDPGKGVRSQAALEAACYFLEQLHGHLARDPSPGTGGGDANTKKSATAA*
Ga0062386_10076034423300004152Bog Forest SoilMSMELDRPGRGVAAFMLSLSTLVALEKNGTLAADELADIVEQSLATLKAIDAETSVRSQAAWGSAIDLLEQLRARLARDRSRRQPEATRSA*
Ga0062595_10166222123300004479SoilMSIELDRPGGGVAAFALSLSTLVALEKNGTLANDELADIVEQSLATLKTIDVETSMRSEAARRFAFDLLEQLQARLARDRNSRQPDYLNY*
Ga0066683_1019470123300005172SoilMLIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDYLNY*
Ga0066685_1070659013300005180SoilGGGVAAFVLSLSTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDYLNY*
Ga0066388_10022905433300005332Tropical Forest SoilMSIELDRPGGGVAAFMLSLGTLLALEKNGTLADDELTDIVEQSLLRLKTVDAETSVRSQAAWVSALDLLEQLHTRLARDRIRRQLDYLNY*
Ga0070668_10162821623300005347Switchgrass RhizosphereMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRS
Ga0070731_1000555493300005538Surface SoilMSTELDRPGSGVAALLLSVSMLHALKKNGTLADCELADIVEQSLARLKALSRGEGVRSQAAWETALYLLEQLQAHLGRETSRDQVGA*
Ga0066692_1081091313300005555SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLQARLARDRSSRQLDYLNY*
Ga0066708_1048214423300005576SoilMSIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDYLNY*
Ga0066706_1032959023300005598SoilMSIELDRSGGGVAAFVLSLSTLLALEKNGTLADDELAGIVEQSLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARER
Ga0070762_1088629713300005602SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDRGEGVRSQATWEAALLLLEQLHAHLARDPSLEQLGVTRST*
Ga0070762_1095246923300005602SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADDELAEIVEQSLARLKALNRGEGLGSQAAWETALYLLDQLHAHLARGLSREH
Ga0070764_1004999823300005712SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSREQVEA*
Ga0066903_10061228233300005764Tropical Forest SoilMSIDLDRPGSGVAAFMLSVSTLLALGNNGTLANDELAEVVEQSLAKLKTIDSEPSLQGQAAWEKALDLLEQLHARLARDRSRRELDYLNY*
Ga0066652_10066906123300006046SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEESLARLKAIDAETSLQSRAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0075364_1029618433300006051Populus EndosphereMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAARRFALDLLDQLQARLARDRSSRQLDYPNY*
Ga0075018_1067292213300006172WatershedsMSIELDRSGGGVAAFALSFSTLLALEKNGTLADDELADIVEQSLARLKAIDAETSMRSQAAWGSALDLLEQLHARLARDRRRRQHDYLNY*
Ga0070765_10004561333300006176SoilMSIELDRAGSGVAALLLSVSTLHALKKNGMLADYELADIVEQSMARLKALHRGEGVRGQAAWETALHLLEQLHAHLARDPSREQVGA*
Ga0070765_10013139623300006176SoilMSIQFDRPGGGVAAFMLSFNMLLAFKKNGTLADDELADIVEQSLAELKAIDAEASVQSQAARGSALNLLERLRARLACDRGRDSSII*
Ga0070765_10042706023300006176SoilMLIEVHRPGSGVAALLLTVSTLHSLKKNGTLADYELADIVEQSLAKIKTLSRGESVRSQDAWEAALYLLEQLQAHLGRDTSREQVGA*
Ga0070765_10045720723300006176SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYYLLEQLHAHLARDPSREQVGA*
Ga0070765_10089165723300006176SoilMSNELDRSGSGVAALLLSVSTLHALKKNGALADHELADIVEQSLARLKALNRGEGVRGQAAWETALYLLEQLHAHLARDPSRQQVEA*
Ga0075370_1030445513300006353Populus EndosphereMAAFALSLSTLVALEKNGTLANDELADIVEQSLATLKTIDVETSMRSEAARRFAFDLLEQLQARLARDRNSRQ
Ga0066665_1116205613300006796SoilMSIELDRPGRGVAAFMLSLSTLLALKKNGTLADYELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLQARLARNRSRRQLDYLNY*
Ga0066660_1054945713300006800SoilVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKTIDAETSVRSQDAWGSAVDLLEQLHARLARDRSREQLEATRSA*
Ga0075421_10001468333300006845Populus RhizosphereMSIELDRPGGGVAAFMLSVSMLVALEKNGTLAHDELADIVERSLATLNSMDPETSVRSQAAWGSAVDLLEQLRARLARDRSRRQLDYLNY*
Ga0075421_10173035023300006845Populus RhizosphereMSIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSHGAWKTALDLLEQLHARLARDRSRKQFDYLNY*
Ga0073928_1027724823300006893Iron-Sulfur Acid SpringMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELTDIVEQSLARLKALDRGEGVRSQATWEAALYLLEQLHAHLARDPSLEHVGATRST*
Ga0073928_1041605713300006893Iron-Sulfur Acid SpringMSIELDRPGGGMAAFTLSVATLLALKKNGTIADHELADIVEQSLAGLKASDPGEGVRSQAAWKAALYLLEQL
Ga0075419_1008019523300006969Populus RhizosphereMSIELDRPGGGVAAFMLSVSMLVALEKNGTLAHDELADIVERSLATLNSMDPETSVRSQAAWGSAVDLLEQLRARLACDRSRRQLDYLNY*
Ga0099795_1036943523300007788Vadose Zone SoilCMSIELDRPGGGVAAFMLSVSTLVALEKNGTLAADELADIVEQSLARLKTIDAETSVRSQDAWRSALDLLEQLQARLARDRSRKELDYLNY*
Ga0099795_1048959613300007788Vadose Zone SoilMSIELDRPGGGVAAFVLSLGTLLALEKNGTLADDELAGIVEESLARLKAIDAETSVQSQAAWGSAIDLLEQLRARLA
Ga0099795_1050254713300007788Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVERSLATLKTMDTETSVRSQAAWGSAIDLLEQLRARLARDRSRRQLDYLNY*
Ga0066710_10252878523300009012Grasslands SoilGGVAAFVLSLSTLLALEKNGTLADDELAGIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDYLNY
Ga0099829_1171105413300009038Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVEQSLAGLKTIDAETSVPSQAAWRSALDLLEQLRARLARDRNRRQPEATRSA*
Ga0099830_1024598123300009088Vadose Zone SoilMSIELDRPGGGVAAFMLSFSTLLALEENGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0099828_1063106923300009089Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVERSLATLKTMDTETSVRSQAAWGSAIGLLEQLHARLARDRNPRQPEATRSA*
Ga0099828_1165640713300009089Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAPDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLN
Ga0099827_1000407733300009090Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDALADIVEQSLARLKAIDAETSVRSQAAWGSALDLLEQLHARLARDRRRRQLDYLNY*
Ga0075418_1073748123300009100Populus RhizosphereMSIELDRPGGGVAAFMLSVGMLLALEKNGALAHDELADIVEWSLATLKTMDTEMSVRSQAAWGSAVDLLEQLRARLARDRS
Ga0075418_1302964423300009100Populus RhizosphereMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQPQARLARDRSRKEVDYLNY*
Ga0066709_10059145013300009137Grasslands SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHPVSLAIGAGDSSII*
Ga0099792_1075159123300009143Vadose Zone SoilMSIELDRPGSGVAAFVLSLGTLLALEKNGTLADDELADIVEESLTRLKAIDAETSLQSQAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0099792_1082933513300009143Vadose Zone SoilMPIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVEQSLARLKAIDAETSVRSQAAWGSAIDLLEQLRARLARDRSRRQLDYLNY*
Ga0114958_1052870323300009684Freshwater LakeMSIELDRPGCGWAALMLSVSTLFALKKNGTLADHELTDIVEQSLARLKALDKGEGVRSQATWEAALYLLEQLHAHLARDPSREQVGA*
Ga0126374_1043637713300009792Tropical Forest SoilMSIELDRPGGGVAAFMLSLGTLLALEKNGTLADDELTDIVEQSLLRLKTVDAETSVRSQAAWVSALDLLEQLHTRLARDRIRRQ
Ga0099796_1027665523300010159Vadose Zone SoilMSIQFDRPGGGVAAFMLSFSMLLALRKNGTLVDDELADIVEQSLAELKAIDAEASVQSRAAWGSALNLLEVLRARFARDQSRGQLDYLNY*
Ga0126306_1061926523300010166Serpentine SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVEQSLTGLKSIDAETSVPSQAAWRSALDLLEQLRARL
Ga0134125_1224007513300010371Terrestrial SoilMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQDAWGAAVDLLEQLQARLARDRSSRQLDYLNY*
Ga0134122_1258111413300010400Terrestrial SoilRKACMSIELDRPGGGVAAFALSLSTLVALEKNGTLANDELADIVEQSLATLKTIDVETSMRSEAARRFAFDLLEQLQARLARDRNSRQPDYLNY*
Ga0126361_1076497713300010876Boreal Forest SoilVITPEELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVEKSLAGLKALDPEMSVPSQDAWRSAVDLLEQLRARLAR
Ga0126350_1040559113300010880Boreal Forest SoilMSIELDRPGSGMAALVLCVSMLHALKKNGTLADHELADIVEQSLARLKALNRGEGVRSQAAWETALYLLEQLHAHLACDPSREQVGA*
Ga0137392_1007379143300011269Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVERSLATLKTMDTETSVRSQAAWGSAIDLLEQVRARLARDRSRRQPEATRSA*
Ga0137392_1119541323300011269Vadose Zone SoilMSIELDRPGRGVAAFMLSLSTLFALEKNGALTHDELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLQARLARNRSRRQLDYLNY*
Ga0137391_1002534523300011270Vadose Zone SoilMSIQFDRPGGGVAAFMLSFSMLLALRKNGTLVDDELADIVEQSLAELKAIDAEASVQSQAAWGSALNLLEVLRARFARDQSRGQLDYLNY*
Ga0137391_1050912523300011270Vadose Zone SoilMPIELDRPGGGVAAFMLSVGTLLALEKNGTLTHDELADIVEQSLARLKALDPETSVRSQEAWGAALDLLEQLRARLARDRSRKQLEATRST*
Ga0137393_1096883623300011271Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEESLARLKAIDAETSLQSQAAWGSALDLLEQLHVRLARERSRKELDYL
Ga0153954_1000504193300011418Attine Ant Fungus GardensMSIELDRPGHGVAAFMLNLGTLLALEKNGMLTGDELLDIVQQSVAKLKAIDAEPSLRSQAVRGRALDLLEKLYARLPEVLSSQRVK*
Ga0137389_1074677213300012096Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVERSLATLKTMDPETSVRSQAAWGSAIDLLEQVRARLARDRSRRQPEATRSA*
Ga0137389_1089188223300012096Vadose Zone SoilMSIQFDRPGGGVAAFMLSFSMLLTLRKNGTLVDDELADIVEQSLAELKAIDAEASVQSRAAWGSALNLLEVLRARFARDQSRGQLDYLNY*
Ga0137389_1136351223300012096Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0153922_110203423300012181Attine Ant Fungus GardensMLIELDRPGSGVAALLLTVSTLHSLKKNGTLADYELADIVEQSLARLKALSRGEGVRSQAAWETALYLLEHLQAHLGRNTSREEAGA*
Ga0137388_1009515513300012189Vadose Zone SoilMSIELDRPGGGVAAFMLSVDTLLALEKNGTLAHDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0137388_1014707623300012189Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVEQSLAGLKTIDAETSVRSQAAWRSALDLLEQLRARLARDRNRKELDYLNY*
Ga0137388_1041487623300012189Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVERSLATLKTMDPETSVRSQAAWGSAIGLLEQLHARLARDRNPRQPEATRSA*
Ga0137364_1008947153300012198Vadose Zone SoilMSIELDRSGGGVAAFMLSLSTLLALEKNGTLADDELADIVEASLARLKAIDAKTSVQSQAAWRSALDLLEQLHARLARERSRKELDYLNY*
Ga0137364_1054559223300012198Vadose Zone SoilMSIELDRPGRGVAAFMLSLSTLFALEKNGALAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHPVSLAIGAGDSSII*
Ga0137365_1020350023300012201Vadose Zone SoilMSIELDRPGGGVAAFVVSLSTLLALEKNGTLADGELADIVEQSLATLKTLDAEASVRSQAAWGSALDLLEQLHARLARDRSRRQPDYLNY*
Ga0137363_1052126623300012202Vadose Zone SoilVAAFMLSLSTLLALQKNGALTHDELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLQARLARNRSRRQLDYLNY*
Ga0137363_1178076723300012202Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELSDIAEASLARLKAIDAETSLQSQAAWGSALDLLEQLHARLARERSRKQLDYL
Ga0137399_1102636613300012203Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEQSLARLKAIDAETSLQSQAAWGSALDLLEQLHARLARERSRKQLDYLNY*
Ga0137362_1010112453300012205Vadose Zone SoilMSIELDRPGRGVAAFMLSISTLLALEKNGTLVHDELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLQARLARNRSRRQLDYLNY*
Ga0137380_1056826923300012206Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRKELDYLNY*
Ga0137381_1058124823300012207Vadose Zone SoilMSIELDRSGGGVAAFVLSLSTLLALEKNGTLADDELADIVEASLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0137376_1084438123300012208Vadose Zone SoilELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHPVSLAIGAGDSSII*
Ga0137378_1134811623300012210Vadose Zone SoilVAAFMPSVGTLLALDKNGTPANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0137377_1054915123300012211Vadose Zone SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAAQRFALDLLEQL
Ga0137377_1167997923300012211Vadose Zone SoilDRPGRGVAAFMLSLSTLFALEKNGALAADELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLQARLARNRSRRQPEATRSA*
Ga0137369_1101463713300012355Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLAHDELADIVERSLATLKAIDPETSVRSQAAWGAAVDLLEQLHARLARDRSR
Ga0137371_1025215313300012356Vadose Zone SoilIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEASLARLKAIDAKTSVQSQAAWRSALDLLEQLHARLARERSRKELDYLNY*
Ga0137371_1051124923300012356Vadose Zone SoilMSIELDRPGRGVAAFMLSLSTLFALEKNGALAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLQARLARDRSSRQLDYLNY*
Ga0137368_1095207013300012358Vadose Zone SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAAQRFALDLLEQLQARLARDRSSRQLDYLNY*
Ga0137360_1084562923300012361Vadose Zone SoilMSIELDRPGGGVAAFILSISTLLALEKNGTLADDELSDIVEKSLARLKAIDAETSVQSRAAWGAALDLLEQLHARLARERSRKELDYLNY*
Ga0137390_1024391623300012363Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAPDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0137390_1163610313300012363Vadose Zone SoilMSIQFDRPGGGVAAFMLSFSMLLTLRKNGTLVDDELADIVEQSLAELKAIDAEASVQSQAAWGSALNLLEVLRARFARDQSRGQLDYLNY*
Ga0150984_12198208713300012469Avena Fatua RhizosphereMSIALDRPGGGVAAFALSLSTLLALEKNGTLVHDELADIVEQSLAGLKTIDAETSVPSQAAWRSALDLLEQLRARL
Ga0137358_1064468923300012582Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELSDIVEKSLARLKAIDAETSVQSRAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0137398_1103569023300012683Vadose Zone SoilGGGVAAFMLSVGTLLALEKNGTRAHDELADIVERSLATLKTMDTETSVRSQAAWGSAIDLLEQLRARLARDRSRRQLDYLNY*
Ga0137397_1059034023300012685Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLANNELAEIVEQSVARLKAIDPETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY*
Ga0137397_1067628913300012685Vadose Zone SoilMSIELDRPGGGVAAFILSVGTLVALEKNGTLAHDELADIVEQSLARLKAIDAETSVPSQAAWGAAVDLLEQLHARLARDRSRRQLDYLN
Ga0137396_1087047813300012918Vadose Zone SoilMSIELDRPGGGVAAFMLSLSTLVALEKNGTLAADELADIVEQSLATLKTMDPETSVRSQAAWGATVDLLEQLHARLARDRSRRQRD
Ga0137359_1046433623300012923Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELTDIVMESLATLKAIDAETSVRSQAAWGAAVDLLKQLHARLACDRSRRQLDYLNY*
Ga0137413_1087507813300012924Vadose Zone SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAARRFALDLLDQLQARLARDRSSRQLDYLNY*
Ga0137419_1119258623300012925Vadose Zone SoilVSIELDRPGGGVAAFALSLSTLLALEKNGTLADDELSDIVGKSLARLKAIDAETSLQSRAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0137416_1020802023300012927Vadose Zone SoilMSIELNRPGRGVAAFVLSLSTLLALEKNGTLAEDELAGIVEESLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARDRSRRQLDYLNY*
Ga0137404_1170397423300012929Vadose Zone SoilMSIQFDLPGGGVAAFMLSFSMLLALRKNGTLVDDELADIVEQSLAELKAIDAEASVQSQAAWGSALKLLELLRARFARDRSRGQLDYLNY*
Ga0137407_1079870623300012930Vadose Zone SoilMSIELDRPGGGVAAFMLGVGTLLALEKNGTLAHDELADIVERSLATLKTMDTETSVRSQAAWGSAIDLLEQLRTRLARDRSRRQLDYLNY*
Ga0137410_1133834423300012944Vadose Zone SoilMALVVLDRAAQLASDLRSAARPVIGPEAYMSIELDRPGGGVAAFILSISTLLALEKNGALAADELADVVEQSLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0164305_1126573113300012989SoilRMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIAEESLARLKSIDAETSVRSQAAWGAAIDLLEQLHARLARDRSRRQLDYLNY*
Ga0164305_1204555013300012989SoilMSIELDRSGGGLAAFVLSFSTLLALEKNGTLANDELAEIVEQSLAKLMAIEAEPSVQSQAAWKSALDLLEQLRARIAR
Ga0163163_1223047823300014325Switchgrass RhizosphereGSGVAAFMLSLGTLLALEKKGTLADDELVDIVEQSLTRLKTIHAETSLRSQGAWKTALDLLEQLQARLARDRSRKQFDYVNY*
Ga0182018_1053125523300014489PalsaMSIELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHLARDPSREQVG
Ga0137405_140110523300015053Vadose Zone SoilMSIELDRPGGGVAAFMLSLSTLLALEKNGTLADDELSDIVEKSLARLKAIDAETSLQSRAAWGSALDLLEQLHARLSRERSRKELDYLNYELSLRKAVPTS*
Ga0137412_1013316513300015242Vadose Zone SoilAARPVIGPEAYMSIELDRPGGGVAAFVLSLGTLLALEKNGTLADDELAGIVEQSLARLKAIDAETSLQSQAVWGSALDLLEQLHARLARERSRKELDYLNY*
Ga0137412_1017776123300015242Vadose Zone SoilMSIELDRPGSGVAAFMLSLGTLLALEKNGVLAGDQLVDIVQQSLAKLKAIDAEPSLQGQAARGSALDLLEQLHARLARDRDGKSSTT*
Ga0137403_1044647623300015264Vadose Zone SoilMSIELDRPGGGVAAFMLSFSMLLALRKNGTLVDDELADIVEQSLAELKAIDAEASVQSQAAWGSALKLLELLRARFARDRSRGQLDYLNY*
Ga0134085_1053017723300015359Grasslands SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRII*
Ga0132258_1171406623300015371Arabidopsis RhizosphereMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEQSLARLKAIDAETSVQSRAAWGSALDLLEQLHARLARDRSRRQFDYLNY*
Ga0132256_10101004223300015372Arabidopsis RhizosphereMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIAEESLARLKAIDAETSVRTQAAWGAAIDLLEQLHARLARDRSRRQLDYLNY*
Ga0132255_10314022123300015374Arabidopsis RhizosphereMSIELDRPGRGVAAFMLSLSTLFALEKNGALTHDELADIVEQSLARLKAIDAEPSVGSQAAWGAAIDLLEQLHARLARDRNLRQPEATRSA*
Ga0163161_1087909023300017792Switchgrass RhizosphereMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Ga0190266_1012487623300017965SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLADDELADIVEESLARLKTMDAETSVQSQAAWRSAIDLLEQLRARLARDRSRRQLDYLNY
Ga0190266_1021362413300017965SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAARRFALDLLEQLQARLARDRSSRQLDYLNY
Ga0184610_120762313300017997Groundwater SedimentMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDGAPWLSTRLCR
Ga0184605_1017104213300018027Groundwater SedimentMSIELDRPGGGVAAFMLSFGTLLALEKNGTLANDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Ga0184608_1002199323300018028Groundwater SedimentMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNY
Ga0184620_1005114333300018051Groundwater SedimentASMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNY
Ga0184620_1005479723300018051Groundwater SedimentMSIELDRPGGGVAAFMLSVGTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLNARLARDRSRRQLDYLNY
Ga0184619_1010094523300018061Groundwater SedimentMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAARRFALDLLEQLQARLACDRSSRQLDYLNY
Ga0184635_1011785023300018072Groundwater SedimentMSIELDRPGGGVAAFMLSVGTLLALEKNGALADDEAAEIVEQTLTRLKTMDAETSVQSQAAWRSAIDLLEQLRARLARDRSRRQLDYLNY
Ga0184609_1044496623300018076Groundwater SedimentMSIELDRPGGGVGAFRLSISTLVALEKNGTLAHDELADIVERSLATLKTMDTETSVRSQAAWGAAVDLLEQLNARLARDRSRRQLDYLNY
Ga0184625_1042460123300018081Groundwater SedimentMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDHLNY
Ga0066667_1095944323300018433Grasslands SoilMSIELDRPGGGVAALMLSVGTLLALEKNGTLSNDQVADIVEQSLARLKTIDAETGVRSQAVWSSALDLLEQLRARLAYERSRKQL
Ga0066667_1125655123300018433Grasslands SoilMSIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDYLNY
Ga0066667_1138182533300018433Grasslands SoilMSIALDRLGGGVAAFVLSLSTLLALEKNGTLADDELAGIVEQSLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARDRS
Ga0190269_1154430213300018465SoilGVAALMLSVGTLLALEKNGTLAHDELADIVERSLATLKTLDTETSVRSQAAWGSAIDLLEQLRARLARDQSRRQHDYLNY
Ga0066662_1049443523300018468Grasslands SoilMSIELDRPGGGVAALMLSVGTLLALEKNGTLSNDQVADIVEQSLARLKTIDAETGVRSQAVWSSALDLLEQLRARLAYERSRKQLDYLNY
Ga0190270_1275263423300018469SoilMSIELDRPGGGVAAFMLSVSMLVALEKNGTLADDELAEIVEQTLTRLKTMDAETSVQSQAAWGSAIDLLEQVRARLARDRSRRQLDYLNY
Ga0066669_1238023023300018482Grasslands SoilMSIELDRPGSGVAAFMLSLGTLLALEKNGTLADDELADIVEQSLARLKAIHAETSLRSQGAWKTALDLLEQLHARLARDRRRKEFDY
Ga0193701_105995123300019875SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYL
Ga0210399_1041145823300020581SoilMSIQFDRPGGGVAAFMLSFNMLLAFKKNGTLADDELADIVEQSLAELKAIDAEASVQSQAARGSALNLLERLRARLACDRGRDSSII
Ga0210381_1009264523300021078Groundwater SedimentMSIELDRPGGGVAAFMLSVGTLVALEKNGTLAADELADIVEQSLARLKAIDAETSARSQAAWGAAVDLLEQLHARLARERSRRQLDYLNY
Ga0210406_1056317723300021168SoilMSIELDRSGGGVAAFVLSFSTLLALEKNGTLADDELAEIVEQSLAKLKAIDAEPSVQSQAVWGSALDLLGQLRARLARDRR
Ga0210406_1125909013300021168SoilMSIQFDRPGGGVAVFMLSFSMLLALKKNGTLVDDELADIVEQSLAELKAIDAEASVQSQAAWGSALNLLQLLRARFARDRSRGHLDYLNY
Ga0210400_1136251213300021170SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDRGEGVRSQATWEAALLLLEQLHAHLARDPSLEQLGVTRST
Ga0210400_1167548113300021170SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLASEPSR
Ga0210388_1051049323300021181SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSREQVEA
Ga0210393_1005908133300021401SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLAGEPSREQVGATRSTQSSQSAGRVEGQG
Ga0210385_1008083423300021402SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLASEPSREQVGATRSTQSSQSAGRVEGQG
Ga0210389_1011434413300021404SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDKGEGVRSQATWEAALFLLEQLHAHLARDPSLEQLGVTRST
Ga0210389_1014797723300021404SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLASEPSREQVGATRSTQSSQSAGRVEGQG
Ga0210387_1025276623300021405SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLAGEPSRE
Ga0210383_1009511613300021407SoilIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLAGEPSREQVGATRSTQSSQSAGRVEGQG
Ga0210383_1098384923300021407SoilMSIELDRAGSGVAALLLSVSTLHALKKNGMLADYELADIVEQSMARLKALHRGEGVRGQAAWETALHLLEQLHAHLARDPSREQVGA
Ga0210394_1035268333300021420SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDRGEGVRSQATWEAALLLLEQLHTHLARDPSPEQLGVTRST
Ga0210392_1061720523300021475SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSREQVEA
Ga0210392_1066793723300021475SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDKGEGVRSQATWEAALYLLEQLHAHLARDPSLEQLGVTRST
Ga0210398_1044989623300021477SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLAGEPSREQVGATRSTQSSQSAGRVEGQG
Ga0210402_1156152423300021478SoilMSIQFDRPGGGVAAFMLSFNMLLAFKKNGTLADDELADIVEQSLAELKAIDAEASVQSQAARGSALILLERLRARLASDRGPDSSII
Ga0207668_1213153113300025972Switchgrass RhizosphereMSIELDRPGGGVAAFMLSVGTLLALEKNGTLADDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLQARLARDRNSRQPD
Ga0257160_107433523300026489SoilMSIELDRPGRGVAAFVLSLSTLLALEKNGTLAADELADIVEQSLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARDRSRKELDYLNY
Ga0208997_100810123300027181Forest SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLRARLARDRSRRQLDYLNY
Ga0209213_106337823300027383Forest SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLANDELADIVEESLVRLKAIDAETSVRSQAAWGAAVDLLEQLRARLARDRSRRQLDYLNY
Ga0209332_100335613300027439Forest SoilMSIELDRPGCGWAALMLSVSTLFALRKNGTLADYELTDIVEQSLARLKALDRGEGVRSQATWEAALYLLERLHAHLARDPNLEQVGTTLST
Ga0209735_103904513300027562Forest SoilMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELTDIVEQSLARLKALDKGEGVRSQATQEAALYLLERLHAHLARDPNLEQVGPTPSTGSSRSAARVAGQG
Ga0209115_102190633300027567Forest SoilMSNELDRSGSGVAALLLSVSTLHALKKNGALADHELADIVEQSLARLKALNRGEGVRGQAAWETALYLLEQLHAHLARDPSRQQVEA
Ga0209528_102095423300027610Forest SoilMSIELDRPGRGVAAFMLSVSTLLALKKNGTLADDELADIVEQSLARLKALDPETSVRSQEAWGAALDLLEQLPARFARDRSRKQLEATRST
Ga0209009_108564013300027667Forest SoilMSIELDRPGCGWAALMLSVSTLFALRKNGTLADYELTDIVEQSLARLKALDRGEGVRSQATWEAALYLLERLHAHLARDPNL
Ga0209772_1003468813300027768Bog Forest SoilMSIEIDRPGSGVAAFTLSVSTLLALKKNGTLADYELTDIVEQSLARLKALDRGEGVRSQAAREAALYLLEQLHAHLARDPSLEHVGATRST
Ga0209448_1005647023300027783Bog Forest SoilMSIELDRPGSGVAALSLSVSTLHALKMNGMHADYELADIVEQSLARLKALDQGESLRSQDAWQTALYLLEQLHAHLARDPSRERVGTTRST
Ga0209139_1004756033300027795Bog Forest SoilMSIELDRPGSGVAAFTLSLSTLLALKKNGTLADYELADIVEQSLARLKALDRGEGVRSQATWEAALYLLEQLHAHLARDPSLEHVGATRST
Ga0209180_1036503323300027846Vadose Zone SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLVHDELADIVEQSLAGLKTIDAETSVPSQAAWRSALDLLEQLRARLARDRNRRQPEATRSA
Ga0209579_1000152043300027869Surface SoilMSTELDRPGSGVAALLLSVSMLHALKKNGTLADCELADIVEQSLARLKALSRGEGVRSQAAWETALYLLEQLQAHLGRETSRDQVGA
Ga0209283_1086673413300027875Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLN
Ga0209169_1002250823300027879SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYYLLEQLHAHLARDPSREQVGA
Ga0209590_1032312923300027882Vadose Zone SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Ga0209275_1066102723300027884SoilMSMELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALTRGEGVRSQAAWETAYHLLEQLHAHLASEPSREQVGATRSTQ
Ga0209275_1092288223300027884SoilMSIELDRPGCGWAALMLSVSTLFTLKKNGTLADYELTEIVEQSLARLKALDKGEGVRSQATWEAALYLLEQ
Ga0209624_1021296413300027895Forest SoilMSMELDRPGTGVAAFTLSVSTLLALKKNGTLADYELADIVEQSLARLKALDQGESLRSQDAWEAALYLLEQLHAHLARDLSREQVGATRST
Ga0209624_1038511223300027895Forest SoilMSMELDRPGSGVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFAR
Ga0209624_1046273633300027895Forest SoilELDRPGSGVAALLLSVSTLHALRKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSREQVEA
Ga0209488_1010216243300027903Vadose Zone SoilCMSIELDRPGGGVAAFMLSVSTLVALEKNGTLAGDELAGIVEESLARLKAIDAETSLQSRAAWGSALDLLEQLHARLARERSRKELDYLNY
Ga0209488_1016216523300027903Vadose Zone SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEESLTRLKAIDAETSLQSRAAWGSALDLLEQLHARLARERSRKQLDYLNY
Ga0209488_1114494723300027903Vadose Zone SoilPGGGVAGFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSQAARRFALDLLEQLQARLARDRSSRQLDYLNY
Ga0209006_1005745833300027908Forest SoilMLIELDRPGSGVAALLLSVSTLHTLKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWETAYHLLEQLHAHLASEPSRKQVGATRSTQSSQSAGRVEGQG
Ga0209006_1054461823300027908Forest SoilLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSRERVEA
Ga0209006_1096147523300027908Forest SoilMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELTDIVEQSLARLRALDPGEGVRSQATWEAALHLLEQLHAHLARDPSLEHVGATRST
Ga0209382_1007231433300027909Populus RhizosphereMSIELDRPGGGVAAFMLSVSMLVALEKNGTLAHDELADIVERSLATLNSMDPETSVRSQAAWGSAVDLLEQLRARLACDRSRRQLDYLNY
Ga0209526_1007588433300028047Forest SoilMSIELDRSGCGVAAFVLSFSTLLALEKNGTLANDELAEIVEQSLAKLKAIDAEPSVQSQAAWKSALDLLEQLRARIARDRN
Ga0137415_1026864013300028536Vadose Zone SoilMSIELNRPGRGVAAFVLSLSTLLALEKNGTLAEDELAGIVEESLARLKAIDAETSVQSQAAWGSALDLLEQLHARLARDRSRRQLDYLNY
Ga0137415_1111904823300028536Vadose Zone SoilMSIELDRPGGGVAAFMLSVSTLVALEKNGTLAADELADIVEQSLARLKTIDAETSVRSQDAWRSALDFLGQLQARL
Ga0307285_1024894213300028712SoilRRMACMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVGLLEQLHARLARDRSRRQLDYLNY
Ga0307307_1014571923300028718SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNFELRPSE
Ga0307317_1010463913300028720SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLAADELADIVEQSLARLKSIDAETSVRSQAAWGAAVGLLEQLHARLARDRSRRQLDYLNY
Ga0307318_1013681113300028744SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDY
Ga0307280_1010073513300028768SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEQSLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Ga0307280_1028371723300028768SoilMSIELDRPGGGVAAFMLSFSTLIALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNY
Ga0307306_1017604323300028782SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNF
Ga0307323_1010317823300028787SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARD
Ga0307287_1033661223300028796SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDLSSR
Ga0307503_1014312123300028802SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEESLARLKAIDAETSLQSRAAWGSALDLLEQLHARLARERSRKELDYLNY
Ga0307292_1001742423300028811SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYPNY
Ga0307292_1034017423300028811SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLAADELADIVEQSLAAIDAETSVRSQAAWGAAVDLLEQLH
Ga0307302_1011994623300028814SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLHARLARDRSRRQLDYLNY
Ga0307296_1047654723300028819SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLQARLARDRSSRQLDYLNY
Ga0307310_1020915923300028824SoilMSIELDRPGGGVAAFMLSVGTLVALEKNGTLAADELADIVEQSLARLKAIDAETSARSQAAWGAAVDLLEQLHAR
Ga0307314_1009766613300028872SoilVAAFMLSVGTLVALEKNGTLAADELADIVEQSLARLKSIDAETSVRSQAAWGAAVGLLEQLHARLARDRSRRQLDYLNY
Ga0307286_1026032423300028876SoilSSMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRSSRQLDYLNF
Ga0307304_1026006513300028885SoilRKARMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWGAAVDLLEQLQARLARDRSSRQLDYLNY
Ga0247827_1082967823300028889SoilMSIELDRPGGGVAAFVLSVGTLVALEKNGTLAAGELADIVEGSLARLKAIDAETSVRSQAAWGAAIDLLEQLHARLARDRSRRQLDYLNY
Ga0308309_1084434213300028906SoilMLIEVHRPGSGVAALLLTVSTLHSLKKNGTLADYELADIVEQSLAKIKTLSRGESVRSQDAWEAALYLLEQLQAHLGRDTSREQVGA
Ga0265746_104451723300030815SoilVAALLLSVSTLHALKKNGTLADYELADIVEQSLARLKALNRGEGVRSQAAWESALYLLEQLHAHFARDPSREQVEA
Ga0138296_102068713300030923SoilMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELTDIVEQSLARLKALDKGEGVRSQATWEAALYLLEQLHAHLARDPSLEQVGVTRST
Ga0073997_1220619633300030997SoilMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELTDIVEQSLARLKALDKGEGVRSQATWEAALYLLERLHAHLARDPSREQVGATPSTGSSRSAGRVAGQG
Ga0073996_1240880823300030998SoilMSIELDRPGGGVAAFMLSVSTLVALQKNGTLAADELADIVEQSLARLKTIDAATSVRSQAAWGSAVDLLGQLHARLARDRTR
Ga0265313_1014231513300031595RhizosphereMSIELDRPGCGWAALMLSVSTLFALKKNGTIADYELTEIVEQSLARLKALDRGEGVRSQATWEAALHLLEQLHAHLARDPSLEQLGVTRST
Ga0307476_10000180263300031715Hardwood Forest SoilMLIELDRPGSGVAALLLTVSTLHSFKKNGTLADYELADIVEQSLARLKALDRGEGVRSQDAWEAALYLLEQLQAHLGRDTSREQVGA
Ga0307474_1081674423300031718Hardwood Forest SoilMSIELDRPGCGWAALMLSVSTLFALKKNGTLADYELNDIVEQSLARLKALDKGEGVRSQATWEAALYLLEQLHAHLARDPSLEQVRVTRST
Ga0307468_10005190233300031740Hardwood Forest SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIVEESLARLKAIDAETSVRSQAAWRAAVDLLEQLQARLARDRSSR
Ga0307478_1100252523300031823Hardwood Forest SoilLDRPGSGVAALLLTVSTLHSLKKNGTLADCELADIVEQSLARLKALSRGEGVRSQDAWEAALYLLEQLQAHLGRDTSREQVGA
Ga0307470_1005504433300032174Hardwood Forest SoilMSIELDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIAEESLARLKAIDAETSVRSQAAWGAAIDLLEQLHARLARDRSRRQLDYLNY
Ga0307470_1070928613300032174Hardwood Forest SoilMSIELDRPGGGVAAFMLSISTLVALEKNGTLAADELADIVEQSLATLKTIDVETSVRSQAARRFALDLLDQLQARLARDRSSRQLDYLNY
Ga0307471_10263490123300032180Hardwood Forest SoilMSIELDRPGGGVAAFVLSLSTLLALEKNGTLADDELADIVEQSLARLKAIDAETSMRSQAAWGSALDLLEQLHARLARDRR
Ga0307472_10063218013300032205Hardwood Forest SoilIEVDRPGGGVAAFMLSVGTLLALEKNGTLANDELADIAEESLARLKAIDAETSVRSQAAWGAAIDLLEQLHARLARDRSQRQLDYLNY
Ga0247829_1063961523300033550SoilMSIELDRPGGGVAAFMLSLSTLLALKKNGTLADDELADIVEQSLATLKTIDVETSVRSGAARRFALDLLEQLQARLARDRRSRQLDYLND
Ga0370497_0136195_374_6073300034965Untreated Peat SoilMSIELDRPGRRVAAFMLSGGTLLALEKNSTLAHHELADIVEQSLSGLKTIDAMSLQSQDAWRSALYLLEQPRALSRAI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.