NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F033202

Metagenome Family F033202

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F033202
Family Type Metagenome
Number of Sequences 178
Average Sequence Length 95 residues
Representative Sequence KPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY
Number of Associated Samples 153
Number of Associated Scaffolds 178

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 3.95 %
% of genes near scaffold ends (potentially truncated) 91.01 %
% of genes from short scaffolds (< 2000 bps) 88.76 %
Associated GOLD sequencing projects 141
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.315 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.977 % of family members)
Environment Ontology (ENVO) Unclassified
(33.708 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.067 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.
1JGI1027J12803_1009047021
2JGI1027J12803_1025443561
3JGIcombinedJ26739_1005916271
4JGIcombinedJ26739_1008393362
5JGI25382J43887_104972581
6Ga0062595_1011303451
7Ga0062592_1020055941
8Ga0066672_106490402
9Ga0066673_100130463
10Ga0066690_106984502
11Ga0066684_100273944
12Ga0066675_111384602
13Ga0066388_1073708862
14Ga0070680_1019026461
15Ga0070710_101827631
16Ga0070705_1000786861
17Ga0070708_1000481346
18Ga0066689_101614511
19Ga0066682_102380392
20Ga0066681_109222282
21Ga0066687_100503003
22Ga0066687_102996151
23Ga0066687_106738691
24Ga0070681_120416621
25Ga0070697_10000769211
26Ga0068853_1006103901
27Ga0070695_1016831591
28Ga0070696_1003401291
29Ga0066661_105845991
30Ga0066692_103398011
31Ga0066704_103430902
32Ga0066670_102577323
33Ga0066705_101504801
34Ga0066654_100619973
35Ga0068864_1011391021
36Ga0066903_1006671151
37Ga0068858_1024135701
38Ga0070717_110825391
39Ga0066651_107390351
40Ga0075028_1005828321
41Ga0075017_1008905231
42Ga0075019_106049702
43Ga0075018_102367783
44Ga0068871_1021043072
45Ga0079222_102333092
46Ga0066665_104420862
47Ga0066660_112239482
48Ga0079220_111061002
49Ga0075425_1025423002
50Ga0075434_1013873862
51Ga0075435_1010238651
52Ga0099794_105142271
53Ga0066710_1012128191
54Ga0066710_1044452161
55Ga0099830_115730591
56Ga0099830_118562491
57Ga0066709_1001705921
58Ga0066709_1033920681
59Ga0105241_113868061
60Ga0105242_109652561
61Ga0105248_106720421
62Ga0105237_107578451
63Ga0126370_123911161
64Ga0126377_102357621
65Ga0105239_132580501
66Ga0134126_127982161
67Ga0126383_120633591
68Ga0137389_110371252
69Ga0137382_109123591
70Ga0137363_102281631
71Ga0137399_103887662
72Ga0137362_100343121
73Ga0137380_108351291
74Ga0137380_114704271
75Ga0137377_100992194
76Ga0137370_103416532
77Ga0137387_112059051
78Ga0137386_101407522
79Ga0137385_109456992
80Ga0137360_103372303
81Ga0137358_100646143
82Ga0137397_101532891
83Ga0137419_110595601
84Ga0137419_112236402
85Ga0137419_115938991
86Ga0137407_107842252
87Ga0137407_115490792
88Ga0137410_112357131
89Ga0137410_119165291
90Ga0164299_109187202
91Ga0164301_100628614
92Ga0164301_110942371
93Ga0134087_100739582
94Ga0157373_103543993
95Ga0157371_101360394
96Ga0157378_106937081
97Ga0157378_111295631
98Ga0163162_114477531
99Ga0157372_129315492
100Ga0134078_101292881
101Ga0163163_124368971
102Ga0137418_100801972
103Ga0137409_106526691
104Ga0132257_1024126971
105Ga0182035_107397552
106Ga0182038_120433041
107Ga0187818_101139762
108Ga0187818_101225291
109Ga0187817_108662751
110Ga0187810_101628061
111Ga0066655_102812302
112Ga0193747_10288621
113Ga0193729_11474452
114Ga0193735_11267772
115Ga0179592_100010801
116Ga0210403_108600351
117Ga0210399_101675211
118Ga0210399_114586841
119Ga0210395_103055201
120Ga0210401_104419232
121Ga0210404_102148313
122Ga0210404_107203891
123Ga0210405_107255951
124Ga0210408_104984692
125Ga0210396_116340611
126Ga0210393_110971141
127Ga0210384_101510381
128Ga0210384_107691132
129Ga0210402_100774641
130Ga0210402_101211104
131Ga0210402_112521741
132Ga0210410_102037952
133Ga0247695_10065541
134Ga0247665_10435762
135Ga0247666_10327951
136Ga0137417_12293761
137Ga0207692_101470382
138Ga0207671_102327461
139Ga0207646_105559092
140Ga0207679_106846582
141Ga0207639_109589821
142Ga0207641_122125961
143Ga0209761_10516874
144Ga0209268_10179254
145Ga0209155_10360342
146Ga0209154_12979542
147Ga0209647_12098232
148Ga0209470_10000511
149Ga0209375_11151731
150Ga0209473_12656981
151Ga0209158_11234962
152Ga0257178_10532691
153Ga0209378_10860731
154Ga0209807_12489291
155Ga0209577_106334801
156Ga0179587_102815861
157Ga0209076_11772511
158Ga0209118_10378992
159Ga0209488_112163821
160Ga0209526_103313542
161Ga0268266_114381231
162Ga0307469_123122301
163Ga0307477_105652761
164Ga0307475_100617892
165Ga0307475_101254824
166Ga0307475_101599981
167Ga0310917_104673352
168Ga0307479_113510492
169Ga0318533_108117451
170Ga0307470_113216812
171Ga0307471_1001554012
172Ga0307471_1009958501
173Ga0307471_1010307011
174Ga0307471_1012643282
175Ga0335080_113708101
176Ga0310810_102551194
177Ga0310811_106935702
178Ga0373958_0219233_133_504
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 42.02%    β-sheet: 0.00%    Coil/Unstructured: 57.98%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090KPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQYSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Corn Rhizosphere
4.5%18.0%16.9%4.5%12.9%6.2%5.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10090470213300000955SoilAWMEKNLQYPTEKYLPMGGLPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAADLVGRLQQWGFAYTDRAARIQYSVDSSSRKK*
JGI1027J12803_10254435613300000955SoilSDVHRALTASLLAAWMDKNLQYPVAKYLPMSGLPASAYTTPSEYRDISGGKVWEAAQQFREAGVAPELVRRLQQWGSAFNERAARLQYAGPPSSPKK*
JGIcombinedJ26739_10059162713300002245Forest SoilVEVRRALTASLLAAWMEKNLQYPTEKYLPMGGLPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAADLVGRLQQWGFVYTDRAARIQYSVNSSSRKK*
JGIcombinedJ26739_10083933623300002245Forest SoilSPVWAPLFQPLPIEVHRALTASMLAAWMDKNLQYPMTKHLPTGLIRRPYTQPHNYGNISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY*
JGI25382J43887_1049725813300002908Grasslands SoilMDKNLQYPVAKYLPMGGSPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAAELVRRLQQWGIAYADRAARLQYAGNSSSRKM*
Ga0062595_10113034513300004479SoilPDNLKQGWRPDDNIDPRIMISPVWAPIFDPLPVEVHQALTTSLLAAWMDKNLQYPITKHLPTGLRRRAYTQPGTYGDITGGKVWDSAKQFREAGVPPEVVKQLQEWGIAYTDYAARIQY*
Ga0062592_10200559413300004480SoilDNIKQGWRPNDNIDPRIMISPVWAPTFKPLPAEVHQALATSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0066672_1064904023300005167SoilSPVWEPIFKPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIVYSDRAARLQYAGNSASSKR*
Ga0066673_1001304633300005175SoilMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY*
Ga0066690_1069845023300005177SoilPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0066684_1002739443300005179SoilMISPVWTPMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY*
Ga0066675_1113846023300005187SoilPGEVHQALTTALLGAWMDKNLQYPIEKYLPMGLRSPAYAPPHNYGDISGGKVWEASQQFRNAGVPPELARRLQQWGLAFAERAARLQY*
Ga0066388_10737088623300005332Tropical Forest SoilGAWMDKNLQYSIATYLPIGLRGNAYAPPASYGGISGGRVWEAARQFRDAGVPGELVQRLEQWGILFADRAARIQY*
Ga0070680_10190264613300005336Corn RhizosphereNIKQGWRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0070710_1018276313300005437Corn, Switchgrass And Miscanthus RhizosphereISAAWAPIFEPLPSEVHRALTTSLLAAWMDKNLQYPVEKYLPLGLPPRIYTPNSKYRDISGGKVWETEKQFREAGVAAELIFRLQQWGILFTDRAARVQYK*
Ga0070705_10007868613300005440Corn, Switchgrass And Miscanthus RhizosphereITKYLPMGGLPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAAELVGRLQQWGIAYTDRAARLQYGGKSSSRKM*
Ga0070708_10004813463300005445Corn, Switchgrass And Miscanthus RhizosphereWMDKNAQYPIEKYLPMGTPRHAYAAPHSYGEITGGKVWEASQQFRDAGVAPELLRRLQQWGVTFNDRAARIQY*
Ga0066689_1016145113300005447SoilQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0066682_1023803923300005450SoilWEPIFKPLPVELRRALTASLLSAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0066681_1092222823300005451SoilKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY*
Ga0066687_1005030033300005454SoilMISPVWTPMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKRLQQWGIAYTDRAARIQY*
Ga0066687_1029961513300005454SoilVELRRALTASLLGAWMDKNLQYPVAKYLPMGGLPARPYAPPQAYGDISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0066687_1067386913300005454SoilDFSQGWRPDQNIDPRIMISPVWAPLFKPLPTEVRRALTGSLLAAWMDKSLQYPMAKYLPIGLLHQRYTPPRAYGDITGGKVWEAAQQFRDAGVQAELVQRLQQWGMAFTDRAARLQY*
Ga0070681_1204166213300005458Corn RhizospherePNDNIDPRIMISPVWAPTFKALPAEVHQALTTAMLAAWMEKNLEYPITRHLPVGVPRRPYTQPGYYGDISGGNVWQSAKQFRDAGVPAALVQSLQQWGAAYADRAARIQY*
Ga0070697_100007692113300005536Corn, Switchgrass And Miscanthus RhizospherePVEVHQALTISLLAAWMDKNLQYPIAKHLPIGLPRRSYTQPHAYGGISGGKVWEAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY*
Ga0068853_10061039013300005539Corn RhizosphereAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDIGGGNVWQSAKEFRDAGVPATLVKTLQRWGAAYADRAARIQY*
Ga0070695_10168315913300005545Corn, Switchgrass And Miscanthus RhizospherePIFKPLPIEVHRALTASLLAAWMDKNLQYPIPKYLPMGASRGAYVTPHDFGDISGGKVWESSQQFRDAGVPPDLVRRLQQWGIAFNDRAARVQY*
Ga0070696_10034012913300005546Corn, Switchgrass And Miscanthus RhizosphereNIKQGWRPNDNIDPRIMISPVWAPTFKPLPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0066661_1058459913300005554SoilSPVWEPIFKPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0066692_1033980113300005555SoilWRPEQNIDPRILISPVWEPIFKPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIVYSDRAARLQYAGNSASSKR*
Ga0066704_1034309023300005557SoilPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIVYSDRAARLQYAGNSASSKR*
Ga0066670_1025773233300005560SoilMISPVWTPMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFREAGVAPELVKHLQQWGIAYTDRAARIQY*
Ga0066705_1015048013300005569SoilMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYT
Ga0066654_1006199733300005587SoilMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKGWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY*
Ga0068864_10113910213300005618Switchgrass RhizosphereKYLPMGGLPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAAELVGRLQQWGIAYTDRAARLQYGGKSSSRKM*
Ga0066903_10066711513300005764Tropical Forest SoilTVHRALTASLLAAWLDKNLQYPIARYLPVSGIPLAAYTTPRSYGDISGGKVWEAAQQFREAGVSATLVARLQEWGTMYADRAARIQYGGISRH*
Ga0068858_10241357013300005842Switchgrass RhizosphereVWAPTFKALPAEVHQALTTSMLGAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0070717_1108253913300006028Corn, Switchgrass And Miscanthus RhizosphereNIDPRIMISPVWAPIFKPLPIEVHQALTASLLAAWMDKNLQYPITKHLPTGLPRRAYAQTGAYGEVSGGKVWEAARQFRDAGVAAELVQRLQQWGIAYTDRAARLQY*
Ga0066651_1073903513300006031SoilNLQYSITKYLPVSGVPPTSYTAPHTYGYISGGRVWEAAQQFREAGVSAELVARLQEWGIQYTDRAARIQYDGISRH*
Ga0075028_10058283213300006050WatershedsPNDNIDPRIMISPVWAPMFKPLPIEVQQALTTSLLAAWMDKNLQYPITKHLPVGVPKRPYTQPSAYGDISGGKVWESASQFRDAGVAPELVKHLQQWGISYADRAARIQY*
Ga0075017_10089052313300006059WatershedsFKPLPIEVHQALTASLLAAWMDKNLQYPIAQYLPLGLPPQNYTAPHKYGDVSGGKVWESARQFRDAGVSAELVQRLQRWGITFTDRAARLQYH*
Ga0075019_1060497023300006086WatershedsGPDDLRQGWRPDNNIDPRIMIDPAWAPIFKPLPIEVHQALTASLLAAWMDKNLQYPIAQYLPLGLPPQNYTAPHKYGDVSGGKVWESARQFRDAGVSAELVQRLQRWGITFTDRAARLQYH*
Ga0075018_1023677833300006172WatershedsSPVWAPIFKPLPVEVHQALTASLLAAWMEKNQQYPITKHLPTGLPRRAYAPPGTYGDISGGKVWESAKQFREAGVPAELVQRLQQWGIAYSDRAARLQY*
Ga0068871_10210430723300006358Miscanthus RhizosphereHQALTTSMLAAWMEKNLEYPITKHLPVGVPRRPYSQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0079222_1023330923300006755Agricultural SoilMDKNLQYPVAKYLPMGGSPARPYAPPRAYGDISGGNVWEAAQQFRDAGVAAELVRRLQQWGIAYADRAARLQYAGNSSSRKM*
Ga0066665_1044208623300006796SoilPIFKPLSIEVHQALTASLLSAWMDKNLQYPIAKHLPFGLPRHTYTATGAYGDISGGKVWEAAQQFRDAGIAAELVRRLEQWGIAFTDRAARLQY*
Ga0066660_1122394823300006800SoilAAWMDKNLQYPVAKYLPMGGSPARPYAPPQVYGDISGGNVWEAAQQFRDAGVAAELVRRLQQWGIAYADRAARLQYAGNSSSRKM*
Ga0079220_1110610023300006806Agricultural SoilWRPDQNADPRIMISPVWEPFFKPLPSEVHRALTASLLAAWMDKNQQYPIARYLPMGATKNAYDAPRTYGEITGGRVWEAAKQFRDAGVAPELVQRLQQWGISYHERAVRIQY*
Ga0075425_10254230023300006854Populus RhizosphereLPIELHQALTTSLLAAWMDKNLQYPITRHLPVGVPRRPYSQPSSYGDISGGKVWESASQFRDAGVAAELVKRLQQWGIAYSDRAARLQY*
Ga0075434_10138738623300006871Populus RhizosphereFKPLPIELHQALTTSLLAAWMDKNLQYPITRHLPVGVPRRPYSQPSSYGDISGGKVWESASQFRDAGVAAELVKRLQQWGIAYSDRAARLQY*
Ga0075435_10102386513300007076Populus RhizosphereMDKNLQYPITKHLPTGLPRRAYTQPGAYGDVSGGKVWEAARQFRDAGVAAELVQRLQQWGIAYSDRAARLQY*
Ga0099794_1051422713300007265Vadose Zone SoilAAWMDKNLQYPITKHLPVGVPKRPYSQPGSYGNISGGKVWESAREFRDAGVSADLIQHLQQWGVAYNDRAARIQY*
Ga0066710_10121281913300009012Grasslands SoilHLGPDDLKQGWRPDDNIDPRIMISPVWAPLFKPLPVEVHQALTTSLLAAWMDKNLQYPVAKHLPIGLPRRSYTQPRAYGGISGGKVWDAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY
Ga0066710_10444521613300009012Grasslands SoilTRKQCADLWPLQQGWRPDDNIDPRIMISPVWTPMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPVLVKHLQQWGIAYTARAARIQY
Ga0099830_1157305913300009088Vadose Zone SoilHRALTASLLAAWMDKNLQYPIAKYLPMGGLPARPYIPPAAYGDISGGNAWEAAQQFRDTGVAAELVGRLQQWGIAYTDRAARIQYSGNSSSRKM*
Ga0099830_1185624913300009088Vadose Zone SoilTAALLAAWMDKNLQYPIEKYLPMGLRSHPYAAPHEYGDITGGKVWEASQQFRNAGVSPELVRRLQQWGIAFTDRAARLQY*
Ga0066709_10017059213300009137Grasslands SoilSPVWAPIFKPLSIEVHQALTASLLSAWMDKNLQYPIAKHLPFGLPRHTYTATGAYGDISGGKVWEAAQQFRDAGIAAELVRRLEQWGIAFTDRAARLQY*
Ga0066709_10339206813300009137Grasslands SoilLRRALTASLLSAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGDISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKR*
Ga0105241_1138680613300009174Corn RhizosphereKALQSSDPRLGPDNIKQGWRPNDNIDPRIMISPVWAPTFKALPVEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0105242_1096525613300009176Miscanthus RhizospherePRLGPDNIKQGWRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLEYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0105248_1067204213300009177Switchgrass RhizosphereRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0105237_1075784513300009545Corn RhizospherePNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0126370_1239111613300010358Tropical Forest SoilWRPDQNVDPRIMISAAWAPVFKILPIEVHQALTAALLTAWMDKNLQYPIERHLPMGVHSNAYAHALPHDYANISGGKVWEASQQFRNAGVPPELVRRLQQWGLAYADRAARIQY*
Ga0126377_1023576213300010362Tropical Forest SoilNLQYPVAKYLPMSGLPASAYTTPSEYRDISGGKVWEAAQQFREAGVAPELVTRLQQWGIAFNERAARLQYAGPPSSPKK*
Ga0105239_1325805013300010375Corn RhizosphereTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0134126_1279821613300010396Terrestrial SoilLKQGWRPDDNVDPRILISPVWEPVFKPLPAEVHRALTTSLLAAWMDKNLQYPMARHLPAGLLRRPYTQPRNYGDISGGKVWEASRQFRDAGVPPELVQRLQQWGLAYADRAARIQY*
Ga0126383_1206335913300010398Tropical Forest SoilNVDPRIMISPIWAPLFKPLPTTVHRALTASLLAAWLDKNLQYPIARYLPVSGIPLAAYTTPRSYGDISGGKVWEAAQQFREAGVSATLVARLQEWGTMYADRAARIQYGGISRH*
Ga0137389_1103712523300012096Vadose Zone SoilWRPEQNIDPRIMISPVWEPIFKPLPVELRRALTNSLLAAWMDKNLQYPTAKYLPMGGLPARPYAPPQAYGDISGGDVWEAAQQFRDAGVPAELVGRLQQWGIAYTDRAARLQYTGKSSSRKK*
Ga0137382_1091235913300012200Vadose Zone SoilLHRALTESLLAAWMDKNLQYPIVKHLLMGVHRQAYATPQAYGDISGGKVWEAAQQFRDVGVSPELVRRLQQWGIAFTDRAARLQY*
Ga0137363_1022816313300012202Vadose Zone SoilDLRRALTNSLLAAWMDKNLQYPTEKYLPMGGLPARPYAPPAAYGEISGGKVWEAAQQFRAAGVAPELVGRLQQWGIAYTDRAARIQYSDNSSSRKK*
Ga0137399_1038876623300012203Vadose Zone SoilAAWMDKNLQYPIEKYLPMGLRSHPYAAPHEYGDITGGKVWEASQQFRNAGVSPELVRRLQQWGIAFTDRAARLQY*
Ga0137362_1003431213300012205Vadose Zone SoilNLKQGWRPDDNIDPRIMISPVWAPIFKPLPVEVRQALTASLLAAWMDKNLQYPVTKHLPTGLPRRAYTQPGAYGVISGGKVWEAAKQFRDAGVSAELVQRLQQWGIAYTDRAARLQY*
Ga0137380_1083512913300012206Vadose Zone SoilALTDAFLAAWMDKNLQYPIEKHLPMGLRSPAYARPHDYTDISGGKVWEASQQFRNAGVSPELVRRLQQWGVAFTDRAARIQY*
Ga0137380_1147042713300012206Vadose Zone SoilEVHQALTTALLGAWMDKNLQYPIEKYLPMGLRSPAYAPPHNYGDISGGKVWEASQQFRNAGVPPELARRLQQWGLAFAERAARLQY*
Ga0137377_1009921943300012211Vadose Zone SoilMDKNLQYPIAKYLPFGLPRHAYTPSGTYGDISGGKVWEAAQQFREAGVPAELVQRLQQQWGIAFTDRAARLQY*
Ga0137370_1034165323300012285Vadose Zone SoilTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQAYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0137387_1120590513300012349Vadose Zone SoilRPNDNIDPRIMISPVWAPIFKPLPVEIHQALTTSLLAAWMDKNLEYPITKHLPTGLPRRAYTQPGAYGDISGGKVWEAARQFRDAGVPAELVQHLQQWGVAYSDRAARLQY*
Ga0137386_1014075223300012351Vadose Zone SoilVHQALTASLLSAWMDKNLQYPIAKHLPFGLPRHTYTATGAYGDISGGKVWEAAQQFRDAGIAAELVRRLEQWGIAFTDRAARLQY*
Ga0137385_1094569923300012359Vadose Zone SoilMDKNLQYPIAKYLPFGLPRHAYTPSGTYGDISGGKVWEAAQQFREAGVPAELVQRLQQWGIAFTDRAARLQY*
Ga0137360_1033723033300012361Vadose Zone SoilGPDNLKQGWRPDDNIDPRIMISPVWAPIFKPLPVEVRQALTASLLAAWMDKNLQYPVTKHLPTGLPRRAYTQPGAYGVISGGKVWEAAKQFRDAGVSAELVQRLQQWGIAYTDRAARLKY
Ga0137358_1006461433300012582Vadose Zone SoilMDKNLQYPIAKYLPMGGLPARPYTPPAAYGDISGGNAWEAAQQFRDAGVAAELVGRLQQWGIAYTDRAARIQYSGNSSSRKM*
Ga0137397_1015328913300012685Vadose Zone SoilLTNSLLAAWMDKNLQYPTEKYLPMGGLPARPYAPPAAYGDISGGKVWEAAQQFRDAGVRADLVGRLQQWGIAYTDRAARIQYSDNSSSRKK*
Ga0137419_1105956013300012925Vadose Zone SoilSPVWEPIFKPLPVELRRALTNSLLAAWMDKNLQYPTAKYLPMGGLPARPYAPPQAYGDISGGDVWDAAQQFRDAGVPAELVGRLQQWGIAYTDRAARLQYAGKSSSRKM*
Ga0137419_1122364023300012925Vadose Zone SoilIMISPVWVPIFKPLPAEVRRALTASLLAAWMDKNLQYPVARYLPLGLPPQAYSPPRIYGAISGGKVWEAAGHFRNAGVSGELVQSLQQWGIAFTDRAERLQYH*
Ga0137419_1159389913300012925Vadose Zone SoilDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWEAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY*
Ga0137407_1078422523300012930Vadose Zone SoilALTTSLLAAWMDKNLQYPITKHLPVGVPRRPYTQPSSYGDISGGKVWESASQFRDAGVAAELVKRLQQWGIAYSDRAARLQY*
Ga0137407_1154907923300012930Vadose Zone SoilAAWMDKNLQYPTEKYLPMGGFPARPYAPPAAYGEISGGNVWEAAQQFRDAGVAANLVGRLQQWGIAYTDRAARIQYNAKSSSRKM*
Ga0137410_1123571313300012944Vadose Zone SoilVFKPLPVELNRALTNSLLAAWMDKNLQYPTEKYLPMGGLPARPYAPPAAYGDISGGKVWEAAQQFRDAGVRADLVGRLQQWGIAYTDRAARIQYSDNSSSRKK*
Ga0137410_1191652913300012944Vadose Zone SoilSLLAAWMDKNLQYPITKHLPTGVPRRAYTQPGIYGDISGGKVWEAAKQFRDAGVAAELVQQLQQWGIAYSDRAARLQY*
Ga0164299_1091872023300012958SoilDPRIMVSPVWEPIFNPLPLELHRALTTSLLTAWMDKNLQYPIPKYLPMGGLPARPYSPPQAYGDISGGNVWEAAQQFRDAGVASELVGRLKQWGIAYTDRAARLQYGGKSSSRKM*
Ga0164301_1006286143300012960SoilMDKNLQYPITKYLPMGGLPARPYSPPQAYGDISGGNVWEAAQQFRDAGVASELVGRLKQWGIAYTDRAARLQYGGKSSSRKM*
Ga0164301_1109423713300012960SoilQGWRPNDNIDPRIMISPVWAPTFKPLPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0134087_1007395823300012977Grasslands SoilVWEPIFKPLPVELRRALTASLLGAWMDKNLQYPVAKYLPMGGLPARPYAPPQAYGDISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK*
Ga0157373_1035439933300013100Corn RhizosphereDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGDVWQSAKEFRDAGVPSALVKTLQQWGAAYADRAARIQY*
Ga0157371_1013603943300013102Corn RhizosphereAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0157378_1069370813300013297Miscanthus RhizosphereNLQYPITKHLPVGVPKRPYTQPTAYGEISGGKVWESASQFRDAGVAPELVKRLQQWGLAYSDRAARIQY*
Ga0157378_1112956313300013297Miscanthus RhizospherePVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0163162_1144775313300013306Switchgrass RhizosphereRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY*
Ga0157372_1293154923300013307Corn RhizosphereLPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRCPYTQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY*
Ga0134078_1012928813300014157Grasslands SoilLPGEVHQALTTALLGAWMDKNLQYPIEKYLPMGLRSPAYAPPHNYGDISGGKVWEASQQFRNAGVPPELARRLQQWGLAFAERAARLQY*
Ga0163163_1243689713300014325Switchgrass RhizospherePRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPSALVKTLQQWGAAYADRAARIQY*
Ga0137418_1008019723300015241Vadose Zone SoilMDKNLQYPIEKHLPMGLRSPAYALPHNFSDITGGKVWEASPQFRNAGVSPELVRRLQQWGVAFTDRAARIQY*
Ga0137409_1065266913300015245Vadose Zone SoilLKEGWRPDENIDPRIMISPVWAPIFKPLPVEVHQALTISLLAAWMDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWEAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY*
Ga0132257_10241269713300015373Arabidopsis RhizosphereTASLLTAWMDKNLQYPITKYLPMGGLPTRPYSPPQAYGDISGGNVWEAAQQFRDAGVASELVERLKQWGIAYTDRAARLQYGGKSSSRKM*
Ga0182035_1073975523300016341SoilAWLDKNLQYPISEHLPFGLLGHAYMPARTYGDISGGRVWDSAEQFRHAGVPDDLVERLQEWGIAYNDRAARLQY
Ga0182038_1204330413300016445SoilIMVSPVWAPIFTPLATDVRRALAASLLAAWMDKNLQYPVSKYLPMSGLPASAYTTPSEYRDISGGKVWEASQQFREAGVAPELVWRLQQWGIAFNERAARLQYAGPPSISKK
Ga0187818_1011397623300017823Freshwater SedimentAWMDKNLQYSIGKYLPLGLNQQAYAPRRAYGEISGGNVWEAAKQFRDAGVPDEVVERLLDWGSTFTDRAARIHY
Ga0187818_1012252913300017823Freshwater SedimentKPLPIEVRHALTASLLAAWMDKNLQYPIAKYLPLGLPPQAYTSPRTYGEISGGKVWEAAQQFRDARVGVEQVQHLLEWGMAFTDRAARLQYH
Ga0187817_1086627513300017955Freshwater SedimentQNVDPRIMVSPDWAPIFKPLPIEIHKAITGSLLAAWMDKNLQYPISEYLPVGPPRDAYTPRRTYGDITGGKVWEAAKQFQKAGVPDEDVERLLQWGTAFTDRAARVQYR
Ga0187810_1016280613300018012Freshwater SedimentPRIMISPVWAPIFKPLPIEVRHALTASLLAAWMDKNLQYPIAKYLPLGLPPQAYTSPRTYGEISGGKVWEAAQQFRDARVGAEQVQHLLEWGMAFTDRAARLQYH
Ga0066655_1028123023300018431Grasslands SoilMISPVWTPMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY
Ga0193747_102886213300019885SoilMISPVWAPIFRPLPVEVHQALTTSLLAAWMDKNLEYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWEAAKQFRDAGVPAELVQRLQQWGNAYADRAARLQY
Ga0193729_114744523300019887SoilTTSLLAAWMDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWDAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY
Ga0193735_112677723300020006SoilFKPLPVEVHQALTTSLLAAWMDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWDAAKQFRDAGVPAELVQRLQQWGIAYADRAARLQY
Ga0179592_1000108013300020199Vadose Zone SoilWRPDQNVDPRIMVSPAWAPVFKPLPIEARRALTEAFLAAWMDKNLQYPIEKHLPMGLRSPAYALPHNFSDITGGKVWEASPQFRNAGVSPELVRRLQQWGVAFTDRAARIQY
Ga0210403_1086003513300020580SoilALTESMLSAWMDKNLQYPMAKHLPMGVHRQAYTTPQAYEDISGGKVWESAQQFRDAGVSPDLVRRLQQWGLAFTDRAARLQY
Ga0210399_1016752113300020581SoilALTASMLAAWMDKNLQYPMTKHLPTGLIRRPYIQPHNYGNISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY
Ga0210399_1145868413300020581SoilWAPTFKPFSVEVHQALTASLLAAWMDKNLQYPLAKYLPFGVLPHSYTPPRTYGHISGGKVWEAAQQFRSAGVDAKLVRRLQQWGMAFTDRAGRLQYERYGLGTKGQVSSRERKGT
Ga0210395_1030552013300020582SoilLVALRSPWLDKNLRYPIPEYLPLSAGHTYTPLNSYGDISGGKAWEAAQQFRNLGVAAELVQHLQQWGFAFTDRAARIQYGH
Ga0210401_1044192323300020583SoilLKQGWRPDENIDPRIMISPVWAPMFQPLPIEVHRALTASMLAAWMDKNLQYPMTKHLPTGLIRRPYTQPHNYGNISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY
Ga0210404_1021483133300021088SoilSLLAAWLDKNMQYPIEKFLPLGLPPRAYTPGVAYGDISGGKVWESAQQFRDATVAPDLIRRLQQWGTAYADRAARIQYN
Ga0210404_1072038913300021088SoilSLLAAWMDKNLQYPLTKHLPIGVPKRPYTQPSAYGDISGGKVWEAARQFRDAGVAAELVQRLQQWGIAYSDRAARLQY
Ga0210405_1072559513300021171SoilLPREVHRALIASLLAAWMDKNLRYSIAEYLPIGMSPRRYYTAPTSYGEVSGGKVWEAAERFRAAGVSDDLIDRLLQWGIAFTDRAERLQYH
Ga0210408_1049846923300021178SoilWMDKNLQYPMTKHLPTGLIRRPYIQPHNYGNISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY
Ga0210396_1163406113300021180SoilTPIFKPLPFEVHQALATSLLAAWMDKNLQYPIRKYLPIGAIRQGYTPSRSYGDITGGNVWGAARQFREAGVAAEVIERLEQWGVVFTDRAARVQY
Ga0210393_1109711413300021401SoilIEVHQALTASMLAAWMDKNLQYPMTKHLPTGLIRRPYIQPHNYGNISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY
Ga0210384_1015103813300021432SoilISPAWAPLSKPLPPELHRALTESLLAAWMDKNLQYPMAKHLPMGVHRQAYAAPQGYGDITGGNVWESAKQFRDAGVSPDLVRRLQQWGLAFADRAARLQY
Ga0210384_1076911323300021432SoilMDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGRVWDAAKQFRDAGVPAELVQRLQQWGIAYADRAALQY
Ga0210402_1007746413300021478SoilWRPDQNIDPRIMISSAWAPMFKPLPIEVHQVVTASLLAAWMDKNLQYPIAKYLPIGLSPHAYAPNRTYGDISGGKAWEAAQQFRDAGVAAEIVQRLQQWGIAFTDRAGRIRYE
Ga0210402_1012111043300021478SoilTELMLSGWMDKNLQYPMAKHLPMGVHRQAYTTPQAYEDISGGKVWESAQQFRDAGVSPDLVRRLQQWGLAFTDRAARLQY
Ga0210402_1125217413300021478SoilSLLSAWMDKNLQYPITKHLPVGVPKRPYTQPSSYGDISGGKVWESAREFRDAGVSTDLVQHLQQWGVAYNDRAARIQY
Ga0210410_1020379523300021479SoilNLQYPMAKHLPMGVHRQAYAAPQAYGEITGGNVWESAKQFRDAGVSPDLVRRLQQWGLAFTDRAARLQY
Ga0247695_100655413300024179SoilWMDKNAQYPIEKYLPMGTPRHAYTAPHSYGEITGGKVWEASQQFRDAGVAPELVRRLQQWGATFNDRAARIQY
Ga0247665_104357623300024219SoilVWAPIFQPLPTEVHRALTASLLAAWMDKNAQYPIEKYLPMGTPRHAYAAPHSYGEITGGKVWEASQQFRDAGVAPELLRRLQQWGVTFNDRAARIQY
Ga0247666_103279513300024323SoilPEQNVDPRIMISPVWAPLLKPLPIEVHRALTASLLAAWMDKNAEYPIEKYLPMGAPRHAYAAPRTYGEISGGKVWEAEQQFREAGVAPELVRRLQRWGVAFNDRAARIQY
Ga0137417_122937613300024330Vadose Zone SoilVHQALTAVTAALLAAWMDKNLQYPIERHLPMGLRSSAHAYALPHDYADISGGKVWEASQQFRNAGVSPELVRRLQQWGLAYTDRAARIQY
Ga0207692_1014703823300025898Corn, Switchgrass And Miscanthus RhizospherePDDNIDPRIMISAAWAPIFEPLPSEVHRALTTSLLAAWMDKNLQYPVEKYLPLGLPPRIYTPNSKYRDISGGKVWETEKQFREAGVAAELIFRLQQWGILFTDRAARVQYK
Ga0207671_1023274613300025914Corn RhizospherePRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY
Ga0207646_1055590923300025922Corn, Switchgrass And Miscanthus RhizosphereLTASLLSAWMDKNLQYPIAKYLPFGLPRHAYTPSGTYGNISGGKVWEAAQQFREAGVPAELVQRLQQWGIAFTDRAARLQY
Ga0207679_1068465823300025945Corn RhizosphereWRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY
Ga0207639_1095898213300026041Corn RhizosphereMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPSALVKTLQQWGAAYADRAARIQY
Ga0207641_1221259613300026088Switchgrass RhizosphereDNIKQGWRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY
Ga0209761_105168743300026313Grasslands SoilMISPVWEPIFKPLPIELRRALTASLLAAWMDKNLQYPVAKYLPMGGSPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAAELVRRLQQW
Ga0209268_101792543300026314SoilILISPVWEPIFKPLPVELRRALTASLLSAWMDKNLQYPVAKYLPMGGLPARPYAPPQAYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK
Ga0209155_103603423300026316SoilMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRAARIQY
Ga0209154_129795423300026317SoilKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIVYSDRAARLQYAGNSASSKR
Ga0209647_120982323300026319Grasslands SoilAAVLAAWMDKNLQYPIERHLPMGLRSNPHAYALPHDYADISGGKVWEASQQFRNAGVSPELVRRLQQWGLAYTDRAARIQY
Ga0209470_100005113300026324SoilLPMGGLPARPYAPPQAYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK
Ga0209375_111517313300026329SoilNIDPRILVSPVWEPIFKPLPVELRRALTASLLSAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKK
Ga0209473_126569813300026330SoilTASLLAAWMDKNLQYPVAKYLPMGGSPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAAELVRRLQQWGIAYADRAARLQYAGNSSSRKM
Ga0209158_112349623300026333SoilLAAWMDKNLQYPIERHLPMGLRSNPHAYALPHDYADISGGKVWEASQQFRNAGVSPELVRRLQQWGLAYTDRAARIQY
Ga0257178_105326913300026446SoilNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWEAAKQFRDAGVPAELVQRLQRWGIAYADRAARLQY
Ga0209378_108607313300026528SoilRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQAYGDISGGSVWEAAQQFRDAGVPAELARRLQQWGIAYSDRAARLQYAGNSASSKR
Ga0209807_124892913300026530SoilMFKPLPTEVHQALTVSLLAAWMDKNLQYPITKHLPTGLPRRAYTQPGSYGDISGGKVWEAAKQFRDAGVAPELVKHLQQWGIAYTDRA
Ga0209577_1063348013300026552SoilIFKPLPVELRRALTASLLAAWMDKNLQYPVAKYLPMGGLPARPYAPPQGYGEISGGSVWEAAQQFRDAGVPAELARRLQQWGIVYSDRAARLQYAGNSASSKR
Ga0179587_1028158613300026557Vadose Zone SoilPLPRDVHQAFTASLLAAWMDKNSQYSIAQFLPLGLPPQVYTAPSAYGEITEGKVWEAAQQFRNAGVAAELVQRLQEWGIAFTDRAGRLQYH
Ga0209076_117725113300027643Vadose Zone SoilGPDDLKEGWRPDENIDPRIMISPVWAPIFKPLPVEVHQALTISLLAAWMDKNLQYPIAKHLPIGLPRRSYTQPRAYGGISGGKVWEAAKQFRDAGVPAELVQRLQRWGIAYADRAARLQY
Ga0209118_103789923300027674Forest SoilRPEQNIDPRIMISPVWEPIFKPLPVELRRALTNSLLAAWMDKNLQYPTAKYLPMGGLPSRPYAPPQAYGDISGGDVWEAAQQFRDAGVPAELVGRLQQWGIAYTDRAARLQYAAKSSSRK
Ga0209488_1121638213300027903Vadose Zone SoilIMVSPAWAPVFKPLPIEARRALTDAFLAAWMDKNLQYPIEKHLPMGLRSPAYALPRDYTDISGGKVWEASQQFRNAGVSPELVRRLQQWGVAFTDRAARIQY
Ga0209526_1033135423300028047Forest SoilEKYLPMGGLPARPYAPPQAYGDISGGNVWEAAQQFRDAGVAADLVGRLQQWGFVYTDRAARIQYSVNSSSRKK
Ga0268266_1143812313300028379Switchgrass RhizosphereFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPSALVKTLQQWGAAYADRAARIQY
Ga0307469_1231223013300031720Hardwood Forest SoilAAWLDMIRLYPTEKYLPMGGLPARPYAPPAAYGEISGGKVWEAAQQFRAAGVAPELVGRLQQWGIAYTDRAARIQYSDNSSSRKK
Ga0307477_1056527613300031753Hardwood Forest SoilNLQYPMTKHLPTGLIRRPYTQPHNYGDISGGKVWEAAKQFRDAGVAPELVQRLQQWGIAYTDRAARIQY
Ga0307475_1006178923300031754Hardwood Forest SoilMISPVWARIFNPLPPELRRALTASLLAAWLDKNLQYSIAVYLPVPSLVNSYSPSRNYGDISGGKVWEAAQRFREAGVAAELVARLQEWGIKYIDRAARIQYDRVSRH
Ga0307475_1012548243300031754Hardwood Forest SoilAWLDKNLQYPIADYLPLSAGHTYVPPNSYADISGGKAWESAQQFRNAGVGAELVQRLQQWGFVFTDRAARIQYGH
Ga0307475_1015999813300031754Hardwood Forest SoilALTVSLLAAWMDKNQQYSIAKYLPVVGAAARAYTPTRSYGNITGGKVWEAARQFRDSGVPDQLVARLQDWGIRYIDRAARIQYDGISRR
Ga0310917_1046733523300031833SoilLATDVRRALAASLLAAWMDKNLQYPVSKYLPMSGLPASAYTTPSEYRDISGGKVWEASQQFREAGVAPELVWRLQQWGIAFNERAARLQYAGPPSISKK
Ga0307479_1135104923300031962Hardwood Forest SoilAWLDKNLQYPIAEYLPLSAGHTYFPPNSYADISGGKAWESAQQFRNAGVGAELVQRLQQWGFVFTDRAARIQYGH
Ga0318533_1081174513300032059SoilQGWRPDENVDPRIMVSPVWAPIFTPLATDVRRALAASLLAAWMDKNLQYPVSKYLPMSGLPASAYTTPSEYRDISGGKVWEASQQFREAGVAPELVWRLQQWGIAFNERAARLQYAGPPSISKK
Ga0307470_1132168123300032174Hardwood Forest SoilVHRALTTSLLAAWMDKNLQYPVEKYLPLGLPPRIYTPNSKYRDISGGKVWETAKQFREAGVAAELIFRLQQWGILFTDRAARVQYK
Ga0307471_10015540123300032180Hardwood Forest SoilMVSPVWAPIFKPLPIEVHQALTAALLAAWMDKNLQYSIEKYLPMGLRSHPYAAPHEYGDITGGKVWEAAQQFRNAGVSPEIVRRLQQWGIAFTDRAARLQY
Ga0307471_10099585013300032180Hardwood Forest SoilLLGAWMDKNLQYSIPKYLPLGLSRQSYTPTRAYGDISGGQVWEAAKQFRDTGVPAELVRRLQQWGIAFTDRAARIQY
Ga0307471_10103070113300032180Hardwood Forest SoilPEQNIDPRIMISPVWEPIFKPLPAELRRALTNSMLAAWMDKNLQYTIEKYLPMGGLPARAYAPPAAYGDISGGNVWEAAQQFRDAGVSPELVRRLQQWGLAYTDRAARIQYSGNTSSRKK
Ga0307471_10126432823300032180Hardwood Forest SoilKPLPAEVHQALATSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNVWQSAKEFRDAGVPATLVKSLQQWGAAYADRAARIQY
Ga0335080_1137081013300032828SoilDPRILISPVWAPFFKPLPVEVHQALTASMLAAWMDKNLEYPITRHLPVGVPRRPYTQPGYYGDISGGNVWQSARQFREAGVPADLIQHLQQWGIAYADRAARIQY
Ga0310810_1025511943300033412SoilVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY
Ga0310811_1069357023300033475SoilPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY
Ga0373958_0219233_133_5043300034819Rhizosphere SoilPRLGPDNIKQGWRPNDNIDPRIMISPVWAPTFKALPAEVHQALTTSMLAAWMEKNLQYPITKHLPVGVPRRPYTQPGYYGDISGGNAWQSAKEFRDAGVPPALVKTLQQWGAAYADRAARIQY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.