NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F021749

Metagenome Family F021749

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F021749
Family Type Metagenome
Number of Sequences 217
Average Sequence Length 97 residues
Representative Sequence PGSVPFTDSVAAYSVALSPGTYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVAHFDSLRPATDFVTCTAQ
Number of Associated Samples 156
Number of Associated Scaffolds 217

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.46 %
% of genes near scaffold ends (potentially truncated) 98.62 %
% of genes from short scaffolds (< 2000 bps) 85.71 %
Associated GOLD sequencing projects 146
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.511 % of family members)
Environment Ontology (ENVO) Unclassified
(41.014 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.387 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142
1JGI25383J37093_100357071
2JGI25383J37093_100733331
3JGI25384J37096_101064062
4JGI25384J37096_101355332
5JGI25382J37095_102252962
6JGI25382J43887_101597131
7JGI25382J43887_103566322
8JGI25382J43887_104205311
9JGI25386J43895_101869581
10soilH2_101845772
11Ga0063356_1013666271
12Ga0066674_103876771
13Ga0066683_104945241
14Ga0066676_101079091
15Ga0068869_1004741922
16Ga0070680_1019708052
17Ga0070689_1013440422
18Ga0070687_1007986221
19Ga0070701_102699451
20Ga0070705_1000397783
21Ga0070705_1018937912
22Ga0066681_102095852
23Ga0066687_101528032
24Ga0070706_1016212361
25Ga0070707_1005194881
26Ga0070699_1000252241
27Ga0070695_1005466271
28Ga0070695_1009787981
29Ga0070696_1000340551
30Ga0070696_1003034031
31Ga0066695_100243314
32Ga0066695_104428732
33Ga0066692_106309701
34Ga0066707_101893492
35Ga0066698_101233271
36Ga0066699_103155871
37Ga0066693_103630031
38Ga0066705_103637982
39Ga0066691_100744811
40Ga0070702_1000442002
41Ga0075277_10010233
42Ga0066651_102033361
43Ga0066656_100736761
44Ga0066656_101654582
45Ga0066656_101703492
46Ga0066652_1002975362
47Ga0075417_107134871
48Ga0066653_100575351
49Ga0066659_100418263
50Ga0066659_103055582
51Ga0075421_1000737361
52Ga0075421_1015683002
53Ga0075421_1026691682
54Ga0075431_1006602222
55Ga0075431_1013874421
56Ga0075431_1016345051
57Ga0075433_101361152
58Ga0075433_103171362
59Ga0075420_1013277152
60Ga0075425_1001264033
61Ga0075434_1012731302
62Ga0075429_1002599381
63Ga0075426_104087481
64Ga0075424_1000096449
65Ga0075436_1008870492
66Ga0075436_1009539361
67Ga0099791_106162742
68Ga0099795_102253191
69Ga0066710_1022331691
70Ga0066710_1022341601
71Ga0066710_1037029302
72Ga0099830_104223352
73Ga0075418_101567771
74Ga0066709_1000792245
75Ga0066709_1008610061
76Ga0099792_102296641
77Ga0075423_120965471
78Ga0134070_100731672
79Ga0134070_104029281
80Ga0134088_102297701
81Ga0134088_103765852
82Ga0134109_101559591
83Ga0134109_102729692
84Ga0134086_102070581
85Ga0134064_100845832
86Ga0134065_100438222
87Ga0134065_101408972
88Ga0134111_101648851
89Ga0134111_105251991
90Ga0134080_101665872
91Ga0134080_102197831
92Ga0134080_105871312
93Ga0134080_106282892
94Ga0134071_105138472
95Ga0134071_105251272
96Ga0134128_131552892
97Ga0134127_100135552
98Ga0134127_118229672
99Ga0134122_113855242
100Ga0137313_10028441
101Ga0137429_10079011
102Ga0137421_10270912
103Ga0137461_11259702
104Ga0137389_100302755
105Ga0137383_111222832
106Ga0137382_102692982
107Ga0137365_105512271
108Ga0137399_101087981
109Ga0137399_106169241
110Ga0137380_108132501
111Ga0137376_100156111
112Ga0137376_113162942
113Ga0137377_103594312
114Ga0137377_103772151
115Ga0137459_10827291
116Ga0137370_103524322
117Ga0137370_105564812
118Ga0137386_101199391
119Ga0137386_101772541
120Ga0137386_106545831
121Ga0137384_105065952
122Ga0137368_104650622
123Ga0137375_100585595
124Ga0137375_100590605
125Ga0137375_102156572
126Ga0137361_110141861
127Ga0137396_108955212
128Ga0137396_112918952
129Ga0137359_111821391
130Ga0137413_101646292
131Ga0137419_103608421
132Ga0137419_117763852
133Ga0137410_102421772
134Ga0137410_118430272
135Ga0137410_118430292
136Ga0126375_117043111
137Ga0134077_100835281
138Ga0134077_101474961
139Ga0134077_103238252
140Ga0134076_100771852
141Ga0134076_104108442
142Ga0134076_104285441
143Ga0134078_100471412
144Ga0134078_103106962
145Ga0180087_10039301
146Ga0180066_10974971
147Ga0134069_11871511
148Ga0134069_13518542
149Ga0134112_102846732
150Ga0134112_104860752
151Ga0134074_10273881
152Ga0134083_100639272
153Ga0134083_102778451
154Ga0184610_10841072
155Ga0184621_102789262
156Ga0184637_100353881
157Ga0184635_103178981
158Ga0184640_100622332
159Ga0184640_101789802
160Ga0184612_100712782
161Ga0066667_100614642
162Ga0066667_116975481
163Ga0066662_124812772
164Ga0066669_115673142
165Ga0179594_102452681
166Ga0210382_101451642
167Ga0210382_105152921
168Ga0210379_103298521
169Ga0210379_104627762
170Ga0222621_10250761
171Ga0137417_11741951
172Ga0137417_13992323
173Ga0209342_110349232
174Ga0207684_104700711
175Ga0207646_102583501
176Ga0209438_10167153
177Ga0209237_11308944
178Ga0209236_10620342
179Ga0209236_10775621
180Ga0209238_12114432
181Ga0209761_10727892
182Ga0209471_12758112
183Ga0209131_10494662
184Ga0209470_12467281
185Ga0209801_10751652
186Ga0209473_13309902
187Ga0209803_11704812
188Ga0209158_11982031
189Ga0209058_10235171
190Ga0209157_10480222
191Ga0209161_103775071
192Ga0209814_102772921
193Ga0209382_106772411
194Ga0209382_106798261
195Ga0209382_109800151
196Ga0209885_10037712
197Ga0209853_11625472
198Ga0307313_101670212
199Ga0307317_103251961
200Ga0307320_102145611
201Ga0307282_105393642
202Ga0307290_100335922
203Ga0307302_101575982
204Ga0307278_102582051
205Ga0307308_100717741
206Ga0307308_101400512
207Ga0307308_103250052
208Ga0307498_103317822
209Ga0307469_100941331
210Ga0310900_107656182
211Ga0326597_104561382
212Ga0307470_110946172
213Ga0307471_1033439882
214Ga0214471_103849352
215Ga0364928_0008495_1497_1859
216Ga0364928_0032932_1_339
217Ga0364931_0215280_3_377
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 5.69%    β-sheet: 13.82%    Coil/Unstructured: 80.49%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090PGSVPFTDSVAAYSVALSPGTYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVAHFDSLRPATDFVTCTAQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Sugarcane Root And Bulk Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Rice Paddy Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Soil
Groundwater Sand
Sediment
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
3.2%3.2%5.5%17.5%15.2%14.3%11.5%6.0%10.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1003570713300002560Grasslands SoilAAAYSMTLDPGTYHWVVAVWKKVGMLHLSPADTALLRVAGYYRNPADTSQPGVVTVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ*
JGI25383J37093_1007333313300002560Grasslands SoilRPVIPGSVPFSDSIAPYSVPLSPGAYQWVLAVWKKPGTLTLTPADTQYLRVAGYYRDPVDSTQPGVVTVPSGGGAAPGDIDFVVNFDSLRPATDFVTCTAQ*
JGI25384J37096_1010640623300002561Grasslands SoilLSPGAYXWVLAVWXKPGXLTLTPAXTQXLRVAGYYRDPVDSTQPGVVTVPSGGGAAPGDIDFVVNFDSLRPATDFVTCTAQ*
JGI25384J37096_1013553323300002561Grasslands SoilPGPLTLTPADTQYLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVAHFDSLRPATDFVTCTAQ*
JGI25382J37095_1022529623300002562Grasslands SoilDLINNRRPVIPGSVPFTDSIAAYSVALSPGSYEWVLAVWKKPGPLTLTPADTQYLRVAGYYRSPTDSTRPGVVTVPSAAAAGDINFVVNFDSLRPATDFVTCTAQ*
JGI25382J43887_1015971313300002908Grasslands SoilANTENVFIAAYPSFPQTCTDLINNRRPVIPGSVPFTDSIAAYSVALSPGSYEWVLAVWKKPGPLTLTPADTQYLRVAGYYRSPTDSTRPGVVTVPSAAAAGDINFVVNFDSLRPATDFVTCTAQ*
JGI25382J43887_1035663223300002908Grasslands SoilPANTENVFIAAYLTFPQTCTDLIANRQPLIPGSVPYTDSLALYSVPLSPGTYQWVLAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVNFVVDFDRLRPATDFVTCTGP*
JGI25382J43887_1042053113300002908Grasslands SoilWVVAVWKKVGMLHLSPADTALLRVAGYYRNPADTSQPGVVTVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ*
JGI25386J43895_1018695813300002912Grasslands SoilAPYSVPLSAATYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGSVTVPNGASTGDVDFLVDFDRLRPATDFVTCTAQ*
soilH2_1018457723300003324Sugarcane Root And Bulk SoilLRFQGTVPDSTDNVFIAAYASFPTTCQELIDNRRPFLPGSVPYTDSVAAYSVPLDPGPYQWVLAVWKKIGTLTLSPADTALLRVAGYYRDAVDTTLAGVITVPSGGSIGDVDFIVDFDHLRPATDFVSCQ*
Ga0063356_10136662713300004463Arabidopsis Thaliana RhizosphereLPPGSYQWVLAVWKKVGSLTLSPADTALLRVAGYYRNAVDTTTAGIVTVPAGASAGDVDFAVDFDNLRPATDFVTCTL*
Ga0066674_1038767713300005166SoilPLPAARYQWVLAVWKKLGALTLSARDTALLRVAGYYRDPADSTQPGAVTVTNGAAVGDVDLKVNFDSLRPATDFVTCTAQ*
Ga0066683_1049452413300005172SoilRQPVIPGSVPYTDSLTLYSVPLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSTLPGNVTVPPGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0066676_1010790913300005186SoilVWKKLGALTISPQDTALLRVAGYYRNPADSTQRGVVSVSNGPAAGDIDFVVDFDSLRPATDFVTCTAQ*
Ga0068869_10047419223300005334Miscanthus RhizosphereINNRRPVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCPP*
Ga0070680_10197080523300005336Corn RhizosphereFPQTCNDLIFNRQPFIPSSVPYADSVSLYSIELLPDTYEWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADTTQPGPVTVPNGSVADSVDFKVDFDNLRPATDFVTCTLR*
Ga0070689_10134404223300005340Switchgrass RhizosphereSTDNVFIAAYVNFPQTCTDLINNRRPVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCPP*
Ga0070687_10079862213300005343Switchgrass RhizosphereTCTDLINNRRPVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCTL*
Ga0070701_1026994513300005438Corn, Switchgrass And Miscanthus RhizospherePGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCTL*
Ga0070705_10003977833300005440Corn, Switchgrass And Miscanthus RhizosphereAYGVTLPPRTYAWVLAVWKKVGQLTLTPADTALLRVAGYYRNPADTTQPGSVIVPNGGAAANINFRVEFDSLRPATDFVSCTAQ*
Ga0070705_10189379123300005440Corn, Switchgrass And Miscanthus RhizosphereDNVFVAAYASFPQTCNDLIFNRQPFIPSSVPYSDSVSLYSIELLPDTYEWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADTTQPGTVTVPDGSVADSVDFKVDFDNLRPATDFVTCTLR*
Ga0066681_1020958523300005451SoilELPPDTYHWVVAVWKKQGSLTLTPADTALLRVAGYYRDPADTTQPGVVTVSNGPAPGDINFVVDFDNLRPATDFVSCTAQ*
Ga0066687_1015280323300005454SoilLDPGTYHWIVAVWKKVGMLQLSAADTALLRVAGYYRNPADSTQPGVVTVPQGAAAGDVDFVANFDSLRPATDFVRCTAQ*
Ga0070706_10162123613300005467Corn, Switchgrass And Miscanthus RhizosphereVLAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQPGSVIVPNGGAAANINFRVEFDSLRPATDFVSCTAQ*
Ga0070707_10051948813300005468Corn, Switchgrass And Miscanthus RhizosphereNRQPPIPGSVPFTDSLALYSVPLSSGTYHWILAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVDFVVDFDRLRPATDFVTCTGP*
Ga0070699_10002522413300005518Corn, Switchgrass And Miscanthus RhizospherePDSTDNVFVAAYASFPQTCNDLIFNRQPFIPSSVPYTDSVSLYSIELLPDTYEWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADTTQPGTVTVPNGSVADSVDFKVDFDNLRPATDFVTCTLR*
Ga0070695_10054662713300005545Corn, Switchgrass And Miscanthus RhizosphereELTPGPYHWVLAVWKKTGNLTLTAADTALLRVAGYYRDPADSTQVGMVTVPNGAAAGDIDFRVNFDSLRPATDFVTCTAQ*
Ga0070695_10097879813300005545Corn, Switchgrass And Miscanthus RhizospherePDSTDNVFVAAYASFPQTCSDLIFNRQPFIPSSVPYADSVTLYSMAVLPGHYEWVLAVWKKLGQLTLSARDTALLRVAGYYRSPADSTLPGAVTVPSGGVADSVDFKVNFDSLRPATDFVTCP*
Ga0070696_10003405513300005546Corn, Switchgrass And Miscanthus RhizosphereTCSDLIFNRQPFIPSSVPYADSVTLYSMAVLPGHYEWVLAVWKKLGQLTLSARDTALLRVAGYYRSPADSTLPGAVTVPSGGVADSVDFKVNFDSLRSATDFVTCP*
Ga0070696_10030340313300005546Corn, Switchgrass And Miscanthus RhizospherePVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCTL*
Ga0066695_1002433143300005553SoilEWVLAVWKKPGTLTLTPADTQYLRVAGYYRSPADSTQRGSVAVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0066695_1044287323300005553SoilAAAYSVALDPATYRWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPADTTQPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAQ*
Ga0066692_1063097013300005555SoilGTYHWVVAVWKKVGTLHLSAVDTALLRVAGYYRNPADTSQPGVVTVLTGAAAGDVNFVANLDSLRPATDFVTCIAQ*
Ga0066707_1018934923300005556SoilYHWVVAVWKKVGTLHLSAVDTALLRVAGYYRNPADTSQPGVVTVLTGAAAGDVNFVANLDSLRPATDFVTCIAQ*
Ga0066698_1012332713300005558SoilHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP*
Ga0066699_1031558713300005561SoilVAVWKKVGMLQLSAADTALLRVAGYYRNPADSTQPGVVTVPQGAAAGDVDFVANFDSLRPATDFVRCTAQ*
Ga0066693_1036300313300005566SoilPPDTYEWVLAVWKKVGNLTLSANDTTLLRVAGYYRNPADTTQPGSVTVPTGSVTDSVDFKIDFDNLRPATDFVSCTVR*
Ga0066705_1036379823300005569SoilYLTFPQTCTDLIANRQPLIPGSVPYTDSLALYSVPLSPGTYQWVLAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVNFVVDFDRLRPATDFVTCTGP*
Ga0066691_1007448113300005586SoilRRPVIPGSVPYTDSAAAYSMTLDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPADTSQPGIVTVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ*
Ga0070702_10004420023300005615Corn, Switchgrass And Miscanthus RhizosphereRPVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCTL*
Ga0075277_100102333300005895Rice Paddy SoilVPYTDSIADYSVELPPDTYHWVLAVWKKIGTLTLTPADTALLRVAGYYRSASDSTQPGIVTVTSGPAAGDIDFRVEFDSLRPAIDFVTCTAR*
Ga0066651_1020333613300006031SoilYTDSAAAYSVELPPDTYHWVVAVWKKVGSLTLTAQDTVLLRVGGYYRDPADTTRPGIVTVPNGPAPGDIDILVDFDNLRPATDFVTCTAQ*
Ga0066656_1007367613300006034SoilSVTYTDSAAAYSVALDPATYHWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPADTTQPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAQ*
Ga0066656_1016545823300006034SoilSIAPYSVSLSPGAYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGVVTVPSGAATGDIDFVVNFDSLRPATDFVTCTAR*
Ga0066656_1017034923300006034SoilPYTDSAAAYSMALDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPTDTSQPGVVTVPTGAAAGDVNFVAHLDSLRPATDFVTCTLQ*
Ga0066652_10029753623300006046SoilLIPGSVPYTDSVADYSVELPPDTYHWVVAVWKKQGSLTLTPADTALLRVAGYYRDPADTTQPGVVTVSNGPAPGDINFVVDFDNLRPATDFVSCTAQ*
Ga0075417_1071348713300006049Populus RhizosphereSVPFNDSLSLYSVALSPGTYHWVLAVWKKPGALTLTPADTQFLRVGGYYRSPADPAQPGTVTVPNGAAAGDVDFVVNFDSLRPATDFVTCTGP*
Ga0066653_1005753513300006791SoilGTIPANTENVFIAAYVAFPLTCNELIANRQPLIPGSVPYTDSMTLYSVPLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSTLPGNVTVPPGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0066659_1004182633300006797SoilDPGAYHWVLAVWKKPGTLTLSPADTQYLRVAGYYRDPADSTQRGVATVPSGSAVSNIDFVVNFDSLRPATDFVTCTAQ*
Ga0066659_1030555823300006797SoilIHFRGSVPDSTDNVFVAAYLPFPQTCNALINHRRPVIPASVPYTDSAAAYSMALDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPADTSQPGVVTVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ*
Ga0075421_10007373613300006845Populus RhizosphereSTDNVFVATYANFPQTCNDLIFNRQPFIPGSVPYTDSLVLYSVAVPPDRYEWVLAVWKKIGTLTLTPQDTTLLRVAGYYRDPADSTLPGAITVPSGGATADVDFRVNFDSLRPATDFVTCTAR*
Ga0075421_10156830023300006845Populus RhizosphereSTDNVFVATYANFPQTCNDLIFNRQPFIPGSVPYTDSLVLYSVAVPADRYEWVLAVWKKIGTLTLTPQDTTLLRVAGYYRDPADSTLPGAVTVPSGGATADVDFRVNFDSLRPATDFVSCTAR*
Ga0075421_10266916823300006845Populus RhizospherePYADSVTLYSVTLLADQYEWVLAVWKKVGNLTLTAQDTVLLRVAGYYRDPADTTQPGAVTVPMGGAADSVDFKVDFDNMRPATDFVTCTLR*
Ga0075431_10066022223300006847Populus RhizosphereIPGSVSYTDSVADYSVALQPRTYAWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADSTQPGVVTVPDGSAAADIDFRVEFDSLRPATDFVTCTAR*
Ga0075431_10138744213300006847Populus RhizosphereVPADRYEWVLAVWKKIGTLTLTPQDTTLLRVAGYYRDPADSTLPGAITVPSGGATADVDFRVNFDSLRPATDFVTCTAR*
Ga0075431_10163450513300006847Populus RhizosphereVSYTDSVADYSVALQPRTYAWVLAVWKKTGTLTLTPADTALLRVAGYYRNPADSTLPGVVTVPDGSAAANIDFRVEFDSLRPATDFVTCTAR*
Ga0075433_1013611523300006852Populus RhizosphereGRLQFRGTIPDSTDNIFIAAYLTFPQTCTDLILNRRPVRPGSVPFASASAAYGAPLDPGDYHWVLAVWKKRGQLTLSAADTTLLRVAGYYRNPADTTQPGVVTVPSGARADSINFVVNFDNLRPATDFVTCTAQ*
Ga0075433_1031713623300006852Populus RhizosphereYSVALSPGTYHWVLAVWKKPGALTLTPADTQFLRVGGYYRSPADPAQPGTVTVPSGAAAGDVDFVVNFDSLRPATDFVTCTGP*
Ga0075420_10132771523300006853Populus RhizosphereNVFVAAYASFPLTCTELIANRQPFIPGSVSYTDSVADYSVALQPRTYAWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADSTQPGVVTVPDGSAASDIDFRVEFDSLRPATDFVTCTAR
Ga0075425_10012640333300006854Populus RhizospherePANTDNVFVAAYANFPQTCNDLIFNRQPFIPSAVPYTDSVAVYGIPLPAGHYDWVLAVWKKVGTLTLTAADTALLRVAGYYRDPTDSTQPGAVTVPSGGAIGGVDFKVNFDSLRPATDFVTCP*
Ga0075434_10127313023300006871Populus RhizosphereLAVWKKPGALTLTPADTQFLRVGGYYRSPADPAQPGTVTVPNGAAAGDVDFVVNFDSLRPATDFVTCTGP*
Ga0075429_10025993813300006880Populus RhizosphereAYTSFPQTCTQLILNRRPFIPSSVPYTDSVSLYSIELLPDTYEWVLAVWKKVGTLTLTANDTTLLRVAGYFRSPADSTQPGSVTVPNGGVADSVDFRVNFDSLRPATDFVTCTLR*
Ga0075426_1040874813300006903Populus RhizosphereSVADYSVALAPDTYHWVVAVWKKVGSLTLTANDTTLLRVAGYYRDPADTTQPGVVTVPNGSAPGDINFLVDFDKLRPATDFVTCTAQ*
Ga0075424_10000964493300006904Populus RhizosphereVAAYATFPQTCNDLIANRRPLIPSSVPYTDSVADYSVALAPDTYHWVVAVWKKVGSLTLTANDTTLLRVAGYYRDPADTTQPGVVTVPNGSAPGDINFLVDFDKLRPATDFVTCTAQ*
Ga0075436_10088704923300006914Populus RhizosphereTFPQTCNDLIQNRRPVIPGSVPYTDSIAGYSVELTPGPYHWVLAVWKKTGNLTLTAADTALLRVAGYYRDPADSTQPGIVTVPNGAAAGDIDFRVNFDSLRPATDFVTCTAQ*
Ga0075436_10095393613300006914Populus RhizosphereLIANRRPLIPSSVPYTDSVADYSVALAPDTYHWVVAVWKKVGSLTLTANDTTLLRVAGYYRDPADTTQPGVVTVPNGSAPGDINFLVDFDKLRPATDFVTCTAQ*
Ga0099791_1061627423300007255Vadose Zone SoilVALPPRTYAWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADSTQPGSVTVPNGGAAANINFRVEFDSLRPATDFVTCSAQ*
Ga0099795_1022531913300007788Vadose Zone SoilPGPYEWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPIDSLQRGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCVP*
Ga0066710_10223316913300009012Grasslands SoilYTDSAAAYSVPLSSGTYQWVLAVWKKVGTLTLTPSDTALLRVAGYYRSPVDSTQPGVVTVPSGAAAGDINFVANFDSLRPATDFVTCTAQ
Ga0066710_10223416013300009012Grasslands SoilYTDSAAAYSVPLSSGTYQWVLAVWKKVGTLTLTPSDTALLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVANFDSLRPATDFVTCTAQ
Ga0066710_10370293023300009012Grasslands SoilFFPGSVPYTDSVAAYSVGLLPGRYEWVVAVWKKVGTLTATAADTALLRVAGYYRNPGDTSQAGVVAVANGAAAGNIDIVIEFDKLRPATDFVTCGP
Ga0099830_1042233523300009088Vadose Zone SoilAYLTFPQTCTDLIGNRQPVIPGSVPFTDSLALYSVPLSSGTYHWVLAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVDFVVDFDRLRPATDFVTCTGP*
Ga0075418_1015677713300009100Populus RhizosphereIAAYATFPQSCTDLINNRRPLIPGSVPYTDSVADYSVPLDPGTYHWLVAVWKKTGTLTLSPADTALLRVAGYYRNPADTTQPGVVTVPNGAAAGGLDFTADFDNLRPATDFVTCS*
Ga0066709_10007922453300009137Grasslands SoilINNRRPFIPGSVPYTDSAAAYSVPLSSGTYQWVLAVWKKVGTLTLTPSDTALLRVAGYYRSPVDSTQPGVVTVPSGAAAGDINFVANFDSLRPATDFVTCTAQ*
Ga0066709_10086100613300009137Grasslands SoilPLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSTLPGNVTVPPGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0099792_1022966413300009143Vadose Zone SoilVWKKPGTLTLTPADTQYLRVAGYYRSPIDSTLPGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0075423_1209654713300009162Populus RhizosphereKPFIPGSVPYTGPSAVYSVPLDPGQYRWVLAVWKKQGTLTLSASDTALLRVAGYYRNPADTTQPGVVTVPPGAVADSINFGVDFGNLRPATDFVTCVAQ*
Ga0134070_1007316723300010301Grasslands SoilAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQAGVVTVPRGGAAGSIDFVADFDHLRLATDFVACTAR*
Ga0134070_1040292813300010301Grasslands SoilAYSVPLDPGAYHWVLAVWKKPGTLTLSPADTQYLRVAGYYRDPADSTQRGVATVPSGSAVSNIDFVVNFDSLRPATDFVTCTAQ*
Ga0134088_1022977013300010304Grasslands SoilDNVFVAAYATFPQTCNELIANRQPVIPGSVPYTDSAAAYSVVLPPRAYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQAGVVTVPSGGAAGSIDFVADFDHLRLATDFVTCTAR*
Ga0134088_1037658523300010304Grasslands SoilPGTLTLTPADTQYLRVAGYYRNPADSTLPGRVTVPPGASVGDVDFVAAFDHLRPATDFVTCTAP*
Ga0134109_1015595913300010320Grasslands SoilQGTIPDNTDDVFIAAYATFPQTCTDLIANRRPLIPGSVPYTDSVADYSVELPPDTYHWVVAVWKKQGSLTLTPADTALLRVAGYYRDPADTTQSGVVTVSNGPAPGDINFVVDFDNLRPATDFVSCTAQ*
Ga0134109_1027296923300010320Grasslands SoilAVWKKPGTLTLTPADTQYLRVAGYYRSPADSTQRGSVAVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0134086_1020705813300010323Grasslands SoilVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPPGASAGDVDFVVDFDNLRPATDFVTCTGP*
Ga0134064_1008458323300010325Grasslands SoilPGSVPYTDSVADYSVELPPDTYHWVVAVWKKQGSLTLTPADTALLRVAGYYRDPADTTQPGVVTVSNGPAPGDINFVVDFDNLRPATDFVSCTAQ*
Ga0134065_1004382223300010326Grasslands SoilFPQSCADLINNRRPLIPGSVPYTDSAAAYSVELLPDTYHWVVAVWKKVGSLTLTAQDTVLLRVGGYYRDPADTTRPGAGPFGTVTMPGRVDILVDFDNLRPATDFVTCTAQ*
Ga0134065_1014089723300010326Grasslands SoilLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSMLPGNVTVPPGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0134111_1016488513300010329Grasslands SoilDSTDNVFVAAYATFPQTCNDLIANRQPVIPGSVPYTDSAAAYSVVLPPRAYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTRAGVVTVPSGGAAGSIDFVADFDHLRLATDFVTCTAR*
Ga0134111_1052519913300010329Grasslands SoilINNRRPVIPGSVPYTDSAAAYSMALDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPTDTSQPGVVTVPTGAAAGDVNFVAHLDSLRPATDFVTCTLQ*
Ga0134080_1016658723300010333Grasslands SoilFRGTIPDSTDNVFVAAYATFPQTCNDLIANRQPVIPGSVPYTDSAAAYSVVLPPRPYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQAGVVTVPRGGAAGSIDFVADFDHLRLATDFVACTAR*
Ga0134080_1021978313300010333Grasslands SoilVPYTDSAAAYSMALDPGTYHWVVAVWKKVGALHVSPADTALLRVAGYYRNPTDTSQPGVVTVPTGAAAGDVNFVAHLDSLRPATDFVTCTLQ*
Ga0134080_1058713123300010333Grasslands SoilYSIPLPTGHYDWVLAVWKKVGNLTLTAADTALLRVAGYYRDPADSTQRGAVTVPSGGAAGGVDFKVNFDSLRPATDFVTCTVR*
Ga0134080_1062828923300010333Grasslands SoilNRQPFIPSSVPYTVSATTYGVPLQPGAYQWLLAVWKKVGNLTLSNADTALLRVAGYYRNPADTTQPGVVTVPAGAFANNVDFVVDFDRLEPATDFVMCTAQ*
Ga0134071_1051384723300010336Grasslands SoilENVFIAAYLAVPQTCTELINNRRPVIPGSVPFTDSLAAYSVPLDPGAYHWVLAVWKKPGTLTLSPADTQYLRVAGYYRDPADSTQRGVATVPSGSAVSNIDFVVNFDSLRPATDFVTCTAR*
Ga0134071_1052512723300010336Grasslands SoilPQTCNDLINNRRPALPGSVPYTDSAAAYSMALDPGTYHWVVAVWKKVGALHVSPADTALLRVAGSYRNPADTSQPGIVTVPNRAAAGDINFVANLDSLRPATDFVTCTAQ*
Ga0134128_1315528923300010373Terrestrial SoilVWKKVGNLTLTAADTALLRVAGYYRNAADTTQPGTVTVPNGSVADSVDFKVDFDNLRPATDFVTCTLR*
Ga0134127_1001355523300010399Terrestrial SoilMAVLPGHYEWVLAVWKKLGQLTLSARDTALLRVAGYYRSPADSTLPGAVTVPSGGVADSVDFKVNFDSLRSATDFVTCP*
Ga0134127_1182296723300010399Terrestrial SoilGTVPDSTDNVFVAAYAAFPQTCNELITNRQPFIPGSVSYTDSVAAYGVTLPPRTYAWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQPGSVIVPNGGAAANINFRVEFDSLRPATDFVSCTAQ*
Ga0134122_1138552423300010400Terrestrial SoilYHWVLAVWKKRGLLTLSAADTALLRVAGYYRNPADTTQPGVVTVPSGARADSINFVVNFDNLRPATDFVTCTAQ*
Ga0137313_100284413300011403SoilNNRRPVIPGSVPYTDSVADYSVPLDPGTYHWLVAVWKKIGPLTLSPADTVLLRVAGYYRNPADTTQPGVVTVPNGAAAGGLDFTADFDNLRPATDFVTCP*
Ga0137429_100790113300011437SoilGSVPYTDSIADYSVPLDPGTYHWLVAVWKKIGALTLSPADTVLLRVAGYYRNPADTTQPGVVTVPNGAAAGGLDFTADFDNLRPATDFVTCP*
Ga0137421_102709123300012039SoilLVAVWKKVGALTLSPADTALLRVAGYYRNPADTTQPGAITVPAGAAAGGVDFTADFDNLRPATDFVTCS*
Ga0137461_112597023300012040SoilPYEWVLAVWKKIGNLTLTAQDTTLLRVAGYYRDPADSTQPGAVTVPSGGAIGDVDFRVNFDSLRPATDFVTCTAR*
Ga0137389_1003027553300012096Vadose Zone SoilPGSVPFTDSVAAYSVALSPGTYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVAHFDSLRPATDFVTCTAQ*
Ga0137383_1112228323300012199Vadose Zone SoilVVPGSVPYSDSAAAYSVAIDPGTYHWIVAVWKKLGSLTISPQDTALLRVAGYYRNPADSTQRGVVSVSNGPAAGDIDFVVDFDSLRPATDFVTCTAQ*
Ga0137382_1026929823300012200Vadose Zone SoilWVLAVWKKPGPLTLTPADTQYLRVAGYYRKPSDSTLPGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0137365_1055122713300012201Vadose Zone SoilPIPGSVTYTDSAALYSVGLDPATYHWVLAVWKKVGNLTLTLQDTALLRVAGYYRDPTDTTQPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAQ*
Ga0137399_1010879813300012203Vadose Zone SoilSVPLSPGTYDWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTRPGVVTVPSGAAAGDVDFVVNFDSLRPATDFVTCTAQ*
Ga0137399_1061692413300012203Vadose Zone SoilNVFIAAYVTFPQTCTDLINNRRPVRPGSVPFTDSIAAYSVPLSPGAYEWVLAVWKKPGTLTLTPADTQYLRVAGYYRRPSDSTLPGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0137380_1081325013300012206Vadose Zone SoilNNRKPPIPGSVTYTDSAAAYSVVLDPATYHWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPTDTTQPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAQ*
Ga0137376_1001561113300012208Vadose Zone SoilPLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSTLPGNVTVPAGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0137376_1131629423300012208Vadose Zone SoilKKRGQLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARADSINFVVDFDHLRPATDFVTCTAQ*
Ga0137377_1035943123300012211Vadose Zone SoilWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGVVTVPSGGGAAPGDIDFLVNFDSLRPATDFVTCTAQ*
Ga0137377_1037721513300012211Vadose Zone SoilRRPVIPGSVPFTDSIAPYSVPLSPGAYQWVLAVWKKPGTLTLTPADTQYLRVAGYYRDPVDSTQPGVVTVPSGGGAAPGDIDFVVNFDSLRPATDFVTCTAQ*
Ga0137459_108272913300012228SoilVFIAAYATFPQTCTDLINNRRPLIPGSVPYTDSVAAYSVALDPGTYQWVVAVWKKVGALTLSPADTALLRVAGYYRNPADTTQPGAVTVPAGAAAGDVDFTADFDNLRPATDFVTCS*
Ga0137370_1035243223300012285Vadose Zone SoilAYLTLPQTCNDLINNRRPVVPGSVPYTDSAAAYSVAIDPGTYHWIVAVWKKLGALTISPQDTALLRVAGYYRNPADSTQRGVVSVANGPAAGDIDFVVDFDSLRPATDFVICTAQ*
Ga0137370_1055648123300012285Vadose Zone SoilGVYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRKPSDSTLPGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0137386_1011993913300012351Vadose Zone SoilAAYATFPQTCNDLIFSRQPFIPSSVPYTDSVVPYSMPLPAGHYDWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADSTQPGGVTVPSGGAAGDVDFKVNFDSLRPATDFVTCTVR*
Ga0137386_1017725413300012351Vadose Zone SoilTCYDLINNRRPVIPGSVPYTDSAAAYSMALDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPADTSQPGIVIVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ*
Ga0137386_1065458313300012351Vadose Zone SoilDLIANRQPVIPGSVPFTDSLTLYSVPLSPGNYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP*
Ga0137384_1050659523300012357Vadose Zone SoilGRLQFRGAVPDSTDNVFIAAYLTFPRTCTELIANRQPFIPSSVPYTVSATTYGVALQPGAYQWLLAVWKKVGNLTLSNADTALLRVAGYYRNPADTTQPGVVTVPAGAFANNVDFVVDFDRLEPATDFVMCTAQ*
Ga0137368_1046506223300012358Vadose Zone SoilDSVTTYSIPLPAAHYEWVLAVWKKVGALTLSARDTALLRVAGYYRDPADSTQRGAVTVTTGPAVGDVDLKVNFDSLRPATDFVTCTAQ*
Ga0137375_1005855953300012360Vadose Zone SoilPPIPGSVTYTDSAAPYSVVLDPATYHWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPADSTQPGIVTIPTGGTVGNIDFLVDFDNLRPATDFVTCTAP*
Ga0137375_1005906053300012360Vadose Zone SoilPPIPGSVTYTDSAAPYSVVLDPATYHWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPADTTLPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAP*
Ga0137375_1021565723300012360Vadose Zone SoilRQPFIPSSVPYTDSLSLYSIELPPDTYEWVLAVWKKLGDLMLTANDTTLLRVAGYYRSPADSTQPGSVTVPTGSVADSIDFKVDFDNLRPATDFVSCTAR*
Ga0137361_1101418613300012362Vadose Zone SoilLDPGTYHWVVAVWKKVGTLHVSPADTALLRVAGYYRNPADTSQPGIVIVPNGPAAGDINFVANLDSLRPATDFVTCTAQ*
Ga0137396_1089552123300012918Vadose Zone SoilLAVWKKVGILTLSPQDTALLRVAGYYRNPADSTQFGIVTIPAGGTVGNIDFRIDFDSLRPATDFVICTAQ*
Ga0137396_1129189523300012918Vadose Zone SoilVPANTENVFIAAYLAFPQTCTDLINNRRPVIPGSVPFTDSIAAYSVPLPPAAYHWVLAVWKKPGTLTLTPADTQYLRVAGYFRNPADSTQPGVVTVPGGAAAGDINFVVNFDSLRPATDFVTCTAL*
Ga0137359_1118213913300012923Vadose Zone SoilGTYHWVVAVWKKLGTLTISPQDTALLRVAGYYRNPVDSTQRGVVPISSGPAAGGIDFVVNFDSLRPATDFVTCTAQ*
Ga0137413_1016462923300012924Vadose Zone SoilSPGAYEWVLAVWKKPGTLTLTPADTQYLRVAGYYRRPSDSTLPGSVTVPSGAAAGDIDFVVNFESLRPATDFVTCAP*
Ga0137419_1036084213300012925Vadose Zone SoilLSAGFLHWRLADWKKPGPLPLPPAGTQYRGGAGYYRSPPPGASSQPGVVTVPSGAAAGDIDFVVNFDNLRPATDFVTCTAQ*
Ga0137419_1177638523300012925Vadose Zone SoilYSVPLSPAAYHWVVAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGVVTVPSGAAAGDVNFVVNFDSLRPATDFVTCTAQ*
Ga0137410_1024217723300012944Vadose Zone SoilPQTCTDLINNRRPVRPGSVPFTDSIAAYSVPLSPGAYEWVRAVWKKPGTLTLTPADTQYLRVAGYYRKPSDSTLPGSVTVPSGAAAGDIDFVVNFDSLRPATDFVTCAP*
Ga0137410_1184302723300012944Vadose Zone SoilDNVFLAAYVTFPQNCNDLINNRKPPIPGSVTYTDSAAAYSVALDPGTYHWVLAVWKKVGVLTLSPQDTVLLRVAGYYRNPADSTQFGIVTIPAGGTVGNIDFRVDFDSLRPATDFVTCTAQ*
Ga0137410_1184302923300012944Vadose Zone SoilDNVFLAAYVTFPQNCNDLINNRKPPIPGSVTYTDSAAAYSVPLDPGTYHWVLAVWKKVGILTLSPQDTALLRVAGYYRNPADSTQFGIVTIPAGGTVGNIDFRVDFDSLRPATDFVTCTAQ*
Ga0126375_1170431113300012948Tropical Forest SoilEWLVAVWKKVGSLTLTPADTALLRVAGYYRNPADTTQPGVVTVPNGGAAGNIDFFADFDQLRPATDFVTCTAR*
Ga0134077_1008352813300012972Grasslands SoilRGTIPDSTDNVFVAAYATFPQTCDELIANRQPVIPGSVPYTDSAAAYSVVLPPRAYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTRAGVVTVPSGGAAGSIDFVADFDHLRLATDFVTCTAR*
Ga0134077_1014749613300012972Grasslands SoilSIAPYSVPLSPGAYQWVLAVWKKPGTLTLTPADTQYLRVAGYYRDPVDSTQPGVVTVPSGGGAAPGDIDFVVNFDSLRPATDFVTCTAQ*
Ga0134077_1032382523300012972Grasslands SoilVPYTDSAAAYSMALDPGTYHWVVAVWKKVGTLHLSPADTALLRVAGYYRNPTDTSQPGVVTVPTGAAAGDVNFVAHLDSLRPATDFVTCTLQ*
Ga0134076_1007718523300012976Grasslands SoilPGTLTLTPADTQYLRVAGYYRNPADSTLPGRVTVPPGASVGDVDFVADFDHLRPATDFVTCTAP*
Ga0134076_1041084423300012976Grasslands SoilWVLAVWKKVGNLTLTAQDTTLLRVAGYYRDPTDSTQPGAVTVPSGGAAGGVDFRVNFDSLRPATDFVTCTLR*
Ga0134076_1042854413300012976Grasslands SoilLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP*
Ga0134078_1004714123300014157Grasslands SoilTLRFRGTIPANTENVFIAAYVAFPLTCNELIANRQPLIPGSVRYTDSLTLYSVPLSPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRNPADSTLPGNVTVPLGASAGDVDFAVDFDNLRPATDFVTCTGP*
Ga0134078_1031069623300014157Grasslands SoilWKKQGSLTLTPADTALLRVAGYYRDPADTTQPGVVTVSNGPAPGDINFVVDFDNLRPATDFVSCTAQ*
Ga0180087_100393013300014872SoilRGPIPDSTDNVFIAAYATFPQTCTDLINNRRPVIPGSVPYTDSLAAYSVPLDPGTYPWLVAVWKKVGALTLSPADTALLRVAGYYRNPADTTQPGAVTVPAGAAAGDVDFTADFDNLRPATDFVTCS*
Ga0180066_109749713300014873SoilVAPYSVPLPPDPYEWVLAVWKKTGNLTLTAQDTTLLRVAGYYRDPADSTQPGAVTVPSGGAIGDVDFRVNFDSLRPATDFVTCTAR*
Ga0134069_118715113300017654Grasslands SoilSVVLPPRAYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQAGVVTVPSGGAAGSIDFVADFDHLRLATDFVTCTAR
Ga0134069_135185423300017654Grasslands SoilWVVAVWKKVGSLTLTAQDTVLLRVGGYYRDPADTTRPGIVTVPNGPAPGDIDILVDFDNLRPATDFVTCTAQ
Ga0134112_1028467323300017656Grasslands SoilRPVIPGSVPFTDSIAPYSVPLSPGAYQWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGVVTVPSGAATGDIDFVVNFDSLRPATDFVTCTAR
Ga0134112_1048607523300017656Grasslands SoilPGSVRYTDSLTLYSVPLSPGTYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP
Ga0134074_102738813300017657Grasslands SoilHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPPGASAGDVDFVVDFDNLRPATDFVTCTGP
Ga0134083_1006392723300017659Grasslands SoilPLIPGSVPYTDSLTLYSVPLSPGTYHWVLAIWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGRVTVPPGASVGDVDFVADFDHLRPATDFVTCTAP
Ga0134083_1027784513300017659Grasslands SoilAYVAFPLTCNELIANRQPLIPGSVRYTDSLTLYSVPLSPGTYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP
Ga0184610_108410723300017997Groundwater SedimentPSSVPYTDSVAPYSVELSPATYQWVLAVWKKVGNLTLTPADTALLRVAGYFRDPADTTQPGVVTVPTGAAAGDIDFVVEFDNLRPATDFVTCTAQ
Ga0184621_1027892623300018054Groundwater SedimentNNRKPPIPGSVPFTDSIAAYSVALLPGVYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRSRTDSTQPDSVAVPSGAAAGDIDFVVNFDSLRPATDFVTCAP
Ga0184637_1003538813300018063Groundwater SedimentTDNVFVAAYATFPQTCNELILNRQPFIPGSVLYTDSVAPYSLSLLPGTYQWVLAVWKRVGTLTLTPADTALLRVAGYYRNAADTTQAGVVTVPSGGSAGGVDFVVDFDHLRPATDFVTCTAQ
Ga0184635_1031789813300018072Groundwater SedimentCNDLIFNRQPFIPSSVPYTDSVSLYSIELLPDTYEWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADTTQPGTVTVPNGSVADSVDFKVDFDSLRPATDFVTCTVR
Ga0184640_1006223323300018074Groundwater SedimentYATFPQTCNELIANRQPFIPGSVLYTDSVAPYSLSLLPGTYQWVLAVWKRVGTLTLTPADTALLRVAGYYRNAADTTQAGVVTVPSGGSAGGVDFVVDFDHLRPATDFVTCTAQ
Ga0184640_1017898023300018074Groundwater SedimentDNVFVATYATFPQTCNDLIFNRQPFIPSSVPYTDSVVPYSVAVPPDRYEWVLAVWKKIGNLTLTAQDTTLLRVAGYYRDPADSTQPGAVTVPSGGAIGDVDFRVDFDSLRPATDFVACTA
Ga0184612_1007127823300018078Groundwater SedimentAAYVSFPQTCNDLIFNRQPFIPSSVPYTDSVALYSIDLLPDTYEWVLAVWKKVGNLTLTANDTTLLRVAGYYRSPADSTLPGAVTVPSGGVADSVDFRVNFDSLMPATDFVTCTAR
Ga0066667_1006146423300018433Grasslands SoilSSGTYQWVLAVWKKVGTLTLTPSDTALLRVAGYYRSPVDSTQPGVVTVPSGAAAGDINFVANFDSLRPATDFVTCTAQ
Ga0066667_1169754813300018433Grasslands SoilSVPYTDTAAAYGLALDPGTYHWVVAVWKKLGNLTVSPQDTALLRVAGYFRSPADSTQPGTVTVPSGAAAGDIDFVANFDSLRPATDFVTCTAQ
Ga0066662_1248127723300018468Grasslands SoilTDSLTLYSVPLSPGNYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPPGASAGDVDFVVDFDNLRPATDFVTCTGP
Ga0066669_1156731423300018482Grasslands SoilRFRGTIPDSTDNVFVAAYLSVPQTCTDLINNRRPVIPGSVPYTDSIAAYSVALSAGVYHWVLAVWKKTGTLTLTPADTALLRVAGYYRDPADSTQRGVVTVPNGAAAGDIDFVVNFDSLRPATDFVRCTAP
Ga0179594_1024526813300020170Vadose Zone SoilTVPDSTDNVFMAAYVTFPQTCDDLINNRRPFIPGSVPYTDSVARYSVELDPGTYHWVLAVWKKDGQLTLTPADTAILRVAGFYRVVTDTTQPGILTIPSGGTVGGIDFVVDFDHLRPATDFVTCTGP
Ga0210382_1014516423300021080Groundwater SedimentYTSFPQTCNQLILNRRPFIPSSVPYTDSVALYSIELLPDTYEWVLAVWKKVGDLTLTAQDTALLRVAGYYRNPADTTQPGVVAVPNGNVADSVDFSVDFDNLKPPTDFVTCTLR
Ga0210382_1051529213300021080Groundwater SedimentWKKPGPLTLTAADTQYLRVAGYYRDPADSSQYGVATVPSGSATRDIDFVVNFDSLRPATDFVTCAP
Ga0210379_1032985213300021081Groundwater SedimentTFPQSCTELINNRQPFIPGSVSYTDSVAAYSVELSPGGYEWVLAVWKKVGNLTLSPADTALLRVAGYYRNATDTTQPGAVTVPSGASAGDIDFVVDFDHLHPATDFVTCTAQ
Ga0210379_1046277623300021081Groundwater SedimentWVLAVWKKVGNLTLTPADTALLRVAGYYRHPRDSTLPGVVIVPNGAAAGAIDFVADFDNLRPATDFVTCTAP
Ga0222621_102507613300021510Groundwater SedimentAAYTSFPQTCNQLILNRRPFIPSSVPYTDSVALYSIELLPDTYEWVLAVWKKVGDLTLTAQDTALLRVAGYYRNPADTTQPGVVAVPNGNVADSVDFSVDFDNLKPPTDFVTCTLR
Ga0137417_117419513300024330Vadose Zone SoilNNRRPVIPGSGSVPYRDSAAAYSVALSPGTYQWVLAVWKKIGTLTLSPSDTALLRVAGYYRSPTDSTQRGVVTVPSGPAVRDIDFVVNFDSLRPATDFVTCTAQ
Ga0137417_139923233300024330Vadose Zone SoilVPLDPGTYHWVLAVWKKVGILTLSPQDTALLRVAGYYRNPADSTQFGIVTIPAGGTVGNIDFRVDFDSLRPATDFVTCTAQ
Ga0209342_1103492323300025326SoilLIFGRQPFIPSSVPYTDSVTPYSIPLPANNYEWVLAVWKKVGTLTLSAQDTAILRVAGYYRDPVDSTRRGPVTVPTGAAVAGVDFRVNFDSLRPATDFVTCAAR
Ga0207684_1047007113300025910Corn, Switchgrass And Miscanthus RhizosphereQTCSDLIFNRQPFIPSSVPYADSVTLYSVAVPPAHYEWVLAVWKKRGQLTLSAADTTLLRVAGYYRDPADTTQPGVVTVPTGAKADSINFVVNFDNLRPATDFVTCTAQ
Ga0207646_1025835013300025922Corn, Switchgrass And Miscanthus RhizosphereVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVDFVVDFDRLRPATDFVTCTGP
Ga0209438_101671533300026285Grasslands SoilRQPFIPSSVPYTDSVTLYSITLLPDQYEWVLAVWKKVGQLQLNAADTALLRVAGYYRSPADTSLPGTVIVPSGGVADSVDFRVNFDSMKPATDFVTCTAR
Ga0209237_113089443300026297Grasslands SoilVPFTDSIAPYSVPLSPGAYQWVLAVWKKPGTLTLTPADTQYLRVAGYYRDPVDSTQPGVVTVPSGGGAAPGDIDFVVNFDSLRPATDFVTCTAQ
Ga0209236_106203423300026298Grasslands SoilGTVPANTENVFIAAYLSFPQTCTDLIANRQPLIPGSVPYTDSLAPYSVPLSAATYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGSVTVPNGASTGDVDFLVDFDRLRPATDFVTCTAQ
Ga0209236_107756213300026298Grasslands SoilRFRGTIPDSTDNVFIAAYVTFPRTCNDLINNRRPFLPGSVPYTDSAAAYSVPLSSGTYQWVLAVWKKVGTLTLTPSDTALLRVAGYYRSPVDSTQPGVVTVPSGAAAGDINFVANFDSLRPATDFVTCTAQ
Ga0209238_121144323300026301Grasslands SoilSTDNVFVAAYLTFPQTCNDLINNRRPVIPGSVPYTDSAAAYSMTLDPGTYHWVVAVWKKVGMLHLSPADTALLRVAGYYRNPADTSQPGVVTVPTGAAAGDVNFVANLDSLRPATDFVTCTAQ
Ga0209761_107278923300026313Grasslands SoilLDPGTYHWVLAVWKKPGPLTLTPADTQYLRVAGYYRSPADSTQPGVVTVPSGAAAGDINFVAHFDSLRPATDFVTCTAQ
Ga0209471_127581123300026318SoilVPYTDSAAAYSMTLDPGTYHWIVAVWKKVGMLQLSAADTALLRVAGYYRNPADTTQPGVVTVPQGAAAGDVDFVANFDSLRPATDFVRCTAQ
Ga0209131_104946623300026320Grasslands SoilGSVPYTDSAAAYSVALDPGTYHWIVAVWKKLGTLTISPQDTALLRVAGYYRDPADSTQPGMVTVLNGAAPGDIDFRVNFDSLRPATDFVTCTAQ
Ga0209470_124672813300026324SoilRFRGTIPDSTDNVFIAAYVTFPQTCNDLINNRKPPIPGSVTYTDSAAAYSVALDPATYHWVLAVWKKVGNLTLTPQDTALLRVAGYYRDPADTTQPGIVTIPAGGTVGNIDFRVEFDSLRPATDFVTCTAQ
Ga0209801_107516523300026326SoilPQTCTDLIANRQPLIPGSVPYTDSLALYSVPLSPGTYQWVLAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVNFVVDFDRLRPATDFVTCTGP
Ga0209473_133099023300026330SoilPYTDSAAAYSMTLDPGTYHWIVAVWKKVGMLQLSAGDTALLRVAGYYRNPADSTQPGVVTVPQGAAAGDVDFVANFDSLRPATDFVRCTAQ
Ga0209803_117048123300026332SoilTFPQTCTDLIANRQPLIPGSVPYTDSLALYSVPLSPGTYQWVLAVWKKPGTLTLTPADTQYLRVGGYYRNPADSTQPGSVTVPNGASAGDVNFVVDFDRLRPATDFVTCTGP
Ga0209158_119820313300026333SoilGTIHFRGTIPNSTDNVFVAAYATFPQTCDELIANRQPVIPGSVPYTDSAAAYSVVLPPRAYEWLVAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQAGVVTVPSGGAAGSIDFVADFDHLRLATDFVTCTAR
Ga0209058_102351713300026536SoilVPLSPGNYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTLPGIVTVPSGASAGDVDFVVDFDNLRPATDFVTCTGP
Ga0209157_104802223300026537SoilANTESVFIAAYLAFPQTCTDLINNRRPVIPGSVPFTDSIAPYSVSLSPGAYHWVLAVWKKPGTLTLTPADTQYLRVAGYYRNPADSTQPGVVTVPSGAATGDIDFVVNFDSLRPATDFVTCTAR
Ga0209161_1037750713300026548SoilPGAYQWLLAVWKKVGNLTLSNADTALLRVAGYYRNPADTTQPGVVTVPAGAFANNVDFVVDFDRLEPATDFVMCTAQ
Ga0209814_1027729213300027873Populus RhizosphereRYSVALSPGTYHWVLAVWKKPGALTLTPADTQFLRVGGYYRSPADPAQPGTVTVPNGAAAGDVDFVVNFDSLRPATDFVTCTGP
Ga0209382_1067724113300027909Populus RhizosphereDQYAWVLAVWKKLGNLTLTAQDTTLLRVAGYYRNPADTTQPGTVTVPNGGVADSVDFKVNFDSLKPATDFVTCP
Ga0209382_1067982613300027909Populus RhizosphereDSLVVYSVAVPADRYEWVLAVWKKIGTLTLTPQDTTLLRVAGYYRDPADSTLPGAVTVPSGGATADVDFRVNFDSLRPATDFVSCTAR
Ga0209382_1098001513300027909Populus RhizosphereVCGTIRFSGTVPDSTDNVFVAAYTSFPQTCTQLILNRRPFIPSSVPYTDSVSLYSIDLLPDTYEWVLAVWKKVGTLTLTANDTTLLRVAGYFRSPADSTQPGSVTVPNGGVADSVDFRVNFDSLRPATDFVTCTLR
Ga0209885_100377123300027950Groundwater SandKKIGVLTLSPADTALLRVAGYYRHPRDSTVPGVVTVPTGASVGDINFAVDFDSLRPATDFVTCQ
Ga0209853_116254723300027961Groundwater SandATFPQNCNELIANRQPFIPSPVPYTDSVAAYSVPLLPGQYEWVLAVWKKDGTLTLTPADTALLRVAGYYRNPADTTQPGVVTVPNGAAAGGIDFIVDFDYLRPATDFVTCTAL
Ga0307313_1016702123300028715SoilGTYQWVLAVWKKVGTLTLSPADTALLRVAGYYRSLTDSTQPGLVTVPTGPAVRDIDFVVNFDSLRPATDFVTCTAQ
Ga0307317_1032519613300028720SoilQTCNDLIFNRQPFIPSSVPYADSVSLYSIELLPDTYEWVLAVWKKVGNLTLTAADTALLRVAGYYRNPADTTQPGTVTVPNGSVADSVDFKVDFDSLRPATDFVTCTVR
Ga0307320_1021456113300028771SoilTGAFAAYGVPLDPGAYHWVLAVWKKRGPLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARVDSINFVVDFDHLRPATDFVTCTAQ
Ga0307282_1053936423300028784SoilAYLTFPQTCSDLILNRRPVFPGSVPYTGASAAYGVPLDPGDYHWVLAVWKKRGQLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARVDSINFVVDFDHLRPATDFVTCTAQ
Ga0307290_1003359223300028791SoilVAVWKKVGDLTLTAQDTTLLRVAGYYRSPADSMQPGVVTVPTGGVVDSVDFRVNFAGLRPATDFVTCTAR
Ga0307302_1015759823300028814SoilVYPGSVPYTGAFAAYGVPLDPGAYHWVLAVWKKRGPLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARVDSINFVVDFDHLRPATDFVTCTAQ
Ga0307278_1025820513300028878SoilMAAYLTFPQTCSDLILNRRPVYPGSVPYTGAFAAYGVPLDPGAYHWVLAVWKKRGQLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARVDSINFVVDFDHLRPATDFVTCTAQ
Ga0307308_1007177413300028884SoilYQWVLAVWKKVGTLTLSPADTALLRVAGYYRSLTDSTQPGLVTVPTGPAVRDIDFVVNFDSLRPATDFVTCTAQ
Ga0307308_1014005123300028884SoilAVWKKRGPLTLSAADTALLRVAGYYRNLADTTQPGVVTVPSGARVDSINFVVDFDHLRPATDFVTCTAQ
Ga0307308_1032500523300028884SoilIPGSVPYTDSTAAYSVPLPSGTYQWVIAVWKKIGTLTLTPSDTALLRVAGYYRSPTDSTQRGTVTVPSGAATGDIDFVVNFDSLRPATDFVTCTAL
Ga0307498_1033178223300031170SoilNRRPLIPGSVPYTDSAAAYSVELAPDTYHWVVAVWKKVGSLTLTAQDTVLLRVGGYYRDPADTTRPGIVTVPNGPAPGDIDILVDFDNLRPATDFVTCTAQ
Ga0307469_1009413313300031720Hardwood Forest SoilVTLPPRTYAWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADTTQPGSVIVPNGGAAANINFRVEFDSLRPATDFVSCTAQ
Ga0310900_1076561823300031908SoilLVAVWKKEGPLTLSPADTALLRVAGYYRNPADTTQPGVFTVPNGAAAGDLDFTADFDHLRPATDFVTCTL
Ga0326597_1045613823300031965SoilVAAWKKVGTLTLTPADTALLREAGFYRDPADTSQPGAVTVSAGADGIDFVVDLDDLHPITDYLTCAVR
Ga0307470_1109461723300032174Hardwood Forest SoilFIAAYANFPQTCADLINNRRPVIPGSVPYTDSIADYSVALDPGTYHWLVAVWKKEGSLTLSPADTALLRVAGYYRNPADTTQPGVLTVPTGAAAGDLDFTADFDHLRPATDFVTCTAQ
Ga0307471_10334398823300032180Hardwood Forest SoilFIPASVPYTDAAAEYSVVLPPGAYEWVLAVWKKVGTLTLTPADTALLRVAGYYRNPADSTQSGVVTVPTGGAAANIDFRAEFDSLRPATDFVTCTAQ
Ga0214471_1038493523300033417SoilVPYTDSVAPYSVELSPGTYEWVVAVWKKVGNLTLSPADTALLRVAGYYRAFIDTMSPGQVTVPTGAYAGDIDFFVDFDNLRPATDFVTCT
Ga0364928_0008495_1497_18593300033813SedimentNVFIAAYATFPQTCNDLISNRRPFIPSSVPYTDSVAAYSVELSPGTYEWVVAVWKKIGALTLSPADTALLRVAGYYRHPRDSTLPGIVTVPNGASAGDIDFVVNFDSLRSATDFVTCTAQ
Ga0364928_0032932_1_3393300033813SedimentSFPQTCGDLIFNRQPFIPSSVPYTDSITLYSIPLPPSGYEWVLAVWKKVGTLTLSAQDTALLRVAGYYRDPADSTLPGAVTVQNGGAIGGVDFKVNFDSLLPATDFVTCTAR
Ga0364931_0215280_3_3773300034176SedimentDSTDNVFVAAYVSFPQTCNDLIFNRQPFIPSSVPYTDSVALYSIDLLPDTYEWVLAVWKKVGNLTLSANDTTLLRVAGYYRSPADSTLPGAVTVPSGGVADSVDFRVNFDSLMPATDFVTCTAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.