NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F049171

Metagenome Family F049171

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F049171
Family Type Metagenome
Number of Sequences 147
Average Sequence Length 128 residues
Representative Sequence MTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIE
Number of Associated Samples 120
Number of Associated Scaffolds 147

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 78.91 %
% of genes near scaffold ends (potentially truncated) 99.32 %
% of genes from short scaffolds (< 2000 bps) 82.99 %
Associated GOLD sequencing projects 114
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(23.129 % of family members)
Environment Ontology (ENVO) Unclassified
(42.857 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.939 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200
1GPIPI_01681430
2GPIPI_01613190
3KansclcFeb2_10610660
4INPhiseqgaiiFebDRAFT_1019006712
5INPhiseqgaiiFebDRAFT_1019007812
6JGI1027J12803_1060661181
7JGI10220J13317_107504571
8JGI24035J26624_10112102
9JGIcombinedJ43975_100039163
10JGIcombinedJ43975_100099641
11Ga0062593_1000881291
12Ga0063356_1030020142
13Ga0062591_1022146222
14Ga0066674_100697362
15Ga0066690_107762272
16Ga0066675_100793741
17Ga0066675_112559292
18Ga0066388_1036131431
19Ga0070671_1010388112
20Ga0070667_1021407251
21Ga0070705_1010018352
22Ga0066682_100910122
23Ga0066682_101803102
24Ga0066681_101417722
25Ga0066681_101478902
26Ga0070707_1010276051
27Ga0070707_1018756082
28Ga0070672_1014841811
29Ga0066701_101259972
30Ga0066661_100048301
31Ga0066661_103464442
32Ga0066707_109782012
33Ga0066707_109950812
34Ga0066705_109297562
35Ga0066654_101016612
36Ga0066706_100840971
37Ga0066905_1020458761
38Ga0068861_1007902211
39Ga0068860_1024311712
40Ga0070716_1007710441
41Ga0066653_102082611
42Ga0066665_102109182
43Ga0066659_100709652
44Ga0066660_101213792
45Ga0075424_1011903961
46Ga0099795_101182861
47Ga0105243_106440601
48Ga0105249_106603431
49Ga0134088_100353203
50Ga0134088_107182261
51Ga0134084_100123731
52Ga0134111_100239942
53Ga0134080_100464021
54Ga0126376_112192302
55Ga0126378_131555192
56Ga0134125_125980212
57Ga0126381_1014954741
58Ga0137392_109999951
59Ga0137383_102973152
60Ga0137383_105495382
61Ga0137382_109772901
62Ga0137382_110219581
63Ga0137365_100212591
64Ga0137365_110114341
65Ga0137365_112502741
66Ga0137374_111293111
67Ga0137376_100189091
68Ga0137376_113507531
69Ga0137377_102073711
70Ga0137377_106950112
71Ga0137370_107515132
72Ga0137367_103256511
73Ga0137366_108023621
74Ga0137366_110883702
75Ga0137371_111765222
76Ga0137360_114569461
77Ga0137373_102643341
78Ga0137394_106384261
79Ga0137404_100834904
80Ga0137404_103705152
81Ga0137404_118484411
82Ga0137407_102124552
83Ga0134077_102751102
84Ga0134110_105634001
85Ga0134075_100330251
86Ga0134078_100818812
87Ga0134078_103206121
88Ga0137409_111170642
89Ga0134073_102018072
90Ga0134089_101146002
91Ga0134085_100666942
92Ga0132256_1005373462
93Ga0134069_13781681
94Ga0134112_102519282
95Ga0134083_101549861
96Ga0163161_105855631
97Ga0184604_100002595
98Ga0184605_104995761
99Ga0184608_103584262
100Ga0184620_100522691
101Ga0184620_102989222
102Ga0184621_102393121
103Ga0184619_100252581
104Ga0137408_12799656
105Ga0137408_12861391
106Ga0193715_10189702
107Ga0193723_10131483
108Ga0193723_10898161
109Ga0193713_10414431
110Ga0193725_10484442
111Ga0193747_10623342
112Ga0193727_10486531
113Ga0193729_11523131
114Ga0193731_10889791
115Ga0193730_10981561
116Ga0193735_10467681
117Ga0193732_10706271
118Ga0193721_10170662
119Ga0193721_11480281
120Ga0193726_11997982
121Ga0193724_10042551
122Ga0210382_105344651
123Ga0193719_104777051
124Ga0193709_10131872
125Ga0224452_11449272
126Ga0224452_12049741
127Ga0222622_100864101
128Ga0222622_104216242
129Ga0193714_10289761
130Ga0247799_10340491
131Ga0207681_100392473
132Ga0209055_10178891
133Ga0209686_11089312
134Ga0209470_12489682
135Ga0209375_10415151
136Ga0257156_10426081
137Ga0209058_11272792
138Ga0209376_10566151
139Ga0209156_100589071
140Ga0209156_104679312
141Ga0209648_102105301
142Ga0209388_11909921
143Ga0307282_103076071
144Ga0307287_100849381
145Ga0307312_107667482
146Ga0307308_102334942
147Ga0306918_108032252
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 21.66%    β-sheet: 16.56%    Coil/Unstructured: 61.78%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110120MTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIESequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Soil
Grasslands Soil
Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
4.8%15.6%20.4%10.9%23.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_016814302088090014SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIXWSLLEKDPNRFYEQYEVKAIWVMNLADKKKVGAIEDTGGYIRPGSHRT
GPIPI_016131902088090014SoilMKPCAYILGIGCVLIAARFVGDRLIGAEPSAQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDLNRFYEQYDVKAIWVINLVDKKKIGAIGD
KansclcFeb2_106106602124908045SoilVTPRAYIFGIACLLIAAGSGNDRLAAAAEPSVQPEPPWIEIGSEKLVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGS
INPhiseqgaiiFebDRAFT_10190067123300000364SoilMKPCAYILGIGCVLIAARFVGDRLIGAEPSAQPXPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDLNRFYEQYDVKAIWVINLVDKKKIGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQ
INPhiseqgaiiFebDRAFT_10190078123300000364SoilMEACLAHFRAAEFHCVSSLPYLSSEAMTPRAYILGIACMLIAARFGNDRLTAAEXSVXXEPPWIEIGXEKAXIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRT
JGI1027J12803_10606611813300000955SoilVIMAAEFAHDKLIAADPSPQPEPRWIEIGSDRAVIARDSKSADGRNALAWTVDSSEPVDWSLLETDPNRFYEQYDVKAIWVINLADKKKVGAVGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQWK
JGI10220J13317_1075045713300001139SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIEDTGGYIRPG
JGI24035J26624_101121023300002126Corn, Switchgrass And Miscanthus RhizosphereMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVIDLADKKKVGAIGDTGGYIRPGSHRTLSVAWGPIENGRRLGADRERKTVCPGCLSVEMGHRYFASSGRGPR*
JGIcombinedJ43975_1000391633300002899SoilMIPRAYILGIGCMLIAGKFASGQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDANHFYEQYDLKEIWVINIPEKKKLGAVADKGGYVRPGSHQTVSVAWGPIENGRRF
JGIcombinedJ43975_1000996413300002899SoilMIPRAYILGIGCMLIAGKFASGQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDANHFYEQYDLKEIWVINIPEKKKLGAVADKGGYVRPGSHQTVSVAWGPIENGRRFA
Ga0062593_10008812913300004114SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGA
Ga0063356_10300201423300004463Arabidopsis Thaliana RhizosphereMKPCAYILGIGCVLIAARFVGDRLIAEPSAQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSVLEKDLNRFYEQYDVKAIWVINLVDKKKIGAIGDTGGYVRPGSHRTLSVAWGPIE
Ga0062591_10221462223300004643SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAI
Ga0066674_1006973623300005166SoilVLTYFEAAEFHCVALSRNYHAETMTLRAYILGIACVLIGARFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALAWTVASNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGDTGGYVRPGSHRTLSIAWGPVENGKRFALAA
Ga0066690_1077622723300005177SoilMNERACAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSDPVDWSLLEKDPNHFYEQYDVKEIWIVNIPDKKKVGSVADQGGYVRPGSHRTLSIAWGPIENGRRFALA
Ga0066675_1007937413300005187SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKK
Ga0066675_1125592923300005187SoilMKPCGYTLAIGYMLIAARLADDRLIAAEPPVQPEPRWIEIGSEKLIIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGD
Ga0066388_10361314313300005332Tropical Forest SoilMNPSSYVFGSGCILIAGLLATDQLPAEDPSPQPEQRWIEIGSEKAVIARDSKSADGHNALAWTIDSSEPIDWSLLEKDANRFYEQYDVKEIWVVNLPDKKKIGMLADKGGYVRPGSHRTL
Ga0070671_10103881123300005355Switchgrass RhizosphereMTLRAYIFGIGCMLIAARFGNDRLNAAEPSVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTGGYVRPGSHRTLSVAWGPVENGRRFALAAYQWKWGTD
Ga0070667_10214072513300005367Switchgrass RhizosphereMTPRAYIFGIACMLIAARFGNDRLTAAEPSVQPESPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQ
Ga0070705_10100183523300005440Corn, Switchgrass And Miscanthus RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTGGYVRPGSHRT
Ga0066682_1009101223300005450SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGIGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALGWTVDSNEPVDWSLLEKDPNRFYEQYDAKAIWAINLADKKKVG
Ga0066682_1018031023300005450SoilMNERACAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSSDGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKEIWIVNIPDKKKVGSVADQG
Ga0066681_1014177223300005451SoilMNERACAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSDKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKEIWIVNIPDKKKVGSVADQGGYVRPGSHRTLSIAWGPIENGRRFALAAYEWKWGTDTLLLLD
Ga0066681_1014789023300005451SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGDKGGYVR
Ga0070707_10102760513300005468Corn, Switchgrass And Miscanthus RhizosphereMTLRAYTFGIACMLIAARFGNDRLLAAAEPSVQPESPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSH
Ga0070707_10187560823300005468Corn, Switchgrass And Miscanthus RhizosphereMKRCTYILGVACALVAAEFVCDKLISAESSPPAEPRWIEIGSERAAIVRDSKSADGRNALAWTIDSSEPIDWSLLEKDVEHFYEQYDVKEIWVVNLLDKKKIGTVGDKGGYVRPGSHRTLSVAWGP
Ga0070672_10148418113300005543Miscanthus RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTGGYVRPGSHRTLSVAWGPVEN
Ga0066701_1012599723300005552SoilVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGD
Ga0066661_1000483013300005554SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEIWVVNLSDKKKIGAVADKGGYVRPGSHRTLSVAWGPLSDNGRRF
Ga0066661_1034644423300005554SoilMNECAFAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKEIWIVNIPDKKKVGSVADQGGYVRP
Ga0066707_1097820123300005556SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEIWVVNLSDKKKIGAVADKGGYVRPGSHRTLSVTWGPLSDNGRRFALAAYQWKWGTDTLLLLDV
Ga0066707_1099508123300005556SoilVLVAAEFVCAKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQWKWGTDTLLLL
Ga0066705_1092975623300005569SoilMKALACLFGFGCIFVVAQFLDAPLRAAEPASSSEPRWIDIGSEKAVIVRDSMSADGRNALAWTVDSTDPVDWSLLEKDVDKFYEQYEVKEIWIINLSDKKKVGSVGDKGGYVRPGSHRTL
Ga0066654_1010166123300005587SoilVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEIWVVNLSDKKKIGAVADKGGYVRPGSHRTLSVTWGPLSDNGRRFALAAYQWKWGTDTLLLLDVGP
Ga0066706_1008409713300005598SoilMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGDKGGYVRPGSHRTLSVAWSPVENGRRFALAAYQWKWGTDTLLLLD
Ga0066905_10204587613300005713Tropical Forest SoilMNNRRAGFITAAILLNVLASDKVAGAESSGEPRWIEIGSERAVIARDSKSADGRNALAWTVESTEPVDWSLLEKDPDHFYEQYEVKEIWVVNLADKNKIGTVGDKGGYVRPGSHRTLSIAWGPIADNGRRFALAAYQWKWG
Ga0068861_10079022113300005719Switchgrass RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKV
Ga0068860_10243117123300005843Switchgrass RhizosphereMTPRAYIFGIACMLIAARFGNDTLTAASESAVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGSVGDTG
Ga0070716_10077104413300006173Corn, Switchgrass And Miscanthus RhizosphereMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRF
Ga0066653_1020826113300006791SoilMNERACAFGIGCILIVAQLSGDRIAAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKEIWIVNIPDKKKVGSVADQGGYVRPGSHRTLSIAWGPIENGRRFALAAY
Ga0066665_1021091823300006796SoilMTLRVYIFGIAWVLIAARFGNDRFAAAAEPSVQPDRPWVEIGLEKAVIARDSKSADGRNALAWTIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLTD
Ga0066659_1007096523300006797SoilMILRVYIFGIAWVLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLLLL
Ga0066660_1012137923300006800SoilMNERAFAFGIGCILIVAQLSGDRIVAADSSAQSGPRWIDIGSEKVVIARDSMSADGRNALAWTVDSSEPVDWSLLEKDPNHFYEQYDVKELWIVNIPDKKKVGSVADQGGYVRPGSHRTLSIAWGPIENGRRFALA
Ga0075424_10119039613300006904Populus RhizosphereMTPRAYILGIACVLIAARSGNDSLAAAAEPSVQPEPRWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFHEQYEVKAIWVINLVDKKKVGAIGDTGGYVRPGSHRTLSIAWGPIENGRR
Ga0099795_1011828613300007788Vadose Zone SoilVDVRSALPHPQRNSIVLAFRLTYHQEAMKPCACTLAIGYILIAALLAGERLIAAEPSVQPELRWIEIGSEKLIIARDSKSADGRNALAWAVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKK
Ga0105243_1064406013300009148Miscanthus RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDRNEPIDWSLLEKDPDRFYEQYDVKAIWVINLADKKKVGAVGDTGGYVRPGSH
Ga0105249_1066034313300009553Switchgrass RhizosphereMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVIDLADKKKVGAIGDT
Ga0134088_1003532033300010304Grasslands SoilMTLRAYTFGIACMLIAARFGNHRLAAAAEPSVQPDRPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGA
Ga0134088_1071822613300010304Grasslands SoilMTLRAYILGIACVLIAAPFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALAWTVASNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGDTGGYVRPGSHRTLSIAWGPVENGKRFALAAYQWKWG
Ga0134084_1001237313300010322Grasslands SoilMTLRAYTFGIACMLIAARFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGDTGGYVRPGSHRTLSIAWGPVENGKRFAL
Ga0134111_1002399423300010329Grasslands SoilMTLRAYTFGIACMLIAARFGNDRLAAATEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLTDQKKVGTVGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQW
Ga0134080_1004640213300010333Grasslands SoilMTLRAYTFGIACMLIAARFGNDRLAAATEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLTDQKKVGTVGDTGGYVRPGSHRTLSV
Ga0126376_1121923023300010359Tropical Forest SoilMNNRRAGFITAAILLNVLASDKVAGAESSGEPRWIEIGSERAVIARDSKSADGRDALAWTVDSAEPIDWSLLDKDPERFYEQYDVKEILVVNLADKKKIGTVGDKGGYVRPGSHRTL
Ga0126378_1315551923300010361Tropical Forest SoilVNPGAYIFGLGWMLIAARSGADPLIAAEPSVQPEPRWIEIGSEKAVVARDSKSADGRDALAWTVDSAEPIDWSLLDKDPERFYEQYDVKEIWVVNLADKKKI
Ga0134125_1259802123300010371Terrestrial SoilVLALRLTYPEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIARDSKSADGRNALAWTVDRNEPIDWSLLEKDPDRFYEQYDVKAIWVINLADKKKVGAVGDTGGYVRPGSHRTLSVAWGPVENGRRFALAAYQWKW
Ga0126381_10149547413300010376Tropical Forest SoilMTARAYIFGVACVLIAARLGNDKLAAAAEPSPQSESSWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVVDLADKTKVGTIGDTGGYVRPGSHRTLSVAW
Ga0137392_1099999513300011269Vadose Zone SoilMLVAAEFVCDKVISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALVWTIDSSEPIDWSLLEKDVEHFYGQYDVKEIWVVNLSDKKKIGTVGDKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQWK
Ga0137383_1029731523300012199Vadose Zone SoilMTLRAYTFGIACMLIAARFGNKRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWIIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGY
Ga0137383_1054953823300012199Vadose Zone SoilVTPRAYIFGIACLLIAAGSGNDGLAAAAEPSVQPEPPWIEIGSEKLVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGALGDTGGYVRPGSHRTLSVAWGPMENGRRFALAAYQWKWGTD
Ga0137382_1097729013300012200Vadose Zone SoilMTRRAYIFGIACMLITARFGNDTLTAAAESSVRPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWIINLADKKKVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQWKWGTDTLLLLDV
Ga0137382_1102195813300012200Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQWKW
Ga0137365_1002125913300012201Vadose Zone SoilMEACEPIPQPNSIVLAPCPTYHPEAVTPRAYIFGIACLLIAPGSGNDGLAAAAEPSVQPEPPWIEIGSEKLAIARDSKSADGRNALAWTVDSNEPEDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGDYVRPGSHRTLSVAWGPMENGRRFA
Ga0137365_1101143413300012201Vadose Zone SoilMTLRVYIFGIAWVLIAARFGNDRLAAAAEPSVQPDRPWVEIGPEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVA
Ga0137365_1125027413300012201Vadose Zone SoilMTLRAYTFGIACMLIAARFGNKRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWIIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSH
Ga0137374_1112931113300012204Vadose Zone SoilMSPRAYIFGIACMLIAARFGNYTLTAAAESSVQQEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSSESIDWSLLKKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTD
Ga0137376_1001890913300012208Vadose Zone SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQWKWGTD
Ga0137376_1135075313300012208Vadose Zone SoilMKPCAYTLAIGYMLIAARLSGERLIAAEPSVQPEPRWIEIGSEKLIIARDSKSADGRNALAWAVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDT
Ga0137377_1020737113300012211Vadose Zone SoilMTLRAYTFGIACMLIAARFGNKRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWIIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHR
Ga0137377_1069501123300012211Vadose Zone SoilVTPRAYIFGIACLLIAPGSGNDGLAAAAEPSVQPEPPWIEIGSEKLAIARDSKSADGRNALAWTVDSNEPEDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHR
Ga0137370_1075151323300012285Vadose Zone SoilVLAFRLTYHQEAMKPCAYTLAIGYMLIAARLADDRLIAAEPSVQPEPRWIEIGSDKLIIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYV
Ga0137367_1032565113300012353Vadose Zone SoilMTLRAYTFGIACMLIAARFGSDRLAAAAESSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDT
Ga0137366_1080236213300012354Vadose Zone SoilMTPRAYIFGIACLLIAARFGNDRLAAATEPSVQPELPWIEIGSEKAVIARDSKSADGRNALAWTVDSDEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVG
Ga0137366_1108837023300012354Vadose Zone SoilMTLRAYTFGIACMLIAARFGNKRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWIIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVR
Ga0137371_1117652223300012356Vadose Zone SoilMTPRAYILGIGCMLIAGEFASDQLIAAEPSPQREPGWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWLLLERDPNRFYEQYDVKEIWVINVPDKKKVGAVGDKGGYVRPGSHRTLSVAWGPIE
Ga0137360_1145694613300012361Vadose Zone SoilMLIAGEFASHQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSGDGRNALAWTVDSSEPVDWSLLEKDADHFYEQYEVKEIWVINIPEKKKLGAVADKGGYVRPGSHQTVS
Ga0137373_1026433413300012532Vadose Zone SoilMTLRAYTFGIACMLIATRFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGG
Ga0137394_1063842613300012922Vadose Zone SoilMTLRAYTFGIACMLIAARFGNDGLAAAAETSVQPDRPWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDVDRFYEQYDLKAIWVINLTDKKKLGTVGDKGGYVRPGSHRTLSVAWGPVENGRRFA
Ga0137404_1008349043300012929Vadose Zone SoilMLVAGQFAANKLAAAETSAPSEPRWIEIDSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADRFYEQYDLRAIWVINLPDKKKVGTVGDKGGYVRPGSHRTLSVAWG
Ga0137404_1037051523300012929Vadose Zone SoilVDVRTALPHPQRNSIVLAFRFTYHQEAMKPCACTLAIGYMLIAARLAGDRLIAAEPSVQPEPRWIEIGSEKLIIARDSKSGDGRNALAWAVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRSGSHRTLSVAWAPIENGRRFALAAYQWK
Ga0137404_1184844113300012929Vadose Zone SoilMAARFGNDRLLTAAEPSVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYIRPGSHRTLS
Ga0137407_1021245523300012930Vadose Zone SoilMAARFGNDRLLTAAEPSVQPEPPWIEIGPEKAVIARDSKSADGRNALVWTIDSNEPVDWSLLEKDPNRFYEQYEVKAIWIINLADKKKVGAIGDTGGYIRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLLLLDV
Ga0134077_1027511023300012972Grasslands SoilMWQRVTAFPQRNCIVLASRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGDKGGYVRPGSHRTL
Ga0134110_1056340013300012975Grasslands SoilMEFHCAAARVLLSLEAMKTRAYLFGLGCIFVVAQFANERLLAAQSEPRSIDIGSEKAVIVRDSMSADGRNALAWTVDSSEPVDWSLLEKDVDKFYEQYEVKEIWIVNLSDHRKIGTVGDKGGYTRPGSHRTLSV
Ga0134075_1003302513300014154Grasslands SoilMWQRVTAFPQRNCIVLASRTAYDQEAMKLCAYILGIGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIGRDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGDKGGYVRPGSHRTLSVAWSPVENGRR
Ga0134078_1008188123300014157Grasslands SoilMTLRAYILGIACVLIAAPFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALAWTVASNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGDTGGYVR
Ga0134078_1032061213300014157Grasslands SoilMTLRVYIFGIAWVLIAARFGNDRFAAAAEPSVQPDRPWVEIGLEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLTDQKKVGTVGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLLLLDVGQD
Ga0137409_1111706423300015245Vadose Zone SoilALFPNLAAMPVVPEPEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWAGRGTHCV
Ga0134073_1020180723300015356Grasslands SoilMTLRAYTFGIACMLIAARFGGNRLNAAEPSVQPEPRWIEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTVGDKGGYVR
Ga0134089_1011460023300015358Grasslands SoilMTLRVYIFGIAWVLIAARFGNDRFAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVASNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGD
Ga0134085_1006669423300015359Grasslands SoilMWQRVTAFPQRNCIVLPSRTAYDQEAMKLCAYILGMGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGTV
Ga0132256_10053734623300015372Arabidopsis RhizosphereVLALRLTYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIARDSKSADGRNALAWTVDRNEPIDWSLLEKDPDRFYEQYDVKAIWVINLADKK
Ga0134069_137816813300017654Grasslands SoilKLWWEMWQRVTAFPQRNCIVLASRTAYDQEAMKLCAYILGIGCMLIAARFGGNRLNAAEPSVQPEPRWIEIGSEKVVIARDSKSADGRNALGWTVDSNEPVDWSLLEKDPNRFYEQYDAKAIWAINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQ
Ga0134112_1025192823300017656Grasslands SoilMTLRAYIFPIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLS
Ga0134083_1015498613300017659Grasslands SoilVLASRTAYDQEAMKLCAYILGIGCMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVG
Ga0163161_1058556313300017792Switchgrass RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTG
Ga0184604_1000025953300018000Groundwater SedimentMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPTRFYEQYDVKAIWVINLADKKKVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAA
Ga0184605_1049957613300018027Groundwater SedimentMTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKTKVGAIGDTGGYVRPGSHRTLSVAWGPIEN
Ga0184608_1035842623300018028Groundwater SedimentMTRRAYIFGIACMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWLLLEKDPSRFYEQYEVKAIWVINLADKKKVGSVGD
Ga0184620_1005226913300018051Groundwater SedimentMKPCAYTLAVGYMLIAVRLAGDGLIVAEPSIQPEQRWIEIGSEKAVIVRDSKSADGRNALAWTVDSTEPVDWSLLEKDPNRFYEQYDVKAIWVINLTDKKKVGAVGDTGGYVRPGSHQTLSVAWGPIENGRRFALAAYQWKWGTDTLLL
Ga0184620_1029892223300018051Groundwater SedimentVLALRLTYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIVRDSKSADGRNALAWTVDSNKPIDWSLLEKDPDRFYEQYDVKAIWVINLADKKKVGAVGDTGGYVRPGSHRTLSVAW
Ga0184621_1023931213300018054Groundwater SedimentMKPCAYVLGIGCMLIAGDFASDQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALVWTVDSIEPVDWSLLDKDPNGFYEKYEVKAIWVINLADKKKVGAVGDTGGYLRPGSHRTLSVAWGPVENGRRFALAAYQWKWGTDTILLL
Ga0184619_1002525813300018061Groundwater SedimentMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPTRFYEQYDVKAIWVINLADKKKVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQWKWGTDTLLL
Ga0137408_127996563300019789Vadose Zone SoilMTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIE
Ga0137408_128613913300019789Vadose Zone SoilMTLRAYTFGIACMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGVGAIGDTGGYVRPGSHRTL
Ga0193715_101897023300019878SoilMEAHEPISAAEFHCVGSLAYLSSEAMTPRAYIFGIACVLIAAGFGNDRLAAAAEPVQPEPPWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGSIGDTGGYVRPGSHRT
Ga0193723_101314833300019879SoilMKPCAYTLAVGYMLIAVRLAGDGLIAAEPSIQPEQRWIEIGSEKAVIVRDSKSADGRNALAWTVDSTEPVDWSLLEKDPNRFYEQYDVKAIWVINLTDKKKVGAVGDTGGYVRPGSHQTLSVAWGPIENGRRFALAAY
Ga0193723_108981613300019879SoilMLVAGQFAANKLAATETSAQSEPRWIQIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDSDHFYEQYEVKEIWVLNIPDKKKVGMVGDKGGYVRPGSHRTLSVAWGP
Ga0193713_104144313300019882SoilMTSPQRNSIVLALRLSYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEARWIEIGSEKAVIARDSKSADGRNILAWTVDSNEPVDWSLLEKDPDRFYERYDVKAIWVINLADKKKVGAVGD
Ga0193725_104844423300019883SoilMTVRAYTFGIVCMLIAARFGNDRLIAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWLLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYRWKWG
Ga0193747_106233423300019885SoilMTLRAYTFGIACMLIGARFGNDRPAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGG
Ga0193727_104865313300019886SoilMLIAGRFAGDRLAAAAEPSAQPEPRWIEIGSEKVVIARDSKSADGRNALGWTVDSNQPVDWLLLEKDPNRFYEQYDLKAIWVINLPDKKKVGTVGDRGGYVRPGSHRTLSVAWG
Ga0193729_115231313300019887SoilMTLRVYIFGIAWMLIAARFGNDRLAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYIRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLLL
Ga0193731_108897913300020001SoilMKPCAYTLAVGYMLIAVRLAGDGLIAAEPSIQPEQRWIEIGSEKAVIVRDSKSADGRNALAWTVDSTEPVDWSLLEKDPNRFYEQYDVKAIWVINLTDKKKVGAVGDTGGYVRPGSHQTLSVAWGPIENGRRFALAAYQWKWGTDTLL
Ga0193730_109815613300020002SoilMTSPQRNSIVLALRLSYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEPRWIEIGSEKAVIARDSKSADGRNILAWTVDSNEPVDWSLLEKDPDRFYERYDVKAIWVINLADKKKVGAVGDTGGYVRPGSHRTL
Ga0193735_104676813300020006SoilMTRRAYIFGIACMLIAARFGNDTLTAAAESSVQPDPPWIEIGPEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEIKAIWVINLADKKKVGAIGDTGGYIRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLL
Ga0193732_107062713300020012SoilMTLRACIFGIACILIAARFGNDALTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKRVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQWKW
Ga0193721_101706623300020018SoilMKRRTYVLGICCVLIAARFAGDRPLAAEPSAQPEPRWIEIGSEKLVIARDSKSADGRNALAWTIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTGGYVRPGSHRTLSIAWGPLEN
Ga0193721_114802813300020018SoilMTLRACIFGIACILIAARFGNDALTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTIDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKRVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQWKW
Ga0193726_119979823300020021SoilMTSPQRNSIVLALRLSYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEPRWIEIGSEKAVIARDSKSADGRNILAWTVDSNEPVDWSLLEKDPDRFYERYDVKAIWVINLADKKKVGAVGDTGGYVRPGSHRTLS
Ga0193724_100425513300020062SoilMTSPQRNSIVLALRLSYHEKAMKPCAYILGIGCMLVSARFGGDRLIAAELSAQPEPRWIEIGSEKAVIARDSKSADGRNILAWTVDSNEPVDWSLLEKDPDRFYERYDVKAIWVINLADKKKVGAVGDTGG
Ga0210382_1053446513300021080Groundwater SedimentMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPTRFYEQYEVKAIWVINLADKKKVGAVGDEGGYVRPGSHRTLSVAWGPIENGRRF
Ga0193719_1047770513300021344SoilMTLRVYIFGIAWVLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEIKAIWVINLADKKKVGAIGDTGGYI
Ga0193709_101318723300021411SoilMTVRAYTFGIVCMLIAARFGNDRLIAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWLLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYRWKWGT
Ga0224452_114492723300022534Groundwater SedimentMTPRAYIFCVACVLMAARFGNDRLAAAAEPSVQPEPPWIEIGSEKAVIARDSKSADGRNALAWIVDSNESVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRT
Ga0224452_120497413300022534Groundwater SedimentMLVAGQFAANKLAAAETSAPSAPRWIEIDSEKVVIARDSKSADGRNALAWTVDSSESIDWSLLEKDADRFYEQYDLKAIWVLNLADKKKVGTIGDKGGYVRPGSHRTLSVAWGPVENGRRFALAAYQWK
Ga0222622_1008641013300022756Groundwater SedimentMNPYAYIFGIGCMLIAARFGGDRLTAAEPSAQPEPRWIEIGSEKVIIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADRFYEQYDLKAIWVINLADKKKVGTVGDRGGYVRPGSHRTLSVAWGPVENGRRF
Ga0222622_1042162423300022756Groundwater SedimentMKRSSYILGIGCTLIAACFGGDKLLAAEPSPQPEPHWIEIGSERAVIARDSKSADGRNALAWTIDSSEPVDWSLLEKDPNRFYEQYDVKAIWVISLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPVENGRRFALA
Ga0193714_102897613300023058SoilMTLRACIFGIACILIAARFGNDALTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTIDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKRVGSVGDTGGYVRPGSHQTLSIAWGPIENGRRFALAAYQWK
Ga0247799_103404913300023072SoilMTLRAYIFGIGCMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAAIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIEDTGGY
Ga0207681_1003924733300025923Switchgrass RhizosphereMKLPAYIFGLGCVLVGGQFAGNKLVAAETSTSSEPRWIEIGLEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLGDKKKVGAIGDTGAYVRPGSHRTLSVAWGPVENGRRFALAAYQWKWGTDTLLLLD
Ga0209055_101788913300026309SoilVKPRTYLLGIACVLVAAEFVCDKLISAESSPPGEPRWIEIGSERAVIVRDSKSADDRNALAWTIDSSEPIDWTLLEKDVEHFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSHRTLSVAWGPLSDNGRR
Ga0209686_110893123300026315SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEQDVERFYEQYDVKEIWVVNLSDKKKIGAVADKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQ
Ga0209470_124896823300026324SoilMTLRAYIFGIACMLIAARFGNDRLTAAEPLVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLIDKKKVGTIGDTGGYVRPGSHRTLSVAWGPIENGRRFALAAYQWKWGTDTLLLLDVGQ
Ga0209375_104151513300026329SoilMTLRAYILGIACVLIGARFGNDRLAAAAEPSAQPESRWIEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLVDKKKVGTIGDTGG
Ga0257156_104260813300026498SoilVLVAAEFVCDKLISAESSPPAEPRWIEIGSERAVIVRDSKSADGRNALAWTIDSSEPIDWPLLEKDVERFYEQYDVKEIWVVNLSDKKKIGTVADKGGYVRPGSHRTLSVAWGPLSDNGRRFALAAYQWKWGT
Ga0209058_112727923300026536SoilMTLRAYTFGIACMLIAARFGNHRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTIDSSEPVDWSLLEKDPNRFYEQYEVKAIWVINLTDQKKVGTVGDTGGYVRPGSHRTLSVAWGPIEN
Ga0209376_105661513300026540SoilMTLRAYTFGIACMLIAARFGNDRLAAATEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYDVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTLSVAWGPIEN
Ga0209156_1005890713300026547SoilMLIAARFGGNRLNAAEPSVQPEPRWVEIGSEKVVIARDSKSADGRNALAWTVDSSEPIDWSLLEKDADHFYEQYDVKEIWVVNIPDKKKVGMVGDKGGYVRPGSHR
Ga0209156_1046793123300026547SoilMKPCGYTLAIGYMLIAARLADDRLIAAEPPVQPEPRWIEIGSEKLIIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYVRPGSHRTL
Ga0209648_1021053013300026551Grasslands SoilMTPRAYILGIGCTLIAGGFASDQLIAAEPSPQREPGWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDPNRFYEQYDVKEIWVINVPDKKK
Ga0209388_119099213300027655Vadose Zone SoilMKPCAYVLGIGCMLIAGEFASDQLIAAEPSPQPGPRWIEIGSEKAVIARDSKSADGRNALAWTVDSSEPVDWSLLEKDADHFYEQYELKEIWVINILEKKKLGAVADKGGYVRPGSHRTVSVAWGPIENGRRF
Ga0307282_1030760713300028784SoilMTRGAYIFGIACMLITARFGNDTLTAAAESSVQPEPLWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPTRFYEQYDVKAIWVINLSDKKKVGS
Ga0307287_1008493813300028796SoilMTRGAYIFGIACMLIAARFGNDTLTAAAESSVQPEPPWIEIGPEKAVIARDSKSADGRNALAWTVDSNEPVDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVG
Ga0307312_1076674823300028828SoilMTPRAYIFGIACMLIAARFGNDTLTAAVESSVQPEPPWIEIGPEKAVVARDSKSADGRNALAWTVDSSESIDWSLLKKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYV
Ga0307308_1023349423300028884SoilMTLRVYIFGIAWMLIAARFGNDRLAAAAEPSVQPDRPWVEIGSEKAVIARDSKSADGRNALAWTVDSSESIDWSLLEKDPNRFYEQYEVKAIWVINLADKKKVGAIGDTGGYIRPGSHRTLSVAWGPIENGRRFALAAYQWK
Ga0306918_1080322523300031744SoilMKSSPCALGFGCMLIAGQFASDRLVAEEPSPQPEPRWIEIGSEKAVIARDSKSADGLNTLAWTVDGSEPIDWPLLEKDANHFYEQYDVKEIWVVNLPDKKKIGTVSDKGGYV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.