NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F048877

Metagenome / Metatranscriptome Family F048877

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F048877
Family Type Metagenome / Metatranscriptome
Number of Sequences 147
Average Sequence Length 141 residues
Representative Sequence MEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA
Number of Associated Samples 119
Number of Associated Scaffolds 147

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.70 %
% of genes near scaffold ends (potentially truncated) 41.50 %
% of genes from short scaffolds (< 2000 bps) 71.43 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.85

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.918 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(25.170 % of family members)
Environment Ontology (ENVO) Unclassified
(31.293 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(50.340 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214
1INPhiseqgaiiFebDRAFT_1005847321
2JGI12053J15887_104899051
3JGI25382J37095_100388562
4JGI25382J37095_101635231
5JGI25382J43887_104470041
6JGI25405J52794_101539911
7Ga0062593_1000235851
8Ga0062589_1000688162
9Ga0063356_1054293331
10Ga0062592_1000558601
11Ga0066674_100229061
12Ga0066672_107405841
13Ga0066673_102695131
14Ga0066679_104150962
15Ga0066676_104919201
16Ga0066675_112428151
17Ga0065705_107275732
18Ga0070690_1016696691
19Ga0066388_1006408333
20Ga0066388_1068721051
21Ga0066388_1077118512
22Ga0070689_1007706452
23Ga0070703_104295741
24Ga0070708_1000230433
25Ga0070708_1001469813
26Ga0066686_102686082
27Ga0070681_112556092
28Ga0070707_1000155874
29Ga0070679_1019554571
30Ga0066701_107715091
31Ga0066661_101587552
32Ga0066702_100726462
33Ga0068857_1024185482
34Ga0066706_107197281
35Ga0066905_1000216872
36Ga0066903_1025370282
37Ga0066903_1064652582
38Ga0068858_1019708092
39Ga0068862_1013623761
40Ga0066658_107483961
41Ga0066665_103900301
42Ga0066660_104509982
43Ga0075428_1000449901
44Ga0075428_1006695022
45Ga0075431_1011751851
46Ga0075433_100620614
47Ga0075433_118782051
48Ga0075420_1000237995
49Ga0075425_1000332933
50Ga0075434_1001234062
51Ga0075429_1017854601
52Ga0075426_100172423
53Ga0075436_1000431603
54Ga0075436_1006634682
55Ga0099791_101074861
56Ga0066710_1000875724
57Ga0066710_1001624311
58Ga0066710_1004238712
59Ga0066710_1004910873
60Ga0066710_1016714971
61Ga0099827_106580131
62Ga0066709_1001447014
63Ga0066709_1017212192
64Ga0114129_102061365
65Ga0105241_122663741
66Ga0105056_10400371
67Ga0126380_102661142
68Ga0126321_11828012
69Ga0134082_105554881
70Ga0134088_106851041
71Ga0126376_106207381
72Ga0126376_107060881
73Ga0126377_101775502
74Ga0126379_127672511
75Ga0134124_100408125
76Ga0134123_101102681
77Ga0134123_117856151
78Ga0137388_108355472
79Ga0137364_104530982
80Ga0137383_104444382
81Ga0137383_108424951
82Ga0137399_101457712
83Ga0137399_109217582
84Ga0137380_100366466
85Ga0137380_111690931
86Ga0137377_114911782
87Ga0137387_105656671
88Ga0137387_111405401
89Ga0137386_108574351
90Ga0137386_109337851
91Ga0137386_112156142
92Ga0137369_110790922
93Ga0137375_100709221
94Ga0137360_109986511
95Ga0137390_115981011
96Ga0137397_103692291
97Ga0137397_105937912
98Ga0157282_103133261
99Ga0137396_103913552
100Ga0137394_103257532
101Ga0137394_109063632
102Ga0137419_102341461
103Ga0137419_106201822
104Ga0137416_117676091
105Ga0137404_114886512
106Ga0137407_112021982
107Ga0137407_115989312
108Ga0157380_123993422
109Ga0157377_109468711
110Ga0137418_1000468815
111Ga0137418_101183791
112Ga0182038_108374582
113Ga0184609_101427631
114Ga0184609_101883381
115Ga0184612_106221301
116Ga0066667_100357052
117Ga0137408_12287211
118Ga0193725_10539931
119Ga0210378_102707951
120Ga0207653_104504832
121Ga0207684_100077335
122Ga0207707_101120802
123Ga0207646_1000590017
124Ga0207670_107554871
125Ga0207667_121316971
126Ga0207674_110924501
127Ga0209438_10134401
128Ga0209234_10102833
129Ga0209237_10663341
130Ga0209236_12882091
131Ga0209055_12157191
132Ga0209471_10269964
133Ga0209470_12044252
134Ga0209377_11504772
135Ga0209157_12108292
136Ga0209056_103874812
137Ga0209648_100970072
138Ga0209689_10244062
139Ga0209590_102686482
140Ga0209590_106790701
141Ga0137415_100361981
142Ga0247823_110625541
143Ga0247822_109590482
144Ga0308189_103744032
145Ga0308187_104271541
146Ga0310888_101323371
147Ga0310890_107142042
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 70.83%    β-sheet: 0.00%    Coil/Unstructured: 29.17%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120140MEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAAExtracel.Cytopl.Extracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.85
Powered by PDBe Molstar

Structural matches with SCOPe domains



 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
95.9%4.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Soil
Grasslands Soil
Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Populus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
25.2%3.4%14.3%10.9%4.1%4.8%8.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10058473213300000364SoilMNDHSMRRWIWASVIVQFLGYVVDAVWHGLVNPGVEPTTVGEMTRHLGTVHLPLYIGAASMLASTSGALLCQMRRSSTGITPPIAFAGAVLSAGAEAWHAYSHLRLDTHSAPVAGTLSVIGFVVVVIGMSVSSWERRRRAARTTRAPRAA*
JGI12053J15887_1048990513300001661Forest SoilMWEDRSLSKWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVXAVLSASAEAWHAYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
JGI25382J37095_1003885623300002562Grasslands SoilMQDQSARTWVWASLILQFLGYVLDAVWHGLLNPGVEPTTVEEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSAPGIALPIAFAAAVLSAGAEASHAYSHLRLDTHSAPVAGTLSLIGFLVVVLAMALSSGGRRRRAGDTTKE*
JGI25382J37095_1016352313300002562Grasslands SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
JGI25382J43887_1044700413300002908Grasslands SoilARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
JGI25405J52794_1015399113300003911Tabebuia Heterophylla RhizosphereMEDRSARRWIWVSLILQALGYVFDAIWHGLLHPGVEPTTVGDMVRHLSTVHLPLYLGAVSVLVSTSRALLRQIRRAAIGLALPIAVAGAVLSTAAEAWHASAHLRLDTHSAPMAGSLSVIGFLVVV
Ga0062593_10002358513300004114SoilMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0062589_10006881623300004156SoilMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEAWHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0063356_10542933313300004463Arabidopsis Thaliana RhizosphereMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRR
Ga0062592_10005586013300004480SoilRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0066674_1002290613300005166SoilVLDAVWHGLLNPGIEPTTTGQMITHLSTVHLPLYIGAASVVVSTSKALLRQISRFAPGITLPIAFAGAVLSAAAETWHAYSHLRLETQSAPVAGSLSFIGFLVVVIAISRSGWARRHRATDTPNERRDT*
Ga0066672_1074058413300005167SoilWHGLLNPGTEPTTTGQMISHLSTVHLPLYIGAASVVVSTSKALLRQIRRFAPGITLPIAFAGAVLSAAAEAWHAYSHLRLETESAPVAGSLSFIGFLVVVIAISRSGWARRHRATDTPNERRDT*
Ga0066673_1026951313300005175SoilMMEDRSARTWIWASLILQFFGYVFDAVWHGLLSPGVEPTTVGEMVRHLGTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDE
Ga0066679_1041509623300005176SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDER
Ga0066676_1049192013300005186SoilRALTVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL*
Ga0066675_1124281513300005187SoilMEDRSARRWIWVSLVLQALGYVYDAIWHGLLRPGVEPPTVGDMVRHLGTVHLALYLGAASVLVSTSRALLRQIKRAAIGRALPMPIAVAGAVLSTAAEAWHAYAHLHLDTHSAPLAGILSVIGFLVVVITMSLSRGDRR
Ga0065705_1072757323300005294Switchgrass RhizosphereMEDRSARRWIWVSLVLQALGYVFDAIWHGLLRPGVEPTTVGDMVRHLSTVHLPLYLGAVSVLVSTSRALLRQIRRAAIGLALPIAVAGAVLSTAAEAWHASAHLRLDTHSAPMAGTLSVIGFLVVVITMSLS
Ga0070690_10166966913300005330Switchgrass RhizosphereMGDPSARTWIWTSIILQFFGYAFDAVWHGLLKPGVEPTTVGEMLRHLGTVHLPLYIGAGSVLVSTFIALVRQARRSTIDRALAVAVAGAVLSASAEAWHAYSHLRLDTHSAPVAGILSVIGFLVVVIAMSRSRRRRGRRVAKHDERHAASAR*
Ga0066388_10064083333300005332Tropical Forest SoilMGVDDARESARRWVWTSLVLQFFGYLFEAVWHALMRPGAEPTTAPEIVRHLATVHLLLYLGAAGILVSTSCVLLHRIRSSATGVAWPIALVGAALSAAAEAWHAYSHLRLDTHSAPIAGVMSVVGFLVVAAATSLSSGHDGRRTTDSVSDRRAA*
Ga0066388_10687210513300005332Tropical Forest SoilMADRSARRWVWASLILQALGYVFDAIWHGLLHPGVEPTTMSEMVHHLGMVHLPLYIGAASVLVSTSRALLRQIRRSATGLTLPIAVAGAVLTTAAEAWHAYAHLRLDTHSAPMAGTL
Ga0066388_10771185123300005332Tropical Forest SoilMEERSARAWIWASLLLQLLGYVYDAAWHGLLEPGVEPQTVAEMAWHLGTVHLPLYVGAASVLVSTSSALLRRVGPAPFGVALPVAVAGAWLSAGAEAWHAVSHLRLDTHTAPIAGTLS
Ga0070689_10077064523300005340Switchgrass RhizosphereLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0070703_1042957413300005406Corn, Switchgrass And Miscanthus RhizosphereGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0070708_10002304333300005445Corn, Switchgrass And Miscanthus RhizosphereMDRLAEDRAARRWIWIALTVQCVGYAFDVVWHGLLNPGVEPKTVDEMLRHLGTVHPPLYLGAASVLVATANALVRQIRRSTAGGALPIAFAGAVLSTAAEAWHASFHLRLDTHSAPIAGILSVVGFLVVVIALSLSSGRWRYSRPS*
Ga0070708_10014698133300005445Corn, Switchgrass And Miscanthus RhizosphereMQDRLAQTWIWASLTLQFLGYVLDATWHGLLNRGVEPHTVGEMARHLSTVHLPLYLGAVSVLIATSRVLLRQVKRSATGIALPIAFGGAVLSVAAEAWHAYSHLRLDTHSAPVAGTLSSPPARGCS*
Ga0066686_1026860823300005446SoilVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL*
Ga0070681_1125560923300005458Corn RhizosphereMNDRSARAWIWASLLLQGVGYVYDAVWHGWLNPGLEPQTTDAMARHLGSVHLPLYIGALSVLVATFTVLVDQARRSSTGTALPIAFAGAVLSAGAEAWHAASHMKLDTHSAPIAGVLSVVGFFVAVVATYCWGRRSA
Ga0070707_10001558743300005468Corn, Switchgrass And Miscanthus RhizosphereVQIVGYIIDAFWHGLLRPGVEPTTLSDMARHLKTVHLVLYVGAAGVLVTTAVALLRQIRRSAATLALGIAFGGAMLSTAAEAWHAYSHLTLDTSHAPIAGLLSGVGFVVVVAAMALSGWRRRRGL*
Ga0070679_10195545713300005530Corn RhizosphereVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0066701_1077150913300005552SoilSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
Ga0066661_1015875523300005554SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA*
Ga0066702_1007264623300005575SoilVAGRLPARTWIWASLILKFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA*
Ga0068857_10241854823300005577Corn RhizosphereSLLLQGVGYVYDAVWHGWLNPGLEPQTTEAMARHLGSVHPPLYFGALSVLVATFTVLVDQARRSGAGIELPIAFAGAVLSAGAEAWHAASHMRLDTHSAPIAGILSVVGFFVVLVATYCWGRSSARRDAAATSERHRAA*
Ga0066706_1071972813300005598SoilMQDVSARTWVWASVSLQFLGYVLDAVWHGLLNPGVEPTTVEEMVRHLGTVHLPLYIGAASVPVSTSRALLRQIRRSAPGIALPIAFAGAVLSAGAEAWHAYSHLRLDMHSAPVAGTLSLLGFLVVVIAMALSSGARRRRTGDTPNEQRVA*
Ga0066905_10002168723300005713Tropical Forest SoilMGASARTWIWISLVVQFLGYLIDVVWHGLLRPGVEPTTVGEMVRHLITVHLPLYIGALSLVVSTSRALLEQITRSRPGLALPIAFAGAVVSVTAEAWHAYSHLRLDTHSAPVAGTLSAIGFLVVVVAMSLSARTRRRHVADDTRGQGAA*
Ga0066903_10253702823300005764Tropical Forest SoilMEDRSARTWVWAALLLQALGYGCDALWHGLLNPGVEPTTRSAMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGIALPIAVAGAMLSTAAEAWHAYSHLRLDTHSAPIAGVLSVIGFFVVVIAMALSRGGRRRAADTTNARDAV*
Ga0066903_10646525823300005764Tropical Forest SoilMQGWSARKWVWASLMLQFFGYLFDAGWHALMRPGAEPTTVPEMVRHLATVHLPLYIGAASVLGSTSWAFLHRIRGSAPSIALPIAVVGAALSAAAEGWHAYSHLRLDTHSAPIAGV
Ga0068858_10197080923300005842Switchgrass RhizosphereWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0068862_10136237613300005844Switchgrass RhizosphereMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPV
Ga0066658_1074839613300006794SoilVRLAVVDPLTKRHSDAYIKVIMQDQSARTWIWASLILQFLGYVLDAVWHGLLNPGVEPRTVEEMARHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGIALPIAFAGAVLSAGAEAWHAYSHLRLDTHSAPVAGTLSLIGFLVVVIAMSLSNRARRR
Ga0066665_1039003013300006796SoilVLDAVWHGLLNPGIEPTTTGQMITHLSTVHLPLYIGAASVVVSTSKALLRQIRRFAPAITLPVAFAGAVLSAAAEAWHAYSHLRLETESAPVAGSLSFIGFLVVVIAISRSGWARRHRATDTPNERRDT*
Ga0066660_1045099823300006800SoilGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA*
Ga0075428_10004499013300006844Populus RhizosphereMHSDGTVKIEPERARTWVAGSLVLQFLGYVYDALWHGVLHPGNEPTTRAEMVRHLSTVHLPLYMGAACVLISTGLALIGQIRRSATGVALPIAFAGAVLSAASEAWHASSHLELDTHNAPTAGVLSVIGFLVVVVTMSLARGIGGPRSA*
Ga0075428_10066950223300006844Populus RhizosphereMKDRSARAWIWTSICLQLAGYVLDAAWHGLLNPGKEPQTLGQMIQHLATVHFLLYIGAASVLVSTSGALVRRIGRSLIGIALPVAVVGAWLSAGAEGWHAYSHLRLDTHSAPIAGILSFLGFLVVVLAMASSRWARRHRPAATSSRPRVASPAAGDIRRSSPRS*
Ga0075431_10117518513300006847Populus RhizosphereMAEARSLRTWVWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYFGAASVFVSTGLALLRHVRRSTHGGGDLLLAFVGAVLSAGAEAWHAYSHLRLDTHHAPVAGTLSAVGFLVVVATMWRSGRG*
Ga0075433_1006206143300006852Populus RhizosphereMTTHNAQVDLGPVIVQFLGYVVDAVWHGLVNPGVEPTTVGEIARHLGTVHLPLYITGITLRIAFAGAVLSAGAEAWHAYSHLRLDTHNAPLAGTLSVLGFIVIVIGMSVSSWERRRRAARTTEQRRAA*
Ga0075433_1187820513300006852Populus RhizosphereEVIMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTVGDMVRHLGTVHLLLYLGAACVLVSTSRALLRQIRRAATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGILSIIGFLVVVITMSLSRGDRRRATDPTNAGGAASS*
Ga0075420_10002379953300006853Populus RhizosphereMQSDGTVKIEPERARTWVAGSLVLQFLGYVYDALWHGVLHPGNEPTTRAEMVRHLSTVHLPLYMGAACVLISTGLALIGQIRRSATGLALPIAFAGAVLSAASEAWHASSHLELDTHNAPTAGVLSVIGFLVVVVTMSLARGIGGPRSA*
Ga0075425_10003329333300006854Populus RhizosphereMQGRSARTWIWASLILQFLGYVLDATWHGLLNPGVEPQTPSEMARHLSTVHLPLYIGAASVLVSTARALLRQVRRSATGIGLPIAFAGAVLSATAEAWHAYSHLRLDTHSAPVAGILSVIGFLVVVIAMSLSSWFHRGRAEETADERRAA*
Ga0075434_10012340623300006871Populus RhizosphereMTTHNAQVDLGPVIVQFLGYVVDAVWHGLVNPGVEPTTVGEIARHLGTVHLPLYIRTGITLRIAFAGAVLSAGAEAWHAYSHLRLDTHNAPLAGTLSVLGFIVIVIGMSVSSWERRRRAARTTEQRRAA*
Ga0075429_10178546013300006880Populus RhizosphereARGDERRDPSLAGHMHSDGTVKIEPERARTWVAGSLVLQFLGYVYDALWHGVLHPGNEPTTRAEMVRHLSTVHLPLYMGAACVLISTGLALIGQIRRSATGVALPIAFAGAVLSAASEAWHASSHLELDTHNAPTAGVLSVIGFLVVVVTMSLARGIGGPRSA*
Ga0075426_1001724233300006903Populus RhizosphereMTTHNAQVDLGPVIVQFLGYVVDAVWHGLVNPGVEPTTVGEIARHLGTVHLPLYIRTGITLRIAFAAAVLSAGAEAWHAYSHLRLDTHNAPLAGTLSVLGFIVIVIGMSVSSWERRRRAARTTEQRRAA*
Ga0075436_10004316033300006914Populus RhizosphereMTTHNAQVDLGPVIVQFLGYVVDAVWHGLVNPGVEPTTVGEIARHLGTVHLPLYITGITLRIAFAAAVLSAGAEAWHAYSHLRLDTHNAPLAGTLSVLGFIVIVIGMSVSSWERRRRAARTTEQRRAA*
Ga0075436_10066346823300006914Populus RhizosphereMQGRSARTWIWASLILQFLGYVLDATWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVSTARALLRQVRRSATGIGLPIAFAGAVLSATAEAWHAYSHLRLDTHLAPIAGALSVIGFFVVVIAMAMSSGRWRRRTVDTTNERHAA*
Ga0099791_1010748613300007255Vadose Zone SoilMEARSARRRVWVALILQALGYVFDAIWHGLLHPGVEPTTMSAMVRHLGTVHLPLYIGAASVIVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATDTPNALHGIIRFVLQANC*
Ga0066710_10008757243300009012Grasslands SoilMQDVSARTWVWASVSLQFLGYVLDAVWHGLLNPGVEPTTVEEMVRHLGTVHLPLYIGAASVPVSTSRALLRQIRRSAPGIALPIAFAGAVLSAGAEAWHAYSHLRLDMHSAPVAGTLSLLGFLVVVIAMALSSGARRRRTGDTPNEQRVA
Ga0066710_10016243113300009012Grasslands SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA
Ga0066710_10042387123300009012Grasslands SoilMQDRGARTWIWASLILQFLGYVLDVTWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA
Ga0066710_10049108733300009012Grasslands SoilVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGILISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL
Ga0066710_10167149713300009012Grasslands SoilQALGYVFDAIWHGLLRPGVEPTTVGDMVRHLGTVHLALYLGAASVLVSTSRALLRHIRRAASGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGILSVIGFLVVVITMSLSRGDRRRATETTNAGGAA
Ga0099827_1065801313300009090Vadose Zone SoilMEDRSVRTWIWASLILQTLGYGFDAVWHGLLNPGVEPTTMREMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSETGIALPIAVAGAMLSTAAEVWHAYSHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGDRRRTAETTNARDAG*
Ga0066709_10014470143300009137Grasslands SoilLTVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPSTTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL*
Ga0066709_10172121923300009137Grasslands SoilMEDRSARRWIWVSLVLQALGYVFDAIWHGLLRPGVELATVGDMVRHLGTVHLPLYLGAASVLVSTSRALWRQIRWSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGILSVIGFLVVVITMSLSRGDRRRATETTNAGGAA*
Ga0114129_1020613653300009147Populus RhizosphereMHSDGTVKIEPERARTWVAGSLVLQFLGYVYDALWHGVFHPGNEPTTRAEMVRHLSTVHLPLYMGAACVLISTGLALIGQIRRSATGVALPIAFAGAVLSAVSEAWHASSHLELETHNAPTAGVLSVIGFLVVVVTMSLARGIGGPRSA*
Ga0105241_1226637413300009174Corn RhizosphereMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQR
Ga0105056_104003713300009801Groundwater SandMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVPLPLYIGAASVLVSTSRALLRHIRRSAPGFALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATDTTNAGAPHNRSVRPSMLEAEV*
Ga0126380_1026611423300010043Tropical Forest SoilMEDRSARRWIWVSLILQALGYVFDAIWHGLLHPGVEPTTVGDMVRHLSTVHLPLYLGAVSVLVSTSRALLRQIRRATIGLTLPIAVAGAVLSTAAEAWHASAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRAAATTNAEGAT*
Ga0126321_118280123300010145SoilMEDRSARRWVWGSLILQALGYVFDAIWHGLLHPGVEPTTVEDMLRHLGTVHLPLYLGATSVLVSTSRALLRQIRRSAISLALPIAVAGAVLSTAAEAWHAHAHLRLDTHSAPMAGTLSIIGFLVVVIMMSLSRGDRRRVAATTNVEGTA*
Ga0134082_1055548813300010303Grasslands SoilVSLILQALGYVFDAIWHGLLHPGVEPTTMSDIVRHLGTVHLPLYIGAASVLVSTSRALWRQIRWSATGLALPIAVAGAVLSTAAEVWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATETTNAEGAA*
Ga0134088_1068510413300010304Grasslands SoilIVQFLGYLIDVAWHGLLSPGVEPATTDDSMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL*
Ga0126376_1062073813300010359Tropical Forest SoilMEDRSARRWVWASLILQALGYGFDAGWHGLLHPGVEPMTVGDMVRHLGTVHLPLYIGAMSVLVSTSRALLHQIRRSATDLALPIAVTGAVLSTAAEAWHAYAHLRLDTHSAPMAGALSAIGFLVVIIAMALSRGGQRRAAAQDRP*
Ga0126376_1070608813300010359Tropical Forest SoilMMEDRSARRWIWVSLILQALGYVFDAVWHGLLRPGVEPTTVGDMVRHLSTVHLPLYLGTVSVLVSTSRALLRQIKRAAIGLALPVAVAGAVLSTAAEAWHAYAHLRLDTHSAPLAGTLSVIGFLVVVITMSLSREAQRRAATTTNAEGTA*
Ga0126377_1017755023300010362Tropical Forest SoilMGASARTWIWISLVVQFLGYLIDLMWHGLLRPGVEPTTVGEMVRHLITVHLPLYIGALSLVVSTSRALLEQIIRSRPGIALPIAFGGAMVSVTAEAWHAYSHLRLDTHSAPVAGALSAIGFLVVVFAMSLSAGTRRRRVADDTRGQRAA*
Ga0126379_1276725113300010366Tropical Forest SoilIAGEAIMEDRSARRWVWVSLILQAFGYVFDAIWHGLLHPGVEPKTMGDMVRHLGTVHLPLYLGAASVLVSTSRALLRQIRRAAIGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGILSVIGFLVVVITMSLSRGDRRRATETTNAGGAA*
Ga0134124_1004081253300010397Terrestrial SoilLSPLEVTMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0134123_1011026813300010403Terrestrial SoilMNDRSARAWIWASLLLQGVGYVYDAVWHGWLNPGLEPQTTEAMARHLGSVHLPLYFGALSVLVATFTVLVDQARRSSAGIALPIAFAGAVLSAGAEAWHAASHMRLDTHSAPSAGILSVVGFFVVVVATYCWGRSGARHDAEATS
Ga0134123_1178561513300010403Terrestrial SoilTWIWTSIILQFFGYAFDAVWHGLLKPGVEPTTVGEMLRHLGTVHLPLYIGAGSVLVSTFIALVRQARRSTIDRALAVAVAGAVLSASAEAWHAYSHLRLDTHSAPVAGILSVIGFLVVVIAMSRSRRRRGRRVAKHDERHAASAR*
Ga0137388_1083554723300012189Vadose Zone SoilMGRLAEDRGALRWIWIALGVQCVGYVLDVVWHGLLNPGVEPKTVDEMLRHLGTVHLPLYLGAASVLVATANALVRRIRRSTAGVALPIAFAGAVLSTAAEAWHASFHLRLDTHSAPIAGILSVVGFLVVVIALSLSSGRWRYSRPS*
Ga0137364_1045309823300012198Vadose Zone SoilGRRSSCSFSGDVLDAVWHGLLNPGIEPTTTGQMITHLSTVHLPLYIGAASVVVSTSKALLRQISRFAPGITLPIAFAGAVLSAAAETWHAYSHLRLETQSAPVAGSLSFIGFLVVVIAISRSGWARRHRATDTPNERRDT*
Ga0137383_1044443823300012199Vadose Zone SoilMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVHLPLYIGAASVLVSTSRALLRRIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRTTDPTNAGSAA*
Ga0137383_1084249513300012199Vadose Zone SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRTATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVV
Ga0137399_1014577123300012203Vadose Zone SoilMQDRSARRWIWASLILQFLGYVLDATWHGLLNPGVEPKTTGEMARHLSTVHLPLYIGAASVLVATSRALLRWVKRSAAGIALPIAFAGAVLSATAEAWHAYSHLRLDTHSAPVAGTLSVVGFLVVVIAMLLSSVRRRRAADTTNA*
Ga0137399_1092175823300012203Vadose Zone SoilITHEVTMEDQSIRRWIWASISLQAVGYLVDIVWHGLLRPGVEPSTVGDMARHLGTVHLPLYIGSVSVLVSVSGALLRQIRRAAPGVALPMAFGGAALSAGAEAWHASSHLHLDTRHAPIAGTMSGVGFHAVVVAIALSSWARRERRAT*
Ga0137380_1003664663300012206Vadose Zone SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
Ga0137380_1116909313300012206Vadose Zone SoilMWEDRSPSKWIWASIILQAVGYAIDVVWHGLLNPGVEAATVRDMARHLGMVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137377_1149117823300012211Vadose Zone SoilMEDRSARTWIWTSLVLQFLGYVVDAVWHGLLSPGVEPTTVGEMARHLRTVHLPLYIGATSVLVSTSRALLRQTRRSVIGIAMPVAFAGAVLSAGAEAWHAYSHLRLDTHSAPIAGTLSVIGFFMVVIAMSLSRGRGRRGTVDATNE
Ga0137387_1056566713300012349Vadose Zone SoilMGDRLARRWVWASLILQALGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
Ga0137387_1114054013300012349Vadose Zone SoilERGWAYEVSRMWEDRSLSKWIWASIILQVVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLHHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137386_1085743513300012351Vadose Zone SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMVLSRGGQRRAA*
Ga0137386_1093378513300012351Vadose Zone SoilMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSDMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSAPGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGILSVIGFLVVVITMSLSRGDRRRATDTTNAGGAASS*
Ga0137386_1121561423300012351Vadose Zone SoilVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLVSTGLALLRHVRRSTHGGGALLLAFVGAVSSASAEAWHAYSHLRLDTHHAPVAGTLSAVGFLVVAIAMWRSGRG*
Ga0137369_1107909223300012355Vadose Zone SoilMEDRPARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSDMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGIL
Ga0137375_1007092213300012360Vadose Zone SoilMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSDMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATDTTNAGGAASS*
Ga0137360_1099865113300012361Vadose Zone SoilACIAGEVIMEERSAQRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTISDMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATDTPNALHGIIRFVLQANC*
Ga0137390_1159810113300012363Vadose Zone SoilMEDRSVRTWIWASLLLQTLGYGFDAVWHGLLNPGVEPTTMRDMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA*
Ga0137397_1036922913300012685Vadose Zone SoilMWEDRSLSKWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137397_1059379123300012685Vadose Zone SoilMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSDMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATETTNAGGAT*
Ga0157282_1031332613300012904SoilLSPLEVTMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVV
Ga0137396_1039135523300012918Vadose Zone SoilMQDRSARRWIWASLILQFLGYVLDATWHGLLNPGVEPKTTGEMARHLSTVHLPLYIGAASVLVATSRALLRWVKRSAAGIALPIAFAGAVLSATAEAWHAYSHLRLDTHSAPVTGTLSVVGFLVVVIAMLLSSVRRRRAADTTNA*
Ga0137394_1032575323300012922Vadose Zone SoilMQDRSARTWIWASLILQLLGYVVDATWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVSTSRALLRQVRRSASGIALPIAFAGAVLSATAEAWHAYSHLRLDTHSAPVAGILSVIGFLVVVIAMSLSSWFHRGCAEETTDERRAA*
Ga0137394_1090636323300012922Vadose Zone SoilMWEDRSLSKWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTGLALLRHVRHWAHGAAHLLLAFVGAVLSASAEAWHAYSHLHLDTRHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137419_1023414613300012925Vadose Zone SoilMMEDRSARTWIWASLILQFFGYVFDAVWHGLLSPGVEPTTVGEMVRHLGTVHLPLYIGAASVLVSTSRALLRQARRSAIGIAMTVAFAGAVLSAAAETWHAYSHLRLDTHTAPIAGGLSVVGFLMVVTAMSLRRA
Ga0137419_1062018223300012925Vadose Zone SoilMWEDRSLSKWIWASIVLQALGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137416_1176760913300012927Vadose Zone SoilIILQALGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137404_1148865123300012929Vadose Zone SoilAACIAVEVIMEDRSARRWVWVALMLQALGYVFDAIWHGLLRPGVEPTTMNEMVRHLGTVHLPLYIGAASVLVSTSRALLRHIRRSAPGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATETTNAGGAT*
Ga0137407_1120219823300012930Vadose Zone SoilEASGGKDSAAASEVVMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA*
Ga0137407_1159893123300012930Vadose Zone SoilEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDQRRATDTTNAGGAA*
Ga0157380_1239934223300014326Switchgrass RhizosphereDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEAWHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA*
Ga0157377_1094687113300014745Miscanthus RhizosphereMNDRSARAWIWASLLLQCVGYVYDAVWHGWLNPGLEPQTTDAMARHLGSVHLPLYIGALSVLVATFTVLVDQARRSSAGIALPIAFAGAVVSAGAEAWHAASHMRLDTHSAPSAGILSVVGFFVVVVATYCWGRRGARHDAEATSERHRAA*
Ga0137418_10004688153300015241Vadose Zone SoilMWEDRSLSKWIWASIILQALGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG*
Ga0137418_1011837913300015241Vadose Zone SoilMMEDRSARTWIWASLILQFFGYVFDAVWHGLLSPGVEPTTVGEMVRHLGTVHLPLYIGAASVLVSTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRCAEATTDE
Ga0182038_1083745823300016445SoilMQTSEVSMEDRSARTWVWAALLLQAFGYGFDAVWHGLLHPGVEPTTRSEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAMLSTAAEAWHASSHLRLDTHSAPLAGVLSVIGFFGVVIAMALSRRGRRRAADTTNARDAV
Ga0184609_1014276313300018076Groundwater SedimentMSGRSPRTWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLVSTGLALLRQVRRSTHGGGALLLAFVGAMLSASAEAWHAYSHLRLDTHHAPVAGTLSAVGFLVVVVTMWRSGRG
Ga0184609_1018833813300018076Groundwater SedimentMEDRSARRWVWVSLILQALGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPIAGVLSVLGFLV
Ga0184612_1062213013300018078Groundwater SedimentMLDQSVRTWVWASLILQSLGYVFDAVWHGLLNPGVEPQTVGEMARHLSTVHMPLYIGVASVLVSTFTALVRSIGRSPTGIALPIACAGCVLSAGAEAWHAYSHLRLDTHSAPVAGTLSFVGFRVVVT
Ga0066667_1003570523300018433Grasslands SoilVLDAVWHGLLNPGIEPTTTGQMITHLSTVHLPLYIGAASVVVSTSKALLRQISRFAPGITLPIAFAGAVLSAAAETWHAYSHLRLETQSAPVAGSLSFIGFLVVVIAISRSGWARRHRATDTPNERRDT
Ga0137408_122872113300019789Vadose Zone SoilMWEDRSLSKWIWASIVLQALGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVAAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG
Ga0193725_105399313300019883SoilRSLSKWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVGAVLSASAEAWHAYSHLHLDTHHAPVGGTLSAVGFLVVVVAMWRSRRG
Ga0210378_1027079513300021073Groundwater SedimentKWIWASIILQAVGYAIDVVWHGLLNPGVEPATVRDMARHLGTVHLPLYIGAASVLVSTGLALLRQVRRSTHGGGALLLAFVGAMLSASAEAWHAYSHLRLDTHHAPVAGTLSAVGFLVVVVTMWRSGRG
Ga0207653_1045048323300025885Corn, Switchgrass And Miscanthus RhizosphereGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA
Ga0207684_1000773353300025910Corn, Switchgrass And Miscanthus RhizosphereMDRLAEDRAARRWIWIALTVQCVGYAFDVVWHGLLNPGVEPKTVDEMLRHLGTVHPPLYLGAASVLVATANALVRQIRRSTAGGALPIAFAGAVLSTAAEAWHASFHLRLDTHSAPIAGILSVVGFLVVVIALSLSSGRWRYSRPS
Ga0207707_1011208023300025912Corn RhizosphereMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA
Ga0207646_10005900173300025922Corn, Switchgrass And Miscanthus RhizosphereVQIVGYIIDAFWHGLLRPGVEPTTLSDMARHLKTVHLVLYVGAAGVLVTTAVALLRQIRRSAATLALGIAFGGAMLSTAAEAWHAYSHLTLDTSHAPIAGLLSGVGFVVVVAAMALSGWRRRRGL
Ga0207670_1075548713300025936Switchgrass RhizosphereHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA
Ga0207667_1213169713300025949Corn RhizosphereMQGRSARTWVWASLLLQGGGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA
Ga0207674_1109245013300026116Corn RhizosphereARAWIWASLLLQGVGYVYDAVWHGWLNPGLEPQTTEAMARHLGSVHPPLYFGALSVLVATFTVLVDQARRSGAGIELPIAFAGAVLSAGAEAWHAASHMRLDTHSAPIAGILSVVGFFVVLVATYCWGRSSARRDAAATSERHRAA
Ga0209438_101344013300026285Grasslands SoilVGYAIDVVWHGLLNPGVEAATVRDMARHLGMVHLPLYIGAASVLISTSLALLRHVRHWAHGAAHLLLAFVGAVLSASAEAWHGYSHLHLDTHHAPVAGTLSAVGFLVVVVAMWRSRRG
Ga0209234_101028333300026295Grasslands SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERRAA
Ga0209237_106633413300026297Grasslands SoilMQDQSARTWVWASLILQFLGYVLDAVWHGLLNPGVEPTTVEEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIRRSAPGIALPIAFAAAVLSAGAEASHAYSHLRLDTHSAPVAGTLSLIGFLVVVLAMALSSGGRRRRAGDTTKE
Ga0209236_128820913300026298Grasslands SoilMEDRSARTWVWASLILQAFGYGFDAVWHGLLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSATGSALPVAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLS
Ga0209055_121571913300026309SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVAIAI
Ga0209471_102699643300026318SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDERCAA
Ga0209470_120442523300026324SoilRALTVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL
Ga0209377_115047723300026334SoilVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISNATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL
Ga0209157_121082923300026537SoilVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL
Ga0209056_1038748123300026538SoilLTVRRQSARTWIWAAVIVQFLGYLIDVAWHGLLSPGVEPATTGDMMRHLATVHLPLYVGAAGVLISTATALLQSIRRSSTGIALPVAFIGAVVASGAEAWHAYAHLHLDTHSAPAAGILSVIGFVVVVIAMFLRRLAL
Ga0209648_1009700723300026551Grasslands SoilMNDHSMRRWIWVSVIVQFLGYVVDAVWHGLLNPGVEPTTVGEMARHLGTVHLPLYIGAASVLVSTSGALLRQMRRSSTGIALPIAFAGVVLSAGAEAWHAYSHLRLDTHSAPVAGTLSVLGFVVVVIGMSVSSWERRRRAARTTEQRRAA
Ga0209689_102440623300027748SoilMQDRGARTWIWASLILQFLGYVLDVIWHGLLNPGVEPQTASEMARHLSTVHLPLYIGAASVLVTTSRALLRQVRRSASGIALPIAVAGAVLSATAEAWHASSHLRLDTHSAPVAGILSVIGFLVVVIAISLSSWFHRRGAEATTDEPRAA
Ga0209590_1026864823300027882Vadose Zone SoilYGFDAVWHGRLHPGVEPTTMGAMVRHLGTVHLPLYIGAVSVLVSTSRALLRQIRRSAMGSALPIAVAGAMLSTAAEAWHAYAHLRLDTHSAPIAGVLSVIGFLVVVIAMALSRGGQRRAA
Ga0209590_1067907013300027882Vadose Zone SoilLARCPRGGVARVEHTVRSGLLFQRGDAAVEAEEVMREDRKARTWIWASLILQFLGYVFDAGWHGLLSPGVEPTTVGEMVRHLGTVHLPLYIGAASVLASTSRALLRQARRLAIGFVMPIAFGGAALSAAAEAWHAYSHLHLDTHSAPIAGTLSVIGFFVVVVAMSLSSARWRRRTVDTTNERGAA
Ga0137415_1003619813300028536Vadose Zone SoilMQDRSARRWIWASLILQFLGYVLDATWHGLLNPGVEPKTTGEMARHLSTVHLPLYIGAASVLVATSRALLRWVKRSAAGIALPIAFAGAVLSATAEAWHAYSHLRLDTHSAPVAGTLSVVGFLVVVIAMLLSSVRRRRAADTTNA
Ga0247823_1106255413300028590SoilMKDRSARAWIWTSICLQLAGYVLDAAWHGLLNPGKEPQTLGQMIQHLATVHFLLYIGAASVLVSTSGALVRRIGRSLIGIALPVAVVGAWLSAGAEGWHAYSHLRLDTHSAPIAGILSFLGFLVVVLAMASSRWARRHRPAATSSRPRVASPAAGDIRRSSPRS
Ga0247822_1095904823300028592SoilMKDRSARAWIWTSICLQLAGYVLDAAWHGLLNPGKEPQTLGQMIQHLATVHFLLYIGAASVLVSTSGALVRRIGRSLIGIALPVAVVGAWLSAGAEGWHAYSHLRLDTHSAPIAGILSFLGFLVVVLAMASSRWARRHRPAATSSRPRVASAPLAVTRLTTVVTSV
Ga0308189_1037440323300031058SoilMEDQSARTWVWASLILQALGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVHLPLYIGAASVLVSTSRALLRQIGRSATGLALPIAVAGAVLSTAAEAWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSREDRRRATDTTNA
Ga0308187_1042715413300031114SoilMEDRSARRWVWVSLILQDLGYVFDAIWHGLLHPGVEPTTMSEMVRHLGTVHLPLYIGAASVLVSTSRALLRYIWRSAPGLALPIAVAGAVLSTAAEVWHAYAHLRLDTHSAPMAGTLSVIGFLVVVITMSLSRGDRRRATDTTNAGSAA
Ga0310888_1013233713300031538SoilVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA
Ga0310890_1071420423300032075SoilVTMQGRSARTWVWASLLLQGVGYLYDAVWHGLLNPGLEPTTREAMARHLATVHLPLYIGAASVLLSTLTLLVREVRRSAAGIALPIAFAGSVLSAAAEARHASSHLRLDTHSAPVAGILSLTGFLVVVIAMSLSGRRRQRRAADTSERRRVA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.