NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F038375

Metagenome Family F038375

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F038375
Family Type Metagenome
Number of Sequences 166
Average Sequence Length 132 residues
Representative Sequence MKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Number of Associated Samples 135
Number of Associated Scaffolds 166

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 69.28 %
% of genes near scaffold ends (potentially truncated) 38.55 %
% of genes from short scaffolds (< 2000 bps) 77.11 %
Associated GOLD sequencing projects 122
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.398 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(26.506 % of family members)
Environment Ontology (ENVO) Unclassified
(45.181 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.048 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176
1GPIPI_02167280
2GPIPI_01377190
3GPIPI_01494340
4F14TC_1002951421
5KanNP_Total_noBrdU_T14TCDRAFT_10329191
6AP72_2010_repI_A001DRAFT_10215961
7JGI12627J18819_103561771
8JGIcombinedJ43975_100023872
9JGIcombinedJ43975_100170022
10JGIcombinedJ43975_100643891
11JGI25405J52794_100077433
12Ga0062589_1015056381
13Ga0066397_101539781
14Ga0063356_1010988271
15Ga0066674_100071173
16Ga0066683_100168222
17Ga0066683_102220563
18Ga0066673_102762261
19Ga0066688_105937451
20Ga0066684_103618042
21Ga0066684_108015171
22Ga0066678_102144671
23Ga0066676_103661561
24Ga0066675_101672041
25Ga0065712_101857421
26Ga0065715_105168052
27Ga0066686_105144912
28Ga0066682_107941012
29Ga0066681_103209021
30Ga0066681_104746911
31Ga0066697_100691882
32Ga0066697_102577713
33Ga0066707_101475881
34Ga0066704_101733642
35Ga0066698_100077063
36Ga0066698_105284913
37Ga0066705_106368551
38Ga0066694_100675002
39Ga0066905_1002811001
40Ga0066903_1028985722
41Ga0081455_100483782
42Ga0081455_100520882
43Ga0066656_100457441
44Ga0066652_1000073207
45Ga0066652_1007958971
46Ga0070716_1002647432
47Ga0066665_101500612
48Ga0066665_108776061
49Ga0066659_101270231
50Ga0066660_116362671
51Ga0075433_113468631
52Ga0075434_1005020222
53Ga0075424_1016223392
54Ga0099791_100624562
55Ga0066710_1006445871
56Ga0066710_1035391302
57Ga0066709_1003443092
58Ga0126380_100021273
59Ga0134070_100304011
60Ga0134082_100374712
61Ga0134067_100563342
62Ga0134084_100535882
63Ga0134086_101542603
64Ga0134086_101885332
65Ga0134086_104372501
66Ga0134064_100392542
67Ga0134065_100873522
68Ga0134063_100750592
69Ga0134063_103257923
70Ga0134062_102677821
71Ga0126376_100300582
72Ga0126378_130626792
73Ga0126377_100093166
74Ga0134066_101397581
75Ga0134066_101560051
76Ga0134126_123988621
77Ga0126383_100911053
78Ga0138514_1000196384
79Ga0137364_101295082
80Ga0137364_101812721
81Ga0137365_101354191
82Ga0137365_104642992
83Ga0137363_103105793
84Ga0137362_104363242
85Ga0137377_100319824
86Ga0137370_100983471
87Ga0137367_108631862
88Ga0137366_100124015
89Ga0137366_109375721
90Ga0137369_103272092
91Ga0137360_102917822
92Ga0137361_100470073
93Ga0137359_101984801
94Ga0137359_117199541
95Ga0137404_100308013
96Ga0137407_105948342
97Ga0126375_100226955
98Ga0134110_100708042
99Ga0134081_100661841
100Ga0134078_100411362
101Ga0134078_102276761
102Ga0134079_101762212
103Ga0137403_109418982
104Ga0134072_100091452
105Ga0134089_100050503
106Ga0134085_104671121
107Ga0134112_100474232
108Ga0134083_100693302
109Ga0184604_101819112
110Ga0184605_100102283
111Ga0184608_100271712
112Ga0184621_100208382
113Ga0184619_100807561
114Ga0184618_100199142
115Ga0184618_100531862
116Ga0184635_100112762
117Ga0184609_100920322
118Ga0184625_100044432
119Ga0066655_107288202
120Ga0066655_109277412
121Ga0066667_100516642
122Ga0066662_101176483
123Ga0066669_105829162
124Ga0066669_116714251
125Ga0137408_10472052
126Ga0193701_10859791
127Ga0193715_10309591
128Ga0193725_10216101
129Ga0193747_10188152
130Ga0193718_11016622
131Ga0193731_10563682
132Ga0179594_101718102
133Ga0210378_103627462
134Ga0193750_10318572
135Ga0222622_101112572
136Ga0222622_107589572
137Ga0207665_103613092
138Ga0209239_10580892
139Ga0209470_10404722
140Ga0209470_10622132
141Ga0209470_11660201
142Ga0209375_10869872
143Ga0209473_11110021
144Ga0209267_11749652
145Ga0209057_10300182
146Ga0209808_10171252
147Ga0209058_10355393
148Ga0209058_11379791
149Ga0209157_12699741
150Ga0209056_100087314
151Ga0209156_102200692
152Ga0208707_1026411
153Ga0208475_10220011
154Ga0209973_10147881
155Ga0210002_11073581
156Ga0307301_103054381
157Ga0307280_102335982
158Ga0307302_102204122
159Ga0307278_100873322
160Ga0310813_110872812
161Ga0307471_1023743521
162Ga0310812_104362502
163Ga0310810_100172886
164Ga0310810_100423912
165Ga0310810_100934723
166Ga0310811_106844302
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 19.12%    β-sheet: 32.35%    Coil/Unstructured: 48.53%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120MKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAKSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Grasslands Soil
Forest Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Miscanthus Rhizosphere
Tabebuia Heterophylla Rhizosphere
Arabidopsis Thaliana Rhizosphere
Populus Rhizosphere
6.0%10.2%13.3%3.6%14.5%26.5%6.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_021672802088090014SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
GPIPI_013771902088090014SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAR
GPIPI_014943402088090014SoilMKGPIMKPILILAAALVIALSDQAFARLGQAEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSITESYARVDKRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK
F14TC_10029514213300000559SoilSDQAFARLSQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATASCETVAGRPTLLIRAKQLEAISCQELTYFSR*
KanNP_Total_noBrdU_T14TCDRAFT_103291913300000596SoilQAFARLSQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATASCETVAGRPTLLIRAKQLEAISCQELTYFSR*
AP72_2010_repI_A001DRAFT_102159613300000893Forest SoilMKRILILAATFVIVLSGLAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDSRRLSEKELSIFLQGNSADKEWKKDPRKLAWERSDHHANAWCEMIAGRPTLLIRAK*
JGI12627J18819_1035617713300001867Forest SoilMKGIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGDYLAMVQFFNGHSVTEAYARLDRHKLSEKELSVFLQGNSAGKEWKKDPRKLAWER
JGIcombinedJ43975_1000238723300002899SoilMKGIVILAAALVIALSDQAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGDYLAMVQFLKGHSVTESYARVDRHKLSQKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
JGIcombinedJ43975_1001700223300002899SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSITESYARVDKRKLSEKELSIFLQGNSAGKEWIKDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
JGIcombinedJ43975_1006438913300002899SoilMKIVILAAALIIALSDQAFARLGQTENQVNPLFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
JGI25405J52794_1000774333300003911Tabebuia Heterophylla RhizosphereMKPIVIXAAALIIALSDQAFARLGQTEDQVXALFGKPIDPGXPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHANAWCETLAGRPTLLICAK*
Ga0062589_10150563813300004156SoilSKGSIMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFFKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0066397_1015397813300004281Tropical Forest SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDNDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADRHKLSEKELSIFLQGNSAGKEWKKDPRKL
Ga0063356_10109882713300004463Arabidopsis Thaliana RhizosphereARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRAK*
Ga0066674_1000711733300005166SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0066683_1001682223300005172SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066683_1022205633300005172SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKSIDPGKPDSDGITTKMYKNPTGEYLAVVQFLKGRSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0066673_1027622613300005175SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKTDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKRDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0066688_1059374513300005178SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066684_1036180423300005179SoilMKHIVILAAVLVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0066684_1080151713300005179SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHH
Ga0066678_1021446713300005181SoilYQNQRTMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066676_1036615613300005186SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGKFAWERSDHHASAWCETIAGRPTLLIRARY*
Ga0066675_1016720413300005187SoilMKGSIMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATA
Ga0065712_1018574213300005290Miscanthus RhizosphereMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFFKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETMAGRPTLLIHAK*
Ga0065715_1051680523300005293Miscanthus RhizosphereALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKNPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0066686_1051449123300005446SoilMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066682_1079410123300005450SoilTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066681_1032090213300005451SoilHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKTDSDGITTNMYKNPTGEYLAVVQFLKGRSITESYARVDSRKLSEKELSIFLQGNSAGKEWKRDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0066681_1047469113300005451SoilHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK*
Ga0066697_1006918823300005540SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK*
Ga0066697_1025777133300005540SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKSTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHA
Ga0066707_1014758813300005556SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATA
Ga0066704_1017336423300005557SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRGK*
Ga0066698_1000770633300005558SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGRFAWERSDHHASAWCETIAGRPTLLIRARY*
Ga0066698_1052849133300005558SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATA
Ga0066705_1063685513300005569SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0066694_1006750023300005574SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0066905_10028110013300005713Tropical Forest SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRGK*
Ga0066903_10289857223300005764Tropical Forest SoilMKPIVILAAALIIALSDEAFARLGQTEDQVNALFGKPIDAGKPDSDGITTNMYKNPTREYLAVVQFLQGHSVTESYARVDRHEFSEKELSVFLQGNSAGKEWKKDPRKLAWERSDHHASAWCEMLAGRPTLLIRAK*
Ga0081455_1004837823300005937Tabebuia Heterophylla RhizosphereMKPIVILAAALIIALSDQAFARLGQTEDQVSALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHANAWCETLAGRPTLLICAK*
Ga0081455_1005208823300005937Tabebuia Heterophylla RhizosphereMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
Ga0066656_1004574413300006034SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAW
Ga0066652_10000732073300006046SoilMKLIAILAAALVIASSDHAFARLGQTEDQVDALFGKPIDPGKPDSDGITTNMYKNRTGEYLAMVQFVKGHSITESYARVDSRNLSKKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0066652_10079589713300006046SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKTDSDGITTNMYKNPTGEYLAVVQFLKGRSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKL
Ga0070716_10026474323300006173Corn, Switchgrass And Miscanthus RhizosphereMKTMRCLSSLCLVLTFTLNGHVIARLGKTEDEVSALFGKPIDPGTPDSNGVTTNMYRNPTGVYLAVVQFLNGHSIAETYARVDNHKFSEKELSIFLQGNSGGKEWKKDPRKVAWERSDHHAKAWCETLAGKPTLLIQLK*
Ga0066665_1015006123300006796SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKGLSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCE
Ga0066665_1087760613300006796SoilMKLIAILAATLVIASSNHTFARLGQTEDQVSALFGKSIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSVTESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0066659_1012702313300006797SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLIRAK*
Ga0066660_1163626713300006800SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHRATAWCETLSGRPTLLIQARY*
Ga0075433_1134686313300006852Populus RhizosphereLVIRIWKKSKPCSLPKPSHHERSIMKPIVILAAAFIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
Ga0075434_10050202223300006871Populus RhizosphereMKPILILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
Ga0075424_10162233923300006904Populus RhizosphereLPKPSHHERSIMKPIVILAAAFIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLAGRPTLLIRAK*
Ga0099791_1006245623300007255Vadose Zone SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0066710_10064458713300009012Grasslands SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAAVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGRFAWERSDHHASAWCETIAGRPTLLIRARY
Ga0066710_10353913023300009012Grasslands SoilLVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK
Ga0066709_10034430923300009137Grasslands SoilMKGSIMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHH
Ga0126380_1000212733300010043Tropical Forest SoilMKPIAILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRAK*
Ga0134070_1003040113300010301Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134082_1003747123300010303Grasslands SoilMKHIVILAAVLVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134067_1005633423300010321Grasslands SoilMKGSIMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134084_1005358823300010322Grasslands SoilMKGSIMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKSIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134086_1015426033300010323Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKL
Ga0134086_1018853323300010323Grasslands SoilMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVSALFGKSIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSITESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETMAGRPTLLIRAK*
Ga0134086_1043725013300010323Grasslands SoilIALSDQAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK*
Ga0134064_1003925423300010325Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETMAGRPTLLIRAK*
Ga0134065_1008735223300010326Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134063_1007505923300010335Grasslands SoilMKGSIMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK*
Ga0134063_1032579233300010335Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKL
Ga0134062_1026778213300010337Grasslands SoilAALVIASSDHAFARLGQTEDQVSALFGKSIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0126376_1003005823300010359Tropical Forest SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRAK*
Ga0126378_1306267923300010361Tropical Forest SoilMKRILILAAAFITALSGQAFARLGQTEDQVNGLFGRPIDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHANAWCEM
Ga0126377_1000931663300010362Tropical Forest SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADRHKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRAK*
Ga0134066_1013975813300010364Grasslands SoilEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134066_1015600513300010364Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLKW
Ga0134126_1239886213300010396Terrestrial SoilQNRRTLKGSIMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKNPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0126383_1009110533300010398Tropical Forest SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADRHKLSEKELSIFLQGNSAGKEWKKGPRKLAWERSDHHASAWCETVAGRPTLLIRGK*
Ga0138514_10001963843300011003SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWER
Ga0137364_1012950823300012198Vadose Zone SoilMTMGRQATVKLILILAAALITLGSEAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHRASAWCETLARRPTLLIRAK*
Ga0137364_1018127213300012198Vadose Zone SoilMKHIVILAAALVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLIRAK*
Ga0137365_1013541913300012201Vadose Zone SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0137365_1046429923300012201Vadose Zone SoilMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSITEWYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHH
Ga0137363_1031057933300012202Vadose Zone SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKPAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0137362_1043632423300012205Vadose Zone SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0137377_1003198243300012211Vadose Zone SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLA
Ga0137370_1009834713300012285Vadose Zone SoilMKHIVILAAALVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLI
Ga0137367_1086318623300012353Vadose Zone SoilMKHIVILAATLVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRA
Ga0137366_1001240153300012354Vadose Zone SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCET
Ga0137366_1093757213300012354Vadose Zone SoilILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK*
Ga0137369_1032720923300012355Vadose Zone SoilMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVSALFGKSIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSITESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAT
Ga0137360_1029178223300012361Vadose Zone SoilMKHIVILAAALVIALSDQALARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0137361_1004700733300012362Vadose Zone SoilMKHIVILAAALVIALSDQALARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKPAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0137359_1019848013300012923Vadose Zone SoilMKTMKCLPVLCLALTFTLSGQTIARLGNTEDEVSALFGKPVDPGTPGRNGLTTNMYRNRTNEYLAAVEFLNGHSIAESYARVDNHKLSEKELSIFLQGNSGGKEWKKDPRKLAWERSDHHAKAWCETLAGRPTLLIQLK*
Ga0137359_1171995413300012923Vadose Zone SoilLALTFTLSGQIIARLGNTEDEVSALFGKPVDPGTPGSNGLTTNMYRNRTNEYLAVVQFLNGHSIAESYARVDNHKFSEKELSIFLQGNSGGKEWKKDPRKLAWERSDHHAKAWCETLAGRPTLLIQLK*
Ga0137404_1003080133300012929Vadose Zone SoilMKGPIMKPILILAAALVIAVSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSITESYARVDKRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETMAGRPTLLIRAK*
Ga0137407_1059483423300012930Vadose Zone SoilMKGPIMKPILILAAALVIAVSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSITESYARVDKRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCE
Ga0126375_1002269553300012948Tropical Forest SoilAATFVIVLSGLAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERIDHHASAWCEMIAGRPTLLIRAK*
Ga0134110_1007080423300012975Grasslands SoilMKGSIMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134081_1006618413300014150Grasslands SoilVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLIRAK*
Ga0134078_1004113623300014157Grasslands SoilMKGQIMKLIAILAAALVIASSDHAFARLGQTEDQVSALFGKSIDPGKPDSDGITTKMYKNPTGEYLAVVQFLKGRSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0134078_1022767613300014157Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0134079_1017622123300014166Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGRPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK*
Ga0137403_1094189823300015264Vadose Zone SoilMKRILILAAALVIALSYHAFARLGQTEDQVNVLFGKPVDPGKPDSDGITTNMYKNPTREYIAAVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGKFAWERSDHHASAWCETIAGRPTLLIRARY*
Ga0134072_1000914523300015357Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYALVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRAK*
Ga0134089_1000505033300015358Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETMAGRPTLLIRAK*
Ga0134085_1046711213300015359Grasslands SoilTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK*
Ga0134112_1004742323300017656Grasslands SoilVILAAALVIALSDQAFARLGQTEEQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRAK
Ga0134083_1006933023300017659Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLIRAK
Ga0184604_1018191123300018000Groundwater SedimentARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184605_1001022833300018027Groundwater SedimentMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0184608_1002717123300018028Groundwater SedimentMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184621_1002083823300018054Groundwater SedimentMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184619_1008075613300018061Groundwater SedimentMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKRDPRKLAWERSDHRATAWCETMAGRPTLLIRAK
Ga0184618_1001991423300018071Groundwater SedimentRLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKRDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184618_1005318623300018071Groundwater SedimentMKHVLILATTIMIAWGSQAVARLGNTEEQVNAVLGKPTDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGRSVAEGYSRVDRHKLSEKELSIFLEGNSAGNKWEKKPGKKAAWIRSDHRAHASYETVSGYPTLMVQAHY
Ga0184635_1001127623300018072Groundwater SedimentMKPIVILAAALTIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184609_1009203223300018076Groundwater SedimentMKPIVILAAALTIALSDQAFARLGQTEDQVNALFGNPTDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0184625_1000444323300018081Groundwater SedimentMKPIVILAAALTIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0066655_1072882023300018431Grasslands SoilMKHIVILAAALVIAFSDQAFARLGQTEDQVSALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK
Ga0066655_1092774123300018431Grasslands SoilAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK
Ga0066667_1005166423300018433Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK
Ga0066662_1011764833300018468Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKPAWERSDHHATAWCETMAGRPTLLIRAK
Ga0066669_1058291623300018482Grasslands SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK
Ga0066669_1167142513300018482Grasslands SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKTDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCET
Ga0137408_104720523300019789Vadose Zone SoilMKHIVILAAPLVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0193701_108597913300019875SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGNPTDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWC
Ga0193715_103095913300019878SoilMKHVLILAAALLIALSGQAFARLGNTEAQVSALFGKPVDSGKPDSNGVTTNMYKNPTGEYLAVVQFLRGHSVAEVYSRVDSRRLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLSGRPTLLIRAR
Ga0193725_102161013300019883SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKRDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0193747_101881523300019885SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0193718_110166223300019999SoilMKHVLILAAALVIALSGQAFARLGNTEAQVSALFGKPVDSGKPDSNGVTTNMYKNPTGEYLAVVQFLRGHSVAEVYSRVDSRRLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLSGRPTLLIRAR
Ga0193731_105636823300020001SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0179594_1017181023300020170Vadose Zone SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0210378_1036274623300021073Groundwater SedimentMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLL
Ga0193750_103185723300021413SoilMKHVLILAAALVIALSGQAFARLGNTEAQVSALFGKPVDPGKPDSNGVTTNMYKNPTGEYLAVVQFLRGHSVAEVYSRVDSRRLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLSGRPTLLIRAR
Ga0222622_1011125723300022756Groundwater SedimentMKHIVILAAALTIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFFKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0222622_1075895723300022756Groundwater SedimentLTMKHILMLAATIVIALSGQAFARLGQTEEQVSALFGKPIEADKPDKEGVTTNTYKNPTGEYIALVQFQKGHSIAEVYSRADGGKLSEKEMSIFLQGNSGGKEWIKDPHKLAWERSDHRAKAWYETLSGRPTLLIQAK
Ga0207665_1036130923300025939Corn, Switchgrass And Miscanthus RhizosphereMKTMRCLSSLCLVLTFTLNGHVIARLGNTEDEVSALFGKPIDPGTPDSNGVTTNMYRNPTGVYLAVVQFLNGHSIAETYARVDNHKFSEKELSIFLQGNSGGKEWKKDPRKVAWERSDHHAKAWCETLAGKPTLLIQLK
Ga0209239_105808923300026310Grasslands SoilMKHIVTLTAVLVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVTGRPTLLIRAK
Ga0209470_104047223300026324SoilMKLIAILAAALVIASSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGRSITESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETMAGRPTLLIRAK
Ga0209470_106221323300026324SoilMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0209470_116602013300026324SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGRFAWERSDHHASAWCETIAGRPTLLIRAR
Ga0209375_108698723300026329SoilMKHIVILAAALVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRGK
Ga0209473_111100213300026330SoilLSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK
Ga0209267_117496523300026331SoilSDQAFARLGQTEDQVNALFGKSTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRG
Ga0209057_103001823300026342SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGKFAWERSDHHASAWCETIAGRPTLLIRARY
Ga0209808_101712523300026523SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWCETVAGRPTLLIRAK
Ga0209058_103553933300026536SoilMKRILILAAALVIALSYHAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGEYIAVVQFLKGHSITESYARVDRRKLSEKELSIFLQGNSAGKEWKKDPGGRFAWERSDHHASAWCETIAGRPTLLIRARY
Ga0209058_113797913300026536SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVA
Ga0209157_126997413300026537SoilMKHIVILAAALVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKFSEKELSIFLQGNSAGKEWKKDPRKLKWERSDHHATAWC
Ga0209056_1000873143300026538SoilMKGSIMKHIVILAAALVIALSNQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATALCETVAGRPTLLIRAK
Ga0209156_1022006923300026547SoilMKHIVILAAALVIAFSDHAFARLGQTEDQVNALFGKPTDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKFSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0208707_10264113300026699SoilKGIVILAAALVIALSDQAFARLGQTEDQVNALFGKPVDPGKPDSDGITTNMYKNPTGDYLAMVQFLKGHSVTESYARVDRHKLSQKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0208475_102200113300027018SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSITESYARVDKRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHA
Ga0209973_101478813300027252Arabidopsis Thaliana RhizosphereMKMKYVLILAATITIALSSQAVARLGNTEDQVKAVLGKPTDPGKPDGDSITTNMYKNPTGEYIAVVQFLKGHSVAEASSRTDRRELSERELSIFLEGNSGGNKWEKKPGKKAAWIRSDHRAHAWYETVSGRPTLMVQAHY
Ga0210002_110735813300027617Arabidopsis Thaliana RhizosphereEDQVKAVLGKPTDPGKPDGDSITTNMYKNPTGEYIAVVQFLKGHSVAEASSRTDRRELSERELSIFLEGNSGGNKWEKKPGKKAAWIRSDHRAHAWYETVSGRPTLMVQAHY
Ga0307301_1030543813300028719SoilMKHIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLMGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0307280_1023359823300028768SoilIALSGQAFARLGNTEAQVSALFGKPVDSGKPDSNGVTTNMYKNPTGEYLAVVQFLRGHSVAEVYSRVDSRRLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETLSGRPTLLIRAR
Ga0307302_1022041223300028814SoilMKPILILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSITESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETMAGRPTLLIRAK
Ga0307278_1008733223300028878SoilMKRIVILAAALVIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKNPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0310813_1108728123300031716SoilMKYIVILAAALVIALSDQAFARLGQTEAQVNALFGKPIDPGKPDSDGITTNMYKNPTGEYLAVVQFLKGHSVTESYARADSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETVAGRPTLLIRAK
Ga0307471_10237435213300032180Hardwood Forest SoilMKRILLLIAILCVLFSGPIFARLGQTEDEVSALFGKSIDQGTPDNNGVTTNMYRNPTGEYIAVVQFLKGRSISESYARVDSRKLSEKELSVFLQGNSAGKEWKKDPHKLAWERSDHHANAWCETLAGRPTLLITLK
Ga0310812_1043625023300032421SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHRATAWCETLSG
Ga0310810_1001728863300033412SoilMKDPSILCSILIFTLCGNAIALIGQTEDEVSALLGKPIDPGKPDGDGVTTNMYKNPGGEYLALVQFARGHSIAESYARVDSHTLSEKELSAFLQGNRGGKEWKKDPHKLAWERSDHRARAWCETLSGRPTLLIQLK
Ga0310810_1004239123300033412SoilMKHIVILAAALVIALSDQAFARLGQTEAQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHASAWCETVAGRPTLLIRAK
Ga0310810_1009347233300033412SoilMKPIVILAAALIIALSDQAFARLGQTEDQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESHARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHATAWCETVAGRPTLLIRAK
Ga0310811_1068443023300033475SoilMKHIVILAAALVIALSDQAFARLGQTEAQVNALFGKPIDPGKPDSDGITTNMYKNPTREYLAVVQFLKGHSVTESYARVDSRKLSEKELSIFLQGNSAGKEWKKDPRKLAWERSDHHAAAWCETVAGRPTLLIRAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.