NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F054744

Metagenome Family F054744

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F054744
Family Type Metagenome
Number of Sequences 139
Average Sequence Length 158 residues
Representative Sequence TFKWLVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLAPFAPDSTEVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLACDAASAHKPCAGSDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIARVPRTASLRVSEGLGVEIVR
Number of Associated Samples 114
Number of Associated Scaffolds 139

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 97.84 %
% of genes from short scaffolds (< 2000 bps) 91.37 %
Associated GOLD sequencing projects 108
AlphaFold2 3D model prediction Yes
3D model pTM-score0.73

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.281 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(35.252 % of family members)
Environment Ontology (ENVO) Unclassified
(55.396 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(72.662 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214.216.218.220.222.224.226.228.
1JGI10216J12902_1095286741
2JGI25385J37094_102171271
3JGI25384J37096_101885091
4JGI25382J43887_104116421
5Ga0062591_1015482491
6Ga0066674_100539881
7Ga0066677_102481401
8Ga0066673_100047046
9Ga0066679_100115595
10Ga0066679_102412842
11Ga0066679_103817432
12Ga0066690_105341101
13Ga0066684_101472231
14Ga0066684_105537722
15Ga0066684_109541381
16Ga0066685_106150781
17Ga0066685_107355772
18Ga0066676_106906632
19Ga0066675_104085591
20Ga0065705_110936251
21Ga0070671_1010255001
22Ga0070694_1010548961
23Ga0070708_1021026271
24Ga0066686_102288711
25Ga0066682_102000731
26Ga0066682_108106331
27Ga0070707_1004061161
28Ga0070697_1012360072
29Ga0070697_1014842591
30Ga0070697_1019924231
31Ga0066701_102416342
32Ga0066701_105515311
33Ga0066661_106032012
34Ga0066692_107313782
35Ga0066704_101002373
36Ga0066703_101973682
37Ga0066703_102015091
38Ga0066705_101042833
39Ga0066691_106330412
40Ga0066654_108481981
41Ga0066696_106012142
42Ga0066696_109305342
43Ga0066656_109477352
44Ga0066652_1009790221
45Ga0066653_100811082
46Ga0066658_103028231
47Ga0066660_101027804
48Ga0066660_110309532
49Ga0075425_1014558252
50Ga0066710_1007958631
51Ga0066710_1018742091
52Ga0066710_1022542202
53Ga0066710_1024515151
54Ga0066710_1041371181
55Ga0099827_103021151
56Ga0114129_127939971
57Ga0075423_128537072
58Ga0134070_103554751
59Ga0134088_106018091
60Ga0134109_103954641
61Ga0134067_102861041
62Ga0134111_105129451
63Ga0134063_101317701
64Ga0134071_105186701
65Ga0134126_117279461
66Ga0137388_103496212
67Ga0137382_108468831
68Ga0137399_107594332
69Ga0137362_112899201
70Ga0137376_102940872
71Ga0137377_109079642
72Ga0137370_109503501
73Ga0137370_109976671
74Ga0137385_105895802
75Ga0137360_119056141
76Ga0137396_102288391
77Ga0137394_114034441
78Ga0137419_109196111
79Ga0137407_105679972
80Ga0134110_102082781
81Ga0134087_100024511
82Ga0120172_11148281
83Ga0120125_10554571
84Ga0134075_101509992
85Ga0134075_102867332
86Ga0134078_102849252
87Ga0134079_105703331
88Ga0120104_10966451
89Ga0134112_101819681
90Ga0184605_103204781
91Ga0184608_100380695
92Ga0066667_100488741
93Ga0066662_102962702
94Ga0066669_120619671
95Ga0193747_10386742
96Ga0193719_100759902
97Ga0224452_10246121
98Ga0222623_102850192
99Ga0207646_118190391
100Ga0209234_11623372
101Ga0209027_12938942
102Ga0209239_11458421
103Ga0209155_100010237
104Ga0209154_13207511
105Ga0209471_12587761
106Ga0209131_13731861
107Ga0209152_101032831
108Ga0209802_12086031
109Ga0209267_12890511
110Ga0209804_10053791
111Ga0209159_10106011
112Ga0209159_10443101
113Ga0209159_12046501
114Ga0209808_12022891
115Ga0209690_11298852
116Ga0209378_11587972
117Ga0209056_100530891
118Ga0209056_104216641
119Ga0209805_10675343
120Ga0307313_100390531
121Ga0307307_101149812
122Ga0307282_101627252
123Ga0307290_101033901
124Ga0307290_103707411
125Ga0307504_103524511
126Ga0307299_100952351
127Ga0307284_101157742
128Ga0307305_101777091
129Ga0307305_102570801
130Ga0307292_101420182
131Ga0307292_103753512
132Ga0307310_103067902
133Ga0307312_100646364
134Ga0307277_101379651
135Ga0307469_105435761
136Ga0307473_111926291
137Ga0307471_1010325801
138Ga0307472_1002108632
139Ga0307472_1023293721
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 25.56%    β-sheet: 24.44%    Coil/Unstructured: 50.00%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120140TFKWLVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLAPFAPDSTEVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLACDAASAHKPCAGSDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIARVPRTASLRVSEGLGVEIVRSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.73
Powered by PDBe Molstar

Structural matches with SCOPe domains



 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Permafrost
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
12.2%10.8%10.1%35.3%10.8%3.6%5.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10952867413300000956SoilWIVAGVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFSPESTQVVTAYDGVLVNRYLQGPPVLRYDPVAAPDFTLSLACDAVPHKPCADRDVDVVLWDDLLRAEGPGWQEVHMPHGGRLRIARVPRSASLRVSEGLGVEIVR*
JGI25385J37094_1021712713300002558Grasslands SoilGEYGYVFSMLPGXSVIAARGAIALAKGLRRPRSLRWLVAGVALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPVPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRA
JGI25384J37096_1018850913300002561Grasslands SoilLFVHVGEYGYVFSMLPGVSVIAARGAVALAKGLRRPRSLRWLVAGVALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPVPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSXGLGVAIIR*
JGI25382J43887_1041164213300002908Grasslands SoilAMLAARGAIAFAKALRMPRTFKWIVMAVVLGDAAIFLLSDTPISARDTARHDRGIDEKVLYLSSFSPQTTFVVSAYDALLAENFLQRLPGGLPLLEYDPANPDFTKPLSCGAAPPTMPCAEPSVDVVLWDDLLRAEGPGWQEVRMPHGARLRIAHVPRSASLRVSEGLGVEIVR*
Ga0062591_10154824913300004643SoilLWRTAFMILWTFAPLPFYIFVHIGEYGYIFSMLPGLVIVAARGSIALAKGLRMPRTFRWLVATVVLGNAAIFLLSDTPISARDVARHDRGIDEKMAYLASFPSESTQVVTAFDDVLVNHYLHGPPVLRYDPVADPAFTLPLSCEAAAPRKPCGGSEVDVVLWDDLLRGVGPGWQEVRLPHGGHLRIAHVARSASLRVSEGLAVEIVR*
Ga0066674_1005398813300005166SoilWIVAATVLGNATIFLFSDTPISARDIARHDRGIDEKLAYLATLAPESTEVVTAYDGVLVDHYVPSTPVFRYDPAATPDFTLALACGKVTQHRPCADTDVDVVLWDDLLRAEGPGWLEVRMPHGARLRIAYLPRSASLRVSEGLGVEIVR*
Ga0066677_1024814013300005171SoilARGAIALAKGVRMPRTFRWIVAAAVLGNAAIFLFSDTPISARDIARHDRGIDEKLAYLATLAPESTEVVAAYDGVLVDHYVRSAAVFRYDPATTPDFTLPLACGRVTQHRPCADSDVDVVLWDDLLRAEGSDWLEVRMPHGARLRIAYLPRSASLRVSGGLGVEIVR*
Ga0066673_1000470463300005175SoilSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSRSLRVSEGLGVEIVP*
Ga0066679_1001155953300005176SoilSDTPISARDIARQDRGIDDKLRYLAAFSPQTTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR*
Ga0066679_1024128423300005176SoilIHVGEYGYIFSTLPGLAIVAARGSIALAKGMRMPRTFKWIVMAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVP*
Ga0066679_1038174323300005176SoilLLFIWTLAPLPFYVFVHVGEYGYVFSMLPGLAIIAARGAIALAKGLRRPRTFRWIVAGVVLANAAIYLLSDTPLSARDISRHDRGIDEKTALLGSYAPATTLVVSAYDSVLAENYLARLPGGLPLLEYDPANPDFTKPLACAAFPATMPCSGEALDVVLWDDLLRAEGTGWTERVLPHGARLRIAHVPRTSSLRVREGLGVEIVR*
Ga0066690_1053411013300005177SoilIFLLSDTPISARDIARQDHGIDEKLAYLSTFRPQVTQVVSGFDSVLVQNYLGGRLPAIEYDPANADFTIPLSCDHAPPHMPCAGTTVDVVLWDDLLRAEGGGWQEVRMAHGGRLRIQNAARTASLRVSQGLGVEIVH*
Ga0066684_1014722313300005179SoilFYVFVHVGEYGYIFSMLPGLAIIAARGAIALAKGLRRPRTFRWIVAGVVLANAAIYLLSDTPLSARDISRHDRGIDEKTALLGSYAPATTLVVSAYDSVLAENYLARLPGGLPLLEYDPANPDFTKPLACAAFPATMPCSGEALDVVLWDDLLRAEGTGWTERVLPHGARLRIAHVPRTSSLRVREGLGVEIVR*
Ga0066684_1055377223300005179SoilGPDSTLVVTAYDDVLVSYYVHGPAVLRYDPAATPTFTEPLACDASSRKPCAGTDLDVVLWDDLLRAEGPGWQEVRMPHGAHLRIAHVPRSASLRVSESLAVEIVR*
Ga0066684_1095413813300005179SoilLATFKPQTTEIVSGYDAVLSEYYLEQLPHGIPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVP*
Ga0066685_1061507813300005180SoilAPLPFYVFVHVGEYGYVFSMLPGLAIIAARGAIALAKGLRRPRTFRWIVAGVVLANAAIYLLSDTPLSARDISRHDRGIDEKTALLGSYAPATTLVVSAYDSVLAENYLARLPGGLPLLEYDPANPDFTKPLACAALPATMPCSGEALDVVLWDDLLRAEGTGWTERVLPHGARLRIAHVPRTSSLRVREGLGVEIVR*
Ga0066685_1073557723300005180SoilLAKGMRMPRTFRWIVAAAVLGNATIFLFSDTPISARDIARHDRGIDEKLAYLATLAPESTEVVTAYDGVLVDHYVPSTPVFRYDPAATPDFTLALACGKVTQHRPCADTDVDVVLWDDLLRAEGPGWLEVRMPHGARLRIAYLPRSASLRVSEGLGVEIVR*
Ga0066676_1069066323300005186SoilAARGAIALAKGMRMPRTFRWIVAAAVLGNATIFLFSDTPISARDIARHDRGIDEKLAYLATLAPESTEVVTAYDGVLVDHYVPSTPVFRYDPAATPDFTLALACGKVTQHRPCADTDVDVVLWDDLLRAEGPGWLEVRMPHGARLRIAYLPRSASLRVSEGLGVEIVR*
Ga0066675_1040855913300005187SoilEYGYIFSLLPGLTILAARGAIAFAKGIRMPRTFKWIVAAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSRSLRVSEGLGVEIVP*
Ga0065705_1109362513300005294Switchgrass RhizosphereMPRTFRWLVATVVLGNAAIFLLSDTPISARDVARHDRGIDEKMAYLASFPSESTQVVTAFDDVLVNHYLHGPPVLRYDPVADPAFTQPLSCEAAAPRKPCGGAEVDVVLWDDLLRGVGPGWQEVRLPHGGHLRIAHVARSASLRVSEGLAVEIVR*
Ga0070671_10102550013300005355Switchgrass RhizosphereARGSIALAKGLRMPRTFKWLVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKMAYLASFADDSVAVVTAYDDVLVNHYLRGPPVLQYDPVAMPAFTQSLACDAIPSRKACAGTDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRLAHVPRSAALRVSEGLGVEIVR*
Ga0070694_10105489613300005444Corn, Switchgrass And Miscanthus RhizosphereHDRGIDEKVAYLASFSPDSTEVVAAYDVVLVDHYLHGPPVLHYDPVARPEFVLPLSCDAPAAPLPCAGTDVDVVLWDDLLRAEGPGWQEVRMPHGARLRIARAPRSGSLRVSEGLGVAIVR*
Ga0070708_10210262713300005445Corn, Switchgrass And Miscanthus RhizosphereGEYGYIFSMLPGLVIIAARGSIALAKGLRMPRTFKWLVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKMAYLASFADDSVAVVTAYDDVLVNHYLRGPPVLQYDPVAMPAFTQPLACDAIPSRKACAGTDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSAALR
Ga0066686_1022887113300005446SoilEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR*
Ga0066682_1020007313300005450SoilAKGLRMPRTLRWMVAAAVLGNAAMFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVR*
Ga0066682_1081063313300005450SoilHVGEYGYIFSMLPGLAIIAARGAIALAKGLRMPRTFKWIVAAVVLGNAAIFVLSDTPISARDIARHDRGIDEKVGALAPFPPDSTLVVTAYDDVLVNRYLHGPAVLRYDPVATANFTAPLSCDPPAPQRPCAGTDVDVVLWDDLLRAEGPGWQEVRLPHGGHLRIAHVPRSASLRVSEGLGVEIVR*
Ga0070707_10040611613300005468Corn, Switchgrass And Miscanthus RhizosphereEYGYIFSMLPGLAILASRGAIALAKGMRMPRTFKWIVMAVVLGNAAIYLLSDTPLSARETARHDRGIDEKVAYLASFSPDSTEVVTAYDEVLVDHYLHGPPVFRYDPVARPDFALPLSCDALAPAGPCAGMDVDVILWDDLLRDEGPGWREVRMPHGARLRIARAPRSASLRVSEGLGVEIVR*
Ga0070697_10123600723300005536Corn, Switchgrass And Miscanthus RhizosphereHDRGIDEKVAYLASFAPGSTEVVTAYDDVLVNHYLHGPAVVRYDPAATPAFTASLSCEAAPSPKPCAGSDVDVVLWDDLLRAEGSGWQEVRMPHGGRLRIAHVARSASLRVSEGLGVEIVR*
Ga0070697_10148425913300005536Corn, Switchgrass And Miscanthus RhizosphereAKGMRMPRTFKWIVMAVVLGNAAIYLLSDTPLSARETARHDRGIDEKVAYLASFSPDSTEVVTAYDEVLVDHYLHGPPVFRYDPVARPDFALPLSCDAPAPAGPCAGTDVDVILWDDLLRAEGPGWREVRMPHGARLRIARAPRSASLRVSEGLGVEIVR*
Ga0070697_10199242313300005536Corn, Switchgrass And Miscanthus RhizosphereVVLGNAAIFLLSDTPISARDIARHDRGIDEKMAYLASFANDSVAVVTAYDDVLVNHYLRGPPVLQYDPVAMPAFTQSLACDAIPSRKACAGTDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSAALRVSEGLGVEIAR*
Ga0066701_1024163423300005552SoilIIAARGAVAFAKGIRRPRLFPWIVGAVALANAAIFLLSDTPISARDIARQDHGIDEKLAYLSTFRPQATQVVSGFDSVLVQNYLGGRLPAIEYDPANADFTIPLSCDHAPPHMPCAGTTVDVVLWDDLLRAEGGGWQEVRMAHGGRLRIQNAARTASLRVSQGLGVEIVH*
Ga0066701_1055153113300005552SoilRDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVR*
Ga0066661_1060320123300005554SoilGLTILAARGAIAFAKGIRMPRTFKWIVMAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVP*
Ga0066692_1073137823300005555SoilIARQDRGIDEKLAYLATFSPQTTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPADPAFTLPLSCAKAPPHMPCAEPEVDVVLWDDLLRAEGPGWQEVRMAHGGRLRIARVPRSAALRVSDGLGVEIVR*
Ga0066704_1010023733300005557SoilAKGLRMPRTFKWIVAAVVLANAAIFLLSDTPISARDIARQDRGIDEKLAYLAVFSPQTTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR*
Ga0066703_1019736823300005568SoilAAAVLGNAAIFLLSDTPLSARDIARHDRGIDEKIAYLASFAPEATLVVTAYDDALVNHYLHGPPVLRYDPVATPAFTEPLSCGAAAAREPCGGTDVDVVLWDDLLRAEGPGWQEVRMAHGARLRIAHVPRSMSLRVREGLGVEILR*
Ga0066703_1020150913300005568SoilLRMPRTFKWIVAAVVLANAAIFLLSDTPISARDIARQDRGIDEKLAYLATFSPQTTQIVSAYDAVLVGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR*
Ga0066705_1010428333300005569SoilVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVP*
Ga0066691_1063304123300005586SoilNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPVPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR*
Ga0066654_1084819813300005587SoilISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSRSLRVSEGLGVEIVP*
Ga0066696_1060121423300006032SoilEKVAALAPFGPDSTLVVTAYDDVLVSYYVHGPAVLRYDPAATPTFTEPLACDASSRKPCAGTDLDVVLWDDLLRAEGPGWQEVRMPHGAHLRIAHVPRSASLRVSEGLAVEIVR*
Ga0066696_1093053423300006032SoilDTPISARDIARHDRGLDEKVAALAPFPPDSTLVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLGCDTASSHKPCAGTDVAVVLWDDLLRAEGPGWQEVLMPHGGRLRIARVPRSASLRVSEGLGVEIVR*
Ga0066656_1094773523300006034SoilTPISARDIARHDRGIDEKLAYLATLAPESTEVVTAYDGVLVDHYVPSTPVFRYDPAATPDFTLALACGKVTQHRPCADTDVDVVLWDDLLRAEGPGWLEVRMPHGARLRIAYLPRSASLRVSEGLGVEIVR*
Ga0066652_10097902213300006046SoilYGYVFSMLPGLSVIAARGAIALAKGLRRPRSLRWLVAGVALGNAAIFLLSDAPISARDISRHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPMPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR*
Ga0066653_1008110823300006791SoilDEKLAYLATLAPESTEVVTAYDGVLVDHYVPSTPVFRYDPAATPNFTLALACGKVTQHRPCADTDVDVVLWDDLLRAEGPGWLEVRMPHGARLRIAYLPRSASLRVSEGLGVEIVR*
Ga0066658_1030282313300006794SoilAATVLGNATIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR*
Ga0066660_1010278043300006800SoilRMPRTFKWIVATVVLANAAIFLLSDTPISARDVARQDRGIDEKLAYLAAFSPETTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR*
Ga0066660_1103095323300006800SoilIFSMLPGLAIIAARGAIALAKGLRRPRTFRWIVAGVVLANAAIYLLSDTPLSARDISRHDRGIDEKTALLGSYAPATTLVVSAYDSVLAENYLARLPGGLPLLEYDPANPDFTKPLACAAFPATMPCSGEALDVVLWDDLLRAEGTGWTERVLPHGARLRIAHVPRTSSLRVREGLGVEIVR*
Ga0075425_10145582523300006854Populus RhizosphereELRDRWRTAFMILWTFAPLPFYVFVHVGEYGYVFSMLPGLAILAARGAIALAKGLRMPRTFRWIVAAVALGNASIFLLTDTPISARDIARHDRGIDEKIAYLESFSPQTTQIVSAYDAVLVGYYLERLPHDVPPLLGYDPANPGFTMPLACGRHAQTTPCADTSVDVVLWDDLLRAEGPGWQEVRMPHGSRLRIAHVPRSSSLRVSEGLGVEIVR*
Ga0066710_10079586313300009012Grasslands SoilPGLAILSARGAIALAKGLRMPRTFKWIVAAVVLANAAIFLLSDTPISARDIARQDRGIDEKLAYLATFSPQTTQIVSAYDAVLVGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR
Ga0066710_10187420913300009012Grasslands SoilGLRMPRTFKWIVMAVAIANAAIFLLSDSPISARDIARHDRGLDEKVAYLSSFPPQTTFVVSAYDGVLAENHLQRLPGGLPLLEYDPANPDFTKPLSCDAAPQTMPCSGEAVDVVLWDDLLRAEGPGWREVRMPHGARLRIARAPRSASLRVSDGLGVEIVR
Ga0066710_10225422023300009012Grasslands SoilLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLERLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSGSLRVSEGLGVEIVP
Ga0066710_10245151513300009012Grasslands SoilEYGYIFSMLPGLAIIAARGAIALAKGLRMPRTFKWIVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVGALAPFPPDSPLVVTAYDDVLVNRYLHGPAVLRYDPIATSDFTLSLACDIAPARKPCAGNDVSVVLWDDLLRAEGPGWEEVRMPHGARLRIAHVPRSAWLRVSEGLGVEILR
Ga0066710_10413711813300009012Grasslands SoilMLAARGSIALAKGLRMPRTFKSIVAVVVLANAAVFHLTDTPISARDIARHDRGVDEKAAYLKATLAPDATLVLTAYDAVLVEHYLPRGYITFAYDPASTPALTRSLGCDRTPPPCGGPEVEVVLWDDILRAVGDGWQEVRMPHGARLRIARVPRSATLRVSGGLGVEIVR
Ga0099827_1030211513300009090Vadose Zone SoilTFKWIVMAVVLGNTAIYLLSDTPISAGDIARHDRGIDEKVAYLAAFSPQTTFVVSAYDALLAETYLGRLPGGLPLLEYDPANPAFTKPLSCGAAPPTMPCSGETVDVVLWDDLLRAEGPGWQEVRMAHGARLRIAHVARSASLRVSEGLGVEIVR*
Ga0114129_1279399713300009147Populus RhizosphereTIFLLSDTPISARDIARHDRGIDEKIAYLASFAPDRTEVVTAYDDVLVNHYLHGPAVLRYDPVAMPAFTAALACDAAPSHTPCAGTDVDVVLWDDLLRAEGTGWEEVRMPHGARLRIARVPRSASLRVSVSPAVEIVR*
Ga0075423_1285370723300009162Populus RhizosphereFAPDRTEVVTAYDDVLVNHYLHGPAVLRYDPVAMPAFTAALACDAAPSHTPCAGTDVDVVLWDDLLRAEGTGWEEVRMPHGARLRIAHVPRSASLRVSEGLGVEIVR*
Ga0134070_1035547513300010301Grasslands SoilLAKGLRMPRTLRWMVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVR*
Ga0134088_1060180913300010304Grasslands SoilPGLAIVAARGAIALAKGMRMPRTFKWIVMAVVLGNAAIYLLSDTPISAGDIARHDRGIDEKVAYLAALSPQTTFVVSAYDALLAETYLGRLPGVLPLLEYDPANSDFTKPLSCGAAPPTMPCSGETVDVVLWDDLLRAEGPGWQEVRMAHGARLRIAHVARSASLRVSEGLAVGIVR*
Ga0134109_1039546413300010320Grasslands SoilTFKWLVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLAPFAPDSTEVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLACDAASAHKPCAGSDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIARVPRTASLRVSEGLGVEIVR*
Ga0134067_1028610413300010321Grasslands SoilAILAARGAIALAKGMRMPRTFKWIVAATVLGNATIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR*
Ga0134111_1051294513300010329Grasslands SoilNAAFFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVP*
Ga0134063_1013177013300010335Grasslands SoilARGAIALAKGMRMPRTFKWIVAATVLGNATIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYLHTAPAYRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVAIVR*
Ga0134071_1051867013300010336Grasslands SoilFVHVGEYGYIFSMLPGLVIIAARGAIALAKGLRMPRTLRWMVAAAVLGNAAMFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCANTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVR*
Ga0134126_1172794613300010396Terrestrial SoilAARGSIALAKGLRMPRTFKWLVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKMAYLASFADDSIAVVTAYDDVLVNHYLRGPPVLQYDPVAMPAFTQPLACDAIPSRKACAGTDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSAALRVSEGLGVEIAR*
Ga0137388_1034962123300012189Vadose Zone SoilGEYGYIFSMLPGLAILAARGAIALAKGMRMPRTFKWIVATVVLGNAFIYLLSDTPLSARDIARHDRGIDEKVAYLSTLAPESTQVVTAYDGVLVDHYLHGPPVFRYDPAATSEFTLPLSCAKARPPGPCAEAQVDVVLWDDLLRAEGPGWLEVRMPHGARLRIARLPRSASLRVSEGLGVEIVR*
Ga0137382_1084688313300012200Vadose Zone SoilSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPAFTLPLSCDTPPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVPRSASLRVSEGLGVQIVR*
Ga0137399_1075943323300012203Vadose Zone SoilKGLRMPRTFKWIVAVAVLGNAAIFLLTDTPISARDIVRHDRGTDEKVAYLSSFSPQTTLVVSGYDAVLAENYLQRLPGGLPLLEFDPASPDFTKPLSCGAAPPTMPCSGETVDVVLWDDLLRAEGESWQDVRVPHGARLRIARVPRSASLRVSDGLRVEIVR*
Ga0137362_1128992013300012205Vadose Zone SoilFSMLPGLVIIAARGAIALAKGLRMPRTFRWIVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPRKPCADRNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVR*
Ga0137376_1029408723300012208Vadose Zone SoilAPLPFYVFVHVGEYGYIFSMLPGLVIIAARGAIALAKGLRMPRTFRWIVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVPRSASLRVSEGLGVEIVR*
Ga0137377_1090796423300012211Vadose Zone SoilMRMPRTFKWIVAAAVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTEVVTAYDGVLVDHYLRNTPVFRYDPAATSDFMLPLACDKATQHRPCADTDVDVVLWDDLLRAEGPGWQEVRMPHGARLRIARLPRAASLRVSGGLGVEIVR*
Ga0137370_1095035013300012285Vadose Zone SoilALAPFPPDSTLVVTAYDDVLVNRYLHGPAVLRYDPIATSDFTLSLACDIAPARKPCAGNDVSVVLWDDLLRAEGPGWEEVRMPHGARLRIAHVPRSAWLRVSEGLGVEILR*
Ga0137370_1099766713300012285Vadose Zone SoilRVELRDRIRTAFIALWLLTPLAFYVFVHVGEYGYIFSMLPGLAILAARGAIALAKGMRMPRTFKWIVMAAVVGNAAIYLFSDTPISARDIARHDRGIDEKIAALAPFRSDSTLVVTAYDDVLVNYYLHGPAVLRYDPAARPAFTEPLACDPSSPRMPCAGTEVDVVLWDDL
Ga0137385_1058958023300012359Vadose Zone SoilYGYVFSMLPGLAILASRGTIALAKGMRMPRTFKWIVMAVVLGNAAIYLLSDTPLSARDIARHDRGIDEKVAYLASFSPNSIEVVAAYDEVLVDHYLHGPPVLRYDPVASPELVLPLSCDAPAPPRPCAGTDVDVVLWDDLLRAEGPGWREVRMPHGARLRIARAPRSGSLRVSEGLGVEIVR*
Ga0137360_1190561413300012361Vadose Zone SoilLAKGMRIPRTFKWIVMTVVLGNTAIYLLSDTPISAGDIARHDRGIDEKVAYLASFSPQTTLVVSAYDALLAQTYLGRLPGGLPLLEYDPANPDFTKPLSCGGAPPTIPCSGETVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSASLRVSEGLGVEILR*
Ga0137396_1022883913300012918Vadose Zone SoilKWIVMAVVLGNAAIFLLSDTPISARDIARHDRAIDEKVAHLGSFSPEATLVVTAYDEVLVTHYLHGPPVLRYDPAATPNFTVPLACDAAPPRTPCAGNDVDVVLWDDLLRAEGSGWQEVRMSHGSRLRIAHVPRSASLRVSEGLGVEIVR*
Ga0137394_1140344413300012922Vadose Zone SoilAPLPFYVFVHVGEYGYIFSMLPGLAIVAARGAIALAKGLRMPRTFRWIVAGAVLGNAAIFLLSNTPLSARDIARQDRGIDEKVAALAPFPQNSTLVVTAYDDVLVNRYLHGPAVLRYDPVATPNFTAPLSCVAPAAQKPCAGTDVDVVLWDDLLRAEGPGWQEVRLPHGGRLRIAHVPRSASLRV
Ga0137419_1091961113300012925Vadose Zone SoilKGLRMPRTFKWIVMAVVVGNAAIYLLSDTPISARDVARHDRGIDEKVAYLASFAPDTTQVVAAYDAVLAENYLQRLPGGLPLLGYDPANPGFTVPLSCAGAPEHAHCTGTDVDVVLWDDLLRAEGPGWQEIRMPHGARLRIAHVARSASLRVSDGLGVEIVR*
Ga0137407_1056799723300012930Vadose Zone SoilARGAIALAKGLRMPRTFKWIVAVVVLSNAAIFLFSDTPISARDIVRHDRGIDEKVAYLASFSPESTQVVTAYDDVLVNRYLRGPPVLRYDPVATPEFTVSLACDAAPPRKPCSATDVDVVLWDDLLRAEGPGWQEVRMPHGARLRIAHVPRSASLRVSEGLGVEIAR*
Ga0134110_1020827813300012975Grasslands SoilDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACTKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR
Ga0134087_1000245113300012977Grasslands SoilHVGEYGYIFSLLPGLTILAARGAIAFAKGIRMPRTFKWIVAAVVLGNASIYIVSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSRSLRVSEGLGVEIVP*
Ga0120172_111482813300013765PermafrostALAQHPPRAARHDRGIDEKMAYLASFAPDSTQVVTAYDDVLVNHYLHGPAVLRYDPVASPEFVMPLACDAAPAHKPCADTDVDVVLWDDLLRAVGPGWQEVRMPHGGRLRIAHVARSASLRVSEGLGVEIVR*
Ga0120125_105545713300014056PermafrostMTVLSSEARRIELRDRWRTTFIALWLLTPLAFYLLIHVGEYGYIFSMLPGLAILGARGTIALAKGMRMPRTFKWIVMAVVVGNAAIFLLSDTPISARDVARHDHGIDEKVAYLTSLPPESMLVLTAYDEVLVNHYLHGPPVLQYDPAATPAFTQPLACDASPSRRPCVDTSVDVVL
Ga0134075_1015099923300014154Grasslands SoilAPLPFYVFVHVGEYGYIFSMLPGLAMLAARGSIALAKGLRMPRTFKSIVAVVVLANAAVFLLTDTPISARDIARHDRGVDEKAAYLKATLAPDATLVLTAYDAVLVEHYLPRGYITFAYDPASTPALTRSLGCDRTPPPCGGAEVEVILWDDILRAIGDGWQEVRMPHGARLRIARAPRSATLRVSEGLGVEIVR*
Ga0134075_1028673323300014154Grasslands SoilGLRRPRSLRWLVAGVALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLAYDPAGHPGFTRPLACAASPPPVPCSGETVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR*
Ga0134078_1028492523300014157Grasslands SoilAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSGSLRVSEGLGVEIVP*
Ga0134079_1057033313300014166Grasslands SoilLFIWTLAPLPFYVFVHVGEYGYVFSMLPGLAIIAARGSIALAKGLRRPRTFRWIVAGVVLANAAIYLLSDTPLSARDISRHDRGIDEKTALLGSYAPATTLVVSAYDSVLAENYLARLPGSLPLLEYDPANPDFTKPLACAAFPATMPCSGEALDVVLWDDLLRAEGTGWTERVLPHGARLRIAH
Ga0120104_109664513300014829PermafrostVDARRVELRDRWRTAFMVLWTFAPIPFYVFVHIGEYGYIFSMLPGLVIIAARGSIALAKGLRMPRTFKWLVAGVVLGNAAIFLLSDTPISARDISRHDRGIDEKMAYLASFADDSTAVVTAYDDVLVNHYLHGPPVLQYDPIAMPSFTQPLACEAIPSRKACAGTDVAVVLWDDLLRAEGPGWQEVRMPHGGRLR
Ga0134112_1018196813300017656Grasslands SoilLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPMPCSGDTVDVVLWDDTLSPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR
Ga0184605_1032047813300018027Groundwater SedimentNAAIFLFSDTPISARDVARHDRGIDEKAAYLASFAPESTQVVTAYDDVLVNYYLHGPPVLRYDPVATPEFTQPLSCEAAAPRKPCAGADVDVVLWDDLLRAEGPGWQEVRLPHGGHLRIAHVARSASLRVSEGLGVEFVR
Ga0184608_1003806953300018028Groundwater SedimentNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFPRDSTQVVTAFDDVLVNNYLHGPAVLRYDPIANGAFVASLACDAVASYKPCANTDVDVVLWDDLLRAVGPDWQEIRMPHGGRLRIAHVSRSASLRVSEGLGVEIVR
Ga0066667_1004887413300018433Grasslands SoilRSLRWLVAGVALGNAAIFLLSDAPISARDISRHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPMPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR
Ga0066662_1029627023300018468Grasslands SoilGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHSTPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR
Ga0066669_1206196713300018482Grasslands SoilIFSMLPGLAIVAARGSIALAKGMRMPRTFKWLVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLAPFAPDSTEVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLACDAASAHKPCAGSDVAVVLWDDLLRAEGPGWQEVLMPHGGRLRIARVPRSASLRVSEGLGVEIV
Ga0193747_103867423300019885SoilIFLLSDTPISARDIARHDLGIDEKVAYLASFAPESMLVVTAYDDVLVNHYLHGPPVLQYDPAATPEFARPLVCDTSSSRRPCAGTDVDVVLWDDLLRAEGQGWQEVRMPHGARLRIAHAPRSASLRVSEGLGVDIVR
Ga0193719_1007599023300021344SoilSIYLLSDTPLSARDVARHDRGIDEKVAALAPFPPESTLVVTAYDDVLVNYYLHGPAVLRYDPIATPAFTEPLACDAASSHKPCSGTDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVGRSASLRVSEGLGVEIVR
Ga0224452_102461213300022534Groundwater SedimentYGYVFSMLPGLAIVAARGAIALAKGLRMPRTFKWIVAAVVLGNAAIFLLSDTPLSARDIARHDRGIDEKAAAIAPFPPDSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTQPLACNAASGHRPCSDKDVDVVLWDDLLRAEGQGWQEVRMPHGTRLRIAHVPRSASLRVSEGLGVEIVR
Ga0222623_1028501923300022694Groundwater SedimentYIFVHVGEYGYIFSMLPGLVIIAARGAIALAKGLRMPRTFKWIVAGVVLSNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFPRDSTQVVTAFDDVLVNNYLHGPAVLRYDPIANGAFVASLACDAVASYKPCANTDVDVVLWDDLLRAVGPDWQEIRMPHGGRLRIAHVSRSASLRVSEGLGVEIVR
Ga0207646_1181903913300025922Corn, Switchgrass And Miscanthus RhizosphereGAIALAKGVRMPRTFKWIVAAAVLANAAIFLLSDTPISARDIARHDRGIDEKIAYLASFAPDRTQVVTAYDDVLVNHYLRGPAVLRYDPVAMPAFTAALACDAAPSHPPCAGTDVDVVLWDDLLRAEGTGWEEVRMPHGARLRIAHVPRSSSLRVSEGLGVEIVR
Ga0209234_116233723300026295Grasslands SoilKGMRMPRTFKWIVAAAVLGNAAIFLLSDTPLSARDIARHDRGIDEKIAYLASFAPEATLVVTAYDDALVNHYLHGPPVLRYDPVATPAFTEPLSCGAAVAREPCGGTDVDVVVWDDLLRAEGPGWQEVRMAHGARLRIAHVPRSMSLRVREGLGVEILR
Ga0209027_129389423300026300Grasslands SoilAAIFLLSDTPISARDIARHDRGLDEKVAALAPFPPDSTLVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLGCDTASSHKPCAGTDVAVVLWDDLLRAEGPGWQEMRMPHGARLRIARVTRSASLRVSEGLGVEIVR
Ga0209239_114584213300026310Grasslands SoilRTALMVLWTFAPLPFYVFVHVGEYGYIFSVLPGLAIIAARGAIALAKGLRMPRTFKWIVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVGALAPFPPDSTLVVTAYDDVLVNRYLHGPAVLRYDPIATSDFTLSLACDIAPARKPCAGNDVDVILWDDLLRAEGPGWEEVRMPHGARLRIAHVPRSASLRVSEGLGVEILR
Ga0209155_1000102373300026316SoilIHVGEYGYIFSLLPGLTILAARGAIAFAKGIRMPRTFKWIVAAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSGSLRVSEGLGVEIVP
Ga0209154_132075113300026317SoilIHVGEYGYIFSLLPGLTILAARGAIAFAKGIRMPRTFKWIVMAVVLGNASIYLLSDTPISARDIARHDRGIDEKLAYLATFKPQTTEIVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSR
Ga0209471_125877613300026318SoilRVELRDHWRAALVILWTFAPLPFYLFVHVGEYGYIFSMLPGLAILSARGAIALAKGLRMPRTFKWIVAAVVLANAAVFLLSDTPISARYIARQDRGIDDKLRYLAAFSPQTTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIA
Ga0209131_137318613300026320Grasslands SoilDTPISARDVARHDRGIDEKVAYLSSFAPETTQVVTAYDDVLINHYLHGPAVLQYDPIASSEFTTPLSCDAAAPRKPCAGADVDVVLWDDLLRAVGPGWQEVRMPHGARLRIAHVARSASLRVSEGLGVEIVR
Ga0209152_1010328313300026325SoilVAATVLGNATIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR
Ga0209802_120860313300026328SoilGEYGYVFSMLPGVSVIAARGAIALAKGLRRPRSLRWLVAGVALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPVPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR
Ga0209267_128905113300026331SoilARGAIALAKGLRMPRTFKWIVAAVVLANAAIFLLSDTPISARDIARQDRGVDEKLAYLATFSPQTTQIVSAYDAVLAGYYLERLPHGLPPLLGYDPANPGFTLPLSCANAPPHMPCADPDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIARAPRSAALRVSDGLGVEIVR
Ga0209804_100537913300026335SoilGAVALANAAIFLLSDTPISARDIARQDHGIDEKLAYLSTFRPQATQVVSGFDSVLVQNYLGGRLPAIEYDPANADFTIPLSCDHAPPHMPCAGTTVDVVLWDDLLRAEGGGWQEVRMAHGGRLRIQNAARTASLRVSQGLGVEIVH
Ga0209159_101060113300026343SoilPISARDIARHDRGIDEKLAFLATFKPQTTEIVSGYDAVLAGYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAQAGPHMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSGSLRVSEGLGVEIVP
Ga0209159_104431013300026343SoilAARGAIALAKGMRMPRTFKWIVAATVLGNATIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTQVVTAYDGVLVDHYVHTAPVFRYDPAASSDFTLPLACGKVTQHRPCADTDVEVVLWDDLLRAEGAGWQEVRMPHGARLRIARVPRSASLRVSEGLGVEIVR
Ga0209159_120465013300026343SoilLPFYLFVHVGEYGYVFSMLPGLSVIAARGAIALAKGLRRPRALRWLVAGVALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPMPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR
Ga0209808_120228913300026523SoilFSMLPGLAIVAARGSIALAKGMRMPRTFKWLVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLAPFAPDSTEVVTAYDDVLVSYYLHGPAVLRYDPVATPAFTEPLACDAASAHKPCAGSDVAVVLWDDLLRAEGPGWQEVLMPHGGRLRIARVPRSASLRVSEGLGVEIVR
Ga0209690_112988523300026524SoilTFAPLPFYLFVHVGEYGYVFSMLPGLSIIAARGAIALAKGLRRPRSLRWLVAGAALGNAAIFLLSDAPISARDIARHDHGIDEKIAYLSTFAPETTSVVTAYDTLLVEHYLKGLPVLPYDPAGHPGFTRPLACAASPPPVPCSGDTVDVVLWDDTLRPEGPGWQEVPMPHGARLRIARVPRASSLRVSEGLGVAIIR
Ga0209378_115879723300026528SoilPLPFYVFVHVGEYGYIFSMLPGLVIIAARGAIALAKGLRMPRTLRWIVAAVVLGNAAIFVLSDTPISARDIARHDRGIDEKIAYLASFAPESTQVVTAYDGVLVNHYLQGPPVLRYDPVETPEFTLPLSCDTAPPHKPCADTNVDVVLWDDLLRAEGPGWQEVRMVHGARLRIAHVARSASLRVSEGLGVEIVH
Ga0209056_1005308913300026538SoilSMLPGLALLAARGAIALAKGMRMPRTFKWIVAAAVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFASESTEVVTAYDGVLVDRYLRNTPVFRYDPAATSDFMLPLACDKATQHRPCADTDVDVVLWDDLLRAEGPGWQEVRMPHGARLRIARLPRAASLRVSDGLGVEIVR
Ga0209056_1042166413300026538SoilPISARDIARHDRGIDEKLAYLATFKPQTTEVVSGYDAVLAEYYLEQLPHGLPPLLSYDPVGNPGFTLPLSCAHAGPRMPCAESDVDVVLWDDLLRAEGSGWQEVRMPHGSRLRIAHVPRSASLRVSEGLGVEIVP
Ga0209805_106753433300026542SoilYVFSMLPGLAILAARGAIALAKGMRMPRTFKWIVAAVVLGNAAIFLFSDTPISARDIARRDRGIDEKLAYLATLAPESTEIVAAYDGVLVDHYVRSAAVFRYDPATTPDFTLPLACGRVTQHRPCADSDVDVVLWDDLLRAEGSDWLEVRMPHGARLRIAYLPRSASLRVSGGLGVEIVR
Ga0307313_1003905313300028715SoilARHDRGIDEKVAALAPFPADSTLVVTAYDDVLVNYYLHGPAVLRYDPIATPAFTEPLACDAASSHKPCSGTDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307307_1011498123300028718SoilMAVVIGNASIYLLSDTPLSARDIARHDRGIDEKVAALAPFPPESTLVVTAYDDVLVNHYLHGPAVVRYDPAETPAFVQPLACNANQSRKPCVGADVDVVLWDDLLRAEGPGWQEVRMPHGARLRIAHVPRSASLRVSEGLGVEILR
Ga0307282_1016272523300028784SoilAPFPADSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTEPLACDAALSHKPCAGTDVALVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307290_1010339013300028791SoilRGAIALAKGTRMPRTFKWIVMAVVIGNASIYLLSDTPLSARDIARHDRGIDEKVAALAPFPPESTLVVTAYDDVLVNHYLHGPAVVRYDPAETPAFVQPLACNANQSRKPCVGADVDVVLWDDLLRAEGPGWQEVRMPHGARLRIAHVPRSASLRVSEGLGVEILR
Ga0307290_1037074113300028791SoilAATLISVDARRIELRDRRRTAFMFLWTFAPVPFYVFVHVGEYGYVFSMLPGLAIVAARGAIALAKGLRMPRTFKWIVAAAVLGNAAIFLLSDTPLSARDIARHDRGIDEKAAAIAPFPPDSTVVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTQPLSCNAASGHRPCSDKDVD
Ga0307504_1035245113300028792SoilLATFTPDSTEVVTAYDDVLVNHYLHGPAVLRYDPTATPAFTAPLSCEGVPSPKPCAGSDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVARSASLRVSEGLGVEVVR
Ga0307299_1009523513300028793SoilTAIAPFPPDSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTQPLACNAASGHRPCSDKDVDVVLWDDLLRAEGPGWQEVRMPHGTRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307284_1011577423300028799SoilFPADSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTEPLACDAASSHKPCSGTDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307305_1017770913300028807SoilLLSDTPVSARDIARHDRGIDEKVTAIAPFPPDSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTQPLACNAASGHRPCSDKDVDVVLWDDLLRAEGQGWQEVRMPHGTRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307305_1025708013300028807SoilVATLAPFPADSTLVVTAYDDVLVNYYLHGPAVLRYDPAATPAFTEPLACDAASSHKPCSGTDVAVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVPRSASLRVSEGLGVEIVR
Ga0307292_1014201823300028811SoilGYIFSMLPGLVIIAARGSIALAKGLRMPRTFRWLVGAVVLGNAAIFLLSDTPISARDIARHDRGIDEKMTYLASFADESTAVVTAYDDVLVNHYLRGPPVLQYDPVAMPSFTQPLACGAITSRKACAGADVDVVLWDDLLRAEGPGWQEVRMPHGGRLRIAHVSRSAALRVSEGLGVEIV
Ga0307292_1037535123300028811SoilSMLPGLVIIAARGAIALAKGLRMPRTFKWIVAGVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAYLASFPRDSTQVVTAFDDVLVNNYLHGPAVLRYDPIANGAFVASLACDAVASYKPCANTDVDVVLWDDLLRAVGPDWQEIRMPHGGRLRIAHVSRSASLRVSEGLGVEIVR
Ga0307310_1030679023300028824SoilPRTFKWLVAAVVLGNAAIFLLSDTPISARDIARHDRGIDEKMAYLASFVDDSTAVLTAYDDVLVNHYLRGPPVLQYDPVAMPAFTQSLECDAIPSRKACAGTDVDVVLWDDLLRAEGPGWQEVRMPHGGRLRLAHVPRSAALRVSEGLGVEIVR
Ga0307312_1006463643300028828SoilLAKGMRMPRTFKWIVMAVVLGNAAIFLLSDTPISARDIARHDRGIDEKVAALAPFPADSTLVVTAYDDVLVNYYLHGPAVLRYDPIATPAFTEPLACDAASSHKPCSGTDVAVVLWDDLLRAEGSGWQEVRMPHGGRLRIAHVGRSASLRVSEGLGVEIVR
Ga0307277_1013796513300028881SoilAIFLLSDTPISARDIARHDRGIDEKIAALAPFAPDSTLVVTAYDDVLVNYYLHGPAVLRYDPVATPAFTERLSCDAGSSHKACAGTDVDVVLWDDLLRAEGPGWREVRMPHGARLRIAHVPRSASLRVSDGLGVEIVR
Ga0307469_1054357613300031720Hardwood Forest SoilSDTPISARDIARHDRGIDEKVAYLATFTPGSTEVVTAYDDVLVNHYLHGPAVLRYDPAATPAFTAPLSCESAASAKPCAGSDVDVVLWDDLLRAEGSGWQEVRMPHGGRLRIAHVARSASLRVSEGLGVEIVR
Ga0307473_1119262913300031820Hardwood Forest SoilDRGIDEKVAYLATFTPGSTEVVTAYDDVLVNHYLHGPAVLRYDPAATPAFTAPLSCESAASAKPCAGSDVDVVLWDDLLRAEGSGWQEVRMPHGGRLRIAHVARSASLRVSEGLGVEIVR
Ga0307471_10103258013300032180Hardwood Forest SoilLLSDTPISARDVARHDRGIDEKMAYLASFAPDATQIVTAYDDVLVNHYVHGPAVLQYDPVGMPEFIQPLACDASPSRGPCAGTDVDVVLWDDLLRAEGPGWQEVRMPHGARLRIAHVPRSASLRVSEGLGVEIVR
Ga0307472_10021086323300032205Hardwood Forest SoilALWTLAPLPFYVFVHVGEYGYVFSMVPGLAIVAARGAIALAKGIRRPRTFRWIVAGVVLGNAAIYLLSDTPISARDIARHDRGIDEKAAYVRAKLKPATTFVVTAYDAVLVDRYLGSEYPTIAYDPVAFPDYRQALACASQAACDGTEVDVVLWDDLLRAKGAGWAEVQMPHGGRLRITHVPRSDSLRVSEGLGVEIAP
Ga0307472_10232937213300032205Hardwood Forest SoilLRDRWRTAFMIVWTFAPLPFYVFVHIGEYGYIFSMLPGLVIIAARGSIALAKGLRMPRTFRWLVAAVVLGNAGIFLLSDTPISARDIARHDRGIDEKVAYLATFTPGSTEVVTAYDDVLVNHYLHGPAVLRYDPAATPAFTAPLSCESAASAKPCAGSDVDVVLWDDLLRAEGSGWQEVR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.