NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F071768

Metagenome / Metatranscriptome Family F071768

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071768
Family Type Metagenome / Metatranscriptome
Number of Sequences 122
Average Sequence Length 199 residues
Representative Sequence MNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWREN
Number of Associated Samples 95
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 67.77 %
% of genes near scaffold ends (potentially truncated) 98.36 %
% of genes from short scaffolds (< 2000 bps) 93.44 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (67.213 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(23.770 % of family members)
Environment Ontology (ENVO) Unclassified
(20.492 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.033 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214.216.218.220.222.224.226.228.230.232.234.236.238.240.242.244.246.248.250.252.254.256.258.260.262.264.266.268.270.272.274.276.278.280.282.284.286.288.290.292.294.
1ICChiseqgaiiDRAFT_06414321
2JGI10214J12806_106542841
3JGI10216J12902_1050385681
4JGI10216J12902_1157207481
5F14TB_1019389861
6C688J35102_1185416831
7C688J35102_1187932911
8C688J35102_1200262241
9C688J35102_1206397632
10Ga0055455_100103982
11Ga0055455_100220962
12Ga0063454_1011587681
13Ga0062593_1024508531
14Ga0063455_1003348522
15Ga0063455_1005951371
16Ga0063455_1012508261
17Ga0062589_1000397381
18Ga0062595_1010562271
19Ga0062595_1011892091
20Ga0062595_1013235231
21Ga0062592_1008213931
22Ga0062592_1009126272
23Ga0062592_1011710501
24Ga0062591_1005518462
25Ga0066812_10091511
26Ga0066814_100291052
27Ga0066388_1005020242
28Ga0066388_1060968941
29Ga0070741_100286578
30Ga0081455_101894981
31Ga0081455_105562672
32Ga0081455_107915451
33Ga0081540_12964631
34Ga0075365_106455901
35Ga0074056_111332451
36Ga0074054_120156991
37Ga0074048_100479361
38Ga0075431_1021734881
39Ga0079219_107168801
40Ga0066710_1035291721
41Ga0105245_109185092
42Ga0105242_112121411
43Ga0126307_100342921
44Ga0126305_103439001
45Ga0126305_106241301
46Ga0126304_101794351
47Ga0126304_105106381
48Ga0126315_104822561
49Ga0126309_111966391
50Ga0126308_107640722
51Ga0126308_107657401
52Ga0126308_109434911
53Ga0126308_110021771
54Ga0126314_105624701
55Ga0126311_100478341
56Ga0126311_102394531
57Ga0126306_109341391
58Ga0126376_115090392
59Ga0126377_127753941
60Ga0134127_120429641
61Ga0134122_119272831
62Ga0137365_110588241
63Ga0137374_106985541
64Ga0150985_1017024892
65Ga0150985_1168446561
66Ga0137369_104118511
67Ga0137368_101420141
68Ga0150984_1170004071
69Ga0157288_102804501
70Ga0157301_101990611
71Ga0164303_112827691
72Ga0164299_101911431
73Ga0164301_113131241
74Ga0164308_102244701
75Ga0164304_114096291
76Ga0164306_102429771
77Ga0164305_101123552
78Ga0173483_100564102
79Ga0173480_103217622
80Ga0173478_108049821
81Ga0137412_105806192
82Ga0132256_1015304352
83Ga0132255_1009709611
84Ga0184624_104243531
85Ga0190270_127766091
86Ga0066669_119843361
87Ga0222622_105080641
88Ga0222622_114423241
89Ga0247789_11075531
90Ga0179589_105348871
91Ga0210142_10031653
92Ga0207687_107531421
93Ga0209177_103206691
94Ga0307313_101957371
95Ga0307311_100949242
96Ga0307298_101314112
97Ga0307307_101976111
98Ga0307317_100785621
99Ga0307297_101474591
100Ga0307323_100814912
101Ga0307299_102802921
102Ga0307287_103456371
103Ga0307305_101668782
104Ga0307296_100393923
105Ga0307296_101087821
106Ga0307296_105596961
107Ga0307310_103891981
108Ga0307310_107248341
109Ga0307304_100792661
110Ga0247827_105168431
111Ga0247826_112226572
112Ga0307469_118871801
113Ga0307468_1003630181
114Ga0308176_107706181
115Ga0308176_123169271
116Ga0268251_102993161
117Ga0307470_116899481
118Ga0307471_1016941601
119Ga0326730_11045921
120Ga0247830_103010691
121Ga0247830_108857691
122Ga0326723_0219372_143_844
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 69.95%    β-sheet: 0.00%    Coil/Unstructured: 30.05%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

20406080100120140160180MNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENCytopl.Cytopl.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains



 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
67.2%32.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Natural And Restored Wetlands
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Surface Soil
Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Peat Soil
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Endosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Agave
23.8%4.1%4.9%12.3%7.4%3.3%3.3%4.9%7.4%3.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_064143213300000033SoilMNDLFTPDDTRDALRKLGALLIGLAALMIYIRKGPFLASNPNQWAAFPMLLVLAIPAVFLYGSILTVPQTGELRPWQLVHNVFGLIFIPLALGEFIDVIGGTPTASLNVFWIAAATAALAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGIGAHWGIYRGLLGILAIGLLAGALYLWRNNPGGDDVAASVTAPA
JGI10214J12806_1065428413300000891SoilMNEILRPDDTGDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLG
JGI10216J12902_10503856813300000956SoilMNDLLKPDDTRDALRNIGGLLFGLAALMIYIRKGPFLTVNPSQWAAFPMFLVLAIPAVYLYGGVTTRPQTGELRPWHVVHSVFGLLFVPLALLQFVDMVGGNPNAPLNIFWTFAATAGLAFYAGAVRGVRVQLLLGSLALIVSWTALWDKILSGGIGAHWGIYRGLLGILAIGLLAGALYVWRTNPGGDEVATSATAPAG
JGI10216J12902_11572074813300000956SoilMNDLFTPDDTRDALRKLGALLIGLAALMIYIRKGPFLASNPNQWAAFPMLLVLAIPAVFLYGSILTVPQTGELRPWQLVHNVFGLIFIPLALGEFIDVIGGTPTASLNVFWIAAATAALAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGIGAHWGIYRGLLGILAIGLLAGALYLWRNNPGGDDVAASVTAPAGDLGLWKASELVT
F14TB_10193898613300001431SoilMNERFTPDDTRDALREIGGLLFGLAAAMIYIRKGPFFAANPEQWAAFPMFLVVAIPAVYLYGGILTKPQTGRLRPWQAVHSVFGLIFVALALRQFVDLIGGTPSADLNTFWIFGLTAALAFYAGFAAGVRVQILLGAIAVIVSWTALWNELLPDEGITAHWGVYRGLLGILSIALLAAALYVWRTNPGGDEVADSATEPAGDLGLWK
C688J35102_11854168313300002568SoilMNDFFRPDDTRDELRKIGGLLIGLAAAMIYLRKGPLVAGNPSQWAAFPMLLVLAIPAVYLYGSILTRPQTGELRPWQAVHNVFGLIFVPLALGQFVDVIGGTPTASLNIFWIAAATAALAFYAGARAGVRVQFLLGSIAMIVSWTALWNKIL
C688J35102_11879329113300002568SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLATNDQQWAAFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAQFVEVIGGNSNAPLNLFWIFAATAALAFYAGVRAGVRVQFLLGSIAAIISWTALWDKILSGGIG
C688J35102_12002622413300002568SoilMNDLFRPDDTRDALRKLGGLLFGLGALMIYVRKGPFLTVNDSQWAEFPIFLVLAIPAVYLYGGAITSRPRTGELRTWQVVHSVLGLLFVALALLQFVDIIGGNPNAPLNLFWVFAATAALAFYAGTVLGVRVQLLLGSIALIISWTALWEKLLSGGIGAHWGVYRGLLGLLAIGLLAGGLYVWRNNP
C688J35102_12063976323300002568SoilMNDLFKPDDTRDALRKIGGLLFGLGALMIYIRKGPFLSLNQDQWASFPIFLVLVIPAVYLYGAILTKPRTGGLRTWQAVHSVFGLLFAFLALAQFVDVIGGSPNASLNVFWTAALTAALAFYAGSNGVRVQFLLGSIFVIVSWTALWDKFLSGGVNAHWGVYRGLLGLLA
Ga0055455_1001039823300003990Natural And Restored WetlandsMNDLFEPDDTRDSLRKIGGLLIGLAAAMIYIRKGPFLTVNPDQWAAFPMFLVLAIPAVYLYVYGGILTRPQTGELRTWQVVHSVFGLIFVPFALLQFVDVIGGDPNAQLNLFWVFGVTAALAFYAGAVTGVRVQLLFGSILLMVSWTALWDKILSGGIGAHWGVYRGLLGLMAIGLLAGALYAWRANPGGDEVSGTPTAPTGDLGLWKASELLTGAGIAAV
Ga0055455_1002209623300003990Natural And Restored WetlandsMNDLFKADDTRDALRKVGGLLLGLGAAMIYIRKGPFLSVNPSQWASFPMFLVLAIPAVYLYGGILTKPRTGQLRPWQAVHSVFGLLFAFLALEQFVDMIGGNPNAPLNVFWIALATAGLAFYAGIVAGVRVQLLLGSIALIISWTALWDKLLSGGIGAHWGVYRGLLGLLAIGLLAGGLRLWRDNPGGDEVAASATAPSGDL
Ga0063454_10115876813300004081SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLATNDQQWAAFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAQFVDLIGGNANAQLNLFWFFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVY
Ga0062593_10245085313300004114SoilPMLLVLAIPAVFLYGSILTVPQTGELRPWQLVHNVFGLVFIPLALGEFIDVIGGTPTASLNVFWIAAATAALAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGIGAHWGIYRGLLGILAIGLLAGALYLWRNNPGGDDVAASVTAPAGDLGLWKASELVTGAGIAAVIGCGLGITAIGNLNPLSGSTPPIE
Ga0063455_10033485223300004153SoilMNDLFKPDDTRDTLRKIGGLLFGLGALMIYMRKGGIFPSQNQDKWAAFPLFVVVALPAAYLYAALLTTPRTGQLRPWQVVHSVFGLLLVPVALRELVIVVGGSPGAPLNTFWIFSVTAALAFYAGAVVGVRVQLLLGSIASIVAWSALWDALLADGIGAHFGVWRGLLGLFSIFLLAGAL
Ga0063455_10059513713300004153SoilLLGLGALMIYIRKGPFLTVNDNQWASFPIFLVLAIPAVYLYGSILTRPRTGELRAWQVVHSVFGLIFVPLALLQFLDMIGGNPNASLNLFWVFLVTAGLAFYAGAVVGVRVQLLLGSITLIIAWTGLWDELLSGGIAAHWGVYRGLLGLIAIGLLAGGLYLWRNNPGGDEVAATATGPSGDLGLWRASELLTGAGIAAVIGCALGITALGNLNPLGTGTPPIETTNAWDILLLLVSLGL
Ga0063455_10125082613300004153SoilMNDLFKPDDTRDELRKIGGILLGLAAAMIFIRKGNGPGHWAEFPVFLLLALPAAYLYGAVFTLPRTGELRPWQGVHSVFGLVFVPLALLQFVDMIGGDPGASLNVFWIFGVTAALAFYAGLVIGVRVQLLLGSIAVIVSWSALWNKFLSDGIGAHYGIYRGLLGLLAIALLAGGLYIWR
Ga0062589_10003973813300004156SoilMNEILRPDDTGDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLGLWKASELLTGAGIAAVIATGLGIASYTKLFAP
Ga0062595_10105622713300004479SoilETMNDLFMPDDTRDALRKLGGLLIGLAALMIYIRKGPFLAGNPNQWAAFPMLLVLAIPAVFLYGSLLTVPQTGELRPWQVAHNVFGLVFIPLALGEFIDVIGGTPTASLNVFWIAAATAAFAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGVGAHWGIYRGLLGILSIGLLAGALYLWRNNPGGDNLAASATAPAGDLGLWKASELVTGAGIAAVIACGLGITAIGNL
Ga0062595_10118920913300004479SoilAAAMIYIRKGPFLSTNNQQWAAFPMFLVLAIPAVYLYGSILTRPQTGELRTWQAVDNVFGLIFVPLALGQFVDVIGGTPTAALNVFWIAGATAAAAFYAGARAGVRVQFLLGSIAVIVAWTALWDKILSGGIGAHWGIYRGLLGILAIGLLAAALYVWRNNPGGDDIGASATAPSGDLGLWKASELVTGAGIAAVIGCALGITAIGNLNPLSGSTPPIQTSNFWD
Ga0062595_10132352313300004479SoilGLLIGLAALMIYIRKGPLLATNNQQWAAFPIFLVLAIPTVYLYGSILTRPQTGELRPWQAVHNVLGLIFVPFALGQFVDVIGGTPTASLNVFWIFALTAALGFYAGARAGVRVQFLLGSIAVIVSWTALWDKILSGGISAHWGIYRGLLGILAIGLLAAALYVWRNNPGGDDVGASATAPSGDLGLWKASELVTGAGIAAVIACALGITAIGNLNP
Ga0062592_10082139313300004480SoilMNEILRPDDTGDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLGLWKASELLTGAGIAAVIATGLGIASYTKLFAPLGATNVAPI
Ga0062592_10091262723300004480SoilMNELLRPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLPTNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNVFGLIFIPLALGQFVDVIGGTPTADLNVFWIAAATAAAAFYAGARAGVRVQFLLGSIAVIVSWTALWNKILS
Ga0062592_10117105013300004480SoilMNELLRPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLSTNNQQWAAFPMFLVLAIPAVYLYGSILTRPQTGELRTWQAVDNVFGLIFVPLALGQFVDVIGGTPTAALNVFWIAGATAAAAFYAGARAGVRVQFLLGSIAVIVAWTALWDKILSGGIGAHWGIYRGLLGILAIGLLAAALYVW
Ga0062591_10055184623300004643SoilMNDDLWRPDDTRDALRKLGGLLLGLGALMIYIRKGPFLGVNPHQWASFPMFLVLAIPAVYLYGGILSRRQTGELRPWQAVHSVFGLILVPLALLQFVDMIGGNPNADLNIFWAFAATAGLAFYAGIVAGVRVQLLLGSIALIVSWTALWNKILSGGIGAHWGIYRGLLGILAIGLLAGALYLWRTNPGGDEVAETATAPSGDLGLWKASELLTGAGIAAVIACS
Ga0066812_100915113300005105SoilMNEILRPDDTRDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAFTAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGIVAIGLLAGALYVW
Ga0066814_1002910523300005162SoilMNEILRPDDTRDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAFTAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGIVAIGLLAGA
Ga0066388_10050202423300005332Tropical Forest SoilMNELLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPLLSTNNQQWAAFPILLVLAIPAVYLYGSILARPQTGELRPWQAVHNVFGLVFVPLALGQFVDVIGGTPTASLNVFWIVAVTAALAFYAGARAGVRVQFLLGSIAVIVSSTALWNKILS
Ga0066388_10609689413300005332Tropical Forest SoilMNDRLRPDDTRDSLRKIGGLLFGLGALMIFLRKGSGPGSEQWADFPMFLVLAIPAVVLYGSVFTKPQTGELRVWQAVYSVFGLIFVPLALLKFIDMIGGSTSDLNTFWVFGVTAALAFYAGAVIGVRVQLLLGSIALIISWSALWDKLLSDGITHNWGVYRGLLGLLAIGLLAGAL
Ga0070741_1002865783300005529Surface SoilMNEPLAPDDTRDTLREIGGILIGLAAAMIFIRKNTGPGHWAAFPMFLNLAIPAVLLYGAVLTRPQTGALRPWQVVHSVFGLIFVPLALLQLIDVLGGTPGTSLNIFWIFGVTAALAAYAGAVIGVRVQLLLASIAAIISWSALWNKLLSSGINSHFGVYRGLLGILAIGLLAAGLYVWRTNPGGDVVAETATAPSGDLGLWKASELLTGAGIAAVI
Ga0081455_1018949813300005937Tabebuia Heterophylla RhizosphereMNDLFKPDDTRDALRKIGGLLLGLGAAMIYIRKGPFLTVNPSQWADFPMFLVLAIPAVYLYGSILTRPRTGELRAWQVVHSVFGLLFVPLALAQFVDMIGGNPNASLNVFWIFLVTAGLAIYAGAVIGVRVQLLLGSISLIIAWTGLWNELLSGGITAHWGVYRGLLGLMAIGLLAGGLYLWRTNPGTEVAATATGPSGDLGLWKASELLTGAGIAAVIGCALGITALGNLNPLGTGTPPIETTNAWDVLLLLVSLGLVAIADAPRETAAAAVR
Ga0081455_1055626723300005937Tabebuia Heterophylla RhizosphereMNELLQPDDTRDALRKIGGILVGLAAAMIYIRKGPFLGVNPHQWAAFPIFLVVALPAVYLYGSILTRSQTGELRPWQAVHNVFGLIFVPFALAQFVDMIGGNPNAQLNLFWIFGVTAALAFYAGSRAGVRVQFLLGSIAVIISWTALWDKILSDGIGAHWGIYRGLLGILA
Ga0081455_1079154513300005937Tabebuia Heterophylla RhizosphereMNDVLAPDDTRDALRKIGGLLLGLAALMIYIRKGPFISGNPSQWASFPMLLVLAIPAVYLYGSILSRPQTGELRPWQAVDNVFGLVFVPLALGQFVQVIGGTPTASLNIFWITAVTAGFAFYAGARAAVRVQFLLGSIAVIVSWTALWNKILSGGIGANWGVYRGL
Ga0081540_129646313300005983Tabebuia Heterophylla RhizosphereMNELLRPDDTRDALRKLGGLLIGLAALMIYIRKGPLLPTNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNVFGLIFIPLALGEFVDVIGGTPTADLNVFWIAAATAGAAFYAGGRAGVRVQFLLGSIAVIVSWTALWNKIL
Ga0075365_1064559013300006038Populus EndosphereMNDLFRPDDTRDELRKIGGILLGLAAAMIFIRKGNGPGQWAAFPIFLLLALPAAYLYGALLTVPRTGELRPWQAVHSVFGLIFVPLALLQFVDLIGGDPGASLNVFWIFGVTAGLAFYAGLVVGVRVQLLLGSIAVIVSWSSLWNKLLSDGIGHNYGTYRGLLGLLAIGLLAAGLY
Ga0074056_1113324513300006574SoilMTDLFKPDDTRDALRKLGGLLLGLGALMIYIRKGPFLTVNDNQWAAFPIFLVLAIPAAYLYGSILTRPRTGGLRAWQVVHSVFGVLLVPLALLQFVDLIGGNPSAALNLFWVFLVTAGLAFYAGGVVGVRVQILLGSIALIIAWTALWDELLSGGISAHWGVYRGLLGLIAIGLLAGGLYLWRTNPGGDEVAATATGPSGDLG
Ga0074054_1201569913300006579SoilMNDLFKPDDTRDALRKIGGLLFGLGAAMIYIRKGPFLSVNPSQWASFPMFLVLAIPAVYLYGSILTRPRTGELRSWQLVHSVFGLIFAVLALSQFVHLIGGNPNAPLNLFWIFLVVAGLAFYVGAVVGVRVQLLLGGIALIISWTGLWDKLLSGGIGAHWGVYRGLLGLTFRF*
Ga0074048_1004793613300006581SoilMNEILRPDDTRDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAFTAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGIVAIGLLAGALYVWRTNPGGAKVAASATRPDGDLGLWKASELLTGAGIAAVIATGLGIASYTKLFAPLGATNVAPIQTSNLWDTLLLLVSL
Ga0075431_10217348813300006847Populus RhizosphereLSVNPSQWASFPMLLVLAIPAVYLYGGSILTRPRTGELRSWQTVHNILGLVFVPLALGQFVDVIGGTPTASLNVFWIFAVTAALAFFAGARAGVRVQFLLGSIAVIVSWTALWNKILSNGINANWGVYRGLLGILAIGLLAGALYVWRTNPGGDEIAGTATAPSGDLGL
Ga0079219_1071688013300006954Agricultural SoilMNDRFTPDDTRDLLREIGGLLLGLAAAMIYIRKGPFLSVNPDQWASFPMFLVVAIPALYLYGGILTRPQTGRLRPWQVVHSVFGLIFVALALRQFVDMIGGNPSADLNTFWIFGTVAALAFYGGFAVGVRVQILLGAIAVIVAWTALWNDFLPDAGITAHWGTYRGLLGLLSIGLLAGALYAWRTNPGGEEFADTAAKPAGDLGIWKASELLTGAGIAAVIA
Ga0066710_10352917213300009012Grasslands SoilMNDSFKPDDTRDALRKVGGLLLGLAAAMIYIRKGPFIALNPSQWAAFPMFLVLAIPAVYLYGGILARPQTGELRPWMAVHSVFGLIFVPFALLQFVDMIGGSPNAQLNLFWAFLATAGLAFYAGTVAGVRVQLLLGSIALIFSWTALWDKILSGGIGAHWGVYRGLLVLLAIFLLAGALYMWRNNPGGDA
Ga0105245_1091850923300009098Miscanthus RhizosphereMNSFEPDDTRDALRKIGGILLALAALMIYIRKGPFIPLNPSQWASFPMFLVLAIPAAYLYGSILARPQTGELRPWMVVHSVLGLLFVPIAIGQFIDMIGGTPGAPLNLFWIFLVTAGMAFYAGTVMGVRVQILLGCLALILSWSALWDKILSGGIGSHWGIYRGLLGILAIGLLAGALYVWRTNPGGDEVAGTATGPAGDLGLWKASELLTGAGIAAVIACALGITALGNLNPLSGTTPPIQTSNFWDIMLLLVSLG
Ga0105242_1121214113300009176Miscanthus RhizosphereLWRPDDTRDALRKLGGLLLGLGALMIYIRKGPFLSVNPHQWASFPMFLVLAIPAVYLYGGILSRRQTGELRPWQAVHSVFGLLLVPLALLQFVDMIGGNPNANLNIFWAFAATAGLAFYAGILAGVRVQLLLGSIALIVSWTALWNKILSGGIGAHWGIYRGLLGILAIGLLAGALYLWRTNPGGDEVAETATAPSGDLGLWKASELLTGAGIAAVVACSLGIASIGNLNPLGTGTPPIQTTNFWDILLLIVSLGLV
Ga0126307_1003429213300009789Serpentine SoilMNELLQPDDTRDALRKLGGLLIGLAALMIYIRKGPILGTNNEQWAAFPMLLVLALPAIYLYGSVLTRPQTGELRPWQAVHNVFGLIFVPLALGQFVDVIGGTPTAPLNVFWITAAAAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSDGITAHWGVYRGLLGILAIGLLAAGLYVWRNNPGGDDVGASATAPSGDLGLWKASELLTGAGIAAVIGCALGITALGNLNPLSGSTPPIETSNFWDIMLLLVS
Ga0126305_1034390013300010036Serpentine SoilMNELLQPDDTRDALRKLGGLLIGLAALMMYIRKGPILGTNNEQWAAFPMLLVLALPAIYLYGSVLTRPQTGELRPWQAVHNVFGLIFVPLALGQFVDVIGGTPTAPLNVFWITAAAAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSDGITAHWGVYRGLLGILAIGLLAAGLYVWRNNPGGDDVGASATAPSGDLGLWKASELLTGAGIAAVIGCAL
Ga0126305_1062413013300010036Serpentine SoilMNERFTPDDTRDALREIGGLLFGLAAAMIYIRKGPFFAANPQQWDAFPMFLVVAIPAVYLYGGILTKPQTDRLRPWQAVHSVFGLIFVALALRQFVDLIGGTPSADLNTFWIFGLTAALAFYVGFAAGVRVQILLGAIAVIVSWTALWNELLPDEGITAHWGVYRGLLGILSIALLAAALYVWRTNPGGDEVADSATEPAGDL
Ga0126304_1017943513300010037Serpentine SoilMNDRFTPDDTRDLLREVGGLLLGLAAAMIYIRKGPFISVNPDQWAAFPMFLVLAIPAAYLYGGILTRPQTGRLRPWQAVHSVFGLILVAFALGQFVDLIGGNPNAELNIFWIFGVTAALAFYAGFVAGVRVQILLGSIAVIISWTALWNKFLPDE
Ga0126304_1051063813300010037Serpentine SoilMNEFLRPDDTRDELRKIGGLLIGLAAAMIYIRKGPILATNTEQWAAFPMFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNAFGLIFVPLALGQFVDVIGGTPTAPLNVFWITAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSDGITAHWGVYRGLLGILAIGLLAAALYLWRNNPGGDDVGASATAPAGDLGLWKASELVTGAGIAAVIACALGITALGNLNPLSG
Ga0126315_1048225613300010038Serpentine SoilRDALRTAGGILFGLAFVMILIRKGSGPGHWAKFPLFLDVAIPAATLYGLGVFTKDKTGGLRVWQAVYAVFGLLLVPLVLLQFVDLVGGSPGTSMNTFWIFGVTAALAFYAGIVAGVRFQLLLGSIAAIISWSALWDALLGGGIGENWGVYRGLLGLIAIALLAGGLFLWRTNKGGDEGAATATYPGGEIGLWKASELLTGAGLAAVLACSLGGLVTFFVVSVAQAFGPTEALRIQTPIETTNAWDIMLLVVALGLVWLGSLIG
Ga0126309_1119663913300010039Serpentine SoilMNDLFKPDDTRDALRTAGGILFGLAFVMILIRKGSGPGHWAKFPLFLDVAIPAATLYGLGVFTKGKTGGLRVWQAVYAVFGLLLVPLVLLQFVDLVGGSPGTSMNTFWIFGVTAALAFYAGIVAGVRFQLLLGSIAAIISWSALWDALLGGGIGENWGVYRGLLGLIAIAL
Ga0126308_1076407223300010040Serpentine SoilMNDLFRPDDTRDTLRKIGGLLIALAAAMIYIRKGPAISVNPSQWAAFPMFLVVAIPAVYLYGSILTRPQTGELRSWQTVHSVLGLIFIPFALAQFVDLIGGSPNAALNLFWIFGFTAAFAFYAGAVAGVRVMFLLGSIGAIVSWSAFWDEILS
Ga0126308_1076574013300010040Serpentine SoilPAFAINPSQWAAFPIFLVLAIPAAYLYGGILTRRQTGELRTWQAVHSVLGLLFVPAALSQFVDVIGGNPTASLNVWWIFLVTAGLAFYVGTVIGVRVQLLLGSIALIIAWSALWDALLSGGIGAHWGIYRGLLGMISIGLLAGGLYLWRTNRGGDELAASATTPGGDLGLWKASELLTAAGIAAVIACGLGITALGNINPLGTGTPPIETTNVWDMLLLVVS
Ga0126308_1094349113300010040Serpentine SoilMNEFLRPDDTRDELRKIGGLLIGLAAAMIYIRKGPILATNTEQWAAFPMFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNAFGLIFVPLALGQFVDVIGGTPTAPLNVFWITAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSDGITAHWGVYRGLLGILAIGL
Ga0126308_1100217713300010040Serpentine SoilGPFISVNPDQWAAFPMFLVLAIPAAYLYGGILTRPQTGRLRPWQAVHSVFGLILVAFALGQFVDLIGGNPNAELNIFWIFGVTAALAFYAGFVAGVRVQILLGSIAVIISWTALWNKFLPDEGITAHWGVYRGLLGLLSIGLLAAALYVWRTNPGGDEVADSATEPAGDLGLWKASELLTGARIAALIACSLGI
Ga0126314_1056247013300010042Serpentine SoilMNDLFKPDDTRDALRTAGGILFGLAFVMILIRKGSGPGHWAKFPLFLDVAIPAATLYGLGVFTKDKTGGLRVWQAVYAVFGLLLVPLVLLQFVDLVGGSPGTSMNTFWIFGVTGALAFYAGIVAGVRFQLLLGSIAAIISWSALWDALLGGGIGENWGVYRGLLGLIAIAL
Ga0126311_1004783413300010045Serpentine SoilMNDRFTPDDTRDLLREIGGLLLGLAAAMIYIRKGPFISVNPDQWAAFPMFLMLAIPAAYLYGGILTRPQTGRLRPWQAVHSVFGLILVALALGQFVDLIGGDPSAELNIFWIFGVTAALAFYAGFVAGIRVQILLGSIAVIISWTALWNKFLPDEGITAHWGVYRGLLGLLSIGLLAAALYVWRTNPGGDEV
Ga0126311_1023945313300010045Serpentine SoilMSDLFKPDDTRDALRKLGGLLFALGAAMIYIRKGPFFAANPSQWAAFPMFVVLAIPATYLYGGILTRRQTGELRTWQAVHSVLGLLFVPAALSQFVDMIGGSPAASLNVCWIFLVTAGLAFYAGTVIGVRIQLLLGSIALIIAWTALWDELLSGGIGAHWGIYRGLLGMIAIGLLAGGLYLWRTNRGGDELAASATTPGGDLGLWKASELLTAAGIAA
Ga0126306_1093413913300010166Serpentine SoilMNDLYRPDVTRDTLRKLGGILLGLAALMIFIRKGPFVSVNPDQWAAFPMFLVLAIPAVYLYGAIFTRPQTGGLRVWQVVHSTLGLVFVPLALLQFIDLIGGTPGTSLNTFWVFGVTAALAFYAGAIAGVRVQLLLGSIALVVSWSALWNRILSDGIGANYGVYRGLLGILAI
Ga0126376_1150903923300010359Tropical Forest SoilMNELLRPDDTRDALRKLGGLLVGLAALMIYIRKGPFLSTNDEQWAAFPIFLVLVIPAVYLYGSILTRPQTGELRPWQAVHNVFGLIFIPLALGQFVDVIGGTPTAALNVFWIAGATAVAAFYAGARAGVRVQFLLGSLAVIVSWTALWD
Ga0126377_1277539413300010362Tropical Forest SoilMNELLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPLLSTNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGKLRPWQAVHNVFGLVLVPLAMGQFVDVIGGTPTASLNVFWIAAVTAALAFYAGARAGVRVQFLLGSIAVIVSWTALWNKILSGGIGAHWGVYRGLLGILAIALLAGAL
Ga0134127_1204296413300010399Terrestrial SoilFRALGGLLIGLAALMIFIRKGWFFPINPSQWASFPLFLVFALPALFLYGSVFTRDRTGELRVWQVVLCSFGYVFVPLALLQFVDVIGGTRGAALNIFWIFGFTAALGFYVGSVRGVRVQLLLASIAAIISWSALWDKLLSNGIGQHYGVYRGVLGVLGIALLAGALYVWRENPGGEEIASSTTAPSGDLGLWKASELLTGAGIAAVLACGLGIAGF
Ga0134122_1192728313300010400Terrestrial SoilMNDDLWRPDDTRDALRKLGGLLLGLGALMIYIRKGPFLGVNPHQWASFPMFLVLAIPAVYLYGGILNRRQTGELRPWQAVHSVFGLILVPLALLQFVDMIGGNPNADLNIFWAFAATAGLAFYAGIVAGVRVQLLLGSIALIVSWTALWNKILSGGIGAHWGIYRGLLGILAIGLLA
Ga0137365_1105882413300012201Vadose Zone SoilMIYIRKGPFLSVNPSQWASFPMFLVVAIPAVYLYGGIMTRPQTGELRPWQAVHSVFGLIFVPFALAQFIHLLSGNPNAPLNVFWIFAVTAALAFYAGTVAGVRVQLLLGSIAAIVSWTALWDKILSGGITAHWGVYRGLLGIAALALLAAALYMWRTNPGGDEVAGTATAPSGDLGLWKASELFTGAGIAAVIAC
Ga0137374_1069855413300012204Vadose Zone SoilPDDTRDALRKIGGLLIGLAALMIYIRKGPFLSVNQDQWASFPIFLVVAIPAVYLYGGILTRPQTGELRTWQAVHSVFGLIFVPFALAQFVDLVGGNPNASLNVFWIFGVTAALAFYAGAVAGVRVQLLLGSIAVIVFWTALWDKILSDGITAHWGVYRGLLGIVAIGLLAAALYVWRTNPGGDEVAGSATAPSGDLGLWKASELFTGAGIAAVIACALGVTAVGNLNPLGTETPPIETTNLWDILLLLVSLGL
Ga0150985_10170248923300012212Avena Fatua RhizosphereMIFIRKGNGPGHWAEFPVFLLLALPAAYLYGAVFTLPRTGELRPWQGVHTVFGLVFVPLALLQFVDMIGGDPGASLNVFWIFGVTAALAFYAGLVIGVRVQLLLGSIAVIVSWSALWNKFLSDGIGAHYGIYRGLLGLLAIALLAGGLYIWRTNPGGDDVAATATGPEGDLGLWKASELLTGAGIAAVLACSLGGLATLLVTSISPSVGTFVTPVHTSNFWDV
Ga0150985_11684465613300012212Avena Fatua RhizosphereMNELLRPDDTRDALRKIGGLLIALAAAMIYIRKGPFLSTNAEQWAAFPMFLVLAIPAVYLYGGIFSRPHTGELRPWQAVHNVLGLIFVPFALAQFVELIGGSSSAQLNVFWIFALTAALAFYAGARAGVRVQFLLGSIAVIISWSALWDRILSDGIGANW
Ga0137369_1041185113300012355Vadose Zone SoilMNDLLKPDDTRDALRKIGGLLFGLGALMIYIRKGPFVNVNPDQWAEFPIFLVLVIPAVVLYGSVLARPQTGELRPWQLVYSVFGLVFVFLALEQLVDVIGGNPNAQLNLFWISLATAGLAFYAGVVLGVRVQLLLGSILLILSWTALWDKLLSDGITANWGVYRGLLGILAIGLLAGGLYLWRSNPGSDE
Ga0137368_1014201413300012358Vadose Zone SoilMNDLLKPDDTRDALRKIGGLLIGLAALMIYIRKGPFLSVNQDQWASFPIFLVVAIPAVYLYGGILTRPQTGELRTWQAVHSVFGLIFVPFALAQFVDLVGGNPNASLNVFWIFGVTAALAFYAGAVAGVRVQLLLGSIAVIVSWTALWDKILSDGITAHWGVYRGLLGIVAIGLLAAALYVWRTNPGGDEVAGSATAPSGDLGLWKASELFTGAGIAAVIACALGVTAVGNLNPLGTGTPPIETTNLWDILLLL
Ga0150984_11700040713300012469Avena Fatua RhizosphereMNELLRPDDTRDALRKIGGLLIALAAAMIYIRKGPFLSTNAEQWAAFPMFLVLAIPAVYLYGGIFSRPQTGELRPWQAVHSVLGLIFVPLALAQFVELIGGSSSAQLNVFWIFALTAALAFYAGSRAGVRVQFLLGSIAVIISWSALWDKILSDG
Ga0157288_1028045013300012901SoilDTGDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGSNPGGTKVAASVTRP
Ga0157301_1019906113300012911SoilFGLAALMIYIRKGPFLASNPNQWAAFPMLLVLAIPAVFLYGSILTVPQTGELRPWQVVHNVFGLVFIPLALGEFIDVIGGTPTASLNVFWIAAATAALAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGVGAHWGIYRGLLGILSIGLLAGALYLWRNNPGGDDVAASVTAPAGDLGLWKATELVTGAGIAAVIGCGLGITAIGNLNPLSGSTPPIETS
Ga0164303_1128276913300012957SoilATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPSGDLGLWKASELITGAGIAAVIGCVL
Ga0164299_1019114313300012958SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGIYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPAGDLGLWKASELITGAGIAAVIGCALGITAIGNLNPLSGSTPPIQTSNFWDVMLLLVSL
Ga0164301_1131312413300012960SoilALRKIGGLLIGLAAAMIYSRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGIYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPAGDLGLWKA
Ga0164308_1022447013300012985SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGIYRGLLGILAIGLLAAALYVW
Ga0164304_1140962913300012986SoilDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLGVAIPAVYLYGSIFTRPQAGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGIYRGLLGILAIGLLAAALYVWRENPGGDDVGASATG
Ga0164306_1024297713300012988SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPAGDLGLWKASELITGAGIAAVIGCALGITAI
Ga0164305_1011235523300012989SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGQLRPWQGVHSVVGLIFVPFALAEFVNLIGGTPSAPLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGIYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPAGDLGLWKASELVTGAGIA
Ga0173483_1005641023300015077SoilMNEILRPDDTGDALRKIGGLLVGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLGLWKASELLTG
Ga0173480_1032176223300015200SoilMNEILRPDDTGDALRKIGGLLLGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPEKGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGIVAIGLLAGALYVWRTNPGGAKVAASATRPDGDLGLWKASELLTGAGIAAVIATGLGIASYTKLFAPLGATNVAPIQ
Ga0173478_1080498213300015201SoilAFPLFLVVAAPVVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLGLWKASELLTGAGIA
Ga0137412_1058061923300015242Vadose Zone SoilMNDLFKPDDTRDALRKLGGLLFGLGALMIYIRKGPLLTVNPDQWASFPIFLVLAIPAVYLYGGAITSRPRTGELRPWQLVHSVFGLLFVALALLQFVDLIGGNPNASLNLFWVFLATAGLAFYAGAVLGVRVQLLLGSIALIVSWTALWDELVSGGITTHWGVYRGLLGLLAIGLLAG
Ga0132256_10153043523300015372Arabidopsis RhizosphereMNDDVWRPDDTRDALRKVGGLLLGLAALMIYIRKGPFLSVNPSQWASFPMFLVLAIPAVYLYGGILSRRQTGELRPWQAVHSVFGLILVPLALLQFVDMIGGNPNANLNIFWAFGATAGLAFYAGIVAGVRVQLLLGSIALIVSWTALWNEILSGGIGAHWGIYRGLLGILAIGLLAGALYLWRTNPGGDEVAE
Ga0132255_10097096113300015374Arabidopsis RhizosphereMNEVLRPDDTRDALRELGGLLIGLAATMIYIRKGPLLSTNTHQWSAFPMFLVLAIPAGYLYWSILTRRQTGELRPWQAVHNVFGVILIPLALGEFVDVIGGTPTAGLNVFWISAVTAAAAFYAGARAGVRVQFLVGSIAVIVSWTALWDQILSDGVGAHWGIYRGLLGILSIGLLAAALYVWRTNPGGNDIGASATSPAGDLGL
Ga0184624_1042435313300018073Groundwater SedimentFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNVLGLILVPFALGQFVDVIGGAPTASLNVFWTFAVTAALAFYAGARAGVRVQFLLGSIAVIVSWTALWDKILSGGISAHWGIYRGLLGILAIGLLAAALYVWRNNPGGDDVGASATAPSGDLGLWKASELVTGAGIAAVIGCALGITAIGNLNPLSGSTPP
Ga0190270_1277660913300018469SoilMNDLFTPDDTRDMLRKLGGILLGLAALMIFIRKGPFVSVNPDQWAAFPIFLVLAIPAVYLYGAIFTRPRTGELRVWQVVHSTLGLVFVPLALLQFIDLIGGTPGTSLNTFWVFGVTTVLAFYAGAIAGVRVQLLLGSIALIVSWSALWNKILSDGIGANYGVYRGLLGILAIVLLAGALYMWR
Ga0066669_1198433613300018482Grasslands SoilDPLSPDDSRDLLREIGGLLIGLAALMIFIRKGPFISINPHQWAAFPMFVAMAIPAAYLYGSLMMRPRTGELRVWQVVHSVFAIIFIPLALGRFIHLLGGTATADLNTFWIFGVTAVLALYAGLAGVRVQLLLGGIALIISWSALWDKLLSGGIGAHYGVWRGLLGIMAIVLLVGGLYMWR
Ga0222622_1050806413300022756Groundwater SedimentMNEYLRPDDTRDSLREIGGLLIALAAAMIYIRKGPFLTVNPDQWAAFPMFLVVAIPAGYLYGGIFTRPQTGELRPWQAVHSVIGLIFVPFALAQFVDLIGGNPNAPLNVFWIFAVTAGLAFYAGSRAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGVYRGLLGILAIGLLAGALYVWRTNPGGDDVGASATAPSGDLGLWKASELVTGAGIAAVIACSLGITAIGNLNPLSGSTPPIQTSNFW
Ga0222622_1144232413300022756Groundwater SedimentTRDALRKIGGLLFGLGALMIYIRKGPFLTVNPSQWASFPMFLVLAIPAAYLYGGAVTSRPQTGELRPWQVLHSVFGLLFVALALLQFIDMVGGNPNASLNLFWVFLVTAGVAFYAGTVLGVRVQLLLGSIALIISWTALWDKLLSDGIGAHWGVYRGLLGLLAIGLLAG
Ga0247789_110755313300023266SoilGLLLGLAAAMIFIRKGGIFPSANRDRWAAFPLFLVVAAPAVYLYSGLLAGPERGELRVWQSVHSVFALLLVPIALREFVVVLGGSPGASLNTFWIFAITAGLAFYTATRFAVRVQLLLGSIAVIVSWTALWDKILSGGVTAHWGIYRGLLGILAIGLLAGALYVWGTNPGGTKVAASATRPDGDLG
Ga0179589_1053488713300024288Vadose Zone SoilMNDLFKPDDTRDALRKLGGLLFGLGALMIYIRKGPLLTVNPDQWASFPIFLVVAIPAVYLYGGAITSRPRTGELRPWQLVHSVFGLLFVALALLQFVDLIGGNPNASLNLFWVFLATAGLAFYAGAVLGVRVQLLLGSIALIVSWTALWDELV
Ga0210142_100316533300025552Natural And Restored WetlandsMNDLFEPDDTRDSLRKIGGLLIGLAAAMIYIRKGPFLTVNPDQWAAFPMFLVLAIPAVYLYVYGGILTRPQTGELRTWQVVHSVFGLIFVPFALLQFVDVIGGDPNAQLNLFWVFGVTAALAFYAGAVTGVRVQLLFGSILLMVSWTALWDKILSGGIGAHWGVYRGLLGLMAIGLLAGALYVWRA
Ga0207687_1075314213300025927Miscanthus RhizosphereMNSFEPDDTRDALRKIGGILLALAALMIYIRKGPFIPLNPSQWASFPMFLVLAIPAAYLYGSILARPQTGELRPWMVVHSVLGLLFVPIAIGQFIDMIGGTPGAPLNLFWIFLVTAGMAFYAGTVMGVRVQILLGCLALILSWSALWDKILSGGIGSHWGIYRGLLGILAIGLLAGALYVWRTNPGGDEVAGTATGPAGDLGL
Ga0209177_1032066913300027775Agricultural SoilRTMNDRFTPDDTRDLLREIGGLLLGLAAAMIYIRKGPFLSVNPDQWASFPMFLVVAIPALYLYGGILTRPQTGRLRPWQVVHSVFGLIFVALALRQFVDMIGGNPSADLNTFWIFGTVAALAFYGGFAVGVRVQILLGAIAVIVAWTALWNDFLPDAGITAHWGTYRGLLGLLSIGLLAGALYAWRTNPGGEEFADTA
Ga0307313_1019573713300028715SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAGTAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHLGVYRG
Ga0307311_1009492423300028716SoilMNDLFKPDDTRDELRKIGGILLGLAAAMIFIRKGNGPGHWAEFPIFLLLALPAAYLYGAVLTLPRTGELRPWQAVHSVFGLVFVPLALLQFVDMIGGDPGASLNVFWIFGVTAALAFYAGLVVGVRVQLLLGSIAVIVSWSALWNKFLSDGIGANYGVYRGLLGLLAIALLAGGLYVWRTNPGGDDVAASATGPEGDLGLWKASELLTGAGIAAVLACSLGGLATLLVSS
Ga0307298_1013141123300028717SoilMNDLFKPDDTRDALRKIGGLLFGLGALMIYIRKGPFLTVNPSQWASFPMFLVLAIPAAYLYGGAVTSRPQTGELRPWQVLHSVFGLLFVALALLQFIDMVGGNPNASLNLFWVFLVTAGVAFYAGTVLGVRVQLLLGSIALIISWTALWDKLLSDGIGAHWGVYRGLLGLLAIGLLA
Ga0307307_1019761113300028718SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWREN
Ga0307317_1007856213300028720SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPSGDLGLWKASELITGAGIAAVIGCALGITAIGNLNPLSGSTPPIQTSNFWDVMLLLVSLGLVA
Ga0307297_1014745913300028754SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAGTAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPSGDLGLWKASELVTGAGIAAVIGCALGISAIGNLNPLTGST
Ga0307323_1008149123300028787SoilMNDLFEPDDTRDSLRKIGGLLIGLAAAMIYIRKGPFLTVNPDQWAAFPMFLVLAIPAVYLYGGVLTRPQTGELRTWQVVHSVFGLIFVPFALLQFVDMIGGDPNAQLNLFWVFAVTAGLAFYAGAVAGVRVQLLFGSILLIISWTALWDKILSGGIGAHWGVYRGLLGLMAIGLLAGALYLWRTNPGGDEVAGTATAPTGDLGL
Ga0307299_1028029213300028793SoilIGLAAAMIYIRKGPFLTVNPDQWAAFPMFLVLAIPAVYLYGGVLTRPQTGELRTWQVVHSVFGLIFVPFALLQFVDMIGGDPNAQLNLFWVFAVTAGLAFYAGAVAGVRVQLLFGSILLIISWTALWDKILSGGIGAHWGVYRGLLGLMAIGLLAGALYLWRTNPGGDEVAGTATAPTGDLGLWKASELLTGAGIAGVIACSLGITAL
Ga0307287_1034563713300028796SoilLLIALAAAMIYIRKGPFLTVNPDQWAAFPMFLVVAIPAGYLYGGIFTRPQTGELRPWQAVHSVIGLIFVPFALAQFVDLIGGNPNAPLNVFWIFAVTAGLAFYAGSRAGVRVQFLLGSIAVIISWTALWDKILSGGIAAHWGVYRGLLGILAIGLLAGALYVWRTNPGGDDVGASATAPSGDLGLW
Ga0307305_1016687823300028807SoilMNDLFKPDDTRDTLRKIGGLLFGLGALMIYIRKGPFISLNQDQWASFPIFVVLAIPAVYLYGAILTRPQTGELRPWQAVHSVFGLVFAFLALAQFVDMIGGSPNAPLNVFWISAVTAGLAFYAGSIGVRVQFLLGSIFVIVSWTALWDKFLSDGITAHWG
Ga0307296_1003939233300028819SoilMNDLFKPDDTRDALRKIGGLLFGLGALMIYIRKGPFLTVNPSQWASFPMFLVLAIPAAYLYGGAVTSRPQTGELRPWQVLHSVFGLLFVALALLQFIDMVGGNPNASLNLFWVFLVTAGVAFYAGTVLGVRVQLLLGSIALIISWTALWDKLLSDGIGAHWGVYRGLLGLLAIGLLAGGLYLWRNNPGGDEVAASATWPAGDLGLWKAS
Ga0307296_1010878213300028819SoilMNDLFKPDDTRDELRKIGGILLGLAAAMIFIRKGNGPGHWAEFPIFLLLALPAAYLYGAVLTLPRTGELRPWQAVHSVFGLVFVPLALLQFVDMIGGDPGASLNVFWIFGVTAALAFYAGLVVGVRVQLLLGSIAVIVSWSALWNKFLSDGIGANYGVYRGLLGLLAIALLAGGLYVWRTNPGG
Ga0307296_1055969613300028819SoilMNEFLQPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLG
Ga0307310_1038919813300028824SoilIGLAAAMIYIRKGPFLTANDQQWATFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVVGLIFVPFALAEFVDLIGGTPSAQLNLFWIFAGTAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKILSGGIGAHWGVYRGLLGILAIGLLAAALYVWRENPGGDDVGASATGPSGDLGLWKASELITGAGIAAVIGCALGITAIGNLNPLSGNTPPIQTSNFWDI
Ga0307310_1072483413300028824SoilDDTRDSLRKIGGLLIGLAAAMIYIRKGPFLTVNPDQWAAFPMFLVLAIPAVYLYGGILTRPQTGELRTWQVVHSVFGLIFVPFALLQFVDMIGGDPNAQLNLFWVFGVTAGLAFYAGAVAGVRVQLLFGSILLIISWTALWDKILSGGIAAHWGVYRGLLGILAIGLLAG
Ga0307304_1007926613300028885SoilMNDLFKPDDTRDELRKIGGILLGLAAAMIFIRKGNGPGHWAEFPIFLLLALPAAYLYGAVLTLPRTGELRPWQAVHSVFGLVFVPLALLQFVDMIGGDPGASLNVFWIFGVTAALAFYAGLVVGVRVQLLLGSIAVIVSWSALWNKFLSDGIGANYGVYRGLLGLLAIALLAGGLYVWR
Ga0247827_1051684313300028889SoilMNDLFMPDDTRDALRKLGGLLIGLAALMIYIRKGPFLAGNPSQWAAFPMLLVLAIPAVFLYGSVLTVPQTGELRPWQVAHNGFGLVFIPLTLGEFIDVIGGTPTASLNVFWIGAVTAAFAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGVGAHWGIYRGLLGILSIGLLAGALYLWRNNPGGHNL
Ga0247826_1122265723300030336SoilMNDVLRPDDTRDALRKVGGLLIGLAALMIYIRKGPFLATNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHNVLGLIFIPFALGQFVDVIGGTPTASLNLFWIFAVTAALGFYAGARAGVRVQFLLGSIAVIVSWTALWDKILSGGISAH
Ga0307469_1188718013300031720Hardwood Forest SoilMNDVLRPDDTRDALRKVGGLLIGLAALMIYIRKGPFLATNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHSVLGLIFVPLALGQFVDVIGGTPTASLNVFWTFAVTAALAFYAGARAGVRVQFLLGSIAVIVSWTALWNKILSGGISAHWGIYRGLLGILAIG
Ga0307468_10036301813300031740Hardwood Forest SoilMNDVLRPDDTRDALRKVGGLLIGLAALMIYIRKGPFLATNNQQWAAFPIFLVLAIPAVYLYGSILTRPQTGELRPWQAVHSVLGLIFVPLALGQFVDVIGGTPTASLNVFWTFAVTAALAFYAGARAGVRVQFLLGSIAVIVSWTALWNKILSGGISAHWGIYRGLLGILAIGLLAGALYVWRNNPGGDDVGASATAPSGDLGLWKASELVTGAG
Ga0308176_1077061813300031996SoilMNDLFRPDDTRDALRKLGGLLFGLGALMIYVRKGPFLTVNDSQWAEFPIFLVLAIPAVYLYGGAITSRPRTGELRTWQVVHSVFGLLFVALALLQFVDIIGGNPNAPLNLFWVFAATAALAFYAGTVLGVRVQLLLGSIALIISWTALWEKLLSGGIGAHWGVYRGLLGLLAIGLLAGGLYLWRNNPGGDEVAASATAPSGDLGLWKASELLTGAGIAAVIGCALGITALGNLNPLSSTPPIETTNVWDI
Ga0308176_1231692713300031996SoilMNELLRPDDTRDTLREIGGLLIGLAAAMIYIRKGPFISTNDEQWAAFPMFLVVAIPAVYLYGSIFTRPQTGELRPWQAVHSVFGLIFVPFALAQFVDVIGGNPTAPLNVFWTFAATAALAFFAGARAGVRVQFLLGSIAVIISWTALWNKILSGGIGTHWGIYRGLLGLLAIGLLAAGLYVWRNN
Ga0268251_1029931613300032159AgaveLRPDDTRDALRKIGGLLLGLAAAMIYIRKGPFLATINQQWAAFPMLLVLAIPAVYLYGSILTRPQTGELRPWQAVHNIFGLIFVPLALGQFVDVIGGTPTAQLNVFWITAATAALAFYAGARAGVRVQFLLGSIAVIISWTALWDKLLSDGIGAHWGVYRGLLGILAIGLLAGALYVWRTNPGGDDVGASATAPSGDLGLWKASELVTGAGIAAVIACALGI
Ga0307470_1168994813300032174Hardwood Forest SoilRPMNEYLRPDDTRDSLREIGGLLIALAAAMIYIRKGPFLTVNPDQWATFPMFLVVAIPAVYLYGGIFTRPQTGELRPWQAVHSVVGLIFVPFALAQFVDLIGGNPNAPLNVFWIFAVTAGLAFYAGSRAGVRVQFLLGSIAVIISWTALWDKILSGGISAHWGVYRGLLGILAIGLL
Ga0307471_10169416013300032180Hardwood Forest SoilMNELLRPDDTRDALRKIGGLLIGLAAAMIYIRKGPFLSTNNQQWAAFPMFLVLAIPAVYLYGSILTRPQTGELRTWQAVDNVFGLIFIPLALGQFVDVIGGTPTAALNVFWIAGATAAAAFYAGARQGVRVQFLLGSIAVIVAWTALWDKILSGGIGAHWGIYRGLLGILAIGLLAAALYVWRNNPGGDDIASSATAPSGDLGLWKASELVTGAGIAAV
Ga0326730_110459213300033500Peat SoilRKLGGLLLGLGALMIYIRKGPFLTVNPSQWASFPIFLVLAIPAAYLYGGAITSRPQTGELRPWQVVHSVFGLVFVALALLQFVDVIGGNPNAPLNLFWVFLVTAGLAFYAGTVLGVRVQLLLGSIALIVSWTALWDKLLSGGIGAHWGIYRGLLGLLAIGLLAGGRYLWRNNPGG
Ga0247830_1030106913300033551SoilMNDALRPDDTRDALRKVGGLLIGLAALMIYIRKGPLLATNNQQWAAFPIFLVVAIPAVYLYGSILTRPQTGELRPWQAVHNVLGLIFVPFALGQLVDVIGGTPTASLNVFWIFAVTAALGFYAGARAGVRVQFLLGSIAVIVSWTALWDKILSGGIGAHWGIYRGLLGILAIGLLAAALYVWRNNPGGDDVGASATAPSGDLGLWKASELVTGAGIAAVIGCALGITAIGNLNPLSGSTPPIQTSNFWDIMLLLVSLGLVAIGSQI
Ga0247830_1088576913300033551SoilRDALRKLGGLLIGLAALMIYIRKGPFLAGNPNQWAAFPMLLVLAIPAVFLYGSLLAVPQTGELRPWQVAHNAFGLVFIPLALGEFIDVIGGTPTASLNVFWIAAATAAFAFYAGSRAGVRVQFLLGSIALIVSWTALWNKILDNGVGAHWGIYRGLLGILSIGLLAGALYLWRNNPGGDNLAASATAPAGDLGLWKASELVTGAGIAAVIACGLGITAIGNLNPLSGSTPPIETSNL
Ga0326723_0219372_143_8443300034090Peat SoilMNDLFKPDDSRDALRKIGGLLLGLGAAMIYIRKGPFLTVNPSQWASFPMFLVLAIPAVYLYGGILTKPQTGELRPWQLVHSVFGLIFVPLALSQFVDMIGGNPNAALNVFWILLVTAGLAFYAGIVAGVRVQLLLGSIALIISWTALWDKLLSGGVGAHWGVYRGLLGLIAIGLLAGGLYLWRNNPGGDEVAASATAPSGDLGLWKASELLTGAGIAAVIACSLGITALGNLNP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.