NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F092785

Metagenome / Metatranscriptome Family F092785

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092785
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 249 residues
Representative Sequence MEAHWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGAGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKK
Number of Associated Samples 93
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.692 % of family members)
Environment Ontology (ENVO) Unclassified
(24.299 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.336 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.56%    β-sheet: 6.46%    Coil/Unstructured: 48.98%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF02649GCHY-1 1.87
PF00873ACR_tran 1.87
PF01261AP_endonuc_2 1.87
PF00106adh_short 0.93
PF00486Trans_reg_C 0.93
PF06415iPGM_N 0.93
PF11897DUF3417 0.93
PF10546P63C 0.93
PF10987DUF2806 0.93
PF13533Biotin_lipoyl_2 0.93
PF02653BPD_transp_2 0.93
PF13649Methyltransf_25 0.93
PF01553Acyltransferase 0.93
PF01676Metalloenzyme 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG1469GTP cyclohydrolase FolE2Coenzyme transport and metabolism [H] 1.87
COG0696Phosphoglycerate mutase (BPG-independent), AlkP superfamilyCarbohydrate transport and metabolism [G] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil18.69%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.89%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere14.02%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.61%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil4.67%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.74%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.80%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil1.87%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.87%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.93%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.93%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.93%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300003368Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021358Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R3Host-AssociatedOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021439Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R03EnvironmentalOpen in IMG/M
3300021444Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R02EnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027502Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027824Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032044Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f20EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1011404933300001661Forest SoilMEAHWYKARECYRVWIPARLSENGKRCRRFFATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGDLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVV
JGI12627J18819_1005221833300001867Forest SoilMEALWYKSRRCWRVIVPSRLSETGKRCRRFFQTKGEAEKFILEIKRRGSVQLADLSIEEIHVLGVIRQSQKYAPALLLEAWQRFESDGLRDGKLTVQELSEKFLARQIAERRSAQTIADDRWRLKTFSRVVGKSRVAVIKRSDILGYLEGIPPGTNRRSHYKALRKMWRWAFDLGHVEYDPMAKLRPLDTWGVNNEILSFELFQRFLRVIQALEPPREGVEPTARYKRLLPYFVLGGFQGLRTCEMVRERADYPVVEWRDFLWKKNLLVVRDEVAKQTRARDRLRYV
JGI25614J43888_1006119113300002906Grasslands SoilMESNWYEPRKCYQVIVPARLSEKGKRCRRFFATKTEAEKFIFETKRQGSVQLAELAVEEKHVLGVIRQSERYEPALLLEAWQRFEKEGIGEAGNLTVQELAEKFLARQKAEGRSARTVIDDRWRLNALTKAIGHLRAGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRLLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVA
JGI25615J43890_102842413300002910Grasslands SoilEKEQHTGRVWKRTGIRQGNVIGFGYLQDCQRNGKRCRRFFATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQAEKYEPTLLLEAWRRFEKEGTDENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNTMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQALEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIKLL
JGI26340J50214_1005654713300003368Bog Forest SoilVEAKWYKARNCYRVWVPARLSEKGKDCRRFFETKEQAEKFIFEAKRSGSVELAELAVEEKHVLGVIRQSQNYEPRLLLEAWQRFQSQGMGNGSNLTVQQLCEKFLTRQIAERRSVQTLADDRWRLNAFSRAVGHARAAAVKRSDILRYLEGIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMNRLKPLDSWGVNNEVLSVELFQRFLRVTQGLEAPREGVNVSGKYRRLLPYFVLGGLQGLRTCEMVRERADYPVIEWRDFK
Ga0062386_10065829223300004152Bog Forest SoilVEAKWYKARKCYRVWVPARLSEKGRECRRFFETKEQAEKFIFETKRSGSVEIAELAVEEKHILGVIRQSQKYEPGLLLEAWQRFEREEIGEDRNLTVLELAEKFLARQKAQGRSTRTLIDDRSRLKAMTTALGHFRAGAVKRADILRYMEEIAPGTNRRSHHKTLRKLWRWAHDLGHVGNDPMAKLKPLDEWGVNNEVISPELFQRLLRVVQGLEGPRDGLEETQKYRGLLSYLVLG
Ga0062386_10084336913300004152Bog Forest SoilNCYRVWVPARLSEKGKDCRRFFETKEQAEKFIFEAKRSGSVELAELAVEEKHVLGVIRQSQNYEPRLLLEAWQRFQSQGMGNGSNLTVQQLCEKFLTRQIAERRSVQTLADDRWRLNAFSRAVGHARAAAVKRSDILRYLEGIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMNRLKPLDSWGVNNEVLSVELFQRFLRVTQGLEAPREGVNVSGKYRRLLPYFVLGGLQGLRTCEMVRERADYPVIEWRD
Ga0062386_10085463913300004152Bog Forest SoilEPRKCYRVWIPVRLSESGKRYRRFFETKEQAQKFIFETKRNGSIELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGSGNAADLTVQELCEKFFARQIAERRSAQTLADDRWRLNAFSREVGQARAAGVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFERFLRVTRGLEAPREGVEASGKYNPLLPYFVLGGLQGLRTCEMVRERADYPV
Ga0066388_10114804423300005332Tropical Forest SoilVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFILETKRSGSVELAELAVEEKHILGVIRQSEKYEPALLLEAWRRFESEGIGNGSHLTVQQLCENFFGRQMAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAMPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQAPRDGLESTAKYKPLLAVFCPRGSPRVTNLRNGERTRRLSGH*
Ga0068868_10083751713300005338Miscanthus RhizosphereKSNIVALVEPHWYEPRKCYRVWIPVRLSESGRRYRRFFETKEQAQKFIFETKRSGSIELADLAVEEKHVLGVIRQSERYEPKLLLEAWRRFESGGSGNGSNLTVKQLCENFFARQVAERRSPQTLGDDRWRLNAFSRLMGQAKVAAMKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVENDPMARLKPLDAWGVNNEVLSVELFQRFLRITQGMETPRNGVEVVTKYKPLLPYFVLGGLQGLRTCEMVRERADYPVVEWRDFLWNKELLV
Ga0070713_10074726713300005436Corn, Switchgrass And Miscanthus RhizosphereLAKKTWKNSNKQQHSGGVEAKWYKSRKCYRVWIPARLSENGKACRRFFNTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFRRLLRVARGLEAPREGSKVTEKYKG
Ga0070710_1020331133300005437Corn, Switchgrass And Miscanthus RhizosphereMEAHWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILHTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGADENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFSAGRARAGRAKRRTKGDSEIQ
Ga0070706_10025945813300005467Corn, Switchgrass And Miscanthus RhizosphereKWYKARNCYRVWVPARLSQNGKDCRRFFETKEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRSPRTLIDDRSRLNAMTKVIGHIRAGAVKRADLLRYMEGIAPGTNRRSHYKTLKKLWHWAHDLGHVANDPMAKLKPLDEWGVNNEVLSPELFQRFLRVVQGLEGPREGVGATQQYQGLLAYFVLGGLQGLRTCEMVRERAKDPVVQWRDFLLEWIHFPPKCLQASISKHRM*
Ga0070707_10036796013300005468Corn, Switchgrass And Miscanthus RhizosphereMEAHWYKARECYRVWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQGFLRVVPGLEEPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRY
Ga0070698_10067408313300005471Corn, Switchgrass And Miscanthus RhizosphereVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSTRTVLDDRWRLNAMTRALGHLRVGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFQRLLRVARGLEAPREGLKVTEKYKGLVPYFVLGGLQGL
Ga0070699_10014674333300005518Corn, Switchgrass And Miscanthus RhizosphereVEAKWYKARNCYRVWVPARLSQNGKDCRRFFETKEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRSPRTLIDDRSRLNAMTKVIGHIRAGAVKRADLLRYMEGIAPGTNRRSHYKTLKKLWHWAHDLGHVANDPMAKLKPLDEWGVNNEVLSPELFQRFLRVVQGLEGPREGVGATQQYQGLLAYFVLGGLQGLRTCEMVRERAKDPVVQWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIELLKPLATTGAVIPVAD
Ga0070697_10024491933300005536Corn, Switchgrass And Miscanthus RhizosphereAHWYKARECYRVWIPARLSENGKRCRRFFATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLERWAMQSQP*
Ga0070697_10038913613300005536Corn, Switchgrass And Miscanthus RhizosphereVEAKWYKARKCSRVWVPARLSQNGKHCRRFFETKEQAEKFIFETKRSGGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRSPRTLIDDRSRLNAMTKVIGHIRAGAVKRADLLRYMEGIAPGTNRRSHYKTLKKLWHWAHDLGHVANDPMAKLKPLDEWGVNNEVLSPELFQRFLRVVQGLEGPREGVGATQQYQGLLAYFVLGGLQGLRTCEMVRERAKDPVVQWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIELLKPLATTGAVIPVADSTFYCL
Ga0070697_10163760213300005536Corn, Switchgrass And Miscanthus RhizosphereELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGNLTVQELAEKFLVRQRAERRSARTVLDDRWRLNAMTRALGHFRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVATDPMGTLRPLDNWGVNNEVLSTELFQRLLRVARGLEAPREGLKVTEKYKGLVPYFV*
Ga0070715_1045903213300006163Corn, Switchgrass And Miscanthus RhizosphereTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGADENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARD
Ga0075021_1087600813300006354WatershedsSGKRYRRFFETKEQAQRFIFETKRSGSIELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGSGNGADLTVQQLCEKFFARQIAERRSAQTLADDRWRLNAFSREVGQARAAAVKRSEILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFMRVTQGLE
Ga0099793_1012247313300007258Vadose Zone SoilMEAHWYKARECYRVWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAQLFQRFLRVVQGLQGPREGLKATQKYRGLLVYFVLGG
Ga0126373_1034225013300010048Tropical Forest SoilMEARWYKSRNCYRVWVPARLSENRKNCRRFFETKEQAEKFILETKRNGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEISDDRNLTVQELAEKFLARQKAQGRSIRTIIDDRSRLNAMTKVLGQTRAGAVKRAHILRYVEGIPPGTNRRSHYKTLRKLWRWAHDLGHVQNDPMAKLKPLDEWGVNNEVLSPELFQRFLRVIQGLESPREGLEPTIQFKGLLAFFV
Ga0074045_1057656313300010341Bog Forest SoilKEQAQKFIFETKRNGSIELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGSGNGANLTVQQLCEKFFARQVAECRSAQTLADDRWRLNAFSRDVGQARAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMNRLKPLDSWGVNNEVLSVELFQRFLRVTQGLEAPREGVNVSGKYIRLLPYFVLGGLQGLRTCEMVRERADYPVIEWRDFKWEKNLIVVR
Ga0126372_1126554323300010360Tropical Forest SoilLFETKEQAQKFIFETKRSGSVELAELAVEEKHVLGVIRQSEKYEPRLLLEAWRRFESEGTGNGSNFTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAATVKRSEILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLEGPRDGVESSAKYKPLLPYFILGGLQ
Ga0126378_1049313213300010361Tropical Forest SoilNLEKEQQRATESNILAEVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFILETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGIGNGSNLTVQQLCENFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAMPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQAPRAVWSQPPNINPSWPYFVLGGSPRVTNLRNGERTRRLSGH*
Ga0126379_1312520813300010366Tropical Forest SoilQKFIFDTKRSGSVELAELAVEEKHVLGVVRQSEKYEPRLLLEAWRRFESEGTGNGSNFTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLEGPRDGV
Ga0126381_10017380933300010376Tropical Forest SoilMEAHWYEPRECYRVWIPARLSENGKRHRRFFATKTEAQKFIFQIKRQGSVQLAELGIDEKHVLGVIRQSGKYKPRLLLEAWQRFEREETGDGTSLTVQELAEKFLARQKSEGRSARTVIDDRWRLNALAKTMGHLRAGAVKRADMLRYLEGIPPGTNRRSHHKTVRKLWRWAHDLDHVQNDPMAKLKPLDQWGVNNEVLSPELFQRFLRVAQGLEAPREGAEVTEKYKRLLPYFVLGGLQGLRTCEIIRERVGDPVIEWPDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATVKVLKPLVGDGPVMPMASKAFYSMRQ*
Ga0126383_1115574113300010398Tropical Forest SoilVEAHWYEPRECYRVWVPARLSENRKRYRRFFATKEQAQKFILETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGIGNGSNLTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQAPRDGLESTAKYR
Ga0134121_1128189613300010401Terrestrial SoilRKCYRVWVPVRLSESGKRYRRFFETKEQAQKFIFETKRSGSVELADLAVEEKHVLGVIRQSEKYEPKFLLEAWRRFESEGSGNGSNFTVQQLCENFFARQLAERRSPQTLGDDRWRLNAFSRVMGQAKAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVENDPMARVKPLDAWGVNNEVLSVELFQRFLRITQGMEAPRVGVEVGAKYKPLLPYFVLGGLQGLRTCEMVRERAD
Ga0137383_1083715913300012199Vadose Zone SoilFSIRKRKPKKFILDTKRKGLVDWAELAVEEKHVLGVIRQSQKYGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLEHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRFARGLEAPREGDIHGLRSFRRR*
Ga0137363_1120934813300012202Vadose Zone SoilENGKACRRFFSAKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYAPGLLLEAWRRFEEEGDGEVRKLTVQELAEKFLARQRAERRSTRTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEGLSTELFQRLLRGGRGLEAPREGLKVTEKYKGLVPYF
Ga0137399_1129598113300012203Vadose Zone SoilEQQFILETKRKGLVDWAELAVEEKQVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLGHVAKDPMGTLRPLDTWGVNNEVLSTELFQRLLRVARGLEAPREGLKVTEKYKGLVPYFVLGGLQ
Ga0137362_1006179413300012205Vadose Zone SoilMGRLAGDFSIRKRKPKKFILDTKRKGLVDWAELAVEEKHVLGVIRQSQKYEPGLFLEAWRRFEEEGDGEVGKFQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRFARGLEAPREGLKVTEKYKGLVPYFVLGGLQGLRTCEIIKEHGNDPVIEWRDFLWKKKLVVVR
Ga0137360_1036662323300012361Vadose Zone SoilMEAHWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGAGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKK
Ga0137361_1012369123300012362Vadose Zone SoilVEAKWYKSRKCYRVWIPARLSENGKACRRFFNTKTEAEKVHFRHEAKGLGGLGGTSSGGKACVGGIRQSQKYGLLLEAWRRFEEEGDGEIGNLTVHELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLEHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRFARGLEAPREGDIHGLRSFRRR*
Ga0137361_1049794913300012362Vadose Zone SoilMEAHWYKARECYRVWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKEMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDTWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIKLLKPLAT
Ga0137398_1032277013300012683Vadose Zone SoilMSENGRACRRFFSTKTEAEKFILETKRKGLVDWAELAVEEKHVLGVIRQSEKYAPGLLLEAWRRFEEEGDGEVRKLTVQELAEKFLSRQRAERRSTRTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLGHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRVAQGLEAPRQGLKVTEKYKGLAPYFVLGGLQGLRTCEMIKEQGKDPVIEWRDFLWKKNLV
Ga0137359_10006479123300012923Vadose Zone SoilVRGQKGRIGQKTRKNSNRQQHSGGMEAKWYQSRKCYRVWIPARMSENGKACRRFFSTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSTRTVLDDRWRLNAMARALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFQRLLRVTQGLEAPRAGLKVTEKYKGLVPYFVLGGLQRTCEIIKEDGNDPVIEWRDFLWKKKLVVVRDGKKWFNLPEKTLLEGSAKVAQKRKTRSSVSMTNL*
Ga0137419_1003964753300012925Vadose Zone SoilLQTKRQGSVQLAELGVEEKHVLGVIRQAEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIKLLKPLATTGAMIPVADSTFYGMRQELC
Ga0137416_1005446613300012927Vadose Zone SoilRCRRFFATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGDLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIQLLKPLATAGATIPVADSTFYALA*
Ga0137416_1147489313300012927Vadose Zone SoilVDWAELAVEEKHVLGVIRQSEKYAPGLLLEAWRRFEEEGDGEVRKLTVQELAEKFLARQRAERRSTRTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLGHVAKDPMGTLRPLDTWGVNNEVLSTELFQRLLRVIQGLEAPRGGLKVTEKYKGLVPYFVLGGLQG
Ga0137404_1125871513300012929Vadose Zone SoilENGKACRRFFTTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFHLEHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRFVRGLEAPREGDIHGLRSFRRR*
Ga0126369_1020071123300012971Tropical Forest SoilLSQNGKECRRFFETNEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPKLLLEAWQRFEKEEIGEDRNLTVQELAEKFLARQKAQGRAPRTLIDDRSRLSSMIKVMGHIRAAAVKRSDILRYMEGIAPGTNRRSHYKTLKKLWRWAHDLGHVASDPMAKLKHMDEWGVNNEVLSPELFQRFLRVIQGLESPREGLEPTMQFKELLSFFVLGGLQGLRTCEMIRERAKDPVIEWRDFLWKKNLIGCATQSQSKPDHVTDLGMYPLNQSQSNC*
Ga0157378_1008275253300013297Miscanthus RhizosphereLDSSLKSKAKKRWNGQENLEKEQQRATYWALVEPHWYEPRKCYRVWVPVRLSESGKRYRRFFETKEQAQKFIFETKRSGSVELADLAVEEKHVLGVIRQSEKYEPKFLLEAWRRFESEGSGNGSNFTVQQLCENFFARQLAERRSPQTLGDDRWRLNAFSRVMGQAKAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVENDPMARVQPLDAWGVNNEVLSVELFQRFLRITQGMEAPRVGVEVGAK
Ga0137418_1030799313300015241Vadose Zone SoilMEAHWYKARECYRVWIPARLSENGKRCRRFFVTKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIQLLKPLATAGATIPVADSTFYALA*
Ga0182036_1026622623300016270SoilMLTAVEAHWYEPRKCYRVWVPARLSQNGKRYRRFFATKEQAQKFIFETKRSGSVEIAELAVEEKHVLGVIRQSEKYEPTLLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRLLRVTQGLEAPRGGVESSAKYKPLLPYLSWADSRAYEPAKWSENASIIRSLSGAIFCGTSNCW
Ga0182041_1041685513300016294SoilVEAKWYKARKCYRVWVPARLSQNGKECRRFFETKEQAEKFILETKRSGSVELAELAVEEKHILGVIRQSQKYEPKLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRAPRTLIDDRSRLTSMIKVMGHIRAAAVKRSDILRYMEGIAPGTNRRSHYKALKKLWRWAHDLGHVVSDPMAKLKPMDEWGVNNEVLSTDLFQRFLRVVQGFEGPREGLEAIQQYKALLPYFVLGGLQGLRTCEMVRERAKDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIKLLEPLAETGMVIPIADSTFYGLRQKL
Ga0182033_1072190313300016319SoilVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFILETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGIANGSNLTVQQLCENFFCRQMTERRSPQTLSDDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAMPPGINRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQASRDGLESTAKY
Ga0182034_1050496913300016371SoilMVSMEPNWYAARDCYQVIVPARLSENGRRRRRFFATKAEAEKFILETKRRGSVELTELAAEEKHVLGVIRQSERYEPGLLLQAWQRFEKEGIFENGNLTVEELAEKFLSRQIAERRSARTVLDDRWRLNALTNALGHLRACAVKPAEILRYLEGLPPGTNRRSHYKTLRKLWRWAFSLGHIETDPMAKLKPLDPWGVNKEVITPELFQRFLRVVQGLQGPREGLPPTEKYKGLVAYFVLGGLQGLRTCEMIKERATIPW
Ga0182034_1087978313300016371SoilVPARLSENGKRYRRFFATKELAQKFIFETKRSGSVEVAELAVEKKHVLGVIRQSEKYEPTLLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFKRLLRVTQGLEAPRGGVESSAKYKPLLPYFVLGGLQGLRTCEMVRERFDYPVIEWRDFLWNKQLLVV
Ga0182040_1151848813300016387SoilEKHVLGVIRQSEKYEPTLLLEAWRRFESEGTGNGSNFTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDTWGVNNEVLSVELFQRFLRVTQGLQAPRDGLESTAKYRPLLAYFVLGGLQGLRTCEM
Ga0182039_1029649023300016422SoilVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFIFETKRSGSVELAELAVEEKHVLGVIRQSEKNEPALLLEAWRRFESEGIGNGSNLTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAMPPGINRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFHRFLRVTQGL
Ga0182038_1027596113300016445SoilVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFIFETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGIANGSNLTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLEAPRDGVESSTKYKPLLPYFILGGLQGLRTCEMVRERADYPVVEWRD
Ga0182038_1214971713300016445SoilVDLSVEEMHVLGVIPQSQKYTPALLLQTWQRFENEGVHNGNLTVQQLCEKFLARQKAEGRSAQTLMDDRWRLNAFCRVLGSGRSGAVKRSDVLGYLESIGPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDSWGVNNEVLNVELFRRFLRVAQGLEAPREGT
Ga0179592_1016229423300020199Vadose Zone SoilMSENGKACRRFFNTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVRRADILGHMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEILSTELFQRLLRVTQGLEAPREGLKVTDKYEGLVPYFVLGGLQGLRTCEIIKEHGNDPVI
Ga0210395_1082220913300020582SoilVEAKWYKARNCYRVWVPARLSEKRRDCRRFFETKEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRYEKEEIGDDRNLTVQELAEKFLARQKAQGRATRTLIDDRSRLNAMTKVIGHTRAGAVKRADILRYMEGIAPGTNRRSHYKTLKKLWRWAHDLGHVVNDPMAKLKPLDEWGINNEVLSPELFQRFIRVVQGL
Ga0210400_1069151413300021170SoilMEAYWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGAGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGP
Ga0210405_1042834013300021171SoilMSENGKACRRFFSTKTEAEKFILETKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLTTELFQRLLRVTQGLEAPREGLKVTEKYEGLVPYLVLGGLQGLRTCEIIKEHGNDPVIEWRDFLWKKKLVVV
Ga0210408_1034034813300021178SoilSGSVELAELAVEEKHILGVIRQSQKYEPSLLLEAWQRYEKEEIGDDRNLTVQELAEKFLARQKAQGRSTRTLIDDRSRLNAMCKVTGDIRAGGVKRADILRYMEGIAPGTNRRSHYKSLKKLWRWAYDLGYLLNDPMAKLKPLDEWGVNNEVLTPELFQRFLRVVQGVEGPREGLEATQQYKGLLAYFVLGGLQGPRTCEMVRERAKDPVIQWRDFLWKKQLIVVRDEVAKQTRARDRLR
Ga0210388_1074726813300021181SoilELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGFGNGANLTVQLLCEKFFARQIAERRSAQTLADDRWRLNAFSREMGQARAAAVKRSNILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLAHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTRGLEGPREGVEVSVKYKPLLSYFVLGGLQGLRTCEMVRERADYPVVEWRDFLWNKKLLVVRDEVAKQTRARDRLRYVPLEPVSVELLRSLTGDGPVIQVARRAFQGLRRELCKEMRIRWPE
Ga0213873_1006015613300021358RhizosphereMEALWYKPRERWQVIVPTRLSETGKRCRRFFATKAEAERFILEIKRRGSVQSADLSVEEMHVLGLIRHSKKYAPGLLLEAWRRFESEGIGDDGRLTVQELTEKFLARQVAERRSSRTLADDRWRLNAFGRAFGHVNATGVKRTHILQYLEKIPPGTNRRSHYKTLKKLWRWALDLGHIEHDPMLRLKPLDAWGANKEVLGPELFQRVLRVAQGLESPRDGLEPVLRYKRLVPYFVLGGLQGLRTCEIVREHGADPVIEWSDILWKKKLVVVRDE
Ga0210393_1096662113300021401SoilATESNILLLVEAHWYEPRKCYRVWIPVRLAESGKRYRRFFETKEQAQKFIFETKRNGSIEFADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGPSNGANLTVQRLCEKFFARQKAERRSVQTLGDDRWRLNAFSREVGQARAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLAHVEHDPMARLKPLDDWGVNNEVLSVELFQHFLRVTRGLEGPREGV
Ga0210385_1017662913300021402SoilVRLSESGKRYRRFFETKEQAQKFIFETKRNGSIELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGFGNGANLTVQLLCEKFFARQIAERRSAQTLADDRWRLNAFSREMGQARAAAVKRSNILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLAHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTRGLEGPREGVEVSVKYKPLLSYFVLGGLQGLRTCEMVRERADYPVVEWRDFLWNKKLLVVRDEVAKQTRARDRLRYVPLEPVSVELL
Ga0210387_1051870113300021405SoilVEAKWYKARNCYRVWVPARLSEKGKDCRRFFETKEQAEKFIFEAKRSGSVELAELAVEEKHVLGVIRQSQNYEPRLLLEAWQRFQSERVGNEPNLTVEQLCEKFLKRQIAERRSVQTLADDRWRLNAFSRALGHARVPAVKRSDVLRYLEAIPPGTNRRSHYKTLRKLWRWAFDLGYVEHDPMNRLKPLDTWGVNNEILSVELFQRFLRVTQGVEAPRQGVGVSGKYKRLLPYFVLGGLQGLRTCEMVRERADYPVIEWRDFKWEKNLIVVRDEVA
Ga0213879_1006132213300021439Bulk SoilMEARWYKSRNCYRVWVPARLSENGKNCRRFFETKEQAEKFILETKRNGSVELAELAVEEKHILGVIRHSEKYEPRLLLEAWQHFEKEEIGDDRNLTVQELAEKFLARRKAQGRSTRTIIDDRSRLNAMTKVLGQMRAGAVKRADILRYVEGIAPGTNRRSHYKTLRKLWRWAHDLGHVENDPMAKLKPLDEWGVNNEVLSPELFQRFLRVIQGLESPREGVEPTIQFRGLL
Ga0213878_1016870313300021444Bulk SoilMEARWYKSRNCYRVWVPARLSENRENCRRFFETKEQAEKFILETKRNGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTLQELAEKFLARQKAQGRSTRTIIDDRSRLNAMTKVLGQNRAGAVKRADILRYVEGIAPGTNRRSHYKTLRKLRRWAHDLGHVENDPMAKLKPLDEWGVNNEVLSPKLFQRFLRIIQGLEGPREGLEPTMQFKGLLSFFVL
Ga0210390_1020276723300021474SoilRYRRFFETKEQAQKFIFETKRNGSIEFADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGPSNGANLTVQRLCEKFFARQKAERRSVQTLGDDRWRLNAFSREVGQARAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLAHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTRGLEGPREGVDVSGKYKSLLPYFVLGGLQGLRTCEMVRERSDYPVVAWRDFLWNKKLLNYWSCATKWLSKQGRVTG
Ga0126371_1145140513300021560Tropical Forest SoilFFETKEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPKLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRAPRTLIDDRSRLTSMIKFMGHIRAAAVKRSDILRYMEGIAPGTNRRSHYKTLKKLWRWAHDLGHVASDPIAKLKPMDEWGVNNEVLSPDLFQRFLRVVQGFEGPREGLDATEQYKGLLPYFVLGGLQGLRTCEMVKERAKDPVIEWRDFLWKKRLIVVRDEVAKQTRARDKLRYVPLEPATIQLLEPL
Ga0207653_1045526513300025885Corn, Switchgrass And Miscanthus RhizosphereEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRSPRTLIDDRSRLNAMTKVIGHIRAGAVKRADLLRYMEGIAPGTNRRSHYKTLKKLWHWAHDLGHVANDPMAKLKPLDEWGVNNEVLSPELFQRFLRVVQGLEGPREGVGAT
Ga0207685_1031807213300025905Corn, Switchgrass And Miscanthus RhizosphereFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGADENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKTMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDRLRYVPLEPLTIK
Ga0207684_1080096313300025910Corn, Switchgrass And Miscanthus RhizosphereKWYKARNCYRVWVPARLSQNGKDCRRFFETKEQAEKFIFETKRSGSVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVQELAEKFLARQKAQGRSPRTLIDDRSRLNAMTKVIGHIRAGAVKRADLLRYMEGIAPGTNRRSHYKTLKKLWHWAHDLGHVANDPMAKLKPLDEWGVNNEVLSPELFQRFLRVVQGLEGPREGVGATQQYQGLLAYFVLGGLQGLRTCEMVRERAKDPVVQWRDFLLEWIH
Ga0207693_1053428913300025915Corn, Switchgrass And Miscanthus RhizosphereEKEQQRATYWALVEPHWYEPRKCYRVWVPVRLSESGKRYRRFFETKEQAQKFIFETKRSGSVELADLAVEEKHVLGVIRQSEKYEPKFLLEAWRRFESEGSGNGSNFTVQQLCENFFARQLAERRSPQTLGDDRWRLNAFSRGMGQAKAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFMRVTQGLEAPREGVQASGKYKPLLPYFVLGGLQGLRTCEMVRERADYPVIEWRDFKWKKKLIVVRDEVAKQTRARDKLRYVPLEA
Ga0207700_1081437813300025928Corn, Switchgrass And Miscanthus RhizosphereMEAHWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILHTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGADENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEEPREGLKATQKYKGLLVYFVVGGLQ
Ga0209240_100056853300026304Grasslands SoilLSENGKRCRRFFATKAEAETFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGAGENGNLSVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQALEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLR
Ga0209647_1000579363300026319Grasslands SoilMSENGKACRRFFSTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRLAFHLGHVAKDPMGTLRPLDNWGVNNEVLSTELFQRLLRVTQGLEAPRGGLKVTEKYKGLVPYFVLGGLQGLRTCEIIKEHGNDPVIEWRDFLWKKKLVVVRDEVAKQTRARDRLRYVPLEAATVKIL
Ga0209647_100408413300026319Grasslands SoilMEAHWYKARECYRVWIPARLSENGKRRRRFFATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSKKYEPTLLLEAWRRFEKEGTGENGNLTVQELAQKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGAREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRY
Ga0257150_101516413300026356SoilVLGVIRQSEKYEPGLLLEAWRRFEKEGDGEAGNLTVQELAEKFLLRQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVRRADILGHMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEILSTELFQRLLRVTQGLEAPREGLKVTEKYKGLVSYLVLGGLQGLRTCEIIKE
Ga0257161_100395933300026508SoilVRGQKGRLGQKNREKQQHCGGVESKWYKSRKCYRVWIPARLSENEKACRRFFNTKTEAEKFILDTKRKGLVDWAELAVAEKHVLGVIRQSEKYEPDLLLEAWRRFEEEGDGEVGKLTVQELAEKFLVRQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVRRADILGHMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEILSTELFQRLLRVTQGLEAPREGLKVTEKYKGLVSYLVLGGLQGLRTCEIIKEHGNDPVIEWRDFLWKKKLVVVRDEVAKQTRA
Ga0257161_104238713300026508SoilATKTEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQAEKYEPTLLLEAWRRFEKEGTGENGDLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRSRDKLRYVPLEPATIQLLKPLATAGATIPVADSTFYGMRQELCKEMRIRWPENC
Ga0209648_1004736213300026551Grasslands SoilMEAHWYKARECYRIWIPARLSENGKRCRRFFATKAEAEKFILQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGAGENGNLSVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSPELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDK
Ga0209622_107702813300027502Forest SoilDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLTTELFQRLLRVTQGLEAPREGLKVTEKYEGLVPYLVLGGLQGLRT
Ga0208989_1007412313300027738Forest SoilVDWAELAVAEKHVLGVIRQSEKYEPDLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALDHLRVGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFQRLLRVTQGLEAPREGLKVTEKYEGLVPYLVLGGLQGLRTCEIIKEHGNDPVIEWRDFLWKKKLVVVRDEVAKQTRARERVSENFVCGSQREIKDGSDQEEG
Ga0209040_1003697643300027824Bog Forest SoilVRLSESGKRYRRFFETKEQAQKFIFETKRNGSIELADLAVEEKHVLGVIRQSQKYEPRLLLEAWRRFESEGSGNAADLTVQELCEKFFARQIAERRSAQTLADDRWRLNAFSREVGQARAAGVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFERFLRVTRGLEAPREGVEASGKYNPLLPYFVLGGLQGLRTCEMVRERADYPVIEWRDFKWKKKLI
Ga0209465_1048963213300027874Tropical Forest SoilFFETKEQAQKFIFETKRRGSVELAELAVEEKHILGVIRQSEKYEPALLLEAWRRFESEGIGNGSNLTVQQLCENFFGRQMAERRSPQTLADDRWRLNAFSRGMGQSRAAAVKRSDILGYLEAMPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFHRFLRVTQGLQAPRDGLESTTKYRPLLAY
Ga0137415_1007779043300028536Vadose Zone SoilLQTKRQGSVQLAELGVEEKHVLGVIRQSEKYEPTLLLEAWRRFEKEGTGENGDLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKAMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDAWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKYKGLLVYFVLGGLQGLRTCEMIRERANDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIQLLKPLATAGATIPVADSTFYALA
Ga0073994_1234971713300030991SoilKTEAEKFILETKRKGLVDWAELAVEEKHVLGVIRQSEKYAPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLVRQRAERRSARTVLDDRWRLNALTRALGHLRVGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFQRLLRVARGLEAPREGSKVTEKYKGLVPYFVLGGLQGLRTCEIIKEHGNDPVIEWR
Ga0073994_1237782413300030991SoilKNMCSGVIRQSEKYEPTLLLEAWRRFEKEGTGENGNLTVQELAEKFVARQKAEGRSARTVIDDRWRLNAMTKEMGHLRVGAVKRADILRYMEGIAPGTNRRSHHKTLRKLWRWAHDLGHVENDPMAKLKPLDTWGVNNEVLSAELFQRFLRVVQGLEGPREGLKATQKY
Ga0170824_11461449113300031231Forest SoilRKCYRVWIPVRLSESGKRYRRFFETKEQAQKFIFDTKRNGSIELADLAVEEKHILGVIRQSQKYEPRLLLEAWRRFESEGSGSGANLTVQQLCEKFFARQKAERRSAQTLADDRWRLNAFSREMGQARAAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMVRLKPLDARGINNEVLSVELFQRFLRVTRGLEGPREGVGVSGKYEPLLPYFVLGGFQGLRTCEMV
Ga0170824_12858916413300031231Forest SoilVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIVEDRNLTVQELVEKFVSRQKAQGRSARTIIDDRSRLKAMTNVMGQIRAGAVKRADILRYMEGIAPGTNRRSHHKTLRKFWRWAHDLGHVGNDPMAKLKPLDEWGVNNEILSPALFQRFLRVVQGLEGPMAGLEGTQKYKRLMPYLVLGGLQGLRTCEMIRERTKYPVIEWRDFLWKKGLIVVRDEVAKQTRARDKLR
Ga0310915_1070985013300031573SoilTAVEAHWYEPRKCYRVWVPARLSQNGKRYRRFFATKEQAQKFIFETKRSGSVEIAELAVEEKHVLGVIRQSEKYEPTLLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGKSRAATVQRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQAPRDGLESTAKYRPLLAYF
Ga0307474_1029834023300031718Hardwood Forest SoilMSENGKACRRFFSTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLTTELFQRLLRVTQGLEAPREGLKVTEKYEGLVPYLVLGGLQG
Ga0307477_1056271913300031753Hardwood Forest SoilFFSTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYDPGLLLEAWRRFEKEGDGEVGKLTVQELAEKFLERQRAERRSARTVLDDRWRLNAMTRALGHLRAGAIKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLSTELFQRLLRVAQGLEAPREGSKVTEKYKGLVPYFVLGGLQGLRTCEIIKEHSNDPVIEWRDFLWKKKLVVVRDEVAKQTRARERLR
Ga0318509_1046339013300031768SoilVEAHWYEPRKCYRVWVPARLSENRKRYRRFFATKEQAQKFIFETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGIANGSNLTVQQLCENFFGRQMAERRSPQTLADDRWRLNAFSRGMGQSRAAVVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRFLRVTQGLQA
Ga0307478_1042680613300031823Hardwood Forest SoilMSENGKACRRFFSTKTEAEKFILDTKRKGLVDWAELAVEEKHVLGVIRQSEKYEPGLLLEAWRRFEEEGDGEVGKLTVQELAEKFLARQRAERRSARTVLDDRWRLNAMTRALGHLRAGAVKRADILGYMEGIPPGTNRRSHYKTLRKLWRWAFDLGHVANDPMGTLRPLDNWGVNNEVLTTELFQRLLRVTQGLEAPREGLKVTEKYEGLVPYLVLGGLQGLR
Ga0306925_1043958613300031890SoilVEAHWYEPRECYRVWVPARLSENRKRYRRFFATKEQAQKFILETKRSGSVELAELAVEEKHVLGVIRQSEKYEPALLLEAWRRFESEGRGNGSNLTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRLLRVTQGLEAPRGGVESSAKYKPLLPYFVLGGLQGLRTCEMVRERVDYPVIEWRDFLWNKQLLVVRDEVAKQTRARDKLRYVPLKLASVRLLRPIAG
Ga0306923_1056973513300031910SoilMLTAVEAHWYEPRKCYRVWVPARLSQNGKRYRRFFATKEQAQKFIFETKRSGSVEIAELAVEEKHVLGVIRQSEKYEPTLLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRLLRVTQGLEAPRGGVESSAKYKPLLPYFVLGGLQGLRTCEMVRERVDYPVIEWRDFLWNKQLLVVRDEVAKQTRARDKLRYVPLELA
Ga0310909_1076503013300031947SoilTESNMLTAVEAHWYEPRKCYRVWVPARLSQNGKRYRRFFATKEQAQKFIFETKRSGSVEIAELAVEEKHVLGVIRQSEKYEPTLLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRLLRVTQGLEAPRGGVESSAKYKPLLPYFVLGGLQGLRTCEMVRERVDYPVI
Ga0306926_1139458513300031954SoilMEAHWNKPRQCYRVWIPARLSENRKWCRRFFATKAEAEKFIFEIKRRGSVQLADLSIEEMHVLGVIRQSQKYTPASLLQAWQRFESEGVHNGNLTVQQLCEKFFARQKAEGRSAQTLMDDRWRLNAFCRVLGPGRSGAVKRSDVLGYLEGIGPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDPWGVNNEVLSVELFQSLLRVALGLEAPRQGLERSERFKGLVPYLVLGAFQGLRTCEMVRESADCPVIEWRDFIWKKKLLV
Ga0306926_1181360213300031954SoilVEAKWYKARKCYRVWVPARLSQNGKECRRFFETKEQAEKFIFETKRSGCVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVLELAEKFLARQKAQGRAPRTLIDDRSRLTSMIKAMGHMRAGAVKRPDILRYLEGIAPGTNRRSHYKTLKKLWRWAHDLGQVASDPMAKLKPMDDWGVNNEVLSPDLFQRFLRVVQALEGPREGVEATERYKGL
Ga0306922_1177497613300032001SoilLVDLSVEEMHVLGVIPQSQKYTPALLLQTWQRFENEGVHNGNLTVQQLCEKFLARQKAEGRSAQTLMDDRWRLNAFCRVLGSGRSGAVKRSDVLGYLESIGPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDPWGVNNEVLSVELFQSLLRVALGLEAPRQGLERSERFKGLVPYLVLGAFQGLRTCEMVRESADC
Ga0318558_1036450313300032044SoilLLEAWRRFESEGIGNGSNFTIQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSHAMGQSRGAAVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDAWGVNNEVLSVELFQRLLRVTQGLEAPRGGVESSAKYKPLLPYLSWADSRAYEPAKWSENASIIRSLSGAIFCGTSNCW
Ga0318533_1041026513300032059SoilVEAKWYKARNCYRVWVPARLSEKGKDCRRFFETKEQAEKFIFETKRNGSVELAELAVEEKHVLGVIRQSEKYEPRLLMEAWRRFESEETGNGSNFTVQQLCEKFFGRQIAERRSPQTLADDRWRLNAFSRGMGQSRAAVVKRSDILGYLEAIPPGTNRRSHYKTLRKLWRWAFNLGHVEHDPMARLKPLDTWGVNNEVLSVELFQRFLRVTQGLEAPRDGVESSAKYKPLLAYFVLGGLQGLRTCEMMRERADYPVIEWRDFLWNKELLVVRDEVAKQTRARDKLR
Ga0306924_1173602613300032076SoilAEKFILEIKRRGSVQLVDLSVEEMHVLGVIPQSQKYTPALLLQTWQRFENEGVHNGNLTVQQLCEKFLARQKAEGRSAQTLMDDRWRLNAFCRVLGSGRSGAVKRSDVLGYLESIGPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDPWGVNNEVLSVELFQSLLRVALRLEAPRQGLERSERFKGLVPYLVLGAFQGLRTCEMVRESADC
Ga0306920_10063449743300032261SoilVEAKWYKARKCYRVWVPARLSQNGKQCRRFFETKEQAEKFIFETKRSGCVELAELAVEEKHILGVIRQSQKYEPRLLLEAWQRFEKEEIGDDRNLTVLELAEKFLARQKAQGRAPRTLIDDRSRLTSMIKAMGHMRAGAVKRPDILRYLEGIAPGTNRRSHYKTLKKLWRWAHDLGQVASDPMAKLKPMDDWGVNNEVLSPDLFQRFLRVVQALEGPREGVEATERYKGLLPYFVLGGLQGLRTCEMVKERAADPVIEW
Ga0306920_10191939013300032261SoilMEARWYKPRQCYRVWIPARLSENRKWCRRFFATKAEAEKFIFEIKRRGSVQLADLSIEEMHVLGVIRQSQKYTPASLLQAWQRFESEGVHNGNLTVQQLCEKFFARQKAEGRSAQTLMDDRWRLNAFCRVLGPGRSGAVKRSDVLGYLEGIGPGTNRRSHYKTLRKLWRWAFDLGHVEHDPMARLKPLDPWGVNNEVLSVELFQSLLRVALGLEAPRQGLERSERFKGLVPYLVLGAFQGLRTCEMVRE
Ga0310914_1087693813300033289SoilLLLEAWQRFEKEEIGDDRNLTVLELAEKFLARQKAQGRAPRTLIDDRSRLTSMIKAMGHMRAGAVKRPDILRYLEGIAPGTNRRSHYKTLKKLWRWAHDLGHVVSDPMAKLKPMDEWGVNNEVLSTDLFQRFLRVVQGFEGPREGLEAIQQYKALLPYFVLGGLQGLRTCEMVRERAKDPVIEWRDFLWKKKLIVVRDEVAKQTRARDKLRYVPLEPATIKLLEPLAETGMVIPIADSTFYGLRQKLCKEMRVRWPENCLRNSYA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.