NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F106122

Metagenome Family F106122

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F106122
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 117 residues
Representative Sequence VGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPK
Number of Associated Samples 77
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 71
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil
(19.000 % of family members)
Environment Ontology (ENVO) Unclassified
(22.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(70.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 18.92%    β-sheet: 0.00%    Coil/Unstructured: 81.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF030614HBT 34.00
PF02036SCP2 13.00
PF14864Alkyl_sulf_C 11.00
PF00440TetR_N 2.00
PF00150Cellulase 1.00
PF00480ROK 1.00
PF12706Lactamase_B_2 1.00
PF01026TatD_DNase 1.00
PF05362Lon_C 1.00
PF03176MMPL 1.00
PF12680SnoaL_2 1.00
PF00753Lactamase_B 1.00
PF00528BPD_transp_1 1.00
PF08818DUF1801 1.00
PF13231PMT_2 1.00
PF05899Cupin_3 1.00
PF09594GT87 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1940Sugar kinase of the NBD/HSP70 family, may contain an N-terminal HTH domainTranscription [K] 2.00
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 1.00
COG1033Predicted exporter protein, RND superfamilyGeneral function prediction only [R] 1.00
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 1.00
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 1.00
COG2409Predicted lipid transporter YdfJ, MMPL/SSD domain, RND superfamilyGeneral function prediction only [R] 1.00
COG2730Aryl-phospho-beta-D-glucosidase BglC, GH1 familyCarbohydrate transport and metabolism [G] 1.00
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 1.00
COG3934Endo-1,4-beta-mannosidaseCarbohydrate transport and metabolism [G] 1.00
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 1.00
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 1.00
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil19.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil17.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil9.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil7.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere5.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.00%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave3.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.00%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere2.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.00%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial1.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.00%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2140918013Soil microbial communities from Great Prairies - Iowa soil (MSU Assemblies)EnvironmentalOpen in IMG/M
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005562Agave microbial communities from Guanajuato, Mexico - As.Ma.eHost-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005983Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S2T1R1Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006051Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-4Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009789Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot28EnvironmentalOpen in IMG/M
3300009840Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105AEnvironmentalOpen in IMG/M
3300010036Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot26EnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010038Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot106EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012021Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - T1EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1EnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019767Populus adjacent soil microbial communities from riparian zone of Oak Creek, Arizona, USA - 239 TEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028707Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_148EnvironmentalOpen in IMG/M
3300028708Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_152EnvironmentalOpen in IMG/M
3300028721Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_355EnvironmentalOpen in IMG/M
3300028755Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_356EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032159Agave microbial communities from Guanajuato, Mexico - As.Ma.e (v2)Host-AssociatedOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Iowa-Corn-GraphCirc_003642302140918013SoilVGFLRRKDERAIPEPGTPEFEQAVQGSAIHDSQSVSMGEPGWSEPFSSETIDLRDTNKQDEVEEVLREHGIDPDKKGQTIDASKFPGLRKALLSVLVRQVPNAGGIDGGISTRKQSPPEPSDE
ICCgaii200_074118522228664021SoilVGLFGFLKKNDERAMPEPGTPEFEQAVTGSAIPDDQSVSMGEPGWTAPGSSQTLDLRGTGKREEIQQVLREHGIDLDQKGQTVDASQVPGLREALLKVLFRGVPNSDGIGGGVSAPKNPPPSEPGD
INPgaii200_053926612228664022SoilVGFLRRKDERAIPKPGTPEFDQAVQGSAIPDSRSVSMGEPGWSDPSSAETIDLRGTGTADQVEEVLREHGIDPDKKGQTIDASKFPGLRKALFSVLVRQVPNADGIGGGVSTPKRQPPEPDR
INPhiseqgaiiFebDRAFT_10530456523300000364SoilVGLFGFLKKNDERAMPEPGTPEFEATVAGSAIPDSQSVSMGEQGWTQPGGSQTLDLRGTGKREEIQGVLREHGIDPDQKGQTIDASKVPGLRTALIKAIFRGVPNSGGIGGGISTPKKAPEDSAD*
JGI11643J12802_1008995613300000890SoilVGFLRRKDERAIPEPGTPEFEQAVQGSAIHDSQSVSMGEPGWSEPFSSETIDLRDTDKQDEVEEVLREHGIDPDKKGQTIDASKFPGLRKALLSVLVRQVPNAGGIDGGISTRKQRPPEPSDE*
JGI10216J12902_10337275723300000956SoilVGLFRKKDERAMPDPGSPEFEAMVEGSAIPDSQSVSMGEEGWTDPGSSQTLDLRGTGKRDQVKEVLREHGIDPEKKGQTIDASQVPGLREALLKALFRGVPNSGGIGRGIPKKRD*
JGI10216J12902_10686732333300000956SoilMGLFRRKDESAIPESGTPEFEEAVQGTAIPDEQSVSMGEAGWTKPGSQTIDLRETGKRDEVREVLRDHGIDPDQKGQTIDASKVPGLRGALLRVLAGQVPNAEGIGGGIPTPKKKPDE*
F14TB_10334704613300001431SoilPRIRRTIRGPDLDRHHRPRYSRAVGFLRRKDEQAIPAPGTPEFEEAVQGSAIHDSQSVSMGEPGWTEPSSSETIDLRGSDKQDEIEAVLREHGIDPDKKGQTIDASKFPGLRKALISVLVRQVPNADGIGGGISTRKQPHPEPSDE*
Ga0063454_10070806123300004081SoilMPEPGTPEFEAAVAGTAMPDSQSVSMGESGWANPDASQTLDLSGTDKRDEVREVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP*
Ga0062593_10208869523300004114SoilMGLFDFLRKNDRRAMPEPGTQEFEEMVAGTAIPDARSVSMGERGWTQPGSSQTLDLRGTGKREEIKEVLREHGIDPDQKGQTIDASQVPGLREALLKVLFQPRR*
Ga0062589_10095251623300004156SoilMGLFDFLRKNDRRAMPEPGTQEFEEMVAGTAIPDARSVSMGERGWTQPGSSQTLDLRGTGKREEIKEVLREHGIDPDQKGQAIDASQVPGLREALLKVLF
Ga0062590_10011147613300004157SoilVGFLRRKDERAIPEPGTPEFEEAVQGSAIHDSQSVSMGEPGWTEPFSSETIDLRGTDKQDEVEEVLREHGIDPDKKGQTIDASKFPGLRKALLSVLVRQVPNAGGIDGGISTRKQPPPEP
Ga0062590_10259060613300004157SoilFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGDGVSTPKKPPPQDPDE*
Ga0062595_10012440723300004479SoilMGLFDFLRKNDRRAMPEPGTQEFEEMVAGTAIPDARSVSMGERGWTQPGSSQTLDLRGTGKREEIKEVLREHGIDPDQKGQTIDASQVPGLREALLKALFRPRR*
Ga0062595_10259522613300004479SoilRYSRAVGFLRRKDERAIPKPGTPEFDQAVQGSAIPDSRSVSMGEPGWSDPSSAETIDLRGMGTADQVEEVLREHGIDPDKKGQTIDASKFPGLRKALFSVLVRQVPNADGIGGGVSTPKRQPPEPDRE*
Ga0062591_10045273313300004643SoilVGFLRRKDERAIPEPGTPEFEEAVQGSAIHDSQSVSMGEPGWTEPFSSETIDLRGTDKQDEVEEVLREHGIDPDKNGQTIDASKFPGLRKALLSVLVRQVPNAGGIDGGISTRKQPPPEPSDE*
Ga0062594_10016915923300005093SoilMGLFRRKDERAIPRPGTPEFDQAVQGSALPGSSSGRSVSMGESGWSEPGKAQVEASSETVDLRDSGKRDEVEEVLREHGIDPDQKGQTIDASEVPGLRGALLKVLFGRVPNTDGIGGGISTPKTPPPEDSDG*
Ga0062594_10043554813300005093SoilVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGIGDGVSTPKKPPPQDPDGELLDGLRPR
Ga0066388_10223515013300005332Tropical Forest SoilMMGLFGFLRKKDERAMPEPGTPEFDAAVAASAIPDDQSVSMGEAGWTKPGSSATLDLRGTGKRDEVKEVLREHGIDPDQQGQTIDASQVPGLRQALLRVLFRGVPNSGGIGDGIPGPKKRGR*
Ga0066388_10458382713300005332Tropical Forest SoilVIGLFRRKDERAIPKPGTPEFDAAVQGTAIPDSQSISMGEVGWASTEPGESQNVDLRGSGKRHEVEEVLREHGIDPDKKGQTVDADSVPGLKQDLLRVLFGGRMGRTGADGIAGGVSRRKQDPPDQA*
Ga0066388_10489137213300005332Tropical Forest SoilMGLFDFLRKNDQRAMPEPGTPEFEEMVAGSAIPDALSVSMGEPGWTRPGWSQTLDLRGTGKREEIKEVLREHGIDPDQKGQTIDASQVPGLREALLKALFQPRR*
Ga0066388_10490606723300005332Tropical Forest SoilMGLFGFRRKNDERAMPEPGTHEFEAMVEGSALPDSRNVSMGEPGWTSPGGSQTIDLGGSGKQDQVEEVLREHGIDPDKKGQTIDASKVPGLRAALLRALFRTVPNSDGIGGGISAPKKKQDE*
Ga0066388_10731929123300005332Tropical Forest SoilLRKNDERAMPEPGTPEFERVVADTAIPDSQSVSMGEEGWAQPGSSQTLDLRGTGQSEEVKKVLREHGIDPDKQGQTVDASQVPGLREALLKALFRG*
Ga0070694_10075199113300005444Corn, Switchgrass And Miscanthus RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGV
Ga0068867_10196505613300005459Miscanthus RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE*
Ga0070696_10185770613300005546Corn, Switchgrass And Miscanthus RhizosphereDRSPDWRYSRPVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQALDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE*
Ga0058697_1080539023300005562AgavePGTPEFDAAVAGSAIPDAQSVSMGEPGWTQPGSSQTLDLRGTGKREEIKQVLREHGIDPDRTGQTIDASTVPGLRQSLIKALFGGVPNSGGIGEGISARKAPRPEPDGE*
Ga0066905_10108835613300005713Tropical Forest SoilMMALFGFLKKNDERAMPEPGTPEFDAAVAGSAIPDAQSVSMGETGWTQPGSSDAFDLRGTGKQEEIQEVLREHGIDPDQRGQTIDASQVPGLRQALLRVLFKGVPNSGGIGGGISAPKRPPEESPDEPGE*
Ga0066905_10147049123300005713Tropical Forest SoilMPEPGTTEFEQAVAGTAIPDDRSVSMGEAGWTRPGSSQTVDLSGTGKRDEIESVLREHGIDPEEKGQTIDASEVPGLREALLKALLRGVPNSGGIGKGIPKNRD*
Ga0081455_1000682913300005937Tabebuia Heterophylla RhizosphereVGLFGFLKKNDERAMPEPGTPEFEEAVAGTAIPDSQSVAMGEDGWTQPGTSQTLDLRDTGKRDEVKEVLREHGIDPDRKGQTIDASKVPGLREALIKALFRGVPNSGGIGGGVSAPRKAPEDSAD*
Ga0081540_1001473233300005983Tabebuia Heterophylla RhizosphereVGLFGFLRKNDQRAMPEPGTPEFDELVTGSAIPDAQSVSMGEEGWVPPGSSETLDLRGSGKRDEIKRVLREHGIDPDREGQTIDASEVPGLREALLKALFRPPR*
Ga0066652_10105556223300006046SoilVGFFGFMKRKDERAIPEPGSPEFEAAVQGSALPDEQSVSMGESGWTNPERSETVDLRDTGKRHEVEEVLREHGIDPDKKGQTIDASKVPGLRSALLRVLAGQVPNTDGIAGGIPAPKKAPPKDPGD*
Ga0075364_1113741313300006051Populus EndosphereMGLFGFLRGKDDRAMPEPGTPEFEAAVSGSALPGSVDMGETGWTKPGEAGASQTVDLRGTGARENVEKALREHGIDPDQEGQTVDASKVPGLQEAILGALG
Ga0105245_1020230623300009098Miscanthus RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQALDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE*
Ga0105249_1282133013300009553Switchgrass RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVG
Ga0126307_10001496173300009789Serpentine SoilRAVGLFGFLKKNDERAMPEPGTPEFEEAVAGTAIPDSQSVSMGEEGWTMPGPSQTVDLRGTGKREEIQEVLREHGIDPDRKGQTIDASQVPGLRAALLKALFRGVPNSGGIGEGIPKNRD
Ga0126307_1001278363300009789Serpentine SoilMGLFGFLRKDERAIPEPGTPEFDAMVQGSALPDAQSVSMGESGWTAPGRQTVDLRGTGKREEIKEVLREYGIDPEQQGQTIDASKVPGLREALLKALFGQVPNSGGFGDGIPKKRDQPPAGRFTVDIDR*
Ga0126307_1004829153300009789Serpentine SoilMGLFGFLRKDDERAMPEPGTPEFEAAVAGTAIPDAQSVSMGEAGWTQPGASQTLDLRGTGKREEIQQVLREHGIDPDQKGQTVDASTVPGLREALLKALFGGVPNSGGIGAGIPQPKSPPSDPAE*
Ga0126307_1080976113300009789Serpentine SoilMMGLFRRQDERAMPEPGTPEFEAMVAGSALPDSQSVSMGESGWTSPSRQTIDLRRTEARPEVERVLREHGIDPEKKGQVIDASKVPGLRQALLRVLAGRIPSDGGFGDGVSKK*
Ga0126313_1005354833300009840Serpentine SoilLFRRKDERAMPEPGSAEFEAMVAGSALSDSQSVSMGESGWKSTSRQSIDLRGTGARQDVERVLREHGIDPEKQGQVIDASEVPGLRQALLRVLAGRIPNAGGFGDGIPPKKRDA*
Ga0126313_1026201923300009840Serpentine SoilMGLFGFLRRKDEDAIPKPGTPEFEAAVEGSAIPDSQSVSMGEPGWADPSKQTIDLRGSGHRQEVEQVLREHGIDPDRKGQTIDASEVPGLREALLKVLFRVPNSGGIGKGIPSPKRDSE*
Ga0126313_1086309023300009840Serpentine SoilMGLFRRKDERAIPEPGSPEFEAAVQGSALPDSQSVSMGEEGWTSPGGTIDLRGTGKREEIAEVLREHAIDPEQKGQTIDASKVPGLREALLGVLFRGVPNSGGFGGGIPANKKAPPDE*
Ga0126305_1035260013300010036Serpentine SoilMGLFGFLRKDDERAMPEPGTPEFEAAVAGTAIPDAQSVSMGEAGWTQPGASQTLDLRGTGKREEIQQVLREHGIDPDQKGQTVDASTVPGLREALLKA
Ga0126304_1070457823300010037Serpentine SoilMGLFGFLRKDERAIPEPGTPEFDAMVEGSALPDAQSVSMGESGWTAPGRQTVDLRGTGKREEIKEVLREYGIDPEQQGQTIDASKVPGLREALLKALFGQVPNSGGFGDGIPKKRDQPPAGRFTVDIDR*
Ga0126315_1035319123300010038Serpentine SoilLFRRKDERAMPEPGSAEFEAMVAGSALSDSQSVSMGESGWKSTSRQSIDLRGTGARQDVERVLREHGIDPEKQGQVIDASEVPGLRQALLRVLA
Ga0126309_1003919513300010039Serpentine SoilTPEFEAMVEGSAIPDSQSVSMGEEGWTDPSRQTVDLRDSGKREEIQQVLREHGIDPEQKGQTIDASKVPGLREALINVLLGRVPNTDGIAGGISHERDR*
Ga0126309_1012547523300010039Serpentine SoilVGLFRRKDERAMPEPGTPEFEAMVEGSAIPDSQSVSMGEEGWTDPSRQTIDLRGSGKREEIQQVLREHGIDPEQKGQTIDASKVPGLREALIKVLVGRVPNTDGIAGGISRTNRED*
Ga0126309_1022778023300010039Serpentine SoilMGLFRRKDERAMPDPGTPEFDAMVQGSALPDSQSVSMGDAGWTQPGGSQTIDLRETGKRDQVKQVLREHGIDPDQKGQTIDASEVPGLRGALMKVLLGQVPNSDGIGGGISPPNQAGPTDPEE*
Ga0126308_1018541823300010040Serpentine SoilVGFLRRKDERAIPAPGTPEFEEAVQGSAIHDSQSVSMGEPGWTEPSSSETIDLRGTDKQDEIEQVLREHGIDPDKKGQTIDASKFPGLRKALLSVLVRQVPNADGIGGGISTRKQRPPEPSDE*
Ga0126308_1026958323300010040Serpentine SoilVGLFGFLKKNDERAMPEPGTPEFDAAVAGSAIPDSQSVSMGEPGWTQPGSSQTLDLRGTGKREEIKEVLWEHGIDPEQKGQTIDASQVPGLRAALLKALFRGVPNSGGIGEGIPKNRD*
Ga0126312_1000203193300010041Serpentine SoilMGLFGFLRRKDEDAIPKPGTPEFEAAVQGSAIPDSQSVSMGEPGWADPSKQTIDLRGSGHRQEVEQVLREHGIDPDRKGQTIDASEVPGLREALLKVLFRVPNSGGIGKGIPSPKRDPE*
Ga0126312_1008297033300010041Serpentine SoilVGLFRRKDERAMPEPGTPEFEAMVEGSAIPDSQSVSMGEEGWTDPSRQTIDLRGSGKREEIQQVLREHGIDPEQKGQTIDASKVPGLREALVKVLVGRVPNTDGIAGGISRTNRED*
Ga0126306_1026217123300010166Serpentine SoilMMGLFRRQDERAMPEPGTPEFEAMVAGSALPDSQSVSMGESGWTSPSRQTIDLRGTGARPEVERVLREHGIDPEKKGQVIDASKVPGLRQALLRVLAGRIPSDGGFGEGVSKK*
Ga0126306_1027729123300010166Serpentine SoilVGLFGFLKKNDERAMPEPGTPEFEEAVAGTAIPDSQSVSMGEEGWTMPGPSQTVDLRGTGKREEIQEVLREHGIDPDRKGQTIDASQVPGLRAALLKALFRGVPNSGGIGEGIPKNRD*
Ga0134065_1024406713300010326Grasslands SoilVGFFGFMKRKDERAIPEPGSPEFEAAVQGSALPDEQSVSMGESGWTNPERSETVDLRDTGKRHEVEEVLREHGIDPDKKGQTIDASKVPGLRSALLRVLAGQV
Ga0126377_1195938423300010362Tropical Forest SoilGFLRKKDERAMPEPGTPEFDAAVQASAIPDDQSVSMGEAGWTEPGSSERLDLRGTAKRDEVKEVLREHGIDPDRKGQTVDASQVPGLRQALLRVLFRGVPNSGGIGEGIPEKKRD*
Ga0134125_1127853613300010371Terrestrial SoilSRPVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE*
Ga0134128_1047956633300010373Terrestrial SoilVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGIGDGVSTPKKPPPQDPDG*
Ga0105239_1045744133300010375Corn RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSPPKNPPPQDPDE*
Ga0134126_1033338233300010396Terrestrial SoilVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLQGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKP
Ga0134122_1059851413300010400Terrestrial SoilVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLQGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE*
Ga0120192_1005770623300012021TerrestrialMGLFGFLRRKDEKAIPNPGTPEFEAMVEGSALPDSQSVSMGEEGWSRPDAKQTIDLRGTGAREEVEQVLREHGIDPEKKGQTIDASKVPGLREALLRVLARGVPNSGGFGRGIPKKKGE*
Ga0164300_1007108913300012951SoilMGIFDFLRRKHERAIPEPGSPEFEAAVQGSALPDSRSVSMGEEGWTKPGETVELSETGKNEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIPAKRKSSSDG*
Ga0164299_1010031523300012958SoilVRIFGFLRRKDERAIPEPGSPEFEAAVHGSALPDSRSVSMGEEGWTKPGETVELSETGKDEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIPAKRKSSSDG*
Ga0164301_1023793523300012960SoilMGIFDFLRRKHERAIPEPGSPEFEAAVQGSALPDSRSVSMGEEGWTKPGETVELSETGTNEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIPAKRKSSSDG*
Ga0164302_1001141343300012961SoilVRIFGFLRRKDERAIPEPGSPEFEAAVQGSALPDSRSVSMGEEGWTKPGETVELSETGKNEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIPAKRKSSSDG*
Ga0164308_1014886223300012985SoilVRIFGFLRRKHEQAIPEPGSPEFEAAVQGSALPDSRSVSMGEEGWTKPGETVELSETGKNEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIAAKRKSPSDE*
Ga0164305_1086742323300012989SoilAIPEPGSPEFEAAVQGSALPDSRSVSMGEEGWTKPGETVELSETGKNEEIQQVLREHGIDPDRKGQTIDASKVPGLREALIKVLVGGVPNSGGIGSGIPAKRKSSSDG*
Ga0157378_1240165413300013297Miscanthus RhizosphereMGLFRRKDERAIPRPGTPEFDQAVQGSALPGSSSGRSVSMGESGWSEPGKAQVEASSETGDLRDSGKRDEVEEVLREHGIDPDQKGQTIDASEVPGLRGALLKVLFGRVPNTDGIGGGISTPKTPPPEDSDG*
Ga0157372_1207541013300013307Corn RhizosphereMGLFRRKDERAIPRPGTPEFDQAVQGSALPGSSSGRSVSMGESGWSEPGKAQVEASSETVDLRDSGKRDEVEEVLREHGIDPDQKGQTIDASE
Ga0132258_1002784673300015371Arabidopsis RhizosphereVGLFGFLRKNDQRAMPEPGTPEFDAMVAGSAVPDSQSVSMGEEGWTQPGSSQSLDLRGTGKREEIQQVLREHGIDPDQKGQTIDASQVPGLREALLKALFRVPNSGGIAKGIPEKRDPDAS*
Ga0132258_1072389323300015371Arabidopsis RhizosphereVGLFRRKDERAIPDPGTPEYEAMVEGSALPDSRSVSMGEPGWTAPDAEKLGVPDTGKREQIEEVLREHGIDPEKQGQVIDASKVPGLRGALLRVLTGRVPNTDGIAGGVSSKSPD*
Ga0132258_1096850423300015371Arabidopsis RhizosphereVGLFDFLRKKDEQAMPEPGSPEFEALVQSSALPDSQSVSMGEEGWIAPGSSQTLDLRGTDQSEQIKEVLRAHGIDPDQQGQTIDASQVPGLREALLKALFMPKQD*
Ga0132258_1135600013300015371Arabidopsis RhizosphereAVALFRRKDERAMPDPGSREFEEAVEGSALPESEAGRTVAMGETGWTEPGSARTIDLRDSGKREEINQVLREHGIDPDEKGQTIDASTVPGLRGALLNVLFGRIPNSDGIGGGISAPRKRDD*
Ga0132255_10011384513300015374Arabidopsis RhizosphereVGLFGFLRKNDQRAMPEPGTPEFDAMVAGSAVPDSQSVSMGEEGWTQPGSSQSLDLRGTGKREEIQQVLREHGIDPDQKGQTIDASQVPGLREALLKALLRVPNSGGIAKGIP
Ga0187788_1017508423300018032Tropical PeatlandGRKDERAIPEPGSPEFEAAVQGSALPSSQGGRSVSMGESGWTNPNQVLDLRGTGAREEVVKALREHGVDPDKKGQTINASTIPGLREDILRALGQAGLRIPNAGGITDAEFEARKRKLLG
Ga0184624_1012590023300018073Groundwater SedimentVGLFGFLRGNDERAMPEPGTPEFDAAVDGTAIPDSQSVSMGEPGWTAPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASQVPGLRQALINALFRGVPNSGGIGGGIPSPKKAPPSDSDD
Ga0190270_1003608123300018469SoilVGFLRRKDERAIPAPGTPEFEEAVQGSAIHDSQSVSMGEPGWTEPSSSETIDLRGTDKQDEIEEVLREHGIDPDKKGQTIDASKFPGLRKALLSVLVRQVPNADGIGGGISTRKQPPPEPSDE
Ga0173479_1054590813300019362SoilVGLFGFLRKKDERAIPEPGTPEFEAAVQGSALPDSTSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGG
Ga0190267_1005128913300019767SoilVGLLDFLRWNDERAMPEPGTPEFERAVEGTAIPDAQSVSMGEPGWTQAGSSQTLDLRGTGKRDQIQQVLREHGIDPDEKGQTIDASQVPGLRSALLKALFGGVPNSGGIGGGISAPKQPPPKPDDR
Ga0222622_1070229723300022756Groundwater SedimentVGLFGFLRKSDERAMPEPGTPEFDSAVEGTAIRDAQSVSMGEPGWTKPGSPQTVDLRGSGKREELQEVLRQHGIDPDQKGQTIDASTVPGLRKALLGVLMGQVPNSGGIDAGISTPKQPPPKPDAE
Ga0207687_1008880123300025927Miscanthus RhizosphereMGLFRRKDERAIPRPGTPEFDQAVQGSALPGSSSGRSVSMGESGWSEPGKAQVEASSETVDLRDSGKRDEVEEVLREHGIDPDQKGQTIDASEVPGLRGALLKVLFGRVPNTDGIGGGISTPKTPPPEDSDG
Ga0207686_1095392723300025934Miscanthus RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPK
Ga0207709_1139207713300025935Miscanthus RhizospherePGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE
Ga0207648_1176326413300026089Miscanthus RhizosphereVGLFGFLRRKDERAIPEPGTPEFEAAVQGSALPDSSSGQSVSMGEPGWTEPGSSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLRAAILRVLVGGVPNSGGFGGGVSTPKKPPPQDPDE
Ga0307291_108455023300028707SoilMPEPGTSEFEAAVAGTAMPDSQSVSMGESGWTNPDASQTLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGI
Ga0307295_1012546913300028708SoilMPEPGTPEFEAAVAGTAMPDSQSVSMGESGWTNPDASQTLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP
Ga0307315_1018921313300028721SoilRAMPEPGTPEFEAAVAGTALPDSQSVSMGESGWTKPDASQTLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP
Ga0307316_1017842623300028755SoilMPEPGTPEFEAAVAGTALPDSQSVSMGESGWTKPDASQTLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP
Ga0307282_1019894413300028784SoilMPEPGTSEFEATVAGTAIPDSESVSMGESGWTNPDASQKLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP
Ga0247825_1059089123300028812SoilVERYSRFVAFFGFLRGKDERAMPEPGTPEFDAAVDGTAIADSQSVSMGEPGWTQPGSSQTVDLRGTGKREEIQEVLREHGIDPDQKGQTIDASQVPGLRQALIKALFRGVPNSGGISGGVSAPKNPPPSEPGD
Ga0307289_1032139513300028875SoilAVGLFGFLRTNDERAMPEPGTPEFEAAVAGTAMPDSQSVSMGESGWTNPDASQTLDLSGTGKRDEVKEVLREHGIDPDEKGQTIDASRVPGLREALLGALFGGVPNSGGIGEGIPKRKP
Ga0307277_1028683623300028881SoilMGLFGFLRKNDERAMPEPGTPEFEAAIEGSAIRDSQSVSMGEPGWTEPGSSQTVDLRGSGKREEIEDVLREHGIDPDQKGQTVDASKVPGLRAAILRVLAGSVPNSGGVDGGI
Ga0247827_1071150813300028889SoilNDERAMPEPGTPEFEAVVAGTAIPDSQSVSMGEAGWTEPGSSQTVDLRDTGKREEIQEVLREHGIDPDQKGQTIDASQVPGLRQALIKALFRGVPNSGGIGGGVSAPKNPPASEPGD
Ga0299907_1060935813300030006SoilMGLFGFLRRGDEDAIPKPGTPEFEAAVRGSAIPDSQSVSMGETGWTSTSASNTQIERPSQTIDLRGTGQREEVERVLREHGIDPERKGETIDASSVPG
Ga0307468_10021397023300031740Hardwood Forest SoilMPEPGTPEFEAVVAGTAIPDSQSVSMGEPGWTEPGSSQTVDLRGTGKREEIQEVLREHGVDPDQKGQTIDASQVPGLRQALIKALFGGVPNAGGIAGGIPSPKKAPPSDSDD
Ga0268251_1016510623300032159AgavePEFEAMVQGSAIPDSQSVSMGEPGWTQPGDSQTLDLRGTGKREEIQEVLREHGIDPDQKGQTIDASKVPGLREALIKALFRGVPNSGGISGGVSAPKKSEE
Ga0268251_1022179423300032159AgavePGTPEFDAAVAGSAIPDAQSVSMGEPGWTQPGSSQTLDLRGTGKREEIKQVLREHGIDPDRTGQTIDASTVPGLRQSLIKALFGGVPNSGGIGEGISARKAPRPEPDGE
Ga0307472_10234062223300032205Hardwood Forest SoilVGLFGFLRKRDERAMPEPAPPVFDCAVDGTAIRDAQSVSMGEPGWAKPGSSQTVDLRGSGKREEIQEVLRQHGIDPDQKGQTIDASTVPGLRKALLGVLMGQVPNA
Ga0247829_1008156533300033550SoilVERYSRFVAFFGFLRGKDERAMPEPGTPEFDAAVDGTAIADSQSVSMGEPGWTQPGSSQTVDLRGTGKREEIQEVLREHGIDPDQKGQTIDASQVPGLRQALMKALFRGVPNSGGIGDGIPSPKKAPPADSDD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.