NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105738

Metagenome / Metatranscriptome Family F105738

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105738
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 195 residues
Representative Sequence VNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVAMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Number of Associated Samples 94
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(19.000 % of family members)
Environment Ontology (ENVO) Unclassified
(28.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 9.85%    β-sheet: 45.81%    Coil/Unstructured: 44.33%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF03401TctC 3.00
PF01594AI-2E_transport 3.00
PF05977MFS_3 3.00
PF00857Isochorismatase 2.00
PF03741TerC 2.00
PF09084NMT1 1.00
PF01381HTH_3 1.00
PF12706Lactamase_B_2 1.00
PF01979Amidohydro_1 1.00
PF02452PemK_toxin 1.00
PF09656PGPGW 1.00
PF14338Mrr_N 1.00
PF01196Ribosomal_L17 1.00
PF01717Meth_synt_2 1.00
PF05494MlaC 1.00
PF05768Glrx-like 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 3.00
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 3.00
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 3.00
COG0861Tellurite resistance membrane protein TerCInorganic ion transport and metabolism [P] 2.00
COG1335Nicotinamidase-related amidaseCoenzyme transport and metabolism [H] 2.00
COG1535Isochorismate hydrolaseSecondary metabolites biosynthesis, transport and catabolism [Q] 2.00
COG0203Ribosomal protein L17Translation, ribosomal structure and biogenesis [J] 1.00
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 1.00
COG0695GlutaredoxinPosttranslational modification, protein turnover, chaperones [O] 1.00
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 1.00
COG2337mRNA-degrading endonuclease MazF, toxin component of the MazEF toxin-antitoxin moduleDefense mechanisms [V] 1.00
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 1.00
COG3118Chaperedoxin CnoX, contains thioredoxin-like and TPR-like domains, YbbN/TrxSC familyPosttranslational modification, protein turnover, chaperones [O] 1.00
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009147|Ga0114129_10000332All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria55123Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere18.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment5.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere3.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere2.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2162886013Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000953Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010102Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010103Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010109Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010119Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010130Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010141Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012402Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300019233Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300030830Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_368 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030903Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
SwBSRL2_0009.000076902162886013Switchgrass RhizosphereMERLFARAVALLTLGTLWIGVPGCMENIALIGRPTIEEGQDDLVGAVERVDLSTRRLYLRPNRSDRRVVALSADAQVLDRGREYPVARLKPGDIVAMQIKRDSRGDPYADLLRIQQNSSSQSWRDAPGAAPRIETLAGRVESVNRRDDSFELDDRAGAPVSVRLSEYVRESDRERFRTLRAGVRVRIEGQFTTRDRFEMLSFLNDEDS
JGI11643J11755_1164063113300000787SoilEVERVDIAARRLYLRANKSARRVVALSADAQVFDRGREYPITRLKPGDVVATQIKRDSRGEPYADLLRIQENAAXQSSRGVPGAXPRIETLAGTVESVNRRDNSFELDDRSGPPVVVRLSEYIRDSDRERFRTLRAGSRVRIEGKFTARDRFEMLSFLNTDDSY*
JGI11615J12901_1090904813300000953SoilRSDLVGKVDRVDLASRRLYLRPQGSDRRVVGFSADAQVLDRGREYPMARLKAGDVVAMQMKRDARGEPYADLIRIQQPAGAQSRGEVPGSAPRIQTLAGTVQSVNRGDNSFALDDRPGRLVSVLLSDYVRDSDRGRFRDLRPGDHVRIEGKFTDGDRFELLSFLNDDEEY*
Ga0063356_10080500913300004463Arabidopsis Thaliana RhizosphereEVERVDLSGRRIYLRSDKSDRRSVALSADAQVFDRGREYPVARLKPGDVVAMQIKRDSRGEPYADLLRIQENTAGQSLRDVPGAAPRIETLAGRVESVNRRDNSFELDDRSGPAVVVRLSEYVRDSDRERFRTLRAGARVRIEGKFTARDRFEMLSFLNTEDFY*
Ga0062592_10025361023300004480SoilMRPVKLRSRRARLFTNSRRAQLIAQAAALVALGTLWIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDIAARRLYLRANKSARRVVALSADAQVFDRGREYPITRLKPGDVVATQIKRDSRGEPYADLLRIQENAAGQSSRGVPGAAPRIETLAGTVESVNRRDNSFELDDRSGPPVVVRLSEYIRDSDRERFRTLRAGSRVRIEGKFTARDRFEMLSFLNTDDSY*
Ga0066685_1037452623300005180SoilMLLISVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0066676_1007833713300005186SoilVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0065707_1000591483300005295Switchgrass RhizosphereMERLFARAVALLTLGTLWIGVPGCMENIALIGRPTIEEGQDDLVGAVERVDLSTRRLYLRPNRSDRRVVALSADAQVLDRGREYPVARLKPGDIVAMQIKRDSRGDPYADLLRIQQNSSSQNWRDAPGAAPRIETLAGRVESVNRRDDSFELDDRAGAPVSVRLSEYVRESDRERFRTLRAGVRVRIEGQFTTRDRFEMLSFLNDEDS*
Ga0066686_1008676533300005446SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0066697_1026234333300005540SoilDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPLLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSAPRIETLAGRVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0066695_1056612023300005553SoilMLLISIPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRD
Ga0066692_1050455423300005555SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPDLVRLSEYVRESDRDRFRTLR
Ga0066704_1015579233300005557SoilMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVLSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEILSFLNDDS*
Ga0066698_1014630243300005558SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRQIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDVVAMQVKRNLRGEWYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRRDNSFELDDQSGPLVSVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0066651_1036332613300006031SoilIPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIEALAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0066652_10037067313300006046SoilMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0075417_1005368843300006049Populus RhizosphereVKLVDRAAALLALGMILTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS
Ga0070716_10067923523300006173Corn, Switchgrass And Miscanthus RhizosphereMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0066665_1049226033300006796SoilMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0075430_10141830913300006846Populus RhizosphereALGTLWIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDIAARRLYLRANKSARRVVALSADAQVFDRGREYPITRLKPGDVVATQIKRDSRGEPYADLLRIQENAPSQSSRGVPGAVPRIETLAGTVESVNRRDNSFELDDRSRPPVVVRLSEYIRDSDRERFRTLRAGARVRIEGKFTARDRFEMLS
Ga0075433_1021911423300006852Populus RhizosphereVKLVDRAAALLALGMLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRGDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS*
Ga0075425_10020574433300006854Populus RhizosphereVKIIERATSLLALGMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYSLGRLKPGDVVAMQVKRDSRGDLYADLIRIQESPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0075434_10242794213300006871Populus RhizosphereRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDD
Ga0075426_1016758123300006903Populus RhizosphereMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDLYADLIRIQESPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFVNDDS*
Ga0075426_1047244923300006903Populus RhizosphereVVGEVDRVDLSSRRLYLRPNSSDRRVVAFSADAQVLYRGNEYPMARLKPGDVVAMQMKRDARGDSYADLIRIQENAGSRIKEEVVSSASRIQTLAGRVQSVNRRDNSFELDNQPGQFVSVLLSENVRDSDKDRFRTLQAGDHVRIEGKFTERDRFELLSFLNDDSY*
Ga0075424_10162568813300006904Populus RhizosphereVKLVDRAAALLALGMILTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRGDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLN
Ga0075436_10007268723300006914Populus RhizosphereMEPVRSAVLVLALGMFLTGIPGCMENIALIGRPTIEEGQSDVVGEVDRVDLSSRRLYLRPNSSDRRVVAFSADAQVLYRGNEYPMARLKPGDVVAMQMKRDARGDSYADLIRIQENAGSRIKEEVVSSASRIQTLAGRVQSVNRRDNSFELDNQPGQFVSVLLSENVRDSDKDRFRTLQAGDHVRIEGKFTERDRFELLSFLNDDSY*
Ga0075419_1087979313300006969Populus RhizosphereCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS*
Ga0066710_10006532733300009012Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0111539_1217395713300009094Populus RhizosphereDIVGEVDRVDLSSRRIYLRPNSGDRRVVAFSADAQVLSRGREYPMARLKPGDLVAMQMKRDSRGDSYADLIRIQEIAGSRNEGDVVSSGPRIQTLAGQVQSVNRRDNSFELDNRPGQLVSVLLSQNVRESDKDRFRTLRAADHVRIEGKFTERDRFEL*
Ga0075418_1003336813300009100Populus RhizosphereVKLVDRAAALLALGMLLTGIPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKGDRRVVAFSTDAQVLYGGREYPVARIKSGDVVAMQIKRDPRGNLYTDLIRLQENPASQSRRDVPSSAPRIETLAGVVESVNRRDNSFELDDESGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0066709_10044586013300009137Grasslands SoilVKIIERATSLLALGMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLARLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRESDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0066709_10104760713300009137Grasslands SoilRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPTSQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0114129_1000033223300009147Populus RhizosphereMVRAASLVALLMVLIGVPGCMENIALIGRPTIEEGQDDVVGEVERVELSARRIYLRPNKSDRSVVAFSTDAQVLYRGREYPVARLKPGDVVAMQVKRNSRGDSYADLIRIQENSTVPSSASRIETLTGRVESVNRRDNSFELDDQSGPLVSVLLSEYVRDSDRERFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0114129_1005599743300009147Populus RhizosphereVKLVDRAAALLALGMLLTGIPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKGDRRVVAFSTTAQVLYGGREYPVARIKSGDVVAMQIKRDPRGNLYTDLIRLQENTASRSRREVPSSAPRIETLAGVVESVNRRDNSFELDEESGPPVLVRLSEYVRESDRDRFRSLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0114129_1067379123300009147Populus RhizosphereVKLVDRAAALLALGMILTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMRVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS*
Ga0105092_1008915433300009157Freshwater SedimentLRRARLFTSSRRAQLIARAAALLALGTLWIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYFRSDKSDRRSVALSADAQVLERGREYPITRLKPGDVVAMQIKRDSRGEPYADLLRIQENSARQSLRGVPGAAPRIETLAGRVESVNRRDNSFELDDRSGPPVVVRLSEYVRDSDRELFRALRAGAQVRIEGKFTARDRFEMLSFLNTEDFY*
Ga0075423_1001199173300009162Populus RhizosphereVKLVDRAAALLALGMLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS*
Ga0127453_108879923300010102Grasslands SoilVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPTSQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLQAGAHVRIEGKF
Ga0127500_106102213300010103Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGK
Ga0127497_104753623300010109Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRVYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDR
Ga0127452_116047123300010119Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAQVLYGGREYPVARLKPGDVVAMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0127493_104206223300010130Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKF
Ga0127499_102515613300010141Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYRVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGVRVRIEGKFTARDRFEMLSFLND
Ga0134088_1013443533300010304Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPLVSVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFL
Ga0134127_1067786923300010399Terrestrial SoilMAKLVAQGAALVMLGTLWIGVPGCMENIALIGRPTIEEGRDDFVGEVDRVDVSGRRLYFRPNRDARRVVALSADARVLDRGREYPVAELRPGDVVAMQIRRDARGDPYADLIRIQESQTRRDVPAPRIETLAGRVGIINRRDNSFELDDRAGAPVSVLLSEYVRDSDRDRFRSLRAGEHVRIEGKFITRDRFELLSFLNNEDS*
Ga0137382_1000693843300012200Vadose Zone SoilVKIIERATSLLALGMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0137399_1020306023300012203Vadose Zone SoilMRRAVLVLALGMLLTGIPGCMENIALIGRPTIEEGRNDVVGEVDRIDLSSRRIYLLPNSGDRRVVVFSTDAQVLDRGREYPIGRLKPGDVVAMQMKRDSRGDSYADLIRIQEVAGSRNEGDVVSSGPRIQTLAGRVQSVNRRDNSFELDNGPGQLVSVLLSQNARESDKDRFRTLRAGDHVRIEGKFTDRDRFELLSFLNDDSY*
Ga0137374_1003725553300012204Vadose Zone SoilVKLVDRAAALLALGMLLIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDVVAMQVRRDSRGDSYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRRDNSFELDDQSGPPVSVRLSEYVRESDRDRFRTLRAGAHVRVEGKFTARDRFEMLSFLNDDS*
Ga0137362_1009144023300012205Vadose Zone SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSGRQLYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGEVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0137379_1041211433300012209Vadose Zone SoilVKLVDRAAALLALGMLLIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVALSADAPVLYRGREYPVARLEPGDVVAMQVRRDSRGDSYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0137387_1010772413300012349Vadose Zone SoilVKIIERATSLLALGMLLIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQLMALKC*
Ga0137369_1010796733300012355Vadose Zone SoilMLWIGVPGCMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVALSADAQVLDRGREYPVARLKPGDLVAMQVKRDSRGESYADLIRIQENSASQSRRDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVLVRLSDYIRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0137384_1039389013300012357Vadose Zone SoilVKIIERATSLLALGMLLISIPGCMENIALIGRPTIEEGKDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDVVAMQVRRDSRGDSYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESVNRRDNSFELDDQSGPPVSVRLSEYVRESDRDRFRTLRAGAHVRVEGKFTARDRFEMLSFLNDDS*
Ga0134058_114049713300012379Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPTSQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFT
Ga0134052_105711523300012393Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLND
Ga0134056_129799423300012397Grasslands SoilPTIEEGQDDVVGEVERVDLASRQIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0134059_124970613300012402Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRVYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0134053_139852423300012406Grasslands SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLEPGDVVAMQVKRNLRGEWYADLIRLQENPASQSRRDVPSSVPRTETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLND
Ga0150984_11926930123300012469Avena Fatua RhizosphereVKLIRRAAALLTLGTLVVGVSGCMENIALIGRPTIEEGQSDVVGEVERVDLSERRIYLRPNESDRRVVALSTDAQALYRGREYPVTRLKPGDVVAMQVKRDPYGESYADLIRIQENPASQSRGDVPSSAPRIETLAGRVESVDRRDNSFELDDQSGRPVSVLLSEYVRDSDKNRFRTLRAGDNVRIEGKFTARDRFEMLSFLNDDS*
Ga0137397_1006387853300012685Vadose Zone SoilVKLIRRAAALLTLGTLVVGVSCMENIALIGRPTIEEGQSDVVGEVERVDLSERRIYLRPNESDRRVVALSTDAQVLYRGREYPVTRSKPGDVVAMQIKRDPRGESYADLIRIQENPASHSRGDVPSSAPRIETLAGRVESVDRRDNSFELDQSGRPVSVLLSEYVRDSDKHRFRTLRAGDNVRIEGKFTARDRFEMLSFLNDDS*
Ga0137397_1024290623300012685Vadose Zone SoilMGIPGCMENIALIGRPTIEEGRNDVVGEVDRIDLSSRRIYLLPNSGDRRVVVFSTDAQVLDRGREYPIGRLKPGDVVAMQMKRDSRGDSYADLIRIQEVAGSRNEGDVVSSGPRIQTLAGRVQSVNRRDNSFELDNGPGQLVSVLLSQNARESDKDRFRTLRAGDHVRIEGKFTDRDRFELLSFLNDDSY*
Ga0137396_1046633423300012918Vadose Zone SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQR
Ga0137394_1012664623300012922Vadose Zone SoilVKLIRRAAALLTLGTLVVGVSGCMENIALIGRPTIEEGQSDVVGEVERVDLSERRIYLRPNESDRRVVALSTDAQVLYRGREYPVTRSKPGDVVAMQIKRDPRGESYADLIRIQENPASQSRGDVPSSAPRIETLAGRVESVDRRDNSFELDQSGRPVSVLLSEYVRDSDKNRFRTLRAGDNVRIEGKFTARDRFEMLSFLNDDS*
Ga0137359_1031306413300012923Vadose Zone SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRTETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDR
Ga0137413_1024098413300012924Vadose Zone SoilIEEGRNDVVGEVDRIDLSSRRIYLLPNSGDRRVVVFSTDAQVLDRGREYPIGRLKPGDVVAMQMKRDSRGDSYADLIRIQEVAGSRNEGDVVSSGPRIQTLAGRVQSVNRRDNSFELDNGPGQLVSVLLSQNARESDKDRFRTLRAGDPVRIEGKFTERDRFELLSFLNDDSY*
Ga0137416_1005985733300012927Vadose Zone SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGEVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0137404_1017168743300012929Vadose Zone SoilVKLVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVAMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGSPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0137412_1018073313300015242Vadose Zone SoilIALIGRPTIEEGRNDVVGEVDRIDLSSRRIYLLPNSGDRRVVVFSTDAQVLDRGREYPIGRLKPGDVVAMQMKRDSRGDSYADLIRIQEVAGSRNEGDVVSSGPRIQTLAGRVQSVNRRDNSFELDNGPGQLVSVLLSQNARESDKDRFRTLRAGDPVRIEGKFTERDRFELLSFLNDDSY*
Ga0137403_1023130923300015264Vadose Zone SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGEVVAMQVKRDSRGDSYADLIRIQASPASQRRGDVSSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0137403_1024409023300015264Vadose Zone SoilVNRAAALLALGTLLTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVAMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS*
Ga0132258_1204911713300015371Arabidopsis RhizosphereVKIIERATSLLSLGMLLISIPGCMENIALIGRPTIEEGQNDVVGEVERVDVSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS*
Ga0132256_10280917913300015372Arabidopsis RhizosphereLQEPALIGRPTIEEGQNDVVGEVDRVDLSSRRIYLRPNSGDRRVVAFSADAQVLSRGREYPMARLKPGDLVAMQMKRDSHGDSYADLIRIQEIAGSRNEGDVVSSGPRIQTLAGQVQSVNRRVNSFELDNRPGLVSVLLSQNVRESDKDRFRTLR
Ga0132257_10079141123300015373Arabidopsis RhizosphereLQEPALIGRPTIEEGQNDVVGEVDREDLSSRRIYLCPNSGDRRVVAFSADAQVLSRGREYPMARLKPGDLVAMQMKRDSRGDSYADLIRIQEIAGSRNEGDIVSSGPQIQTLAGQVQSVNRRVNSFELDNRPGLVSVLLSQNVRESDKDRFRTLRAGNHVRIEGKFTERDRFELLSFLNDDS*
Ga0132255_10295005723300015374Arabidopsis RhizosphereLQEPALIGRPTIEEGQNDVVGEFDRVDLSSRRIYLRPNSGDRRVVAFSADAQVLSRGREYPMARLKPGDLVAMQMKRDSHGDSYADLIRIQEIAGSRNEGDVVSSGPRIQTLAGQVQSVNRRVNSFELDNRPGLVSVLLSQNVRESDKDRFRTLQAGDHVRIEGKFTKRYRFELLSFLNDDS*
Ga0184626_1001306933300018053Groundwater SedimentMRLVNLRSRRARSSTSSRRAQLIARAAALLALGTLWIGVPGCMENIALIGRPSIAEGWDDVVGEVERVDLSARRLYLHPNKSDRRVVTLSADAQVLDRGREYPVARLKAGDVVAMQVKRDSRGESYVDLIRIQENSASQRRGDVPSSAPRIERLAGTVESINRRDNSFELDDQSGPPVSVLLSEYVRDSDRDRFRTLRAGARVRIEGKFTARDRFEMLSFLNDDS
Ga0184623_1033428113300018056Groundwater SedimentLLIGVPGCMENIALIGRPTIAEGQDDVVGEVERVDLSMRRIYLRPNKSDRRVVPFSTDAQVLYRGREYPVTRLEPGDVVAMQVKRDSRGDSYADLIRVQEDPRSQSRGDLPSSAPRIQTLAGRVESVNRRDNSFELDDRSGPSVSVLLSEYARDSDRDRFRALRAGAHVRIEGKFTARDRFEMLSFLNDDPS
Ga0184619_1019060213300018061Groundwater SedimentMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVALSADAQVLDRGREYPVAQLKPGDVVAMQVKRDSRGESYADLIRIRENSATQSRRDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSAYVRDSDRDRFRTLRAGAH
Ga0184632_1002143153300018075Groundwater SedimentMRLAKRRSKRARFSTSRVVKFIGRAAALLALGTLWIGVPGCMENIALIGRPSIAEDHDDVVGEVERVDLSGRRIFLRPNKSDRRVVALSADAQVLDRGREYPVARLKPGDVVAMQLKRDSRGEPYADLLRIQENSASQRRGDIPSAAPRIETLAGTVASINRRDNSFELDDRSGPPVSVLLSEYVRDSDRDRFRNLRAGARVRIEGKFTARDRFEMLSFLNDDS
Ga0184609_1006579543300018076Groundwater SedimentMRLVKRRSRKARFFTSSRRAKLIARAASLLALGMLWIGVPGCMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVAFSADAQVLDRGREYPVARLKPGVVVAMQVKRDSRGESYADLIRIQENSASQSRRDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTSRDRFELLSFLNDDS
Ga0184627_1004853013300018079Groundwater SedimentMDRAASLLALGMLLIGVPGCMENIALIGRPTIAEGQDDVVGEVERVDLSMRRIYLRPNKSDRRVVPFSTDAQVLYRGREYPVTRLEPGDVVAMQVKRDSRGDSYADLIRVQEDPRSQSRGDLPSSAPRIQTLAGRVESVNRRDNSFELDDRSGPSVSVLLSEYARDSDRDRFRALRAGAHVRIEGKFTARDRFEMLSFLNDDPS
Ga0184645_106469223300019233Groundwater SedimentMENIALIGRASIAEGQDDVVGEVERVDLSARRLYLRPNKGDRRVVALSADAQVLDRGREYPVARLKPGDVVAMQVKRDSRGESYADLIRIQEDSASQRRGDVPSAAARIERLAGTVESINRRDNSFELDDRSGPVLVLLSEYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0184643_138322523300019255Groundwater SedimentMKSTRIAKWKNWKPTIGTNEYLVGNMRLVKRRSRKARFSTSSRQAKLIARAGALLALGMLWIGVPGCMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVALSADAQVLDRGREYPVAQLKPGDVVAMQVKRDSRGESYADLIRIQENSATQSRRDVPSSASRIQTLSGRVESVNPRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTGRDRFEMLSFLNDDS
Ga0184646_155501313300019259Groundwater SedimentMRLAKRRSKRARFSTSRVVKFIGRAAALLALGTLWIGVPGCMENIALIGRPSIAEDHDDVVGEVERVDLSGRRIFLRPNKSDRRVVALSADAQVLDRGREYPVARLKPGDVVAMQLKRDSHGEPYADLLRIQENSASQRRGDIPSAAPRIETLAGTVASINRRDNSFELDDRSGPPVSVLLSEYVRDSDRDRFRNLRAGARVRIEGKFTARDRFEMLSFLNDDS
Ga0193715_105501113300019878SoilMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSGRWIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS
Ga0193707_1000006373300019881SoilMENIALIGRPTIEEGQNDVVGEVERVDLSGRWIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS
Ga0193735_100922613300020006SoilVKIIERATSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSGRWIYLRPNKSDRRVVALSLDAQVLDRGREYPLGRLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS
Ga0193749_103766313300020010SoilMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSGRWIYLRPNKSDRRVVALSLDAQVLDRGREYPLARLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS
Ga0210378_1003459913300021073Groundwater SedimentMENIALIGRPSIAEGWDDVVGEVERVDLSARRLYLHPNKSDRRVVTLSADAQVLDRGREYPVARLKAGDVVAMQVKRDSRGESYADLIRIQENPTSQSRGDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVLLSEYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0222625_159214713300022195Groundwater SedimentLVKRRSRKARFFTSSRRAKLIARAASLLALGMLWLGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLPARRLYLRPNKGDRRVVAFSADAQGLDRGREYPVAQLKPGDVVAMQVKRDSRGESYADLIRIQENSATQSRRDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0209438_101644123300026285Grasslands SoilMRLVKRRSRKARFFTSSRRVKLIRRAAALLTLGTLVVGVSGCMENIALIGRPTIEEGQSDVVGEVERVDLSERRIYLRPNESDRRVVALSTDAQVLYRGREYPVTRSKPGDVVAMQIKRDPRGESYADLIRIQENPASQSRGDVPSSAPRIETLAGRVESVDRRDNSFELDQSGRPVSVLLSEYVRDSDKHRFRTLRAGDNVRIEGKFTARDRFEMLSFLNDDS
Ga0209761_124964913300026313Grasslands SoilPGCMENIALIGRPTIEEGQDDVVGEVERVDLASRRIYLRPNKSDRRVVAFSTDAPVLYRGREYPVARLKPGDVVVMQVKRDSRGDLYADLIRVQENPASQSRRDVPSSVPRIETLAGVVESVNRRDNSFELDDQSGPPVLVRLSEYVRESDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0209814_1009294233300027873Populus RhizosphereVKLVDRAAALLALGMILTGVPGCMENIALIGRPTIEEGQDDVVGEVERVDLSSRRIYLRPNKSDRRVVAFSADAPVLYRGREYPVARLEPGDFVAMQVRRDSHGDLYADLIRLQENPASQSRRDVPSSAPRIETLAGRVESINRQDNSFELDDQSGPPVLVRLSEYVR
Ga0209382_1067097613300027909Populus RhizosphereMRPVKLRSRRARLFTNSRRAQLIAQAAALVALGTLWIGVPGCMENIALIGRPTIEEGQDDVVGEVERVDIAARRLYLRANKSARRVVALSADAQVFDRGREYPITRLKPGDVVATQIKRDSRGEPYADLLRIQENAAGQSSRGVPGAAPRIETLAGTVESVNRRDNSFELDDRSGPPVVVRLSEYIRDSDRERFRTLRAGSR
Ga0307320_1026320513300028771SoilMLWIGVPGCMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVAFSADAQVLDRGREYPVAQLNPGDVVAMQVKRDSRGESYADLIRIQENPTSQSRGDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTL
Ga0308205_101032513300030830SoilMLWIGVPGCMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVALSVDAQVLDRGREYPVAQLNPGDVVAMQVKRDSRGESYADLIRIQENPTSQSRGDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0308206_102756613300030903SoilMENIALIGRPTIEEGRDDVVGEVERVDLSARRLYLRPNKGDRRVVAFSADAQVLDRGREYPVAQLNPGDVVAMQVKRDSRGESYADLIRIQENPTSQSRGDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDDS
Ga0308204_1006385623300031092SoilMLWIGVPGCMENIALIGRPTIEEGRNDVVGEVERVDLSARRLYLRPNKGDRRVVAFSADAQVLDRGREYPVAQLKPGDVVAMQVKRDSRGESYADLIRIQENSATQSRRDVPSSASRIQTLAGRVESVNRRDNSFELDDQSGPPVSVRLSDYVRDSDRDRFRTLRAGAHVRIEGKFTARDRFEMLSFLNDEP
Ga0308194_1013126123300031421SoilSLLALGMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSGRRIYLRPNKSDRRVVALSLDAQVLDRGREYPLERLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRIEGKFTQRDRFEMLSFLNDDS
Ga0307473_1058106913300031820Hardwood Forest SoilMLLISVPGCMENIALIGRPTIEEGQNDVVGEVERVDLSERRLYLRPNKSDRRVVALSLDAQVLDRGREYPLARLKPGDVVAMQVKRDSRGDSYADLIRIQESPASQRRGDVPSSVPRIETLAGRVVSVNRRDNSFELDDQSGPPVSVRLSEYVRDSDRDRLRTLRAGDHVRI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.