NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F074528

Metagenome Family F074528

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F074528
Family Type Metagenome
Number of Sequences 119
Average Sequence Length 214 residues
Representative Sequence PVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEEGREAIIAEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVTVQREARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPISTTPTGRLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD
Number of Associated Samples 83
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.84

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.160 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.613 % of family members)
Environment Ontology (ENVO) Unclassified
(42.017 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.218 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.16%    β-sheet: 39.64%    Coil/Unstructured: 48.20%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.84
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.70.1.0: automated matchesd6zcwa_6zcw0.53731
b.70.1.0: automated matchesd4cvba_4cvb0.53589
b.69.13.1: Oligoxyloglucan reducing end-specific cellobiohydrolased1sqja11sqj0.52806
b.70.1.1: Quinoprotein alcohol dehydrogenase-liked2ad6a_2ad60.52165
b.70.3.1: DPP6 N-terminal domain-liked1xfda11xfd0.51725


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 119 Family Scaffolds
PF01396zf-C4_Topoisom 37.82
PF00106adh_short 10.08
PF01751Toprim 5.88
PF13561adh_short_C2 5.04
PF01134GIDA 3.36
PF06267DUF1028 2.52
PF08241Methyltransf_11 0.84
PF01040UbiA 0.84
PF13428TPR_14 0.84
PF02899Phage_int_SAM_1 0.84
PF00326Peptidase_S9 0.84
PF02481DNA_processg_A 0.84
PF07676PD40 0.84
PF13469Sulfotransfer_3 0.84

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 119 Family Scaffolds
COG3342Uncharacterized conserved protein, Ntn-hydrolase superfamilyGeneral function prediction only [R] 2.52
COG0758Predicted Rossmann fold nucleotide-binding protein DprA/Smf involved in DNA uptakeReplication, recombination and repair [L] 1.68
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 0.84
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 0.84


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.16 %
All OrganismsrootAll Organisms0.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300012685|Ga0137397_10000024All Organisms → cellular organisms → Bacteria91263Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil13.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.40%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.20%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.36%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.52%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.52%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.84%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.84%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25383J37093_1010723913300002560Grasslands SoilRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
JGI25382J43887_1029094813300002908Grasslands SoilAAAGAGVKYGDAVPGVIRPSDLERIKRVLHLDRFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKTVCEAGGFCPDPVGFVGSRVRIVLLQDKDVAPLLMIDQEVRGAGQELVDLRAVGVAGEVLGWSGRAETANGHVGLSLTPIIGAADGGAGAGLDPPLLFRWNPKRDRFQFYDCVAGDDGTTRCDFLDEIGG*
Ga0066683_1005921733300005172SoilVTRRPAVVLSLALVLMTAHASGRAAAGVKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAAVWLGAEEGREAIVSEPLSDGTLKKVKDVCDAGGFCPDPVGFVASRVRIVLLRNGDVETVVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPEASDGHIALSLTPITENEGGEVAARLDPPLLVRFNPKTGHFQVFDCVARDGGEADCDFIDEPGD*
Ga0066683_1013589423300005172SoilDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD*
Ga0066680_1037550723300005174SoilVRRALLLIVLAALRPAAAGAGVKYGDAVPGVIRPSDLERIKRVLHLDRFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKTVCEAGGFCPDPVGFVGSRVRIVLLQDKDVAPLLMIDQEVRGAGQELVDLRAVGVAGEVLGWSGRAETANGHVGLSLTPIIGAADGGAGAGLDPPLLFRWNPKRDRFQFYDCVAGDDGTTRCDFLDEIGG*
Ga0066685_1006213213300005180SoilMRPADPAPENPIEHWMGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVTVQREARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0066685_1018051013300005180SoilPAIRSAGPPRCHRLHDLEAVVRRAVAVIGGLAVALVGAAAPDRAAAGVRYGDAVPGVIHPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLHDETLDKVRKACDSGGFCPDPVGFVATRVLIVVLRDKDVETMVTVQKDARGGRGRLVDLRGLGVEGEILGWNGRADASEGHVALSLTPITGTANGGAGAGLDPPLLIRFNPKTERFQLYDCMAGDDGGTICDFREEPGD*
Ga0066676_1003290523300005186SoilPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD*
Ga0066676_1028437813300005186SoilKYGDAVPGVVRPSDLERIKSVLHLDRFSVIAAVRLGAEEGREAVIAEPLPDDILAKIKAVCEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRAAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG*
Ga0070680_10148385413300005336Corn RhizosphereVLAATAADPGVKYGDAVPAVIRPSDLQRIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKGACEAGGFCPDPVGFVAARVRIVVLDGPDIVTIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALALTPITVSGDGPAGAGLDPPLLVRFNPDRGRFQAYDCLRTEDDTVRCDFRD
Ga0070692_1000137963300005345Corn, Switchgrass And Miscanthus RhizosphereMRARAALIAVPILTVLASAAADPGVKYGDAVPAVIRPSDLERIERVLHLARFSVIAAVRLGAEEGREAVIAEPLSDEALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDILPVVTIEKEAQCRGRRLVDLQRLGLEGDILGWNGRAEAANGHVALALTPISEIDGGRAAAGLDPPVLIRWNPDRNRFQSYDCVRAEDDTVRCGFQDEP*
Ga0070703_1026787613300005406Corn, Switchgrass And Miscanthus RhizosphereMRHRAALVAIPVLVFLAATAADPGVKYGDAVPAVIRPSDLQRIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVTIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALALTPITVSGDGPAGAGLDPPLLVRFNPDRGRFQAYDCLR
Ga0066686_1004561433300005446SoilMSNGGGATSYRSRLGRALAWLIILAEFPTGAIGAGVKYGDAVPGVIRPSDLERIKSVLHLERFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKAACEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRAAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG*
Ga0066686_1006695923300005446SoilMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0066686_1012898923300005446SoilVRRAVAVIGGLAVALVGAAAPDRAAAGVRYGDAVPGVIRPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLHDETLDKVRKACDSGGFCPDPVGFVATRVLIVVLRDKDVETMVTVQKDARGGRGRLVDLRGLGVEGEILGWNGRADASEGHVALSLTPITGTANGGAGAGLDPPLLIRFNPKTERLQLYDCMAGDDGGTICDFREEPGD*
Ga0066686_1014234213300005446SoilLIPTPLAPDIERMSVGTSASRDRSWRTLLLIALALLAALGSPAARAGVIYGDALPGVIRPSDLERIKRVLHLDRFSVIAAVRLGAEAGREVVIAEPLPEEALGKIKTACDAGGFCPDPVGFVGARVSVVLLQDKDVVPVVTIEKEIRGAGRPLVDLREIGVAGGILGWSGRAEAADGHVALALTPVTESIEGRVAAGADPPILIRWNPRRDRFQFFDCDAAEDGSTHCDFKDAPGD*
Ga0066682_1004896423300005450SoilMSNGGGATSYRSRLGRALAWLIILAEFPTGAIGAGVKYGDAVPGVIRPFDLERIKSVLHLDRFSVIAAVRLGAEEGREAVIAEPLPDEALAKIKAACEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRGAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG*
Ga0066682_1038651513300005450SoilIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQREARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD*
Ga0066681_1012099813300005451SoilMQAAGLKVLILVILLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPLGFVASRVRIVLLRDGEVETVVVVQERVLGAHGRLVELRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVARDGGEADCDFIDEPGD*
Ga0070707_10044958723300005468Corn, Switchgrass And Miscanthus RhizosphereMRLRAALVAIPILTVLASTALQPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLSDAALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDVVSIVSIEKEAQCRGRRLIDLQRLGLEGEILGWNGRAEAANGHVALGLTPITETGDGRTGAGLEPPLLVRFNPDRNRFQAFDCVRTEDDTVRCDFQDEPGD*
Ga0070697_100000075533300005536Corn, Switchgrass And Miscanthus RhizosphereMNGGTSASRDHAWPRLALFSLAVLAALGSSAARTGVIYGDAVPGVIRPSDLDRIRRVLHLDRFSVIAAVRLGAEDGREVVIAEPLPDEVLKKVAGSCEAGGFCPDPVGFIGARVRIVLLRDKDVVPVVTIDKEIRGGGRLLVDLRDLGVAGEILGWNGRAEAANGHVALAVTPIAGTIEGRTVAGAEPPLLIRWNERRERFQFFDCAAAEDGSTRCDFRDEAGG*
Ga0070697_10006519823300005536Corn, Switchgrass And Miscanthus RhizosphereVTRRPAVFLSLALVLMTAHASGRAVAGVKYGDAVAGIVRPSDLERIKRVLRLDRFSIIAAVWLGAEEGREAIVSEPLSDGTLKKVKDVCDAGGFCPDPVGFVASRVRIVLLRNGDVETMVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPEASDGHIALNLTPITENEGGEVAARLDPPLLVRFNPKTGHFQVFDCVAGDGGEAECDFIEELGD*
Ga0070697_10185632513300005536Corn, Switchgrass And Miscanthus RhizosphereILTVLASTALQPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLSDAALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDVVSIVSIEKEAQCRGRRLVDLEGLGLEGEILGWNGRAEAANGHVALGLTPITETGDGRTGAGLEPPLLVRFNPDRSRF
Ga0070704_10062801113300005549Corn, Switchgrass And Miscanthus RhizosphereMSHRAALVAIPVLAVLAVTAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKATCEAGGFCPDPVGFVAARVRIVVLDGPDILPVATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALALTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQAYDCLRTEDDTVRCDFRDEPGD*
Ga0066695_1025262513300005553SoilMQAAGLKVLILVILLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPLGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAELDPPLLVRFNPKRGRFQVFDCVAGDGGEADCDFIDEPGD*
Ga0066692_1071007713300005555SoilRREQARVLITAPAAPIYSPVTRRAAVVLSLALVLMTAHASGRAAAGVKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQEQVLGARGGLVDLRGLGVEGEILGWNGRPDSSGGHIALSLTPITENGGGEIAAGLDPPLLVRFNP
Ga0066692_1082846113300005555SoilAGVKYGDAVPGVIRPSDLERIKRVLHLDRFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKTVCEAGGFCPDPVGFVGSRVRIVLLQDKDVAPLLMIDQEVRGAGQELVDLRAVGVAGEVLGWSGRAETANGHVGLSLTPIIGAADGGAGAGLDPPLLFRWNPKRDRFQFYDCVAGDDGTTRCDFLDEI
Ga0066698_1001901123300005558SoilMRPADPAPENPIERWTGGIQNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0066698_1045316123300005558SoilRSRLGRALAWLIILAEFPTGAIGAGVKYGDAVPGVIRPSDLERIKSVLHLERFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKAACEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRAAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG*
Ga0068862_10014847513300005844Switchgrass RhizosphereMRLRAALVAIPILTVLASTALQPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLSDAALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDVVSIVSIEKEAQCRGRRLIDLQRLGLEGEILGWNGRAEAANGHVALGLTPITET
Ga0066656_1001615723300006034SoilMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0075421_10002150393300006845Populus RhizosphereMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQAFDCLRTEDDTVRCDFRDEPGD*
Ga0075431_10006337223300006847Populus RhizosphereMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQAYDCLRAEDDTVRCDFRDEPGD*
Ga0075436_10006615523300006914Populus RhizosphereVSRPAALVLSVAIAGLCAGVLTGVSAGVKYGDAVPGIIRPSDLERIKLALHLDRFSVIAAVRLGVDEGREVVLSEPLDDGTLKRVKDACDAGGFCPDPVGFVASRVRIVLLLDKDVDPIVAIDKEARGPRGRMVDLGVLGVAGEILGWNGRPDASDGHVALSLTPIVRNEKGETAPGLDPPLLVRFNPKHGRFQVFDCVAGDGGEAVCSFVEEPGG*
Ga0099791_1018678113300007255Vadose Zone SoilVRRALPIFAGLAVVAVLAHGPVAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREAIIAEPLSDETLKRVKDACEAGGFCPDPVGFVASRVRIVLLQGKDVETVVTVQREPRGARGRLVDLRGLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAPGLNPPLLLRFNVERDRFQVFDCVAGDDGGAVCHFLEEPGD*
Ga0066710_10001553513300009012Grasslands SoilMRPMLVPLVVLAALGSSAARAGVIYGDAVPGVIRPSDLDRIRRVLHLDRFSVIAAVRLGAEDGREVVIAEPLPEEVLKRVAGSCEAGGVCPDPVGFICARVRIVLLQDKDIVPVVTIEKEIRGGGRLLVDLRDLGVPGEILGWSGRAEAASGHVALAVTPISGTIEGRIVTGAEPPLLIRWNTQRERFQFFDCAAAEDGSTRCDFRDEAGG
Ga0066710_10007437643300009012Grasslands SoilMSVGTSASRDRSWPTLLLIALALLAALGSPAARAGVIYGDALPGVIRPSDLERIKRVLHLDRFSVIAAVRLGAEAGREVVIAEPLPEEALGKIKTACDAGGFCPDPVGFVGARVSVVLLQDKDVVPVVTIEREIRGAGRPLVDLREIGVAGGVLGWSGRAEAADGHVALALTPVTETIEGRVAAGADPPILIRWNPRRDRFQFFDCVAAEDGSTRCDFKDAPGD
Ga0066710_10019356323300009012Grasslands SoilVRRAVAVIGGLAVALVGAAAPDRAAAGVRYGDAVPGVIRPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLHDETLDKVRKACDSGGFCPDPVGFVATRVLIVVLRDKDVETMVTVQKDARGGRGRLVDLRGLGVEGEILGWNGRADASEGHVALSLTPIAGTANGGAGAGLDPPLLIRFNPKAERFQLYDCTAGDDGGTICDFREEPGD
Ga0066710_10126928613300009012Grasslands SoilMRSVSGTAGLRVDSGPRGPDIGHMSNGGGATSNRSRLGRALAWLIILATLSTGTISAGVKYGDAVPGVIRPYDLERIKSVLHLDRFSVIAAVRLGAEDGREAVISEPLPDETLANIKSACEGGGFCPDPVGFVGSRIRIVLLQDKDVLPLVTIEQEVRGAGQRLVDLRELGVDGQILGWNGRADAANGHVALSLTPITARADGRAAAGLDIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEV
Ga0066710_10131016513300009012Grasslands SoilMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCV
Ga0066710_10228744313300009012Grasslands SoilPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEEGREAIIAEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVTVQREARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPISTTPTGRLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD
Ga0066710_10428251113300009012Grasslands SoilPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCV
Ga0099827_1004815113300009090Vadose Zone SoilMRTRASDRATAGRAFVTRRPAVALSLALVLMTAQASGPAAAGVKYGDAVPGGVRPSDLERIKRALRLDRFSMLAAVRLGAKEGREAIVAEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGEVSPGLDPPLLIRFNPKRGRFQVFDCVARDGGEADCDFIDEPGD*
Ga0066709_10014830033300009137Grasslands SoilMPIADPAPGGPRPSGRRSAVLLLLGTVLLAVQSPAGVIAGPSYGDAVPAVVRPSDLERIKRALRLDRFSIIAAVRLGVEEGREAIVAEPLSDETLKKVKDACDAGGFCPDPVGFVASRVRIVLLQDKDVETAIVVQNEARGARGRLVDLRGLGINGEILGWNGRPDSSGGHVALSLTPIAESAGGEVAAGLDPPLLIRFNPKAGRFQVYDCVAGDAGGADCDFIEEPGD*
Ga0066709_10039393423300009137Grasslands SoilMQAAGLKVLILVILLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPLGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVAGDGGEADCDFIDEPGD*
Ga0066709_10098540923300009137Grasslands SoilMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQGEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD*
Ga0114129_1001482593300009147Populus RhizosphereMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQAYDCLRTEDDTVRCDFRDEPGD*
Ga0134088_1003254513300010304Grasslands SoilVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQGEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0134088_1013137013300010304Grasslands SoilVPGVVRPADIETIKRVLDLQRFSIIAAVRLGAEEGREVVLAEPLSDETLKRVREGCEAGGYCPDPVGFVASRVLIVLLADKGVETIVSVQKEVLGGRRRLADLRALGLEGEILGWNGRAEASAGHVALSLTPVAGRDGAYGPGLEPPLLIRFNPDRDRFQVYDCVAANGGEPICDFRDGPED*
Ga0134111_1011538113300010329Grasslands SoilVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRD
Ga0126377_1146386113300010362Tropical Forest SoilMRPTPLNRGSRGGLILLALAIAGPAIAGVKYGDAVPGIIHPADIERIKRVLDLQRFSIIAAVRLGAEEGREAVIAEPLPDETLRRVKEGCEAGGFCPDPVGFVASRVRIALLIDKGVETLVTLQGEVRGEHGRLVDLRDLGVEGEVLGWNGRADAADGHVALSLTPIVRRGDGGIASGLEPPLLIRFNPEESRFQVYDCVAGD
Ga0134127_1000454593300010399Terrestrial SoilMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPIATIEKQALSRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQVYDCLRTEDDTVRCDFRDEPGD*
Ga0134122_1132516523300010400Terrestrial SoilMRLRAALVAIPILTVLAATAADPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLTDEALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDVVPIVSIEKEAQCRGRRLVDLQRLGIEGDILGWNGRAEAANGHVALGLTPITESGDGRTGAGLEPPLLLRFNPDRNRFQAYDCVR
Ga0134121_1010526723300010401Terrestrial SoilMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPVATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQVFDCVAGDGGEAVCSFVEEPGG*
Ga0137388_1021258323300012189Vadose Zone SoilLVIGRRGLARVLITAPAAPIYSPVTRRAAVVLSLALVLMTAHASGRAAAGVKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQSQVLGARARLVDLRDLGVEGEILGWNGRPDASDGHIALSLTPITENESGEVAAGLDPPLLLRFNPKRGRFQVFDCVAGDGGEADCDFIDEPGD*
Ga0137383_1001376523300012199Vadose Zone SoilMDGMNTGVRAPLFGSRRRRALLSLVVPVALWPAGVGAGVKYGDAVPGVIRPSDLERIKRVLHVDRFSVIAAVRLGAEEGREVVIAEPLPDETLAKIMSVCESGGFCPDPVGFVGSRVRIVLLQDRDVVPRVTIEQEARGSGQRLVDLRALGIAGEILGWNGRAETADGHVGLSLTPIVEAGGRAGAGLDPPLLIRWNPKRDRFQFFDCVAGKDGTTRCDFQEEVGE*
Ga0137383_1022183823300012199Vadose Zone SoilMPPGLEPLIVIAMLGSAAAATAGVKYGDAVPGVIRPSDLERIKSVLHLDRFSVIAAVRLGAEDGREAVIAEPLPNETLARVKSACEGGGFCPDPLGFVGVRIRIVRLQDKDVLPLVTIEQEVRGAGQRLVDLRELGVDGQILGWNGRADTANDHVALSLTPMTATADGRAGAGLDPPLLIRWNP
Ga0137363_1001537233300012202Vadose Zone SoilMTAQASGPAAAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVAEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGEVSPGLDPPLLIRFNPKRGRFQVFDCVARDGGEGDCDFIDEPGD*
Ga0137399_1030260423300012203Vadose Zone SoilMSSETGVSSFGPRLQAALLSLIVLAGLEPAAVGAGVKYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGVEEGREVVIAEPLPDEALAKVKTACEAGGFCPDPVGFVGARIRIVLLHDKDVVPLVTAEHEVRAAGGRLVDLGALGVTGEVLGWNGRAEAANGHVGLGLTPITGTAGGGAGAGLDLPLLIRWNPKRDRFQFFDCVVGDDGTTRCDFQDEVGG*
Ga0137399_1042734913300012203Vadose Zone SoilVAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREVIIAEPLSDETLKRVKDACEAGGFCPDPVGFVASRVHIVLLQDKDVETVVTVQTEPRGARGRLVDLRDLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAAGLNPPLLLRYNVKASRFQVFDCVAGNAGGTICDFLEEPGD*
Ga0137362_1031114513300012205Vadose Zone SoilMTAHASGRAAAGVKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAAVWLGAEEGREAIVSEPLSDGTLKKVKDVCDAGGFCPDPVGFVASRVRIVLLRNGDVETVVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPEASDGHIALSLTPITENGSGEVSPGLDPPLLVRFNPKRGRFQVFDCVARDGGEADCDFIDEPGD*
Ga0137380_1003524523300012206Vadose Zone SoilMQAGLKVLILVVLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGEVSPGLDPPLLVRFNPKRGRFQVFDCVARNGGEADCDFIDEPGD*
Ga0137380_1021724013300012206Vadose Zone SoilVALWPAGAGAGVKYGDAVPGVIRPSDLERIKRVLHVDRFSVIAAVRLGAEEGREAVIAEPLPDETLAKIMSVCESGGFCPDPVGFVGSRVRIVLLQDRDVVPRVTIEQEARGSGQRLVDLRALGIAGEILGWNGRAETADGHVGLSLTPIVEAGGRAGAGLDPPLLIRWNPKRDRFQFFDCVAGKDGARRCDFREEVGE*
Ga0137381_1028558433300012207Vadose Zone SoilVRRALTVVLGLAVAALCAVTPCRVSAGVKYGDAVPGVIHPSDLERIKRVLRLDRFSIIAAVWLGAEEGREAIIAEPLSDGTLRKVKDACEAGGFCPDPVGFVASRVRIVLLQDRDVETVVMVQKEARGARGRLIDLRGLGVDGEILGWSGSAGASDGHVALSLTPVTETEDGKVGAGLDPPILIRFNP
Ga0137379_1075203223300012209Vadose Zone SoilMQAGLKVLILVVLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVAEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIAMSLTPITENGSGEVSPGLDPPLLVRFNPKRGRFQVFDCVARDGGEADCDFIDEPGD*
Ga0137377_1152796013300012211Vadose Zone SoilGVIRPSDLERIKSVLHLDRFSVIAAVRLGAEDGREAVIAEPLPNETLARVKSACEGGGFCPDPLGFVGVRIRIVRLQDKDVLPLVTIEQEVRGAGQRLVDLRELGVDGQILGWNGRADTANDHVALSLTPMTATADGRAGAGLDPPLLIRWNPKRDRFQFLDCVAGDDGTTRCDFQDEVGG*
Ga0137387_1034552413300012349Vadose Zone SoilMALLLLATQAPGRAAAGVRYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEDGREAIVAEPLPDGTLKKVKDACDGGGFCPDPVGFVASRVRIVLLRNGDVETMVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPEASDGHIALSLTPITENEGGDVAARLDPPLLVRFNPKTGHFQVFDCVAGDGGEANCDFIEEPGD*
Ga0137367_1098897813300012353Vadose Zone SoilVIRASDLERIKRILHLDRFSVIAAVRLGAEEGREAIVAEPLPDETLKKLKTACEGGGFCPDPVGFVASRVRIVRLLDGDVATVIVVQTEARGAHGRLADLRQMGVAREILGWNGRADASDGHVALSLTPLTGTEEGRVGAALDPPLLIRFNPKRDRFQVYDCIEGDDGGAVCAFKDETGD
Ga0137384_1025096823300012357Vadose Zone SoilVALWPAGAGAGVKYGDAVPGVIRPSDLERIKRVLHVDRFSVIAAVRLGAEEGREAVIAEPLPDETLAKIMSVCESGGFCPDPVGFVGSRVRIVLLQDRDVVPRVTIEQEARGSGQRLVDLRALGIAGEILGWNGRAETADGHVGLSLTPIVEAGGRAGAGLDPPLLIRWNPKRDRFQFFDCVAGKDGTTRCDFQEEVGE*
Ga0137360_1092544513300012361Vadose Zone SoilMWPSEASGPAAAGVKYGDAVPGVVRPSDLERIKRVLRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRNRDVETVVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGEVSPGLDPPLLIRFNPKRGRFQVFDCVARDGGEGDCDFIDEPGD*
Ga0137361_1010443923300012362Vadose Zone SoilMTAHASGRAAAGVKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAAVWLGAEEGREAIVSEPLSDGTLKKVKDVCDAGGFCPDPVGFVASRVRIVLLRNGDVETVVVVQAQVLGARGRLVDLRGLGVEGEILGWNGRPEASDGHIALSLTPITENEGAEVAARLDPPLLVRFNPKTGHFQVFDCVARDGGEADCDFIDEPGD*
Ga0137397_1000002423300012685Vadose Zone SoilMQAGLKVLILVVLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLEKIKRALRLDRFSIIAAVRLGAEEGREVIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLQDKDVQTLVVLKDQVLGTGGKLVRLRDLGVEGEILGWNGRPDASDGHIALSLTPITENEGGVVAASLDPPLLIRFNPEAGRFQVFDCVAGDGGEPDCAFLGELGD*
Ga0137396_1002101843300012918Vadose Zone SoilMAIRDALAATHEGTSPMVSVTRSGEKTAPAGALAQGPRWAGVDSGPRRPDIASMSSETGVSSFGPRLQAALLSLIVLAGLEPAAVGAGVKYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEEGREVVIAEPLPDEALAKVKTACEAGGFCPDPVGFVGARIRIVLLHDKDVVPLVTAEHEVRAAGGRLVDLRALGVTGEILGWNGRAEAANGHVALGLTPITGTAGGGAGAALDLPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFQDEVGG*
Ga0137396_1009566113300012918Vadose Zone SoilVAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREAIIAEPLSDETLKKVKDACQAGGFCPDPVGFVASRVRIVLLRDKDVETVVTVQREPRGARGRLVDLRGLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAAGLNPPLLLRYNVKASRFQVFDCVAGNAGGTICDFLEEPGD*
Ga0137396_1023086513300012918Vadose Zone SoilMQAAGIKVLILVVLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLADGTLKKVKDACDAGGVCPDPVGFVASRVRIVLLRDADVETVVVVQERVLGARGQLVDLRGLSVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVARDGGKADCDFIDEPGD*
Ga0137394_1005460923300012922Vadose Zone SoilVRRALPIFAGLAVVAVLAHGPVAAGVKHGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREAIIAEPLSDETLKKVKDACEAGGFCPDPVGFVASRVRIVLLRDKDVETVVTVQREPRGARGRLVDLRGLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAPGLNPPLLLRFNVERDRFQVFDCVAGDDGGTVCDFLEEPGD*
Ga0137394_1075443123300012922Vadose Zone SoilMPGLAAGHRAVIRTIVGHCPTRRLFNIVLLGVQIVLLPLQTIGPAVAGAKYGDAVPGIIRPSDLERIKRVLHLDRFSVIAAVRLGAEEGREAIVTEPLTDETLKMVKDACDAGGFCPDPVGFVASRVRIVLLRDQDIETRIVVQTEARGARGRLVDLRALGVEGEILGWNGRPDVSDGHLALSLTPITPMDGGSVAGGL
Ga0137416_1024490823300012927Vadose Zone SoilVRRALPVFAGLAVVAVLAHGLAAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREVIIAEPLSDETLKRVKDACEAGGFCPDPVGFVASRVHIVLLQDKDVETVVTVQTEPRGARGRLVDLRDLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAAGLNPPLLLRYNVKASRFQVFDCVAGNAGGTICDFLEEPGD*
Ga0137416_1070762223300012927Vadose Zone SoilMQIADPEPGVRAPAGRRPAALLLLGTVLLAVQTPSWTAGAKYGDAVAGVVRPSDLERIKRVLRLDRFSIIAVVRLGVEEGREAIVAEPLSEETLKKVKDGCDAGGFCPDPVGFVASRVRIVLLRDKDLETLVVVQKEALGGHGRLVDLPGLGVQGEIFGWNGRPDASDGHVALSLTPITSIGNGELAAGLDPPLLIRFNAKAGRFQVYDCVAGDGGGANCDFSEEPGD*
Ga0137404_1013935023300012929Vadose Zone SoilMQAGLKVLILVVLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQERVLGARGQLVDLRGLSVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVARDGGKADCDFIDEPGD*
Ga0137407_1013688613300012930Vadose Zone SoilVPDPAIRSAGPPRCHRLHDLEAAVRRAVAVIGGLAVALVGAAAPDRAAAGVRYGDAVPGVIRPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLQDETLDKVRKACDAGGFCPDPVGFVATRVLIVVLRDKDVETMVTVRKEARGGRGRLVDLRGLGVEGEILGWNGRADASDGHVALSLTPITGTADGGAGAGLDLPLLIRFNPKTDRFQFYDCMAGDGGGTICDFREEPGD*
Ga0137407_1042094023300012930Vadose Zone SoilMQAGLKVLILVVLLLIVSAQVGVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQERVLGARGQLVDLRGLSVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVARDGGKADCDFIDEPGD*
Ga0137410_1001265823300012944Vadose Zone SoilVTRRPAVALSLALVLMTAQASGRAAAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQEQVLGVRGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVAGDGGEADCGFIDEPGD*
Ga0137410_1091485813300012944Vadose Zone SoilDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLTDEALAKLKTACEAGGFCPDPVGFVAARVQIVVLDGPDILPVVTIETEARCRGRRLVDLEHLGLVGKILGWNGRAEAANGHVALGLTPITETSDGRTGAGLEPPLLVRFNPDRNRFQAYDCVRTEDDAVRCDFQDEP*
Ga0134077_1007467823300012972Grasslands SoilPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0134076_1000818833300012976Grasslands SoilVIRRPAVILLLVTALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLNPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD*
Ga0134076_1013390113300012976Grasslands SoilVPGVVRPADIETIKRVLDLQRFSIIAAVRLGAEEGREVVLAEPLSDETLKRVREGCEAGGYCPDPVGFVASRVLIVLLADKGVETIVSVQKEVLGGRRRLADLRALGLEGEILGWNGRAEASAGHVALSLTPVAGRDGAYGPGLEPPLLIRFNPDRDRFQVYDCVAANGGEPICDFRNGPED*
Ga0134075_1003836113300014154Grasslands SoilGGHYIFRQDAGGIRMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEEGREAIIAEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVTVRREARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD*
Ga0134075_1026079813300014154Grasslands SoilGMSNGGGATSYRSRLGRALAWLIILAEFPTGAIGAGVKYGDAVPGVIRPSDLERIKSVLHLERFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKAACEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRAAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG*
Ga0137418_1012495523300015241Vadose Zone SoilGTRMQAGLKVLILVVLLLIVSAQVGVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLADGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDADVETVVVVQERVLGARGQLVDLRGLSVAGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFQVFDCVARDGGEANCDFIDEPGD*
Ga0137409_1025016523300015245Vadose Zone SoilMRLRAALVAIPILCVLASTAVEPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLTDEALAKLKTACEAGGFCPDPVGFVAARVQIVVLDGPDILPVVTIETEARCRGRRLVDLEHLGLVGKILGWNGRAEAANGHVALGLTPITETSDGRTGAGLEPPLLVRFNPDRNRFQAYDCVRTEDDAVRCDFQDEP*
Ga0137409_1105431313300015245Vadose Zone SoilVLLLIVSAQVSVSGRATAGVKYGDVVPGVVRPSDLEKIKRALRLDRFSIIAAVRLGAEEGREVIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIGLLQDKDVQTLVVLKDQVLGTGGKLVRLRDLGVEGEILGWNGRPDASDGHIALSLTPITENEGGVVAASLDPPLLIRFNPEAGRFQVFDCVAGDGGEPDCAFLGELGD*
Ga0137409_1154620513300015245Vadose Zone SoilIAAVRLGAEDGREIVLAEPLADEVLAQVKDGCEAGGFCPDPVGFVASRVSIVRLQGPDVQTLVTVDRTVRGSRGVMADLARIGVEGEIVGWNGRPDASDGHVALSLTPVTRSVGSRLGPGLDPPLLIRFNPRSDRFQVYDCVAGDGGQPECDFLEEPGG*
Ga0134112_1003497113300017656Grasslands SoilMRPADPAPENPIEHWMGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLVTALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGYGGVAICKFEDDAGD
Ga0134083_1024232513300017659Grasslands SoilVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEAGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLPPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAG
Ga0184623_1019668723300018056Groundwater SedimentMSVRAALVTIPILTVLASTAVDPGVKYGDAVPAVIRPSDLERIERVLHLARFSVIAAVRLGAEEGREAVIAEPLSDEALSKVKAACEAGGFCPDPVGFVAARVRIVVLEGTDVVPVVTIEKEARCRGRRLVDLQGLGLAGDILGWNGRAEAANGHVALSLTPIAEIDAEAERLIGQIPELTEADRGDTLVVLRRQYLQRQL
Ga0184612_1050820113300018078Groundwater SedimentAAASRAAAGVRHGDAVPGVIHPSDLERIKRILNLERFSIIAAVRLGATDGREAIIAEPLPDETLKKVKEVCEAGGFCPDPVGFVASRVRVVLLQDKDVETVVTVEKEARGARGRLVDLRVLGVEGDLLGWNGRAEASDGHVALSLTPVTGRPDGTIRPGLDPPLLIRFNPDQDRFQVYDCVAGDGGEAMCDFKEE
Ga0066667_1091502123300018433Grasslands SoilLLLIVSAQVSVSGRATAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPLGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITESGSGEVAAGLDPPLLVRFNPKRGRFQVFDCVAGDGGEADCDFIDEPGD
Ga0137408_134126013300019789Vadose Zone SoilLEAAVRRAVAVIVGGLAVALVGAAAPDRAAAGVRYGDAVPGVIRPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLQDETLDKVRKACDAGGFCPDPVGFVATRVLIVVLRDKDVETMVTVRKEARGGRGRLVDLRGLGVEGEILGWNGRADASDGHVALSLTPITGTADGGAGAGLDLPLLIRFNPKTDRFQFYDCMAGDGGGTICDFREEPGD
Ga0137408_134126223300019789Vadose Zone SoilPDPAIRSAGPPRCHRLHDLEAAVRRAVAVIGGLAVALVGAAAPDRAAAGVRYGDAVPGVIRPSDLERIKRVLNIERFSIIAAVRLGAEEGREVIVAEPLQDETLDKVRKACDAGGFCPDPVGFVATRVLIVVLRDKDVETMVTVRKEARGGRGRLVDLRGLGVEGEILGWNGRADASDGHVALSLTPITGTADGGAGAGLDLPLLIRFNPKTDRFQFYDCMAGDGGGTICDFREEPGD
Ga0209109_1009015513300025160SoilWEEGLPLKMPSGLSGPILCGVILVGGVVWFPAAGGVKYGDAVPGIIRPSDLERIKRALHLERFSVIAAVRLGAEEGREAVIAEPLPDEALAKVKAACEAGGFCPDPVGFVGARVRIVLLDGADVVPVVTIEKEARGPGKRLVDLGGLGLAGDILGWNGRAEAANGHVALSLTPITETAGGQAGAGLDPPLLIRWNPKRDRFQAYDCVLEEDDTTRCDFRDEPPD
Ga0207646_1051375223300025922Corn, Switchgrass And Miscanthus RhizosphereMRLRAALVAIPILTVLASTALQPGVKYGDAVPAVIRPSDLERIQRVLHLARFSVIAAVRLGAEEGREAVIAEPLSDAALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDVVSIVSIEKEAQCRGRRLIDLQRLGLEGEILGWNGRAEAANGHVALGLTPITETGDGRTGAGLEPPLLVRFNPDRNRFQAFDCVRTEDDTVRCDFQDEPGD
Ga0209235_102292523300026296Grasslands SoilMRPADPAPENPIEHWMGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD
Ga0209237_119432913300026297Grasslands SoilMRPADPAPENPIEHWMGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPI
Ga0209761_100708023300026313Grasslands SoilMRPADPAPENPIEHWMGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD
Ga0209470_100970643300026324SoilMRPADPAPENPIERWTGGIRNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPVPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIITEPLSDETLTKVKDACEAGGFCPDPVGFVASHVLIVLLRDEDVETVVMVQSEARGPRGRLIDLRDLGVEGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVVGDGGVAICEFEDEAGD
Ga0209058_100785243300026536SoilMRPADPAPENPIERWTGGIQNAFREAGCRPRPGLHVGDARVIRRPAVILLLATALLTALAPGPVAAGVRYGDAVPGVIRPSDLERIKRVLRLDRFSVIAAVRLGAEDGREAIIAEPLSDETLTKVKDACEGGGFCPDPVGFVASRVLIVLLRDEDVETVVMVQREARGPHGRLIDLRDLGVDGEILGWNGRADAADGHVALSLTPITTTPTGKLGAGLDPPLLIRFNSKRDRFQVYDCVAGDGGVAICEFEDEAGD
Ga0209157_102726733300026537SoilMSNGGGATSYRSRLGRALAWLIILAEFPTGAIGAGVKYGDAVPGVIRPSDLERIKSVLHLERFSVIAAVRLGAEEGREAVIAEPLPDETLAKIKAACEAGGFCPDPVGFVGSRIRIVLLQDKDVLPLVMIEQEVRAAGQRLVDLRELGVDGPILGWNGRADAADGHVALSLTPITATADGRAAAGLHIPLLIRWNPKRDRFQFFDCVAGDDGTTRCDFRDEVGG
Ga0209056_1026374913300026538SoilMQAGSGALRDGCAPVSRRRALALSMALVLLATQAPGRAAAGVKYGDAVPGVVRPSDLERIKRALRLDRFSIIAAVRLGAEEGREAIVSEPLPDGTLKKVKDACDAGGFCPDPVGFVASRVRIVLLRDGEVETVVVVQERVLGARGRLVDLRGLGVEGEILGWNGRPDASDGHIALSLTPITENGSGDVAAGLDPPLLVRFNPKRGRFLVFDCVAGDGGEADCDFIDEPGD
Ga0209388_109971413300027655Vadose Zone SoilMVMGPFGVLIPRGGGPILVQDRACRRHHEVDPRHPGLHRDGTTPVRRALPIFAGLAVVAVLAHGPVAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREAIIAEPLSDETLKRVKDACEAGGFCPDPVGFVASRVRIVLLQGKDVETVVTVQREPRGARGRLVDLRGLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAPGLNPPLLLRFNVERDRFQVFDCVAGDDGGAVCHFLEEPGD
Ga0209382_1007023923300027909Populus RhizosphereMSHRAALVAIPVLVVLAATAADPGVKYGDAVPAVIRPSDLERIERSLHLSRFSVIAAVRLGAEEGREAVIAEPLSDAALARVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPIATIEKQALCRGRRLVDLEGLGLEGEILGWNGRAEGANGHVALSLTPITVSGDGPAVAGLDPPLLVRFNPDRGRFQAFDCLRTEDDTVRCDFRDEPGD
Ga0137415_1013025323300028536Vadose Zone SoilVRRALPVFAGLAVVAVLAHGLAAAGVKYGDAVPGVIHPSDFERIKLTLRLDRFSIIAAVRLGAAEGREVIIAEPLSDETLKRVKDACEAGGFCPDPVGFVASRVHIVLLQDKDVETVVTVQTEPRGARGRLVDLRDLGVEGEILGWNGRAEASDGHVALSLTPITAAGEGGVAAGLNPPLLLRYNVKASRFQVFDCVAGNAGGTICDFLEEPGD
Ga0307473_1000362123300031820Hardwood Forest SoilVEVRKKRRDRGHPEIVLQARGAPLNRGSRPASRALVLLALAIAGPAGAGVKYGDAVPGIVRPADIETIKRVLDLQRFSIIAAVRLGAEEGREVVLAEPLSDETLKKVREGCEAGGYCPDPVGFVASRVLIVLLADKGVETIVSVQKEVLGGRRRLADLRALGLEGEILAWNGRAEASAGHVALSLTPVAGRDGAYGPGLEPPLLIRFNPDRDRFQVYDCVAANGGEPICDFRDGPED
Ga0307471_10021105623300032180Hardwood Forest SoilMSARAALVAIPILTVLASTAADPGVKYGDAVPAVIRPSDLERIERVLHLARFSVIAAVRLGAEEGREAVIAEPLSDGALAKVKAACEAGGFCPDPVGFVAARVRIVVLDGPDIVPVVTIEKEAQCRGRRLVDLQGLGLEGVILGWNGRAEAANGHVALALTPISEIDGGRA
Ga0307471_10028613423300032180Hardwood Forest SoilVRRALPIFAGLAVVAVLALGLAAAGVKYGDAVPGVIHPSDFERIKLALRLERFSIIAAVRLGAAEGREAIIAEPLSEETLKKVKDACEAGGFCPDPVGFVASRVRVVLLQDKDVETVVTVQGDPRGARGRLVDLRDLGVAGEVLGWNGRADASDGHVALSLTPITAGGEGRVAAGLNPPLLLRFNVKRDRFQVFDCVAGDDGGAICDFLEEPGD
Ga0307471_10031359723300032180Hardwood Forest SoilVRQAVGVLAGLAISVLASVAQGPATAGVKYGDAVPGVIRPADIERIKRVLDLERFSIIAAVRLGAEEGREVVLAEPLPDETLKKVRDVCEAGGFCPDPVGFVASRVLILLLADKGVETVVTVGKEVLWARGRLVDLRALGVEGQILGWNGRADASEGHVALSLTPVTGREQEAIGAGLEPPLLIRFNPDRDRFQVYDCVAGDDGGPICDFKEGPED
Ga0364930_0281305_14_5593300033814SedimentMNVRALVLSSLFLIALAPAAIHPGVKYGDAVPGLIRPSDLERIKRVLHLERFSVIAAVRLGAEEGREAVIAEPLPDEVLAKVKSACEAGGFCPDPVGFVGARVRIVLLDGVDVVPVVTIEKEARGPGRRLVDLVGLGVAGDILGWNGRADAANGHVALSLTPITETAGGPAGAGLDPPLLIR
Ga0364940_0016281_158_7993300034164SedimentMNVRALVLSSLFLIALAPAAIHPGVKYGDAVPGLIRPSDLERIKRVLNLERFSVIAAVRLGAEEGREAVIAEPLPDEVLAKVKSACEAGGFCPDPVGFVGARVRIVLLDGADVVPVVTIEKEARGPGKRLVDLGGLGLAGDILGWNGRAEAANGHVALSLTPITETAGGPAGAGLDPPLLIRWNPKRDRFQAYDCVLAEDDTTRCDFQEEPPD
Ga0364943_0086605_234_8843300034354SedimentVRQAGTFLAGLAVSVLGAVAPGPATAGVKYGDAVPGVIRPADIERIKRVLNLERFSIIAAVRLGAEEGREVVLAEPLSDETLKKVRDVCEAGGFCPDPVGFVASRVLIVLLADTGVETVVTVGKEARGARGRLADLRALGVEVEILGWNGRADASGGHVALSLTPVTSREQGTIGAGLEPPLLIRFNPDRDRFQVYDCVEGDDGEPICDFKEGPED


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.