NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F094289

Metagenome / Metatranscriptome Family F094289

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F094289
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 162 residues
Representative Sequence SIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVRRSTSSGTSD
Number of Associated Samples 90
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.89 %
% of genes from short scaffolds (< 2000 bps) 1.89 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.113 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(41.509 % of family members)
Environment Ontology (ENVO) Unclassified
(40.566 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.113 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 6.18%    β-sheet: 25.28%    Coil/Unstructured: 68.54%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF13620CarboxypepD_reg 1.89
PF00069Pkinase 1.89
PF05534HicB 0.94
PF00034Cytochrom_C 0.94
PF03575Peptidase_S51 0.94
PF01039Carboxyl_trans 0.94
PF01850PIN 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 7.55
COG0777Acetyl-CoA carboxylase beta subunitLipid transport and metabolism [I] 0.94
COG0825Acetyl-CoA carboxylase alpha subunitLipid transport and metabolism [I] 0.94
COG1598Antitoxin component HicB of the HicAB toxin-antitoxin systemDefense mechanisms [V] 0.94
COG4226Predicted nuclease of the RNAse H fold, HicB familyGeneral function prediction only [R] 0.94
COG4799Acetyl-CoA carboxylase, carboxyltransferase componentLipid transport and metabolism [I] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.11 %
All OrganismsrootAll Organisms1.89 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300016294|Ga0182041_10404736All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1162Open in IMG/M
3300016371|Ga0182034_10675042All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium877Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil41.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.87%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.55%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.66%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.89%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300004631Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF234 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012391Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021307Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027045Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 40 (SPAdes)EnvironmentalOpen in IMG/M
3300027376Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027703Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 81 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030730Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12627J18819_1045828313300001867Forest SoilGQKVRLHPTGAPTGTPPNVMITVDQVQLEPSEVTGTVTATDSSSTPPTFTLGNLPSFFTNAGIMSIQVDVLSNTQFETEEDQMMSGLSSLKTGDMVSVRGLLFNTMTTPTMVAEKVVKRSMSSGTSD*
Ga0058899_1065027713300004631Forest SoilVDTSGFMFPSGLNFLSAADLMVGQEVRLHPTGLPTGTPPNLMVTVDQVQLEPTFVTGTITAVNASSNPQTFTLGSLPSYFTNAGITSIQVDILATTRFETEEDQTFSGLSSVKSGDMVSVRGPLFKTMTMPTMAAEKVVKRSGTGD*
Ga0066672_1043131713300005167SoilLLAFEVKFFQPPNQMSFAGSVTSVSTSSFQIVLFDEEFFGGGDEMGSFSMGVPLTINLAPQATFSIDTSSFMLPSGLNFASIADLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFALGSLPSFFTNAGITSIQVDVLATTQFKTEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0066688_1043715413300005178SoilLHPTGVPTGTPPNLMITVDQVQLEPSFVTGTVTAVNIGSNPQTFTLGSLPSFFTNAGITSIQVDVFATTQFETEEDQTLSGLGSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTGD*
Ga0066678_1100078413300005181SoilFQIVLFDEEFFGGGDEMGSFSMGVPLTINLAPQATFSIDTSSFMLPSGLNFASIADLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFALGSLPSFFTNAGITSIQVDVLATTQFKTEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSM
Ga0066671_1034300923300005184SoilFAAADLMVGQKVRLHPTGAPAGMPPNVMITVDQVQLEPSDVTATITAINAGSNPQTFTLGTLPMFFQNAGIMSIQVDVLSNTQFETEEDQMMSGLSSFKTGDIVSVRGLLFNTMTTPTMVAEKVVGRSMSSGTSD*
Ga0066681_1039466313300005451SoilPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0066687_1014525713300005454SoilLHPTGAPTGTPPNLMVTVDQVQLEPSFLTGTVTAVNTSSNPQTFTLGSLPPFFTNAGITSLQVDVLATTRFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTGD*
Ga0066697_1002745713300005540SoilQATFSIDTSGFMFPSGLNFASTADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSLKPGDVVSVRGPLFNTMTMPTMAAEKVVKRSMSSGTSD*
Ga0066695_1008970243300005553SoilGLNFASTADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSLKPGDVVSVRGPLFNTMTMPTMAAEKVVKRSMSSGTSD*
Ga0066699_1002472763300005561SoilTFAAADLMVGQKVRLHPTGAPAGMPPNVMITVDQVQLEPSDVTATITAINAGSNPQTFTLGTLPMFFQNAGIMSIQVDVLSNTQFETEEDQMMSGLSSFKTGDIVSVRGLLFNTMTTPTMVAEKVVGRSMSSGTSD*
Ga0066703_1088344513300005568SoilLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFKTEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0066708_1081015713300005576SoilNFASAADLMVGQEVRLHPTGAPTGTPPNLMVSVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTTPAMAAERVVKRSMSSGSGH*
Ga0066691_1013246013300005586SoilPSGLNFASIADLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0066706_1118984813300005598SoilEMGSFSMGSPLTINLATPTAFSVDTGGFMLPSGLSFMSPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSNTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMGAEKVVNRSMSSGTDD*
Ga0066652_10040112013300006046SoilFSCLQTGQIVKVDAKMKPDGSLLAFEVKFFQPPNQMSFAGTVTSVNTVNTVNMGASSFQIVLFDEESFGGGDEMGSFSMGAPLTINLAPQATFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0070716_10064559223300006173Corn, Switchgrass And Miscanthus RhizosphereMVGQEVRLHSTGPTTGTPPNLMVTVDQAQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLATTRFETEEDQTFSGLGSLKSGDVVSVRGPLFKTMTMPTIAAEKVVKRSTSSGTSD*
Ga0066710_10131388313300009012Grasslands SoilSTSSFQIVLFDEEFFGGGDEMGSFSMGVPLTINLAPQATFSIDTSSFMLPSGLNFASIADLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFKTEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD
Ga0099829_1167105913300009038Vadose Zone SoilITLAPQATFSVDTSGFMFPSGLNFASAGDLMVGQEVRLHPTGPPTGTPPNLMVAVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLAATQFETEEDQSLSGLGSLKSGDVVSVRGPLFKTMTMPSMAAEKVVKRSGSSD*
Ga0099792_1049482513300009143Vadose Zone SoilDEEFFGSGGDMGSFSMGAPVTITLAPQATFSVDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGTVTAVNTSNNPQTFTLGSLPSFFTNVGITSIQVDVLATTQFETEEDQTFSGLSSFKSGDMVSVRGPLFKTMTMPSMAAEKVVKRSGSSD*
Ga0134082_1005518613300010303Grasslands SoilSGFMLPSGLNFASTTDLMVGQEVRLHPTGVPTGTPPNLMITVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0134067_1014939523300010321Grasslands SoilATPTAFSVDTGGFMLPSGLSFMSPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSDTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMGAEKVVNRSMSSGTDD*
Ga0134064_1016650323300010325Grasslands SoilMSPADLMAGQKVRLHPTGLPSRMPPNVPVTVAQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSDTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMVAEKVVNRSMSSGTDD*
Ga0134080_1034490013300010333Grasslands SoilLHPTGLPSGMPPNVTVTVDQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSNTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMVAEKVVNRSMSSGTDD*
Ga0134063_1037995513300010335Grasslands SoilVTSVNTVNTVNMGASSFQIVLFDEESFGGGDEMGSFSMGAPLTINLAPQATFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVSVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0150983_1337587813300011120Forest SoilTPPNLMVTVDQVQLEPSFVTGTITAFNTSSNPQTFTLGGLPSYFTNAGIMSIQVDVLATTQFETEEDQTFSGLGSLKSGDVVSVRGPLFKTMTMPTIAAEKVVKRSTSSGTSD*
Ga0150983_1432833313300011120Forest SoilFYQPPNQMSFAGTISSVSSGGSSFQLVLFDDESFGGDEMSSFSISAPLTVNLAPMAAFSIDTGSFTFPPGLTFASPADLSAGQEVRVHPTGAPIGTPPNLMVTVDQIELEPSYVTGTVTAVSTGSSPQTFTLGSLSSFFTNAGIASIRVDVLSTTMFETEEDQTFAGLSSLNPGDMVSVRGPLFKTMTMPTMVAEKVVKHSTTSGDD*
Ga0137392_1140249013300011269Vadose Zone SoilFMFPSGLNFASAGDLMVGQEVRLHPTGPPTGTPPNLMVAVDQVQLEPSFVTGTITAFNTSSNPQTFTLGTLPSYFTNAGIMSIQVDVLAATQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVKRSGTGD*
Ga0137393_1009152353300011271Vadose Zone SoilFMFPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVAVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLAATQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPSMAAEKVVKRSGSSD*
Ga0137389_1020810213300012096Vadose Zone SoilGDMGSFSMGAPVTVTPTMMATFSIDTSGFMFPSGLNFASAGDLMVGQEVRLHPTGPPTGSPPNLMVAVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLAATQFETEEDQSLSGLGSLKSGDVVSVRGPLFKTMTMPSMAAEKVVKRSGSSD*
Ga0137388_1029411713300012189Vadose Zone SoilPTGTPPNLMVAVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLAATQFETEEDQSLSGLGSLKSGDVVSVRGPLFKTMTMPSMAAEKVVKRSGSSD*
Ga0137364_1140309013300012198Vadose Zone SoilEMGSFSMGVPLTINLAPQATFSIDTRSFMLPSGLNFTSIADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTVVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVRRSMSSGTSD*
Ga0137383_1068522713300012199Vadose Zone SoilPSFTIATDNNTQFDFGTSCSTADFSCLKIGQIVEVEAKMRPDGSLLAFEVKLFQPPNQMSFGGTITSVSSGGSSFQIVLFDEEWFAGSEMGSFSMGAPITINLATPTAFSIDSGGFMLPSGLSFATPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSDITGTITAINTASNPQTFTLGMLPAFFQNAKIMSIQVDVLSNTQFETEEDRMVSGLSSFKTGDTVSVRGLLFNTMTTPTMVAEKVV
Ga0137382_1131678013300012200Vadose Zone SoilAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0137399_1050015413300012203Vadose Zone SoilNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD*
Ga0137399_1110377413300012203Vadose Zone SoilGDMGSFSMGAPVTVTPTMMATFSIDTSGFMFPSDLKFASAADLMVGQEVRLHPTGPPTGTPPNLIVTVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLATTQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVNRSMSSGTSD*
Ga0137399_1171066913300012203Vadose Zone SoilNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPVTITLAPQATFSVDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMATVDQVQLEPSFVTGTITGFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDGLATTQFETEEDQTLSGLGSLKSGDVVS
Ga0137377_1019288123300012211Vadose Zone SoilDAKMKPDGSLLAFEVKFFQPPNQMSFAGSVTSVSTSSFQIVLFDEEFFGGGDEMGSFSMGVPLTINLAPQATFSIDTSSFMLPSGLNFASIADLMVGQEVRLHPTAAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFKTEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVLKRSMSSGTSD*
Ga0137370_1012275613300012285Vadose Zone SoilMSFSGTITSVGSGGGSFQIVLFNEEDFSSGEMGNFSMGASVTINLATGATFSIDTGGFMLPSGLNFASANDRMVRLHPTGPPTGTPPNLMITGTVDQVQLEPFHVTATITAINTGGNPQTFTLGTLPPFFTDAGIISIQVDVLSNTQFETEEDQMVSGLSSFQPETRFRLADYCLTP*
Ga0137387_1022940923300012349Vadose Zone SoilSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVRRSTSSGTSD*
Ga0137387_1097548823300012349Vadose Zone SoilADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDVVSVRGPLFNTMTMPTMAAEKVVNRSMSSGTSD*
Ga0137371_1053255313300012356Vadose Zone SoilAAPTAFSVDTGGFMLPSGLSFMSPADLMAGQKVRLHPTGLPSGMPPNVTLTVDQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSNTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMVAEKVVNRSMSSGTDD*
Ga0137371_1143368913300012356Vadose Zone SoilPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0137385_1023163713300012359Vadose Zone SoilTAFSIDSGGFMLPSGLSFATPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSDITGTITAINTASNPQTFTLGMLPAFFQNAKIMSIQVDVLSNTQFETEEDRMVSGLSSFKTGDTVSVRGLLFNTMTTPTMVAEKVVNRSMSFGTSD*
Ga0137361_1116474013300012362Vadose Zone SoilADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSSVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKAGDMVSIRGPLFNTMTMATMAAEKVVKRSMSSGTSD*
Ga0137390_1105248533300012363Vadose Zone SoilTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGTLPSFLTNAGITSIQVDVLTSTQFETEEDQTFSGLSSLKSGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0134035_103943713300012391Grasslands SoilMIRAQATFSIDTSGFMFPSGLNFASTADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSLKPGDVVSVRGPLFNTMTMPTMAAEKVVKRSMSSGTSD*
Ga0137395_1000609053300012917Vadose Zone SoilMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRLTGTVTVVNTGGNPQTLTLGNLPSLFTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDIVSVRGPLFNTMTMPTMVAEKVVKRSMSSGTSD*
Ga0137396_1066639513300012918Vadose Zone SoilEVRLHPTGAPTGTPPNLMATVDQVQLEPSFVTGTVTAVNTSNNPQTFTLGSLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKSGDMVSVGGPFFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0137396_1089539213300012918Vadose Zone SoilGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD*
Ga0137359_1091493413300012923Vadose Zone SoilSFSMGAPLTINLAPQATFSIDTSGFMLPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEASFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0137359_1116929513300012923Vadose Zone SoilSMGVPLTINLAPQATFSIDTRSFMLPSGLNFASIADLMVGQEVRLHPTGAPTGMPPNLMVTVDQVQLEPSFLTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0137404_1074201313300012929Vadose Zone SoilADLMVGQEVRLHPTGPPTGTPPNLMVTVDQAQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD*
Ga0137404_1123746113300012929Vadose Zone SoilTRSFMLPSGLNFASIADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFLTGSVTAVNTSSNPQSFTLSSLPSFFTNAGITSIQVDVLATTQFATEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0137407_1192213813300012930Vadose Zone SoilMGSFSMGAPLTINLAPQATFSIDTSGFMLPSGLNFASTADLMVGQEVRLHPTGAPTGAPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD*
Ga0134078_1017561213300014157Grasslands SoilNFSCLQTGQIVKVDAKMKPDGSLLAFEVKFFQPPNQMSFAGTVTSVNTVNTVNMGASSFQIVLFDEESFGGGDEMGSFSMGAPLTINLAPQATFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVSVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFNTMTMPTMAAEKVVKRSTSSGTSD*
Ga0134079_1041983613300014166Grasslands SoilLTINLAPQATFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGAPTGTPPNLMVSVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFFTNAGITSIQVDVLATTQLETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD*
Ga0137420_103010113300015054Vadose Zone SoilFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD*
Ga0137420_125107023300015054Vadose Zone SoilTSSFMFPSGLNFASIADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSSVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKAGDMVSVRGPLFNTMTMATMAAEKVVKRSMSSGTSD*
Ga0137420_138157023300015054Vadose Zone SoilFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD*
Ga0137420_141655013300015054Vadose Zone SoilRDRYSHDDGDFLSIDTSGFMFPSDLKFASAADLMVGQEVRLHPTGPPTGTPPNLIVTVDQVQLEPSFVTGTITGFNTSSNPQTLTLGSLPSYFTNAGIMSIQVDVLATTQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVKRSMSPGTSD*
Ga0134072_1041122113300015357Grasslands SoilVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQTFSGLSSLKPGDVVSVRGPLFNTMTMPTMAAEKVVKRSMSSGTSD*
Ga0182036_1065525023300016270SoilNFASATDLLVGQKVRLHPTGAPTGTPPNLMISVDQVQLEPSYVTATITGINTGGNPQTFALGTLPSLFTNAGIMSIQVDVLSNTQFETEEDQMVSGLGSFKTGDIVSVRGLLFNTMTTPTMVAEKVVSRSTSSGSED
Ga0182041_1040473623300016294SoilDTGGFMLPSGLNFASAADLMVGQKVRLHPTGAPTGTPPNLMITVDQVQLEPSYVTATITAINTGGNPQTFTLGTLPPWFANAGITSLQVDVLSTTQFETEEDQMVSGLSAFKTGDTVSVRGLLFNTMTTPTMVAEKIASRSASSGNED
Ga0182035_1031288613300016341SoilIDTGGFMLPSGLNFASATDLMVGQKVRLHPTGAPTGAAPNLMITVDQVQLEPSYVTATITAVNTGGNPQTFTLGTLPSFFTNVGIMSIQVDLLPTTQLETEEDQMVSGLSSLKTGDTVSVRGLLFNTMTTPTMVAEKVVSRSAPSGSED
Ga0182034_1067504213300016371SoilQKVRLHPTGLPTGTPPNLMITVDQVQLEPSSVTATIAAINSGGNPPTLTLGTLPSFFTKAGITSIQVDVLSTTQFETEEDQMVSGLSAFHTGDTVSVRGLLFNTMTTPTMVAEKVVNRSMSSGSED
Ga0066669_1075909413300018482Grasslands SoilPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD
Ga0179592_1040528813300020199Vadose Zone SoilAADLMVGQEVRLHPTGPPTATPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD
Ga0179596_1041375813300021086Vadose Zone SoilISTDSNSHFDFGTSCSAANFSCLQTGQIVKVDAKMKPDGSLLAFEVKFFQPPNQMSFAGSVTSVNTGASSFQIVLFDEESFGSGDGMGSFSMGVPLTINLAPQATFSIDTRSFMLPSGLNFASIADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFLTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDMVS
Ga0179584_131451013300021151Vadose Zone SoilGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQAQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD
Ga0210400_1076340313300021170SoilFFGGGGDMGSFLMGAPVTITLAPMATFSVDTSGFMFPSGLNFASAADLMVGQEVRFHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAFNTSTNPQTFTLGSLPSFFTNVGIMLIQVDVLATTQFETEEDQTLSGLGSFKSGDVVSVRGPLFKTMTMPTMAAEKVVKRSGTGD
Ga0179585_105079213300021307Vadose Zone SoilITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPVTITLAPQATFSVDTSGFMFPSGLNFASAADLMIGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGTVIAVNTSNNPQTFTLGSLPSFFTNAGITSIRVDVLATTQFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMSPMAAEKVVKPPCPPAPATRSKSSDVA
Ga0137417_127410413300024330Vadose Zone SoilQTMHTFTIQSGMNGPSFTIATDTNTQFDFGTSCSGENFSCLQKGQTVKVDAKMKPDGSLLAFEVKFFQPPNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD
Ga0137417_141398813300024330Vadose Zone SoilDFGTSCSGENFSCLQKGQTVKVDAKMKPDGSLLAFEVKFFQPPNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD
Ga0209238_112796713300026301Grasslands SoilSIDASGFMLPSGLNFASTTDLMVGQEVRLHPTGVPTGTPPNLMITVDQVQLEPSFVTGTVTAVNIGSNPQTFTLGSLPSFFTNAGITSIQVDVFATTQFETEEDQTLSGLGSFKPGDMVSVRGPLFKTMTTPAMAAERVVKRSMSSGSGH
Ga0209687_114257313300026322SoilLHPTGAPTGTPPNLMVTVDQVQLEPSFLTGTVTAVNTSSNPQTFTLGSLPPFFTNAGITSLQVDVLATTRFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTGD
Ga0209804_123046513300026335SoilTTSGFQIVLFDEESFGEMGSFSMGAPLTVNLASQATFSIDTSGFMLPSGLNFASTADLMVGQELRLHPTGAPTGTPPNLMVTVDQVQLEPSFLTGTVTAVNTSSNPQTFTLGSLPSFFTNAGITSLQVDVLATTRFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTGD
Ga0209159_113086913300026343SoilSGLNFASTADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSRVTGTVTAVNTGSNPQTFTLGNLPSFFTNAGITSIQVDVLATTQFETEEDQTFSGLSSLKPGDVVSVRGPLFNTMTMPTMAAEKVVKRSMSSGTSD
Ga0257172_109634713300026482SoilGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMSTMAAEKVVKRSMSSGTSD
Ga0209808_117902913300026523SoilDEESFGGGDEMGSFSMGAPLTINLASQATFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTVAPTGTPPNLMVSVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFNTMTMPTMAAEKVVKRSTSSGTSD
Ga0209161_1048866913300026548SoilEMGSFSMGSPLTINLATPTAFSVDTGGFMLPSGLSFMSPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSPITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSNTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMATPTMGAEKVVNRSMSSGTDD
Ga0209648_1038650313300026551Grasslands SoilSFSMGAPVTITLAPQATFSVDTSGFMFPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTVTAVNTSNSPQTFTLGSLPSFFTNAGIMSIQVDVLATTQFETEEDQTLSGLSSFKSGDIVSVRGPLFKTMTLPTMAAEKVVKRSGTSD
Ga0209577_1085450013300026552SoilADLMVGQEVRLHPTGAPTGTPPNLMVTVDQVQLEPSFVTGSVTAVNTSSNPQSFTLGSLPSFLTNAGITSIQVDVLATTQFETEEDQSFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSTSSGTSD
Ga0179587_1028219213300026557Vadose Zone SoilDEEFFGSGGDMGSFSMGAPVTVTPTMMATFSIDTSGFMFPSDLKFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAFNTSSNPQTFTLGTLPSYFTNAGIMSIQVDVLAATQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPSMAAEKVVKRSGSSD
Ga0179587_1094647413300026557Vadose Zone SoilPTGPPTATPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD
Ga0207726_105909213300027045Tropical Forest SoilIPLTINLATQTTFSIDTDGSTLPSGLNFAAPTDLMVGQEVRLHPNGLPSGTLSNPVITVDQVQLGSARVTGTISAVNAGGNPQTFTLANLPLSFTKAGISSIQVDVLSTTQLETEEDQMVSGLSALNMGDTVSVRGLLFNTMTTPTMVAEKVLQH
Ga0209004_102908913300027376Forest SoilMGAPLTVTRAADLMAGQKVRLHPTGAPSGTPPNLMITVDQVQLEPSYVTGTITAVNTSGNPQTLTLGMLPSFFTNAGISSIQTDVLSNTQLETEEDQMMAGLSSFKTGDMVSVRGLLFNTMTTPTMVAEKIVSRSTSSGGSED
Ga0209076_114338813300027643Vadose Zone SoilTFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTATPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTQGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD
Ga0209117_111773713300027645Forest SoilASTADLMLGQEVRLHPTGVPTGTLPSLVVTVDQVQLEPSFITGTVTAVNTGSNPQTFILGSLPSFFTSAGITSIQVDVLAATQFEKEEDQTFSGLSSLKPGDMVSVRGPLFKTATMPTMAAEKVVKRPVSSGTGD
Ga0207862_106001423300027703Tropical Forest SoilQEVRLHPNGLPSGTLSNPVITVDQVQLGSARVTGTISAVNAGGNPQTFTLANLPLSFTKAGISSIQVDVLSTTQLETEEDQMVSGLSALNMGDTVSVRGLLFNTMTTPTMVAEKVLQH
Ga0209488_1043809013300027903Vadose Zone SoilVDAKMKPDGSLLAFEVKFFQPPNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPVTITLAPQATFSVDTSGFMFPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTVTAVNTSNNPQTFTLGSLPSFFTNVGITSIQVDVLATTQFETEEDQTFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVVKRSMSSGTSD
Ga0137415_1093451813300028536Vadose Zone SoilFSCLQKGQTVKVDAKMKPDGSLLAFEVKFFQPPNEMSFAGAITSVDTGSFKIVLFDEEFFGGGGDMGSFSMGAPLTINLAPQATFSIDTSGFMLLSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAINTSSNPQSFTLGSLPSFFTNAGITSIQVDVLSTTQFETEEDQMFSGLSSFKPGDMVSVRGPLFKTMTMPTMAAEKVV
Ga0307482_123943213300030730Hardwood Forest SoilIVLFDEEAFGSSEMGSFSMGIPLTINLATSTTFSIDMGGFMLPSGLTFAAAADLMVGQKVRLHPSGAPAGMPPNVMITVDQVQLEPSDVTAIITAINTGSNPQTFTLGTLPMFFQNAGIMSIQVDVLSNTQFETEEDQMMSGLSSFKTGDIVSVRGLLFNTMTTPTMVAEKIVGRSMSSGTDD
Ga0307474_1097631523300031718Hardwood Forest SoilVSTSSFQIVLFDEESFGGGDAMGSLSTGASLTINLASQTTFSIDTSGFMFPSGLNFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLATTQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVNRSMSSGTSD
Ga0307477_1063670213300031753Hardwood Forest SoilSGAGTFQIVLFDEEWFGSDEMGSFSMGAPLTVTLASQATFSIDSDGFMIPSGLNFASAADLMAGQKVRLHPTGAPSGTPPNLMITVDQVQLEPSYVTGTITALNISGNPQTLTLGMLPSFFTNAGITSIQVDVLSNTQFETEEDQMMSGLNSFKTGDTVSVRGLLFNTMTTPTMVGEKVVSRSTSSGSTD
Ga0307475_1019585013300031754Hardwood Forest SoilFEVKFFQPPNQMSFAGTVTSVNSGAGTFQIVLFDEEWFGSDEMGSFSMGAPLTVTLASQATFSIDSDGFMIPSGLNFASAADLMAGQKVRLHPTGAPSGTPPNLMITVDQVQLEPSYVTGTITALNISGNPQTLTLGMLPSFFTNAGITSIQVDVLSNTQFETEEDQMMSGLNSFKTGDTVSVRGLLFNTMTTPTMVGEKVVSRSTSSGSTD
Ga0310917_1048611513300031833SoilSDEMGSFSMGAPVTINLATGATFSVDTGVFTLPSGLNFASSTDLMVGQKVRLHPMGAPTGTPPNLMITVDQVQLEPSYVTATITAINTGGNPQTFTLGTLPSFFTNAGIMSIQVDVLSKTPFETEEDQMVSGLSSFKTGDTVSVRGPLFNTMTTPTIVAEKVVSRSTSSGSED
Ga0306925_1072517813300031890SoilFGSDDMGSFSMGAPVTINLATGATFSVDTGGFTLPSGLNFASSTDLMVGQKVRLHPMGAPTGTPPNLMITVDQVQLEPSYVTATITAINTGGNPQTFTLGTLPSFFTNAGIMSIQVDVLSKTPFETEEDQMVSGLSSFKTGDTVSVRGPLFNTMTTPTIVAEKVVSRSTSSGSED
Ga0306923_1100929413300031910SoilVGQKVRLHPMGAPTGTPPNLMITVDQVQLEPSYVTATITAINTGGNPQTFTLGTLPSFFTNAGIMSIQVDVLSKTPFETEEDQMVSGLSSFKTGDTVSVRGPLFNTMTTPTIVAEKVVSRSTSSGSED
Ga0306923_1222956713300031910SoilDLLVGQKVRLHPTGAPTGTPPNLMISVDQVQLEPSYVTATITGINTGGNPQTFALGTLPSLFTNAGIMSIQVDVLSNTQFETEEDQMVSGLGSFKTGDIVSVRGLLFNTMTTPTMVAEKVVSRSTSSGSED
Ga0307479_1006487413300031962Hardwood Forest SoilGLSFMSPADLMAGQKVRLHPTGLPSGMPPNVTVTVDQVQLEPSDITGTITAINTGSNPETFTLGNLPAFFQNAKIMSIQVDVLSNTRFETEEDHMVSGLSSFKIGDTVSVRGLLFNTMTTPTMVAEKVVNRSMSSGTSD
Ga0307479_1074051913300031962Hardwood Forest SoilNFSCLQTGQTVRVEAKMQTDGSLLAFEVKFFQPPNQMSFAGTVTSVNSGAGTFQIVLFDEEWFGSDEMGSFSMGAPLTVTLASQATFSIDSDGFMIPSGLNFASAADLMAGQKVRLHPTGAPSGTPPNLMITVDQVQLEPSYVTGTITALNISGNPQTLTLGMLPSFFTNAGITSIQVDVLSNTQFETEEDQMMSGLSSFKTGDTVSVRGLLFNTMTTPTMVGEKVVSRSTSSDSTD
Ga0310911_1013471113300032035SoilNLAAGATFSIDTGGFMLPSGLNFASATDLMVGQKVRLHPTGAPTGAAPNLMITVDQVQLEPSYVTATITAVNTGGNPQTFTLGTLPSFFTNVGIMSIQVDLLPTTQLETEEDQMVSGLSSLKTGDTVSVRGLLFNTMTTPTMVAEKVVSRSVPSGSED
Ga0307471_10164553013300032180Hardwood Forest SoilSGFMFPSDLKFASAADLMVGQEVRLHPTGPPTGTPPNLMVTVDQVQLEPSFVTGTITAFNTSSNPQTFTLGSLPSYFTNAGIMSIQVDVLATTQFETEEDQTLSGLGSLKSGDVVSVRGPLFKTMTMPTMAAEKVVNRSMSSGTSD
Ga0307471_10239888413300032180Hardwood Forest SoilKMQTDGSLLAFEVKFFQPPNQMSFAGTVTSVNSGAGTFQIVLFDEEWFGSDGMGSFSMGSPLTVTLASRATFSIDSAGFVIPSGLNFASAADLMAGQKVRLHPTGAPSGTPPNLMITVDQVQLEPSYVTGTITASNISGNPQTLTLGMLPSSFTNAGITSIQVDVLSNTQFETEEDQMMSGLNSFKTGDTVSVRGLLFNTMTTPTMVGEKVVSRSTSSGST
Ga0306920_10290815313300032261SoilTGATFSVDTGVFTLPSGLNFASSTDLMVGQKVRLHPTGAPTGTPPNLMITVDQVQLEPSYVTASITAINTGGNPQTFTLGTLPSFFTNAGIMSIQVDVLSNTQFETEEDQMVSGLSSFKTGDTVSVRGPLFNTRTTPTMVAEKVVSRSASSGSED


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.