NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097110

Metagenome / Metatranscriptome Family F097110

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097110
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 109 residues
Representative Sequence VKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISAVLDTLEYAK
Number of Associated Samples 85
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 85.71 %
% of genes near scaffold ends (potentially truncated) 5.77 %
% of genes from short scaffolds (< 2000 bps) 6.73 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.269 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(24.038 % of family members)
Environment Ontology (ENVO) Unclassified
(50.962 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 7.86%    β-sheet: 30.71%    Coil/Unstructured: 61.43%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF05114DUF692 53.85
PF00248Aldo_ket_red 7.69
PF00005ABC_tran 0.96
PF03960ArsC 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG3220Uncharacterized conserved protein, UPF0276 familyFunction unknown [S] 53.85
COG1393Arsenate reductase or related protein, glutaredoxin familyInorganic ion transport and metabolism [P] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.27 %
All OrganismsrootAll Organisms6.73 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300006794|Ga0066658_10897647All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium509Open in IMG/M
3300006804|Ga0079221_11784729All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium503Open in IMG/M
3300009012|Ga0066710_103325722All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium613Open in IMG/M
3300011269|Ga0137392_10556295All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium953Open in IMG/M
3300012202|Ga0137363_11337432All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium605Open in IMG/M
3300026297|Ga0209237_1249381All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium545Open in IMG/M
3300027846|Ga0209180_10807242All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium502Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil24.04%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.12%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil14.42%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.50%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil6.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.88%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1021810713300002558Grasslands SoilVKALALSVLALLSTDVVVPAGTRIPIRFVQRVTSGRDTVGTRVLVQTMGALVQDSCVLVPPYVRAKGRIVVSKGGGRFGRHGRLGLRFDSLE
JGI25384J37096_1002396813300002561Grasslands SoilMKAIALVLLPLLLTDVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVRPGRWAAVSAVLDTLE
JGI25382J37095_1003969333300002562Grasslands SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWV
JGI25388J43891_104285123300002909Grasslands SoilVKAAVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLVFDSLEVRPGRWVPMSGVLDTLEYAKPNALSDSGLVSSGKTSVVGVG
JGI25390J43892_1016185823300002911Grasslands SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRS
Ga0066674_1051960613300005166SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGVLDTLEYAKPGALTDSGLVSSG
Ga0066677_1072741023300005171SoilVKTIVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALVQDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWLAMSGVLDT
Ga0066677_1084394613300005171SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTTGALVRDSCVLVPPYVRAKGRIVVSKGGGRFGRHGRLGLRFDSLEVRAGHWVAISGVLDTLEY
Ga0066680_1009632613300005174SoilMKVIAPTLLALWLGSTVIPAGTRIPIRFVQRITSGKDTVGTPVLVQTMGALVRDSCVVVPPYTRAKGRIVVSKGGGRFGRHGRLGLRFDSLEVR
Ga0066680_1053406523300005174SoilVKTLFAVLLAVLAGDTVIPVGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGRWVPIAGLLDSLEYTKPAF
Ga0066679_1100590213300005176SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTTGALVRDSCVLVPPYVRAKGRIVVSKGGGRFGRHGRLGLRFDSLEVRAGHWVAISGVL
Ga0066690_1018736133300005177SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISGILDTLEY
Ga0070703_1041261713300005406Corn, Switchgrass And Miscanthus RhizosphereVKILFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVIVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEV
Ga0070705_10098112613300005440Corn, Switchgrass And Miscanthus RhizosphereMRAIALALLPLLLSDVVLPTGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVMRDSCIIVPPYLRAKGHVVVSKGGGRFGRHGRLGLR
Ga0070694_10001517863300005444Corn, Switchgrass And Miscanthus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGQWVPIAGLLDTLEYTKPAFLTDSGLV
Ga0070708_10158857123300005445Corn, Switchgrass And Miscanthus RhizosphereVKAIALTLLTLLSADVVVPAGTHIPIRFVQRITSGRDTVGTEVLVQTMGALVQDSCVLVPPYVRAKGRIVISKGGGRFGRHGRLGFTFDSLEVRPSRWV
Ga0066686_1014917413300005446SoilMKVIAPTLLALWLGSTVIPAGTRIPIRFVQRITSGKDTVGTPVLVQTMGALVRDSCVVVPPYTRAKGRIVVSKGGGRFGRHGRL
Ga0070697_10071770313300005536Corn, Switchgrass And Miscanthus RhizosphereMRAIALALLALLSSDLVIPAGTRIPIRFVQRVTSGRDTVGTEVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRARHWV
Ga0070697_10105578013300005536Corn, Switchgrass And Miscanthus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVERVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGHVVVSKGGGRFGRHGKLGLRFDSLEVGRGRWVPIAGLLDTLEYTKPAFLTD
Ga0070696_10113499113300005546Corn, Switchgrass And Miscanthus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVERVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGQWVPIAGLLDTLEYTKPAF
Ga0066661_1007274613300005554SoilVKAAVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLVFDSLEVRPGRWVPMSGVLDTLEYAKPNALSDSGLVSSGR
Ga0066700_1036737113300005559SoilVKTIVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALVQDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWLAMSGVLDTLEYAK
Ga0066700_1069704923300005559SoilMRALAPTLLALLVGGSVIPAGTRIPIRFVQHITSGKDTVGTPVLVQTMGALVVQESCVVVPPYTRAKGRIVVSKGGGRFGRHGRLGLRFDSLE
Ga0066670_1053378613300005560SoilVKAIALLVLLLQVEVVVPAGTRIPIRFVQRITSGRDTVGSEVLVQTVGAVVQDSCTIIPPYLRAKGRIVLSKGGGRFGRHGQLGLEFDSLEIRRGRWVPISG
Ga0066693_1025059023300005566SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRVVVSQGGGRFGRHGRLGLRFDSLEVRSGHWVPIAGVLDTLEYAKPGAVTDSGLVSSGRTSVGGL
Ga0066694_1011748233300005574SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDS
Ga0079222_1001387343300006755Agricultural SoilVRATLALLLVALLGRTVIPAGTHIPIRFVQRVTSGKDSVGTAVLVQTMGALVSDSCVVVPPYLRAKGHVVVSKGGGRFGRHGQLGLRFDSLEVRPGRWVEIAGLLDTLE
Ga0079222_1050081523300006755Agricultural SoilVKILFAALLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGRWVRIAGLLDSLEYTKPAFLTDSG
Ga0066653_1045804923300006791SoilMKAIALALPLLLTDVVVPVGTHIPIRFVQRVTSGRDTVGTPVLVQTMGAVVRDSCVILPPYLRAKGHLVVSKGGGRFGRHGRLGLSFDSLEI
Ga0066658_1089764723300006794SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGVLDTLEYAKPGALTESGLVSSGKTSVGGVGRKLV
Ga0066659_1120026413300006797SoilVRATVTLLLALVGGDVLIAAGTRIPIRFLESITSGRDTVGTPVLVQTMGALALDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGHWLAMSGVLDTL
Ga0066659_1187182513300006797SoilVRAVVALLLAAGLGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGALVSDSCVVVPPYLRAMGHVVVSKGGGRFGRHGKLGLRF
Ga0079221_1178472923300006804Agricultural SoilMKAIALAFLALLSSDLVIPAGTRIPIRFVQRITSGRDTVGTQVLAQTMGALVQDSCVLVPPYVRVKGRIVFSKGGGRFGRHGRLGLAFDSLEVRPSRWVAISGVLDTLEYAKPGAVTDSGLVSSGKT
Ga0075425_10086540523300006854Populus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGQWVRIAGLLDTLEYTKPAFLTDSGLVSSGK
Ga0075424_10184893223300006904Populus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGQWVRIA
Ga0075436_10126746823300006914Populus RhizosphereVKAIALTLLTLLSADVVVPAGTRIPIRFVQRITSGRDTVGTEVLVQTMGALVQDSCVLVPAYVRAKGRIVISKGGGRFGRHGRLGFTFDSLEVRPSRWVAISGVLDTLEYAKPGAVTDSG
Ga0066710_10033039613300009012Grasslands SoilMKAIALPLLPLLLADVVLPAGTHIPIRFVQRVTSGRDTVGAPVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVRPGRWAAVSAVLDTL
Ga0066710_10332572223300009012Grasslands SoilMKAIALVLLPLLLADVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTIGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVRPGHWAAVSAVLDTLEYAPRGGLADSGLVSSGKTSIVGVGKKLV
Ga0099828_1086808723300009089Vadose Zone SoilVKAIALALLTLLSADVVVPAGTRIPIRFVQRITSGRDTVGAGVLVQTMGALVQDSCVLVPPYVRAKGRIVISKGGGRFGRHGRLGLAFDSLEVRPAHWVAISGVLDTLEYAKPGAVTD
Ga0099827_1048941113300009090Vadose Zone SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISAVLDTLEYAKPGALTDSGLVSSGKTS
Ga0099827_1163284213300009090Vadose Zone SoilMKAIALALLALLSSDLVIPAGTRIPIRFVQRVTSGRDTVGTEVLVQTMGALVQDSCVVVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFESLEVRAGQWVPISAVLDTLEYAK
Ga0066709_10002660493300009137Grasslands SoilVKAIVALLLALVGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALVRDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWLAMSGVLDTLEYAKPGALSDSGLV
Ga0134082_1003792333300010303Grasslands SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISGILDTLEYAKPGAL
Ga0134088_1055818923300010304Grasslands SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISAVLDTLEYAK
Ga0134086_1039688613300010323Grasslands SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGVLDTLE
Ga0134063_1036736923300010335Grasslands SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRF
Ga0134063_1043543413300010335Grasslands SoilVKAIAIALLALLSSDSVIPAGTRIPIRFVQRVTSGRDTAGTRVLVQTMGALVQDSCILVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDS
Ga0134063_1051511023300010335Grasslands SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRVVVSQGGGRFGRHGRLGLRFDSLEVRSGHWVPIAGVLDTLEYAKPGA
Ga0134066_1038294323300010364Grasslands SoilVKAAVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLVFDSLEVRPGRWVPMSGVLDTLEYAK
Ga0137392_1055629513300011269Vadose Zone SoilMRAIALVLLPLMLSDVVLPTSTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLHAKGHVVVSKGGGRFGRHGRLGLHFDSLEIRPGRWAAISAVLDTLEYAPRGGLVDSGLVSSGKTSIVGWAGSSCRPASRR
Ga0137389_1055616113300012096Vadose Zone SoilMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPGRWVAISAVLDTLEYAPRGGL
Ga0137383_1075093923300012199Vadose Zone SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLE
Ga0137383_1092145823300012199Vadose Zone SoilVKAIGLALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALLQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLE
Ga0137363_1133743223300012202Vadose Zone SoilMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTTVLVQTMGAVVRDSCVIVPPYVRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPGRWAAISAVLDTLEYAPRGGLADSGLV*
Ga0137380_1127734413300012206Vadose Zone SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVRVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHW
Ga0137376_1000531653300012208Vadose Zone SoilMKAISLVLVPLLLTDVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTIGAVVRESCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVRPGRWVAVSAVLDTLE*
Ga0137376_1006195753300012208Vadose Zone SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRVTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSG
Ga0137376_1109581823300012208Vadose Zone SoilVKTIVALLLALAGGDVVIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALVRDSCVVVPPYTRAKGRIVVSKGGGRFGRHGRLGLRFDSL
Ga0137372_1004535013300012350Vadose Zone SoilMKSIALVLLPLLLTDVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTMGAVVRDSCVIVRPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVSPGRWASVAAVLDTLEYAPRGGL
Ga0137360_1030884033300012361Vadose Zone SoilVKAIALALLTLLSADVVVPAGTRIPIRFVQRITSGRDTVGAGVLVQTMGALVQDSCVLVPPYVRAKGRIVISKGGGRFGRHGRLGLAFDSLEVRPAHWVAISGVLDTLEYAKPGAVTDSGLVS
Ga0137360_1044525623300012361Vadose Zone SoilVKAIAFALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRWGRRGRLGLRFDSLEVRAGHWVPISALLDTLEYAKPGALT
Ga0137395_1104170613300012917Vadose Zone SoilMKAIALALLALLSSDLVIPAGTRIPIRFVQRVTSGRDTVGTEVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFESLEVRAGQWVPISAVLDTLEY
Ga0137396_1094066613300012918Vadose Zone SoilMKAIALALLALLSSDLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALVQDSCVLVPPYVRAKGRIVISRGGGRFGRHGRLGLTFDSLEVRPSRWVAISGVLDT
Ga0137419_1052286013300012925Vadose Zone SoilMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTTVLVQTMGAVVRDSCVIVPPYVRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPGRWAAISAVLDTLEYAPRGGLVDSGLVSSGKMSIA
Ga0134087_1064529113300012977Grasslands SoilVKAIALLVLLLQVEVVVPAGTRIPIRFVQRITSGRDTVGSEVLVQTVGAVVQDSCTIIPPYLRAKGRIVLSKGGGRFGRHGQLGLEFDSLEIRRGRW
Ga0134078_1035469013300014157Grasslands SoilMKAIALVLLPLLLTDVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTIGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEI
Ga0134079_1043165613300014166Grasslands SoilVKTAIALLLALAGGDVLIAAGTHIPIRFLEPIISGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWLAMSGVLDT
Ga0137420_127422223300015054Vadose Zone SoilMRAIALVLLPLLLADVVLPAGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPGRWAAISAVLDTLEYAPR
Ga0134073_1005415723300015356Grasslands SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGVLDTLEYAKPGALTDSGLVSSGK
Ga0134072_1008177323300015357Grasslands SoilVKAAVALFLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGL
Ga0134072_1013868123300015357Grasslands SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMRALVQDSCVLVPPYVRSKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISGVLDTLEYAKPGALTDSGLVSSGKTSVGGVG
Ga0134072_1019791323300015357Grasslands SoilVKTIVALLLALAGGDVVIAAGTHIPIRFLEPITSGRDTIGTPVLVQTMGALVRDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWLAMSGVLDTLEYAKPG
Ga0134069_134962013300017654Grasslands SoilMKAIALALPLLLTDVVVPVGTHIPIRFVQRVTSGRDTVGTPVLVQTMGAVVRDSCVILPPYLRAKGHVVVSKGGGRFGRHGRLGLSFDSLEIRPGRWAAVSAMLDTLEYAPRGGLA
Ga0066667_1057837913300018433Grasslands SoilVKAAVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLVFDSLEVRPGRWVPMSGVLDTLEYAKPNALSDSGLVSSG
Ga0066667_1088590613300018433Grasslands SoilVKPLALALLTLLSADVVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRVVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVPIAGVLDTLEYAKPGAVTDSGLVSSGRTSVGGLGK
Ga0066669_1006225843300018482Grasslands SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGVLDTLEYAKPGALT
Ga0184643_136950213300019255Groundwater SedimentVKALAVALIALLSSDLAIPAGTHIPIRFVQRITSGRDTVGTPVLVQTMGALVRDSCVVVLPYMRAKGRIVVSKRGGRFGRHGRLGLRFDSLEVRSGRWVA
Ga0179594_1027508323300020170Vadose Zone SoilMKAIALVLLPLLLTDVVLPAGTHIPIRFVQRVTSGRDTVGTPVLVQTIGAVMRESCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEVRPGRWVAVSAVLDTLEYAPRG
Ga0215015_1045155393300021046SoilVKIIPAAVLGLLLADVAIPAGTHLPIRFLQPITSGRDTVGTRVLVQTMGAWVQDTCIVLPPYLRAKGRIVVSKGGGGFGRHGKLGLRFDSLEVRPGQWAAIAGVLDTLEYAKAGLV
Ga0210382_1050921223300021080Groundwater SedimentMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPG
Ga0137417_102880213300024330Vadose Zone SoilVRAIALTLLTLLSADVVVPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVISKGGRRFGRHGQLGLTFDSLEV
Ga0207646_1130021423300025922Corn, Switchgrass And Miscanthus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVERVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGHVVVSKGGGRFGRHGKLGLRFDSLEVRRGQWVPIAGLLDTLEYTKPAFLTDSGLVSSGKTGVVGVGKKLVP
Ga0207700_1041854223300025928Corn, Switchgrass And Miscanthus RhizosphereVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGRWVPIAGLLDSL
Ga0209237_124938113300026297Grasslands SoilMRALAPTLLALLVGGSVIPAGTRIPIRFVQRITSGKDTVGTPVLVQTMGALVVQESCVVVPPYTRAKGRIVVSKGGGRFGRHGRLGLRFDSLEVRPGRWATISGVLDTLEYAKPGTLTDSGLVSSGKTSLVGVGK
Ga0209238_108326433300026301Grasslands SoilVKAAVALLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLELRPGRWVAMSGVLD
Ga0209801_132071123300026326SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLIPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISGVLDTLEYAKPGALTDSGLVSSGKTS
Ga0209375_105446333300026329SoilVKSLSLALLTLLSADLVIPAGTRIPIRFVQRITSGRDTVGTGVLVQTMGALVQDSCVLVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFDSLEVRSGHWVAISGV
Ga0209375_121891313300026329SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPIS
Ga0209803_119644613300026332SoilVKPLALALLALLSSDPVIPAGTRIPIRFVQRVTSGRDTVGTRVLVQTMGALVQDSCVLVLPYVRAKGRIVVSKGGGRFGRHGRLGLRFDSL
Ga0209158_117355013300026333SoilVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFD
Ga0209808_109485533300026523SoilVKAAIALLLALAGGDVLIAAGTHIPIRFLEPIISGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGRWIAMSGVLDTLEY
Ga0209160_125065413300026532SoilVRIIAVALLALVPTDTVIPAGTHIPIRFVQRVTSGKDTVGTQVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGLLG
Ga0209058_135871713300026536SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLIPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGHWVPISGVLDTLEYAKP
Ga0209056_1039930923300026538SoilVKAIALALLALLSADLVIPAGTRIPIRFVQRITSGRDTVGTQVLVQTMGALMQDSCVLVPPYVRAKGRIVVSRGGGRWGRHGRLGLRFDSLEVRAGDWVPISGVLDTLKYAKPGALTDSGVVSSGKTSVGGVGRK
Ga0209178_136673213300027725Agricultural SoilVKTLFAVLLAVLAGDTVIPAGTHIPIRFVQRVTSGKDTVGTAVLVQTMGAVVSDSCVVVPPYLRAMGRVVVSKGGGRFGRHGKLGLRFDSLEVRRGRWVRIA
Ga0209073_1023499613300027765Agricultural SoilMKATVAVLLALAGGDVLIAAGTHIPIRFLEPITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGHWVAMRGVLDTL
Ga0209074_1001434033300027787Agricultural SoilVRATLALLLVALLGRTVIPAGTHIPIRFVQRVTSGKDSVGTAVLVQTMGALVSDSCVVVPPYLRAKGHVVVSKGGGRFGRHGQLGLRFDSLEVRPGRWVEIAGLLDTLEYTKP
Ga0209074_1028854123300027787Agricultural SoilVRAALALLLAVVAGDTVIPAGTHIPIRFVERVTSGKDTVGTPVLVQTMGAVVNDSCVVVPPYLRAMGHVVVSKGGGRFGRHGKLGLRFDSLE
Ga0209180_1080724213300027846Vadose Zone SoilVKAIALALLTLLSADVVVPAGTRIPIRFVQRITSGRDTVGAGVLVQTMGALVQDSCVLVPPYVRAKGRIVISKGGGRFGRHGRLGLAFDSLEVRPAHWVAISGVLDTLEYAKPGAVTDSGLVSS
Ga0209590_1091853523300027882Vadose Zone SoilMKAIALALLALLSSDLVIPAGTRIPIRFVQRVTSGRDTVGTEVLVQTMGALVQDSCVVVPPYVRAKGRIVVSRGGGRFGRHGRLGLRFESLEVRAGQWVPISAVLDT
Ga0307504_1047759813300028792SoilMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIR
Ga0307278_1007866333300028878SoilVKALAVALIALLSSDLAIPAGTHIPIRFVQRITSGRDTVGTPVLVQTMGALVRDSCVVVLPYMRAKGRIVVSKRGGRFGRHGRLGLRFDSLEVRSGRWVAMSGLLDTLEYAGPRALGDSGLVSSGKTSGVG
Ga0307479_1156669823300031962Hardwood Forest SoilVRAAVALLLALASGDVLMPAGTHIPIRFLERITSGRDTVGTPVLVQTMGALARDSCVVVPPYLRAKGRVVVSKGGGRFGRHGKLGLAFDSLEVRPGHWLAMSGVLDT
Ga0307472_10074855113300032205Hardwood Forest SoilMRAIALVLLPLLLSDVVLPAGTHIPIRFVQRVTSGRDTVGTAVLVQTMGAVVRDSCVIVPPYLRAKGHVVVSKGGGRFGRHGRLGLRFDSLEIRPGRWAAISAV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.