NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F014884

Metagenome / Metatranscriptome Family F014884

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F014884
Family Type Metagenome / Metatranscriptome
Number of Sequences 259
Average Sequence Length 98 residues
Representative Sequence TAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA
Number of Associated Samples 198
Number of Associated Scaffolds 259

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 175
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.614 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.552 % of family members)
Environment Ontology (ENVO) Unclassified
(34.749 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(43.629 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 29.92%    β-sheet: 18.11%    Coil/Unstructured: 51.97%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.58.4.0: automated matchesd4dpoa14dpo0.68545
d.58.4.17: SOR-liked6m35a_6m350.66045
d.58.4.23: Marine metagenome family DABB3d2pgca12pgc0.66023
d.58.4.0: automated matchesd3qmqa13qmq0.65538
d.58.4.11: PA3566-liked1y0ha_1y0h0.63394


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 259 Family Scaffolds
PF001982-oxoacid_dh 32.43
PF02811PHP 19.69
PF00578AhpC-TSA 5.41
PF02817E3_binding 5.02
PF00106adh_short 2.70
PF13561adh_short_C2 1.93
PF03745DUF309 1.16
PF03544TonB_C 0.77
PF02082Rrf2 0.39
PF07994NAD_binding_5 0.39
PF02780Transketolase_C 0.39
PF07676PD40 0.39
PF00830Ribosomal_L28 0.39
PF08534Redoxin 0.39
PF00474SSF 0.39
PF00581Rhodanese 0.39
PF00676E1_dh 0.39
PF03118RNA_pol_A_CTD 0.39

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 259 Family Scaffolds
COG0508Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) componentEnergy production and conversion [C] 37.45
COG1547Predicted metal-dependent hydrolaseFunction unknown [S] 1.16
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.77
COG0202DNA-directed RNA polymerase, alpha subunit/40 kD subunitTranscription [K] 0.39
COG0227Ribosomal protein L28Translation, ribosomal structure and biogenesis [J] 0.39
COG05672-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymesEnergy production and conversion [C] 0.39
COG0640DNA-binding transcriptional regulator, ArsR familyTranscription [K] 0.39
COG1071TPP-dependent pyruvate or acetoin dehydrogenase subunit alphaEnergy production and conversion [C] 0.39
COG1260Myo-inositol-1-phosphate synthaseLipid transport and metabolism [I] 0.39
COG1414DNA-binding transcriptional regulator, IclR familyTranscription [K] 0.39
COG1725DNA-binding transcriptional regulator YhcF, GntR familyTranscription [K] 0.39
COG1959DNA-binding transcriptional regulator, IscR familyTranscription [K] 0.39
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 0.39
COG2188DNA-binding transcriptional regulator, GntR familyTranscription [K] 0.39
COG2378Predicted DNA-binding transcriptional regulator YobV, contains HTH and WYL domainsTranscription [K] 0.39
COG2524Predicted transcriptional regulator, contains C-terminal CBS domainsTranscription [K] 0.39


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.61 %
All OrganismsrootAll Organisms0.39 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300026301|Ga0209238_1000118All Organisms → cellular organisms → Bacteria25231Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.60%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.20%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.95%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.02%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands4.25%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.25%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.32%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.32%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.32%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.93%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.93%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.16%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.16%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.16%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.77%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.77%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.39%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.39%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.39%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.39%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.39%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.39%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.39%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.39%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.39%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.39%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.39%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004006Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300004022Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300010141Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010896Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012388Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014324Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015257Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231_16_10DEnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300022226Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_13EnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025521Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025549Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025556Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025567Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027324Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25381J37097_100541933300002557Grasslands SoilVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
JGI25385J37094_1001418173300002558Grasslands SoilVTMPKVLTASRVRVPAQAETEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIRLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA*
JGI25385J37094_1003458133300002558Grasslands SoilMSKVLTVARVRVPPANEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESLTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAAAEA*
JGI25383J37093_1002469633300002560Grasslands SoilMPKVLTASRVRVPAQAETEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIRLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA*
JGI25383J37093_1019319823300002560Grasslands SoilVPSQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
JGI25382J37095_1014444413300002562Grasslands SoilMPKVLTAARIRVPAANEPDYLATLRELCQFAEARGQRIWLFRNAKDPQLFLEFSESPTEMSHRAQASRLPEEIKLERRLQGLATYAPDAWELWSEVSLPAPAGA*
JGI25382J43887_1017701713300002908Grasslands SoilMPKVLTAARIRVPAANEPDYLATLRELCQFAEARGQRIWLFRNAKDPQLFLEFSESPTEMSHRAQASRLPEEIKLERRLQGLATYAPDAWELWSEVSLPA
JGI25389J43894_100744733300002916Grasslands SoilMPKVLTAXRVRVPPANESDYLATLRELCQFAEARGQRIWLYRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAWELWTEVSFAAPAEA*
JGI25616J43925_1000770993300002917Grasslands SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA*
Ga0055438_1003925413300003995Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLW
Ga0055453_1030918813300004006Natural And Restored WetlandsMPKVLTASRVRVAAANEADYVATLRELCQFADARGQRIWLFRNARDPRLFIEFSESLTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVSLGAEAEA*
Ga0055437_1007074613300004009Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWTEVPLGAATEA*
Ga0055465_1005038833300004013Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWNEVPLGAATEA*
Ga0055465_1028082323300004013Natural And Restored WetlandsMAKVLTASRVRVPAHNEAEYLATLRELCQFADARGQRIWVFRHAADPRLSLEFSESPTEMSHRAQASRLPEEIKLERKLQTLVTYAPDAWDLWSEVSVAARPA*
Ga0055432_1004251523300004022Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLREFTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWTEVPLGAATEA*
Ga0063356_10416922423300004463Arabidopsis Thaliana RhizosphereMKPPRVLTASRVRVPATSEADYLAALRELCQFAEARGQRIWVFRSARDPQLFIEFSESATEMSHRAQASRLPEELKIEKQLQHLATYAPDAWELWTEVPLVED*
Ga0062594_10240100413300005093SoilVRVPAQHETEYVNTLRELSQFAEARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQTIATYAPDAWELWSEVSVWPVAR*
Ga0066674_1019605923300005166SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0066672_1030427233300005167SoilEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYASDAWELWTEVSLAAGAEA*
Ga0066677_1001642923300005171SoilMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0066683_1030445723300005172SoilMPKVLTVARVRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEGPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0066680_1074762513300005174SoilRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPTEA*
Ga0066680_1083774713300005174SoilRIAAPNEADYLATLRELCQFADARSQRIWLFRNAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALGTYAPDAWELWTEVPLTAAAEA*
Ga0066673_1014848813300005175SoilPTNEADYLATLRELCQFAEARGQRIWIFRNAKDAQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPSEA*
Ga0066673_1085410013300005175SoilTACRVRVPAHAEAEYVATLRELAGFASARGQRIWLFRNARDPRLFLEFSESATEMSHRAQASRLPEELKLEKKLQSLATYAPDAWDLWTDVALAAEAEA*
Ga0066688_1046811423300005178SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVS
Ga0066688_1068844713300005178SoilYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
Ga0066685_1000996613300005180SoilARVRVPTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0066678_1046802913300005181SoilSQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0068993_1025881423300005183Natural And Restored WetlandsLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWTEVPLGAATEA*
Ga0066676_10008442103300005186SoilARVRVPTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKQLQALATYAPDAWELWSEVSVWPAERSQ*
Ga0070692_1056696413300005345Corn, Switchgrass And Miscanthus RhizosphereMAKVLTASRVRVPAHNEAEYLATLRELSQFADARGQRIWVFRHGSDPRLFLEFSESPTEMSHRAQASRLPEELKLERKLQQLVTYA
Ga0070694_10016572913300005444Corn, Switchgrass And Miscanthus RhizosphereVRVPSQNETEYVNTLRELSQFAEARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
Ga0070694_10050425613300005444Corn, Switchgrass And Miscanthus RhizosphereMAKVLTASRVRVPSNNEAEYLATLRELCQYAEARGQRIWVFRHAGDPRLFIEFSESPTEMSHRAQASRLPEELKLERKLQQLVTYAPDAWDLWNEVSVAAKPA*
Ga0070708_100000479233300005445Corn, Switchgrass And Miscanthus RhizosphereMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLERRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0070708_10026930513300005445Corn, Switchgrass And Miscanthus RhizosphereTAARVRVAAQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAIATYAPDAWELWSEVPVATASQA*
Ga0066689_10002911123300005447SoilDYLATLRELCQFAEARGQRIWIFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWTEVSLTADSEA*
Ga0070707_10030601023300005468Corn, Switchgrass And Miscanthus RhizosphereMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0070699_10040661533300005518Corn, Switchgrass And Miscanthus RhizosphereMAKVLTASRVRVPSNNEAEYLATLRELCQYAEARGQRIWVFRHAGDPRLFIEYSESPTEMSHRAQASRLPEELKLERKLQQLVTYAPDAWDLWNEVSVAAKPA*
Ga0070741_1134921823300005529Surface SoilVTPSRQPRVLTASRVRVPPTSEADYFAILRQLSQFAEARGQRIWVFRSAKDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLEQLATYAPDAWELWTEVPMGGGEDR*
Ga0066697_1015253533300005540SoilPTNEADYLATLRELCQFAEARGQRIWIFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGQYAPDAWELWTEVSLTAGSEA*
Ga0066701_1001112613300005552SoilVPPTHEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESATEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWTEVSLAAPAEA*
Ga0066695_1002193013300005553SoilASRVRVPAASEADYLATLRELCQYAEARGQRIWVFRHAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLETHLQSLVTYAPDAWELWTEVSLAAPTEA*
Ga0066695_1053684133300005553SoilATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA*
Ga0066661_1006492063300005554SoilATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYASDAWELWTEVSLAAGAEA*
Ga0066661_1006506263300005554SoilATLRELCQFAEARGQRIWIFRNAKDAQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPSEA*
Ga0066661_1018466623300005554SoilMPKVLTASRVRVPAPAEPEYFATLKELTQFAEARGQRISVFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVALPALTGEGA*
Ga0066661_1020366733300005554SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPHLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0066692_1090817323300005555SoilRVRVPAQHETEYVNTLRELSQFAEARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEITLERRLQSLATYAPDAWELWSEVSVWPVAP*
Ga0066704_1004467523300005557SoilMPKVLTASRVRVPAQAETEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAPAEA*
Ga0066704_1007540963300005557SoilIAAPNEADYLATLRELCQFADARSQRIWLFRNAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALGTYAPDAWELWTEVPLTAAAEA*
Ga0066700_1111027613300005559SoilQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAIATYAPDAWELWSEVPVATASQA*
Ga0066699_1047976133300005561SoilRVRVAPQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAVATYAPDAWELWSEVPVAAASQT*
Ga0066699_1070360833300005561SoilVPPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWSEVSLAAPTEA*
Ga0066694_1001754373300005574SoilRVPPTHEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESATEMSHRAQASRLPEEIKLERRLQSLGTYASDAWELWTEVSLAAPAEA*
Ga0066708_1064975623300005576SoilMPKVLTVARVRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPQLFTEFSEGPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPAEA*
Ga0066706_1003630913300005598SoilRVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0070702_10162327213300005615Corn, Switchgrass And Miscanthus RhizosphereQNEAEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
Ga0074470_1009775233300005836Sediment (Intertidal)MPKVLTASRVRVAAANEADYLATLRELCQFAEARGQRIWLFRNARDPHLFIEFSESLTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVSIGAEAEA*
Ga0066651_1012395613300006031SoilRVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQGLGTYAADAWELWTEVSLAAEAEA*
Ga0066651_1019304213300006031SoilRVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQGLGTYAADAWELWTEVSLAAPTEA*
Ga0066656_1004624413300006034SoilTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA*
Ga0075417_1015297423300006049Populus RhizosphereMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPTAAMEP*
Ga0070716_10005558013300006173Corn, Switchgrass And Miscanthus RhizosphereEADYLATLRELCQFAEARGQRIWIFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWTQVSLEAGSEA*
Ga0079222_1000117033300006755Agricultural SoilMPKILTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAWELWTEVSLAAAAEA*
Ga0079222_1220746313300006755Agricultural SoilRVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
Ga0079221_10000024433300006804Agricultural SoilMPRHLTASRVRVSKTNEPDYLAALRELAQFAEARGGRIWVYRNAKDPQLFIDFTESATEMSHRAQASRLPEETKLEKKLQGLATYAPDAWELWTEVGLAAGAEA*
Ga0079221_1059930713300006804Agricultural SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLVAGAE
Ga0075428_10143762113300006844Populus RhizospherePAANEAEYLATLKELCQFADARGQRIWLFRHATDPRLFIEFSESRTEMSHRAHASRLPEEIKLEKKLQGLVTYAPDAWELWSEVTLATAAEA*
Ga0075421_10084981323300006845Populus RhizosphereMPKVLTASRVRVPVANEPEYFATLRELTQFAEARGQRIWLFRHAADPQLFLEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWNEVPLGAATEA*
Ga0075421_10170024133300006845Populus RhizosphereLKELCQFADARGQRIWLFRHATDPRLFIEFSESRTEMSHRAHASRLPEEIKLEKKLQGLVTYAPDAWELWSEVTLATAAEA*
Ga0075421_10198055723300006845Populus RhizosphereMPKVLTASRVRVPAANEADYFATLRELCQFAEARGQRIWLFRNARDPRLFIEFSESQTEMSHRAQASRLPEETKLEKRLQSLVTYAPDAWELWSEVSLAAPAEA*
Ga0075425_10089860133300006854Populus RhizosphereRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0075425_10169986713300006854Populus RhizosphereETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
Ga0075434_10189885013300006871Populus RhizosphereTAARVRVPTQNETEYVNTLRELSQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0075436_10051687423300006914Populus RhizosphereMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0079219_1003512813300006954Agricultural SoilLATLRELCQFAEARGQRIWIFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGQYAPDAWELWTEVSLTAGREA*
Ga0079219_1011683923300006954Agricultural SoilMPKILTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAW
Ga0079219_1044075813300006954Agricultural SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGT
Ga0099794_1080348313300007265Vadose Zone SoilTAARVRIAASNEADYLATLRELCQFAEARSQRIWLFRNAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQARGTYAPDAWELWTEVPLTAAAEA*
Ga0066710_10150930313300009012Grasslands SoilPKVLTVARVRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEGPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA
Ga0066710_10190683113300009012Grasslands SoilEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA
Ga0066710_10195145623300009012Grasslands SoilMPKVLTESRVRVPAPNEADYLAVLRELQQSADARGQRIWLFRHAKDPRLFIECSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA
Ga0099829_1005314733300009038Vadose Zone SoilMAKVLTASRVRVAAQAEAEYLATLRELGEFADARGQKIWLYRNAKDPRLLIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAEAEA*
Ga0099829_1022841923300009038Vadose Zone SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAATAEA*
Ga0099829_1113483713300009038Vadose Zone SoilMPKVLTASRVRVPAPNEADYLATLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEETQLEKKLKQLGTYAPDAWEL
Ga0099830_1033676833300009088Vadose Zone SoilMPKVLTASRVRVPAQAETEYLATLRELCQFADARGQKIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAATEA*
Ga0099828_1037573623300009089Vadose Zone SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA*
Ga0099827_1166629113300009090Vadose Zone SoilACRVRVPAHAEAEYVATLRELAGFASARGQRIWLFRNARDPRLFLEFSESATEMSHRAQASRLPEELKLEKQLQSLATYAPDAWDLWTDVALAAEAEA*
Ga0066709_10042781723300009137Grasslands SoilVLTASRVRVPAASEADYLATLRELCQYAEARGQRIWVFRHAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLETHLQSLVTYAPDAWELWTEVSLAAPTEA*
Ga0066709_10263451713300009137Grasslands SoilEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA*
Ga0066709_10276586913300009137Grasslands SoilHESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA*
Ga0114129_1029985823300009147Populus RhizosphereMTMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPTAAMEP*
Ga0105347_116962423300009609SoilMPKVLTASRVRVPANNEREYLATLRELCQYADARGQRIWVFRHASDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQALVTYAPDAWDLWTEVSVSAPREA*
Ga0105252_1007438023300009678SoilMPKVLTASRVRVPANNEREYLATLRELCQYADARGQRIWVFRHASDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQALVTYAPDAWDLWTEVSVNAPREA*
Ga0105071_109422523300009808Groundwater SandMAKVLTASRVRVPAHNEAEYFATLRELCQFAEARGQRIWVFRHASDPRLFIEFSESPTEMSHRAQASRLPEELKLERKLQQLVTY
Ga0105057_107607823300009813Groundwater SandMPKVLTVARVRVPPAHEADYLAALRELCQFADARGQRIWLFRNAKDSHLFTEFSESPTEMSHRAQASRLPEEIKLEKRLQGLGIYAPDAWELWTEVSLAAPTEA*
Ga0105082_105072923300009814Groundwater SandMPKVLTVARVRVPPAHEADYLAALRELCQFADARGQRIWLFRNAKDSQLFTEFSESPTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPTEA*
Ga0127499_125389123300010141Grasslands SoilVRVPPANESDYLATLRELCQFAEARGQRIWLYRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAWELWTEVSFAAPAEA*
Ga0134070_1033175723300010301Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSL
Ga0134088_1050357813300010304Grasslands SoilTACRVRVPAHAEAEYLATLRELAGFASARGQRIWLFRNARDPRLFLEFSESATEMSHRAQASRLPEELKLEKKLQTLATYAPDAWDLWTDVAIAAEAEG*
Ga0134086_1029641123300010323Grasslands SoilTAARVRVPTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0134111_1056549913300010329Grasslands SoilARVRVPPANESDYLATLRELCQFAEARGQRIWLYRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQGLGTYAPYAWELWTEVSFAAPAEA*
Ga0134080_1005053823300010333Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAHSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0126377_1245694523300010362Tropical Forest SoilMTMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNSKDPQLFIEFSESLTEMSHRAQASRLPEETKLEKKLQGLVTYAPDAWELWSEVPLPAAAVEP*
Ga0134066_1004773633300010364Grasslands SoilYLATLRELCQFAEARGQRIWIFRNAKDAQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPSEA*
Ga0136847_1150313363300010391Freshwater SedimentMPKVLTASRVRIAAANEAEYLGTLRELCQFAEARGQRIWLFRSAKDPRLFIEFSESATEMSHRAQASRLPEETKLEKRLQALATFAPDAWDLWSEVSLATPSAA*
Ga0136847_1195198113300010391Freshwater SedimentMPKVLTASRVRVAAANEADYFATLRELGQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETRLEKRLQSLVTYAPDAWELW
Ga0138111_109240013300010896Grasslands SoilLTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
Ga0137391_1023805513300011270Vadose Zone SoilMPKVLTASRVRVPAQAEAEYLATLRELCQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAATAEA*
Ga0137391_1086138623300011270Vadose Zone SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA*
Ga0137391_1118088813300011270Vadose Zone SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAE
Ga0137393_1006788263300011271Vadose Zone SoilMPKVLTASRVRVPAPNEADYLAVLRELRQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA*
Ga0137429_100506313300011437SoilTQNEAEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA*
Ga0137457_121960823300011443SoilMAKVLTASRVRVPANNEREYLATLRELCQYADARGQRIWVFRHASDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQALVTYAPDAWDLWTEVSVNAPREA*
Ga0137389_1178891523300012096Vadose Zone SoilMPKVLTASRVRVPAQAEVEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAATAEA*
Ga0137388_1059223123300012189Vadose Zone SoilMPKVLTASRVRVPAQAELEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAPAEA*
Ga0137388_1138739623300012189Vadose Zone SoilPTNEADYVATLRELCQFAEARGQRIWLYRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAADAWELWTEVSLPAVADV*
Ga0137364_1069686233300012198Vadose Zone SoilAARVRVAPTNEADYLATLRELCQFAEARGQRIWIFRNAKDAQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPSEA*
Ga0137383_1003339623300012199Vadose Zone SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTGA*
Ga0137383_1049235033300012199Vadose Zone SoilDYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYASDAWELWTEVSLAAGAEA*
Ga0137382_1023799633300012200Vadose Zone SoilVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPAEA*
Ga0137365_1042373123300012201Vadose Zone SoilMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTGA*
Ga0137399_1027572313300012203Vadose Zone SoilMPKVLTASRVRVPAQAESEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA*
Ga0137374_1070394913300012204Vadose Zone SoilAARVRVPTQNETEYVNTLRELSQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0137380_1017780523300012206Vadose Zone SoilMPKVLTASRVRVPAPNEADYLATLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPHGAGAEA*
Ga0137381_1021246423300012207Vadose Zone SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLSTYAPDAWELWSEVSLGAGAEA*
Ga0137381_1032721323300012207Vadose Zone SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGTYAPDAWELWTEVSLAAPTGA*
Ga0137381_1168316213300012207Vadose Zone SoilEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPTEA*
Ga0137376_1045790933300012208Vadose Zone SoilAASEADYLATLRELCQYADARGQRIWLFRSAQDPQLFIEFSESPTEMSHRAQASRLPEEIKLEKHLQSLVTYAPDAWDLWTEVSLEAPTEA*
Ga0137379_1118289023300012209Vadose Zone SoilVLTASRVRVPAASEADYLATLRELCQYAEARGQRIWVFRHAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLETHLQSLVTYAPDAWELWTEVSLGAPTEA*
Ga0137378_1008508623300012210Vadose Zone SoilMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLCAYAPDAWELWTEVSLAAPTEA*
Ga0137378_1059999213300012210Vadose Zone SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVSLG
Ga0137378_1092313613300012210Vadose Zone SoilNVADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESATEMSHRAQASRLPEEIKLERRLQSLGTYASDAWELWTEVSLAAPAEA*
Ga0137378_1158257313300012210Vadose Zone SoilDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0137377_1039745513300012211Vadose Zone SoilLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESLTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAAAEA*
Ga0137377_1058745733300012211Vadose Zone SoilMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPHLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA*
Ga0137387_1002125363300012349Vadose Zone SoilVLTASRVRVPAASEADYLATLRELCQYAEARGQRIWVFRHAKDPQLFIEFSESPTEMSHRAQASRLPEEITLETHLQSLVTYAPDAWELWTEVSLAAPTEA*
Ga0137387_1022746633300012349Vadose Zone SoilEADYLATLRELCQFAEARSQRIWLFRNAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALGTYAPDAWELWTEVPLTAAAEA*
Ga0137387_1065072933300012349Vadose Zone SoilYKADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSVGTYAPDAWELWSEVSLATPTEA*
Ga0137387_1099282913300012349Vadose Zone SoilEADYLATLRELCQFAEARSQRIWLFRNAKDPQLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAVGTYAPDAWELWTEVPLTAAAEA*
Ga0137367_1037165923300012353Vadose Zone SoilMTMPKVLTASRVRVAAANEADYLATLRELAQYADARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPTAAMEP*
Ga0137366_1081928723300012354Vadose Zone SoilPNVLTVARVLVAPTHEADYLAPLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLCAYAPDAWELWTEVSLAAPTEA*
Ga0137371_1085352113300012356Vadose Zone SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEVPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAP
Ga0137371_1138814223300012356Vadose Zone SoilRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPAEA*
Ga0137371_1145058913300012356Vadose Zone SoilMPKVLTVARVRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEGPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAW
Ga0137384_1123507513300012357Vadose Zone SoilNVADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
Ga0134058_105419123300012379Grasslands SoilVRVPPANEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEVKLEKRLQSLGTYAPDAWELWTDVSLASAAEA*
Ga0134031_108680223300012388Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0134057_109070923300012396Grasslands SoilMSKVLTVARVRVPPANESDYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
Ga0134060_122998613300012410Grasslands SoilPANEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEVKLEKRLQSLGTYAPDAWELWTDVSLAAAAEA*
Ga0137373_1006394713300012532Vadose Zone SoilATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEELKLEKQLQSLGTYASDAWELWTEVSLAAPTEA*
Ga0137397_10007850133300012685Vadose Zone SoilMPKVFTAARVRVPTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0137396_1016958013300012918Vadose Zone SoilVPTQNETEYVNTLRELSKFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0137396_1063674823300012918Vadose Zone SoilMPKVLTASRVRVPAQAESEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAPAEA*
Ga0137396_1124174213300012918Vadose Zone SoilTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0137394_10021994103300012922Vadose Zone SoilVMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLRAGAEA*
Ga0137419_1060771623300012925Vadose Zone SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTERSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0137416_1185141513300012927Vadose Zone SoilMPKVLAAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA*
Ga0137404_1201630223300012929Vadose Zone SoilVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA*
Ga0137410_1002677113300012944Vadose Zone SoilVRVPSQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEITLEKRLQALATYAPDAWELWSEVPVATTSQA*
Ga0134087_1055557723300012977Grasslands SoilVPPTNEADYLATLRELCQFADARGQRIWLFCNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSFAAPTEA*
Ga0134081_1004160613300014150Grasslands SoilMPKVLTVARVRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEGPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPAEA*
Ga0134075_1020577523300014154Grasslands SoilMSKVLTVARVRVPPANESDYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWTEVPIAAATEA*
Ga0075352_105784323300014324Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWTEVPLGAATEV*
Ga0137418_1071195113300015241Vadose Zone SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAG
Ga0180067_112578223300015257SoilARVRVPANNEREYLATLRELCQYADARGQRIWVFRHASDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQALVTYAPDAWDLWTEVSVSAPREA*
Ga0134072_1016749513300015357Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRDAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPAEA*
Ga0134089_1007466233300015358Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLGAPTEA*
Ga0134112_1047136423300017656Grasslands SoilMSKVLTVARVRVPPANESDYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWTEVPIAAATEA
Ga0187776_1128129423300017966Tropical PeatlandMTTPRVLTASRVRVPATSEVDYFATLRELRRFAEARGQKIWLFRNAKDPRLFIEFSESRSEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVAATSEA
Ga0184610_111113113300017997Groundwater SedimentMPKVLTASRVRVPAANEADYFATLRELCQFADARGQRIWLFRNARDPRLFIEFSESQTEMSHRAQASRLPEETKLEKRLQSLVTYAPDA
Ga0187766_1070096913300018058Tropical PeatlandRVPATSEADYLAVLRQLAQFAVARGQRIWVFRSARDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLQTIATYAPDAWELWSEVPLALPEEPEPEL
Ga0187765_1024970523300018060Tropical PeatlandVTTPRVLTASRVRVPATSEVDYFATLRELRRYAEARGQKIWVFRSAADPHLFIEFSESPTEMSHRAQASRLPEELKLEKHLQHLVTYAPDAWDLWTEVPIEAPTDT
Ga0184637_1018700223300018063Groundwater SedimentMPKVLTASRVRVPANNEAEYLATLRELCQYAEARGQRIWVFRHAGDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQSLVTYAPDAWDLWSEVSVAAPRQA
Ga0184640_1052576413300018074Groundwater SedimentETEYVSVLRELRQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKRLQSLATYAPDAWELWSEVPVTATTQA
Ga0066655_1011851223300018431Grasslands SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA
Ga0066655_1048715813300018431Grasslands SoilRVPPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQSLGTYAPDAWELWSEVSLAAPTEA
Ga0066667_1037308113300018433Grasslands SoilVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQGLGTYAADAWELWTEVSLAAGAEA
Ga0066667_1175047323300018433Grasslands SoilPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPTEA
Ga0066662_1008714423300018468Grasslands SoilMPKVLTASRVRVPAQAETEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIRLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA
Ga0066662_1101549623300018468Grasslands SoilMPKVLTAARIRVPAANEPDYLATLRELCQFAEARGQRIWLFRNAKDPQLFLEFSESPTEMSHRAQASRLPEEIKLERRLQGLATYAPDAWELWSEVSLPAPAGA
Ga0187894_1006307133300019360Microbial Mat On RocksMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKRLQSLVTYAPDAWELWSEVAIPSAAMEA
Ga0187892_10000673623300019458Bio-OozeMPKVLTASRVRVAAANEADYLATLRDLAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPSAAMET
Ga0187893_10001134663300019487Microbial Mat On RocksMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPSAAMET
Ga0137408_148509433300019789Vadose Zone SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA
Ga0193715_106892613300019878SoilLTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLAAPAEA
Ga0215015_1108922723300021046SoilVPPTNEADYLATLRELCQFADARGQRIWLYRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYATDAWELWTEVSLPAVADV
Ga0210404_1023459623300021088SoilMPKVLTASRVRVPAPNESDYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA
Ga0224512_1005275323300022226SedimentMPKVLTASRVRVAAANEAEYVATLRELTQFAEARGQRIWLFRHATDAQLFLEFSESRSEMSHRAQASRLPEEIKLEKKLQTLATYAPDAWELWTEVSVAATET
Ga0247680_100303923300024246SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLGAGAEA
Ga0209108_1043932213300025165SoilPAPNETEYVSTLRELCQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKRLQSLATYAPDAWELWSEVSLTAPSQA
Ga0209520_1075689113300025319SoilAHEADYLAALRELCQFADARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLATPTEA
Ga0209640_1044108623300025324SoilMPKVLTASRVRVAAANEADYLATLRELVQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKRLQGLVTYAPDAWELWSEVALPAAAMEA
Ga0209341_1041478533300025325SoilEAEYLSALRELGQFAEARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKRLQTLATYAPDAWELWSEVSLASPSPA
Ga0210083_100743423300025521Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWNEVPLGAATEA
Ga0210094_101693323300025549Natural And Restored WetlandsMPKVLTASRVRVPAANEPEYLATLRELTQFAEARGQRIWLFRHAADPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWTEVPLGAATEA
Ga0210120_112828923300025556Natural And Restored WetlandsMPKVLTASRVRVAAANEADYVATLRELCQFADARGQRIWLFRNARDPRLFIEFSESLTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVSLGAEAEA
Ga0210076_112260923300025567Natural And Restored WetlandsMAKVLTASRVRVPAHNEAEYLATLRELCQFADARGQRIWVFRHAADPRLSLEFSESPTEMSHRAQASRLPEEIKLERKLQTLVTYAPDAWDLWSEVSVAARPA
Ga0207653_1005822713300025885Corn, Switchgrass And Miscanthus RhizosphereADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA
Ga0207684_1014180723300025910Corn, Switchgrass And Miscanthus RhizosphereMPKVLTVARVRVAPTNEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLERRLQSLGAYAPDAWELWTEVSLAAPTEA
Ga0207687_1182369913300025927Miscanthus RhizosphereQNEAEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA
Ga0209234_120445513300026295Grasslands SoilLTAARVRVAPQDETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAVATYAPDAWELWSEVPVATASQT
Ga0209235_129045713300026296Grasslands SoilRVRVPAQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA
Ga0209236_1000891263300026298Grasslands SoilMSKVLTVARVRVPPANEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESLTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAAAEA
Ga0209238_1000118253300026301Grasslands SoilMPKVLTASRVRVPPANESDYLATLRELCQFAEARGQRIWLYRNAKDPQLFTEFSESPTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAWELWTEVSFAAPAEA
Ga0209238_117956123300026301Grasslands SoilHAEAEYVATLRELAGFASARGQRIWLFRNARDPRLFLEFSESATEMSHRAQASRLPEELKLEKKLQSLATYAPDAWDLWTDVALAAEAEA
Ga0209239_100677113300026310Grasslands SoilARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQGLGTYAADAWELWTEVSLAAGAEA
Ga0209471_100477313300026318SoilLTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLAAPTEA
Ga0209472_117315833300026323SoilLTAARVRVPTQNETEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA
Ga0209470_113981133300026324SoilRVPPTSEADYLATLRELCQFAEARGQRIWLFRNTKDPHLFTEFSEGPTEMSHRAQASRLPEELKLEKRLQSLGAYAPDAWELWTEVSLAAPTEA
Ga0209152_1050246313300026325SoilPTHEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLTADSEA
Ga0209802_113628113300026328SoilEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPTEA
Ga0209803_127751813300026332SoilRVPSQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA
Ga0209690_104898233300026524SoilMPKVLTVARVRVAPTNEADYFATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESPTEMSHRAQSSRLPEELKLEKRLQSLGAYAPDAWELWTFLND
Ga0209378_1007871123300026528SoilAARVRVPPTHEADYLATLRELCQFAEARGQRIWLFRNAKDPHLFTEFSESATEMSHRAQASRLPEEIKLERRLQSLGTYASDAWELWTEVSLAAPAEA
Ga0209378_113474133300026528SoilAARVRVPPTHEADYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESATEMSHRAQASRLPEEIKLERRLQSLGTYASDAWELWTEVSLAAPAEA
Ga0209160_104961923300026532SoilMPKVLTASRVRVPAQAETEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAPAEA
Ga0209056_1060661313300026538SoilQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQALATYAPDAWELWSEVPVATTSQA
Ga0209156_1040669513300026547SoilAPQNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLERRLQAVATYAPDAWELWSEVPVATASQT
Ga0209648_1003131853300026551Grasslands SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA
Ga0209648_1013448333300026551Grasslands SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAATAEA
Ga0209886_100585223300027273Groundwater SandMAKVLTASRVRVPAHNEAEYFATLRELCQFAEARGQRIWVFRHAGDPRLFLEFSESPTEMSHRAQASRLPEELKLERKLQSLVTYAPDAWDLWNEVSVAARTA
Ga0209845_103154323300027324Groundwater SandMPKVLTVARVRVPPAHEADYLAALRELCQFADARGQRIWLFRNAKDSQLFTEFSESPTEMSHRAQASRLPEEIKLEKRLQSLGTYAPDAWELWTEVSLAAPSEA
Ga0209178_110655723300027725Agricultural SoilMPKILTAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLERRLQGLGTYAPDAWELWTEVSLAAAAEA
Ga0209177_1035510513300027775Agricultural SoilPTQNETEYVNTLRELSQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA
(restricted) Ga0233416_1026735123300027799SedimentMPKVLTASRVRVSAANETEYLATLRELIQFAEARGQRIWLFRHASDPQLFIEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWNEVPLGAATE
Ga0209180_1016087223300027846Vadose Zone SoilMAKVLTASRVRVAAQAEAEYLATLRELGEFADARGQKIWLYRNAKDPRLLIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAEAEA
Ga0209180_1022573423300027846Vadose Zone SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAATAEA
Ga0209814_1001070423300027873Populus RhizosphereMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPTAAMEP
Ga0209283_1015602323300027875Vadose Zone SoilMPKVLTASRVRVPGQAEAEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA
Ga0209283_1043998213300027875Vadose Zone SoilASRVRVPAQAETEYLATLRELCQFADARGQKIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAATEA
Ga0209488_1110407913300027903Vadose Zone SoilMPKVLTASRVRVPAPNEADYLAVLRELQQFADARGQRIWLFRHAKDPRLFIEFSEGPTAMSHRAQASRLPEEIQLEKKLKQLGTYAPDAWELWSEVPLAAGAEA
Ga0209382_1058393023300027909Populus RhizosphereMPKVLTASRVRVPVANEPEYFATLRELTQFAEARGQRIWLFRHAADPQLFLEFSESRSEMSHRAQASRLPEEIKLEKKLQSLVTYAPDAWDLWNEVPLGAATEA
Ga0209382_1168223723300027909Populus RhizosphereMPKVLTASRVRVPAANEADYFATLRELCQFAEARGQRIWLFRNARDPRLFIEFSESQTEMSHRAQASRLPEETKLEKRLQSLVTYAPDAWELWSEVSLAAPAEA
Ga0209853_110440823300027961Groundwater SandMPKVLTVARVRVPPAHEADYLAALRELCQFADARGQRIWLFRNAKDSQLFTEFSESPTEMSHRAQASRLPEELKLERKLQSLVTYAPDAWDLWNEVSVAARTA
Ga0137415_1080125223300028536Vadose Zone SoilMPKVLAAARVRVPPTNEADYLATLRELCQFADARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQSLGTYASDAWELWTEVSLATPTEA
Ga0307281_1027466313300028803SoilADYLTTLRELTQFADARGQRIWVFRSAKDPRLFLECSESPTEMSHRAQASRLPEEIQLEKRLQSLVTYAPDAWELWTEVSLATPAGA
Ga0307278_1016447813300028878SoilEYVNTLRELSQFAEARGQRIWLYRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQALATYAPDAWELWSEVPVATTSQA
Ga0307505_1022406023300031455SoilMPNKVLTASRVRVPANNEAEYLATLRELCQFAEARGQRIWVFRHAGDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQSLVTYAPDAWDLWTEVSFAAPRQV
Ga0247727_1082472823300031576BiofilmMPKVLTASRVRIAAANEAEYLGTLRELCQFAEARGQRIWLFRSAKDPRLFIEFSESATEMSHRAQASRLPEETKLEKRLQALATFAPDAWDLWSEVSLATPSEA
Ga0214473_1006880123300031949SoilMPKVLTASRVRVAAANEADYLATLRELAQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEETKLEKKLQSLVTYAPDAWELWSEVALPAAAMEA
Ga0214473_1100559533300031949SoilEYLSTLRELGQFADARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIQLEKRLQSLATYAPDAWELWSDVSLTAQSQA
Ga0307479_1019017033300031962Hardwood Forest SoilMPKVLTASRVRVPAQSEAEYLAILRELCQFADARGQKIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEEIKLEKRLQGLVTYAPDAWDLWTEVSLAAAAEA
Ga0307479_1174942823300031962Hardwood Forest SoilMTRVLTVSRVRVPAASEADYVATLRALGEFAVARGQRLWLFRSAQDPGLFIEFSESPTEMSHRAQASRLPEELKLEAHLQHTVTYAPDAWELWTEVPL
Ga0326597_1049366833300031965SoilMPKVLTASRVRVPAANEPEYFATLRELTQFAEARGQRIWLFRHAADPQLFLEFSESRSEMSHRAQASRLPEEITLEKKLQSLVTYAPDAWDLWNEVPLGAATEA
Ga0307471_10058616013300032180Hardwood Forest SoilMTRVLTVSRVRVPAASEADYVATLRALGEFAVARGQRLWLFRSAQDPGLFIEFSESPTEMSHRAQASRLPEELKLEAHLQHTVTYAPDAW
Ga0307471_10120443233300032180Hardwood Forest SoilDYLATLRELCQFAEARGQRIWLFRNAKDPQLFTEFSESQTEMSHRAQASRLPEEIKLEKRLQGLGTYAPDAWELWTEVSLGAAAEA
Ga0307471_10425582923300032180Hardwood Forest SoilMAKVLTASRVRVAARAEAEYLATLRELGQFADARGQKIWLYRNAKDPRLLIEFSESPTEMSHRAQASRLPEENKLEKRLQGLVTYAPDAWDLWTEVSLAAEAEA
Ga0335085_10002051343300032770SoilMSAPNALPRALTASRVRVPPTGEADYLAVLRQLAQFAVARGQRIWVFRSARDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLQTIATYAPDAWELWTEVPLAIPDEPEQEL
Ga0335085_10008174123300032770SoilVSTRDSLPRVLTASRVRVPPTSEADYLAVLRQLAQFAVARGQRIWVFRSAKDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLQKIATYAPDAWELWTEVPLALPEEPEPEL
Ga0335079_1094081433300032783SoilDYLTTLRELSRFAEARGQRIWVFRSAADPQLFIEFSESATEMSHRAQASRLPEEVKLERRLQGIATYAPDAWDLWTEVPLPARSET
Ga0335080_1013140323300032828SoilVSTRDSLPRVLTASRVRVPPTSEADYLAVLRQLAQFAVARGQRIWVFRSAKDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLQKIATYAPDAWELWTEVPVALPEEPEPEL
Ga0335070_1013956733300032829SoilVTTPRVLTASRVRVPATSEVDYFATLRDLRRYAEARGQKIWVFRSAADPQLFIEFSESPTEMSHRAQASRLPEELKLEKHLQSLVTYAPDAWDLWTEVPIEAPTET
Ga0335083_1027349123300032954SoilMSAPNALPRALTASRVRVPPTGEADYLAVLRQLAQFAVARGQRIWVFRSARDPQLFIEFSESPTEMSHRAQASRLPEELKLERQLQTIATYAPDAWELWTEVPLATPDEPEQEL
Ga0214472_1165026123300033407SoilARVRVAAPNETEYVNTLRELSQFAEARGQRIWLFRNAKDPRLFIEFSESPTEMSHRAQASRLPEELTLEKRLQRLATYAPDAWELWSEVPVATTSQA
Ga0214471_1011515923300033417SoilMAKVLAASRVRVPAPNESEYIATLRELIQFAEARGQRIWVFRHAKDPRLFIEFSESATEMSHRAQASRLPEEIKLERKLQTLATYAPDAWDLWTEVPLTTPTEA
Ga0214471_1128767723300033417SoilMAKVLAASRVRVPAPNESEYIATLRELIQFAEARGQRIWVFRHAKDPRLFIEFSESATEMSHRAQASRLPEEIKLERKLQTLATY
Ga0364934_0369501_280_5433300034178SedimentMPKVLTASRVRVPANNEAEYLATLRELCQYAEARGQRIWVFRHAGDPRLFIEFSESPTEMSHRAQASRLPEEIKLERKLQSLVTYAPD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.