NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F035426

Metagenome / Metatranscriptome Family F035426

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F035426
Family Type Metagenome / Metatranscriptome
Number of Sequences 172
Average Sequence Length 159 residues
Representative Sequence MLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTA
Number of Associated Samples 169
Number of Associated Scaffolds 172

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 50.00 %
% of genes near scaffold ends (potentially truncated) 1.16 %
% of genes from short scaffolds (< 2000 bps) 1.16 %
Associated GOLD sequencing projects 164
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.837 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil
(9.884 % of family members)
Environment Ontology (ENVO) Unclassified
(34.884 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(29.651 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 32.30%    β-sheet: 26.09%    Coil/Unstructured: 41.61%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 172 Family Scaffolds
PF00165HTH_AraC 1.16
PF13489Methyltransf_23 0.58
PF07676PD40 0.58
PF12146Hydrolase_4 0.58
PF02195ParBc 0.58
PF09966DUF2200 0.58
PF12833HTH_18 0.58
PF12872OST-HTH 0.58
PF03050DDE_Tnp_IS66 0.58

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 172 Family Scaffolds
COG3436TransposaseMobilome: prophages, transposons [X] 0.58


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A98.84 %
All OrganismsrootAll Organisms1.16 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009177|Ga0105248_12416590All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi599Open in IMG/M
3300009792|Ga0126374_11419561All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi566Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil9.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.72%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment4.07%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.07%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.49%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands3.49%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen3.49%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland2.33%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.33%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.33%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.33%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil2.33%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.33%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere2.33%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.91%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.74%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.74%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.74%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.16%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.16%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.16%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.16%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.16%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.16%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.16%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.16%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.16%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.16%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.58%
Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Microbial Mat0.58%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.58%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.58%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.58%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.58%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.58%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Agricultural Soil0.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.58%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.58%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.58%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.58%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.58%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.58%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.58%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.58%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.58%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.58%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.58%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere0.58%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Unclassified → Tabebuia Heterophylla Rhizosphere0.58%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.58%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.58%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.58%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.58%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.58%
AgaveHost-Associated → Plants → Phyllosphere → Phylloplane/Leaf Surface → Unclassified → Agave0.58%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.58%
Rice-Straw Enriched CompostEngineered → Solid Waste → Grass → Composting → Bioreactor → Rice-Straw Enriched Compost0.58%
Activated SludgeEngineered → Wastewater → Nutrient Removal → Biological Phosphorus Removal → Bioreactor → Activated Sludge0.58%
Ionic Liquid And High Solid EnrichedEngineered → Lab Enrichment → Defined Media → Unclassified → Unclassified → Ionic Liquid And High Solid Enriched0.58%
Anaerobic Digester DigestateEngineered → Bioreactor → Anaerobic → Unclassified → Unclassified → Anaerobic Digester Digestate0.58%
Sediment SlurryEngineered → Bioremediation → Metal → Unclassified → Unclassified → Sediment Slurry0.58%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459019Litter degradation MG4EngineeredOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300003331Ionic liquid and high solid enriched microbial communities from the Joint BioEnergy Institute, USA - AR20-2-R (Metagenome Metatranscriptome, Counting Only)EngineeredOpen in IMG/M
3300003373Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1Host-AssociatedOpen in IMG/M
3300003993Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300005281Thermophilic microbial communities from the Joint Bioenergy Institute, California, USA of rice/straw/compost enrichment - eDNA_2EngineeredOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005335Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaGHost-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006178Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-2Host-AssociatedOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009011Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-4 metaGHost-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009081Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009095Agricultural soil microbial communities from Utah to study Nitrogen management - Steer compost 2015EnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009111Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1EnvironmentalOpen in IMG/M
3300009131Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Open_0915_D1EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300009789Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot28EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010041Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot104AEnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010044Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot60EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011402Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT830_2EnvironmentalOpen in IMG/M
3300011416Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT551_2EnvironmentalOpen in IMG/M
3300011427Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT418_2EnvironmentalOpen in IMG/M
3300011430Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT600_2EnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012038Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT800_2EnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012514Unplanted soil (control) microbial communities from North Carolina - M.Soil.1.old.130510EnvironmentalOpen in IMG/M
3300012670Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT293_2EnvironmentalOpen in IMG/M
3300012893Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S059-202B-1EnvironmentalOpen in IMG/M
3300012899Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S058-202B-2EnvironmentalOpen in IMG/M
3300012902Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S169-409C-1EnvironmentalOpen in IMG/M
3300012903Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S134-311R-1EnvironmentalOpen in IMG/M
3300012907Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1EnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013100Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaGHost-AssociatedOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014256Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_TuleB_D2EnvironmentalOpen in IMG/M
3300014300Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D1EnvironmentalOpen in IMG/M
3300014321Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D1EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014487Bulk soil microbial communities from Mexico - Magueyal (Ma) metaGEnvironmentalOpen in IMG/M
3300014832Activated sludge bacterial and viral communities from EBPR bioreactors in Brisbane, Australia - M90108EngineeredOpen in IMG/M
3300014872Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT790_16_10DEnvironmentalOpen in IMG/M
3300014874Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_2_16_10DEnvironmentalOpen in IMG/M
3300014875Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_1_16_10DEnvironmentalOpen in IMG/M
3300014967Freshwater microbial mat microbial communities from Canadian High Arctic Lake 9K, Kuujjuarapik, Canada - Sample L9KaEnvironmentalOpen in IMG/M
3300015249Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT293A_16_10DEnvironmentalOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019884Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025920Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025932Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025959Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026071Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_TuleA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026090Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026111Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberrySE_CattailB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026827Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G09A4-11 (SPAdes)EnvironmentalOpen in IMG/M
3300027027Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027041Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027332Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027665Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027675Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027695Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027723Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027761Agave microbial communities from Guanajuato, Mexico - As.Sf.rz (SPAdes)Host-AssociatedOpen in IMG/M
3300027792Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 1-3cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027871Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Open_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027877Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027948Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027956Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300027979Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028589Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day1EnvironmentalOpen in IMG/M
3300029799Metagenomes from anaerobic digester of solid waste, Toronto, Canda. Combined Assembly of Gp0238878, Gp0238879, Gp0242100, Gp0242119EngineeredOpen in IMG/M
3300029987I_Fen_E3 coassemblyEnvironmentalOpen in IMG/M
3300030000I_Fen_N3 coassemblyEnvironmentalOpen in IMG/M
3300030002II_Fen_N1 coassemblyEnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031232Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3EnvironmentalOpen in IMG/M
3300031521III_Fen_E2 coassemblyEnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031726Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_1EnvironmentalOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031903Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-1Host-AssociatedOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300032004Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-3Host-AssociatedOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034071Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME17Oct2008D10-rr0110EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M
3300034660Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034661Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034692Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - A1A0.3EngineeredOpen in IMG/M
3300034818Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_3Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
4MG_047903002170459019Switchgrass, Maize And Mischanthus LitterMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQIISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLRQIVAHRGYGEAIDCYQEFICKTAYRHKHDVTLIRTAGHLLHRSHGQIRIADL
JGI1027J11758_1304505113300000789SoilMLHQPXYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYV
C687J26631_1004823413300002124SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0006572J49612_106897113300003331Ionic Liquid And High Solid EnrichedMPLGRRWQQIDVNCFWVLEQDRESYNREVYLPDAYIEAIINVGAPLMLESGYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWVVKPILNIDPDPSTARVIALDADWQRFAEYLAQIVAHRGYGEAIACLQEYVCKTAYRHKHDLTFIRTAGRLLLRSRGQIRM
JGI25407J50210_1017690313300003373Tabebuia Heterophylla RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDXTLI
Ga0055468_1003580013300003993Natural And Restored WetlandsMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0055436_1025694713300004024Natural And Restored WetlandsVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIYCYQEYVCKAAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQCHLSSSQLERQFKHYT
Ga0055500_1015458313300004062Natural And Restored WetlandsMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMA
Ga0065720_113953713300005281Rice-Straw Enriched CompostMPLGRRWQQIDVNCFWVLEQDRESYNREVYLPDAYIEAIINVGAPLMLESGYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWVVKPILNIDPDPSTARVIALDADWQRFAEYLAQIVAHRGYGEAIACLQEYVCKTAYRHKHDLTF
Ga0065704_1036744523300005289Switchgrass RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTL
Ga0065705_1116709813300005294Switchgrass RhizosphereESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLAAQIYLSSSQFDRQFKHYTAISPKAYARIVRFGSLQAALLVNPSI
Ga0065707_1002920443300005295Switchgrass RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYR
Ga0070683_10224549413300005329Corn RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYVEVIINVGAPLMLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMKLYPWAVKPILNIDADPSTVHVIGLDADWQRFVDDLTQIVAHRGYGEAIACYQEYVCKMAYRHKHDVMLIRTAGHLLHHSHGQIRMADLAAQSYLSS
Ga0066388_10643248723300005332Tropical Forest SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRG
Ga0070666_1048968223300005335Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGDAIDCYQEFVCKTAYRHKHDITLIRTAGQLLHRSHGQIR
Ga0070674_10119403513300005356Miscanthus RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVLINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAG
Ga0070673_10179776313300005364Switchgrass RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDITLIRTAGHLL
Ga0070662_10191250913300005457Corn RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYG
Ga0070707_10143609113300005468Corn, Switchgrass And Miscanthus RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGEAI
Ga0070686_10072392313300005544Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDITLIRTAGHLLHRSHGQIRMADLAAQ
Ga0070664_10102897023300005564Corn RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0068860_10066056233300005843Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYR
Ga0075285_103235713300005890Rice Paddy SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAINCYQEYVCKTTYRHKHDVTLIRTAGHLLHRSHGQIRMADLAAQSYLSS
Ga0070717_1103423213300006028Corn, Switchgrass And Miscanthus RhizosphereMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAVHLLHCSHGQIRMADLAAQ
Ga0075029_10130838113300006052WatershedsMLHQLKIAAVPMLAADVNCFWTLEQDQGSYNNEVFLPDSYTEVIINVGASPLLETENGLIEIPRAFVNPLQNKPLRFRTTGYCQMISMQLYPWSPKPILNIDADPSTVHIIGLDSEWQRFADYLTQVVAHRGYEEAINCYQEYVCTI
Ga0070716_10091106823300006173Corn, Switchgrass And Miscanthus RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYVEVVINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRQKHDLMLIRTAGYLLHRSHGQIR
Ga0075367_1063319813300006178Populus EndosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILTIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLYRSHGQIRMTDLAAQSHLSSSQ
Ga0097621_10146225613300006237Miscanthus RhizosphereMFHQPIYAPVPMLAADVNCFWALEQDQESYNREVNLPDAYIEVMINVGAPLLLESEHGMFELPRAFVNPLQNKPLRIRAAGLCQIISMKLYPWAVKPILNIEANPSSVHVIGLDAGWQRFADDLTRIVAHQGYAEAIDCYQEYVCNTAYRHKHDLMLIRTAGHLLQRA
Ga0075430_10078043513300006846Populus RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRG
Ga0079217_1111627013300006876Agricultural SoilMLNQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGY
Ga0075426_1084576823300006903Populus RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESFNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0105251_1035652623300009011Switchgrass RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSAVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRLKHDITLIRTAGHLLHRSHGQIRIAD
Ga0066710_10117262923300009012Grasslands SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDSYIEVIINVGAPLVLESEHGKLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIPMTDLATQSHLSSSQLERQFKHYTAISPKAYARIVRFGSLQASLLVNPSD
Ga0105098_1000685713300009081Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHMIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIR
Ga0079224_10381662713300009095Agricultural SoilMRHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYVEVVINVGAPLMLESAYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIACYQEFVCKTAYRHKHDLTLIRTAGHLLHRS
Ga0105245_1241545513300009098Miscanthus RhizosphereMLHQPTYAPVPRLAADVNCFWALEQDQASYNREAHLPDAYIEVIINVGAPLVLERDHGMLELPRAFVNPLQNKPLRIRAAGCCQMISMQLYPWAVKPILNIEADPSRVHVIGLDADWQRFADDLTRIVAHRGYEEAVDCYQEYVCKTAYRHKHDITLIRTAGHL
Ga0115026_1129726213300009111WetlandMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRG
Ga0115027_1030674113300009131WetlandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDVDPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIECYQEYVCKTA
Ga0105243_1271645513300009148Miscanthus RhizosphereAPMPMLAADVNCFWVLEQEQEAINNEEFLPDSYIEVMITAGAPLLLETTSGLVELPRAFMNPIQNKPLRLRATGYCQAISMKLYPWAVKPILNIDADPSNMHIIGLDANWQRFADDLALIVAHRGYAEAIDCFQDYVCKIAYRSKHDVTPIRTAGHLLRRSQGQIRMTDLAAQSYLSS
Ga0075423_1261021313300009162Populus RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHD
Ga0105104_1089697813300009168Freshwater SedimentMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRVAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFAEYLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHL
Ga0105242_1184773913300009176Miscanthus RhizosphereMAHQPKIAPVPLLAADVNCFWTLEQDQNTYNNEEFLPHFYAEVVIVTGAPLLLETENGMVELPRAFVNPIQNKPLRFRAVGYCQTLAMKLYPWALKPILNIDADPSNVHVIGLDSQWQHFANDLTQIVIHQGYEEAIHCFQDYICKIAYRGKHDIVPIRTAGHMLRRSQGQICMTDLAAQSYLSASQFERR
Ga0105248_1241659013300009177Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPVLNIEADPSTVHVIGLDADWQRFAEYLAQIVAHR
Ga0105237_1171034713300009545Corn RhizosphereMLHQPIYAPVPILAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAIKPILNIEADPSDVHVIGLDADWQRFADDLTRIVAHRGYEEAVDCYQEYVCKRAYRHKHDITLIRTAGNLLQSS
Ga0105238_1207643623300009551Corn RhizosphereMLGDVHLPDAYIEVIINVGAPLVLERDHGMLELPRAFVNPLQNKPLRIRAAGCCQMISMQLQPWAVKPILNIEADPSTVHVIGLDASWQRFAADLTGIVAHRGYEEAIGCYQEYVCKTAYRHKHDVMLIRTAGHLLQRSHGQIR
Ga0105259_112483513300009597SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVVINVGAPLMLESEHGMLGLPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLI
Ga0114944_127227323300009691Thermal SpringsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQHKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKH
Ga0126307_1146930413300009789Serpentine SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNRKVYLPDAYIEVIINVGAPLVLESEHGMFELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAV
Ga0126374_1141956113300009792Tropical Forest SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVLINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIEADPSDVHVIGLDADWQRFAEYLTQVVAHRGYGE
Ga0126304_1063773413300010037Serpentine SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGTLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRM
Ga0126312_1079596713300010041Serpentine SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQVYPWAVKPILNIDADRSTVHVIGLDADWQRFADELTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDLTLIRT
Ga0126380_1076813123300010043Tropical Forest SoilMAHQPKIAPVPKLAADVNCFWTLEQDQDTYNNEEFLPHFYAELVIVTGAPLLLETENGMVELPRAFVNPIQNKPLRFRAVGYCQTLAMKLYPWALKPILNIDADPSNVHVIGLDSEWQHFANDLTRIVIHQGYEEAINCFQDYICKIAYRGKHDVVPIRTAGHMLRRSQGQICMTDLVAQSYLSASQFERRFKQYTGVSPKT
Ga0126310_1069583513300010044Serpentine SoilMLAADVNCFWALEQDQESYNREVYLPDAYVEVVINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADALTQIVAHRGYGDAIDCYQEFVCKTAYRHKHNL
Ga0126372_1147875613300010360Tropical Forest SoilMLAADVNCFWALEQDRESYNREVYLPDAYIEVMINVGAPLMLEREYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAVDCYQEFVCKTAYRHKHDVMLIRTAGH
Ga0126372_1308487313300010360Tropical Forest SoilQASYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQHKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFAYDLTQIVARRGYGEAIDCYQEYVCKTAYRHKHDITLIRTAGQLLHRSHGQIRMTDLAAQS*
Ga0126377_1074841313300010362Tropical Forest SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQKISMQLYPWAVKPILNIDAHPSTVHVIGLDADWQRFADDLTQIVAHRGYAEAIDCYQEFV
Ga0105239_1111238113300010375Corn RhizosphereMLAADVNCFWALEQDQKSYNREVNLPDAYIEVIINVGAPLLLESEHGMLELPRAFVNPLQNKPLRIRAVGLCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFAEYLAQNVAHRGYGEAVDCYQEYI
Ga0134124_1209439013300010397Terrestrial SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0134127_1106595313300010399Terrestrial SoilMFHQPKFAPVPKLAADVNCFWALEQDQESYNREDFLPDSFAEVVINVGAPLMLETERGLLELPRAFVNPIQNKPLRFRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYAEAIDCYQEYVCKTAYRHKHDLML
Ga0134122_1230563413300010400Terrestrial SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGY
Ga0134121_1093044913300010401Terrestrial SoilMLAADVNCFWALEQDQESYNRQEFVPDSYIEVVINVGAPLMLETESGLLALPRAFVNPIQNKPLRFRAAGFCQMISMKLYPWAVKPILNIDADPSTVHVIGLDAEWQHFAHDLMQMVAHHGYAEAIDRYQEYVSEIAYQHKHDVMPIRIAGHLLRRSQGQIRMTDLAAQSHLSASQLERQFKRYAAISPKAYARI
Ga0137356_101865613300011402SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYWHKHDITLIRTAGLLLHRSHGQIR
Ga0137422_108755823300011416SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVAAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYR
Ga0137448_108581213300011427SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIQADPSTVHVIGLDADWQRFADDLTQIVAHRGYEEAIDCYQEYVCKTAYRHKHDLMLIRTAGHLLH
Ga0137423_107101913300011430SoilMLNQPTYAPVPMLAADVNCFWALEQDQESQNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDLMLIRSAGHLLHRSHGQIRMTDLAAQSYLSSS*
Ga0137438_107280323300011431SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCIKSMSVKLLIGISTISR*
Ga0137445_108902713300012035SoilMLHQPTYAPVPVLAADVNCFWALDQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHD
Ga0137431_117476023300012038SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQHFADDLTQIVAHRGYGE
Ga0137379_1116992723300012209Vadose Zone SoilMLAADVNCFWALEQDKESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIATHRGYGEAIDCYQEFVCKT
Ga0150985_11894971413300012212Avena Fatua RhizosphereMLHQPIYAPVPRLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLLLESEYGMLELPRAFVNPLQKKPLRIRATGFCQMISMQLYPLAAKPILNSDADPSTVHVISLDADWQRFANDLTQIVAHRGYGEA
Ga0137434_101389713300012225SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYSWAVKPILNIDPDPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEF
Ga0150984_10957800613300012469Avena Fatua RhizosphereMIHQPKIIPDPMLAADVHCFWSLEQDQDSYNLEAFLPDSFTEVVINVGAPLLLETESGLLDIPRAFVNPLQDKPLRFRTTGYCQMISMKLYPWALKPILNIDADPSTVHIIGLDSEWQRFADYLTQVVAHHGYEEAISCYQEYVCNIAYKGKHD
Ga0150984_11277768913300012469Avena Fatua RhizosphereMLHQPIYAPVPLLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLETERGMLELPRAFVNPLQNKPLRIRAAGFCQIISMQLYPWAVKPILNIEPDPSTVHVIGLDAGWQRFADDLARIVAHRGYAEAIDCYQEFVCKTAYRHKHDVMLIRTAGHLLHQSH
Ga0157330_102746713300012514SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAH*
Ga0137335_101145533300012670SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVRPILNIDADPSTVHVIRLDADWQRFADDLTQIVAHRGYGEAIDC
Ga0137335_101726213300012670SoilMLYQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLHPWAVKPILNIDADPSNVHVIGLDIDWQRFADDLTQIVAHRGYGE
Ga0157284_1003635013300012893SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVNLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLTHIRTAGHLLHRSHGQIRMADLAAQSYLSSSQ
Ga0157299_1027110213300012899SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEYGRLELPRAFVNPLQNKTLRIRAAGFCQMISMQLYPWAVKPILNIEADPSTMHVMGLDADWQRFADDLTQIVAHRGYGEA
Ga0157291_1032007013300012902SoilMFHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDANWQRFADDLTQIVAHRGYEEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSH
Ga0157289_1011666613300012903SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLLLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSSVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFICKTAYRHQHDL
Ga0157283_1002098213300012907SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQHKPLRIRAAGLCQMISMQLHPWAVRPILNIDADPSTVHVIGLDAAWQRFAEDLTQIVAHR
Ga0162652_10010501513300012941SoilMLHQPTYAPVPMLAADVNCFWALEQDQESHNREVYLPDAYIEVIINVGAPLVLESERGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0126375_1087260913300012948Tropical Forest SoilMLAADVNCFWTLEQDQDTYNNEEFLPHFYAELVIVTGAPLLLETENGMVELPRAFVNPIQNKPLRFRAVGYCQTLAMKLYPWALKPILNIDADPSKVHVIGLDSEWQHFANDLTQIVIHQGYEEAIHCFQDYICKIAYRGKHDIAPIRTAGHMLRRSQGQICMTDLAAQSNLSASQFERRFKQY
Ga0164305_1085698323300012989SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMVESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYQHKHDVTLIRTA
Ga0157373_1057790413300013100Corn RhizosphereMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLLRTAGHLLHRSHGQIRIAD
Ga0134079_1052613913300014166Grasslands SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFINPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLAAQSY
Ga0075318_105150623300014256Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLKQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLL
Ga0075321_107445713300014300Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAVGFCQMISMQLYPWAVKPILNIDADSSTVHVIGLDADWQRFADDLMQIVAHRGYG
Ga0075353_104861313300014321Natural And Restored WetlandsMLAADVNCFWALEQDQEIYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILKIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLAAQSYLSSSQLERQFKHYTAISP
Ga0157380_1022218913300014326Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRS
Ga0182000_1044426113300014487SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFANDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRIADLA
Ga0119905_119970713300014832Activated SludgeMLASDVNCFWALEQDQESYNREVYLPDAYVEVLINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNLEADPSTVHVIGLDAGWQRFADYLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLTLIRTAGHLLH
Ga0180087_104218613300014872SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVPH*
Ga0180084_111887413300014874SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLLRTAGHLLHCSHGQIRMADLAAQ
Ga0180083_102771523300014875SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTHIVAHRGYGEAIDCYQEFVCKTAYR
Ga0182827_1003284313300014967Microbial MatMLNQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVLINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTVGHLLHRSHGQIRMADLAA
Ga0180071_101749623300015249SoilMLAADVNCFWALEQDQESYNREVYLPDSYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0180093_113856213300015258SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYVEAIDCYQEFVCKTAYRHKHDLMLIPTAGHL
Ga0180085_104977713300015259SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGLLLHRSHGQIRIADLAAQSY
Ga0184604_1012819413300018000Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAVGFCQMISMQLHPWAVKPILNIDADPSNVHVIRLDIDWQRFADDLTLIVAHRGYG
Ga0184617_112871923300018066Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVRLILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDC
Ga0184632_1024751613300018075Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHD
Ga0184612_1026712113300018078Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGY
Ga0190265_1201989713300018422SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVVINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDADWQRFADDLTHIVAHRGYGDAIDCY
Ga0190275_1151708323300018432SoilMLHQPTYAPVPLLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHR
Ga0190268_1090914713300018466SoilQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLATQSHLSSSQLERQFKHYTAISPKAYARIVRFGSLQASLLVNPSIRLVDLADVY
Ga0184646_133195413300019259Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLHPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRIADL
Ga0173479_1017469813300019362SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGVPLGLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIEADPSTVHVIALDDKWQRFADGLTQIVAHRGYGDAIDCYQEFVCKTA
Ga0187892_1037838323300019458Bio-OozeMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPHQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLAT
Ga0193741_106358913300019884SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGQLLHRSHG
Ga0210378_1011436713300021073Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRT
Ga0210380_1051245713300021082Groundwater SedimentHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRAHGQIRIADLAAQSYLSS
Ga0222622_1082186613300022756Groundwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRIADLAAQ
Ga0207649_1088027823300025920Corn RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQQYVCKT
Ga0207652_1180084313300025921Corn RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNINADPSTVHVIGLDADWQRFADDLTQIVAHRGYG
Ga0207690_1176796513300025932Corn RhizosphereMRHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTL
Ga0207686_1179791113300025934Miscanthus RhizosphereMLHQPTYDPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLL
Ga0207670_1074905913300025936Switchgrass RhizosphereMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRM
Ga0210116_112295513300025959Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEF
Ga0207668_1080376113300025972Switchgrass RhizosphereMLHQPIFDPVPMLAADVNCFWALEQDQESYNREVYLPDAYVEVVINIGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTL
Ga0207640_1166520613300025981Corn RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVT
Ga0207703_1105916213300026035Switchgrass RhizosphereMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHVLHRSHG
Ga0208537_103683213300026071Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLKQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLTLLRTAGHLLHRSHGRIRIADLAAQSYLSSSQFER
Ga0207702_1224923213300026078Corn RhizosphereMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDITLIR
Ga0208912_106090313300026090Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLAT
Ga0207676_1083065333300026095Switchgrass RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYG
Ga0208291_106816313300026111Natural And Restored WetlandsMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLMQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLL
Ga0207591_11027413300026827SoilNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHNVTLIRTAGHLLHRSHGQIRMADLAAQSYL
Ga0209844_100372313300027027Groundwater SandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQIHLSSSQLERQFKHYTAISPKAYARIVRFGSLQASLLVNPSI
Ga0209876_101334013300027041Groundwater SandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLML
Ga0209898_105305313300027068Groundwater SandPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRIADLAAQS
Ga0209861_104240823300027332Groundwater SandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQ
Ga0209818_123029813300027637Agricultural SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNINADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAY
Ga0209983_114716413300027665Arabidopsis Thaliana RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLMQIVAHRGYGEAIDCYQEYVCKT
Ga0209077_112131213300027675Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIHAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRT
Ga0209966_104009413300027695Arabidopsis Thaliana RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMA
Ga0209703_128810813300027723Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTA
Ga0209462_1009845613300027761AgaveMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQSHLSSSQLERQFKHYTAISPKAYA
Ga0209287_1019383223300027792Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYREAIDCYQEFVCKTAYRH
Ga0209397_1018576733300027871WetlandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLEVPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRM
Ga0209293_1036455813300027877WetlandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVVHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHL
Ga0209481_1024313513300027880Populus RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHRSH
Ga0209858_101840913300027948Groundwater SandMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHAMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAY
Ga0209820_113262223300027956Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHMIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRIADLAAQSYLS
Ga0256864_105585833300027964SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIR
Ga0209705_1013711013300027979Freshwater SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLI
(restricted) Ga0233417_1032234813300028043SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGTLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPLAVKPILNIDADPSTVHVIGLDAGWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDLMPIRTAGHLLHRSHGQIRMADLAAQSYLSSSQLERQF
Ga0247818_1137791213300028589SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTACRHKHDVT
Ga0311022_1460422713300029799Anaerobic Digester DigestateMLAPDVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRATGFCQMISMQLYPWAVKPIMNIDADPSTVHVIGLDADWQRFADDLTQIVAHRCYGEAIDCYQEYGCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQIHLSSSQLERQ
Ga0311334_1116811913300029987FenMIHQPKIIPAPMLAADVNCFWALEQDQETFNDEEFLPDSYIEVMIATGAPLMLETTSGLVELPRAFMNPIQNKPLRFRATGYCQAISMKLYPWAVTPILNINADPSNMHVIGLDADWQRFAADLTLIVAHRGYAEAIDCFQDYVCKIAYRGKHDVTPIRTAGHLM
Ga0311337_1077043533300030000FenMRHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLLLESEHGMAELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSSVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYR
Ga0311350_1133667013300030002FenMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPLAVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGQLLHG
Ga0307499_1012779623300031184SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVVINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTGIVAHRGYGEAIDCYQEFVCKTAYRHKHDVMLMRTAGHLLHRSHGQIRMADL
(restricted) Ga0255310_1018318413300031197Sandy SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLLLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIEADPSTVHVIGLDANWQRFAEYLTQIVAHRGYGEAIGCYQEFVCKTAYRHKHDVT
Ga0302323_10339083813300031232FenMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLLLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSPVHVIGLDANWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTL
Ga0311364_1165114913300031521FenMIHQPKITPAPMLVADVNCFWVLEQDQETFNDEEFLPDSYIEVMIATGAPLMLETTSGLVELPRAFMNPIQNKPLRFRATGYCQAISMKLYPWAVTPILNINADPSNMHVIGLDADWQRFAADLTLIVAHRGYAEAIDCFQDYVCKIAYHGKHDVTPIRTAGHLMRRS
Ga0310888_1041347013300031538SoilMLHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLATQSHLSSSQLERQFKHYTA
Ga0310813_1120357723300031716SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHK
Ga0302321_10187223213300031726FenMRHQPIYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVMINVGAPLLLESEYGMLELPRAFVNPLQNKPLRIRVAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLRQIVAHRGYGEAIDCYQEFVYKRAYRHKHDVML
Ga0310907_1042149613300031847SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNMDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIGCYQEFVCKTAYRHKHDLMPIRTAG
Ga0307407_1133975313300031903RhizosphereMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTA
Ga0310885_1037367323300031943SoilMLHQPTYAPVPMLAADVNCFWALAQDQESYNREVYLPDAYIEVMINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCK
Ga0310897_1055195113300032003SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNQPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRS
Ga0307414_1068059923300032004RhizosphereMLNQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYVEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLMLIRTAGHLLHRSHGQIRMADLAA
Ga0326726_1108641413300033433Peat SoilMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHRMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGRLLH
Ga0335028_0352324_3_4133300034071FreshwaterMLHQPIFAPVPILAADVSCFWALEQDQESYNREVYLPDAFIEVIINVGAPLLLESEHGLLELPRAFVNPLQNKPLRIRTAGFCQMISMQLYPWAVKPILNIEADPSTVHVIGLDADWQRFADDLTQIAAHRGYEEAI
Ga0364932_0125094_2_5113300034177SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVTLIRTAGHLLHSSHG
Ga0364934_0117172_484_10023300034178SedimentMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDVRLIRTAGQLLHRSHGQIR
Ga0314781_100446_2_5473300034660SoilMLHQPTHAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRATGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLTLIRTAGHLLHRSHGQIRMADLAAQSC
Ga0314782_208888_3_5033300034661SoilMLHQPMYAPAPMLAADVSCFWALEQDQESYNREVYLPDAYIEVIINVGAPLMLESEYGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDAGWQRFADDLTQIVARRGYGEAIDCYQEFVCKTAYRYKHDVTLLRTAGHLLHR
Ga0373917_0011492_1_5613300034692Sediment SlurryMLHQPTYAPVPMLAADVNCFWALEQDQESYNREVYLPDAYIEVIINVGAPLVLESEHGMLELPRAFVNPLQYKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEYVCKTAYRHKHDVTLIRTAGHLLHRSHGQIRMTDLATQCHLSSSQ
Ga0373950_0134446_58_5553300034818Rhizosphere SoilMLAADVNCFWALEQDQESYNREVYLPDAYIEVLINVGAPLVLESEHGMLELPRAFVNPLQNKPLRIRAAGFCQMISMQLYPWAVKPILNIDADPSTVHVIGLDADWQRFADDLTQIVAHRGYGEAIDCYQEFVCKTAYRHKHDLRLIRTAGHLLHRSHGQIRMADL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.