NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F038326

Metagenome / Metatranscriptome Family F038326

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F038326
Family Type Metagenome / Metatranscriptome
Number of Sequences 166
Average Sequence Length 213 residues
Representative Sequence MNINCEDRDRIFEDGTPAEWAALEAHSANCAVCSEELRAWKAISVAAKEMRDYSDSPSLWPRIERALTAEAAAKTHRAQRWSWLSLGLGLSLGWQTAAAAALVLILTVSAGWVYLHRTGPVSDRDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Number of Associated Samples 105
Number of Associated Scaffolds 166

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 90.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.60 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (93.976 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(32.530 % of family members)
Environment Ontology (ENVO) Unclassified
(51.807 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.904 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 70.16%    β-sheet: 0.81%    Coil/Unstructured: 29.03%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 166 Family Scaffolds
PF08281Sigma70_r4_2 35.54
PF00293NUDIX 4.22
PF11611DUF4352 4.22
PF13620CarboxypepD_reg 3.61
PF01255Prenyltransf 3.61
PF00266Aminotran_5 2.41
PF13601HTH_34 2.41
PF13345Obsolete Pfam Family 1.81
PF06271RDD 1.81
PF10099RskA 1.81
PF00324AA_permease 1.20
PF07244POTRA 1.20
PF00781DAGK_cat 1.20
PF01135PCMT 1.20
PF00535Glycos_transf_2 1.20
PF01642MM_CoA_mutase 0.60
PF04357TamB 0.60
PF16757Fucosidase_C 0.60
PF13116DUF3971 0.60
PF00282Pyridoxal_deC 0.60
PF00696AA_kinase 0.60
PF01957NfeD 0.60
PF01381HTH_3 0.60
PF15902Sortilin-Vps10 0.60
PF09832DUF2059 0.60

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 166 Family Scaffolds
COG0020Undecaprenyl pyrophosphate synthaseLipid transport and metabolism [I] 3.61
COG1597Phosphatidylglycerol kinase, diacylglycerol kinase familyLipid transport and metabolism [I] 2.41
COG1714Uncharacterized membrane protein YckC, RDD familyFunction unknown [S] 1.81
COG0531Serine transporter YbeC, amino acid:H+ symporter familyAmino acid transport and metabolism [E] 1.20
COG0833Amino acid permeaseAmino acid transport and metabolism [E] 1.20
COG1113L-asparagine transporter or related permeaseAmino acid transport and metabolism [E] 1.20
COG1115Na+/alanine symporterAmino acid transport and metabolism [E] 1.20
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 1.20
COG2518Protein-L-isoaspartate O-methyltransferasePosttranslational modification, protein turnover, chaperones [O] 1.20
COG2519tRNA A58 N-methylase Trm61Translation, ribosomal structure and biogenesis [J] 1.20
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 1.20
COG0076Glutamate or tyrosine decarboxylase or a related PLP-dependent proteinAmino acid transport and metabolism [E] 0.60
COG1884Methylmalonyl-CoA mutase, N-terminal domain/subunitLipid transport and metabolism [I] 0.60
COG2911Phospholipid transport to the outer membrane protein TamBCell wall/membrane/envelope biogenesis [M] 0.60


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A93.98 %
All OrganismsrootAll Organisms6.02 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005537|Ga0070730_10000420All Organisms → cellular organisms → Bacteria50167Open in IMG/M
3300006804|Ga0079221_10058221All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1759Open in IMG/M
3300010343|Ga0074044_10000138All Organisms → cellular organisms → Bacteria71928Open in IMG/M
3300021404|Ga0210389_10093380All Organisms → cellular organisms → Bacteria2313Open in IMG/M
3300021407|Ga0210383_10000001All Organisms → cellular organisms → Bacteria → Acidobacteria568166Open in IMG/M
3300027795|Ga0209139_10000361All Organisms → cellular organisms → Bacteria15433Open in IMG/M
3300027884|Ga0209275_10028968All Organisms → cellular organisms → Bacteria2544Open in IMG/M
3300031090|Ga0265760_10001691All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6483Open in IMG/M
3300031708|Ga0310686_111078947All Organisms → cellular organisms → Bacteria2118Open in IMG/M
3300033134|Ga0335073_10003776All Organisms → cellular organisms → Bacteria21163Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil32.53%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil16.87%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil6.63%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.82%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil4.22%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.61%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.41%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.41%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.41%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere2.41%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.81%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.81%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.20%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.60%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.60%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.60%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.60%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.60%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.60%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.60%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005169Soil and rhizosphere microbial communities from Laval, Canada - mgHPAEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009698Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_3_AS metaGEnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010876Boreal forest soil eukaryotic communities from Alaska, USA - W5-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300023030Soil microbial communities from Bohemian Forest, Czech Republic ? CSU2EnvironmentalOpen in IMG/M
3300024227Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic - CZU4Host-AssociatedOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027109Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF008 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027737Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027783Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027795Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028016Rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE1Host-AssociatedOpen in IMG/M
3300028023Rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE5Host-AssociatedOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030759Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSU1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030940Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031090Metatranscriptome of rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031240Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-8-27 metaGHost-AssociatedOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032898Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.1EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062385_1003576313300004080Bog Forest SoilLEAHSTNCAVCAEELRSWRALSAAAREMRDYSDTPSLWPRIERALAEEAAAKRNRSERWSWLSLGFGLSLGWQTAAAAALVLILTVSASWFYVHRTPSVDNRDQSLLRSPALAEVERSQAAYEQAIDKLAADAKPALENPASPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0062385_1015552523300004080Bog Forest SoilMNMKCEDRDRVFEDGTPAEWAALETHSTNCAACAEELRDWKTLSVAAKELRDYSDTPSLWLRIEPALAEAAFKKQSAGRWSWLTLGFGLSLGWQTAAAAAMVVLLTVSAGWVYWHRPTPGNRGDKDQSLLKTPALAEVERTQAAYEQAIDKLALDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEGKR*
Ga0062384_10058427513300004082Bog Forest SoilMNITCEDRDRILEDGTPAEWAALEAHSANCAACAEELRAWKAISVAAKELRDYSDSPFLWPRIERALAAETAAKTHRAQRWSWLSLGLGFSLGWQTAAAAAFVLILTASVGWIYLHPTKPLPSPDLSLLKSPALAEVERTQAAYEQAIDKLAADAKPRLGNTATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0062387_10001343133300004091Bog Forest SoilMNLKCEDRDRIFEDGTPAEWAALEAHSANCLECTEELRAWKALSVAAKELRDYSDTPSLWLRIEPALAEAAAAKQRDAGRWSWLTLGFGLSLGWQTAMAAALVVLLTVSAGWVYLHPPRPGNRGDKDQSLLKSPALAEVERAQAAYEQAIDKLALNAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKRQTLQDILEEKR*
Ga0062389_10041699923300004092Bog Forest SoilMNMKCEDRDRVFEDGTPAEWAALETHSTNCAACAEELRDWKTLSVAAKELRDYSDTPSLWLRIEPALAEAAFKKQSAGRWSWLTLGFGLSLGWQTAAAAAMVVLLTVSAGWVYWHRPTPGNRGDKDQSLLKTPALAEVERTQAAYEQAIDKLALDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHL
Ga0062386_10000822023300004152Bog Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCAVCAEELRSWKALSVAAQEMRDYSNWLSLWPRIERALAEEAAAKRNRSARWSWLSLGFGLSLGWQTTAAAALVLLLTVSAGWIYLLRPTSLTPADQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQARMNPSNAHLRQQLLAMYQEKQQTLEDILEGKR*
Ga0066810_1012113413300005169SoilISVAAKELRDYSGSPALWPRIEQALTAEAAAKKIRAWRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLPVKPVPNGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070730_10000420193300005537Surface SoilMNIKCEDRDKILAEGTPDEWAALEAHATICDACGEELRSWRALSGAAVEMREYSDTPSLWPRIERALVEQAAAQDRHSEGWGWLSFGRLFPLRLQTAAAAVLVLILTASAGWIYIHRQTPPVVQGDQSLLKSPALAEVERAQAAYERAIDKLAAQAKPQLENPTTPLQANYREKLLVLDSAINDLRAQTGMNPSNAHVRQQLLAMYQEKQQTLEDILEEKR*
Ga0070732_1004657643300005542Surface SoilMNIKCEDRNRVFEDGTPAEWAALEAHSTSCAVCAEELLSWKALSVAARELRDYSDAPSLWPRIERSLTEEGAAQKQRAERWRWLSLGFGFSLGWQTAAAAAMVLILTVSGVWVYLHRTPPAGSADQSLLKSPALAEVERAQTAYEQAIDKLAAQAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHL
Ga0070761_1017511723300005591SoilMNINCEDRDRIFEDGTPAEWAALEAHSANCAVCSEELRAWKAISVAAKEMRDYSDSPSLWPRIERALTAEAAAKTHRAQRWSWLSLGLGLSLGWQTAAAAALVLILTVSAGWVYLHRTGPVSDRDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070761_1018761923300005591SoilAEELRSWKALSAAAKEMRDYSDAPSLWPRIERALTAEASAKQQRPERWRWLTFGFGLSLGWQTAAATALVLILTVSAVWVYEHPPTPVGRADQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070761_1042319923300005591SoilMNIKCEDRDRIFEDGTPTEWAALEAHSASCAVCAEELRAWKAISVAAKELRDYSDTPSLWPRIERALTAEAAVKKHRTERWGWLSLGFGLSLGWQTAAVAAVVLILTVSTGWVYLHRTGRGSGSDRDQSLLKSPALAEVERAQAAYQQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLR
Ga0070762_1003613433300005602SoilMNITCEDRDRIFLDGTPAEWAALEAHSGNCSVCTEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALTAEAARKKHRAEGWNWLSLGLGLSLGWQTAAAAALVLILTASAGWIYLHPPKSRPSPDLSLLKSSALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070762_1005953813300005602SoilMNITCEDRDRIFEDGTPSEWAALEAHSATCPACAEELRAWKAISVAAKELRDYSDSPSLWPRIEHALAAEAQAKKHRTERWSWLSLGFGLSLGWQTALVAAMVVVLTVSGSWIYLHRTGRGPGSDRDQSLLKTPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070762_1016852223300005602SoilMNIKCEDRDRIFEDGTPAEWAALEAHSVNCAVCAEELRSWKALSVAAKELRDYSDTPSLWPRIESALTEEAAAKKQRAERWSWLSLGFGLSLGWQTAGAAVLVLVLTVSAGWVYFHRIGFEPSPDQSLLKSSALADVERAQAAYEHAIDKLAADAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKK*
Ga0070762_1019276123300005602SoilMNITCEDRDRIFHDGTPSEWAALEAHSANCAVCAEELRSWKALSVAAKELRDYSYTPSLWPRIERSLTEEAAAKKQRAERWGWLSLRFGLSLEWQTAAAAAFVLILTVSAGWVYLHRKPVADPGDQSLLKSRALAEVERAQAVYEQAIDKLATEAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070762_1039312723300005602SoilSWKALSLAAHEMRDYSDTPSLWPSIERALSEEASAKKARVGRWGWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLQRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070762_1058017723300005602SoilSWKALGAAAQEMRDYSDSPLLWSQIERALAAEAAAKNQRSGRWAWLSIGFGLSLGWQTAAAATLVLILTVSAGWLYLHRAGPGPGGDQSLLKSPALAEVERAQTAYVQAIDKLAAEAKPALENSATPFEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070764_1007865323300005712SoilMNITCEDRDRIFEDGTPAEWAALEAHGVNCGICAEELRAWKAISVAAKEMRDYSDSPSLWPRIEQALTAETAAKTHRTQRWSWLSLGFGLSLGWQTAAAAALVLILTASAGWIYLHPTRPVPHPDLSLLKSPALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070766_1028013823300005921SoilMNITCEDRDRIFLDGTPAEWAALEAHSGNCSVCTEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALTAEATRKKHRAEGWNWLSLGLGLSLGWQTAAAAALVLILTASAGWIYLHPPKSRPSPDLSLLKSSALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0075029_10098939013300006052WatershedsELRAWKAISVAAKELRDYSDTPSLWPRIEPSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSGAWIYLPRGGVVDSTDQSLLKSPALAEVERAQAAYELAIDKLAADAKPQLDNPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070765_10000167333300006176SoilMNIACKDRDGIFEDGTPAEWAAFEAHSANCASCAEELRSWKALSVTAKELRDYSDTPSLWPSIESALTEEAAAKKQRAERWGWLSLGFGLSLGWQTAAAAALVLVLTVSAGWIYFHRIGPVPGGDQSLLKSSALANVERTQAAYEQAIDKLAADAKTELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070765_10000868333300006176SoilMNIICEDRDRIFEDGTPAEWAALEAHSANCAACAEELRAWKAISVAAKELRDDSDTPSLWPRIERSLIAEAAAKNQHAGRWSWLSLGFGLSLGWQTAAAAALVLVLSVSGAWIYVHRGGVVDSPDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0070765_10001553323300006176SoilMMNITCEDRDRIFEDGTPAEWSALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWPRIEHALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070765_10002364373300006176SoilMNIKCEDRDRIFEDGTPAEWAALEAHSVNCAVCAEELRSWKALSVAAKELRDYSDTPSLWPRIESALTEEAAAKKQRAERWSWLSLGFGLSLGWQTAGAAVLVLVLTVSAGWVYFHRIGFEPSPDQSLLKSSALADVERAQAAYEHAIDKLAADAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEK
Ga0070765_10010265023300006176SoilMNIKCEDRDRIFEDGTPAEWVALEAHSANCAVCAEELRSWKALSLAAHEMRDYSDTPSLWPRIERALSEEASAKKVQAGRWGWLSLGFGLSLGWQSAAAAALIVLLTVSAGWIYLHRPTSITPVDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0070765_10022540923300006176SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCVSCAKEIRGWKALSVAAKELRDYSETPSLWPRIGRALTEEAATRKKRPEGWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWVYWHRTGPAPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAAEAKPELENPATPLQTSYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYHEKQQTLQDILEEKR*
Ga0070765_10068387913300006176SoilMNIQCADRGRIFEDGTPAEWAAFEAHSANCASCAEELQGWKALSVQARELRDYSDTPSLWPRIARALTDEAATGKKRAERWGWLSLGFGLSLGWPTAAAAVLALILTVSAGWVYWHGTGPRPIGDQSLLKSPALAEVERTQTAYEQAIDKLAVEAKPELENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDVLEEKR*
Ga0079222_1091160013300006755Agricultural SoilMNIKCEDRERILAEGTPAEWAALEAHATNCDECGEELRSWRALSAAAVEMREYSDTPSLWPRIERALVEQAAAQNLRSEGRGWLSFDRLFSLRLQTAVAAALVLILTAFTGWIYVRRQTPPVMQGNQSLLKSPALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQANYREKLLVLDSAINDLRAQTGLNPSNAHVRQQLLAMYEEKQQTLEDILEEKR*
Ga0079221_1005822123300006804Agricultural SoilMNINCEDRDRILAEGTPAEWAALEAHAANCDECGEELRSWRALSAAAVEMREYSDTPSLWPRIERALVEQAAAQNLRSEGRGWLSFDRLFSLRLQTAVAAALVLILTAFTGWIYVRRQTPPVMQGNQSLLKSPALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQANYREKLLVLDSAINDLRAQTGLNPSNAHVRQQLLAMYEEKQQTLEDILEEKQ*
Ga0079220_1130553613300006806Agricultural SoilALEAHSANGPVCAEELRAWKSLSTAAQDWRDYSDSPSLWLRIERALLEQKAARNIREVRWGWLSFGRPFSLGLQTAAAGALLVILTVSAGWIYLHRPAPPVVQGDQSLLKSPALAEVERAQAAYERAIEKLAVQAKPELEKPTTPLQANYREKLLVLDSAINDLRAQAGMNPSNAHVRHQLLAMYVEKQQTLEDILGEKQ*
Ga0073928_1008273443300006893Iron-Sulfur Acid SpringMNITCEDRDRMLEDGTPAEWAALEAHSTSCAVCAEELRAWKALSVAAKEMRDYSDTPSLWPRIERSLTEEAATRKQRAERRGWLSLGFGLSLGWQTAAAAAMVVILTVSAGWFYGHRTGPVASSDQSLLKSSALADVERAQIAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQA
Ga0079219_1008665623300006954Agricultural SoilMNINCEDRDRILAEGTPAEWAALEAHAANCDECGEELRSWTALSAAAVEMREYSDTPSLWPRIERALVEQAAAQNLRSEGRGWLSFDRLFSLRLQTAVAAALVLILTAFTGWIYVRRQTPPVMQGNQSLLKSPALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQANYREKLLVLDSAINDLRAQTGLNPSNAHVRQQLLAMYEEKQQTLEDILEEKQ*
Ga0102924_1000654113300007982Iron-Sulfur Acid SpringMNIQCEDRDRIFEDGTPAEWVALEAHSANCAACAAELRSWKALSGAAQELRDYSDTPSLWPRIERALVKEAGFQKHRSGRWGWLSMRGGFTLGLQTAAAAALVLVLIVSSGWLYLHRSKPVEQGDHSLLKSSALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQASYREKLLVLDSAIDDLREQAGINPSNAQLRQQLLAMYQEKQQTLEDILEEKR*
Ga0116216_1077994213300009698Peatlands SoilTPMNITCKDRDRIFEDGTPAEWAALEAHSANCGDCAEELRAWKAISVAAKELRDYSGSPALWPSIEQALTAEAASKKIRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHPAKPVSGGDQSLLKSPALAEVERAQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAG
Ga0074044_10000138163300010343Bog Forest SoilMNIICEDRDRIFEDGKPEEWAALEAHSANCAACADELRAWKALSVAAQELRDYSHTPSLWPRIQLSLTEAAAAKKQRAERWGWLSHGFGLSLGWQTAAATALVLILTISAGWIYVHSPKRVDHGDQSLLRSPALAEVERTQAAYELAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR*
Ga0136449_100020264103300010379Peatlands SoilMNITCKDRDRIFEDGTPAEWAALEAHSANCGDCAEELRAWKAISVAAKELRDYSGSPALWPSIEQALTAEAASKKIRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHPAKPVSGGDQSLLKSPALAEVERAQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAYLRQQLLAMYQEKQQTLQDILEEKR*
Ga0126361_1075757023300010876Boreal Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHTANCAACTEELRAWKALSVAAQELRDYSDTPSLWPRIESALAQEAAAKKQRAERWSWLSLGFGLSLGWQTAVAAALVLVLTVSAGWIYFHRVGPVTSGDQSLLKSSALAGVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0126350_1085114623300010880Boreal Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHTANCAACTEELRAWKALSVAAQELRDYSDTPSLWPRIESALAEEAAAKKQRAERWSWLSLGFGLSLGWQTAVAAALVLVLTVSAGWVYFHRVGPVTSGDQSLLKSSALAGVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR*
Ga0137391_1126234813300011270Vadose Zone SoilLSVAAQQLRDYSDRPSLWPRIESALAAEAAAKKQRPERWGWLSLGFSLSLGWQTAVAAALVLVLTVSAGWVYLHRTPPVAPGDQSLLKSPALAEVERAQAAYEQAIDKLAAEAKPELENPATPVEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR*
Ga0137388_1010637923300012189Vadose Zone SoilMNVTCNDRDRIFEDGTSAEWAALEAHTTSCAACAEELRAWKALSAAAMELRDYSDSPSLWPRIERALSEQAPAKLRRAERWRSLSFWRNLPLSWQTASAGTFVLLLTVSAGWFYLHPPKTRGLADQSLLKNSALAEVERTETAYVQAIDELAAEAKPQLENPTTSLQANYREKLFVLDSAIDDLRAQAGLNPSNAQLRYELLAVYQEKQRTLEEILEEKR*
Ga0137363_1010780423300012202Vadose Zone SoilMNVTCNDRDRIFEDGTPAEWAALEAHTASCVACAEELRAWKALSAAASELRDYSDSPSLWPRIERALSERTATKLRRAERWSWLSFRRDAPLSWQSAAAGAFVLLLTVSAGWFYLHPPKPRGPVDQSLLKNSALADVERTETAYVQAIDKLAAEAKPQLENPATPLQANYREKLFVLDSAIDELRAQAGLNPSNAHLRYELLAVYQEKQRTLEGILEEKR*
Ga0137362_1017757623300012205Vadose Zone SoilMNVTCNDRDRIFEDGTPAEWAALEAHTASCVACAEELRAWKALSAAASELRDYSDSPSLWPRIERALSERTATKLRRAERWSWLSFRRDAPLSWQSAAAGAFVLLLTVSAGWFYLHPPKPRGPVDQSLLKNSALADVERTETAYVQAIDKLAAEAKPQLENPATPLQANYREKLLVLDSAIDELRAQAGLNPSNAHLRYELLAVYQEKQRTLEGILEEKR*
Ga0137360_1000287253300012361Vadose Zone SoilMTMNFTCQDRDRILEDGTPAEWAELEAHGANCAVCANELRAWKALSVAAQQLRDYSDRPSLWPRIESALAAEAAAKEQRPERWGWLSLGFSLSLGWQTAVAAALVLVLTVSAGWVYLHRTPPVAPGDQSLLKSPALAEVERAQAAYEQAIDKLAAEAKPELENPATPVEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR*
Ga0137360_1004904333300012361Vadose Zone SoilMNVTCNDRDRIFEDGTPAEWAALEAHTASCAACAEELRAWKALSAAASELRDYSDSPSLWPRIERALSERTATKLRRAERWSWLSFRRDAPLSWQSAAAGAFVLLLTVSAGWFYLHPPKPRGPVDQSLLKNSALADVERTETAYVQAIDKLAAEAKPQLENPATPLRANYREKLLVLDSAIDELRAQAGLNPSNAHLRYELLAVYQEKQRTLEGILEEKR*
Ga0153915_1006218063300012931Freshwater WetlandsMNIRCEDREKIFENGTPAEWVALEAHSANCAVCAEELRTWKSLSTAAQELRDYSDTPSLWPRIERALVEQAAARKHRAERWGWLSFGLGFSLGWQTAAAATLVLILTVSAGWIYLHRPTQVVQGEQSLLKRPALAEVERAQAAYEQAIDKLAAQAKPELEKPTTSLQANYREKLLVLDSAINDLRAQAGMNPSNAHVRQQLLAMYQEKQQTLEDILEEKR*
Ga0187825_1022295513300017930Freshwater SedimentMTMAIQCEDRDRIFLDGTAEEWAALEAHSLDCPACAAELQSWSALSAAANQLRDYSDSPVLWPRIRRELQEQSERHRNSMRWKLFSFGGFTLGLQTIASAALVLLLSISGAWLYWHRQPPVAPADQSLLKNSALADVERTQAAYEQAIDKLAAQARKQIENPVTPLQASYKEKLLVLDSAIDDL
Ga0187821_1013353313300017936Freshwater SedimentMNITCEDRDRIFGDGTPADWAALEAHSANCAACAEELRAWKAISVAAKELRDYSDTPSLWPRIERSLITEAATKKQRAGRWSWLSLGFGVSLGWQTAAAAALVLILTVSAGWIYFPPVKPVPNGDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0187817_1074059713300017955Freshwater SedimentKCEDRDKIFEDGTPAEWAALEAHSANCVVCAEELRSWKALSVAAQELRDYSVTPPLWSRIAQALTEEAAAKKHRSDRWSWLSLGFGLSLRWQTAAAAALVLILTVSAVWVYVHPPADGGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQ
Ga0187779_1014014723300017959Tropical PeatlandMAMNISCKDRDRIFADSTPAEWAALEAHSANCAICAEELRGWKALSAAAQELRDYSDSPSLWRSIERALVQQAAAKKQRWSWLHPGTGIALGWQVAAAAALSVIIALGSVWVYVHRTSKSAPQDLADRFLLKSPALAEVERTQEAYEQAIEKMAVEAKSQIDNPTTPLQESYREKLLVLDSAIEDLRAQAGLNPSNAHLRQQLLGMYQEKQQTLQDILEEKR
Ga0187810_1004234223300018012Freshwater SedimentDKIFEDGTPAEWAALEAHSANCAVCAEELRSWKALSVTAQELRDYSVTPPLWSRIAQALTEEAAAKKHRSDRWSWLSLGFGLSLRWQTAAAAALVLILTVSAVWVYVHPPADGGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0179592_1005240133300020199Vadose Zone SoilMNIACEDRDRIFEDGSPAEWAALEAHSANCAACAEELRSWKAISVLAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAAAALAMILTVSAGWIYLHPAKPVSSGDQSLLKSSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210407_1001685373300020579SoilMTMNTTCEDRDRIFEDGTPAEWAAFEAHSANCAVCSEQLRSWKALSAAAQEMRDYSDSPLLWFRIERALTREAAAGKQRSGPWAWLSPGSGLPLGWQTAAAATLVLILTVSVGWVYLHRTGPGPNGDQSLLKSPALAEVERAQAVYVRAIDKLAAEARPAFENPATPFEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210407_1011465233300020579SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCVSCAKELRGWKALSVAAKELRDYSDTPSLWPQIGRALTEEAATRKKLPERWGWLSLGFSLSLGWQTAAAAAVVLILTVSAGWVYLHRTGPVPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYHEKQQTLQDILEEKR
Ga0210407_1013964123300020579SoilMNFTCNDRDRIFEDGAPAEWAALEAHAAVCPACAVELRAWKSLSLAAQELRDYTESPALWPRIEQALGKDAAAEKRRKQRWGWLSFGTKLPFGWQSAVAAACVLILTISVYWIFRPAPGGHGNGDQSLLKRPALAEVERTQTAYEQAIDKLAAEAKAQIENPTTSLQTSYREKLLVLDSAIDDLRLQAGFNPSNAHLRQQLLAMYREKQQTLEDILEEKR
Ga0210407_1015864823300020579SoilIFENGTPAEWAAFEAHSANCALCSEQLRNWKALSAAAQEMRDYSDSPLLWTRIERALTAEAAARNQRAGRWAWLSLGLGLSLGWQTAAAATLVLILTVSAGWIYVHPTGPGPSGDQSLLKSPALAEVERAQTAYVQAIDKLVAEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210407_1023188923300020579SoilMNITCEDRDRIFEDGTPAEWAALEAHSANCAACADELRAWKGISIAAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAVAALVLILTLSAGWIFLQPTKPVPSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210403_10000742293300020580SoilMNITCEDRDRVLEDGTPAEWAALETHSANCAACAEELRAWKAISVAAKELRDYSGSPALWTRIEQALTAEAAAKKTRTGRWNWLSLGFGLSLGWQTAAAAALVLILTVSAGWLYLHPIKPVSSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210403_1020504423300020580SoilMNIKCEDRDRILEDGTPAERAALEEHGASCAVCAEELRSWKALSIAANELRDYSDTPSLWPRIERALAAEASAKQQRANRWGWLSLGFGLSLGWRTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210403_1021776613300020580SoilMNITCEDRDRILEDGTPSEWAALEAHSANCTACAEELGAWKALSAAAKEMRDYSDTPSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAATLVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLED
Ga0210403_1093195113300020580SoilSANCAVCAEELRAWKAVSVAAKELRDYSDAPSLWPRIERALTAEAAVKKHRAERWGWLTLGFGLSLGWQTAAVAAMVLILTVSTGWVYLHRAGRGPGGDRDQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQELLAMYQEKQQTLQDILEEKR
Ga0210399_1000424323300020581SoilMNIKCEDRDRIFEDGTPAQWAALESHSANCAVCAEELRSWKALSAAAKELRDYSDTPSLWPRIERALIGEAAANKKRAERWGWLSLGFGLSLGWQTAAAAALLLILTVSVGWVYWPPTPRDGGDQSLLKTSALAEVERAQTAYEHAIDKLAAEAKPELENPATPLQASYREKLLVLDTAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210395_10000742103300020582SoilMNIKCEDRDRILEDGTPAEWAALEGHGASCAVCGEELRSWKALSAAAKELRDYSDTPSLWPRIERALAAEASAKQQRANRWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210395_10001412103300020582SoilMNIKCEERDRIFEDGTPAEWAALEAHSANCAVCAEELRAWKAVSVAAKELRDYSDAPSLWPRIERALTAEAAVKKHRAERWGWLTLGFGLSLGWQTAAVAAMVLILTVSTGWVYLHRAGRGPGGDRDQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQTGMNPSNAHLRQELLAMYQEKQQTLQDILEEKR
Ga0210401_1001745293300020583SoilMNITCEDRDRILEDGTPSEWAALEAHSANCTACAEELGAWKALSAAAKEMRDYSDTPSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAATLVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210401_1004992043300020583SoilMNIKCEDRDRIFEDGTPAEWAALEAHSANCAVCAEELRAWKALSIAAQELRDYSDTPSLWPRIESALTEEAAAKKQRAERWSWLSLGFGLSLGWQTAVAAALVLVLTVFAGWIYLHPTGHVPSGDQSLLKSSALADVERAQAAYEHAIDKLAADAKPELENPTTPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210401_1006807823300020583SoilMNIKCEDRDRILEDGTPAEWAALEGHGASCAVCGEELRSWKALSAAAKELRDYSDTPSLWPRIERALTAEASAKQQRANRWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210401_1022455223300020583SoilMTMNTICEDRDRIFEDGTPAEWAAFEAHSANCALCSEQLRSWKALTAAAQEMRDYSDSPLLWTRIERALTAEAAARNQRAGRWAWLSLGLGLSLGWQTAAAAALVLILTISAGWIYVHRTGPGASGDQSLLKSPALAEVERAQTAYVQAIDKLVGEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210401_1024031713300020583SoilMNIKCEDRDRVFEDGTPAEWAALDAHSANCAICAEELRSWRALSVAAQEIRDYSDTPSLWPRIERALTEETAAKKARAGRWSWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLHRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLED
Ga0210401_1064353713300020583SoilMNIQCEDRERIFEDGTPAEWAALEAHSADCAVCAEELSSWKALSLAAHEMRDYSDTPSLWPSIERALSEEASAKKARVGRWGWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLQRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210401_1094286213300020583SoilMNIKCEERDRIFEDGTPAEWAALEAHSANCAVCAEELRAWKAVSVAAKELRDYSDAPSLWPRIERALTAEAAVKKHRAERWGWLTLGFGLSLGWQTAAVAAMVLILTVSTGWVYLHRAGRGPGGDRDQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQTGMNPSNAHLRQELLAMYQEKQQT
Ga0210406_1006893623300021168SoilMNPTCEDRDRIFENGTPAEWAAFEAHSANCALCSEQLRNWKALSAAAQEMRDYSDSPLLWTRIERALTAEAAARNQRAGRWAWLSLGLGLSLGWQTAAAATLVLILTVSAGWIYVHPTGPGPSGDQSLLKSPALAEVERAQTAYVQAIDKLVAEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210406_1033880723300021168SoilMNIKCEDRERIFEDGTPTEWAALEAHGANCAACTEELRRWKALSAAAKQLRDYSDAPSLWPRIERALIEEAAAKKHRAERWSWLSLGFSLSLGWQTAAAAALVLILAVSAGWIYLHPLTPVVPSDQSLLKSPALAEVERAQAAYEQAIEKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLR
Ga0210406_1042479223300021168SoilMNITCEDRDRIFEDGTPAEWAALEAHSANCAACADELRAWKGINIAAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAVAALVLILTLSAGWIFLQPTKPVPSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210405_10000511143300021171SoilMNIKCEDRDRVFEDGTPAEWAALDAHSANCAICAEELRSWRALSVAAQEIRDYSDTPSLWPRIERALTEETAAKKARAGRWSWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLHRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210405_10001943103300021171SoilMNITCEDRDRIFEDGTPAEWAALEAHGASCAVCAEELRAWKALSVAAQELRNYADTPSLWPRIEGVLTAEAATKKQRAERWGWRSLGFGLSLGWQTAAAAAMVVILTVSAGWVYWHRTRPVLSSDQSLLKSPALAEVERAQAAYEQAIDKLAADAKLQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210388_10000460173300021181SoilMNITCKDRDRIFEDGTPAEWAALEAHSANCDACAEELRAWKAISIAAKEMRDYSDSPSLWPRIEQALAAETAAKTHRAQRWSWLSLGFGLSLGWQTAAAAALILILTASAGWIYLHPTKPVPSSDLSLLKTPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210388_10004032103300021181SoilMNITCEDRDRVLEDGTPAEWAALETHSANCAACAEELRAWKAISVAAKELRDYSGSPALWTRIEQALTAEAAAKKTRTGRWNWLSLGFGLSLGWQTAAAAALVLILTVSAGWLYLHPIKPVSSGDQSLLKSPALADVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210388_1003354823300021181SoilMNIKCEERDRIFEDGTPAEWAALEAHSANCAVCAEELRAWKAVSVAAKELRDYSDAPSLWPRIERALTAEAAVKKHRAERWGWLTLGFGLSLGWQTAAVAAMVLILTVSTGWVYLHRAGRGPGGDRDQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQELLAMYQEKQQTLQDILEEKR
Ga0210388_1018294313300021181SoilKALSAAAKEMRDYSDVPSLWPRIERSLTEAAAAKEQRAERWSWFSLGFGLSLGWQTAAAAALVLILTASAGWVYLHRAGPAPSSDQSLLKTPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRLQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210393_10000123323300021401SoilMNIKCEDRDRIFEDGTPAEWAALEAHSANCAVCAEELRAWKAVSVAAKELRDYSDAPSLWPRIERALTAEAAVKKHRAERWGWLTLGFGLSLGWQTAAVAAMVLILTVSTGWVYLHRAGRGPGGDRDQSLLKSPALAEVERAQAAYEQAIDKLAADAKTQLENPATPLQASYREKLLVLDSAINDLRAQTGMNPSNAHLRQELLAMYQEKQQTLQDILEEKR
Ga0210393_1002584653300021401SoilMNIKCEDRDRIFEDGTPAEWAALEEHGASCAVCAEELRSWRALSAAAKELRDYSATPSLWPRIERALTAEAPAKQQRASRWGWVSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210393_1013709023300021401SoilMNITCEDRDRILEDGTPSEWAALEAHSANCTACAEELGAWKALSAAAKEMRDYSDTPSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAAILVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEGKR
Ga0210393_1062290913300021401SoilIFEDGAPAEWAALEAHAAACPACAEELRAWKSLSLAAQELRDYTESPALWPRIEQALGKDAAAEKRRKQRWGWLSFGTKLPFGWQSAVAAACVLILTISVYWIFRPAPGGHGNGDQSLLKRPALAEVERTQTAYEQAIDKLAAEAKAQIENPTTSLQTSYREKLLVLDSAIDDLRLQAGFNPSNAHLRQQLLAMYREKQQTLEDILEEKR
Ga0210385_1022696923300021402SoilAWKAISIAAKEMRDYSDSPSLWPRIEQALAAETAAKTHRAQRWSWLSLGFGLSLGWQTAAAAALILILTASAGWIYLHPTKPVPSSDLSLLKTPALAEVERTQAAYEQAIDKLAADAKPQLENPANPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210389_1009338023300021404SoilMNMTCEDRDRIFEDGTHAEWAALEAHATNCAVCAEELRAWKALSVAAQELRDYSEAPALWTRIEESLTEEAAAKKEKADRWSRLSLGFGSSLVWQSVAAAFVVLVLTVSAGWIYVHRSVNVLSTDHSLLKSSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRVQAGMNPSNAHLRQQLLAMYEEKQQTLEDILEIKR
Ga0210387_10000333143300021405SoilMNITCEDRDRIFLDGTPAEWAALEAHSGNCTVCSEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALTAEAARKKHRAEGWNWLSLGLGLSLGWQTAAAAALVLILTASAGWIYLHPPKSRPSPDLSLLKSSALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210387_1001293333300021405SoilMNIKCEDRDRILEDGTPAERAALEEHGASCAVCAEELRSWKALSIAANELRDYSDTPSLWPRIERALAAEASAKQQRANRWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210386_1056767123300021406SoilMNITCEDRDRIFEDGTPSEWAALEAHSATCPACAEELRAWKAISVAAKELRDYSDSPSLWPRIEHALAAEAQAKKHRTERWSWLSLGFGLSLGWQTALVAAMVVVLTVSGGWIYLHRTGRGLGGDRDQSLLKTPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210383_10000001623300021407SoilMTITCEDRDRIFEDGTPAEWAALEAHSVNCVGCAEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALAAETAAKAHRAQRWSWLSLGFGLSLGWQTAAAAALILILTASAGWIYLHPTKPVPSLDLSLLKSPALAEVERTQAAYEQAIDKLATDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210383_1000491283300021407SoilMTMNTTCRDRDRIFEDGTPAEWAAFEAHTANCAVCSEQLRSWKALSAAAQEMREYSDNPLLWPRIESALTAEAAAKKQRTGRWAWLSLGFGLSLGWQTAVAATLVLILTVSAGWVYWHPTGPGPSGDQSLLKSPALAEVERAQAAYVQAIDKLAAQAKPALENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYEEKQQTLEDILEEKR
Ga0210394_1000485493300021420SoilMNISCEDRDRILEDGTPSEWAALEAHSANCTACAEEVGAWKALSAAAKEMRDYSDTPSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAATLVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210384_1001602353300021432SoilMTMNTICEDRDRIFEDGTPAEWAAFEAHSANCALCSEQLRSWKALTAAAQEMRDYSDSPLLWTRIERALTAEAAARNQRAGRWAWLSLGLGLSLGWQTAAAATLVLILTISAGWIYVHRSGPGASGDQSLLKSPALAEVERAQTAYVQAIDKLVAEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210384_1013091543300021432SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCVSCAKELRGWKALSVAAKELRDYSDTPSLWPQIGRALTEEAATRKKLPERWGWLSLGFSLSLGWQTAAAAAVVLILTVSAGWVYLHRTGPVPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYH
Ga0210391_1000314583300021433SoilMMNITCEDRDRIFEDGTPAEWSALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWPRIEHALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210390_1011933533300021474SoilMNITCEDRDRILEDGTPSEWAALEAHSANCTACAEELGAWKALSAAAKEMRDYSDTPSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAAILVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210392_1053456313300021475SoilMNITCEDRDRIFEDGTPAEWAALEAHSANCAACADELRAWKGISIAAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAVAALVLILTLSAGWIFLQPTKPVPSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLD
Ga0210398_1050817313300021477SoilLEDGTPAEWAALEGHGASCAVCGEELRSWKALSAAAKELRDYSDTPSLWPRIERALAAEASAKQQRANRWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210398_1064028123300021477SoilLSVAAQEIRDYSDTPSLWPRIERALTEETAAKKARAGRWSWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLHRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210402_10000727203300021478SoilMNIQCEDRDRIFEDGTPAEWAALEAHSANCGLCAEELRTWKSLSTTARELRDYSDTPSLWPRIERALVEQAAARKPRAERWGWLSFGRGFSLGLQAAAAGALVLILTVSAGWVYWHRLTPVKQRDQSLLKSPALAEVERAQVAYEQAIDKLAVQAKPELEKPSTPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210402_1002493743300021478SoilMNIACEDRDRILEDGTPAEWAALEAHSANCAACAEELRAWKAISVAAKELRDYSDTPSLWPRIERSLIAEDATKKQRAGRWSWLSLGFGLSLGWQTAAAAALVLVLTVSAGWLYLQPPKPVSGGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0210402_1027326733300021478SoilMNITCEDRDRIFEDGTPAEWAALEAHSANCAACADELRAWKGISIAAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGLSLGWQTAAVAALVLILTLSAGWIFLQPTKPVPSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQ
Ga0210410_1000969693300021479SoilMNFTCNDRDRIFEDGAPAEWAALEAHAAVCPACAVELRAWKSLSLAAQELRDYTESPALWPHIEQALGKDAAAEKRRKQRWGWLSFGTKLPFGWQSAVAAACVLILTISVYWISRPAPGGHGNGDQSLLKRPALTEVERTQTAYEQAIDKLAAEAKAQIENPTTSLQTSYREKLLVLDSAIDDLRLQAGFNPSNAHLRQQLLAMYREKQQTLEDILEEKR
Ga0210410_1010076133300021479SoilMTMNTICEDRDRIFEDGTPAEWAAFEAHSANCALCSEQLRSWKALTAAAQEMRDYSDSPLLWTRIERALTAEAAARNQRAGRWAWLSLGLGLSLGWQTAAAATLVLILTISAGWIYVHRTGPGASGDQSLLKSPALAEVERAQTAYVQAIDKLVGEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0210409_1019028223300021559SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCVSCAKELRGWKALSVAAKELRDYSDTPSLWPQIGRALTEEAATRKKLPERWGWLSLGFSLSLGWQTAAAAAVVLILTVSAGWVYLHRTGPVPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAEEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYHEKQQTLQDILEEKR
Ga0212123_10003664193300022557Iron-Sulfur Acid SpringMNIQCEDRDRIFEDGTPAEWVALEAHSANCAACAAELRSWKALSGAAQELRDYSDTPSLWPRIERALVKEAGFQKHRSGRWGWLSMRGGFTLGLQTAAAAALVLVLIVSSGWLYLHRSKPVEQGDHSLLKSSALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQASYREKLLVLDSAIDDLREQAGINPSNAQLRQQLLAMYQEKQQTLEDILEEKR
Ga0224561_100749613300023030SoilMTMNTTCEDRDRIFADGTPAEWAALEAHSANCAVCTEELRSWKALGAAAQEMRDYSDSPLLWSQIERALAAEAAAKNQRSGRWAWLSIGFGLSLGWQTAAAATLVLILTVSAGWLYLHRAGPGPGGDQSLLKSPALAEVERAQTAYVQAIDKLAAEAKPALENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0228598_102750513300024227RhizosphereSEWAALEAHSANCAACAEELRSWKALSVAAKEMRDYSDTPSLWPRIERALTEESAAKKQRAERWNWLSLGFGLSLGWQTAAAAALVLILTISGVWIYVHRPTPLDNRDQSLLKTPALAEVVRTQAAYEQAIEKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209648_1016476723300026551Grasslands SoilMTMNFTCQDRDRILEDGTPAEWAELEAHGANCAVCADELRAWKALSVAAQQLRDYSDRPSLWPRIESALAAEAAAKKQRPERWGWLSLGFSLSLGWQTAVAAALVLVLTVSAGWVYLHRTPPVAPGDQSLLKSPALAEVERAQAAYEQAIDKLAAEAKPELENPATPVEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR
Ga0208603_100898423300027109Forest SoilMNITCEDRDRILEDGTPSEWAALEAHSANCAACAEELRSWKALSVAAKEMRDYSDTPSQWPRIERALTEESAAKKQRAERWNWLSLGFGLSLGWQTAAAAVLVLILTVSGVWIYGHRPTPLDIRDQSLLKTPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209736_1000118143300027660Forest SoilMNITCEDRDRIFEDGAPAEWTALETHSANCAVCAEELGAWKALSFAAQELRDYSDTPSLWPRIERALTLEAAAKNQRSGRWAWLSFGFGLSLGWQTAAAAALVLILTVSAGWVYLHRTGPLPSGDQSLLKSPALAEVEHAQTAYVQAIDKLAAEAKPALENPGTPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209178_106109523300027725Agricultural SoilMNINCEDRDRILAEGTPAEWAALEAHAANCDECGEELRSWRALSAAAVEMREYSDTPSLWPRIERALVEQAAAQNLRSEGRGWLSFDRLFSLRLQTAVAAALVLILTAFTGWIYVRRQTPPVMQGNQSLLKSPALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQANYREKLLVLDSAINDLRAQTGLNPSNAHVRQQLLAMYEEKQQTLEDILEEKQ
Ga0209038_1016937813300027737Bog Forest SoilMITKCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRSWRALSAAAHEMRDYSDTPSLWPRIERALAEEAAAKRNRSERWSWLSLGFGLSLGWQTAAAAALVLILTVSASWFYVHRTPSVDNRDQSLLRSPALAEVERSQAAYEQAIDKLAADAKPALENPASPLQASYREKLLVLDSA
Ga0209073_1005126523300027765Agricultural SoilMNINCEDRDRILAEGTPAEWAALEAHATNCDECGEELRSWRALSAAAVEMREYSDTPSLWPRIERALVEQAAAQNLRSEGRGWLSFDRLFSLRLQTAVAAALVLILTAFTGWIYVRRQTPPVMQGNQSLLKSPALAEVERAQAAYEQAIDKLAAQAKPQLENPATPLQANYREKLLVLDSAINDLRAQTGLNPSNAHVRQQLLAMYEEKQQTLEDILEEKQ
Ga0209448_1006749423300027783Bog Forest SoilLSAAAKEMRDYSDTPSLWPRIERALTEEVATKKQRSERWSWFSLGFGLSLGWQTAAAAVLVLVLTVSGVWIYVHPPKRMDRADQSLLKSPALAEVERAQAAYEQAIDKLAADAKPRLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209139_10000361113300027795Bog Forest SoilMNLKCEDRDRIFEDGTPAEWAALEAHSANCLECTEELRAWKALSVAAKELRDYSDTPSLWLRIEPALAEAAAAKQRDAGRWSWLTLGFGLSLGWQTAMAAALVVLLTVSAGWVYLHPPRPGNRGDKDQSLLKSPALAEVERAQAAYEQAIDKLALNAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKRQTLQDILEEKR
Ga0209656_1010997613300027812Bog Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCAVCAEELRSWKALSVAAQEMRDYSNWLSLWPRIERALAEEAAAKRNRSARWSWLSLGFGLSLGWQTTAAAALVLLLTVSAGWIYLLRPTSLTPADQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQARMNPSNAHLRQQLLAMYQEKQQTLEDIL
Ga0209773_1004159313300027829Bog Forest SoilEAHSANCLECTEELRAWKALSVAAKELRDYSDTPSLWLRIEPALAEAAAAKQRDAGRWSWLTLGFGLSLGWQTAMAAALVVLLTVSAGWVYLHPPRPGNRGDKDQSLLKSPALAEVERAQAAYEQAIDKLALNAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKRQTLQDILEEKR
Ga0209580_1000294143300027842Surface SoilMNIKCEDRNRVFEDGTPAEWAALEAHSTSCAVCAEELLSWKALSVAARELRDYSDAPSLWPRIELGLAVEAAADKKRAERWGWLSIGFGLSLGWQTAAAVALVLILTVSVGLVYWPPTTRDRGDQSLLRTSALAEVERAQSAYEHAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR
Ga0209166_1000315873300027857Surface SoilMNIKCEDRDKILAEGTPDEWAALEAHATICDACGEELRSWRALSGAAVEMREYSDTPSLWPRIERALVEQAAAQDRHSEGWGWLSFGRLFPLRLQTAAAAVLVLILTASAGWIYIHRQTPPVVQGDQSLLKSPALAEVERAQAAYERAIDKLAAQAKPQLENPTTPLQANYREKLLVLDSAINDLRAQTGMNPSNAHVRQQLLAMYQEKQQTLEDILEEKR
Ga0209283_1015393823300027875Vadose Zone SoilMNVTCNDRDRIFEDGTSAEWAALEAHTTSCAACAEELRAWKALSAAAMELRDYSDSPSLWPRIERALSEQAPAKLRRAERWRSLSFWRNLPLSWQTASAGTFVLLLTVSAGWFYLHPPKTRGLADQSLLKNSALAEVERTETAYVQAIDELAAEAKPQLENPTTSLQANYREKLFVLDSAIDDLRAQAGLNPSNAQLRYELLAVYQEKQRTLEEILEEKR
Ga0209169_1000960323300027879SoilMNITCEDRDRIFEDGTPAEWAALEAHGVNCGICAEELRAWKAISVAAKEMRDYSDSPSLWPRIEQALTAETAAKTHRTQRWSWLSLGFGLSLGWQTAAAAALVLILTASAGWIYLHPTRPVPHPDLSLLKSPALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0209275_1002896823300027884SoilMNITCEDRDRIFLDGTPAEWAALEAHSGNCSVCTEELRAWKAISTAAKEMRDYSDSPSLWPRIEQALTAEAARKKHRAEGWNWLSLGLGLSLGWQTAAAAALVLILTASAGWIYLHPPKSRPSPDLSLLKSSALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0209275_1008685823300027884SoilMNIKCEDRDRIFEDGTPAEWAALEAHSVNCAVCAEELRSWKALSVAAKELRDYSDTPSLWPRIESALTEEAAAKKQRAERWSWLSLGFGLSLGWQTAGAAVLVLVLTVSAGWVYFHRIGFEPSPDQSLLKSSALADVERAQAAYEHAIDKLAADAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKK
Ga0209275_1029183013300027884SoilSWKALSLAAHEMRDYSDTPSLWPSIERALSEEASAKKARVGRWGWLSLGFGLSLGWQTAAAAALVLLLTVSAGWIYLQRPTSMTPADHSLLKSPALAEVERTQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209380_1016536423300027889SoilMNITCEDRDRIFHDGTPSEWAALEAHSANCAVCAEELRSWKALSVAAKELRDYSYTPSLWPRIERSLTEEAAAKKQRAERWGWLSLRFGLSLEWQTAAAAAFVLILTVSAGWVYLHRKPVADPGDQSLLKSRALAEVERAQAVYEQAIDKLATEAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209380_1026044723300027889SoilMNITCEDRDRIFLDGTPAEWAALEAHSGNCSVCTEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALTAEATRKKHRAEGWNWLSLGLGLSLGWQTAAAAALVLILTASAGWIYLHPPKSRPSPDLSLLKSSALAEVERTQAAYEQAIDKLAADAKAQLENPATPLEASYREKILVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0209415_10005721143300027905Peatlands SoilMNITCKDRDRIFEDGTPAEWAALEAHSANCGDCAEELRAWKAISVAAKELRDYSGSPALWPSIEQALTAEAASKKIRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHPAKPVSGGDQSLLKSPALAEVERAQAAYEQAIDKLAVDAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAYLRQQLLAMYQEKQQTLQDILEEKR
Ga0209006_1001370473300027908Forest SoilMNIKCEDRERIFEDGTPTEWAALEAHGANCAACTEELRRWKALSAAAKQLRDYSDAPSLWPRIERALIEEAAAKKHRAERWSWLSLGFSLSLGWQTAAAAALVLILAVSAGWIYLHPLTPVVPSDQSLLKSPALAEVERAQAAYEQAIEKLAVNAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0209006_1001678893300027908Forest SoilMTMNTACEDRDRIFADGTPAEWAALEAHSANCAVCSEELRGWKALSAAAQEMRDYSDSPLLWSRMQRALAAEAAAKNQRSGRWAWLSIGFGLSLGWQTAAAATLVLILTVFAGWLYWHPAGPGPGGDQSLLKSRALADVERAQTAYVQAIDKLAAEAKPALENPATPFEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0265354_1000057143300028016RhizosphereMMNITCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDHSLLKSPALAEVERTQAAYEQAIDKLAADARPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0265357_100032533300028023RhizosphereMMNITCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWLRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0308309_10001211103300028906SoilMNIACKDRDGIFEDGTPAEWAAFEAHSANCASCAEELRSWKALSVTAKELRDYSDTPSLWPSIESALTEEAAAKKQRAERWGWLSLGFGLSLGWQTAAAAALVLVLTVSAGWIYFHRIGPVPGGDQSLLKSSALANVERTQAAYEQAIDKLAADAKTELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEKR
Ga0308309_1000535583300028906SoilMNIICEDRDRIFEDGTPAEWAALEAHSANCAACAEELRAWKAISVAAKELRDDSDTPSLWPRIERSLIAEAAAKNQHAGRWSWLSLGFGLSLGWQTAAAAALVLVLSVSGAWIYVHRGGVVDSPDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0308309_1022926023300028906SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCVSCAKELRGWKALSVAAKELRDYSETPSLWPRIGRALTEEAATRKKRSEGWGWLSLGFGLSLGWQTAAAAALVLILTVSAGWVYWHRTGPAPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAAEAKPELENPATPLQTSYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYHEKQQTLQDILEEKR
Ga0308309_1059092323300028906SoilMNIQCADRGRIFEDGTPAEWAAFEAHSANCASCAEELQGWKALSVQARELRDYSDTPSLWPRIARALTDEAATGKKRAERWGWLSLGFGLSLGWPTAAAAVLALILTVSAGWVYWHGTGPRPIGDQSLLKSPALAEVERTQTAYEQAIDKLAVEAKPELENPATPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDVLEEKR
Ga0222749_1025882013300029636SoilDGTPAEWAALEAHSASCVSCAKELRGWKALSVAAKELRDYSDTPSLWPQIGRALTEEAATRKKLPERWGWLSLGFSLSLGWQTAAAAAVVLILTVSAGWVYLHRTGPVPLVDQSLLKSPALAAVEHAQTAYEQAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRHQLLAMYHEKQQTLQDILEEKR
Ga0265745_100205123300030759SoilMMNITCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWLRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADARPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0265740_100823613300030940SoilRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWLRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADARPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0265760_1000169183300031090SoilMMNITCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWLRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDHSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0265320_1021485023300031240RhizosphereSANCSVCAEELRAWKAISVAAKELRDYSDAPSLWPRIEQALATEAQAKKHLAGRWRWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLRPVRDVSHTDQSLLKSPALAEVERAQAAYEQAIDKLAADAKAQLDNPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0265320_1051383413300031240RhizosphereAHSANCPICAGEVRAWKAISVAAKELRDYSDAPSLWPRIEQALATEAATSRHRTGRWSWLSLGFGLSLGWQTAAAAALVLILTVSGGWIYLHRTPVPGRDRDQSLLKTPALAEVERAQAAYEQAIDKLAAGAKAQLDNPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQ
Ga0310686_100710876103300031708SoilMNIKCEDRDRILEDGTPAECAALEAHGASCTVCAEELRSWKALSIAANELRDYSDTPSLWPRIERALIGEAAANKKRAERWGWLSLGFGLSLGWQTAAAVALVLILTVSVGWVYWPPTPRDGGDQSLLKTSALAEVERAQTAYEHAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR
Ga0310686_10138008313300031708SoilDRILEDGTPSEWAALEAHSANCAACAEELRSWKALSVAAKEMRDYSDTPSQWPRIERALTEESAAKKQRAERWNWLSLGFGLSLGWQTAAAAALVLILTVSGVWIYVHRPTPLDNRDQSLLKTPALAEVVRTQAAYEQAIEKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQT
Ga0310686_10491754923300031708SoilAALEAHSTNCAVCAEELRAWKAISGAAKEMRDYSDSPSLWPSIERALTVEAAAKKHRAERWSWLSLGFGLSLGWQTAVAAALVLILTASAGWIYLHRGPVPSRDRDQSLLKTPALAEVERTQAAYEQAIDKLAADAQPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0310686_11107894733300031708SoilMMNITCEDRDRIFEDGTPAEWAALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWPRIEQALAAETAVKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDHSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0310686_11772535113300031708SoilMNITCEDRDRILEDGTPSEWAALEAHSANCTACAEELGAWKALSAAAKEMRDYSDTLSLWPQIERSLKEEAAGKKQRTERWSWLSLGFGLSLGWQTAAAATLVLILSVSAGWVYLHRAGPAPAGDESLLKTSALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVVDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTL
Ga0307476_1054492213300031715Hardwood Forest SoilMNITCEDRDRIFEDGSPADWAALEAHSANCAACAEELSAWKAISVAAKELRDYSDTPSLWPRIERSLVAEAATKKQRAGRWNWLSLGFGFSLGWQTAAAAALVLILTVSAGWIYLHPAKPVPNGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQAKQQTLQDILEEKR
Ga0307474_1000766633300031718Hardwood Forest SoilMNIKCEDRDRIFENGTPAEWAALEAHSASCLSCAEELRGWKALSVRARELRDYSDMPSLWPRIERALTEEAATRKRRAERWGWLSVGFGLSLGWQTAAAAALVVILTVSAGWIYWHRTGPAPIGDQSLLKSPALAEVERAQTAYEQAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQDILEEKR
Ga0307474_1033947823300031718Hardwood Forest SoilMNITCEDRDRIFENGTPAEWAALEAHSANCAACAEELSAWKAISVAAKELRDYSDTPSLWPRIERSLIAEAATKKQRAGRWSWLSLGFGFSLGWQTAAAAALVLVLTVSAGWIYLHPIKPVSSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQAKQQTLQDILEEKR
Ga0307474_1107521713300031718Hardwood Forest SoilRDRIFLDGTPAEWAAFEAHSANCAVCSEQLRSWKTLSAAAQEMRDYSDSPLLWSGIERALTAAAAAKQRRSGRWAWLSPGSGLSLGWQIAAAASLVLILTVSVGWVYLHRTGPGPNGDQSLLKSPALAEVERAETAYVQAIDKLAAEARPALENPGTPLEASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLEDILEEK
Ga0307477_1023136613300031753Hardwood Forest SoilMNITCNDRDRIFEDGTPAEWAALEAHAASCAGCAEEVRAWKALSTAATELRDYKESPALWPRIERALGEQAAVTARRAARWNWLRSWRNISLGWQTAAVGALVLLLTVSVGWFILHPKPPVAPDQSLLKSSALAEVERAENAYIQAIDKLAAEAKPQIENPATPLQASYKEKLLVLDSAIDDLRAQAGLNPSNAHLRNQLLAMYQEKKQTFEDILEEK
Ga0307475_1024336323300031754Hardwood Forest SoilNCAACAEELRAWKAISVAAKELRDYSDTPSLWPRIERSLVAEAATKKQRAGRWSWLSLGFGFSLGWQTAAAAALVLILTVSAGWIYLHPTKPVSSGDQSLLKSPALAEVERAQAAYEQAIDKLAADAKPQLENPATPLQANYREKLLVLDSAINDLRAQAGMNPSNAYLRQQLLAMYQAKQQTLQDILEEKR
Ga0307478_10000594233300031823Hardwood Forest SoilMNIKCEDRDRIFEDGTPAEWAALEEHGASCAVCAEELRSWRALSAAAKELRDYSDTPSLWPPIERALTAEASAKQQRASRWGWLSLGFSLSLGWPTAAAAALVLILTVSAGWIYLHRTPLVGPGDQSLLKSPALAEVERAQAAYEEAIDKLAAEAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYEEKQQTLQDILEEKR
Ga0307479_1019491023300031962Hardwood Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHSASCAACGEELRSWKALSVAAKELRDYSDTPSLWPRIEGALTAEAAANKKRAERWGWLSLGFGLSLGWQTAAAVALVLILTVSVGLVYWPPTTRDGGDQSLLKSPALADVERAQTAYEQAIDKLAAQAKPELENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQEILEEKR
Ga0307479_1040270423300031962Hardwood Forest SoilMNIKCEDRDRILEDGTPAEWAALEAHSANCAVCTEELRAWKALSVAARELRDYSDMPSLWPRIENALTQEAAAKKHRAERWGWLSLGFSLSLGWQTAAAAALVLVLTVSAGWVYFHRTPPVVPVDQSLLKSPALAEVERAQAAYEQAIDKLAVEAKPELENPVTSLQASYREKLLVLDSAINDLRAQAGMNPSNAHLRQQLLAMYQEKQQTLQELLEEKR
Ga0307471_10002546823300032180Hardwood Forest SoilMNIKCEDRDRIFEDGTPAEWAALEAHSANCGLCAEELRTWKSLSTTAQELRDYSDTPSLWPRIERALVEQAAAGKHRAERWRWLSFGRRFSLGLQTAVAGALVLILTVSAGWLYWHGSKLVEQPNHSLLKSPALAEVERAQVAYEQAIDKLAVQAKPELEKPSTPLQANYREKLLVLDSAINDLRAQAGMNPSNAHLRQLLLAMYQEKQQTLEDILEEKR
Ga0307471_10014926423300032180Hardwood Forest SoilMNENCSDRNRIFKDGTLAEWAALEAHTASCVACAEELRAWKALSAAASELRDYSDSPSLWPRIERAFSEQTAAKMRRAERWSWLSFWRDAPLSWQTAAAGAFVLLLTVSAGWFYLHPPTPRGPADQSLLKNSALADVERTETAYVKAIDKLAAEAKPQLENPTTPLQANYREKLFVLDSAIDELRAQAGLNPSNAHLRYELLAVYQEKQRTLEGILEEKR
Ga0348332_1135727413300032515Plant LitterALEAHSTNCAVCAEELRAWKAISAAAKEMRDYSDSPSLWLRIEQALAAETAAKKHRAGRWSWLSLGFGLSLGWQTAAAAALVLILTVSAGWIYLHRTSTGEHGDQSLLKSPALAEVERTQAAYEQAIDKLAADAKPQLENPATPLQASYREKLLVLDSAINDLRAQAGMNPSNSHLRQQLLAMYQEKQQTLEDILEEKR
Ga0335085_10003214193300032770SoilMNIKCEDRDRILEDGTAAEWAALEEHSLNCAACTEELRGWKALSVAAKQLRDYSDTPSLWPRIARMLAQETKTKSHRSGPWAWFSLGSGFTLGLQATAAAAVVLILTVSAGWIYLHPPKPVAQVDHSLLKSSALAEVERTQAAYEEAIDKLASQAKPQLANPTTPLQASYREKLLVLDSAINDLRAQAGLNPSNAQLRQQLLAMYQEKQQTLEDILEEK
Ga0335079_10000746223300032783SoilVPMNIDCKDRDRILEDGTPAEWAALEAHSAVCLACAEELRAWKALSVAAAELRDYSDSPILWQRIERALVEQETARQERANRWAWLRLGSGSMLGWQVAAAATLAVLLAIAGIWVYRHETGSAIGRPPIAEKTPLLKSPALAEVERTQTAYEQAIDKLAAEAKAQIDNPTTPLQESYREKLLVLDSAIDDLRAQAGLNPSNAHLRQQLLAMYEEKQRTLQEILEEKQ
Ga0335080_10001161113300032828SoilMNIDCKDRDRILEDGTPAEWAALEAHSAVCLACAEELRAWKALSVAAAELRDYSDSPILWQRIERALVEQETARQERANRWAWLRLGSGSMLGWQVAAAATLAVLLAIAGIWVYRHETGSAIGRPPIAEKTPLLKSPALAEVERTQTAYEQAIDKLAAEAKAQIDNPTTPLQESYREKLLVLDSAIDDLRAQAGLNPSNAHLRQQLLAMYEEKQRTLQEILEEKQ
Ga0335072_1004530123300032898SoilMNIQCEDRDRIFEDGTPAEWAALEAHSANCAVCAEELRSWRALSAAAQKLRDYSDAPSLWPRIERALAEQAGRKPHARWRGWLNLGSGFTLGLQTAAAAALVLILTVSAGWVYFHRSGPVAPDDPSLLKSSALAEVERTQAAYEQAIDKLAAQAKPQLENPTTPLQASYREKLLVLDSAIDDLRAQAGLNPSNAQLRQQLLAMYQEKQQTLEDILEEKQ
Ga0335083_10000464193300032954SoilMNIKCEDRDRILEDGTAAEWAALEAHSLNCAACTEELRGWKALSVAAKQLRDYSDTPSLWPRIARMLAQETKTKTHRSGPWAWFSLGSGFTLGLQATAAAAVVLILTVSAGWIYLHPPKPVAQVDHSLLKSSALAEVERTQAAYEEAIDKLASQAKPQLANPTTPLQASFREKLLVLDSAINDLRAQAGLNPSNAQLRQQLLAMYQEKQQTLEDILEEK
Ga0335076_1009630553300032955SoilMNIDCKDRDRIFEDGTPAEWAGLEAHSASCEGCAEEMRAWKALSVAAQELRDYSDAPSLWPRIGQALAEQASAKRMRWLNLGTSLALGWQLAAAAALALLVTLGAVWVYVHRTPRNTPQEMADKFLLKSPALAEVERTQAAYEQAIDKLAAEAKPQIDNPATPLQASYREKLLVLDSAIDDLRAQAGLNPSNAQLRQQLLAMYQEKQQTLQDILEEKR
Ga0335073_10003776153300033134SoilMNIDCNDRDRILEDGTPEEWAALEAHSASCPACAEELRAWKALSAAAAELRDYSDSPMLWQRIERTLVQEQTTRQQPANRWAWLRLGSGLMLGWQVAAAAALAVLLGITGVWVYLHETGNPIGRPPIAEKSPLLKSPALAEVERTQAAYEQAIDKLAGEAKAQIDNPTTPLQENYREKLLVLDSAIDDLRAQAGLNPSNAHLRQQLLAMYEEKQQTLQEILEEKR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.