NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F062047

Metagenome / Metatranscriptome Family F062047

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F062047
Family Type Metagenome / Metatranscriptome
Number of Sequences 131
Average Sequence Length 273 residues
Representative Sequence MSAPDTMMPLISAPWLDADGNPLFVTPLPGFRSEVIQSLEWSYPGILSPPMKALLGSCCGLAGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEIGNKDLPGPIWCVFPDPEVAVYVNDDLAAFMATIREHTCRGEMHAWLHGLTAQARTVWSRRHAWAMRPHEAARSDRGIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPRPRSRDASTRASARAEVSFAIETPRSGRKPRPNPENP
Number of Associated Samples 110
Number of Associated Scaffolds 131

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 70.99 %
% of genes near scaffold ends (potentially truncated) 42.75 %
% of genes from short scaffolds (< 2000 bps) 62.60 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (56.489 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(17.557 % of family members)
Environment Ontology (ENVO) Unclassified
(21.374 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(35.878 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.13%    β-sheet: 19.80%    Coil/Unstructured: 60.07%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 131 Family Scaffolds
PF00011HSP20 22.90
PF00166Cpn10 3.05
PF00106adh_short 1.53
PF01557FAA_hydrolase 1.53
PF13610DDE_Tnp_IS240 1.53
PF07726AAA_3 1.53
PF12411Choline_sulf_C 0.76
PF02895H-kinase_dim 0.76
PF01814Hemerythrin 0.76
PF03641Lysine_decarbox 0.76
PF03466LysR_substrate 0.76
PF04185Phosphoesterase 0.76
PF08447PAS_3 0.76
PF10938YfdX 0.76
PF01266DAO 0.76
PF03960ArsC 0.76
PF13692Glyco_trans_1_4 0.76
PF09123DUF1931 0.76
PF01694Rhomboid 0.76
PF08668HDOD 0.76
PF13545HTH_Crp_2 0.76
PF04542Sigma70_r2 0.76
PF06742DUF1214 0.76
PF00155Aminotran_1_2 0.76
PF02423OCD_Mu_crystall 0.76
PF01292Ni_hydr_CYTB 0.76
PF13356Arm-DNA-bind_3 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 131 Family Scaffolds
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 22.90
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 3.05
COG0643Chemotaxis protein histidine kinase CheASignal transduction mechanisms [T] 1.53
COG5402Uncharacterized protein, contains DUF1214 domainFunction unknown [S] 0.76
COG5361Uncharacterized conserved proteinMobilome: prophages, transposons [X] 0.76
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.76
COG4117Thiosulfate reductase cytochrome b subunitInorganic ion transport and metabolism [P] 0.76
COG3658Cytochrome b subunit of Ni2+-dependent hydrogenaseEnergy production and conversion [C] 0.76
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.76
COG3038Cytochrome b561Energy production and conversion [C] 0.76
COG2864Cytochrome b subunit of formate dehydrogenaseEnergy production and conversion [C] 0.76
COG2423Ornithine cyclodeaminase/archaeal alanine dehydrogenase, mu-crystallin familyAmino acid transport and metabolism [E] 0.76
COG1969Ni,Fe-hydrogenase I cytochrome b subunitEnergy production and conversion [C] 0.76
COG1611Nucleotide monophosphate nucleosidase PpnN/YdgH, Lonely Guy (LOG) familyNucleotide transport and metabolism [F] 0.76
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.76
COG1393Arsenate reductase or related protein, glutaredoxin familyInorganic ion transport and metabolism [P] 0.76
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.76
COG0705Membrane-associated serine protease, rhomboid familyPosttranslational modification, protein turnover, chaperones [O] 0.76
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.76


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A56.49 %
All OrganismsrootAll Organisms43.51 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004080|Ga0062385_10130308Not Available1269Open in IMG/M
3300004114|Ga0062593_100858718Not Available911Open in IMG/M
3300004152|Ga0062386_100373874Not Available1144Open in IMG/M
3300004156|Ga0062589_100110058All Organisms → cellular organisms → Bacteria → Proteobacteria1782Open in IMG/M
3300005337|Ga0070682_100045232All Organisms → cellular organisms → Bacteria → Proteobacteria2728Open in IMG/M
3300005337|Ga0070682_100081723All Organisms → cellular organisms → Bacteria → Proteobacteria2093Open in IMG/M
3300005437|Ga0070710_10855439Not Available653Open in IMG/M
3300005439|Ga0070711_100007553All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6614Open in IMG/M
3300005537|Ga0070730_10103709Not Available1966Open in IMG/M
3300005537|Ga0070730_10404522Not Available883Open in IMG/M
3300005541|Ga0070733_10325425Not Available1017Open in IMG/M
3300005546|Ga0070696_100000391All Organisms → cellular organisms → Bacteria27027Open in IMG/M
3300006175|Ga0070712_100716083Not Available854Open in IMG/M
3300006893|Ga0073928_10023375All Organisms → cellular organisms → Bacteria → Proteobacteria6234Open in IMG/M
3300007788|Ga0099795_10014636All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2437Open in IMG/M
3300009093|Ga0105240_10030155All Organisms → cellular organisms → Bacteria → Proteobacteria7054Open in IMG/M
3300009143|Ga0099792_10019914All Organisms → cellular organisms → Bacteria2977Open in IMG/M
3300009500|Ga0116229_10002394All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria30270Open in IMG/M
3300009500|Ga0116229_10258977Not Available1479Open in IMG/M
3300009709|Ga0116227_10068236All Organisms → cellular organisms → Bacteria → Proteobacteria3054Open in IMG/M
3300009709|Ga0116227_10287062Not Available1261Open in IMG/M
3300009709|Ga0116227_10354384Not Available1120Open in IMG/M
3300010159|Ga0099796_10024533Not Available1894Open in IMG/M
3300010339|Ga0074046_10053435All Organisms → cellular organisms → Bacteria2677Open in IMG/M
3300010860|Ga0126351_1030860Not Available947Open in IMG/M
3300012202|Ga0137363_11036616Not Available697Open in IMG/M
3300012361|Ga0137360_10273279Not Available1395Open in IMG/M
3300012362|Ga0137361_10401133Not Available1260Open in IMG/M
3300012582|Ga0137358_10032976All Organisms → cellular organisms → Bacteria → Proteobacteria3405Open in IMG/M
3300012683|Ga0137398_10037584All Organisms → cellular organisms → Bacteria → Proteobacteria2793Open in IMG/M
3300012892|Ga0157294_10001019All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium3490Open in IMG/M
3300012896|Ga0157303_10021831Not Available1093Open in IMG/M
3300012917|Ga0137395_10059166All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2439Open in IMG/M
3300012923|Ga0137359_10028560All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria4774Open in IMG/M
3300012925|Ga0137419_10080496All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2200Open in IMG/M
3300012927|Ga0137416_10132483All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter1926Open in IMG/M
3300012927|Ga0137416_10789885Not Available839Open in IMG/M
3300012929|Ga0137404_10063069All Organisms → cellular organisms → Bacteria2881Open in IMG/M
3300012930|Ga0137407_10570983Not Available1060Open in IMG/M
3300012943|Ga0164241_10038494All Organisms → cellular organisms → Bacteria → Proteobacteria3572Open in IMG/M
3300012944|Ga0137410_10102569All Organisms → cellular organisms → Bacteria → Proteobacteria2123Open in IMG/M
3300013314|Ga0175859_1077462Not Available1148Open in IMG/M
3300014158|Ga0181521_10115902All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter1607Open in IMG/M
3300014168|Ga0181534_10498584Not Available688Open in IMG/M
3300014201|Ga0181537_10018266All Organisms → cellular organisms → Bacteria → Proteobacteria4997Open in IMG/M
3300014201|Ga0181537_10511407Not Available821Open in IMG/M
3300014201|Ga0181537_10537156Not Available799Open in IMG/M
3300014654|Ga0181525_10426887Not Available729Open in IMG/M
3300014655|Ga0181516_10111445Not Available1382Open in IMG/M
3300014838|Ga0182030_10331010All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1647Open in IMG/M
3300015200|Ga0173480_10005311All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria5041Open in IMG/M
3300015245|Ga0137409_10000248All Organisms → cellular organisms → Bacteria → Proteobacteria57177Open in IMG/M
3300015245|Ga0137409_10699899Not Available847Open in IMG/M
3300015264|Ga0137403_10124392All Organisms → cellular organisms → Bacteria → Proteobacteria2568Open in IMG/M
3300018019|Ga0187874_10319258Not Available631Open in IMG/M
3300018025|Ga0187885_10117231Not Available1281Open in IMG/M
3300018042|Ga0187871_10047125All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2612Open in IMG/M
3300018043|Ga0187887_10053185All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2486Open in IMG/M
3300018044|Ga0187890_10045745All Organisms → cellular organisms → Bacteria → Proteobacteria2621Open in IMG/M
3300020199|Ga0179592_10058936All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter1752Open in IMG/M
3300020579|Ga0210407_10001833All Organisms → cellular organisms → Bacteria → Proteobacteria19067Open in IMG/M
3300020579|Ga0210407_10088734All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2338Open in IMG/M
3300020579|Ga0210407_10689443Not Available793Open in IMG/M
3300020580|Ga0210403_10012596All Organisms → cellular organisms → Bacteria6844Open in IMG/M
3300020580|Ga0210403_10234358Not Available1506Open in IMG/M
3300020583|Ga0210401_10230447Not Available1711Open in IMG/M
3300021168|Ga0210406_10135598All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2072Open in IMG/M
3300021171|Ga0210405_10074250All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2679Open in IMG/M
3300021181|Ga0210388_10108162All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2387Open in IMG/M
3300021401|Ga0210393_10291526Not Available1325Open in IMG/M
3300021402|Ga0210385_10691587Not Available780Open in IMG/M
3300021405|Ga0210387_10303345Not Available1405Open in IMG/M
3300021406|Ga0210386_10192095Not Available1725Open in IMG/M
3300021406|Ga0210386_10263630Not Available1472Open in IMG/M
3300021420|Ga0210394_10113663All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2344Open in IMG/M
3300021432|Ga0210384_10445352Not Available1166Open in IMG/M
3300021432|Ga0210384_10539616Not Available1049Open in IMG/M
3300021474|Ga0210390_10487277Not Available1039Open in IMG/M
3300021478|Ga0210402_10107711All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2509Open in IMG/M
3300021478|Ga0210402_10199497Not Available1839Open in IMG/M
3300021479|Ga0210410_10073059All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium3008Open in IMG/M
3300022557|Ga0212123_10044140All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4219Open in IMG/M
3300022880|Ga0247792_1039419Not Available854Open in IMG/M
3300022906|Ga0247766_1057423Not Available988Open in IMG/M
3300025913|Ga0207695_10074681All Organisms → cellular organisms → Bacteria → Proteobacteria3451Open in IMG/M
3300025945|Ga0207679_11224841Not Available689Open in IMG/M
3300026320|Ga0209131_1131096Not Available1290Open in IMG/M
3300026482|Ga0257172_1037920Not Available876Open in IMG/M
3300026557|Ga0179587_10115481Not Available1641Open in IMG/M
3300027857|Ga0209166_10072819Not Available1954Open in IMG/M
3300027857|Ga0209166_10261888Not Available916Open in IMG/M
3300027860|Ga0209611_10004209All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria24700Open in IMG/M
3300027860|Ga0209611_10190023Not Available1264Open in IMG/M
3300027860|Ga0209611_10228614Not Available1122Open in IMG/M
3300027869|Ga0209579_10249524Not Available953Open in IMG/M
3300027903|Ga0209488_10112647All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter2044Open in IMG/M
3300027993|Ga0247749_1021501Not Available692Open in IMG/M
3300028047|Ga0209526_10099643All Organisms → cellular organisms → Bacteria → Proteobacteria2045Open in IMG/M
3300028536|Ga0137415_10209992Not Available1764Open in IMG/M
3300029952|Ga0311346_11024325Not Available664Open in IMG/M
3300030007|Ga0311338_10328382Not Available1675Open in IMG/M
3300030503|Ga0311370_10133040All Organisms → cellular organisms → Bacteria → Proteobacteria3456Open in IMG/M
3300030617|Ga0311356_10755999Not Available926Open in IMG/M
3300030677|Ga0302317_10164486Not Available1032Open in IMG/M
3300030688|Ga0311345_10452267Not Available1126Open in IMG/M
3300031022|Ga0138301_1442335Not Available941Open in IMG/M
3300031234|Ga0302325_10632981All Organisms → cellular organisms → Bacteria1564Open in IMG/M
3300031236|Ga0302324_100345661Not Available2251Open in IMG/M
3300031547|Ga0310887_10050370All Organisms → cellular organisms → Bacteria → Proteobacteria1866Open in IMG/M
3300031562|Ga0310886_10352935Not Available855Open in IMG/M
3300031715|Ga0307476_10674294Not Available766Open in IMG/M
3300031718|Ga0307474_10292420Not Available1254Open in IMG/M
3300031754|Ga0307475_10430644Not Available1060Open in IMG/M
3300031823|Ga0307478_10184454All Organisms → cellular organisms → Bacteria1672Open in IMG/M
3300031908|Ga0310900_10446799Not Available993Open in IMG/M
3300031940|Ga0310901_10171403Not Available846Open in IMG/M
3300031962|Ga0307479_10779105Not Available933Open in IMG/M
3300032144|Ga0315910_10174715All Organisms → cellular organisms → Bacteria1605Open in IMG/M
3300032144|Ga0315910_10476661Not Available961Open in IMG/M
3300032157|Ga0315912_10008723All Organisms → cellular organisms → Bacteria → Proteobacteria8772Open in IMG/M
3300032157|Ga0315912_10090467All Organisms → cellular organisms → Bacteria → Proteobacteria2397Open in IMG/M
3300032174|Ga0307470_10002767All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria6691Open in IMG/M
3300032805|Ga0335078_10412885All Organisms → cellular organisms → Bacteria1767Open in IMG/M
3300032896|Ga0335075_10032913All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales7463Open in IMG/M
3300033134|Ga0335073_10094424All Organisms → cellular organisms → Bacteria3878Open in IMG/M
3300033134|Ga0335073_10283074Not Available2000Open in IMG/M
3300034129|Ga0370493_0178342Not Available704Open in IMG/M
3300034130|Ga0370494_052626Not Available1030Open in IMG/M
3300034659|Ga0314780_032013Not Available967Open in IMG/M
3300034660|Ga0314781_030454Not Available891Open in IMG/M
3300034661|Ga0314782_025575Not Available1040Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil17.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.11%
Host-AssociatedHost-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated6.11%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog5.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil5.34%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil4.58%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.58%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa4.58%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland3.82%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.05%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.05%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.53%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.53%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.53%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil1.53%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.53%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog1.53%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.53%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.76%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.76%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.76%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.76%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.76%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.76%
Moss AssociatedHost-Associated → Plants → Phyllosphere → Unclassified → Unclassified → Moss Associated0.76%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.76%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005337Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3L metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009500Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300009709Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fb - Sphagnum magellanicum MGHost-AssociatedOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010860Boreal forest soil eukaryotic communities from Alaska, USA - C5-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012892Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1EnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013314Moss microbial communities from three moss species from boreal forest in Fairbanks, Alaska, USA ReanalysisHost-AssociatedOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014168Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_10_metaGEnvironmentalOpen in IMG/M
3300014201Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_10_metaGEnvironmentalOpen in IMG/M
3300014654Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_10_metaGEnvironmentalOpen in IMG/M
3300014655Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin01_10_metaGEnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015200Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018019Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_150EnvironmentalOpen in IMG/M
3300018025Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_20_100EnvironmentalOpen in IMG/M
3300018042Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_10EnvironmentalOpen in IMG/M
3300018043Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_7_10EnvironmentalOpen in IMG/M
3300018044Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_21_10EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022880Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S106-311C-6EnvironmentalOpen in IMG/M
3300022906Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L223-509R-6EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027860Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MG (SPAdes)Host-AssociatedOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027993Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S199-509C-5EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300029952II_Bog_N3 coassemblyEnvironmentalOpen in IMG/M
3300030007I_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300030503III_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030617II_Palsa_N2 coassemblyEnvironmentalOpen in IMG/M
3300030677Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Palsa_N3_3EnvironmentalOpen in IMG/M
3300030688II_Bog_N2 coassemblyEnvironmentalOpen in IMG/M
3300031022Forest soil microbial communities from Spain - ITS-tags Site 9-Mixed-thinned forest site A3_MS_spring Metatranscriptome (Eukaryote Community Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031940Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D2EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032896Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4EnvironmentalOpen in IMG/M
3300033134Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.2EnvironmentalOpen in IMG/M
3300034129Peat soil microbial communities from wetlands in Alaska, United States - Sheep_creek_fen_01D_16EnvironmentalOpen in IMG/M
3300034130Peat soil microbial communities from wetlands in Alaska, United States - Collapse_03_16EnvironmentalOpen in IMG/M
3300034659Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034660Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034661Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062385_1013030813300004080Bog Forest SoilMSASDTTMPLISAPWLNAEGKPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLGGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLASFLATLRERTCQGEVSHWLGELTAQARAVWSQRYAWAMRPHEAHRSDPAIRGWLMTLPASAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRRARVAHANAPALEEVSFAVETPGNRRRPPLKR
Ga0062593_10085871813300004114SoilMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTSRRVAPAILGTAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRTW
Ga0062386_10037387423300004152Bog Forest SoilMSAPDTMMPLVSAPWMDAEGNRLSVRPLPGLRSDVIQSLECSYPGILKPAMKNLLSTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAIDDAERRWIAEIGDKELPGPIWCVFPDPEVAVYVCDDLAAFVAALAERTCRGEMQAWLQDLITQARTVWSRRHARASRPHEACHLDRAIRGWLLGLPFDAYVYDLRSPRTPRGWPYGVLGPSGRQYRCGRLPVFAVAGLPAEGWRAPHPTARD
Ga0062589_10011005823300004156SoilMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTSRRVAPAILGTAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRTWGRSRKVPLETRPAARTFRANALAKTGAYS*
Ga0070682_10004523223300005337Corn RhizosphereMATPDTIWPLVSAPWLDADGNPLTVRPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA*
Ga0070682_10008172313300005337Corn RhizosphereMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIVSPQMKALLGTCCGLADTDLGSIDFTGCWFDEEPSTVFRPALTLAIDDAGRRWIAEVGNGDLPGPVWCVFPDPEVIVHVSDDLAAFVSTLHEYTCRGEMQTWLHRLNDEARSVWSQRHVLAFRPHEIFKSDRAIRAWVSGLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTPRRVAPAILGMAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRAWGRSRKVPLETRPAARTFRANAPAKTGAYS*
Ga0070710_1085543913300005437Corn, Switchgrass And Miscanthus RhizosphereLGTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAVDDAERRWVAEIGDKDLPGPVWCVFPDPEVAVYVSDDLAAFVAALGERTCRGEMRAWLQDLTTQARTVWSRRHAWASRPHEACHVDRAIRGWLSGLPVDAYVYDLRAPRTPRGWPYGVLGPSGRQFRCGRLPVFAVAGSPAEGWRTPHPRTRDATTRTPATAAVSFAIETRESGRQRRSH
Ga0070711_10000755333300005439Corn, Switchgrass And Miscanthus RhizosphereMSALDTMMPLVSAPWLNTDGNPLSVTALPGLRSEVVQGLECTYTGILTPAMKALLGTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAVDDAERRWVAEIGDKDLPGPVWCVFPDPEVAVYVSDDLAAFVAALGERTCRGEMRAWLQDLTTQARTVWSRRHAWASRPHEACHLDRAIRGWLSGLPVDAYVYDLRAPRTPRGWPYGVLGPSGRQFRCGRLPVFAVAGSPAEGWRTPHPRTRDATTRTPATAAVSFAIETRESGRQRRSHSENPGRWPFSRRGSAYRRPIQGAQNVVSELRPCA*
Ga0070730_1010370923300005537Surface SoilMMSRSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFLEEPYPVFRPALTLAIDDAERRWIGEIGNQDLPGPIWCVFPRPEVAVYVCDDLAAFLELLRERTCRGEMSAWLQNLTAQARAVWSRRHALALRPHEAHRADRAIRGWLRTLPSDAYVYDLRTPTDARGWPYGVVGPSGRYYRCGRELVFAVAGLPAQGWRAPHPRARIASTPLGPANAEVSFAIETARGRPRPPVNPEHPRWISSRRGGSSPPRPLGQRAQTTPWELRPCA*
Ga0070730_1040452213300005537Surface SoilPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGMLSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRGRVPSLPQPAPSAEVSFAIGAAGGRRQPRARPERPARWVSGRRTAGRLSPSVGRR
Ga0070733_1032542523300005541Surface SoilPMKDLVGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRREMGTWLRNLTAQARAVWSRRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLSAQGWRVPHPRGRVPSLPQAAASAEVSFTIGAAGGRRQLRARPERPARWVSGRRMAGRLPPSVGRRVEPAVLEERPCA*
Ga0070696_100000391103300005546Corn, Switchgrass And Miscanthus RhizosphereMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIRSLECSYPGILSPPMKDLLGSCCGLARTELGSIDFTGCWFQKEPYRVFRPALTLAIDDAERRWIGEVGYKDLPGPIWCVFPKPEVAVYVSDDLPAFLGVLRERTCRGEMGTWLQNLTAQARTVWSRRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRQYRCGRELVFAVAGLPAQGWRVPHPRGRVPSLPQAAASAEVSFAIGAAGGRRQPRARPERPARWVSGRRTAGRLSPSVGHRVEAAVLLEERPCA*
Ga0070712_10071608313300006175Corn, Switchgrass And Miscanthus RhizosphereMSALDTMMPLVSAPWLNTDGNPLSVTALPGLRSEVVQGLECTYTGILTPAMKALLGTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAVDDAERRWVAEIGDKDLPGPVWCVFPDPEVAVYVSDDLAAFVAALGERTCRGEMRAWLQDLTTQARTVWSRRHAWASRPHEACHLDRAIRGWLSGLPVDAYVYDLRAPRTPRGWPYGVLGPSGRQFRCGRLPVFAVAGSPAEGWRAPHPRAR
Ga0073928_1002337583300006893Iron-Sulfur Acid SpringMSAPDTMMPLVSAPWLKADGNPLSVRALPGLRSEVIQGLECSYPGILSPIMKTLLSTCCGLAGIELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGNQDLPGPVWCVFPDPEVAVYVCDDLAAFVATLREHTSRGEMHAWLQGLTAEARTVWSRRHALAMRPHEAYHSDRAIRGWLLGLPFDAYVYDLRSPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPHAAPAPGMQASAELSFAIETPQSGLQASI*
Ga0099795_1001463633300007788Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSRRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPESPARASSLSRRASSRLHGRESSAVEVRPCA*
Ga0105240_1003015563300009093Corn RhizosphereMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDFAAFLEVLRERTCRGEMGTWLRHLTAQARAVWSRRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAQGWRVPHPRGRVPSLPQAAASAEVSFAIGAAGGRRLLRARPERPARWVSGRRMAGRLSPSVGRRLETAVLEERPCA*
Ga0099792_1001991433300009143Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRASSRLHGRESAAVEVQPCA*
Ga0116229_10002394413300009500Host-AssociatedMMPLISAPWLDADGNPLSVKALPGLRAEVIQNLECSYPGILSPTMKDLLGTCHGLAGTALGSIDFTGCWFPEEPCAVFRPALTLAVDDEQRRWIAEVGNRDLPGPVWCVFPDPQVAVYVCDDLAAFLATLREHTARGEMHAWLQRLTAEACAVWSQRHGAALRPHQAHRSDRAIRSWLSGLPFDAYVYDLRAPSTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRRRAAPTPGMAASTELSFAIETPQSGRKRRSNPEDPGRWPFCRRTAGRTRRSPTHDAQIVGLELRPCA*
Ga0116229_1025897713300009500Host-AssociatedKMSAPDTMMPLISAPWLDADGNPSSVKALPGLRPEVIQSLECSYPGILSPRMKALLGACCGLTGTELGSIDFTGCWFPEEPCAVFRPALTLAVDDDGRRWIAEVGNRDLPGPVWCVFPEPEVAVYVSDDLAAFLATLHDHTSRGEMHAWLRDLTAEARSVWSQRHLSAMRPHEAHRSDRAIRGWLLGLPFDAYVYDLRAPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWWAPYPRPRSAPTPGMSAIAEVSFAIDTPQSGRKRRSNPAHPGRRDPEPARVSRAPDPQTRQPRTAVGVRRSVPSLSWRISGRPRRHPIQDTRAAALELRPCA*
Ga0116227_1006823623300009709Host-AssociatedMKDLLSRSCGLGGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDFPGPIWCVFPRPEVAVYVSDDLATFLATLRERTCRGEVSHWLDELTAQARAVWAQRHAGAIRPHQAHRADPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRATRRKPRVAHAKAPAMEDVSVAIETARSRRRPAIQREIAVPIAA*
Ga0116227_1028706223300009709Host-AssociatedMKEVLSRSCGLAGTALGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEIGTGDLPGPIWCVFPDPEVAVHVSDDLAAFLATLHGRTGQGEVSRWLDELTAQARAVWSQRHAGAIRPYEAHRTDPAIRGWLMTLPAGAYVYDLRERSAARGWPYGVAGPSGRYYRCGRLPVFGVAGSPAEGWRASRRAVRAAHAKASAGEEASFAVEAARGRRWPPIKRETPVPRTGGGRALKWGGWRSGLHRMATAQHELRPCA*
Ga0116227_1035438423300009709Host-AssociatedMKRLLSTCCGLGGTELGSIDFTGCWFPEEPCAVFRPALTLAIDDAGRRWIAEVGNRDLPGPVWCVFPDPEVAVYVGDDLAAFLAALREHSSRGEMQAWLQRLTAEARSVWSQRHRAALRPHEAHRSDPAIRGWLLGLPFDAYVYDLRSPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRLRAAPTRGMPASAEISFAIDTPQSGRKR
Ga0099796_1002453313300010159Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPESPARASSLSRRASSRLHGRESAAVEVRPCA*
Ga0074046_1005343533300010339Bog Forest SoilMDAEGNRLSVRPLPGLRSDVIQSLECSYPGILKPAMKNLLSTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAIDDAERRWIAEIGDKELPGPIWCVFPDPEVAVYVCDDLAAFVAALAERTCRGEMQAWLQDLITQARTVWSRRHARASRPHEACHLDRAIRGWLLGLPFDAYVYDLRSPRTPRGWPYGVLGPSGRQYRCGRLPVFAVAGLPAEGWRAPHPTARDAAAKAPAHAKVPFAIETPQSGRQRRSHPEVPRRWPLSRRGSACRRPIQGTQTAVSELRPCA*
Ga0126351_103086013300010860Boreal Forest SoilMKALLGTCCGLAGTEIGSIDFTGCWFLEEPYAVFRPALTLAVDDAERRWIAEIGDKDLPGPVWCVFPDPEVAVYVSDDVAAFISTIRAHTCRGEMHAWLHGLTAQARTVWSRRHAWAMRPHEAARLDRGIRGWLLGLPMDAYVYDLRVPGLARGWPYGVAGPSGRLYRCGRLPVFAVAGFPAEGWRAPRPRSRDATTRLPAHAEVSFAIETPRSGRKPLHNPEIPAPWPSSRRAWSRPFRPPIHGGQTD
Ga0137363_1103661613300012202Vadose Zone SoilGHPLSVTPLPGLRSEVIKGFECSYPGILSPRMKEVLGSCCGLAGTELGSIDFTGCWFLEEPYAVFRPALTLAVDDAERRWIAEIGDKDLPGPVWCVFPDPEVAVYVSDDLAAFMAAIREYTGRGEMHAWLHRLTARARTVWSRRHALAMRPHEAAHSDRAIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRFYRCGRLPVFAVAGLPAEGWRAPRPRSRDAATRAP
Ga0137360_1027327913300012361Vadose Zone SoilMKALLGNSCGLAGTEIGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEIGDRDLPGPVWCVFPDPEVAVYVSDDLAAFMAAIREYTGRGEMHAWLHGLTARARTVWSRRHALAIRPHEAAHSDRAIRGWLLGLPMDAYVYDLRVQGIVRGWPYGVAGPSGRFYRCRRLPVFAVAGLPAEGWRAPRPRSRDATVRAPASAEVSFAIETPHSGRKPRPNPENPGRWPLSRRASGRQLRPPIRGRTAMVELRPCA*
Ga0137361_1040113323300012362Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVMGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRAASRLHGRESAAVEVRPCV*
Ga0137358_1003297643300012582Vadose Zone SoilMQALLGTCCGLAGTEIGSIDFTGCWFLEEPCAVFRPALTLAIDDAERRWIAEIGDRDLPGPVWCVFPDPEVAVYVSDDLAAFMAAIREYTGRGEMHAWLHRLTARARTVWSRRHALAMRPHEAAHSDRAIRGWLLGLPMDAYVYDLRVQGIVRGWPYGVAGPSGRFYRCGRLPVFAVAGLPAEGWRTPRPRSRDACTKAPVRAEVSFAIQTPHSGRKPRANPENPGRWPLSRRASAARYRPLPRVGLPWWSYAHAREKSC*
Ga0137398_1003758413300012683Vadose Zone SoilEPWAVFRPALTLAIDDAERRWIAEVGDKELPGPVWCVFPHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHDAAHSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSSSRRASSRLHGRESAAVEVRPCA*
Ga0157294_1000101933300012892SoilMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTSRRVAPAILGTAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRTWGRSRKVPLETRPAARTFRANALAKTGAYS*
Ga0157303_1002183123300012896SoilMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTS
Ga0137395_1005916633300012917Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLRNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRASSRLHGRESAAVEVRPCA*
Ga0137359_1002856033300012923Vadose Zone SoilMQALLGTCCGLAGTEIGSIDFTGCWFLEEPCAVFRPALTLAIDDAERRWIAEIGDRDLPGPVWCVFPDPEVAVYVSDDLAAFMAAIREYTGRGEMHAWLHRLTARARTVWSRRHALAMRPHEAAHSDRAIRGWLLGLPMDAYVYDLRVQGIVRGWPYGVAGPSGRFYRCGRLPVFAVAGLPAEGWRTPRPRSRDACTKAPVRAEVSFAIQTPHSGRKPRANPENPGRWPLSRRASGSPLPPPAQGRTAMVELRPCA*
Ga0137419_1008049643300012925Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWTVFRPALTVAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAAQSDRSIRGWLLALPSDAYVYDLRRRSDARGWPYGVMGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRASSRLHGRESAAVEVRPCA*
Ga0137416_1013248323300012927Vadose Zone SoilMQALLSGCCGLAGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEIGDRDLPGPVWCVFPDPEVAVYVSDDLATFMATIRESTGRGEMHAWLHGLTAQARTVWSWRHTLAMRPHEAAHSDRAIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRFYRCGRLPVFAVAGLPAEGWRAPRPRSRDAATRAPASAEVSFAIETPHSGRKPRPTPENPGRWPLGRRASGSPLRPPVRGRTAMVELRPCA*
Ga0137416_1078988513300012927Vadose Zone SoilLKTLLGTFCGLAGTELGGIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAAHSDRSIRGWLLALPSDAYVYDLRRRSDARGWPYGVMGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRASSRLHGRESSAVEVRPCA*
Ga0137404_1006306943300012929Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWAVFRPALTLAIDDAERRWIAEVGDKELPGPVWCVFPHPEVTIYVSDDLADFMATLRESTCRGEMGAWLQNLTAQARAVWSRRHALAMRPHDAAHSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSSSSRRASSRLHGRESAAVEVQPCA*
Ga0137407_1057098313300012930Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWAVFRPALTLAIDDAERRWIAEVGDKELPGPIWCVFPHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHDAAHSDRSIRGWLLALPSDAYVYDLRRRSDARGWPYGVMGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSSSSRRASSRLHGRESAA
Ga0164241_1003849413300012943SoilMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA*
Ga0137410_1010256913300012944Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSSSSRRASSRLHGRESAAVE
Ga0175859_107746213300013314Moss AssociatedKALPGLRPEVIQGLECSYPGILSPRMKALLGACCGLTGTELGSIDFTGCWFPEEPCAVFRPALTLAVHDNGRRWIAEVGNRDLPGPVWCVFPDPEVAVYVSDDLAAFLATLHDHTSRGDMHAWLRDLTAEARSVWSQRHLSAMRPHEAHRSDRAIRGWLLGLPFDAYVYDLRAPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWWAPYPRPRSAPTPGMSAIAEVSFAIDTPQSGRKRRSNPADPGRWPMSRRTLGRPRRLPIQDSRSAALELRPCA*
Ga0181521_1011590223300014158BogMDAEGNPLSVSPLPGLRCDVIQSLECSYPGILKPAMKNLLSTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAIDDAERRWIGEIGDKELPGPIWCVFPNPEVAVYVCDDLAAFVAALAERTCRGEMQAWLQDLITQARTVWSRRHARASRPHEACHLDRAIRGWLLGLPCDAYVYDLRSPRTPRGWPYGVLGPSGRQYRCGRLPVFAVAGLPAEGWRAPHPTARDAATKAPAHAEVPFAIETPQSRRQRRSRPENPGRWPLSRRGSACRRPIQGTQTAASELRPCA*
Ga0181534_1049858413300014168BogDFTGCWFLEEPYTIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSHWLDELTAQARAVWAQRHAGAIRPHQAHRADPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAKAPDLAEASVAIETARSRRRPAIQREIPAPWIGGRRALEWGGWRSGLQRMS
Ga0181537_1001826633300014201BogMKELLSRSCGLGGTELGSIDFTGCWFLEEPYSIFRPALTLAIDDRERRWIAEVGNGNLPGPIWCVFAQPEVAVYVSDDLAGFLATLRERTCRGEVSHWLDELTAQARAVWSQRHAWAIRPHEAHRSDPAIRGWLMTLPSGAYVYDLRAPSAVRGWPYGVAGPSGRHYRCGRIPVFAVAGLPAEGWRASRRRARVVYAKAPASEEVSFAVETAGSRRRFPVKREPPAPWTWGRRALKGDGRSARELRSCA*
Ga0181537_1051140713300014201BogLSRSCGLAGTELGSIDFTGCWFLEEPYTIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVLPHPEVAVYVSDDLATFLATLRERTCRGEVSHWLDELTAQARAVWAQRHAVAIRPHQAHRSDPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAKASALEEVSFAIETARSRRRPAIQREIPAPWTGGKRALESGGWRSGFRVMPAAQRELRSCA*
Ga0181537_1053715613300014201BogFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSRWLDALGAQARAVWSQRHVRAIRPHEAHRSDPAIRGWLMTLPAGAYVYDLRARSVARGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRKARVAHAKPPAMEEVSFAIETARSRRRPSIQRERPAPRTWSGWVLKGDGYRSGFHHMPAPQAELRSCA*
Ga0181525_1042688713300014654BogSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDFPGPIWCVFPRPEVAVYVSDDLATFLATLRERTCRGEVSHWLDELTAQARTVWAQRHAGAIRPHQAHRSDPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRKARVAHANAPAMEELSFAIETARSRRRPAIQREISAPIAA*
Ga0181516_1011144513300014655BogMKDLLSRSCGLGGTELGSIDFTGCWFLEEPYTIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPRPEVAVYVSDDLATFLATLRERTCRGEVSHWLDELTAQARAVWAQRHAGAIRPHEAHRSDPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRKARVVHAKAPDLAEASVAIETARSRRRPAIQREIPAPWIGGRRALEWGGWRSGLQRMSAAQRELQSCA*
Ga0182030_1033101013300014838BogMKALLGTCCGLAGTDLGSIDFTGCWFLEEPYSVFRPALTLAVDDAERRWIAEVGNQDFPGPVWCVFPEPEVAVYVCDDLATFVATLREHTCRGEMRAWLQHLTAQARTVWSRRHALAMRPHEAYHSDRAIRGWLVGLPFDAYVYDLRAPSTARGWPYGVAGPLGRLYRCGRLPVFAVAGLPSEGWRAAHPRPRVAPTPGIGASVELSFAIETTGSGPKRRSNPENPARWALSRHTSGRARRPYFE
Ga0173480_1000531143300015200SoilMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTPRRVAPAILGTAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRTWGRSRKVPLETRPAARTFRANALAKTGAYS*
Ga0137409_10000248113300015245Vadose Zone SoilMKALLGNSCGLAGTEIGSIDFTGCWFLEEPYAVFRPALTLAIDDAEHRWIAEIGDRDLPGPVWCVFPDPEVAVHVSDDLSAFMAAIREYTGRGEMHAWLHGLTARARTVWSRRHALAIRPHEAAHTDRAIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRFYRCGRLPVFAVAGLPAEGWRAPRPRSRDATTKVPVRAEGSFAIETPQSGRKPRPNPENPGRWPLSRRESGRPLRPPIRGRTAMVELRPCA*
Ga0137409_1069989913300015245Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRK
Ga0137403_1012439243300015264Vadose Zone SoilMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWTVFRPALTLAIDDAERRWIAEVGDKELPGPVWCVFPHPEVTIYVSDDLADFMATLRESTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHDAAHSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSSSSRRASSRLHGRESAAVEVQPCA*
Ga0187874_1031925813300018019PeatlandTGCWFLEESYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSRWLDELTAQARAVWWQRHAWAIRPYEAHRMDPAIRGWLMTLPSGAYVYDLRAQSTVRGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGGRASRRRARVAHAKAPATAEVAFGIEAARSRRRPVIQREIAAPDTG
Ga0187885_1011723113300018025PeatlandPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLGGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSEDLATFLATLRERTCHGDVSRWLGELTAQARAVWWQRHAWAIRPYEAHRMDPAIRGWLMTLPSGAYVYDLRAQSTVRGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRRARVAHAKAPATAEVAFAIEAARSRRRPVIQREIAAPDTGVRRAPEWDGWRSGFHSMPAGQRELRSCA
Ga0187871_1004712523300018042PeatlandMKELLSRSCGLGGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSEDLATFLATLRERTCHGDVSRWLGELTAQARAVWWQRHAWAIRPYEAHRMDPAIRGWLMTLPSGAYVYDLRAQSTVRGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRRARVAHAKAPATAEVAFAIEAARSRRRPVIQREIAAPDTGVRRAPEWDGWRSGFHSMPTGQRELRSCA
Ga0187887_1005318533300018043PeatlandMSTSDTTMPLISAPWLNGEGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLGGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSEDLATFLATLRERTCHGDVSRWLGELTAQARAVWWQRHAWAIRPYEAHRMDPAIRGWLMTLPSGAYVYDLRAQSTVRGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRRARVAHAKAPATAEVAFAIEAARSRRRPVIQREIAAPDTGVRRAPEWDGWRSGFHSMPAGQRELRSCA
Ga0187890_1004574543300018044PeatlandMSTSDTTMPLISAPWLNREGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLGGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSEDLATFLATLRERTCHGDVSRWLGELTAQARAVWWQRHAWAIRPYEAHRMDPAIRGWLMTLPSGAYVYDLRAQSTVRGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGWRASRRRARVAHAKAPATAEVAFAIEAARSRRRPVIQREIAAPDTGVRRAPEWDGWRSGFHSMPAGQRELRSCA
Ga0179592_1005893613300020199Vadose Zone SoilMSAPDTLMPLVSAPWLDADGHPLSVTSLSGLRSEVIQNLECSYPGILKPAMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGA
Ga0210407_1000183393300020579SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGYRVKPAVLEERPCA
Ga0210407_1008873443300020579SoilMSTPDTMMPLVSAPWLNANGYPLSVRALPGLRSEVIQGLECSYPGILSPTMKALLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGYQDLPGPVWCVFADPEVAVYVCDDLAAFVTTLREHSSRGEMHAWLQGLTAEARTVWSQRHALAMRPNEAYHSHRAIRGWLLGLPFDAYVYDLRAPRTVHGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRAVPAPGMPASAELSFATESPQSGRKRRSNPEDPARGSLSRRTSGRPRRLPIQGAQTAVLELRPCA
Ga0210407_1068944313300020579SoilFTGCWLPEESNAVFRPALTLAIDAAGRRWIAESGDKDLPGPVWCVFPDPEVAVYVSDDLASFVATIHEYTGRGEMHAWLHGLTAQARTVWSRRHAWATRPHEAARSDRGIRGWLLGLPMDAYVYDLCVQGIARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRARRPRSRDASTRASARAEVSFAIETPRSGRKPRPDPANPRRWHVSRRAWSRPLRPLIQGGQTDVSELRPCA
Ga0210403_1001259613300020580SoilMSAPDTMMPLVSAPWLKADGNPLSVRALPGLRSEVIQSLECSYPGILSPTMKTLLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGTQDLPGPVWCVFPDPEVAVYVCDDLAAFVATLREHTSQGEMHAWLQGLTAEARTVWSRRHAVAMRPHEAYHSDRAIRGWLLGLPFDAYVYDLRAPRTMHGWPYGVAGPAGRLYRCGRLPVFAVAGLPAEGWRAPSPRPRAAPTPEMPASAEVSFAIETPRSGRKRRSDPEDPARWPLSRRTSGRPRRLPIQGAQAAVLELRPCA
Ga0210403_1023435823300020580SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGVAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGHRV
Ga0210401_1023044723300020583SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRCEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPSLPQAARSAEVSFAIGAAVGRGQPRARPERPARWVSGRRTAGRLPPSVGHRVKPAVLEERPCA
Ga0210406_1013559823300021168SoilMSTPDTMMPLVSAPWLNANGYPLSVRALPGLRSEVIQGLECSYPGILSPTMKALLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGYQDLPGPVWCVFADPEVAVYVCDDLAAFVTTLRENTSRGEMQAWLQGLTAEARTVWSQRHALAMRPNEAYHSHRAIRGWLLGLPFDAYVYDLRAPRTVRGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRAVPAPGMPASAELSFATESPQSGRKRRSNPEDPARGSLSRRTSGRPRRLPIQGAQTAVLELRPCA
Ga0210405_1007425023300021171SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGHRVKPAILEERPCA
Ga0210388_1010816243300021181SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRCEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDAPGWPYGVLGPSGRHYRCGRELVFAVAGLPAEGWPVGHRVKPAILEERPCA
Ga0210393_1029152623300021401SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRCEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGHRVKPAILEERPCA
Ga0210385_1069158713300021402SoilMSASDTTMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLAGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVWVSDDLATFLATFRERTCQGEVSCWLDELTAQARAVWSQRHAWAIRPHEAHRSDPGIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRR
Ga0210387_1030334513300021405SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRCEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRLDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGYRVKPAVLEERPCA
Ga0210386_1019209513300021406SoilMSASDTTMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLAGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVWVSDDLATFLATFRERTCQGEVSCWLDELTAQARAVWSQRHAWAIRPHEAHRSDPGIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRRARVVHAKAPATMPATEEVSFAIETARSRRKPAIQREIPVAWTGGRRALEWGGWRSTFHRMPAARRESRSCA
Ga0210386_1026363023300021406SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGELGDKDLPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPSLPQAATSAEVSFAIGAAVGRRQRRARLERPARWVSGRRTAGRLPPSVGHRVKPAVLEERPCA
Ga0210394_1011366323300021420SoilMSTPDTMMPLVSAPWLDADGSPLSVRALPGLRSEVIQNLECSYPGILSPAMKALLGTCCGLAGTDLGSIDFTGCWFLEEPYSVFRPALTLAVDDAERRWIAEVGNQDLPGPVWCVFPDPEVAVHVCDDLAAFLATLHQHTCRGETHAWLRGLNAQARTVWSRRHALAMRPHDAHHSDRAIRGWLSGLPLDAYVYDLRAPRNARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRVAPTPGLPASAELSFAIETPKDVRMRRSNPENPGRRTLSRQTSGRVRRPPIQDAQAGVLEVRRCA
Ga0210384_1044535213300021432SoilMSTPDTMMPLVSAPWLNANGYPLSVRALPGLRSEVIQGLECSYPGILSPTMKALLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGYQDLPGPVWCVFADPEVAVYICDDLAAFVTTLREHSSRSEMQAWLQGLTAEARTVWSQRHALAMRPNEAYHSHRAIRGWLLGLPFDAYVYDLRAPRTVRGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRAVPAPGMPASAELSFATESPQSGHKRRSNPEDPARGSLSRRTSGRPRRLPIQGAQTAVLESRPCA
Ga0210384_1053961613300021432SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDTERRWIGEVGDKDIPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPILPEAATSAEVPPSVGY
Ga0210390_1048727723300021474SoilMSASDTTMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKELLSRSCGLAGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVWVSDDLATFLATFRERTCQGEVSCWLDELTAQARAVWSQRHAWAIRPHEAHRSDPGIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRRARVVHAKAPATMPAT
Ga0210402_1010771123300021478SoilMSAPDTMMPLVSAPWLNANGYPLSVRALPGLRSDVIQGLECSYPGILSPTMKALLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGYQDLPGPVWCVFSDPEVAVYVCDDLAAFVTTLRENTSRGEMQAWLQGLTAEARTVWSQRHALAMRPNEAYHSDRAIRGWLLGLPFDAYVYDLRAPRTVRGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRAVPAPGMPASAELSFATESPQSGHKRRSNPEDPARGSLSRRTSGRPRRLPIQGAQTAVLELRPCA
Ga0210402_1019949723300021478SoilMSAPDTMMPLISAPWLDADGNPLFVTPLPGLRSEVIQSLEWSYPGNLTPAMKALLSTCCGLAGTEIGSIDFTGCWFPEESNAVFRPALTLATDAAGRRWIADSGDKDLPGPVWCLFPDPEVAVYVSDDLASFMATIHEHTGRGEMHAWLHGLTAQARMVWSRRHAWATRPHEAARSDRGIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPRPRSRDASTRASARAEVSFAIETPRSGRKPRPDPANPRRWPVSRRAWSRPLRPPIQGGQTDVSELRSCA
Ga0210410_1007305933300021479SoilMSAPDTMMPLVSAPWLDADGNPLSVTPLPGLRSEVIQSLEWSYPGNLTPAMKALLSTCCGLAGNEIGSIDFTGCWFPEESNAVFRPALTLAIDAARRRWIAESGDKDLPGPVWCVFPDPEVAVYVSDDLASFIATIHEHTGRGEMHAWLHGLTAQARTVWSRRHAWATRPHEAARSDRGIRGWLLGLPMDAYVYDLRVYGIARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPRPRSRDASTRASARAEVSFAIETPRSGRKPRPDPANPRRWHVSRRAWSRPLRPLIQGGQTDVSELRPCA
Ga0212123_1004414033300022557Iron-Sulfur Acid SpringMSAPDTMMPLVSAPWLKADGNPLSVRALPGLRSEVIQGLECSYPGILSPIMKTLLSTCCGLAGIELGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGNQDLPGPVWCVFPDPEVAVYVCDDLAAFVATLREHTSRGEMHAWLQGLTAEARTVWSRRHALAMRPHEAYHSDRAIRGWLLGLPFDAYVYDLRSPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPHAAPAPGMQASAELSFAIETPQSGLQASI
Ga0247792_103941913300022880SoilMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTSRRVAPAILGTAAGEPAFAIDNSVGRRS
Ga0247766_105742313300022906Plant LitterMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTSRRVAPAILGTAAGEPAFAIDNSVGRRSRRSGRRNPRRLPAGHRTWGRSRKVPLETRPAARTFRANALAKTGAYS
Ga0207695_1007468163300025913Corn RhizosphereMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDFAAFLEVLRERTCRGEMGTWLRHLTAQARAVWSRRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAQGWRVPHPRGRVPSLPQAAASAEVSFAIGAAGGRRLLRARPERPARWVSGRRMAGRLSPSVGRRLETAVLEERPCA
Ga0207679_1122484113300025945Corn RhizosphereMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRTWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAG
Ga0209131_113109613300026320Grasslands SoilMSTSDAMMPLISAPWLNADGHPLSVTPLPGLRSEVIQGFECSYPGILSPRMKEVLGSCCGLAGTELGSIDFTGCWFLEEPYPVFRPALTLAIDDAERRWIAEIGSKDLPGPIWCVFPKPEVAVYVSDDLAAFLEVVRERTCRGEMGAWLQNLTAQARTVWSRRHAWALRPHEAHRSDRAMRGWLTTLPSDAYVYDLRTRSDARGWPYGVVGPSGRQYRCGRELVFAVAALPAQGWWAPHPRAHIGSRPFAPEGAETAFASEIARERRKPRVEPERPARWTSGRRTSSRPPSPLVQGAEVAAALESRPCA
Ga0257172_103792013300026482SoilDTMMPLVSAPWLNADGNPLSVRALPGLRSEVIQGLECSYPGILSPTMKALLRTCCGLAVTDLGCIDFTGCWFLEEPCAVFRPALTLAVDDAERRWIAEVGYQDLPGPVWCVFPDPEVAVYVCDDLAAFVATLREHTSQGEMHAWLQGLTAEARIVWSQRHALAMRPHEAYHSDRAIRGWLLGLPFDAYVYDLRAPRTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRPRAASTPEMPASAELSFAIETPQSGRKRRSNPEDPARWPLSRRTSGRPRRLPIQG
Ga0179587_1011548123300026557Vadose Zone SoilMSAPDTLMPLVSAPWLDADGHPLSVRSLSGLRPEVIQNLECSYPGILKPAMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSSSSRRASSRLHGRESAAVEVRPCA
Ga0209166_1007281933300027857Surface SoilMMSRSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFLEEPYPVFRPALTLAIDDAERRWIGEIGNQDLPGPIWCVFPRPEVAVYVCDDLAAFLELLRERTCRGEMSAWLQNLTAQARAVWSRRHALALRPHEAHRADRAIRGWLRTLPSDAYVYDLRTPTDARGWPYGVVGPSGRYYRCGRELVFAVAGLPAQGWRAPHPRARIASTPLGPANAEVSFAIETARGRPRPPVNPEHPRWISSRRGGSSPPRPLGQRAQTTPWELRPCA
Ga0209166_1026188813300027857Surface SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGMLSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDLAAFLGVLRERTCRGEMGTWLQILTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRGRVPSLPQPAPSAEVSFAIGAAGGRRQPRARPERPARWVSGRRTAGRLSPSVGRRVK
Ga0209611_1000420983300027860Host-AssociatedMSAPNTMMPLISAPWLDADGNPLSVKALPGLRAEVIQNLECSYPGILSPTMKDLLGTCHGLAGTALGSIDFTGCWFPEEPCAVFRPALTLAVDDEQRRWIAEVGNRDLPGPVWCVFPDPQVAVYVCDDLAAFLATLREHTARGEMHAWLQRLTAEACAVWSQRHGAALRPHQAHRSDRAIRSWLSGLPFDAYVYDLRAPSTARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRRRAAPTPGMAASTELSFAIETPQSGRKRRSNPEDPGRWPFCRRTAGRTRRSPTHDAQIVGLELRPCA
Ga0209611_1019002323300027860Host-AssociatedMSAPDTMMPLVSAPWLDADGNPLSVKALPGLRPDVIQSLESSYPGILSPTMRALLGTCCGLAGTELGSIDFTGCWFLEEPCAVFRPALTLAVDDAGRRWIAEVGNQDLPGPLWCMFPDPEVALYVCDDLAAFLTMLRQHSAQGEMHAWLQRLTVEARLVWSQRHVWAMRPHEAHRADRGIRGWLWGLPRDAYVYDMRAPLTLRGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPRRWPLSRRPLGRPRASRVEDAQNGALELRPCA
Ga0209611_1022861413300027860Host-AssociatedMSASDTTMPLIAAPWLNAEGNPLSVIPLPGLRDEVIQSLECSYPGILSPPMKDLLSRSCGLGGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAGRRWIAEVGNGEFPGPIWCVFPSPEVAVYVSDDLATFLATFRERTCRGEVSHWLDELTAQARAVWARRHAGAIRPHHAHRSDPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRATRRKPRVAHAKAPDLEEASFAIETARSRRRPVIQRE
Ga0209579_1024952413300027869Surface SoilMSAPDTMMPLVSAPWVNAEGNPLSVRPLPGLRSDVIQSLECSYPGILTPAMKNLLSTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAIDDAERRWIAEIGDKDLPGPVWCVFPDPEVVVYVCDDLAAFVAALGERPRAGEMQAWLRDLTTQARTVWSRRHAWASRPHEACHLDRAIRGWLSGLPFDAYVYDLRSPGTPRGWPYGVLGPSGRQYRCGRLPVFAVAGLPADGWRAPHPRERAVTTPMPPMSAEVCSAIETLQ
Ga0209488_1011264733300027903Vadose Zone SoilMSAPDTLMPLVSAPWLDADGHPLSVTSLSGLRSEVIQNLECSYPGILKPAMKTLLGTCCGLAGTELGSIDFTGCWFLEEPWSVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLADFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRCDARGWPYGVVGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPRAKPEPPARGSSLSRRASSRLHGRESAAVEVRPCA
Ga0247749_102150113300027993SoilPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIITPQLKALLGTCCGLADTDLGSLDFTGCWFQEEPSTVFRPALTLAIDDAGRRWIAEVGNHDLPGPVWCVFPDPEVTVYVSNDLAAFVSTLHEYTCRGEMQTWLHRVYDEARSVWSQRHVLAFRPHEVFKSDRAIRAWVSRLPFNAYVYDLRERNVERGWPYGVAGPSGRHFRCGRLPVFAVAGLPVE
Ga0209526_1009964333300028047Forest SoilMSAPDTMMPFVSAPWLKADGNPPSVSALPGLRSEVIQSLECSYPGILSPTMKTLLSTCCGLAGTELGCIDFTGCWFLEEPCAVFRPALTLAVDDADRRWIAEVGTQDLPGPVWCVFPDPEVAVYVCDDLAAFVATLREHTSQGEMHAWLQGLTAEARTVWSRRHAVAMRPHAAYHSDRTIRGWLLGLPFDAYVYDLRAPRTVRGWPYGVAGPAGRLYRCGRLPVFAVAGLPAEGWRAPPPRPRAAPTPEMPASDEVSFAIETPRSGRKRRSDPEDPARWPLSRRT
Ga0137415_1020999233300028536Vadose Zone SoilVRSFSGLRPEVIQNLECSYPGILKPAMKTLLGTCCGLAGTELGSIDFTGCWLLEEPWAVFRPALTLAIDDAERRWIAEVGDRDLPGPVWCVFSHPEVTIYVSDDLSDFMATLRERTCRGEMGAWLQNLTAQARAVWSQRHALAMRPHEAPQSDRSIRGWLLALPSDAYVYDLRRRSDARGWPYGVMGPSGRLYRCGRLPVFAVAGLPAEGWRAPHPRARIAPSLGAQVSAEVAFAIETPLSRRKPQAKPEHPARGSPSSRRASSPLHGRESAAVEVRPCA
Ga0311346_1102432513300029952BogCGLAGTELGSIDFTGCWFREEPYAIFRPALTLAIDDADRRWIAEVGNGDLPGPIWSVFPHPEVAVYVSDDLAGFLATLRERTCQGEVSHWLDELSAQARAVWSQRHAWAIRPHEAHRSDSAIRGWLMTLPASAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGSRASRRRARVANAKAPPMEEVSFAVETAGSRRRPFIKRATSGPWT
Ga0311338_1032838213300030007PalsaMSASDATMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSYPGILSPAMKELLSRSCGLAGTEFGSIDFTGCWFLEEPHAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSRWLDELTAKARAVWSQRHAWAIRPYEAHRSDPAIRGWLMTLPAGAYVYDLRARSAVRGWPYGVAGPSGRHYRCGRLPVFAVAGLPEEGWRASRRRARVAHAKAPFLEEASFPVETARSRCRYHIRREIPAQRTWGGRAPKWGEWRSGFHSMPAAQGESRLCA
Ga0311370_1013304063300030503PalsaMSAPDTTMPLISAPWLNAEGNPLSVIPLPGLREEVIRSLECSYPGILSPAMKELLSRSCGLAGTEFGSIDFTGCWFLEEPHAVFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSRWLDELTAKARAVWSQRHAWAIRPYEAHRSDPAIRGWLMTLPAGAYVYDLRARSAVRGWPYGVAGPSGRHYRCGRLPVFAVAGLPEEGWRASRRRARVAHAKAPFLEEASFPVETARSRCRYHIRREIPAQRTWGGRAPKWGEWRSGFHSMPAAQGESRLCA
Ga0311356_1075599923300030617PalsaNPLPGLREEVIQSLECSYPGILSPPMKELLRRSCGLAGTELGSIDFTGCWFMEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPVWCVFPHPEVAVYVSDDVGTFLATLRERTCQGEVSRWLDELTAQARAVWSQRHAWAIRPHEAHRSDPEIRGWLMTLPAGAYVYDLRARSVARGWPYGVAGPSARHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAKAPAMEWRSGFHPIPAAQAELRSCA
Ga0302317_1016448623300030677PalsaSPMSTSDTTMPLISAPWLNAEGNPLSVNPLPGLREEVIQSLECSYPGILSPPMKELLRRSCGLAGTELGSIDFTGCWFMEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPVWCVFPHPEVAVYVSDDVGTFLATLRERTCQGEVSRWLDELTAQARAVWSQRHAWAIRPHEAHRSDPEIRGWLMTLPAGAYVYDLRARSVARGWPYGVAGPSARHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAKAPAMEWRSGFHPIPAAQAELRSCA
Ga0311345_1045226713300030688BogSPPMKELLGRSCGLGGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVLPQPEVAVYVSDDLAGFLATLRERTCQGEVNQWLEELTAQARAVWSQRHAAAIRPHEAHRADPSIRAWLMTLPASAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGSPAEGSRASRRRARVANAKAPPMEEVSFAVETAGSRRRPFIKRATSGPWTWGPGALEVSGRPGFPNIPSASGELRLCA
Ga0138301_144233513300031022SoilMSAPAMMPLVSAPWLDADGNPLSARPLSGLRSEVIQSLECSYPGILKPAMKKLLSTCCGLAGTELGSIDFTGCWFLEEPSAVFRPALTLAIDDAERRWIAEIGDKDLPGPVWCVFSDPEVAVYVSDDLAAFVATLGERTCRGAMQAWLQDLTTQARTAWSRRHSSASRPHEACHLDRAIRGWLSGLPFDAYVYDLRAPRTPRGWPYGVLGPSGRQFRCGRLPVFAVAGLPAEGWRAPDPTARDAATRGVARAEVSFAIEIRQSGRQRRSHPENPGRWKPWQVA
Ga0302325_1063298123300031234PalsaMSTSDTTMPLISAPWLNAEGNPLSVNPLPGLREEVIQSLECSYPGILSPPMKELLRRSCGLAGTELGSIDFTGCWFMEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPVWCVFPHPEVAVYVSDDVGTFLATLRERTCQGEVSRWLDELTAQARAVWSQRHAWAIRPHEAHRSDPEIRGWLMTLPAGAYVYDLRARSVARGWPYGVAGPSARHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAKAPAMEWRSG
Ga0302324_10034566143300031236PalsaMSASDATMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSYPGILSPPMKKLLSRSCGLGGTELGSIDFTGCWFLEEPYAIFRPALTLAIDDAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLATFLATLRERTCQGEVSHWLDDLTAQARAVWAQRHAGAIRPHQAHRLDPAIRGWLMTLPASAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRRKARVAHAQAPAMEEVSFVVETAGSRRRPPVNRETPAPWTWGPRELKGRGRSGFHSMPAAQRELRSCA
Ga0310887_1005037023300031547SoilMATPDTIWPLVSAPWLDADGNPLTVRPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQVRMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA
Ga0310886_1035293513300031562SoilMATPDTIWPLVSAPWLDADGNPLTVRPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPG
Ga0307476_1067429413300031715Hardwood Forest SoilECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPCGVFRPALTLAIDDAERRWIGEVGDKNLPGPIWCVFPKPEVAVYVSDDLAAFLGVLHKRTCRGEMGTWLQNLTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPSLPQAATSAEVSFAIGAAVGRRQPRARPERPARWVSGWRTAGRLPPSVG
Ga0307474_1029242013300031718Hardwood Forest SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKNLPGPIWCVFPKPEVAVYVSDDLAAFLGVLHERTCRGEMGTWLQNLTAQARAVWSRRHSLALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPSLPQAATSAEVSFAIGAAVGRRQPRARPERPARWVSGWRT
Ga0307475_1043064413300031754Hardwood Forest SoilMSAPDTMMPLISAPWLDADGNPLFVTPLPGFRSEVIQSLEWSYPGILSPPMKALLGSCCGLAGTELGSIDFTGCWFLEEPYAVFRPALTLAIDDAERRWIAEIGNKDLPGPIWCVFPDPEVAVYVNDDLAAFMATIREHTCRGEMHAWLHGLTAQARTVWSRRHAWAMRPHEAARSDRGIRGWLLGLPMDAYVYDLRVQGIARGWPYGVAGPSGRLYRCGRLPVFAVAGLPAEGWRAPRPRSRDASTRASARAEVSFAIETPRSGRKPRPNPENP
Ga0307478_1018445423300031823Hardwood Forest SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYGVFRPALTLAIDDAERRWIGEVGDKNLPGPIWCVFPKPEVAVYVSDDLAAFLGVLHERTCRGEMGTWLQNLTAQARAVWSQRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAEGWRVPHPRARVPSLPQAATSAEVSFAIGAAVGRRQPRARPERPARWVSGRRTAGRLPPSVGHRVKPAVLEERPCA
Ga0310900_1044679913300031908SoilCEMSTPDTMLPFISAPWLDADGNPLSVRPLPGLRADVIQSLECSYPGIVSPQMKALLGTCCGLADTDLGSIDFTGCWFDEEPSTVFRPALTLAIDDAGRRWIAEVANGDLPGPVWCVFPDPEVIVHVSDDLAAFVSTLHEYTCRGEMQTWLHRLNDEARSVWSQRHVLAFRPHEIFKSDRAIRAWLSGLPFNAYVYDLRERNVARGWPYGVAGPSGRHFRCGRLPVFAVAGLPVEGWRAPTPRRVAPAILGMTASEPPFAIDNSAGRRARRSGRRNPGRLSPGRRIWGRSRKARLEIKPAARTFRANAPAKTGAYS
Ga0310901_1017140313300031940SoilPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA
Ga0307479_1077910513300031962Hardwood Forest SoilMSAPDTMMPLVSAPWLDADGHPLSVRPLPGLRSEAIQSLECSYPGTLTPAMKALLGTCCGLAGTAIGSIDFTGCWFFEEPYAVFRPALTLAVDDAERRWIAEIGDRDLPGPVWCVFPDPEVAVYVSDDVAAFMSTIRAHTCRGEMHAWLHGLTAQARTVWSRRHAWAMRPHEAARSDRGIRGWLLGLPMDAYVYDLRVPGIARGWPYGVAGPSGRLYRCGRLPVFAVAGFPAEGWRAPRPRSRDATTRAPARAEVSFAIETPHSGRKPPPNP
Ga0315910_1017471513300032144SoilLVSAPWLDADGNPLTVRPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA
Ga0315910_1047666113300032144SoilMATLDTMRPFVSAPWLDADGNPLSVKPLPGLRSDAIQNLECSYPGILSTRMKALLGTSCGLTGTDLGAIDFTGCWFIEEPYTVFRPALTLAIDNAERRWIAEIGNQDLPGPVWCVFSEPEVTLHVCDDLATFVQFVHEHTCEGTMETWLRSLNAQARTVWSHRHASAMRPHEAFKSDRAIRSWLLGLPFNAYVYDLRAPMAARGWPYGVAGPTGRLYRCKRLPVFAVAGSPAEGWQAPRPRRSAVPIPEVTAEPSLAFDTLRSGARRVGPHIPRRRSGRRERWSPVLQN
Ga0315912_1000872313300032157SoilPLPGLRSDAIQSLECSYPGILSPRMKALLGTSCGLAGTDLGAIDFTGCWFIEEPYTVFRPALTLAIDNAERRWIAEIGNHDLPGPVWCVFSEPEVTLHVCDDLATFVQFVHEHTCEGTMETWLRSLNAQARTVWSHRHASAMRPHEAFKSDRAIRSWLLGLPFNAYVYDLRAPMAARGWPYGVAGPTGRLYRCKRLPVFAVAGSPAEGWQAPRPRRSAVPIPEVTAEPSLAFDTLRSGARRVGPHIPRRRSGRRERWSPVLQNTA
Ga0315912_1009046723300032157SoilMATPDTIWPLVSAPWLDADGNPLTVRPLPGLRSDVIQSLECSYPGILSPQMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKAA
Ga0307470_1000276723300032174Hardwood Forest SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRPEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFQEEPYRVFRPALTLAIDDAERRWIGEVGDKDLPGPIWCVFPKPEVAVYVSDDLTAFLGMLRERTCRGEMGTWLQSLTAQARAVWSRRHALALRPHEAHRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAQGWRIPHPRGRVPNLPQAAASAEVSFAIGAARTAGRLSPSVGHRVEAAVLEERLCA
Ga0335078_1041288533300032805SoilMSRSDVTMPLISAPWLNADGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFLEEPYPVFRPALTLAIDDAERRWIGEVGNKDLPGPIWCVFPKPEVAVYVCDDLAAFLGLLRERTCRGDMAAWLQHLTAQARAVWSRRHALALRPHEAHRTDRAIRGWLRTLPSDAYVYDLRTPTDARGWPYGVVGPSGRYYRCGRELVFAVAGLPAQGWRAPHPRARIASTPRGPANAEVSFAIETARGRPRPRVNPEHPRWTSSRRTGSSPPRPRVQRARITPLELRPCA
Ga0335075_1003291373300032896SoilVFRPALTLAVDDAERRWIAEIGNQDLPGPIWCVFPHPEVAVYVSDDLAGFLRVLRERTCGGEVGAWLRDLAAQARTVWSRRHVLAIRPHQAHRLDRAIRGWLTTLPCDAYVYDLRAHTEARGWPYGVVGPSGRHYRCGRELIFAVAGLPAQGWRAPHPRARVASIPSGPANAEVSFAVETARGRPGPRITPERPTRWTFSGRTSSDPPSPPAQRAQTRALQLRPCA
Ga0335073_1009442443300033134SoilMSRSDVTMPLISAPWLNAGGNPLSVTPLPGLRSEVIQSLECSYPGILSPPMKDLLGSCCGLAGTELGSIDFTGCWFLEEPYPVFRPALTLAIDDAERRWIGELGNKDLPGPIWCVFPKPEVAVYVCDDLAAFLGLLRERTCRGDMAAWLQHLTAQARAVWSRRHALALRPHEAHRTDRAIRGWLRTLPSDAYVYDLRTPTDARGWPYGVVGPSGRYYRCGRELVFAVAGLPAQGWRAPHPRARIASTPRGPANAEVSFAIETARGRPRPRVNPEHPRWTSSRRTGSSPPRPRVQRARITPLELRPCA
Ga0335073_1028307423300033134SoilMSPSDVTMPLISAPWLNADGNPLSVTPLPGLRSDVIQSLECSYPGILSPPMKDLLGNACGLAGTELGSIDFTGCWFQDEPYSVFRPALTLAIDDAERRWIGEVGNEDLPGPIWCVFPKPEVAVYVSDDLSAFLGLLRERTCKGEMGTWLQSLTSQARAVWSRRHAQALRPHEAYRSDRAIRGWLMTLPSDAYVYDLRTPTDARGWPYGVVGPSGRHYRCGRELVFAVAGLPAQGWRVPHPRARVPRLPQASASADVSFAFRTAGGRRQPRARPERPARWVSGRRTSGRLSPSVGHRIGPAVLEERPCG
Ga0370493_0178342_51_7043300034129Untreated Peat SoilMKALLGTCCGLAGTELGCIDFTGCWFSEEPFAVFRPALTLAVDDAERRWIAEVGNGDLPGPVWCVFPRPEVAVYVGDDLAAFLTTLREHTSRGEMHGWLQDLTADAHLVWSQRHARALRPHEAHRSDRAIRGWLVGLPFDAFVYDLRAPSIARGWPYGAAGPSGRLYRCGRLPVFAVAGLPAEGWRAPYPKPRALLTPGMPASTDLSFAIETPQSVRK
Ga0370494_052626_38_9613300034130Untreated Peat SoilMSASDTTMPLISAPWLNAEGNPLSVIPLPGLREEVIQSLECSFPGILTPPMKELLSRSCGLAGTELGSIDFTGCWFAEEPYAIFRPALTLAIDGAERRWIAEVGNGDLPGPIWCVFPHPEVAVYVSDDLAAFLATLRDQTCQGEASHWLDELTAQARVVWSQRHARAIRPHEAHRSDPAIRGWLMTLPAGAYVYDLRARSAARGWPYGVAGPSGRHYRCGRLPVFAVAGLPAEGWRASRGGAQVAHAKAPALEEASFAVETAGTRRRPPPSERRPLHGHGYHGRSTAAGAPAFTVCLAAQRELRSCA
Ga0314780_032013_219_9443300034659SoilMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQRSVRRPRWTPVLQKA
Ga0314781_030454_1_6783300034660SoilMKALLGTCCGLAETDLGTIDFTGCWFVEEPYTVFRPALTLAIDDAERRWIAEVGNHDLPGPVWCVFSEPEVTLNVCDDLATFVKTVHDHTCEGKMRTWLRHLNDQARMVWSQRHALAMRPHEAFKSDRAIRGWLSGLPFNAYVYDLRAQKAARGWPHGVAGPTGRLFRCKRLPVFAVAGSPAEGWQAPRPRRSAAPIPEVIAEPSFAFDTLRGGARRLGPRIPGQR
Ga0314782_025575_2_9433300034661SoilEKACALKQETHRSTDKDQEEKMMSTPDTLMPYVSAPWLDADGNPLSVRPLPGLRSDVIQSIECSYPGILSSRMKALLGTCCGLAGTELGAIDFTGCWFAEEPYTVFRPALTLAIDDAERRWIAEIGNHDLPGPVWCVFSEPEVTLHVCDDLATFVETVHEHTCAGKMKAWLRGLTDQARTVWSQRHALAMRPHEAFKSDRAIRGWLLGLPFNAYVYDLRAPRAARGWPYGIAGPSGRLYRCKRLPVFAVAGSPAEGWRAAHPRRRAAPIPEVIAEPSFAFDSLRSGARRLGPKIPRQWSVSRRMWRPVLQNAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.