NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105597

Metagenome / Metatranscriptome Family F105597

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105597
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 351 residues
Representative Sequence MKKHRLAGIIVAVVVVGMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETTTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRHITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPKGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPATPQRKPFPNKSTIRN
Number of Associated Samples 72
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 45.00 %
% of genes near scaffold ends (potentially truncated) 20.00 %
% of genes from short scaffolds (< 2000 bps) 39.00 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(41.000 % of family members)
Environment Ontology (ENVO) Unclassified
(49.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(82.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 39.53%    β-sheet: 23.04%    Coil/Unstructured: 37.43%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with PDB biological assemblies

PDB IDStructure NameBiol. AssemblyTM-score
3k8jSTRUCTURE OF CRYSTAL FORM III OF TP045310.6053
3k8iSTRUCTURE OF CRYSTAL FORM IV OF TP045310.58451
3k8gSTRUCTURE OF CRYSTAL FORM I OF TP045310.58255
3k8hSTRUCTURE OF CRYSTAL FORM I OF TP045310.5806
3k8gSTRUCTURE OF CRYSTAL FORM I OF TP045320.55681


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF01551Peptidase_M23 41.00
PF00009GTP_EFTU 8.00
PF02769AIRS_C 5.00
PF00905Transpeptidase 4.00
PF00012HSP70 3.00
PF00586AIRS 2.00
PF01556DnaJ_C 2.00
PF07681DoxX 2.00
PF00912Transgly 1.00
PF03544TonB_C 1.00
PF01242PTPS 1.00
PF13462Thioredoxin_4 1.00
PF08546ApbA_C 1.00
PF02151UVR 1.00
PF09286Pro-kuma_activ 1.00
PF07291MauE 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0443Molecular chaperone DnaK (HSP70)Posttranslational modification, protein turnover, chaperones [O] 3.00
COG0484DnaJ-class molecular chaperone with C-terminal Zn finger domainPosttranslational modification, protein turnover, chaperones [O] 2.00
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 2.00
COG4270Uncharacterized membrane proteinFunction unknown [S] 2.00
COG07206-pyruvoyl-tetrahydropterin synthaseCoenzyme transport and metabolism [H] 1.00
COG0744Penicillin-binding protein 1B/1F, peptidoglycan transglycosylase/transpeptidaseCell wall/membrane/envelope biogenesis [M] 1.00
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 1.00
COG1893Ketopantoate reductaseCoenzyme transport and metabolism [H] 1.00
COG4934Serine protease, subtilase familyPosttranslational modification, protein turnover, chaperones [O] 1.00
COG4953Membrane carboxypeptidase/penicillin-binding protein PbpCCell wall/membrane/envelope biogenesis [M] 1.00
COG5009Membrane carboxypeptidase/penicillin-binding proteinCell wall/membrane/envelope biogenesis [M] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003220|JGI26342J46808_1000590All Organisms → cellular organisms → Bacteria2622Open in IMG/M
3300003224|JGI26344J46810_1000659All Organisms → cellular organisms → Bacteria2099Open in IMG/M
3300003352|JGI26345J50200_1001985All Organisms → cellular organisms → Bacteria1867Open in IMG/M
3300004091|Ga0062387_100066491All Organisms → cellular organisms → Bacteria1804Open in IMG/M
3300004091|Ga0062387_100079767All Organisms → cellular organisms → Bacteria1691Open in IMG/M
3300004092|Ga0062389_100242601All Organisms → cellular organisms → Bacteria1819Open in IMG/M
3300004092|Ga0062389_100678002All Organisms → cellular organisms → Bacteria1202Open in IMG/M
3300005406|Ga0070703_10020283All Organisms → cellular organisms → Bacteria1933Open in IMG/M
3300005518|Ga0070699_100010546All Organisms → cellular organisms → Bacteria → Acidobacteria7997Open in IMG/M
3300005526|Ga0073909_10019058All Organisms → cellular organisms → Bacteria2208Open in IMG/M
3300005536|Ga0070697_100071874All Organisms → cellular organisms → Bacteria2838Open in IMG/M
3300005537|Ga0070730_10000564All Organisms → cellular organisms → Bacteria41755Open in IMG/M
3300005537|Ga0070730_10092320All Organisms → cellular organisms → Bacteria2104Open in IMG/M
3300005545|Ga0070695_100199504All Organisms → cellular organisms → Bacteria1429Open in IMG/M
3300005549|Ga0070704_100002012All Organisms → cellular organisms → Bacteria11263Open in IMG/M
3300005591|Ga0070761_10124511All Organisms → cellular organisms → Bacteria1498Open in IMG/M
3300005591|Ga0070761_10137267All Organisms → cellular organisms → Bacteria1427Open in IMG/M
3300005602|Ga0070762_10018612All Organisms → cellular organisms → Bacteria3597Open in IMG/M
3300005712|Ga0070764_10184296All Organisms → cellular organisms → Bacteria1164Open in IMG/M
3300005921|Ga0070766_10005469All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6433Open in IMG/M
3300006176|Ga0070765_100101588All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2487Open in IMG/M
3300006176|Ga0070765_100242178All Organisms → cellular organisms → Bacteria1651Open in IMG/M
3300006176|Ga0070765_100391410All Organisms → cellular organisms → Bacteria1296Open in IMG/M
3300006755|Ga0079222_10121737All Organisms → cellular organisms → Bacteria1422Open in IMG/M
3300006804|Ga0079221_10012286All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium3305Open in IMG/M
3300010343|Ga0074044_10004682All Organisms → cellular organisms → Bacteria11270Open in IMG/M
3300010343|Ga0074044_10090058All Organisms → cellular organisms → Bacteria2068Open in IMG/M
3300010373|Ga0134128_10016188All Organisms → cellular organisms → Bacteria8859Open in IMG/M
3300010401|Ga0134121_10002358All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia16427Open in IMG/M
3300012202|Ga0137363_10034918All Organisms → cellular organisms → Bacteria3503Open in IMG/M
3300012361|Ga0137360_10002962All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis10164Open in IMG/M
3300012931|Ga0153915_10285922All Organisms → cellular organisms → Bacteria1837Open in IMG/M
3300012960|Ga0164301_10271579All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300017924|Ga0187820_1000006All Organisms → cellular organisms → Bacteria34553Open in IMG/M
3300018007|Ga0187805_10011725All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3709Open in IMG/M
3300020579|Ga0210407_10037809All Organisms → cellular organisms → Bacteria3591Open in IMG/M
3300020579|Ga0210407_10150234All Organisms → cellular organisms → Bacteria1795Open in IMG/M
3300020580|Ga0210403_10053499All Organisms → cellular organisms → Bacteria3229Open in IMG/M
3300020580|Ga0210403_10185575All Organisms → cellular organisms → Bacteria1705Open in IMG/M
3300020580|Ga0210403_10233138All Organisms → cellular organisms → Bacteria1510Open in IMG/M
3300020581|Ga0210399_10004770All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis10525Open in IMG/M
3300020581|Ga0210399_10008419All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis8059Open in IMG/M
3300020582|Ga0210395_10000782All Organisms → cellular organisms → Bacteria24481Open in IMG/M
3300020583|Ga0210401_10144261All Organisms → cellular organisms → Bacteria2226Open in IMG/M
3300021168|Ga0210406_10048381All Organisms → cellular organisms → Bacteria3733Open in IMG/M
3300021171|Ga0210405_10002146All Organisms → cellular organisms → Bacteria → Acidobacteria20873Open in IMG/M
3300021171|Ga0210405_10198243All Organisms → cellular organisms → Bacteria1591Open in IMG/M
3300021180|Ga0210396_10000161All Organisms → cellular organisms → Bacteria117759Open in IMG/M
3300021180|Ga0210396_10159631All Organisms → cellular organisms → Bacteria2023Open in IMG/M
3300021181|Ga0210388_10014642All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6277Open in IMG/M
3300021401|Ga0210393_10039240All Organisms → cellular organisms → Bacteria3713Open in IMG/M
3300021404|Ga0210389_10270534All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300021405|Ga0210387_10015323All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5934Open in IMG/M
3300021405|Ga0210387_10025199All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4655Open in IMG/M
3300021405|Ga0210387_10040236All Organisms → cellular organisms → Bacteria3725Open in IMG/M
3300021420|Ga0210394_10034877All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4486Open in IMG/M
3300021420|Ga0210394_10742588All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300021432|Ga0210384_10016328All Organisms → cellular organisms → Bacteria7283Open in IMG/M
3300021432|Ga0210384_10018976All Organisms → cellular organisms → Bacteria6677Open in IMG/M
3300021433|Ga0210391_10000094All Organisms → cellular organisms → Bacteria → Acidobacteria106930Open in IMG/M
3300021433|Ga0210391_10034519All Organisms → cellular organisms → Bacteria4052Open in IMG/M
3300021433|Ga0210391_10084384All Organisms → cellular organisms → Bacteria2507Open in IMG/M
3300021474|Ga0210390_10238183All Organisms → cellular organisms → Bacteria1540Open in IMG/M
3300021479|Ga0210410_10198672All Organisms → cellular organisms → Bacteria1796Open in IMG/M
3300021559|Ga0210409_10007499All Organisms → cellular organisms → Bacteria → Acidobacteria11235Open in IMG/M
3300021559|Ga0210409_10014778All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7721Open in IMG/M
3300021559|Ga0210409_10285105All Organisms → cellular organisms → Bacteria1491Open in IMG/M
3300021559|Ga0210409_10423934All Organisms → cellular organisms → Bacteria1189Open in IMG/M
3300024178|Ga0247694_1001151All Organisms → cellular organisms → Bacteria4764Open in IMG/M
3300024179|Ga0247695_1006198All Organisms → cellular organisms → Bacteria1710Open in IMG/M
3300024182|Ga0247669_1003296All Organisms → cellular organisms → Bacteria3726Open in IMG/M
3300024246|Ga0247680_1002244All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3437Open in IMG/M
3300024290|Ga0247667_1004640All Organisms → cellular organisms → Bacteria2984Open in IMG/M
3300024323|Ga0247666_1014472All Organisms → cellular organisms → Bacteria1718Open in IMG/M
3300024323|Ga0247666_1017138All Organisms → cellular organisms → Bacteria1565Open in IMG/M
3300025885|Ga0207653_10029318All Organisms → cellular organisms → Bacteria1773Open in IMG/M
3300025910|Ga0207684_10012381All Organisms → cellular organisms → Bacteria → Acidobacteria7422Open in IMG/M
3300026551|Ga0209648_10055362All Organisms → cellular organisms → Bacteria3381Open in IMG/M
3300027698|Ga0209446_1002817All Organisms → cellular organisms → Bacteria4259Open in IMG/M
3300027701|Ga0209447_10000040All Organisms → cellular organisms → Bacteria30260Open in IMG/M
3300027787|Ga0209074_10070400All Organisms → cellular organisms → Bacteria1119Open in IMG/M
3300027795|Ga0209139_10072304All Organisms → cellular organisms → Bacteria1210Open in IMG/M
3300027812|Ga0209656_10044743All Organisms → cellular organisms → Bacteria2539Open in IMG/M
3300027857|Ga0209166_10000542All Organisms → cellular organisms → Bacteria40282Open in IMG/M
3300027857|Ga0209166_10015546All Organisms → cellular organisms → Bacteria → Acidobacteria4824Open in IMG/M
3300027857|Ga0209166_10033133All Organisms → cellular organisms → Bacteria3129Open in IMG/M
3300027884|Ga0209275_10011253All Organisms → cellular organisms → Bacteria3836Open in IMG/M
3300027889|Ga0209380_10021047All Organisms → cellular organisms → Bacteria3689Open in IMG/M
3300028906|Ga0308309_10268303All Organisms → cellular organisms → Bacteria1434Open in IMG/M
3300028906|Ga0308309_10291013All Organisms → cellular organisms → Bacteria1378Open in IMG/M
3300029636|Ga0222749_10047659All Organisms → cellular organisms → Bacteria1883Open in IMG/M
3300030862|Ga0265753_1000222All Organisms → cellular organisms → Bacteria3102Open in IMG/M
3300030940|Ga0265740_1001195All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300031708|Ga0310686_109640957All Organisms → cellular organisms → Bacteria1735Open in IMG/M
3300031708|Ga0310686_109646810All Organisms → cellular organisms → Bacteria3044Open in IMG/M
3300031708|Ga0310686_111130274All Organisms → cellular organisms → Bacteria1563Open in IMG/M
3300031715|Ga0307476_10271016All Organisms → cellular organisms → Bacteria1242Open in IMG/M
3300031823|Ga0307478_10001223All Organisms → cellular organisms → Bacteria → Acidobacteria22847Open in IMG/M
3300032955|Ga0335076_10103071All Organisms → cellular organisms → Bacteria2781Open in IMG/M
3300033412|Ga0310810_10321964All Organisms → cellular organisms → Bacteria1646Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil41.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil12.00%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil11.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil6.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment2.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil2.00%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003220Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1EnvironmentalOpen in IMG/M
3300003224Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3EnvironmentalOpen in IMG/M
3300003352Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300017924Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_5EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300024323Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK07EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027701Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027795Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030940Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI26342J46808_100059063300003220Bog Forest SoilMIGVFGAPSRSRESTEGRIFLEFYPGPEGWLGRQTDKSSNGDSEDSVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVAXGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN*
JGI26344J46810_100065953300003224Bog Forest SoilMIGVFGAPSRSRESTEGRIFLEFYPGPEGWLGRQTDKSSNGDSEDSVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGXATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN*
JGI26345J50200_100198533300003352Bog Forest SoilVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN*
Ga0062387_10006649123300004091Bog Forest SoilNGDSEDSVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN*
Ga0062387_10007976723300004091Bog Forest SoilMSKRHLAGAILAAIVAAAIGSYAYRRWSGSDSSARNDLLAQMPANTNAVLYIDLDALRQSPFLAELYKWAPQAKVDADYSQFLQSTGFNYETDLNRVCIAFLNQGQDATIYAVGDGRFDRKKISAYASQTGTRGSKNGQETFSVPLNGGTRRITFTFLHKDRVALTNGPSLDLSASAPRSDSDAQAWRERFRRLAGSPVFAVVRQDSSSGATLSAQAHGGMQSPQLAALLDQLQWITVAGNPEGDRLRVVLEGEGTAATATRHLSDMLNGLLVMAQVGLSEPKMRQQLQPDAREAYLELLKSTDVSQIDRGDMKSVRLIFDLTPKFLDVARTALPVAPAAPESKVPPNKSLANKGTIRN*
Ga0062389_10024260123300004092Bog Forest SoilMSKRHLAGAILAAIVAAAIGSYAYRRWSGSDSSARNDLLAQMPANTNAVLYIDLDALRQSPFLAELYKWAPQAKVDADYSQFLQSTGFNYETDLNRVCIAFLNQGQDATIYAVGDGRFDRKKISAYASQTGTRGSKNGQETFSVPLNGGTRRITFTFLHKDRVALTNGPSLDLSASAPRSDSDAQAWRERFRRLAGSPVFAVVRQDSSSGATLSAQAHGGMQSPQLAALLDQLQWITVAGKPEGDRLRVVLEGEGTAATATRHLSDMLNGLLVMAQVGLSEPKMRQQLQPDAREAYLELLKSTDVSQIDRGDMKSVRLIFDLTPKFLDVARTALPVAPAAPESKVPPNKSLANKGTIRN*
Ga0062389_10067800223300004092Bog Forest SoilFYAHWRWSGSESNPRNELLAQIPANANTVLYIDLDALRQSPFLAELYKWAPQARVDADYSQFLQSTGFNYETDLNRICIAMLSQGQDATIYAVADGQFDQKKISAYAAQTGTRENRNGREIFLVPLNGSTRRITFTFLRKDRVALTNGPNLDLSASAPRSDSDAEAWRERFRRLAGSPVFGVVRQDSSSGANFGAQAHGSIQSPQLAALLDQLQWITVAGKPEGDRLRVVLEGEGTAAAPTRQLSDMLNGLLVMAQVGLNNPKLRQQLQSDAREAYLEMLKSADVSQVDRGDTKSVRLIFDLTPKFLDAARTAIPLAAPAPENKVPPNKSLGNKSTIRN*
Ga0070703_1002028323300005406Corn, Switchgrass And Miscanthus RhizosphereMRNRRLAGTIVAVVVVGMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFELTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN*
Ga0070699_10001054653300005518Corn, Switchgrass And Miscanthus RhizosphereMRNRRLAGTIVAVVVVGMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSGVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFELTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN*
Ga0073909_1001905863300005526Surface SoilMKKHRLAGIIVAVVVVGMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETTTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRHITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPKGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPATPQRKPFPNKSTIRN*
Ga0070697_10007187463300005536Corn, Switchgrass And Miscanthus RhizosphereMRKHRLAGIIVAIVVVGLILVFGYERWSGSGYDPRNDVLAQMPAESSAVLYIDLDALRQSPFLTELYKWAPQAKADADYAQFLQSTGFNYETDLHRVGIAFLKHDETTTLLAVAEGRFDRKKISAYASQTGTRENHGGMEIFSVPLSGSTRRITFTFLRNDRIALTNGTSLEANLSLRPVDSDAQSWRERFRRLAGSPVFAVMRQDAGAGAALSTQAPRGWRSPQLSALIDQLQWITVAGRPDADRLRVVLEGESGAAAPTRQLSDVINGLLVLAQAGLSDRKMREQLRPEEREAYLELLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPAAPGVPQTKPFPNKSTIRN*
Ga0070730_1000056433300005537Surface SoilMNKQRLTGAIVAVVVVGAIVLFGYYRWRGSGVDPRIDILANMPSDASAVLFVDLDGLRRSPFLAELYKWAPQTTADADYTQFLQATGFNYESDLERVGIALLKRGQDTFVFAVAEGRFDRNKISAYALQTGTRENRAGREIFSLPRSSTARRITFTFLRNDRMALMNSDGLESLLSQKHSDIDAQAWRERFRRLAGSPVFAVIRQDAGASAALGAGAAGGFHSPQLSALVDQLQWITVAGKPDADRLRVVVEGEGSADAPTRQLSDVVNGLLVLAQAGLGDPKLRQKLPAEEREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLDAARAALPVAPAVPQTKRFPNKSTVRN*
Ga0070730_1009232023300005537Surface SoilMKNRRLAGTIVAIVVVGIALVFGYERWRGSGYDPRNDLLAQLPADSSAVLYIDLDGLRQSPFLAELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFSKRGEATTLFALAEGRFDRKKISSYASQSGTRENHGGREIFSVPLIGSARRVTFTFLRNDRIALTNGTNLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQAAGAGTALSTQAPRGLQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESSAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLELLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTTMPVAPAVPQGKPFPNKSTIRN*
Ga0070695_10019950413300005545Corn, Switchgrass And Miscanthus RhizosphereMRKHRLAGIIVAIVVVGLILVFGYERWSGSGYDPRNDVLAQMPAESSAVLYIDLDALRQSPFLTELYKWAPQAKADADYAQFLQSTGFNYETDLHRVGIAFLKHDETTTLLAVAEGRFDRKKISAYASQTGTRENHGGMEIFSVPLSGSTRRITFTFLRNDRIALTNGTSLEANLSLRPVDSDAQSWRERFRRLAGSPVFAVMRQDAGAGAALSTQAPRGWRSPQLSALIDQLQWITVAGRPDADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDRKMREQLRPEEREAYLELLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPAAPGVPQTKPFPNKSTIRN*
Ga0070704_10000201253300005549Corn, Switchgrass And Miscanthus RhizosphereMRNRRLAGTIVAVVVVGMTLVFGYERWRGSAYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSGVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN*
Ga0070761_1012451123300005591SoilMNKRRLGEIILAVILVGAMVFYGYQRWSSSGSRSPNDVLRHMPADAEAVLYIDLDALRQSPFLSELYKWAPEPKADPDYTQFLESTGFNYESDLNRAGIALSKHGQETTLFAVADGRFDRRKIAAYAQQTGTRESQGGKEIFSVPLLGGTRRITFTFLPNDQIALTNGSQLLSSLSPPPADSDAEAWRERFRRLSGSPVFAVVRQNARAGTALDERTPHGFRSPQLSALIDQLQWITLAAKPEADRLRVALEGEGSADAPTRQLSDVINGLLLLAEAGLSDQKMRQQLEPEVRETYLEMLKSADVRQIDRGETKSVRLLFDLTPKFLEAAR
Ga0070761_1013726723300005591SoilMNKRTITRTLPAIVLAGAVVFYGYERWGGSKYSPRNDVLALMPEEAIAVLYIDLDELRQSPFLAELYKWAPETKADADYAQFLQSTGFNYESDLNHISIAWLKRGNDTTLFAVANGRFDRKKISAYASQTGARENRSGKEIFSVPLSGSGRRITFTFLSNDRIALTNSTDLESLLSQPRADSDTLAWRERFRRLAGSPVFAVVRQSAAAGAALSAQAPGGLQSPQLSALIDQLQWITLAGKPEADHLRVVLEGEGAADAPTRQISEVLNGLLVLAQAGLSDQKMRQQLQPDVRAAYLEMLKSADVSQIDRGETKSVRLVFDLTPKFFGAARAAMPVSPAIPQNMAAPNKGTIRN*
Ga0070762_1001861243300005602SoilMTKRSVALGILAILVAGALALYGYRRFGVSGPSARDQLLGEMPAGASAVLFLDLDALRQSPFLAELYKWAPEPKADPDYAQFLGSSGFNYETDLSRVSVAVMKHGQESDLFAVADGKFDRKKISAYASETGTRVSRGGREIFSVPISGSARRITFTFLGNNRMALTNGTSLEATLSEPPGGSDREAWRERFRRLAGSQVFAVVRQDRGAGAALSARAPGGLQSPQLSALIDQLQWITVAGKTEGDRLRVVTEGEGSSDAPARQLSDVLNGLLVLAQAGLHAEKLRHELPPEVREAYLELLKSADVSEIDRGETKSVRLILDVTPEFLEAARASIPAAPAAPQNKSLPNKSTIRN*
Ga0070764_1018429613300005712SoilMTKRSVALGILAILVAGALALYGYRRFGGSGPSARDQLLGEMPAGASAVLFLDLDALRQSPFLAELYKWAPEPKADPDYAQFLGSSGFNYETDLSRVSVAVMKHGQESDLFAVADGKFDRKKISAYASETGTRISRGGREIFSVPIAGSARRITFTFLGNNRMALTNGTSLEATLSEPPGGSDREAWRERFRRLAGSQVFAVVRQDRGAGAALSARAPGGLQSPQLSALIDQLQWITVAGKTEGDRLRVVTEGEGSSDAPARQLSDVLNGLLVLAQAGLHAEKLRHELPPEVREAYLELLKSADVSEIDRG
Ga0070766_1000546933300005921SoilVGVRVDRMNKRRLGEIILAVILVGAMAFYGYQRWSSSGSRSPNDVLRHMPADAEAVLYIDLDALRQSPFLSELYKWAPEPKADPDYTQFLESTGFNYESDLNRAGIALSKHGQETTLFAVADGRFDRRKIAAYAQQTGTRESQGGKEIFSVPLLGGTRRITFTFLPNDQIALTNGSQLLSSLSPPPADSDAEAWRERFRRLSGSPVFAVVRQNARAGTALDERTPHGFRSPQLSALIDQLQWITLAAKPEADRLRVALEGEGSADAPTRQLSDVINGLLLLAEAGLSDQKMRQQLEPEVRETYLEMLKSADVRQIDRGETKSVRLLFDLTPKFLEAARAARPVAPVAPQRKAPRNPGTIRN*
Ga0070765_10010158813300006176SoilDLDGLRQSPFLAELYKWAPQAKADADYAQFLQSTGFNYESDLNRLSVALLNHGQDSTVYAVVDGRFDRKKISAYASQTGTRENRNGREIFSVPLNGSTRQITLMFLRNDRIALTNGSSLESPVSPPHTDSDAQAWRERFRRLAGSPVFAVIRQDAAAGAAFSAQTPRGLQSPQLSALLDQLQWITVAGKPDADRLRVVLEGEGTAEAPTRQLSDVLNGLVVLAQAGLSDQKMRQQLQPEVREAYLEMLKSAEVSQIDRGETKSVRLIFDLTPKFLEAARTALPVVPPAPQKKALPNKGTIRN*
Ga0070765_10024217823300006176SoilMNKRRLAGTILVAILVGAIALYGYQRWRSSEDSPRNDLLAQMPADASAVFYIDLDALRQSPFLAELYKWAPQTKADADYAQFLQSTGFNYESDLHRASIALLKHGQETTLFTVADGRFDRKKIAAYASQTGTIENRSGREIFSVPLSGSAKRITFTFLRKDRIALTNGAALDALLSPVHADSDSLAWRERFRRLGGSPLFAVVRQDAAAGSALSAQAPGGLQSPQLSALIDQLHWITVAGKPDADHLRVVLEGEGSADAPTRQLSEVINGLLVLAQAGLSDRKVRQELQPEVRESYLEMLKSADVSQIDRGETKSVRLMFDLTPKFLETARTAMPIAPAVPQNKPQANKNT
Ga0070765_10039141013300006176SoilMNKRAVAGTVLAVIVAGAIVFYAYQRFGGSGYSPRDEMLAQMPADANAVLHIDLDALRQSPFLAELYKWAPQSRADADYSQFMQSTGFNYESDLNRVSIALLKSGKDSVLFAVAEGRFDRKKISAYASQTGTRENRSGKEIFSVPLNGTTQRITFTFLRSDRIALTNGSNIEGRLSAPHEDADSKTWRERFRRLAGAPVFAVVRQDAAAGTALSAQTQRGLQSPQLSALIDQLQWITIAGKPEGDHLRVVVEGEGAADAPIRQLSDVLNGLLVLAQAGLSDQKMRQQLQPDVREA
Ga0079222_1012173723300006755Agricultural SoilQLRGISDFGAGNMNKQRLAGAIVAAVVVAAIVVFGYEHWSGSGFDPRNDILANMPSEASAVLFIDLEGLRQSPFLAELYKWAPQTTADADYAQFLQATGFNYESDLKRVCIALLKHGEETIVFAVAEGRFDRKKISAYALQTGTRENRAGREIFSLPRSGTSRRFTFTFLRNDRIALMNSDGLDSLLSQRPSEIDAQAWRERFRRLAGSSVFAVIRQDAGAGAALGAGAPGGFHSPQLSALVDQLQWITVAGKPDSDRLRVVVEGEGSADAPTRQLSDVVNGLLVLAQGGLEDPKLRQQLPAEGREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLEAARAALPVAPAVPQTKRFPNKSTIRN*
Ga0079221_1001228633300006804Agricultural SoilMNKQRLAGAIVAAVVVAAIVVFGYEHWSGSGFDPRNDILANMPSEASAVLFIDLEGLRQSPFLAELYKWAPQTTADADYAQFLQATGFNYESDLKRVCIALLKHGEETIVFAVAEGRFDRKKISAYALQTGTRENRAGREIFSLPRSGTSRRFTFTFLRNDRIALMNSDGLDSLLSQRPSEIDAQAWRERFRRLAGSSVFAVIRQDAGAGAALGAGAPGGFHSPQLSALVDQLQWITVAGKPDSDRLRVVVEGEGSADAPTRQLSDVVNGLLVLAQGGLEDPKLRQQLPAEEREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLEAARAALPVAPAVPQTKRFPNKSTIRN*
Ga0074044_10004682103300010343Bog Forest SoilMSKQRIVVTAVAVLVVGAIVLYGYQRWAGTRSSPRDELLAQMPADAGAVLFLDLDALRQSPFLAELYKWAPPPKTDTDYAQFLQSTGFNYETDLNLVSIALLKHGQESTLFAVAEGRFDRKKISAYASQTGTRESRGGREIFSVPVTGGTRRITFTFLRNDRIALTNASTLESSLSLPHADSDSLAWRERFRRLAGSPLFAVVRQDAGAGAALSAQAPGGLQSPQLSALIDQLQWITVAGKPEADHLRVVFEGEGTSDATTRQLSDVINGLLVLAQAGLYNQKMRQQLQPDVREAYLELLKSADVSQIDRGDTKSIRLMFDLTPRFLEAARTTMPVAPAAPPNKALSNKGTIRN*
Ga0074044_1009005843300010343Bog Forest SoilMIGVFGAPSRSRESTEGRIFLEFYPGPEGWLGRQTDKSSNGDSEDSVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN*
Ga0134128_1001618853300010373Terrestrial SoilMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFELTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN*
Ga0134121_10002358143300010401Terrestrial SoilMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN*
Ga0137363_1003491833300012202Vadose Zone SoilMNKLRLAGAILAVLLIGAIVYLGYRHWGGASYSPRDEVLRQMPADASAVLYIDLNALRQSPFLSELYKWAPQPQTDADYSQFLLSTGFNYERDLNRVSIALLRSGKDTILFAVAEGRFDRKKISAYALQTGTREIRSGKEIFSVPLSGPARQIAFTFLSNDRIALTNGSDFAAALSVPHEDADSQVWREHFRRLAGSPVFAAVRQDAAAGTALSAEAPRGFESPQLSALIDQLQWITAAGKPEGDRLRVVLEGEGSSEAPIRQLSDVLNGLLMMAQVGLSDQKMRQQLQPDVREAYQEMLKSADVSQIDRGETKSVRLMFDLTPKFLEAARTATPVAPVAPQNKALRNKRTIRN*
Ga0137360_1000296283300012361Vadose Zone SoilMNKLRLAGAILAVLLIGAIVYLGYRHWGGASYSPRDEVLRQMPADASAVLYIDLNALRQSPFLSELYKWAPQPQTDADYSQFLLSTGFNYERDLNRVSIALLRSGKDTILFAVAEGRFDRKKISAYALQTGTREIRSGKEIFSAPLSGPARQIAFTFLSNDRIALTNGSDFAAALSVPHEDADSQVWREHFRRLAGSPVFAAVRQDAAAGTALSAEAPRGFESPQLSALIDQLQWITAAGKPEGDRLRVVLEGEGSSEAPIRQLSDVLNGLLMMAQVGLSDQKMRQQLQPDVREAYQEMLKSADVSQIDRGETKSVRLMFDLTPKFLEAARTATPVAPVAPQNKALRNKRTIRN*
Ga0153915_1028592223300012931Freshwater WetlandsMKKHKLAGTIVVVVVFGMALVFGYERWRGSAYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLAELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFLKHGEATTLFAVAEGRFDRKKISTYASQTGTRENHGGREIFSVPLSGSTRRITFTFLRNDRIALTNGTNLEASSQRPADSDAQAWRVRFRRLAGSPVFAVVRQDAGAGAALSMQAPRGLQSAQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEGPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEVARTAMPVAPTVPQKKPFPNKSTIRN*
Ga0164301_1027157913300012960SoilTLVFGYERWRGSGYDARNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYSQFLQSTGFNYETDLHRVSIAFLKHGETTTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSSGTRRITFTFLRNDRIALTNGTNLEASLSQRSADSDAQAWRERFRRLAGSPIFAVVRQDAGAGAALSTQAPRGLQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVVPAVPQRRSFPNKSTIRN*
Ga0187820_1000006293300017924Freshwater SedimentMSTQRLNKRKLAGAIALALVAGGSLFYGYQRLGGSGYSPRNDVLEPMPADASVVLFIDLDALKQSPFLAELYKWAPQTKADADYAQFLKSTGFNYETDLNCASIAVLNRGTETTLFAIAEGRFDRKKISAYALQTGTRENRSGAEIFSVPLAASTRRITFTFLRNNRIALTNGANLESLLSRTQADPDTQAWRERFRRLAGSPIFAVARQDAAVGAAMDAQAPGGLRSPQLSALIHQLEWITVAGKPNADRMRVVLEGEGAPDAPTRQLSDVINGLLVLAQAGLSEPKMRQQLQPGVREAYLEMLKSADVSQIDRGETKSVRLMFDVSPQFLEAARAGMPTAPAAPQNKVLPNKSAIRN
Ga0187805_1001172513300018007Freshwater SedimentYSPRNDVLAPMPADASVVLFIDLDALKQSPFLAELYKWAPQTKADADYAQFLKSTGFNYETDLNCASIAVLNRGTETTLFAIAEGRFDRKKISAYALQTGTRENRSGAEIFSVPLAASTRRITFTFLRNNRIALTNGANLESLLSRTQADPDTQAWRERFRRLAGSPIFAVARQDAAVGAAMDAQAPGGLRSPQLSALIHQLEWITVAGKPNADRMRVVLEGEGAPDAPTRQLSDVINGLLVLAQAGLSEPKMRQQLQPGVREAYLEMLKSADVSQIDRGETKSVRLMFDVSPQFLEAARAGMPTAPAAPQNKVLPNKSAIRN
Ga0210407_1003780953300020579SoilMNKRRLGGIILAAILVGAMVFYGYQRWGGSAPRSPNDVLAHMPADAGAVLYIDLDALRQSPFLSELYKWAPQPQRDPDYTQFLESTGFNYESDLHRVGIALGRHGQQTTLFAVADGRFDRGKIAAYARRAGTSRSQGGKEIFSVPLASGTRQITFTFFEDDRIALTSGTGLLSSFPPPPADSDAEAWRERFRRLAGSPVFAVVRQNASAGTALSAEAPRGFRSPQLSALIDQLQWITVAGKPEADRLRVALEGEGSADGPTRQLSDVIKGLLVLAEAGLNDQKMRQQLEPEVREAYLEMLKSADARQIDRGETKSVRLVFDLTPRFLEAARAAMPVAPAAPQGKAARNKGTIRN
Ga0210407_1015023423300020579SoilLGGDGANRMNKRRLAGAILGLVGIGAIACLSYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPEADADYSQFMQSTGFNYERDLNRVSIALLKSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNANTRRIAFTFLRSDRIALTNGSNFETSLSASHEDAESRAWRERFRRLAGAPVFAVVRQDAAAGTALSAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSTDVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPISPVAPQNKALPNKGTIRN
Ga0210403_1005349933300020580SoilMNKRRLAGTILAVILVAAIAVYGYQRWRGSADSARNDLLAQMPADASAVFYIDLDALRQSPFLAELYKWAPQTKADADYAQFLQSTGFNYENDLHRASIALLKHGQETILFAVADGRFDRKKIAAYASQSGTTENRSGREIFSVPLSGSTRRITFTFLRKDRIALTNGAALDALLSPVHADSDSLAWRERFRRLGGSSLFAVMHQDAAAGSALSAQAPGGLQSPQLSSLIDQLHWITVAGKPEADHLRVVLEGEGGADTPTRQLSEVINGLLVLAQAGLSDRKVRQELQPEVRESYLELLKSADVSQIDRGETKSVRLMFDLTPKLLETARTSLPVAPPVPQSKPQPNKNTIRN
Ga0210403_1018557513300020580SoilSVVLYIDLDALRQSPFLAELYKWAPQPKADADYAQFQQSTGFNYESDLNRVSIALVRHGQESTLLAVAEGRFDRKKIAAYASQTGTREARGGKEIFSVPMAGGTRRITFTFLRSDRIALTNDASLESTLSQPRADSDTQAWRERFRRLAGSPVFVVVRQDAGAGTALGAQAPGGLQSPQLSALIDQLHWITVAGKPEADHLRVVLEGEGGADAPTRQLSDMINGLLVLAQAGLHDQKLRQQLPPDVREAYLELLKSADVSQIDRGETKSVRLMFDLTPGFLEAARTIMPVVPPAPENKVPPHKSTIRN
Ga0210403_1023313813300020580SoilMMGAIAYLGYLHWGGSGHNPSDEVLAQTPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGAAAGTALSSQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAKDAPTRQLSEVLNGLLLLAQAGLSDQKMRQQLQPDVREAYLEILKSADVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210399_1000477023300020581SoilMNKQRLGGIILAAILVGAMVFYGFQRWGGSGSLPANEVLAHMPVDAGAVLFIDLDALRPSPFLSELYKWAPEPKADPEYTQFLESTGFNYESDLHRVGIALSRHGQQTTLFAVADGRFDRRKIAAYAQQAGTRESQGGKEIFSVPLSSGTRRITFTFLESDRIALTSGSNLLSCWSPAPADSDALAWRERFRRLAGSPVFAVVHQNARAGTALSEQTPRGLRSPQLAALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLSEVIRGLLELAEAGLSDQKMRQQLEPEVREAYLEMLKSGDARQIDRGDTKSVRLIFDLTPKFLEAARTAMPVVPVAPQRKAPRNKATIRN
Ga0210399_1000841923300020581SoilMNKRRLAGAIVALVVMGAIAYLSYLHWGGSGHNPRDEVLAQMPADASAVLYIDLDALRQSPFLVELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLKGGKETILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLLSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQDAAAGTALGAQTPRGFQSPQLSALIDQLQWITAAGKPEGDHLRVVLDGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDIREAYLEMLKSADVSLIDRRDTKSVRLTFDLTPKFLEAARTGMPIAPVAPQKRAIPNKGTIRN
Ga0210395_10000782233300020582SoilMNKRRLGEIILAVILVGAMVFYGYQRWSSSGSRSPNDVLRHMPADAEAVLYIDLDALRQSPFLSELYKWAPEPKADPDYTQFLESTGFNYESDLNRAGIALSKHGQETTLFAVADGRFDRRKIAAYAQQTGTRESQGGKEIFSVPLLGGTRRITFTFLPNDQIALTNGSQLLSSLSPPPADSDAEAWRERFRRLSGSPVFAVVRQNARAGTALDERTPHGFRSPQLSALIDQLQWITLAAKPEADRLRVALEGEGSADAPTRQLSDVINGLLLLAEAGLSDQKMRQQLEPEVRETYLEMLKSADVRQIDRGETKSVRLLFDLTPKFLEAARAARPVAPVAPQRKAPRNPGTIRN
Ga0210401_1014426123300020583SoilMMGAIAYLGYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGAAAGTALSSQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAKDAPTRQLSEVLNGLLLLAQAGLSDQKMRQQLQPDVREAYLEILKSADVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210406_1004838133300021168SoilMNPAFRLAAPPGPIPKAAFYPSNVPGHRLGVSRQKRTQKHKVEMLWRWANRMNKRRLLGIILAVILVGALVFYGYQRWGGSGSRSPNDVLRHLPADASAVLYIDLDALRQSPFLSELYKWAPEPKADPEYAQFLESTGFNYERDLHRVGIALYKHGQQTTLFAVADGRFDRRKIAAYAEQAGTRESQAGKEVFSVPLASGARRITFTFLQGDRIALTNGASLELSSSPPPADSDAEAWRERFRRLAGSPVFAVVRQDARAGTALSGETARGFASPQLSALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLADVIKGLLVLAEAGLSDQKMRQQLRPEVREAYLEMLKSADTRQIDRGETKSVRLIFDLTPKFLEAARAAMPVAPVAPQGKATRNKATIRN
Ga0210405_10002146193300021171SoilMNKLRLAGGILVVVLSGAIIYVGYRHWGGPSYSPRDEVLAQMPADASAVLYIDVDALRPSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLKSGKDTILFAVAEGRFDREKISAYASQTGTHENRTGQEIFSVPLNGTTRRIAFTFLRSDRIALTNGSDFEASLSAPHEDADSQAWRERFRRLAGSPVFAVVHQDAAAGTALSAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGASDAPIRQLSDVLNGLLMMAQVGLSDQKMRQQLQPDVREAYQEMLKSTDVSQIDRGETKSVRLMFDLTPKFLEAARIAMPVAPVAPQNKALRNKGTYSKLN
Ga0210405_1019824323300021171SoilMMGAIAYLGYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSITLLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGAAAGTALSSQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAKDAPTRQLSEVLNGLLLLAQAGLSDQKMRQQLQPDVREAYLEILKSADVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210396_1000016163300021180SoilLRGDGANRMNKRRLAGAILALVMMGAIAYLGYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSITLLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGAAAGTALSSQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAKDAPTRQLSEVLNGLLLLAQAGLSDQKMRQQLQPDVREAYLEILKSADVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210396_1015963123300021180SoilVSSLPNRKTTLQIAAVILATIVAAVIAFNAYHRWSDSAFNARNDLLAQLPADASTVLYMDLDALRQSSILAELYKWAPQGKTDADYTQFLQSTGFNYESDLNRLSVALLNHGQDSTVFAVADGRFDRKKISAYASQAGTRTNRNGKEIFSVPLSGSTRRITFTFLRSDRIALTNGPNLESSSFGPPSDSDAQAWRERFRRLAGSPVFAVIRQDGTGSMFSTQAPRGLQSPQLSALLDQLQWITIAGKPDADRLRVVLEGEGSADAPTRQISDVLNGLLVLAQAGLSEPKMRQQLQPEVREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLEAARTALPLAPPAPQNKTLPNAKSTIR
Ga0210388_1001464233300021181SoilVGVRVDRMNKRRLGEIILAVILVGAMVFYGYQRWSSSGSRSPNDVLRHMPADAEAVLYIDLDALRQSPFLSELYKWAPEPKADPDYTQFLESTGFNYESDLNRAGIALSKHGQETTLFAVADGRFDRRKIAAYAQQTGTRESQGGKEIFSVPLLGGTRRITFTFLPNDQIALTNGSQLLSSLSPPPADSDAEAWRERFRRLSGSPVFAVVRQNARAGTALDERTPHGFRSPQLSALIDQLQWITLAAKPEADRLRVALEGEGSADAPTRQLSDVINGLLLLAEAGLSDQKMRQQLEPEVRETYLEMLKSADVRQIDRGETKSVRLLFDLTPKFLEAARAARPVAPVAPQRKAPRNPGTIR
Ga0210393_1003924023300021401SoilMTKPRLAGIILAAILVGAMVFYGYQRWSAPGSRSANDVLRRMPADAGAVLYIDLDALRPSPFLSELYKWAPEPKADPDYTQFLESTGFHYESDLHRVGIALSGHGQDTTLFAVADGRFDRGKIAAYAEQAGTRESQGGKEIFSVPLSGDTRRITFTFLERDRIALTNRGQLLSSWSPPPADADALAWRERFRRLAGSPLFAVVRQNARAGTALSERTPGGFGSPQLSALIDQLQWITVAGKPEGDRLRVALEGEGSADAPTRQLSEVIQGLLGLAEAGLSDQKMRQQLAPEVREAYLEMLKSADAREIDRGETKSVRLIFDLTPKFFEAARTAMPVRPAAPRRKAPGNKDTIRN
Ga0210389_1027053413300021404SoilMNKRRLAGAIVALVVMGAIAYLSYLHWGGSGHNPRDEVLAQMPADASAVLYIDLDALRQSPFLVELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLKGGKETILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLLSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQDAAAGTALGAQTPRGFQSPQLSALIDQLQWITAAGKPEGDHLRVVLDGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDIREAYLEMLKSADVSLIDRRDTKSVRLTFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210387_1001532323300021405SoilMTKRSVALGILAILVAGALALYGYRRFGVSGPSARDQLLGEMPAGASAVLFLDLDALRQSPFLAELYKWAPEPKADPDYAQFLGSSGFNYETDLSRVSVAVMKHGQESDLFAVADGKFDRKKISAYASETGTRISRGGREIFSVPISGSARRITFTFLGNNRMALTNGTSLEATLSEPPGGSDREAWRERFRRLAGSQVFAVMRQDRGAGAALSARAPGGLQSPQLSALIDQLQWITVAGKTEGDRLRVVTEGEGSSDAPARQLSDVLNGLLVLAQAGLHAEKLRHELPPEVREAYLELLKSADVSEIDRGETKSVRLILDVTPEFLEAARASMPAAPAAPQNKSLPNKSTIRN
Ga0210387_1002519943300021405SoilLRGDGANRMNKRRLAGAILALVMMGAIAYLGYLHWGGSGHNPSDEVLAQTPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSARRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGAAAGTALSSQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAKDAPTRQLSEVLNGLLLLAQAGLSDQKMRQQLQPDVREAYLEILKSADVSLIDRRDTKSVRLIFDLTPKFLEAARAAMPIAPVAPQNKALPNKGTIRN
Ga0210387_1004023623300021405SoilMNKQRLGGIILAAILVGAMVFYGYQRWGGSGSLPANEVLAHMPVDAGAVLFIDLDALRPSPFLSELYKWAPEPKADPEYTQFLESTGFNYESDLHRVGIALSRHGQQTTLFAVADGRFDRRKIAAYAQQAGTRESQGGKEIFSVPLSSGTRRITFTFLESDRIALTSGSNLLSCWSPAPADSDALAWRERFRRLAGSPVFAVVHQNARAGTALSEQTPRGLRSPQLAALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLSEVIRGLLELAEAGLSDQKMRQQLEPEVREAYLEMLKSADARQIDRGDTKSVRLIFDLTPKFLEAARTAMPVAPVAPQRKAPRNKATIRN
Ga0210394_1003487733300021420SoilLGASWMNKRTIAATIVAVLGVGAIVLYGYQRWSGSGSSSRNELLAQMPADTSVVLYIDLDALRQSPFLAELYKWAPQPKADADYAQFLQSTGFNYERDLNRVSIALVKHGQESTLLAVAEGRFDRKKIAAYASQTGTREARGGKEIFSVPMAGGTRRITFTFLRSDRIALTNDASLESTLSQPRADSDTQAWRERFRRLAGSPVFVVVRQDAGAAAALSAQAPGGLQSPQLSALLDQLQWITVAGKPEADHLRVVLEGEGGADAPTKQLSDVINGLLVLAQAGLHDQKLRQQLPPDVREAYLELLKSADVSQIDRGETKSVRLMFDLTPGFLEAARTIMPVVPPAPENKVPPHKSTIRN
Ga0210394_1074258813300021420SoilGRGSSPQNEVLAQMPADASAVLYLDLDALRQSPFLAELYKWAPQPEADPDYAQFLKSTGFNYESDLNRVGLALLGHGQQTTLFAVVDGRFDRPKIAAYASQAGARESEGGREIFSVPLQGSTQRITFAFLANDRLALTNGTTLRASFSQPPADSDRQAWRERFRRLAGSPVFAVVRQNGRAGTAFSAETPRALRSPQLSALIDQLQWITVAGKPEADRLRVVVEGEGSADAPTRQLSDLIKGLLVLAEAGLSDHKMRQQLEPEVREAYLEMLKSAEATEIDRGET
Ga0210384_1001632883300021432SoilMNKQRLLGTILAVVLVGAMVLYGFQRWGGSGSRSPNDVLRHLPADASAVLYIDLDALRQSPFLSELYKWAPEPKADPEYAQFLESTGFNYERDLHRVGITLSKHGQQTMLFAVAEGRFDRRKIAAYAEQAGTRESQAGKEVFSVPLASGARRITFTFLQGDRIALTNGASLELSFSPPPADSDTEAWRERFRRLAGSPVFAVVRQNARVGTALSGETARGFASPQLSALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLADVIKGLLVLAEAGLSDQKMRQQLRPEVREAYLEMLKSADARQIDRGETKSVRLIFDLTPKFLEAARAAMPVAPVAPQGKATRNKDTIRN
Ga0210384_1001897653300021432SoilMRKRTLVGTVAAVIAAGAVVLYAYQRWGDSGSNARNDVLGQMPPDAGAVLFLDVDALRQSPFLAELYKWAPQPKADPDYSQFLQSTGFNYERDLHRVSVALLKRGRNSQLFVVADGRFDQRKIAAYLSPSGVHENQGGREILSVPLTGRAGRITFTFLRNDRIALTNGDGLEALLSAKPGGANSDSEAWRERFRRLAGSPVFAVVRQDAAAGAALSAQAPGGLRSPQLSVLIDQLPWITIAGKPEADRMRIVLEGEGAADTNTRQLSDVLNGLLVLAEAGLSDQKMRRQLPPEAREAYLELLKSADVSQIDRGEAKSVRLIFDLTPRFLEAARTARPVAPAAPENKTLPNKSTIRN
Ga0210391_1000009433300021433SoilMTKRSVALGILAILVVGALALYGYRRFGVSGPSARDQLLGEMPAGASAVLFLDLDALRQSPFLAELYKWAPEPKADPDYAQFLKSSGFNYETDLSRVSVAVMKHGQESDLFAVADGKFDRKKISAYASETGTRISRGGREIFSVPISGSARRITFTFLGNNRMALTNGTSLEATLSEPPGGSDREAWRERFRRLAGSQVFAVVRQDRGAGAALSARAPGGLQSPQLSALIDQLQWITVAGKTEGDRLRVVTEGEGSSDAPARQLSDVLNGLLVLAQAGLHAEKLRHELPPEVREAYLELLKSADVSEIDRGETKSVRLILDVTPEFLEAARASIPAAPAAPQNKSLPNKSTIRN
Ga0210391_1003451913300021433SoilVAAAIALYSYHRWGGSGSGQRNDLLSQMPADSSAVLFIDLDALRQSPFLAELYKWAPQPKTDADYSQFLQSTGFNYERDLERVSIALLKHGQESTLLIVAEGRFDRKKIAAYASQTGTRESRGGKEIFSVPVAGGTRRIAFTFLRSDRIALTNDASLESSLLQPHADSDTEAWGERFRRLEGSPVFAVIRQDAGTGAALSAQAPGGLQSPQLSALIDQLQWITVAGKPEADHLRVVLEGEGAADAPTKQLSDVISGLLVLAQAGLHDQKLRQQLQPDVREAYLELLKSADVSRIDRGETKSVRLMFDLTPQFLEAARTPTLPASPEATPKKPSPNKSTIRN
Ga0210391_1008438413300021433SoilMNAAVLHEARQARAVGLILSLKRALCKARSVGQKRVKQPLSRLRALGARWMNKRTIAATIVAALGVGAIVPYGYQRWSGSGSSSRNELLAQMPADTSVVLYIDLDALRQSPFLAELYKWAPQPKADADYAQFLQSTGFNYERDLNRVSIALVKHGQESTLLAVAEGRFDRKKIAAYASQTGTREARDGKEIFSVPMAGGTRRITFTFLRSDRIALTNDASLESTLSQPRADSDTQAWRERFRRLAGSPVFVVVRQDAGAAAALSAQAPGGLQSPQLSALLDQLQWITVAGKPEADHLRVVLEGEGGADAPTKQLSDVINGLLVLAQAGLHDQKLRQQLPPDVREAYLELLKSADVSQIDRGETKSVRLMFDLTPGFLEAARTIMPVVPPAPENKVPPHKSTIRN
Ga0210390_1023818323300021474SoilMMGAIAYLGYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPKADADYSQFLQSTGFNYERDLNRVSITLLNSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNGSTRRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGATAGTALSAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSADVSLIDRRETKSVRLMFDLTPKFLEAARAPMPIAPVAPQNKALPNKGTIRN
Ga0210410_1019867213300021479SoilMNKQRLGGIILAAILVGAMVFYGFQRWGGSGSLPANEVLAHMPVDAGAVLFIDLDALRPSPFLSELYKWAPEPKADPEYTQFLESTGFNYESDLHRVGIALSRHGQQTTLFAVADGRFDRRKIAAYAQQAGTRESQGGKEIFSVPLSSGTRRITFTFLESDRIALTSGSNLLSCWSPAPADSDALAWRERFRRLAGSPVFAVVHQNARAGTALSEQTPRGLRSPQLAALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLSEVIRGLLELAEAGLSDQKMRQQLEPEVREAYLEMLKSADARQIDRGETKSVRLIFDLTPKFLEAARTAMPVVPVAPQRKAPRNKATIRN
Ga0210409_1000749963300021559SoilMNKRTIAATIVAVLGVGAIVLYGYQRWSGSGSSSRNELLAQMPADTSVVLYIDLDALRQSPFLAELYKWAPQPKADADYAQFLQSTGFNYERDLNRVSIALVKHGQESTLLAVAEGRFDRKKIAAYASQTGTREARGGKEIFSVPMAGGTRRITFTFLRSDRIALTNDASLESTLSQPRADSDTQAWRERFRRLAGSPVFVVVRQDAGAAAALSAQAPGGLQSPQLSALLDQLQWITVAGKPEADHLRVVLEGEGGADAPTKQLSDVINGLLVLAQAGLHDQKLRQQLPPDVREAYLELLKSADVSQIDRGETKSVRLMFDLTPGFLEAARTIMPVVPPAPENKVPPHKSTIRN
Ga0210409_1001477813300021559SoilMRKRTLVGTVAAVIAAGAVVLYAYQRWGDSGSSARNDVLGQMPPDAGAVLFLDVDALRQSPFLAELYKWAPQPKADPDYSQFLQSTGFNYERDLHRVSVALLKRGRNSQLFVVADGRFDQRKIAAYLSPSGVHENQGGREILSVPLTGRAGRITFTFLRNDRIALTNGDGLEALLSAKPGGANSDSEAWRERFRRLAGSPVFAVVRQDAAAGAALSAQAPGGLRSPQLSVLIDQLPWITIAGKPEADRMRIVLEGEGAADTNTRQLSDVLNGLLVLAEAGLSDQKMRRQLPPEAREAYLELLKSADVSQIDRGEAKSVRLIFDLTPRFLEAARTAMPVAPAAPENKTLPNKSTIRN
Ga0210409_1028510523300021559SoilMNPAFRLAAPPGPIPKGAFYPANVLGHRLGLSRQKRTQQLWRWANRMNKQRLLGTILAVVLVGAMVLYGFQRWGGSGSRSPNDVLRHLPADASAVLYIDLDALRQSPFLSELYKWAPEPKADPEYAQFLESTGFNYERDLHRVGIALSKHGQQTMLFAVAEGRFDRRKIAAYAEQAGTRESQGGKEVFSVPLASGARRITFTFLQGDRIALTDGASLELSFSPPPADSDTEAWRERFRRLAGSPVFAVVRQNARAGTALSGETARGFASPQLSALIDQLQWITVAGKPEADRLRVALEGEASADAPIRQLADVIKGLLVLAEAGLSDQKMRQQLRPEVREAYLEMLKSADARQIDRGETKSVRLIFDLTPKFLEAARAAMPVAPVAPQGKATRNKDTIRN
Ga0210409_1042393423300021559SoilMNKRALAGTVLAVMVAGAIVFYAYQRFGGSGYSPRDEVLAQMPADANAVLHIDLDALRQSPFLSELYKWAPQPRADADYSQFLQSTGFNYEGDLNRVSIALLKSGKDTVLLALAEGRFDRKKISAYASQTGTRENRGGKEIFSVPVNGSTQRITFTFLRSDRIALTNGSNLEAWLSAPHEDADSKTWRERFRRLAGTPVFAVVRQDAAAGTALGAQTPRGLQSPQLSALIDQLQWITIAGKPEGDHLRVVVEGEGTAEGPIRQLSDVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSADVSQIDRGETKSVRLMFDLTPKFLEAARTAMPVVPVAPQKKALPNKRTIRN
Ga0247694_100115133300024178SoilMNKRRLAGTIVAVVVVSMALVFGYQRWTSSSYDPRNDLLAQMPAESSAVLYIDLDGLRQSPFLAEIYKWAPQAKADADYVQFLQSTGFNYETDLHRVSIAFLKRAEATTLFVVAEGQFDRKKISAYASQMGTRENHKGREIFSVQPGGGTRRITFTFLRNDRIALTNGANLEASLSRPAADSDAQAWRERFHRLAGSPVFAVVRQDAGAGEALSRQAPRGLQSSQLSALIDQLQWITIAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLTLAQAGLSDQKMREQLEPDVREAYLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAIPVAPAAPQAKPFPNKSTIRN
Ga0247695_100619823300024179SoilMRNRRLAGTIVAVAVVSMTLVFGYERWRDSGYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFLKHDETTTLFAVAEGRFDRKKISAYASQTGTRENHGGREIFSVPSSGSPHRITFTFLRNDRIALTNGTNLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQDAGAGTALSTQAPRSWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLMLAQAGLSDQKMREQLQPDVREAYLEMLKTVDVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVTPAVPQRKPFPNKSTIRN
Ga0247669_100329673300024182SoilMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFELTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN
Ga0247680_100224433300024246SoilMTLVFGYERWRDSGYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFLKHDETTTLFAVAEGRFDRKKISAYASQTGTRENHGGREIFSVPSSGSPHRITFTFLRNDRIALTNGTNLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQDAGAGTALSTQAPRSWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLMLAQAGLSDQKMREQLQPDVREAYLEMLKTVDVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVTPAVPQRKPFPNKSTIRN
Ga0247667_100464063300024290SoilMTLVFGYERWRDSGYDPRNDVLAQMPAESSAVLYIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFLKHDETTTLFAVAEGRFDRKKISAYASQTGTRENHGGREIFSVPSSGSPHRITFTFLRNDRIALTNGTNLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQDAGAGTALSTQAPRSWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLMLAQAGLSDQKMREQLQPDVREAYLEMLKTADVSQINRGETKSVRLIFDLTPKFLEAARTAMPVTPAVPQRKPFPNKSTIRN
Ga0247666_101447243300024323SoilMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSVLIDQLQWITIAAKPEADRLRVVLEGESGAEAPTRQLSGVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN
Ga0247666_101713813300024323SoilMKNRRLAGTIVAIVVVGIALVFGYERWRGSGYDPRNDLLAQLPADSSAVLYIDLDGLRQSPFLAELYKWAPQTKADADYAQFLQATGFNYETDLHRVSIAFLKHGEATTLFALAEGRFDRKKISSYTSQTGTRENHGGREIFSVPLIGSTRRVTFTFLRNDRIALTNGANLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQAAGVGTALSTQAPRGLQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESSAEGPTRQLSDVINGLLVVAQAGLSDQKMREQLQPDVREAYLEILKSADVSQIDRGETKSVRLIFDLTPRFLEAARTAMPVAPTVPQGKPFPNKSTIRN
Ga0207653_1002931823300025885Corn, Switchgrass And Miscanthus RhizosphereMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN
Ga0207684_1001238163300025910Corn, Switchgrass And Miscanthus RhizosphereMRNRRLAGTIVAVVVVGMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESGAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVRLIFELTPKFLEAARTAMPVAPAAPQRKPFPNKSTIRN
Ga0209648_1005536233300026551Grasslands SoilMNKLRLAGAILAVLLIGAIVYLGYRHWGGASYSPRDEVLRQMPADASAVLYIDLNALRQSPFLSELYKWAPQPQTDADYSQFLLSTGFNYERDLNRVSIALLRSGKDTILFAVAEGRFDRKKISAYALQTGTREIRSGKEIFSVPLSGPARQIAFTFLSNDRIALTNGSDFAAALSVPHEDADSQVWREHFRRLAGSPVFAAVRQDAAAGTALSAEAPRGFESPQLSALIDQLQWITAAGKPEGDRLRVVLEGEGSSEAPIRQLSDVLNGLLMMAQVGLSDQKMRQQLQPDVREAYQEMLKSADVSQIDRGETKSVRLMFDLTPKFLEAARTATPVAPVAPQNKALRNKRTIRN
Ga0209446_100281723300027698Bog Forest SoilVPSSMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN
Ga0209447_10000040283300027701Bog Forest SoilMNKSRIAAIVAAAVVLCAIVLFGYERWGGSKLSAREELLAQLPADASAVFYIDLDALRQSPFLEELYKWAPQSKADPEYAQFLQATGFNYENDLNRLSVAVLKRGQDTTLFAVADGRFDRKKISVYATQTGTHENRGGKEIFSVPISGAARRITFTFLRNDRIALTNEASLGTSSPQTSNDSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTRAPGGLQSPQLSALIDQLQWVTVAGKPEGDHLRVVLEGEATAEGTTRQLSDLLNGLLVLAQAGLSDSKLRQQLQPQVRNAYQEMLKSADVSQIDRGETKSVRLIFDVTPSFLEAARSGAPVTPAIPQGKVLPDKKATIRN
Ga0209074_1007040023300027787Agricultural SoilFLAELYKWAPQTTADADYAQFLQATGFNYESDLKRVCIALLKHGEETIVFAVAEGRFDRKKISAYALQTGTRENRAGREIFSLPRSGTSRRFTFTFLRNDRIALMNSDGLDSLLSQRPSEIDAQAWRERFRRLAGSSVFAVIRQDAGAGAALGAGAPGGFHSPQLSALVDQLQWITVAGKPDSDRLRVVVEGEGSADAPTRQLSDVVNGLLVLAQGGLEDPKLRQQLPAEEREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLEAARAALPVAPAVPQTKRFPNKSTIRN
Ga0209139_1007230413300027795Bog Forest SoilMPANTNAVLYIDLDALRQSPFLAELYKWAPQAKVDADYSQFLQSTGFNYETDLNRVCIAFLNQGQDATIYAVGDGRFDRKKISAYASQTGTRGSKNGQETFSVPLNGGTRRITFTFLHKDRVALTNGPSLDLSASAPRSDSDAQAWRERFRRLAGSPVFAVVRQDSSSGATLSAQAHGGMQSPQLAALLDQLQWITVAGNPEGDRLRVVLEGEGTAATATRHLSDMLNGLLVMAQVGLSEPKMRQQLQPDAREAYLELLKSTDVSQIDRGDMKSVRLIFDLTPKFLDVARTALPVAPAAPESKVPPNKSLANKGTIRN
Ga0209656_1004474313300027812Bog Forest SoilNVAPPRRRVFGQKRTKQLLSRFRIAGANPMNKQRIAATIVVVLVAGAIAFYAYERWSGPGNNPRNELLAQMPADASAVFYVDLDALRQSPFLAELYKWAPQPKADADYAQFLQSTGFNYESDLNRVSIALLKHGRESTLLAVAEGRFDRKKISAYSSQTGTRETRGGREIFSMPVSGSARPITFTFLRNDRIALTNDASLESSLSQPHADSDEQGWRERFRRLAGSPVFVVVRQDAGAGAALNAQAPGGLQSPQLSALIDQLQWITVAGKPEADHLRVVIEGEGAADAPTKQLSEVIRGLLMLAQAGLNDPKLRQQLQPDVREAYVELLKSADISQIDRGDTKSVRLMFDLTPKFLEAARLPTVPAPPEAPPKKIPSNKGTIRN
Ga0209166_1000054263300027857Surface SoilMNKQRLTGAIVAVVVVGAIVLFGYYRWRGSGVDPRIDILANMPSDASAVLFVDLDGLRRSPFLAELYKWAPQTTADADYTQFLQATGFNYESDLERVGIALLKRGQDTFVFAVAEGRFDRNKISAYALQTGTRENRAGREIFSLPRSSTARRITFTFLRNDRMALMNSDGLESLLSQKHSDIDAQAWRERFRRLAGSPVFAVIRQDAGASAALGAGAAGGFHSPQLSALVDQLQWITVAGKPDADRLRVVVEGEGSADAPTRQLSDVVNGLLVLAQAGLGDPKLRQKLPAEEREAYLEMLKSADVSQIDRGETKSVRLIFDVTPKFLDAARAALPVAPAVPQTKRFPNKSTVRN
Ga0209166_1001554623300027857Surface SoilMKNRRLAGTIVAIVVVGIALVFGYERWRGSGYDPRNDLLAQLPADSSAVLYIDLDGLRQSPFLAELYKWAPQAKADADYAQFLQSTGFNYETDLHRVSIAFSKRGEATTLFALAEGRFDRKKISSYASQSGTRENHGGREIFSVPLIGSARRVTFTFLRNDRIALTNGTNLEASLSQRPADSDAQAWRERFRRLAGSPVFAVVRQAAGAGTALSTQAPRGLQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESSAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLELLKSADVSQIDRGETKSVRLIFDLTPKFLEAARTTMPVAPAVPQGKPFPNKSTIRN
Ga0209166_1003313313300027857Surface SoilGVSEIGVQQMKNRRLAGTIVAIVVVGIALVFGYERWRGSGYDPRNDLLAQLPADSSAVLYIDLDGLRQSPFLAELYKWAPQAKADADYAQFLQATGFNYETDLHRVSIAFLKHGEATTLFALAEGRFDRKKISSYASQTGTRENHGGREIFSVPPIGSTRRVTFTFLRNDRIALTNGANLEASLSQRPADSDAQAWRERFRRLAGSPIFAVVRQAAGVGTALSTQAPRGLQSPQLSALIDQLQWITVAGKPEADRLRVVLEGESSAEAPTRQLSDVINGLLVLAQAGLSDQKMREQLQPDVREAYLEILKSADVSQIDRGETKSVRLIFDLTPKFLEAARTAMPVAPTVPQGKPFPNKSTIRN
Ga0209275_1001125363300027884SoilMTKRSVALGILAILVAGALALYGYRRFGVSGPSARDQLLGEMPAGASAVLFLDLDALRQSPFLAELYKWAPEPKADPDYAQFLGSSGFNYETDLSRVSVAVMKHGQESDLFAVADGKFDRKKISAYASETGTRVSRGGREIFSVPISGSARRITFTFLGNNRMALTNGTSLEATLSEPPGGSDREAWRERFRRLAGSQVFAVVRQDRGAGAALSARAPGGLQSPQLSALIDQLQWITVAGKTEGDRLRVVTEGESSSDAPARQLSDVLNGLLVLAQAGLHAEKLRHELPPEVREAYLELLKSADVSEIDRGETKSVRLILDVTPEFLEAARASMPAAPAAPQNKSLPNKSTIRN
Ga0209380_1002104763300027889SoilVGVRVDRMNKRRLGEIILAVILVGAMAFYGYQRWSSSGSRSPNDVLRHMPADAEAVLYIDLDALRQSPFLSELYKWAPEPKADPDYTQFLESTGFNYESDLNRAGIALSKHGQETTLFAVADGRFDRRKIAAYAQQTGTRESQGGKEIFSVPLLGGTRRITFTFLPNDQIALTNGSQLLSSLSPPPADSDAEAWRERFRRLSGSPVFAVVRQNARAGTALDERTPHGFRSPQLSALIDQLQWITLAAKPEADRLRVALEGEGSADAPTRQLSDVINGLLLLAEAGLSDQKMRQQLEPEVRETYLEMLKSADVRQIDRGETKSVRLLFDLTPKFLEAARAARPVAPVAPQRKAPRNPGTIR
Ga0308309_1026830313300028906SoilMNKRRLAGTILVAILVGAIALYGYQRWRSSEDSPRNDLLAQMPADASAVFYIDLDALRQSPFLAELYKWAPQTKADADYAQFLQSTGFNYESDLHRASIALLKHGQETTLFTVADGRFDRKKIAAYASQTGTIENRSGREIFSVPLSGSAKRITFTFLRKDRIALTNGAALDALLSPVHADSDSLAWRERFRRLGGSPLFAVVRQDAAAGSALSAQAPGGLQSPQLSALIDQLHWITVAGKPDADHLRVVLEGEGSADAPTRQLSEVINGLLVLAQAGLSDRKVRQELQPEVRESYLEMLKSADVSQIDRGETKSVRLMFDLTPKFLETARTAMPIAPAVPQ
Ga0308309_1029101313300028906SoilVELTDPMNKRAVAGTVLAVIVAGAIVFYAYQRFGGSGYSPRDEMLAQMPADANAVLHIDLDALRQSPFLAELYKWAPQSRADADYSQFMQSTGFNYESDLNRVSIALLKSGKDSVLFAVAEGRFDRKKISAYASQTGTRENRSGKEIFSVPLNGTTQRITFTFLRSDRIALTNGSNIEGRLSAPHEDADSKTWRERFRRLAGAPVFAVVRQDAAAGTALSAQTQRGLQSPQLSALIDQLQWITIAGKPEGDHLRVVVEGEGAADAPIRQLSDVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSADVSQIDRGETKSVRLMFDLTPKFLEAARTAMPVVPVTPQNKGLPNKGTIRN
Ga0222749_1004765913300029636SoilMNPAFRLAAPPGPIPKGAFYPANVPGHRLGLSRQKRTEQLWRWANRMNKQRLLGTILAVVLVGAMVLYGFQRWGGSGSRSPNDVLRHLPADASAVLYIDLDALRQSPFLSELYKWAPEPKADPEYAQFLESTGFNYERDLHRVGITLSKHGQQTMLFAVAEGRFDRRKIAAYAEQAGTRESQAGKEVFSVPLASGARRITFTFLQGDRIALTNGASLELSFSPPPADSDTEAWRERFRRLAGSPVFAVVRQNARVGTALSGETARGFASPQLSALIDQLQWITVAGKPEADRLRVALEGEGSADAPIRQLADVIKGLLVLAEAGLSDQKMRQQLRPEVREAYLEMLKSADARQIDRGETKSVRLIFDLTPKFLEAARAAMPVAPVAPQGKATRNKDTIRN
Ga0265753_100022223300030862SoilVNRRRIAATIAGAIVAGAIALYGYHRWGGSGSGQRNDLLSQMPADASAVVFIDLDALRQSPFLAELYKWAPQPKTDADYSQFLQSTGFNYERDLDRATIALLKHGQESTLLIVAEGRFDRKKIAAYASQTGTRESRGGKDIFSVPVAGGTRRITFTFLRSDHIALTNDASLESSLLQPHADSDTEAWRERFRRLAGSPVFAVVRQDAGTGAALSAQAPGGLQSPQLSALIDQLQWITVAAKPEADHLRVVLEGEGTADAPTKQLSDVISGLLVLAQAGLHDQKLRQQLQPDVREAYLELLKSADVSRIDRGETKSVRLMFDLTPQFLEAARTPTLPASPEATPKKPSPNKSTIRN
Ga0265740_100119513300030940SoilIVAGAVALYGYHRWGGFGSNPRNDLLSQMPADASAVLFIDLDALRQSPFLAELYKWAPQPKTDADYSQFLQSTGFNYERDLDRATIALLKHGQESTLLIVAEGRFDRKKIAAYASQTGTRESRGGKDIFSVPVAGGTPRITFTFLRSDRIALTNDASLESSLSQPHADSDTEAWRERFRRLAGSPVFAVVRQDAGTGAALSAHAPGGLQSPQLSALIDQLQWITVAAKPEADHLRVVLEGEGTADAPTKQLSDVISGLLVLAQAGLHDQKLRQQLQPDVREAYLELLKSADVSRIDRGETKSVRLMFDLTPQFLAAARTPTLPASPEATPKKPSPNKGTIRN
Ga0310686_10964095723300031708SoilMNKRRLAGAILAPVVMGALAYLGYLHWGGSGYNPRDEVLAQMPADASAILYIDLDALRQSPFLAELYNWAPQPKADADYSQFLQSTGFNYERDLNRVSIALLKSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPKNGNTRRIAFTFLRSDRIALTNGSNFEASLSAPHEDAESRAWRERFRRLAGAPVFAVVRQGATAGTALGAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGAADAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSADVSKIDRGETKSVRLMFDLTPKFLEAVRTAMPIPPVAPRNKALPNKGTIRN
Ga0310686_10964681033300031708SoilMNWRRISAIIAGAIVAGAIALYGYHRWGGSGSGQRNDLLSQMPADASAVVFIDLDALRQSPFLAELYKWAPQPKTDADYSQFLQSTGFNYERDLDRATIALLKHGQESTLLIVAEGRFDRKKISAYASQTGTRESRGGKDIFSVPVAGGTRRITFTFLRSDRIALTNDASLESSLSQPHADSDTEAWRERFRRLAGSPVFAVVRQDAGTGAALSAQAPGGLQSPQLSALIDQLQWITVAAKPEADHLRVVLEGEGTADAPTKQLSDVISGLLVLAQAGLHDQKLRQQLQPDVREAYLELLKSADVSRIDRGETKSVRLMFDLTPQFLEAARTPTLPASPEATPKKPSPNKSTIRN
Ga0310686_11113027423300031708SoilRDELLAQMPTDASTVLFLDLDALRQSPFLAELYKWAPQTKADPDYAQFLQSTGFNYETDLSRVSIAVLKHGQETALFAVADGKFDRKKISAYASQTGTRESRGRKEIFSVPVNGSARRIAFTFLRDNRIALTNDATLESSLSAQHGASDAQAWRERFRRLAGSPVFAVVRQDAAAGTALSARAPGGLQSPQLTALIDQLQWITVAGKPEADRLRVILEGEGTSDAPTRQLSDVLNGLLVLAQAGLHDQKLRQQLQPDVREAYLEMLKSADVSQIDRGETKSVRLIFDVTPQFLEAARASIPAASPSPQNKVLPNKHTIRN
Ga0307476_1027101613300031715Hardwood Forest SoilMNKRRLAGAILGLVGIGAIACLSYLHWGGSGYNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPEADADYSQFMQSTGFNYERDLNRVSIALLKSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNANTRRIAFTFLRSDRIALTNGSNFETSLSASHEDAESQPWRERFRRLAGAPVFAVVRQDAAAGTALSAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSTDVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPISPVAPQNKALPNKGTIRN
Ga0307478_10001223193300031823Hardwood Forest SoilMNKRRLAGAILGLVGIGAIACLSYLHWGGSGHNPSDEVLAQMPADASAVLYIDLDALRQSPFLAELYKWAPQPEADADYSQFMQSTGFNYERDLNRVSIALLKSGKDTILFAVAEGRFDREKISAYALQTGTRENHGGKEIFSVPQNANTRRIAFTFLRSDRIALTNGSNFETSLSASHEDAESQPWRERFRRLAGAPVFAVVRQDAAAGTALSAQTPRGLQSPQLSALIDQLQWITAAGKPEGDHLRVVLEGEGATDAPTRQLSEVLNGLLVLAQAGLSDQKMRQQLQPDVREAYLEMLKSTDVSLIDRRDTKSVRLMFDLTPKFLEAARAAMPISPVAPQNKALPNKGTIRN
Ga0335076_1010307153300032955SoilMTRNKITAIVAASVALAAILLLSYQHWRDSGPGGREELLAKLPADATAVLYIDLDALRQSPFLAELYKWVPQAEADTDYAEFLKSTGFNYEADLHRAAIAILKRGQDSAFCAVAEGSFDRKRIAAYASQSGTRETRNGKEVFSVPINGMPRRISFTFLRDDRIVLTNDSNLLTPSTHLVTDLDAQAWRERFRRLAGSPIFAVVRQDAAAGEAISARAPGGLQSPQLSALIDQLQWISVAGNPEADHLRVVLEGEGAADGPTRQISDLLTGLLSLAEAGLNDPKVRRELQPQVREAYLELFKSADVSQIDRGDTKSVRLIFDVTPALLEAARSSSPGTPASPQEKAPPGKKATIRN
Ga0310810_1032196433300033412SoilMTLVFGYERWRGSGYDPRNDVLAQMPAESSAVLFIDLDGLRQSPFLGELYKWAPQAKADADYAQFLQSTGFNYETDLRRVSIAVLKHGETNTLFAVAEGRFDRNKISAYASQTGTRENHGGREIFSVPPSGGTRRITFTFLRNDRIALTNGANLEASLSQRSADSDAQAWRERFRRLAGSPVFAVVRQDAGAGAALSTQAPRGWQSPQLSVLIDQLQWITIAAKPEADRLRVVLEGESGAEAPTRQLSGVINGLLVLAQAGLSDQKMREQLQPDVREACLEMLKSADVSQIDRGETKSVR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.