NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F062774

Metagenome / Metatranscriptome Family F062774

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F062774
Family Type Metagenome / Metatranscriptome
Number of Sequences 130
Average Sequence Length 174 residues
Representative Sequence MKAGEKFFLLTPKGTEELRGRTPKLDATARSILSLIDQGTTVAESILQRSKFTRDQVIEGLRSLMSNGFIAASTSDATVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Number of Associated Samples 91
Number of Associated Scaffolds 130

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 14.62 %
% of genes near scaffold ends (potentially truncated) 30.77 %
% of genes from short scaffolds (< 2000 bps) 59.23 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.077 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(40.000 % of family members)
Environment Ontology (ENVO) Unclassified
(42.308 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.308 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 50.00%    β-sheet: 4.95%    Coil/Unstructured: 45.05%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 130 Family Scaffolds
PF17201Cache_3-Cache_2 13.85
PF00903Glyoxalase 10.77
PF13561adh_short_C2 6.15
PF06127Mpo1-like 3.85
PF11196DUF2834 2.31
PF00135COesterase 2.31
PF01872RibD_C 2.31
PF01252Peptidase_A8 1.54
PF01810LysE 1.54
PF07676PD40 1.54
PF08238Sel1 1.54
PF00106adh_short 1.54
PF00543P-II 0.77
PF14814UB2H 0.77
PF13023HD_3 0.77
PF13442Cytochrome_CBB3 0.77
PF07690MFS_1 0.77
PF12697Abhydrolase_6 0.77
PF07282OrfB_Zn_ribbon 0.77
PF14329DUF4386 0.77
PF00067p450 0.77
PF00528BPD_transp_1 0.77
PF14300DUF4375 0.77
PF05163DinB 0.77
PF08818DUF1801 0.77
PF06296RelE 0.77
PF064393keto-disac_hyd 0.77
PF01209Ubie_methyltran 0.77
PF01636APH 0.77
PF12399BCA_ABC_TP_C 0.77

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 130 Family Scaffolds
COG45392-hydroxy fatty acid dioxygenase MPO1 (alpha-oxidation of fatty acids)Lipid transport and metabolism [I] 3.85
COG0597Lipoprotein signal peptidaseCell wall/membrane/envelope biogenesis [M] 3.08
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 2.31
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 2.31
COG2272Carboxylesterase type BLipid transport and metabolism [I] 2.31
COG0347Nitrogen regulatory protein PIISignal transduction mechanisms [T] 0.77
COG2124Cytochrome P450Defense mechanisms [V] 0.77
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.77
COG22272-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylaseCoenzyme transport and metabolism [H] 0.77
COG2318Bacillithiol/mycothiol S-transferase BstA/DinB, DinB/YfiT family (unrelated to E. coli DinB)Secondary metabolites biosynthesis, transport and catabolism [Q] 0.77
COG4430Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 familyFunction unknown [S] 0.77
COG4737Uncharacterized conserved proteinFunction unknown [S] 0.77
COG5646Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis)Posttranslational modification, protein turnover, chaperones [O] 0.77
COG5649Uncharacterized conserved protein, DUF1801 domainFunction unknown [S] 0.77


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms63.08 %
UnclassifiedrootN/A36.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000546|LJNas_1029037Not Available594Open in IMG/M
3300002245|JGIcombinedJ26739_100713414All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300003220|JGI26342J46808_1008821Not Available927Open in IMG/M
3300004080|Ga0062385_10050191All Organisms → cellular organisms → Bacteria → Proteobacteria1800Open in IMG/M
3300004080|Ga0062385_10823633Not Available609Open in IMG/M
3300004092|Ga0062389_100708809All Organisms → cellular organisms → Bacteria → Proteobacteria1180Open in IMG/M
3300004092|Ga0062389_101968581Not Available761Open in IMG/M
3300004152|Ga0062386_100040431All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3494Open in IMG/M
3300004152|Ga0062386_100746526All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium804Open in IMG/M
3300004635|Ga0062388_100410216All Organisms → cellular organisms → Bacteria → Proteobacteria1180Open in IMG/M
3300005537|Ga0070730_10000002All Organisms → cellular organisms → Bacteria → Proteobacteria606246Open in IMG/M
3300005602|Ga0070762_10040302All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2538Open in IMG/M
3300005602|Ga0070762_11144529Not Available537Open in IMG/M
3300005650|Ga0075038_10698842All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Chromobacteriaceae → Formivibrio → Formivibrio citricus508Open in IMG/M
3300006050|Ga0075028_100086025All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1580Open in IMG/M
3300006052|Ga0075029_100021506All Organisms → cellular organisms → Bacteria → Proteobacteria3620Open in IMG/M
3300006052|Ga0075029_100555495All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Chromobacteriaceae → Formivibrio → Formivibrio citricus763Open in IMG/M
3300006102|Ga0075015_100955294Not Available522Open in IMG/M
3300006426|Ga0075037_1735204All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Chromobacteriaceae1830Open in IMG/M
3300007819|Ga0104322_132560Not Available1654Open in IMG/M
3300007819|Ga0104322_147566All Organisms → cellular organisms → Bacteria → Proteobacteria16747Open in IMG/M
3300009633|Ga0116129_1003130All Organisms → cellular organisms → Bacteria8040Open in IMG/M
3300010343|Ga0074044_10262210All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300012205|Ga0137362_10151824Not Available1981Open in IMG/M
3300012924|Ga0137413_10001396All Organisms → cellular organisms → Bacteria → Proteobacteria9352Open in IMG/M
3300012924|Ga0137413_10064947Not Available2166Open in IMG/M
3300012929|Ga0137404_10192008Not Available1725Open in IMG/M
3300012930|Ga0137407_10861782Not Available856Open in IMG/M
3300014501|Ga0182024_10022740All Organisms → cellular organisms → Bacteria → Proteobacteria11284Open in IMG/M
3300014501|Ga0182024_10337790All Organisms → cellular organisms → Bacteria → Proteobacteria1973Open in IMG/M
3300014838|Ga0182030_10394035Not Available1447Open in IMG/M
3300015051|Ga0137414_1230892All Organisms → cellular organisms → Bacteria6247Open in IMG/M
3300015051|Ga0137414_1230893All Organisms → cellular organisms → Bacteria → Proteobacteria2925Open in IMG/M
3300015051|Ga0137414_1252319All Organisms → cellular organisms → Bacteria → Proteobacteria1514Open in IMG/M
3300015160|Ga0167642_1002276All Organisms → cellular organisms → Bacteria → Proteobacteria3854Open in IMG/M
3300015160|Ga0167642_1008689Not Available1807Open in IMG/M
3300015206|Ga0167644_1000043All Organisms → cellular organisms → Bacteria → Proteobacteria126221Open in IMG/M
3300019882|Ga0193713_1003507All Organisms → cellular organisms → Bacteria → Proteobacteria5071Open in IMG/M
3300019886|Ga0193727_1001211All Organisms → cellular organisms → Bacteria10580Open in IMG/M
3300019887|Ga0193729_1021731All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Phycisphaerae → Phycisphaerales → Phycisphaeraceae → unclassified Phycisphaeraceae → Phycisphaeraceae bacterium2775Open in IMG/M
3300019889|Ga0193743_1208542Not Available613Open in IMG/M
3300019890|Ga0193728_1001611All Organisms → cellular organisms → Bacteria → Proteobacteria11579Open in IMG/M
3300020004|Ga0193755_1000746All Organisms → cellular organisms → Bacteria → Proteobacteria9492Open in IMG/M
3300020021|Ga0193726_1001020All Organisms → cellular organisms → Bacteria → Proteobacteria24288Open in IMG/M
3300020021|Ga0193726_1006444All Organisms → cellular organisms → Bacteria → Proteobacteria6781Open in IMG/M
3300020021|Ga0193726_1007119All Organisms → cellular organisms → Bacteria → Proteobacteria6400Open in IMG/M
3300020021|Ga0193726_1012280All Organisms → cellular organisms → Bacteria → Proteobacteria4564Open in IMG/M
3300020034|Ga0193753_10067878Not Available1855Open in IMG/M
3300020060|Ga0193717_1012534All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales3965Open in IMG/M
3300020060|Ga0193717_1164638Not Available637Open in IMG/M
3300020061|Ga0193716_1096061Not Available1281Open in IMG/M
3300020199|Ga0179592_10000226All Organisms → cellular organisms → Bacteria → Proteobacteria22812Open in IMG/M
3300020579|Ga0210407_10004140All Organisms → cellular organisms → Bacteria → Proteobacteria11511Open in IMG/M
3300020579|Ga0210407_10005189All Organisms → cellular organisms → Bacteria → Proteobacteria10098Open in IMG/M
3300020579|Ga0210407_10221426All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1472Open in IMG/M
3300020579|Ga0210407_10307596All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1238Open in IMG/M
3300020580|Ga0210403_10011577All Organisms → cellular organisms → Bacteria7154Open in IMG/M
3300020580|Ga0210403_10034584All Organisms → cellular organisms → Bacteria → Proteobacteria4039Open in IMG/M
3300020580|Ga0210403_10748424Not Available780Open in IMG/M
3300020580|Ga0210403_10831578All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium732Open in IMG/M
3300020581|Ga0210399_10004884All Organisms → cellular organisms → Bacteria10409Open in IMG/M
3300020581|Ga0210399_10033800All Organisms → cellular organisms → Bacteria → Proteobacteria4090Open in IMG/M
3300020581|Ga0210399_10221185All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1577Open in IMG/M
3300020582|Ga0210395_10588573Not Available835Open in IMG/M
3300020583|Ga0210401_10409499Not Available1218Open in IMG/M
3300021168|Ga0210406_10005692All Organisms → cellular organisms → Bacteria13001Open in IMG/M
3300021168|Ga0210406_10007353All Organisms → cellular organisms → Bacteria → Proteobacteria11202Open in IMG/M
3300021168|Ga0210406_10013763All Organisms → cellular organisms → Bacteria → Proteobacteria7768Open in IMG/M
3300021168|Ga0210406_10340127Not Available1212Open in IMG/M
3300021170|Ga0210400_10001438All Organisms → cellular organisms → Bacteria → Proteobacteria23683Open in IMG/M
3300021180|Ga0210396_10080311All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2962Open in IMG/M
3300021401|Ga0210393_10094289Not Available2380Open in IMG/M
3300021401|Ga0210393_10122664All Organisms → cellular organisms → Bacteria2078Open in IMG/M
3300021401|Ga0210393_10190990All Organisms → cellular organisms → Bacteria → Proteobacteria1654Open in IMG/M
3300021401|Ga0210393_10426785Not Available1081Open in IMG/M
3300021401|Ga0210393_10644454All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300021402|Ga0210385_10796893Not Available725Open in IMG/M
3300021405|Ga0210387_10007200All Organisms → cellular organisms → Bacteria → Proteobacteria8522Open in IMG/M
3300021405|Ga0210387_10040958All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3695Open in IMG/M
3300021405|Ga0210387_10044430All Organisms → cellular organisms → Bacteria → Proteobacteria3555Open in IMG/M
3300021406|Ga0210386_10242299All Organisms → cellular organisms → Bacteria1536Open in IMG/M
3300021407|Ga0210383_11119108Not Available664Open in IMG/M
3300021420|Ga0210394_10001741All Organisms → cellular organisms → Bacteria → Proteobacteria32959Open in IMG/M
3300021420|Ga0210394_10006875All Organisms → cellular organisms → Bacteria → Proteobacteria12176Open in IMG/M
3300021420|Ga0210394_10434213Not Available1156Open in IMG/M
3300021474|Ga0210390_10196275All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1707Open in IMG/M
3300021475|Ga0210392_10328061Not Available1103Open in IMG/M
3300021475|Ga0210392_11157359Not Available579Open in IMG/M
3300021477|Ga0210398_10007012All Organisms → cellular organisms → Bacteria10271Open in IMG/M
3300021477|Ga0210398_11336413Not Available562Open in IMG/M
3300021478|Ga0210402_11365494Not Available636Open in IMG/M
3300021479|Ga0210410_11144664Not Available669Open in IMG/M
3300021559|Ga0210409_10069365All Organisms → cellular organisms → Bacteria → Proteobacteria3282Open in IMG/M
3300021559|Ga0210409_10212069All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1761Open in IMG/M
3300022726|Ga0242654_10092204Not Available938Open in IMG/M
3300023255|Ga0224547_1000381All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria6164Open in IMG/M
3300024222|Ga0247691_1004356All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae2496Open in IMG/M
3300025463|Ga0208193_1007308All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Rhodocyclales → Rhodocyclaceae → Azospira → Azospira inquinata3686Open in IMG/M
3300026374|Ga0257146_1009926All Organisms → cellular organisms → Bacteria → Proteobacteria1546Open in IMG/M
3300026557|Ga0179587_10595196Not Available728Open in IMG/M
3300027562|Ga0209735_1016930All Organisms → cellular organisms → Bacteria1467Open in IMG/M
3300027635|Ga0209625_1023590All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1368Open in IMG/M
3300027674|Ga0209118_1004989All Organisms → cellular organisms → Bacteria → Proteobacteria5095Open in IMG/M
3300027698|Ga0209446_1021445All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1604Open in IMG/M
3300027701|Ga0209447_10004706All Organisms → cellular organisms → Bacteria → Proteobacteria4090Open in IMG/M
3300027737|Ga0209038_10026616All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1711Open in IMG/M
3300027812|Ga0209656_10204821Not Available954Open in IMG/M
3300027829|Ga0209773_10020509All Organisms → cellular organisms → Bacteria → Proteobacteria2589Open in IMG/M
3300027857|Ga0209166_10000004All Organisms → cellular organisms → Bacteria → Proteobacteria606044Open in IMG/M
3300027879|Ga0209169_10035657All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2622Open in IMG/M
3300027908|Ga0209006_10032580All Organisms → cellular organisms → Bacteria → Proteobacteria4725Open in IMG/M
3300027908|Ga0209006_10402260Not Available1153Open in IMG/M
3300027911|Ga0209698_10267602Not Available1361Open in IMG/M
3300028017|Ga0265356_1014550Not Available856Open in IMG/M
3300029636|Ga0222749_10393670All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium733Open in IMG/M
3300030626|Ga0210291_11160944Not Available500Open in IMG/M
3300030738|Ga0265462_10658670Not Available814Open in IMG/M
3300030740|Ga0265460_11799889Not Available628Open in IMG/M
3300030743|Ga0265461_12241303Not Available634Open in IMG/M
3300030776|Ga0075396_1870884Not Available544Open in IMG/M
3300030991|Ga0073994_12243997Not Available642Open in IMG/M
3300031231|Ga0170824_111925121Not Available599Open in IMG/M
3300031231|Ga0170824_114656121Not Available718Open in IMG/M
3300031469|Ga0170819_10559963All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300031525|Ga0302326_11545770Not Available886Open in IMG/M
3300031708|Ga0310686_114139918Not Available736Open in IMG/M
3300031715|Ga0307476_10546104All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300031718|Ga0307474_10153256All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1740Open in IMG/M
3300031823|Ga0307478_10400969Not Available1136Open in IMG/M
3300033545|Ga0316214_1013116All Organisms → cellular organisms → Bacteria1134Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil40.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.54%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil10.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.69%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.62%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.85%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil2.31%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.31%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.31%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.31%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland1.54%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.54%
Permafrost SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Permafrost Soil1.54%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil1.54%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost1.54%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil0.77%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.77%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil0.77%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.77%
Quercus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Quercus Rhizosphere0.77%
RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Roots0.77%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere0.77%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000546Quercus rhizosphere microbial communities from Sierra Nevada National Park, Granada, Spain - LJN_Illumina_AssembledHost-AssociatedOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003220Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005650Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate RNA 2013_055 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006426Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate RNA 2013_054 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300007819Permafrost core soil microbial communities from Svalbard, Norway - sample 2-1-2 SoapdenovoEnvironmentalOpen in IMG/M
3300009633Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015160Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G7C, Adjacent to main proglacial river, mid transect (Watson river))EnvironmentalOpen in IMG/M
3300015206Arctic soil microbial communities from a glacier forefield, Russell Glacier, Kangerlussuaq, Greenland (Sample G8B, Adjacent to main proglacial river, end of transect (Watson river))EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020034Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300023255Peat soil microbial communities from Stordalen Mire, Sweden - 717 P2 10-14EnvironmentalOpen in IMG/M
3300024222Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK32EnvironmentalOpen in IMG/M
3300025463Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10 (SPAdes)EnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027701Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027737Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028017Rhizosphere microbial communities from Maridalen valley, Oslo, Norway - NZE4Host-AssociatedOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030626Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO410-VDE110SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030738Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VDE Co-assemblyEnvironmentalOpen in IMG/M
3300030740Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ARE Co-assemblyEnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300030776Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA10 Emin (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031469Fir Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300033545Spruce roots microbial communities from Maridalen valley, Oslo, Norway - NRE4Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
LJNas_102903713300000546Quercus RhizosphereMKAGEKYFLLTPKGIEELRGRAPKLDADIRIVLSLIDQGFTSADAILQRSKSSRDEMIDTLRSLLSNGFVATATSDGTVKAAAPEPTPSVADSVSERLRLKQGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPALVACVREINETDYDG*
JGIcombinedJ26739_10071341413300002245Forest SoilMKSGEKYFVLTPKGIEELRGRAPKLDANSRSVLSLIEQGFTSADPLLQRSKSTRDEMIDLLRLLLSNGFISTATSDAAVKAPTPEPTPSVADSVSERLRLKQGISPSQARFVLSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDS
JGI26342J46808_100882113300003220Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVTCVREINETDF*
Ga0062385_1005019133300004080Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVTSLQMALDNIRTEVK
Ga0062385_1082363313300004080Bog Forest SoilTHKLDAGSRNILSLIELGAASPDAILQRSKSPRDEVIDHLRSLMTHGFVATAASEGTTHSGTRDSTSSVAGSISERLRLKPGISPAQGRFLLTNFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF*
Ga0062389_10070880913300004092Bog Forest SoilMKAGDKFFLLTPKGMEELRGRASKLDTNARNFLSLIEQGYTTAEVMLQRSKVSREDVIDGLRLLLGGGFVATGISDGTTSAAVPEATPSVADSMNERLRLKPGVSPSQARFVLSNFCLDQFGAQGKDLADVVEFCTDVPSLQLALDNIRTEVKKLCPERRPALVACVREINETDY*
Ga0062389_10196858123300004092Bog Forest SoilMKVGEKYFVLTPKGIEELRGRAPKLDANTRGVLSQIEQGFRSAEALLQRSKSSRDEMIDLLRLLLSNGFISTATSDGSVRAPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSVRSEVRKICPDRRPALVACVREINETDYDG*
Ga0062386_10004043143300004152Bog Forest SoilMCGLPHFAYSSEVPQTANAGEYMKAGEKYFLLTPKGMEELRGRTHKVDAVARNILSLIELGATSPDAILHRSKSPRDEVIDHLRSLLSAGFVATATSGGTAQPGTPDSTSSVSSSTSERLRLKPGISPAQARFLLSNFCLDQFGTDGQVLADAVEFCTDVASLQLALDNIRTEVKKLGPERRSALVACVREINETDF*
Ga0062386_10074652613300004152Bog Forest SoilMKAGEKYFLLTPKGMEELRGRAHKLGAAARNILTLIEHGATSPEAILQRVKSPRDEVIDQLRSLLSQAYLATATSDGTAQPGTPDSAGSVSGSSSERLRLKPGISPAQARFLLSKFCLDQFGTGGQILADAVDFCTDVTSLQMALDNIRTE
Ga0062388_10041021613300004635Bog Forest SoilMCCGCPVWHIVRRSAGGVRVGEPMKAGDKFFLLTPKGMEELRGRASKLDANARNFLSLIEQGYTTAEVMLQRSKVSREDVIDGLRLLLGGGFVATGISDGTTSAAVPEATPSVADSMNERLRLKPGVSPSQARFVLSNFCLDQFGAQGKDLADVVEFCTDVPSLQLALDNIRTEVKKLCPERRPALVACVREINETDY*
Ga0070730_100000021243300005537Surface SoilMKAGEKYFALTPKGVEELRGRAAKLDANTRNILSLIEQGFTSADALLQRSKSTRDEMIDMLRLLLGNGFVSTAVSDGTVKAPTPEPTPSVADSISERLRLKQGISPSQARFALSNFCLDQFGTAGKDLADVVDLCEDVAGLQMALDSIRSEVKRVCPDQRPALVACVREINETDYDG*
Ga0070762_1004030233300005602SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEHGSTNPDAILQRSTAPRETVIDGLRWLISHGFVATGTSDGTPSQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF*
Ga0070762_1114452913300005602SoilDAGAKNILSLIEQGAGSPDAILQRSKAPREAVIDGLRWLISHGFVASGTGEGTTSHGAHDAASVTSSISERLRLKPGISPAQARFLLTNFCLDQFGAEGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRSALVACVREINETDF*
Ga0075038_1069884213300005650Permafrost SoilYFLLTPKGVDELRARASKLEAGAKNFLSLIEQGYTTAETILQRSKASREEVIDGLRLLLSSGFISTSTSEATVQPKPESTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDSLFTVAFLLYLVL*
Ga0075028_10008602523300006050WatershedsMKAGEKFFLVTPKGTEELRGRAHKLDAGAKNILSLIEQGATNPDAILQRSKAPRDAVLDGLRWLMSHGFVATGTSDGTTSPGAHDATSSVASSISERLRLKPGISPAQARFLLTNFCLDQLGADGQVLANAVEFCTDVASLQMALDNIRTEVKRLGPEGRTALVACVHEINESDF*
Ga0075029_10002150643300006052WatershedsMKAGEKYFLVTPKGKEELRGRAHKLDAAAKSILSLIEQGSTNPDAILQRSKASREAVIEGLRWLINHGFVTTGTDDGTTPQGSAAASRVADSISERLRLKPGVSPAQARFLLTNFCLDQFGAAGQVLADAVELCTDVTSLQMALDNIRTEVKKLGPEGRAALVGCVREINDTDY*
Ga0075029_10055549513300006052WatershedsVTPKGVDELRGRAHKLDAGAKNILSLIEQGATNPDAILQRSKAPRDAVLDGLRWLMSHGFVATGTSDGTTSPGAHDATSSVASSISERLRLKPGISPAQARFLLTNFCLDQLGADGQVLANAVEFCTDVASLQMALDNIRTEVKRLGPEGRTALVACVHEINESDF*
Ga0075015_10095529413300006102WatershedsGATNPDAILQRSKAPRDAVLDGLRWLMSHGFVATGTSDGTTSPGAHDATSSVASSISERLRLKPGISPAQARFLLTNFCLDQLGADGQVLANAVEFCTDVASLQMALDNIRTEVKRLGPEGRTALVACVHEINESDF*
Ga0075037_173520423300006426Permafrost SoilDELRARASKLEAGAKNFLSLIEQGYTTAETILQRSKASREEVIDGLRLLLSNSFIATATSDGTTPAATPEATPSVADSISERLRLKPGISPSQARFALSNFCLDQFGPAGKDFADAVEFCTDVASLQLALDNIRTEVKKLCPERRPALVACVREINDTDY*
Ga0104322_13256023300007819Permafrost SoilMKAGEKYFLLTPKGMEELRGRAPKLDANARSALSLIDQGFTSADAILQRSKSSRDEMIDTLRSLLGNGFVATATSDGTVKAAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGAAGKDLADVVDLCVDVAGLQLALDSIRSELKRVCPERRPLLVACVREINDTDF*
Ga0104322_14756683300007819Permafrost SoilMKSGEKYFLLTPKGTEELRGRTPKLDAKTRNVLSLIDQGCTSADAILQRSKSSRDDMIDVLRLLLSNDFVATATSDGTVKAVTPEPTPSVADSVSERLKLKPGISPSQARFSLSNFCLDQFGTAGKELADVVDLCADVAGLQLALDTIRSEVKKICPDRRPALVACVREINDTDF*
Ga0116129_100313023300009633PeatlandMKAGEKYFVLTPKGIEELRGRAPKRDANTRSVLSLIEQGFTSADALLQRSKSTRDEMIDMLRLLLSNGFISTAISDGTVKAPTPEPTPSVADSVSERLRLKQGVSPSQARFVLSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG*
Ga0074044_1026221013300010343Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASVTNSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVTSLQMALGNIRTEVKRLGPEGRAALVACVREINETDF*
Ga0137362_1015182423300012205Vadose Zone SoilMKTGEKCFLLTPKGTEELRGRTPKLDANARSILSLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTGDGAVPPPPSAMPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLAEAVGFCTDVASLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137413_10001396173300012924Vadose Zone SoilMKAGEKFFLLTPKGMEELRGRTPKLAATARSILSLIDQGTTVAESILQRSKLTRDEVIEGLRSLLSSGLISTSTSDVTAQSTPESTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137413_1006494723300012924Vadose Zone SoilMKAGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEGTVQAPPASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137404_1019200823300012929Vadose Zone SoilMKAGEKFFLLTPKGTEELRSQTPKLDVSARGILSLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTGDGAVPPPPSAMPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLAEAVGFCTDVASLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137407_1086178213300012930Vadose Zone SoilIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEGTVQAPPASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0182024_10022740153300014501PermafrostMKAGEKCFLLTPKGVEELRGRTPKLDATAMSILSLIDHGTTVAESILQRSKFTRDQVIEGLRVLLSNGFISTSTSDVTVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVSFCTDVDSLQLALDNIRTEVKKLFPERRAALVACVREINETDY*
Ga0182024_1033779023300014501PermafrostMKAGEKYFVLTPKGIEELRGRAPKLDANTRSVLSLIEQGFTSADALLQRSKSTRDEMIDTLRVLLSNGFISTAISDGTVKAPMPGPTPSVSDSVSERLRLKQGVSPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG*
Ga0182030_1039403513300014838BogMKAGEKFFVLTPKGVEELRGRTSKLDANARNFLSLIEQGYTTAEAILLRSKAAREDVIDGLRLLLSNNFVATATSDGTTQAGTPDSTPSVADSISERLRLKPGISPSQARFMLSNFCLDQFGTDGKDLADAVEFCVDVDSLQLALDNIRTEVKKLFPERRPELVACVREINETDF*
Ga0137414_1230892133300015051Vadose Zone SoilMKAGEKFFLLTPKGMEELRGRTPKLAATARSILSLIDQGTTVAESILQLSKLTRDEVIEGLRSLLSSGLISTSTSDATAQSTPESTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137414_123089313300015051Vadose Zone SoilMKAGEKFFLLTPKGMEELRGRTPKLAATARSILSLIDQGTTVAESILQRSKLTRDEVIEGLRSLLSSGLISTSTSDATAQSTPESTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0137414_125231913300015051Vadose Zone SoilMKAGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEGTVQAPPVSTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0167642_100227643300015160Glacier Forefield SoilMKAGEKFFLLTPKGTEELRGRTPKLDATARSILSLIDQGTTVAESILQRSKFTRDQVIEGLRSLMSNGFIAASTSDATVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF*
Ga0167642_100868923300015160Glacier Forefield SoilMKAGEKYFLLTPKGTEELRGSAPKLDANPRSVLSLIDQGFTSADAILQRSKSSRDELIDTLRSLLSNGFVATATSDETVKAAPPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVFPDRRPLLVACVREINDTDF*
Ga0167644_10000431093300015206Glacier Forefield SoilMKAGEKYFLLTPKGIEELRGRAPKLDANPRSVLSLIEQGLTSADALLQRSKSTRDEMIDVLRSLLSNGFISTALSDGTVKAPTPEPAPSVADSVSERLRLKQGVSPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG*
Ga0193713_100350743300019882SoilMKAGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEQGTAVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEGTVQAPPASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193727_100121153300019886SoilMKAGEKFFLLTPKGMEELRGRTPKLDATARSILSLIDQGTTVAESILQRSKLTRDEVIEGLRSLLSSGLISTSTSDVTVQPTPESKPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193729_102173113300019887SoilMKAGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEHGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEETVQAPPGSTSAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193743_120854213300019889SoilLSLIEQGTAVAESILQRSKFTRDQVIEGLRVLLSSGFISTSTSEGTVQAPPSSTPAVSVADSVSERLRLKPGISPSQARFMLSNFCLDQFGTNGQDLADAVGFCTDVTSLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193728_1001611163300019890SoilMKAGEKFFLLTPKGMEELRGRSPKLDATARSILSLIDQGTAVAESILQRSKLTRDEVIEGLRSLLSSGLISTSTSDLTVQPTPESKPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193755_100074613300020004SoilMKAGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEQGTAVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEGTVQAPPASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLF
Ga0193726_1001020253300020021SoilMRAGEKFFLLTPKGMEELRGRTPKLDANARSLLSLIDQGTTVAESILQRSKFTRDQVIEGLRVLLSSGFISTSIGDGTVQASPDSAPAVSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGLCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINDTDY
Ga0193726_1006444103300020021SoilMKAGEKFFLLTPKGTEELRGRTPKLDAASRSILSLIDQGTSVAESILKRSKFSRDQVIEGLRSLLSSGFISTSTSDVTVQPTPESTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDY
Ga0193726_100711983300020021SoilMKIGEKFFLLTPKGTEELRSQTPKLDVIARGILSLIEQGTTVAESILQRSKFTRDQVIEGLRVLLSSGFISTSTSEGTVQAPPASTLASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVPSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193726_101228023300020021SoilMKTGEKFFLLTPKGMEELRGRTPKLDAASRSILSLIDQGTAVAESILKRSKFTRDQVIEGLRSLLSNGFISTSTSDATVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDY
Ga0193753_1006787813300020034SoilMKAGEKFFLLTSKGTEELRGRTPKLDATARSILSLIDQGTTVAESILQRSKLTRDEVIEGLRSLLSSGLISTSTSDVTVQPTPESKPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193717_101253463300020060SoilMKAGEKFFLLTPKGTEELRSRTPKLDAIAMSILSLIEQGTAVAESILQRSKFTRDQVIEGLRVLLSSGFISTSTSEGTVQAPPRSTPAVSVADSVSERLRLKPGISPSQARFMLSNFCLDQFGTNGQDLADAVGFCTDVTSLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193717_116463813300020060SoilMKAGEKFFLLTPKGTEELRGRTPKLDTAARSILSLIEQGTTVAESILQRSKLTRDQVIEGLRVLLSSGFISSSISDGTVQAPPGSTPAVSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDVVSLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0193716_109606123300020061SoilMKAGEKFFLLTPKGTEELRGRTPKLDATARSILSLIEQGTTVAESILQRSKLTRDQVIEGLRLLLSNGFISSSMSDGTAQAPPAPTPAVSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0179592_1000022683300020199Vadose Zone SoilMKTGEKCFLLTPKGTEELRGRTPKLDANARSILSLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTGDGAVPPPPSAMPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLAEAVGFCTDVASLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0210407_1000414083300020579SoilMMCGLPQLAYSSQYMKAGEKYLMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVVCVREINETDF
Ga0210407_10005189133300020579SoilMKAGEKFFLLTPKGMEELRGRTPKLDAAARSILSLIDQGTTVAESILQRSKFTRDQVIEGLRVLLSSGFISTSTSDTPVQPAPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTELKKLFPERRAALVACVREINETDF
Ga0210407_1022142623300020579SoilMTAGEKFFLLTPKGVEELRGKTPKLDATARSILSLIEQGTIVAESILQRSKFTRDQVIEGLRSLLSNGFISSSMSDVPVQPTPEATPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0210407_1030759623300020579SoilMVRLSDPRSGVDCLISCLRQDVRQAPNLAYSSHDRRRSEAGELMKAGEKYFLLTPKGTEELRGRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0210403_1001157763300020580SoilMKAGEKYFLLTPKGIEELRGRAPKLDANTRSVLSLIDQGFTSADAILQRSKSSRDELIDMLRSLLSNGFVATATSDGTVKTAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINDTDY
Ga0210403_1003458443300020580SoilMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVVCVREINETDF
Ga0210403_1074842413300020580SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEHGSTNPDAILQRSTAPRETVIDGLRWLISHGFVATGTSDGTPSQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF
Ga0210403_1083157823300020580SoilMKAGEKYFLLTPKGMEELRLRSHKVDAGARNILSLIELGATSADAILQRSKSPRDEVLDHLRSLMSTGFVATAASDGTAQPGTPDATSSVSSSTSERLRLKPGISPAQARFLLSNFCLDQFGTAGQVL
Ga0210399_1000488443300020581SoilMKAGEKHLLVTPKGVEELRGRAHKLDAGAKNILSLIEQGAGSPDAILQRSKAPREAVIDGLRWLISHGFVASGTGEGTTSHGAHDAASVTSSISERLRLKPGISPAQARFLLTNFCLDQFGAEGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRSALVACVREINETDF
Ga0210399_1003380043300020581SoilMKAGEKYFLLTPKGMEELRLRSHKVDAGARNILSLIELGATSADAILQRSKSPRDEVLDHLRSLMSTGFVATAASDGTAQPGTPDATSSVSSSTSERLRLKPGISPAQARFLLSNFCLDQFGTAGQVLADAVDFCTDVASLQIALDNIRTEVKKLGPERRAALVACVREINETDY
Ga0210399_1022118523300020581SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210395_1058857323300020582SoilKVGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTHPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCADVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210401_1040949923300020583SoilVKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCADVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210406_10005692203300021168SoilMKAGEKFFLLTPKGMEELRGRTQKLDAAARGILSLIDQGTTVAESILQRSRFNRDQVIEGLRVLLSSGFISTSTSDAPVQPSPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTELKKLFPERRAALVACVREINETDF
Ga0210406_1000735323300021168SoilMCGKRPHLAYSSQGRRRFQTGESMKAGEKYFLLTPKGMEELRGRAPKLDANTRSVLSLIDQGFTSADAILQRSKSSRDELIDMLRSLLSNGFVATATSDGTVKTAAPEPPPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINDTDY
Ga0210406_1001376323300021168SoilVRLSDPRSGVDCLISCPRQDVRQAPNLAYSSQDRRRSDAGEIMKAGEKYFLLTPKGTEELRDRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0210406_1034012713300021168SoilQELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVVCVREINETDF
Ga0210400_10001438283300021170SoilMCGLPQLAYSSQYMKAGEKYLMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVVCVREINETDF
Ga0210396_1008031133300021180SoilMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVACVREINETDF
Ga0210393_1009428933300021401SoilGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEHGSTNPDAILQRSTAPRETVIDGLRWLISHGFVATGTSDGTPSQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF
Ga0210393_1012266413300021401SoilMCGLPQLAYSSQYMKAGEKYLMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVACVREINETDF
Ga0210393_1019099013300021401SoilMVRLSDPRSGVDCLISCPRQDVRQAPNLAYSSQDRRRSDAGEIMKAGEKYFLLTPKGTEELRGRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRS
Ga0210393_1042678523300021401SoilHAGENMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210393_1064445423300021401SoilMKAGEKFFLLTPKGTEELRGRAPKLDATARSILSLIDQGTIVAESILQRSKLTRDQVIEGLRSLLSNGFISSSMSDVTVQPTPEPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGSNGQDLAEAVGFCTDVDSLQLALDNIRTEVKKLFPERRAALVACVREINETDY
Ga0210385_1079689313300021402SoilVKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSMNPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCADVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210387_1000720073300021405SoilMKAGEKFFLLTPKGMEELRSRASKIDANAKIILSLIDQGTTTAEAILQRSKSTRDDVIEGLRVLLSNAFVATATSDGTVQPKSPDPAPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGAKGQDLADAVGFCTDVPSLQLALDNIRTEVKTLCPEQRPALVACVREINDTDF
Ga0210387_1004095813300021405SoilMKAGEKYFLLTPKGIEELRGRAPKLDANTRSVLSLIDQGFTSADAILQRSKSSRDELIDMLRSLLSNGFVATATSDGTVKTAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVCP
Ga0210387_1004443033300021405SoilMKAGEKYFLLTPKGTEELRDRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0210386_1024229923300021406SoilMKAGEKYFVLTPKGIEELRGRAAKLDANSRNVLSLIEQGFTSADALLQRSKSTRDEMIDMVRLLLSNGFISTAISDGTVKTPTPTPAPAPSVADSVSERLRLKQGVSPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG
Ga0210383_1111910813300021407SoilGAKNILSLIEHGSTNPDAILQRSTAPRETVIDGLRWLISHGFVATGTSDGTPSQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF
Ga0210394_10001741433300021420SoilMKAGEKYFLLTPKGMEELRLRSHKVDAGARNILSLIELGATSADAILQRSKSPRDEVLDHLRSLVSTGFVATAASDGTAQPGTPDSTSSVSSSTSERLRLKPGISPAQARFLLSNFCLDQFGTAGQVLADAVDFCTDVASLQIALDNIRTEVKKLGPERRAALVACVREINETDY
Ga0210394_10006875133300021420SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLISHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVVCVREINETDF
Ga0210394_1043421323300021420SoilMVRLSDPRSGVDCLISCPRQDVRQAPNLAYSSQDRRRSDAGEIMKAGEKYFLLTPKGTEELRDRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0210390_1019627523300021474SoilMKAGEKFFLVTPKGIEELRGRAHKLDAGAKSILSLIEQGSTNPDAILQRSKEPRETVIDGLRWLLSHGFVATGTSDGATPQGAHDATSVTSSISERLRFKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210392_1032806113300021475SoilMKAGEKFFLLTPKGMEELRGRTPKLDAAARSILSLIDQGTTVAESILQRSKFTRDQVIEGLRVLLSSGFISTSTSDTPVQPAPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTELKKLFPERRAALVA
Ga0210392_1115735913300021475SoilMVRLSDPRSGVDCLISCLRQDVRQAPNLAYSSHDRRRSEAGELMKAGEKYFLLTPKGTEELRGRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAG
Ga0210398_1000701243300021477SoilVKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTHPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0210398_1133641313300021477SoilLRGKTPKLDATARSILSLIEQGTIVAESILQRSKFTRDQVIEGLRSLLSNGFISSSMSDVPVQPTPEATPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0210402_1136549413300021478SoilMKAGEKYFLLTPKGIEELRGRAPKLDANTRSVLSLIDQGFTSADAILQRSKSSRDELIDMLRSLLSNGFVATATSDGTVKTAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVAC
Ga0210410_1114466413300021479SoilPKGVEELRCRAHKLDAGAKNILSLIEHGSTNPDAILQRSTAPRETVIDGLRWLISHGFVATGTSDGTPSQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPKGRAALVACVREINETDF
Ga0210409_1006936523300021559SoilMMCGLPQLAYSSQYMKAGEKYLMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVACVREINETDF
Ga0210409_1021206923300021559SoilVKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLVSHGFVATATSDGTASQGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLADAVEFCSDVTSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0242654_1009220433300022726SoilKAGEKYFLLTPKGTEELRDRTPKLDANVRSVLSLIDQGFTSSDAILQRSKSSRDELIDMLRSLLSNGFVATATSDGTVKTAAPEPPPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDY
Ga0224547_100038123300023255SoilMKAGEKCFLLTPKGVEELRGRTPKLDATAMSILSLIDHGTTVAESILQRSKFTRDQVIEGLRVLLSNGFISTSTSDVTVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLAEAVSFCTDVDSLQLALDNIRTEVKKLFPERRAALVACVREINETDY
Ga0247691_100435623300024222SoilMKAGEKYFLLTPKGTEELRSQTPKLDAIARGILTLIEQGTTVAESILQRSKFTRDEVIEGLRVLLSNGFISTSTSEATVQAPPASTPAVSVADSVSERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLADVVGFCTDVASLQLALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0208193_100730853300025463PeatlandMKAGEKYFVLTPKGIEELRGRAPKRDANTRSVLSLIEQGFTSADALLQRSKSTRDEMIDMLRLLLSNGFISTAISDGTVKAPTPEPTPSVADSVSERLRLKQGVSPSQARFVLSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG
Ga0257146_100992613300026374SoilMKAGEKFFLLTPKGMEELRGRTPKLDATARSILSLIDQGTTVAESILQRSKLTRDQVIEGLRVLLSSGFISTSTSEAPVQPTPEPTPSVADSVSERLRLKPGISPSQARFALSNFCLDQFGTNGQDLADAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVAVVREINETDF
Ga0179587_1059519613300026557Vadose Zone SoilMKTGEKYFLLTPKGIEELRGRTPKLDANTRNVLSLIEQGCTSADAILQRSKSSRDAMIDVLRSLLSNDFVATATSDGTVKAATPEPTPSVADSVSERLKLKPGISPSQARFSLSNFCLDQFGTAGKELADVVDLCVDVAGLQLALDSIRSEVKKICPDRRPALVACVREINDTDF
Ga0209735_101693023300027562Forest SoilMMCGLPQLAYSSQYMKAGEKYLLLTPKGMEELRGHVHNLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFAATATSDGTAVPGTPDSKSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPEKRSALVACVREINETDF
Ga0209625_102359023300027635Forest SoilMKAGEKYFLLTPKGIEELRGQAPKLDADIRSVLYLIDQGFTSADAILQRSKCSRDEMIEMLRSLLSDGFVATATSDGTVKAAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKELADVVDLCVDVAGLQLALDSIRSEVKKVCPDRRPLLVACVREINDTDF
Ga0209118_100498973300027674Forest SoilMKAGEKYFLLTPKGTEELRGRAPKLDANIRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATTPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0209446_102144523300027698Bog Forest SoilMKAGEKFFLVTPKGTDELRGRAHKLDAGAKNILSLIEQGATNPDAILQRSKAPREAVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVTCVREINETDF
Ga0209447_1000470663300027701Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVTSLQMALDNIRTEVKRLGPEGRAALVTCVREINETDF
Ga0209038_1002661623300027737Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTASQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVSSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0209656_1020482113300027812Bog Forest SoilAGEYMKAGEKYFLLTPKGMEELRGRTHKVDAVARNILSLIELGATSPDAILHRSKSPRDEVIDHLRSLLSAGFVATATSGGTAQPGTPDSTSSVSSSTSERLRLKPGISPAQARFLLSNFCLDQFGTDGQVLADAVEFCTDVASLQLALDNIRTEVKKLGPERRSALVACVREINETDF
Ga0209773_1002050923300027829Bog Forest SoilMKAGEKFFLVTPKGVEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSQGGHDAASTSSISERLRLKPGISPAQARFLLTDFCLDQLGADGQVLANAVEFCTDVSSLQMALDNIRTEVKRLGPEGRAALVACVREINETDF
Ga0209166_100000041233300027857Surface SoilMKAGEKYFALTPKGVEELRGRAAKLDANTRNILSLIEQGFTSADALLQRSKSTRDEMIDMLRLLLGNGFVSTAVSDGTVKAPTPEPTPSVADSISERLRLKQGISPSQARFALSNFCLDQFGTAGKDLADVVDLCEDVAGLQMALDSIRSEVKRVCPDQRPALVACVREINETDYDG
Ga0209169_1003565733300027879SoilMKVGEKFFLLTPKGVEELRAKASKLDANARSYLSLIEQGYPTAEGMLQRSKAAREDVIEGLRVLLSNGFIATATSDGTTPAPTSEPAPTVADSISERLKLKPGISPSQARFILSNFCLDQFGTAGKDLSDAVEFCTDVPSLQMALDNIRTEVKKICPERRPALVACVREINDTDY
Ga0209006_1003258023300027908Forest SoilMKAGEKYFLLTPKGIEELRGQAPNLDADIRSVLYLIDQGFTSADAILQRSKCSRDEMIDTLRSLLSDGFVATATSDGTVKAAAPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKELADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINDTDY
Ga0209006_1040226023300027908Forest SoilVDSLTSRLRQDVLQAPHLAYSSQDRRRSEAGECMKAGEKYFLLTPKGTEELRGRTPKLDADVRSVLSLIDQGFTSSDAILQRSKSSRDELIDTLRSLLSNGFVATAVSDGTVKATPPEPTPSVADSISERLRLKSGISPSQARFALSNFCLDQFGTAGKDLADVVDLCVDVAGLQMALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0209698_1026760223300027911WatershedsMKAGEKYFLVTPKGKEELRGRAHKLDAAAKSILSLIEQGSTNPDAILQRSKASREAVIEGLRWLINHGFVTTGTDDGTTPQGSAAASRVADSISERLRLKPGVSPAQARFLLTNFCLDQFGAAGQVLADAVELCTDVTSLQMALDNIRTEVKKLGPEGRAALVGCVREINDTDY
Ga0265356_101455013300028017RhizosphereMKAGEKYFVLTPKGIEELRGRASKLDANSRSVLSLIEQGFTSADPLLQRSKCTRDEMIDMLRLLLSNGFISTAISDAAVKAPTPEPTPSVADSVSERLRLKHGVSPSQARFVLSNFCVDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRIC
Ga0222749_1039367013300029636SoilMMCGLPQLAYSSQYMKAGEKYLMLTPKGMEELRLHAHKLDAGARNILSLIELGATSPEAILQRSKSTREEVIDHLRSLIGHGFVATATSDGTAAPGAPDSTSSVSSSTSERLRLKPGISPAQARFLLTNFCLDQFGAAGQVLADAVDFCTDVTSLQMALDNIRTEVKKLGPDKRSALVAC
Ga0210291_1116094413300030626SoilMKAGEKFFLVTPKGVEELRGRAHKLDAAAKSILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSPGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDN
Ga0265462_1065867023300030738SoilMKAGEKFFLVTPKGVEELRGRAHKLDAAAKSILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSPGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF
Ga0265460_1179988913300030740SoilLLVTSKGTEELRGRAHKLDAGAKNILSLIEQGSTNPDAILQRSKAPREAVIDGLRWLMSHGFVATGTSDGTASHGAHDATSAASSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKKLGPEGRAALVACVREINETDF
Ga0265461_1224130313300030743SoilEKFFLVTPKGVEELRGRAHKLDAAAKSILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLMSHGFVATGTSDGTPSPGAHDAASVTSSISERLRLKPGISPAQARFLLTDFCLDQFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKRLGPNGRSALVACVREINETDF
Ga0075396_187088413300030776SoilGESMKAGEKYFLLTPKGMEELRGRAPKLDADIRIVLSLIDQGFTSADAILQRSKSSRDEMIDTLRLLLSNGFVATATSDGTVKAAAPEPTPSVADSISERLRLKQGISPSQGRFALSNFCLDQFGTSGKELADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0073994_1224399713300030991SoilMRSQAGETMKTGEKYFLLTPKGIEELRGRTPKLDTSTRNVLSLIEQGCTSADAILLRSKSSRDDMIDVLRLLLSNDFVATATSDGTVKPAAPEPTPSVADSVSERLKLKPGISPSQARFSLSNFCLDQFGTAGKELADVVDLCVDVAGLQLALDSIRSEVKKICPDRRPALVACVREINDTDF
Ga0170824_11192512113300031231Forest SoilSQSGESMKAGEKYFLLTPKGMEELRGRAPKLDADIRIVLSLIDQGFTSADAILQRSKSSRDEMIDTLRLLLSNGFVATATSDGTVKAAAPEPTPSVADSISERLRLKQGISPSQGRFALSNFCLDQFGTSGKELADVVDLCVDVAGLQLALDSIRSEVKKVCPDRRPALVACVREINETDYDG
Ga0170824_11465612113300031231Forest SoilLIEQGAGSPEAILQRTKAPRETVIEGLRWLISHGFVAAGTGEGTASQGAHDAASVTSSISERLRLKPGISPSQARFVLSNFCLDQFGTNGQDLAEAVGFCTDVDSLQMALDNIRTEVKKLFPERRAALVACVREINETDF
Ga0170819_1055996313300031469Forest SoilSQSGESMKAGEKYFLLTPKGMEELRGRAPKLDADIRIVLSLIDQGFTSADAILQRSKSSRDEMIDTLRLLLSNGFVATATSDGTVKAAAPEPTPSVADSISERLRLKQGISPSQGRFALSNFCLDQFGTSGKELADVVDLCVDVAGLQLEEARGVVVDDPNDDLIDLWAAAPVVVVG
Ga0302326_1154577023300031525PalsaDKFFLLTPKGLEDLRGKASKLDPHARNFLSLIEQGYTTADVILQRSKAPREDVIEGLRLLLSKGFVATGLSDGTTPVPAPEASPSVADSISERLRLKPGVSPSQARFLLSNFCLDQFGTQGKDLADAVEFCADVPSLQLALDNIRTEVKKICPERRPALVACVREINETDY
Ga0310686_11413991813300031708SoilMKAGEKFFLVTPKGVEELRGRADKLDAGAKNILSLIEQGSTNPDAILQRSKAPRETVIDGLRWLISHGFVATGTSDGTTSPAAHDAVSVTSSISERLRLKPGISPAQARFLLTDFCLDHFGADGQVLANAVEFCTDVTSLQMALDNIRTEVKRLGPEGRALLVACVREINETDF
Ga0307476_1054610423300031715Hardwood Forest SoilMKAGEKYFVLTPKGIEELRCRAPKLDANSRNVLSLIEQGFTSADALLQRSKTTRDEMIDMVRLLLSNGFISTVISDGTVKAPAPAPAPTLTPAPTPSVADSVSERLRLKHGVSPSQARFVLSNFCLDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYDG
Ga0307474_1015325623300031718Hardwood Forest SoilMKAGEKYFLLTPKGMEELRGRAPKLDAGTRSVLSLIDQGFTSADAILQRSKSSRDEMIDTLRSLLSNGFVATATSDGTVKATPPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKDLADIVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINDTDF
Ga0307478_1040096923300031823Hardwood Forest SoilMVRLSDPRSGVDCLISCPRQDVRQAPNLAYSSQDRRRSDAGEFMKAGEKYFLLTPKGTEELRGRVQKLDANTRSVLSLIDQGFTSADAILQRSKTSRDEMIDLLRLLLSNGFIATAVSDGTVKAAPPEPTPSVADSISERLRLKSGISPSQGRFALSNFCLDQFGTAGKELADVVDLCVDVAGLQLALDSIRSEVKRVCPDRRPLLVACVREINETDF
Ga0316214_101311613300033545RootsMREDVCGQHPLWHIVPRTAGVLQAGASMKAGEKYFVLTPKGIEELRGRASKLDANSRSVLSLIEQGFTSADPLLQRSKCTRDEMIDMLRLLLSNGFISTAISDAAVKAPTPEPTPSVADSVSERLRLKHGVSPSQARFVLSNFCVDQFGTAGKDLADVVDLCVDVAGLQLALDSIRSEVKRICPDRRPALVACVREINETDYD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.