NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095819

Metagenome / Metatranscriptome Family F095819

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095819
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 328 residues
Representative Sequence VLLTDCFRESVRRARLFAVVCVVVCAGFFVAAATTPDQPVGDAVGLIDGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLNGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPRNDMPPTTVENASGETARNSSDAAGAGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAAATTPQPPVRNAPANAAPAPPQGSVVDRVRSFFRKLWTRGS
Number of Associated Samples 75
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.38 %
% of genes near scaffold ends (potentially truncated) 34.29 %
% of genes from short scaffolds (< 2000 bps) 34.29 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.048 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(45.714 % of family members)
Environment Ontology (ENVO) Unclassified
(53.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(79.048 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 15.08%    β-sheet: 23.81%    Coil/Unstructured: 61.11%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF05187ETF_QO 21.90
PF04028DUF374 11.43
PF13365Trypsin_2 8.57
PF01121CoaE 4.76
PF00491Arginase 4.76
PF13180PDZ_2 3.81
PF00005ABC_tran 1.90
PF02882THF_DHG_CYH_C 1.90
PF01012ETF 1.90
PF00041fn3 1.90
PF02801Ketoacyl-synt_C 1.90
PF12698ABC2_membrane_3 0.95
PF03009GDPD 0.95
PF00698Acyl_transf_1 0.95
PF12704MacB_PCD 0.95
PF01336tRNA_anti-codon 0.95
PF13570PQQ_3 0.95
PF07228SpoIIE 0.95
PF01019G_glu_transpept 0.95
PF04055Radical_SAM 0.95
PF00534Glycos_transf_1 0.95
PF03551PadR 0.95
PF13561adh_short_C2 0.95
PF00550PP-binding 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 21.90
COG2440Ferredoxin-like protein FixXEnergy production and conversion [C] 21.90
COG2121Uncharacterized conserved protein, lysophospholipid acyltransferase (LPLAT) superfamilyFunction unknown [S] 11.43
COG0010Arginase/agmatinase family enzymeAmino acid transport and metabolism [E] 4.76
COG0237Dephospho-CoA kinaseCoenzyme transport and metabolism [H] 4.76
COG01905,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolaseCoenzyme transport and metabolism [H] 1.90
COG0686Alanine dehydrogenase (includes sporulation protein SpoVN)Amino acid transport and metabolism [E] 1.90
COG2025Electron transfer flavoprotein, alpha subunit FixBEnergy production and conversion [C] 1.90
COG2086Electron transfer flavoprotein, alpha and beta subunitsEnergy production and conversion [C] 1.90
COG0405Gamma-glutamyltranspeptidaseAmino acid transport and metabolism [E] 0.95
COG0584Glycerophosphoryl diester phosphodiesteraseLipid transport and metabolism [I] 0.95
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 0.95
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.95
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.05 %
UnclassifiedrootN/A0.95 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001356|JGI12269J14319_10109720All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81317Open in IMG/M
3300001593|JGI12635J15846_10016546All Organisms → cellular organisms → Bacteria6112Open in IMG/M
3300004091|Ga0062387_100550545All Organisms → cellular organisms → Bacteria → Acidobacteria816Open in IMG/M
3300004092|Ga0062389_100092446All Organisms → cellular organisms → Bacteria → Proteobacteria2642Open in IMG/M
3300005336|Ga0070680_100061590All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_83072Open in IMG/M
3300005406|Ga0070703_10035492All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81527Open in IMG/M
3300005444|Ga0070694_100040492All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_83106Open in IMG/M
3300005445|Ga0070708_100005588All Organisms → cellular organisms → Bacteria9966Open in IMG/M
3300005458|Ga0070681_10070837All Organisms → cellular organisms → Bacteria3451Open in IMG/M
3300005518|Ga0070699_100268713All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81526Open in IMG/M
3300005542|Ga0070732_10208304All Organisms → cellular organisms → Bacteria → Acidobacteria1169Open in IMG/M
3300005549|Ga0070704_100053828All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_82845Open in IMG/M
3300005591|Ga0070761_10004470All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8263Open in IMG/M
3300005602|Ga0070762_10001205All Organisms → cellular organisms → Bacteria11758Open in IMG/M
3300005610|Ga0070763_10023632All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2743Open in IMG/M
3300005712|Ga0070764_10133079All Organisms → cellular organisms → Bacteria → Acidobacteria1355Open in IMG/M
3300005921|Ga0070766_10015827All Organisms → cellular organisms → Bacteria3939Open in IMG/M
3300005921|Ga0070766_10061776All Organisms → cellular organisms → Bacteria2133Open in IMG/M
3300005921|Ga0070766_10138240All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81480Open in IMG/M
3300006176|Ga0070765_100039941All Organisms → cellular organisms → Bacteria → Proteobacteria3746Open in IMG/M
3300006176|Ga0070765_100530230All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81108Open in IMG/M
3300006804|Ga0079221_10045275All Organisms → cellular organisms → Bacteria1945Open in IMG/M
3300006893|Ga0073928_10006080All Organisms → cellular organisms → Bacteria → Acidobacteria16468Open in IMG/M
3300010401|Ga0134121_10021002All Organisms → cellular organisms → Bacteria5224Open in IMG/M
3300010876|Ga0126361_11124591All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81344Open in IMG/M
3300010880|Ga0126350_11589586All Organisms → cellular organisms → Bacteria2161Open in IMG/M
3300012202|Ga0137363_10000355All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae25386Open in IMG/M
3300012361|Ga0137360_10008761All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia6355Open in IMG/M
3300012960|Ga0164301_10340193All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81027Open in IMG/M
3300020579|Ga0210407_10012102All Organisms → cellular organisms → Bacteria6415Open in IMG/M
3300020580|Ga0210403_10000551All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae38637Open in IMG/M
3300020580|Ga0210403_10171050All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81780Open in IMG/M
3300020582|Ga0210395_10000010All Organisms → cellular organisms → Bacteria302746Open in IMG/M
3300020582|Ga0210395_10000886All Organisms → cellular organisms → Bacteria → Acidobacteria23014Open in IMG/M
3300021168|Ga0210406_10004093All Organisms → cellular organisms → Bacteria → Acidobacteria15767Open in IMG/M
3300021168|Ga0210406_10042044All Organisms → cellular organisms → Bacteria → Acidobacteria4048Open in IMG/M
3300021168|Ga0210406_10195936All Organisms → cellular organisms → Bacteria1677Open in IMG/M
3300021170|Ga0210400_10231820All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81506Open in IMG/M
3300021170|Ga0210400_10542417All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_8960Open in IMG/M
3300021171|Ga0210405_10001038All Organisms → cellular organisms → Bacteria → Acidobacteria33625Open in IMG/M
3300021171|Ga0210405_10001337All Organisms → cellular organisms → Bacteria → Acidobacteria28396Open in IMG/M
3300021171|Ga0210405_10005207All Organisms → cellular organisms → Bacteria11957Open in IMG/M
3300021178|Ga0210408_10294690All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81293Open in IMG/M
3300021180|Ga0210396_10126349All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2306Open in IMG/M
3300021181|Ga0210388_10023285All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5039Open in IMG/M
3300021401|Ga0210393_10000003All Organisms → cellular organisms → Bacteria385019Open in IMG/M
3300021401|Ga0210393_10372959All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81162Open in IMG/M
3300021401|Ga0210393_10501615All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_8991Open in IMG/M
3300021404|Ga0210389_10027667All Organisms → cellular organisms → Bacteria4365Open in IMG/M
3300021405|Ga0210387_10004929All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae10241Open in IMG/M
3300021405|Ga0210387_10061249All Organisms → cellular organisms → Bacteria → Acidobacteria3055Open in IMG/M
3300021407|Ga0210383_10003578All Organisms → cellular organisms → Bacteria13889Open in IMG/M
3300021407|Ga0210383_10010023All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8093Open in IMG/M
3300021420|Ga0210394_10066297All Organisms → cellular organisms → Bacteria3130Open in IMG/M
3300021432|Ga0210384_10005114All Organisms → cellular organisms → Bacteria → Acidobacteria14419Open in IMG/M
3300021432|Ga0210384_10126134All Organisms → cellular organisms → Bacteria → Acidobacteria2296Open in IMG/M
3300021432|Ga0210384_10300775All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81444Open in IMG/M
3300021433|Ga0210391_10029813All Organisms → cellular organisms → Bacteria4385Open in IMG/M
3300021433|Ga0210391_10057346All Organisms → cellular organisms → Bacteria3093Open in IMG/M
3300021433|Ga0210391_10086542All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_82473Open in IMG/M
3300021433|Ga0210391_10498370All Organisms → cellular organisms → Bacteria → Acidobacteria956Open in IMG/M
3300021477|Ga0210398_10005777All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11475Open in IMG/M
3300021477|Ga0210398_10081780All Organisms → cellular organisms → Bacteria → Acidobacteria2624Open in IMG/M
3300021478|Ga0210402_10005564All Organisms → cellular organisms → Bacteria11268Open in IMG/M
3300021478|Ga0210402_10064911All Organisms → cellular organisms → Bacteria → Acidobacteria3215Open in IMG/M
3300021478|Ga0210402_10125423All Organisms → cellular organisms → Bacteria2323Open in IMG/M
3300021479|Ga0210410_10000696All Organisms → cellular organisms → Bacteria → Acidobacteria34768Open in IMG/M
3300021559|Ga0210409_10030746All Organisms → cellular organisms → Bacteria → Acidobacteria5167Open in IMG/M
3300022557|Ga0212123_10001501All Organisms → cellular organisms → Bacteria64488Open in IMG/M
3300024178|Ga0247694_1000088All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia34311Open in IMG/M
3300024179|Ga0247695_1005892All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81757Open in IMG/M
3300024182|Ga0247669_1000936All Organisms → cellular organisms → Bacteria9901Open in IMG/M
3300024182|Ga0247669_1000937All Organisms → cellular organisms → Bacteria → Acidobacteria9895Open in IMG/M
3300024246|Ga0247680_1000015All Organisms → cellular organisms → Bacteria → Acidobacteria127471Open in IMG/M
3300024271|Ga0224564_1009267All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81628Open in IMG/M
3300024290|Ga0247667_1004454All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3063Open in IMG/M
3300024331|Ga0247668_1044879All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_8901Open in IMG/M
3300025885|Ga0207653_10013343All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_82572Open in IMG/M
3300025910|Ga0207684_10031923All Organisms → cellular organisms → Bacteria → Acidobacteria4481Open in IMG/M
3300025917|Ga0207660_10047247All Organisms → cellular organisms → Bacteria → Acidobacteria3042Open in IMG/M
3300027660|Ga0209736_1000774All Organisms → cellular organisms → Bacteria10292Open in IMG/M
3300027842|Ga0209580_10036847All Organisms → cellular organisms → Bacteria2243Open in IMG/M
3300027853|Ga0209274_10056607All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1871Open in IMG/M
3300027853|Ga0209274_10105315All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81394Open in IMG/M
3300027855|Ga0209693_10021770All Organisms → cellular organisms → Bacteria3094Open in IMG/M
3300027857|Ga0209166_10004973All Organisms → cellular organisms → Bacteria → Acidobacteria9174Open in IMG/M
3300027889|Ga0209380_10003253All Organisms → cellular organisms → Bacteria → Acidobacteria10131Open in IMG/M
3300027889|Ga0209380_10018204All Organisms → cellular organisms → Bacteria3981Open in IMG/M
3300027908|Ga0209006_10051958All Organisms → cellular organisms → Bacteria3679Open in IMG/M
3300028906|Ga0308309_10003267All Organisms → cellular organisms → Bacteria9700Open in IMG/M
3300028906|Ga0308309_10191034All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81681Open in IMG/M
3300028906|Ga0308309_10513309All Organisms → cellular organisms → Bacteria → Acidobacteria1037Open in IMG/M
3300029636|Ga0222749_10208373All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_8980Open in IMG/M
3300030940|Ga0265740_1009162All Organisms → cellular organisms → Bacteria → Acidobacteria876Open in IMG/M
3300031057|Ga0170834_103747173All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81082Open in IMG/M
3300031128|Ga0170823_10152014All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_8924Open in IMG/M
3300031708|Ga0310686_101045973All Organisms → cellular organisms → Bacteria11318Open in IMG/M
3300031708|Ga0310686_101161883All Organisms → cellular organisms → Bacteria → Acidobacteria5325Open in IMG/M
3300031708|Ga0310686_110682550All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1887Open in IMG/M
3300031718|Ga0307474_10195560All Organisms → cellular organisms → Bacteria1538Open in IMG/M
3300031720|Ga0307469_10222587All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81494Open in IMG/M
3300032770|Ga0335085_10007284All Organisms → cellular organisms → Bacteria → Acidobacteria17232Open in IMG/M
3300032898|Ga0335072_10351833All Organisms → cellular organisms → Bacteria → Acidobacteria1612Open in IMG/M
3300033412|Ga0310810_10582413All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium 13_1_20CM_3_53_81082Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil45.71%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil16.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.67%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.86%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.86%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.86%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil1.90%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.90%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.90%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.90%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.90%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.90%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.95%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.95%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.95%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001356Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010876Boreal forest soil eukaryotic communities from Alaska, USA - W5-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300024271Soil microbial communities from Bohemian Forest, Czech Republic ? CSU5EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027853Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030940Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSI1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032898Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.1EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12269J14319_1010972013300001356Peatlands SoilMVCAGIFVAAAAAPDQPVGDAVGLIEGEDIAVTGPMSVEVVAGVAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSILKAGGAVTVALESGAIHARLEREPALSVYTAQIQGKPLAIGDEPRDFLVGFESTGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLTNGQLEGMGNGAGRCKCELQIAKAPPAPTVAPAVNVAPAPSKDTASSEVVATPGDAPAAMEKPAQKEVPIYTVDMPPLRFDATAKVQPEPDARLMVIVR
JGI12635J15846_1001654653300001593Forest SoilMLCAGIFVAAAAAPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGVAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGAVTVALESGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDTLGNGTGHCKCELQIAKSATVPMPRNEMPAMTVESAKSETTQKSGDSAGTGSGEKLSKKEEPIYTVNMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEPVATAAATPPPPIAVAPTNTAPAPQGSVVDRVRSFFRRLWSRGA*
Ga0062387_10055054513300004091Bog Forest SoilSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGALTVALESGAIHARVEREPALTVYTAEVQAQTVAIGDDPREILVGFENPGMMCIRTYRGALRVEQQLSGRSVIVPQGVDVTLANGQIDAIRNGSGHCQCELQIAKAPPVPAPGNGTSARVGETGSAETAPNASDAPTSGERNEKKEEPIYTVIMPPLRFDANAKVQPEPDPRLMMIVRRVRVRPTLVFQGRVEGEAISAAATGAPQPPAPVVSAPAKAAPPAQGSVVDRVRSFF
Ga0062389_10009244623300004092Bog Forest SoilLRVLLKDCFRESVRRARLFAVVSVMIGAGFFVAAATTPDQPAGDAVGLIEGEDLAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGAGEQPSKNEEVIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPALIFQGRVEGETVATAAAATPQPAVRNAPANAAPAPAQGSVVDRVRSFFRKLWTRGS*
Ga0070680_10006159013300005336Corn RhizosphereAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW*
Ga0070703_1003549213300005406Corn, Switchgrass And Miscanthus RhizosphereVGDELRVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW*
Ga0070694_10004049223300005444Corn, Switchgrass And Miscanthus RhizosphereMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW*
Ga0070708_10000558873300005445Corn, Switchgrass And Miscanthus RhizosphereVGDELRVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHAPAPPEASVFDRVRSFFRRLW*
Ga0070681_1007083723300005458Corn RhizosphereVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW*
Ga0070699_10026871323300005518Corn, Switchgrass And Miscanthus RhizosphereVGDELRVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLL*
Ga0070732_1020830423300005542Surface SoilLIVLLESYLRESVRRVGILAAACAMVCAGVFVAATATADQPGGDAVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGSVTVALESGAIRARLESEPAMSVYTPQIQGQPVAIGDEPREFLVGFENAGVMCVRTFRGAMRLENQLSGQSVIVPQGGDVMLTNGQIESLRNGAGHCNCELQIVKAPAARLPRNEMPAAALENARGETPQNSADSAGTGSAVRSRTASATATEEKPSKKEEPIYTVDMPPLRFNASAKVQPAPDPRLMVIVRRVRVRPTLIFQGRVEGESVTTAAIVSPQP
Ga0070704_10005382813300005549Corn, Switchgrass And Miscanthus RhizosphereMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGEDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW*
Ga0070761_1000447023300005591SoilMIGAGFFVAAATTPDQPAGDAVGLIEGEDLAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNAPVNAAPAPAEGSVVDRVRSFFRKLWTRGS*
Ga0070762_10001205133300005602SoilMQLKAYLPRLARRARLSAAVCGMVCAGLLVAAAAPDQPAGDAVGLIEGQDIRVTGPMSVEVVDGQTKTILRSGSDVQVKSGQARISLQEGGQISICGPAHFSVLKSAGALTLALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIEEMRNGAGQCKCELQIAKIPAAPAGGSEAAANSNDTAGSEKPEAKEEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEGEPAATSAATTAPPKPPV
Ga0070763_1002363223300005610SoilMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISICGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVILANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEAAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRQVRVRPALVFQGRVEGEATATATAVVRTAPPKPPAASAAEKKTTPAEGSVVDRMRSFFRKLWSRGG*
Ga0070764_1013307913300005712SoilMKLKTYLPRLARRARLSAAVCGMVCAGLLVAAAAPDQPAGDAVGLIEGQDIRVTGPMSVEVVDGQTKTILRSGSDVQVKSGQARISLQEGGQISICGPAHFSVLKSAGALTLALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIEEMRNGAGQCKCELQIARIPAAPARGSEAAANSNDTAVSEKPEAKEEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEGEPAATSAATTAPPKPPVASAPAKKAAPAEGSVVDRMRSFFRKLWTRGG*
Ga0070766_1001582733300005921SoilMKLKTYLPRLARRARLSAAVCGMVCAGLLVAAAAPDQPAGDAVGLIEGQDIRVTGPMSVEVVDGQTKTILRSGSDVQVKSGQARISLQEGGQISICGPAHFSVLKSAGALTLALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIEEMRNGAGQCKCELQIAKIPAAPAGGSEAAANSNDTAVSEKPEAKEEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEGEPAATSAATTAPPKPPVASAPAKKAAPAEGSVVDRMRSFFRKLWTRGG*
Ga0070766_1006177633300005921SoilMVCVMIGAGFFVAAATTPDQPAGDTVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNAPVNAAPAPAEGSVVDRVRSFF
Ga0070766_1013824023300005921SoilMLCAGIFVAAADTPDQPVRDAVGLIEGEDIAVTGPMSVEVVGGQTKTILRSGSDVRVKSGQARISLVEGGQISVCGPAHLSVLKSGGAVTLALESGAIRARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLNGQSVMVPQGGDVTLANGQIDALGNGAGHCTCELQIAKAPAVPLPRNEMPARTVEGAKEETVQNSGDAAGAGEKSSKKEEPIYTVDMPPLRFDATAKVQPKPDPRLMVIVRRVRVRPTLLFQGRVEGEVVAAATITAAPPQPPVARAPANTPAQGSVVDRVRSFFRKLWTRGG*
Ga0070765_10003994113300006176SoilVGGQTKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGTVTVALESGAIRARLEREPALSVYTAQIQGQPIAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVMLANGQIDALGNGSGHCTCELQIAKAPAVPLPRNEMPARTVESSKEGTVQNSGDASAAGEKSSKKEEPIYTVDMPPLRFDASAKVQAEPDPRLMVIVRRVRVRPTLIFQGRVEGETVAVAAAIPPQAPAAPARVTPAAPPSQGSVIDRVRSFFRNLWTRGG*
Ga0070765_10053023013300006176SoilTTATADQPAGDSVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIRARLEGEPAISVYTPQLQGQPVAIGDEPRDFLVGFLNPGVMCVRTFRGAMRLEHQLSGQSVIVPQGGDVMLTNGQIDSLRNGAGHCNCELQIAKAPAIPLPKNEVPATAVESAKGQTAESSGGAAGAETVEKPSKKEEPIFTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEAQPATAAAVVTPAPPVSAPAKTPAPTQGSVVDRVRSFFRRLWTRGA*
Ga0079221_1004527533300006804Agricultural SoilMDRRELARRATLFAAVSGMLCLAVFVSASAAPDQPAGDTVGVIDGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLAEGGQLSICGPAHLSVLKSGGSLTVALESGTIHALLEQEPALTVYTAQIQGKTLAIGNDMRELLVGFENPGLMCIHTVRGALRLEQQLSGQSIIVPQGGDVLLANGQIEGMQNGQGHCNCSLQIAKSAPIRSPGSATPSNSAEIATRESAQPSSEPARIPTGAPSSVEKPASKDQPIFQVDMPPLRFDANAKIQTEPDP
Ga0073928_10006080123300006893Iron-Sulfur Acid SpringMVFAGIFVAGAAAPDQPVGDAVGLIEGEDIAITGPMSVEVVGGVAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSSGAVTVALESGAIHARLEREPALSVYTAQIQGQPVAIGDEPREFLVGFEGAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDTLGNGAGHCKCELQIAKAPVVPLPRNEMPAATVESAKSDTTQSSGDAAGTGVGEKLSKKEGPIYTVDTPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEVVAAAAVPPPPVVSAPANAAPAPTQGSVVDRVRSFFRKLWTRGG*
Ga0134121_1002100223300010401Terrestrial SoilMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHAPAPPEASVFDRVRSFFRRLW*
Ga0126361_1112459113300010876Boreal Forest SoilVLLASYSRELRRRAVILAGVCAMVCAGIFVAAAAAPDQPVGDAVGLIEGEDIAVTGPMSVEIVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIHARLEREPALSVYTAQIQGRPVAIGDEPRDFLVGFESAGIMCVRTYRGAMHLEQQLTGQSVMVPQGGDVTLANGQIDSLGNGAGRCKCELQIAKAPAVPAAAPAGSGMPAATGESPNNGTVANSGDAAGAVEKPSKNAEPIYTVDMPPLRFDASAKVQPEPDPRLMAIVRRVRVRPTLIFQGRVEGEPVTAAAATPPQPPAAKAPANTAPPSQGSVVDRVRSFFHRLWTRGG*
Ga0126350_1158958623300010880Boreal Forest SoilVLLASYSRELRRRAVILAGVCAMVCAGIFVAAAAAPDQPVGDAVGLIEGEDIAVTGPMSVEIVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIHARLEREPALSVYTAQIQGRPVAIGDEPRDFLVGFESAGIMCVRTYRGAMHLEQQLTGQSVMVPQGGDVTLANGQIDSLGNGAGRCKCELQIAKAPAVPAAAPAGSGMPAATGESPNNGTVANSGDAAGAVEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMAIVRRVRVRPTLIFQGRVEGEPVTAAAATPPQPPAAKAPANTAPPSQGSVVDRVRSFFHRLWTRGG*
Ga0137363_10000355173300012202Vadose Zone SoilMVCAGIFVRAATPDQSVGDMVGLIEGEDIAVTGPMSVEVVGGIAKTILRSGSDVRVKSGHARISLVEGGQISICGPARLSVLKSGGAVTVALESGAIHARLESELALSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLNGQSVIVPQGGDVTLANGQIDALGNGSGHCACELEIAKAPAVPLPRNAMSARTVEGAKGETVQNSGDAAGAGEKSSKKEEPIYTVDMPPLRFDATAKVQPEPDARLMVIVRRVRVRPTLIFQGQVEGEIVATAAATPAPTVVTRPTPSTPPQAQGSVVDRVRLFFRRLWTRGG*
Ga0137360_1000876123300012361Vadose Zone SoilMVCAGIFVRAATPDQPVGDMVGLIEGEDIAVTGPMSVEVVGGIAKTILRSGSDVRVKSGHARISLVEGGQISICGPARLSVLKSGGAVTVALESGAIHARLESELALSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLNGQSVIVPQGGDVTLANGQIDALGNGSGHCACELEIAKAPAVPLPRNAMSARTVEGAKGETVQNSGDAAGAGEKSSKKEEPIYTVDMPPLRFDATAKVQPEPDARLMVIVRRVRVRPTLIFQGQVEGEMVATAAATPAPTVVTRPTPSTPPQAQGSVVDRVRLFFRRLWTRGG*
Ga0164301_1034019313300012960SoilVLLRVPERELPRRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTGALDSGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCVRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESTIPARSEAGETAKSEPAQNASNSAAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMAIVRRIRVRPTLIFQGRVEGEP
Ga0210407_1001210263300020579SoilMLCAGIFVAAAAVPDQLVGDAVGLIEGEDIAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHAQLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFEGAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCTCELQIAKAPALPATAPVMNVSPAPGKDAARTEIVANPGDAAASEKPMQKTEPIYTVDMPPLRFDATAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAVTPPPVVSAPAKPAAASPAQGSMVDRVRSFFHRLWSPSS
Ga0210403_10000551113300020580SoilLRVLLADYFRESVRRARLFAVVCVMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSILKSGGAVTVALDSGAIHARLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARGQAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLLVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPTNAAAASAQGSIVDRLRSFFRKLWTRGS
Ga0210403_1017105023300020580SoilVQLTSYLRESLRLARLFACVCAMLCAAVVFASASTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREILVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGQCKCELQIAKAPQVPPAVNSSPAGENGKIESVQSSSGIAGSGEKPEKKEEPIYTVVMPPLRFDASAKVQPEPDPQLMVLVRRVRVRPTLIFQGRVEGEPVATVAVAPQQPPVANVPAHAAPPPQASVFDRVRSFFHRLWSGGA
Ga0210395_100000101023300020582SoilVRLKIYLRELARRARLLAGVYGIVCAGLLVVAAAAPDQPAGDTVGLIEGQDIRVTGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210395_10000886203300020582SoilLRVLLKDYFRESVRRARLFAVVSVMIGAGFFVAAATTPDQPAGDAVGLIEGEDLAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNAPVNAAPAPAEGSVVDRVRSFFRKLWTRGS
Ga0210406_10004093163300021168SoilVAGEWLRVLLTDCVRESVRRARLFAVVCVMVCAGFFVAAATTPDQPVGDAVGLIDGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPRNEMPPTTVEGASGETARNSSDAAGAGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAAATTPQPPVRNAPANAAPAPPQGSVVDRVRSFFRKLWTRGS
Ga0210406_1004204443300021168SoilVAGESLRVLLTDYFRESVRRARLFAVVCMMVCAGFFVAAAIPPDQLVSDAVGLIDGENIAVTGPMSVEVVSGEVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGHCKCELQIAKAPVIPLPRNEMPPTTVESAKGETAGNSGDAAGTGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAVATPPPIASAPANAAPVPAQGSVVDRVRSFFRKLWTRGS
Ga0210406_1019593613300021168SoilAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHARLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCNCELQIAKAPAVPATAPVMNVSPAPSKDAARNEIVANPGEAAGASEKPIQKTEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAATPPPVVSAPAKPAAAPPAQGSMVDRVRSFFHRLWSRSS
Ga0210400_1023182023300021170SoilIEGEDIAITGPMSVEMVGGTAKMILRSGSDVRVKSGQARISLAEGGQISICGPAHLSVLKSGGAVTLALESGAIHARLEREPKLSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLTNGQIESLGNGAGHCKCELQIAKAPAMPTTAPVVNAAPPAANKEMTSSDAAASSGETTPASEKPAQKTEPIYTVDMPPLRFDANARVQPEPDPRLMVIVRRVRVRPTLIFQGHVEGETVATAVAAVPPPPVASAPAKAAPGQPGQGSVVDRVRTFFHKLWSRSN
Ga0210400_1054241713300021170SoilVAGESLRVLLTDYFRESVRRARLFAVVCMMVCAGFFVAAAIPPDQLVSDAVGLIDGENIAVTGPMSVEVVSGEVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFEGAGLMCVRTYRGAMRLEHQLSGQSVMVPQGADITLTNGQIESLVNGAGHCKCELQIAKAPAVPTTPPAVNFAPAGSKETASTDAAVNPGEATSANDKPAQKTEPIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGTVEGETVTAA
Ga0210405_1000103853300021171SoilLRVLVRNYRREFLRRAGVSAAACAMLCGGIFVAAAGVPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKAGGTVTVALESGAIHARLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCNCELQIAKAPAVPATAPVMNVSPAPSKDAARNEIVANPGEAAGASEKPIQKTEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAATPPVVSAPAKPAAAPPTQGSMVDRVRSFFHRLWSRSS
Ga0210405_10001337203300021171SoilMRGSVRRVRLLAAVCAVLCAGFLVVAAAKIAAGTAPDQPAGDTVGLIEGEDIRVTGPMSVEVVGGQTKTILRSGSDVLVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHAMLEREPALTVYTAEIQAQTVAIGDDPREVLVGFESPGMMCLRTYRGALRVEQQLTGRSVIVPQGVDVMLQNAQIDAMRNGAGQCKCELQIARSRTIPAPGIGTAARVGESGNGETAPRASEAPAGADEKSDKKEEPIYTVIMPPLRFDANAKVQAEPDPRLMVIVRRVRVRPTLVFQGRVEEETVATAAATAPPPPTVAAPKTATPAPAQGSVVDRMRSFFHRLWSRSG
Ga0210405_1000520753300021171SoilVLRKGYLPESMRRARLLAGVCAMVCGGIFVAGAAAPDQPVGDTVGLIEGEDISVTGPMSVEVVGGVAKTILRSGSDVRVKSGHARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIHARLETEPALSVYTAQIQGRPVAIGEEPRDFLVGFEGAGTMCVRTYRGAMRLEQQLSGLSVMVPQGGDVTLANGQIDALGNGAGRCKCELQIAKDVTVPAVPPPGSAMPGSTGEGVKSETAQNSNDAPGTAEKRSKNEEPIYTVDMPPLRFDASAKVQVEPDPRLMVIVRRVRVRPTLIFQGRVDGETVATAVATPAQPPAASAPGNAGPPPAQGSVVDRVRSFFRRLWTRAS
Ga0210408_1029469013300021178SoilLRVLLTDYFRESVRRARLFAVVCMMVCAGFFVAAAIPPDQLVSDAVGLIDGENIAVTGPMSVEVVSGEVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLNGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPRNDMPPTTVENASGETARNSSDAAGAGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAAATTPQPPVRNAPANAAPAPPQGSVVDRVRSFFRKLWTRGS
Ga0210396_1012634923300021180SoilMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210388_1002328553300021181SoilLLAGVCGIVCAGLLVVAAAAPDQPAGDTVGLIEGQDIRVTGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210393_100000032603300021401SoilVRLKIYLRELARRARLLAGVCGIVCAGLLVVAAAAPDQPAGDTVGLIEGQDIRVTGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210393_1002864713300021401SoilARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHAQLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFEGAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCTCELQIAKAPALPATAPVMNVSPAPGKDAARTEIVANPGDAAASEKPMQKTEPIYTVDMPPLRFDATAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAVTPPPVVSAPAKPAAASPAQGSMVDRVRSFFHRLWSPSS
Ga0210393_1037295913300021401SoilLKVLLKDYFRESVRRARLFAMVCVMIGAGFFVAAATTPDQPAGDTVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPSLSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCIRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGRCKCELQIAKAPAVPLPRNEMPPTTSESARSETARNSGDAGGAGEKPSKNEEPIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVR
Ga0210393_1050161513300021401SoilPDQPAGDAVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDRGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFENAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGRCKCELQVAQAPLVPLPRNEMPPTTVESARTETARNSGDTGEAGERASKEEEPIYTVDMPPLRFDASAMVQPEPDPRLMVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVRNAPANAAPAPAQGSVVDRVRSFFRKLWTRGT
Ga0210389_1002766733300021404SoilLRVLVRNYRREFLRRAGVSAAACAMLCGAIFVAAAAVPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHARLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCNCELQIAKAPAVPASAPVMNASPAPGKDAARNEIVANLGEAAGASEKPIQKTEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAATPPVVSAPAKAAAAPPTQGSMVDRVRSFFHRLWSRSS
Ga0210387_1000492923300021405SoilMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSILKSGGAVTVALDSGAIHARLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARGQAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLLVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPTNAAAASAQGSIVDRLRSFFRKLWTRGS
Ga0210387_1006124933300021405SoilMLCGAIFVAAAAVPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHARLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCNCELQIAKAPAVPATAPVMNVSPAPSKDAARNEIVANPGEAAGASEKPIQKTEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEVVAAATPPPVVSAPAKPAAAPPAQGSMVDRVRSFFHRLWSRSS
Ga0210383_10003578113300021407SoilVLLKDYFRESVRRARLFAMVCVMIGAGFFVAAATTPDQPAGDTVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPSLSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCIRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGRCKCELQIAKAPAVPLPRNEMPPTTSESARSETARNSGDAGGAGEKPSKNEEPIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETAATAAAATPPPAARNAPASPAQGSVVDRVRSFFRKLWTRGS
Ga0210383_1001002323300021407SoilLRVLLADYFRESVRRARLFAVVCVMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSILKSGGAVTVALDSGAIHARLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARGQAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLLVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPTNAAAASAQGSIVDRLRSFLRKLWTRGS
Ga0210394_1006629713300021420SoilMRGSVRRVRLLAAVCAVLCAGFLVVAAAKIAAGTAPDQPAGDTVGLIEGEDIRVTGPMSVEVVGGQTKTILRSGSDVLVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHAMLEREPALTVYTAEIQAQTVAIGDDPREVLVGFESPGMMCLRTYRGALRVEQQLTGRSVIVPQGVDVMLQNAQIDAMRNGAGQCKCELQIARSRTIPAPGIGTAARVGESGNGETAPHASEAPAGTDEKSDKKEEPIYTVIMPPLRFDANAKVQAEPDPRLMVIVRRVRVRPTLVFQGRVEEETVATAAATAPPPPTVAAPKTATPAPAQGSVVDRMRSFFHRLWSRSG
Ga0210384_1000511473300021432SoilVLLTDCFRESVRRARLFAVVCVVVCAGFFVAAATTPDQPVGDAVGLIDGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLNGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPRNDMPPTTVENASGETARNSSDAAGAGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAAATTPQPPVRNAPANAAPAPPQGSVVDRVRSFFRKLWTRGS
Ga0210384_1012613423300021432SoilMRRARIFAGVCAVAGAGVFVATTATADQPAGDSVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIRARLEGEPAISVYTPQLQGQPVAIGDEPRDFLVGFLNPGVMCVRTYRGAMRLEQQLSGQSVIVPQGGDVMLTNGQIDSLRNGAGHCHCELQIAKAPAVPLPKNEVPATAEESARGETAQSSGGAAGAETGEKPSKKEEPIFTVDMPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEAQPVTAAAVVTPAPPPAKTPAPTQGSVVDRVRSFFRRLWTRGG
Ga0210384_1030077513300021432SoilVLVRNYKREFLRRAGVSAAACAMLCGAIFVAAAAVPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGMAKTILRSGSDVRVKSGQARISLVEGGQISICGPARLSVLKAGGTVTVALESGAIHARLEQEPKLSVYTAQIQGQPVAIGDEPREFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIEALGNGAGHCNCELQIAKAPAVPATAPVMNVSPAPSKDAARNEIVANPGEAAGASEKPIQKTEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLI
Ga0210391_1002981323300021433SoilVLLKDYFRESVRRARLFAVVSVMIGAGFFVAAATTPDQPAGDAVGLIEGEDLAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGTVTVALDSGAIHALLEREPALSVYTAQIEGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNAPVNAAPAPAEGSVVDRVRSFFRKLWTRGS
Ga0210391_1005734633300021433SoilVCGIVCAGLLVVAAAAPDQPAGDTVGLIEGQDIRVTGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210391_1008654213300021433SoilGFWVVAAAKIAAGTAPDQPAGDTVGLIEGEDIRVTGPMSVEVVGGQTKTILRSGSDVLVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHAMLEREPALTVYTAEIQAQTVAIGDDPREVLVGFESPGMMCLRTYRGALRVEQQLTGRSVIVPQGVDVMLQNAQIDAMRNGAGQCKCELQIARSRTIPAPGIGTAARVGESGNGETAPHASEAPAGTDEKSDKKEEPIYTVIMPPLRFDANAKVQAEPDPRLMVIVRRVRVRPTLVFQGRVEEETVATAAATAPPPPTVAAPKTATPAPAQGSVVDRMRSFFHRLWSRSG
Ga0210391_1049837013300021433SoilVLLRVDLRGSVRRARLLAAVCAVVSTGFFVAAAARTSIAAKNATVATPDQPAGDSVGLIEGEDIAVTGPMSVEVVGGQTKTVLRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGALTVALESGSIHAMVEREPALTVYTAEIQAQTLAIGEEPREIVVGFESPGMMCIRTYRGALRVEQQLSGRSVIVPQGVDVMLANAQIDAMRNGTGQCKCELQIAKSRTIPAPGTTAAVHAGETGNSEAAPNTSDAPASGEKTEKREEPIYTVIMPPLMFDAKAKVQAEPDPRLMMIVRRVRVRPSL
Ga0210398_1000577793300021477SoilIYLRELARRARLLAGVCGIVCAGLLVVAAAAPDQPAGDTVGLIEGQDIRVTGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISVCGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLMGFPSPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEEAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRRVRVRPALVFQGRVEEEATATAVATKAPPKPPAASAAAKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0210398_1008178023300021477SoilMVCVMIGAGFFVAAATTPDQPAGDTVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPSLSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCIRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGRCKCELQIAKAPAVPLPRNEMPPTTSESARSETARNSGDAGGAGEKPSKNEEPIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETAATAAAATPPPAARNAPASPAQGSVVDRVRSFFRKLWTRGS
Ga0210402_1000556443300021478SoilVQLTSYLRESLRLARLFACVCAMLCAAVVFAAGSTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREILVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGQCKCELQIAKAPQVPPAVNSSPAGENAKSESAPNASGTAGSGEKSEKKEEPIYTVIMPPLRFDASTKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEPVATVAAAPSQPPVTNVPAHAAPPPQASVFDRVRSFFHRLWSGGA
Ga0210402_1006491123300021478SoilLRVLLADYFRESVRRARLFAVVCVMIGAGFFVAEATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTHRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARGETARNSGDAGEAGERASQEEEPIYTVDMPPLRFDASAKVQPEPDPRLLVIVRRVRVRPTLLFQGRVEGETVTTPVAATPQPPVRNAPANAAPAPAQGSMVDRVRSFFRKLWTRGS
Ga0210402_1012542323300021478SoilMLCAAVVFASGSTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREILVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGQCKCELQIAKAPQVPPAVNSSPAGENGKIESVQSSSGIAGSGEKPEKKEEPIYTVVMPPLRFDASAKVQPEPDPQLMVLVRRVRVRPTLIFQGRVEGEPVATVAVAPQQPPVANVPAHAAPPPQASVFDRVRSFFHRLWSGGA
Ga0210410_1000069693300021479SoilVLLTDYFRESVRRARLFAVVCVMVCAGFFVAAATPPDQLVSDAVGLIDGEDIAVTGPMSVEVVSGEVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGHCKCELQIAKAPVIPLPRNEMPPTTVESAKGETAGNSGDAAGTGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAVATPPPVASAPANAAPVPAQGSVVDRVRSFFRKLWTRGS
Ga0210409_1003074623300021559SoilLRVLLTDYFRESVRRARLFAVVCVMVCAGFFVAAATPPDQLVSDAVGLIDGEDIAVTGPMSVEVVSGEVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGAGHCKCELQIAKAPVIPLPRNEMPPTTVESAKGETAGNSGDAAGTGEKPSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVTTAVATPPPVASAPANAAPVPAQGSVVDRVRSFFRKLWTRGS
Ga0212123_1000150173300022557Iron-Sulfur Acid SpringLRVLLTIYLRESVRHAWIFAAVCAMVFAGIFVAGAAAPDQPVGDAVGLIEGEDIAITGPMSVEVVGGVAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSSGAVTVALESGAIHARLEREPALSVYTAQIQGQPVAIGDEPREFLVGFEGAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDTLGNGAGHCKCELQIAKAPVVPLPRNEMPAATVESAKSDTTQSSGDAAGTGVGEKLSKKEGPIYTVDTPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEVVAAAAVPPPPVVSAPANAAPAPTQGSVVDRVRSFFRKLWTRGG
Ga0247694_1000088203300024178SoilVLLTVYLRAPVGLVRLFFCVSGMLCAAALVAAMAASDQPLGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHARLEREPALTVYTAQIQAQTVAIGDDPREILVGFENSGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVMLANGQIDTLRNAAGHCTCELQIAKAPVVPKPGSAISSRSGAEETARSETPQNTGGSPAPAEKPGTKEEPIYEVYPPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEPAATAAAIAPLQPPVASAPARVAPPPQSSVFDRMRSFIHRLWSRGA
Ga0247695_100589223300024179SoilVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHAPAPPEASVFDRVRSFFRRLW
Ga0247669_1000936103300024182SoilMLCAAALVAAMAASDQPLGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHARLEREPALTVYTAQIQAQTVAIGDDPREILVGFENSGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVMLANGQIDTLRNAAGHCTCELQIAKAPVVPKPGSAISSRSGAEETARSETPQNTGGSPAPAEKPGTKEEPIYEVYPPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEPAATAAAIAPLQPPVASAPARVAPPPQSSVFDRMRSFIHRLWSRGA
Ga0247669_100093793300024182SoilMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHAPAPPEASVFDRVRSFFRRLW
Ga0247680_1000015643300024246SoilVLLTVYLRAPVGLVRLFFCVSGMLCAAALVAAMAASDQPLGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIHARLEREPALTVYTAQIQAQTVAIGDDPREILVGFENSGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVMLANGQIDTLRNAAGHCTCELQIAKAPVVPKPGSAISSRSGAEETARSEAPQNTGGSPAPAEKPGTKEEPIYEVYPPPLRFDATAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEPAATAAAIAPLQPPVASAPARVAPPPQSSVFDRMRSFIHRLWSRGA
Ga0224564_100926713300024271SoilVLLADYFRESVRRARLFAVVCMMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSILKSGGAVTVALDSGAIHARLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARSEAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLMVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPANAAAASAQGSIVDRIRSFFRKLWTRGN
Ga0247667_100445433300024290SoilVLLRVHERELPRRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVAGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDAREILVGFESPGTMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAAPMPESAIPARSEEGEIAKSEPAQNASNSAALAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPPQPSVASAPVHAPAPPEASVLDRVRSFFRRLWSRGG
Ga0247668_104487913300024331SoilVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILASGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLM
Ga0207653_1001334323300025885Corn, Switchgrass And Miscanthus RhizosphereVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW
Ga0207684_1003192323300025910Corn, Switchgrass And Miscanthus RhizosphereMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW
Ga0207660_1004724713300025917Corn RhizosphereGPVSQRSSMRALRNQVDVAAILQVIKEARSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATPRQPPVASAPVHALAPPEASVFDRVRSFFRRLW
Ga0209736_100077443300027660Forest SoilMLCAGIFVAAAAAPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGVAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGAVTVALESGAIHARLEREPALSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDTLGNGTGHCKCELQIAKSATVPMPRNEMPAMTVESAKSETTQKSGDSAGTGSGEKLSKKEEPIYTVNMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGEPVATAAATPPPPIAVAPTNTAPAPQGSVVDRVRSFFRRLWSRGA
Ga0209580_1003684713300027842Surface SoilVLLESYLRESVRRVGILAAACAMVCAGVFVAATATADQPGGDAVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGSVTVALESGAIRARLESEPAMSVYTPQIQGQPVAIGDEPREFLVGFENAGVMCVRTFRGAMRLENQLSGQSVIVPQGGDVMLTNGQIESLRNGAGHCNCELQIVKAPAARLPRNEMPAAALENARGETPQNSADSAGTGSAVRSRTASATATEEKPSKKEEPIYTVDMPPLRFNASAKVQPAPDPRLMVIVRRVRVRPTLIFQGRVEGESVTTA
Ga0209274_1005660713300027853SoilVRLKIYLREWAGRARLLAGVCGIVSAGLLAAAAAAPDQPAGDTVGLIEGQDIRVSGPMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISICGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVILANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEAAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRQVRVRPALVFQGRVEGEATATATAVVRTAPPKPPAASAAEKKTTPAEGSVVDRMRSFFRKLWSR
Ga0209274_1010531523300027853SoilLRVLLKDYFRESVRRARLFAVVSVMIGAGFFVAAATTPDQPAGDAVGLIEGEDLAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNA
Ga0209693_1002177033300027855SoilMSVEVVAGQTKTILRSGSDVQVKSGQARISLVEGGQISICGPAHFSVLKSGGALTVALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVILANGQIDEMRNGAGQCKCELQIAKIPAAPAGGSDAAASSSEAAADEKPEAREEPIYTVIMPPLEFDAKAKVQPEPDMRLMTIVRQVRVRPALVFQGRVEGEATATATAVVRTAPPKPPAASAAEKKTTPAEGSVVDRMRSFFRKLWSRGG
Ga0209166_1000497333300027857Surface SoilVLLREHERELPRRGRIFAIVGAMLWVAVLVAVGAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPELTVYTAQIQAQTVAIGDDPREILVGFESPGLMCVRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAAPRSGNAIPTPSEAGETAKDEPAQNAVGSPATAEKPATKEEPIYEVYPPPLRFDASAKVQPEPDPQLMVIVRRVRVRPTLIFQGRVEGEPVAAAAAAAPPQPPVASTPARVAQPPQASVFDRVRSFFRRLWSRGG
Ga0209380_1000325393300027889SoilMVCVMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVAGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHALLEREPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGGGRCKCELQIAKAPVVPLPKNEMPPTTAESARSETARNSGDAGGASEQPSKNEELIYTVDMPPLRFDANAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEGETVATAAAATPQPAVRNAPVNAAPAPAEGSVVDRVRSFFRKLWTRGS
Ga0209380_1001820423300027889SoilMKLKTYLPRLARRARLSAAVCGMVCAGLLVAAAAPDQPAGDAVGLIEGQDIRVTGPMSVEVVDGQTKTILRSGSDVQVKSGQARISLQEGGQISICGPAHFSVLKSAGALTLALESGAIHARVDREPALTVYTAEIQAQTMAIGDDPRDVLVGFASPGMMCIHTYRGALRVEQQLSGRSVIVPQGVDVMLANGQIEEMRNGAGQCKCELQIAKIPAAPAGGSEAAANSNDTAVSEKPEAKEEPIYTVIMPPLEFDAKAKVQPEPDMQLMTIVRRVRVRPALVFQGRVEGEPAATSAATTAPPKPPVASAPAKKAAPAEGSVVDRMRSFFRKLWTRGG
Ga0209006_1005195833300027908Forest SoilLKELFKTCLRESRRRTRILAGVCAMLCAGIFVAAADAPDQPVGDTVGLIEGEDIAITGPMSVEMVGGTAKMILRSGSDVRVKSGQARISLAEGGQISICGPAHLSVLKSGGAVTLALESGAIHARLEREPKLSVYTAQIQGQPVAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLTNGQIESLGNGAGHCKCELQIAKAPAMPTTAPVVNAAPPAANKEMTSSDAAASSGETTPASEKPAQKTEPIYTVDMPPLRFDANARVQPEPDPRLMVIVRRVRVRPTLIFQGHVEGETVATAVAAVPPPPVASAPAKAAPAQPGQGSVVDRVRTFFHKLWSRSN
Ga0308309_1000326783300028906SoilMVCAGIFVAAADTPDQPVRDAVGLIEGEDIAVTGPMSVEVVGGQTKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGTVTVALESGAIRARLEREPALSVYTAQIQGQPIAIGDEPRDFLVGFESAGVMCVRTYRGAMRLEQQLSGQSVMVPQGGDVMLANGQIDALGNGSGHCTCELQIAKAPAVPLPRNEMPARTVESSKEGTVQNSGDASAAGEKSSKKEEPIYTVDMPPLRFDASAKVQAEPDPRLMVIVRRVRVRPTLIFQGRVEGETVAVAAAIPPQAPAAPARVTPAAPPSQGSVIDRVRSFFRNLWTRGG
Ga0308309_1019103413300028906SoilMVCAGFFVAAATTRDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEQEPALSVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARGQAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLLVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPTNAAAASAQGSIVDRLRSFLRKLWTRGS
Ga0308309_1051330913300028906SoilVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIRARLEGEPAISVYTPQLQGQPVAIGDEPRDFLVGFLNPGVMCVRTFRGAMRLEHQLSGQSVIVPQGGDVMLTNGQIDSLRNGAGHCNCELQIAKAPAIPLPKNEVPATAVESAKGQTAESSGGAAGAETVEKPSKKEEPIFTVDMPPLRFDASAKVQPEPDPRLMVIVRRVRVRPTLIFQGRVEAQPATAAAVVTPAPPVSAPAKTPAPTQGSVVDRVRSFFRRLWTRGA
Ga0222749_1020837313300029636SoilMRRARIFAGVCAVAGAGVFVAITATADQPAGDSVGLIEGEDIAVTGPMSVEVVGGTAKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALESGAIRARLEGEPAISVYTPQLQGQPVAIGDEPRDFLVGFLNPGVMCVRTYRGAMRLEQQLSGQSVIVPQGGDVMLTNGQIDSLRNGAGHCHCELQIAKAPAVPLPKNEVPATAEESARGETAQSSGGAAGAETGEKPSKKEEPIFTVDMPPLRFDASAKVLPEPDARLVVIVRRVRVRPTLIFQGRVEAQPVTAAAVVTPAPPPAKTPAPTQGSVVDRVR
Ga0265740_100916213300030940SoilAKTATLAASDQQAGDSVGLIEGEDIAVTGPMTVEVVGGLTKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSAGALTVALESGSIHALVEREPALTVYTAEIQAQTLAIGEEPREILVGFESPGMMCIHTFRGALRVEQQLSGRSVIVPQGVDVMLANAQIDAMRNGTGQCKCDLPIAKSRTIPAPGTTAAVRAGETGNSEAAPNSSEAQATDEKAEKKEEPIYTVIMPPLLFDAKAKVQTEPDPRLMVIVRRVRVRPSLVFQGRVEGEPIATAVATPPQPPAP
Ga0170834_10374717313300031057Forest SoilMLCAAVSFASGSTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREMLVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGQCKCELQIAKAPRVPPAVNSSPAGETAKSESAPNASGTAGSGEKTEKKEEPIYTVIMPPLRFDASAKVQPEPDPQLMVLVRRVRVRPTLIFQGRVE
Ga0170823_1015201413300031128Forest SoilLAGEQLRVHLTSYLRESARPARLFGCVCAMLCAAVSFASGSTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREMLVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGQCKCELQIAKAPRVPPAVNSSPAGETAKSESAPNASGTAGSGEKTEKKEEPIYTVIMPPLRFDASAKVQPEPDPQLMVLVRRVRVRPTLIFQGRVE
Ga0310686_10104597323300031708SoilMVSAMVCAGFFVAAAATPDQPVGDTVGLIEGEDIAVTGPMSVEVVGGLVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIHARLEREPALNVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVLVPQGGDVTLANGQIDAPGIGAGHCKCELQIAKAPVVPLPRNEMPPTTGENAKGETVRNSGDAAGAGEKSSKKEEPIYTVDMPPLRFDASAKVQPEPDPRLMVMVRRVRVRPTLIFQGRVEGETVTTAVATPPQPPAASVPASAAPAPAQGSVVDRVRSFFRKLWTRGS
Ga0310686_10116188323300031708SoilMMIGAGFFVAAATTPDQPVGDAVGLIEGEDIAVTGPMSVEVVGGQAKTILRSGSDVRVKSGEARISLVEGGQISICGPAHLSILKSGGAVTVALDSGAIHARLEREPALTVYTAQIQGQPVAIGDEPREFLVGFESAGIMCVRTYRGAMRLEQQLSGQSVMVPQGGDVTLANGQIDALGNGTGRCKCELQIAKAPVVPLPRNEMPPTTVESARSEAARNSGDAGETGEKASKEEEPIYTVDMPPLRFDANAKAQPEPDPRLMVIVRRVRVRPTLLFQGRVEGETVTTAVAATPQPPVVNAPANAAAASAQGSIVDRIRSFFRKLWTRGN
Ga0310686_11068255023300031708SoilVLGRVYLRGPVRRARLLGGVCAVLCAGLFVAAAAKTATVATSDQPAGDAVGLIEGEDIAVTGPMSVEVVGGQTKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGSIHALVEREPALTVYTAEIQAQTVAIGDQPREILVGFESPGMMCIRTYRGALRVEQQLSGRSVIVPQGVDVMLANAQIDAMRNGTGQCKCELQIAKSRTIPAPRDTTAARVGESGNSKAAPNASDAPATEEKTEKKEQPIYTVIMPPLRFDANTKVQAEPDPRLMLIVRRVRVRPTLVFQGRVEEETVATAAAMPPPPPVASAPAKAEAAPQGSVVDRVRSFFRKLWTRGG
Ga0307474_1019556023300031718Hardwood Forest SoilSGSDVRVKSGQARISLVEGGQISICGPAHLSVLKSGGAVTVALDSGAIRARLEGAPAMNVYTPQIQGQPVAIGDEPRDFLVGFENPGVMCVRTFRGAMRLEQQLGGQSVIVPQGGDVVLTNGQIDSLRNGAGHCNCELQIAKAPAVPLPKNEPTPRVVESAKGETTDAAGSATVEKPSKKEEPIFTVDTPPLRFDASAKVQPEPDLRLMVIVRRVRVRPTLIFQGRVEGEPVTAAAVAPPAPPVSAAPAKAPAQTEGSVVDRVRSFFRRLWTRGA
Ga0307469_1022258723300031720Hardwood Forest SoilLRVQFTSYLRESLRLARLFACVCAMLCAAVVFASASTPDQPVGDAVGVIEGEDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTLALESGVIHARMEREPALTFYTAQIQAQTVAIGDEPREILVGFDSPGMMCIRTYRGAMRLEQQFSSQSIMVPQGGDLMLANGHIDTLRGGTGLCKCELQIAKAPQVPPAVNSSPAGENAKSELAQNASGTAGPGEKPEKKEEPIYTVIMPPLRFDASTKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEPVATVAAAPQQPPVANVPAHTAPPPQASVFDRVRSFFHRLWSGGA
Ga0335085_1000728473300032770SoilMHEREVRWRVRLLGTVCALVLAATFVAVAGVPDQPSGDAVGVIEGEDINVTGPMSVDVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKAGGALTVALESGTIHARVLQTPALTFYTAQIQAQTVAIGDDPREILVGFENPGLMCIRTYRGAIRVEQQLSGQSVMVPQGEDVTLVNGQIDAMGSGAGHCNCELQVAKAAPMPRISGGEQPSARGSTNTQEETTEKGQVLSGEANTATSRAEKKEEPIYQVDMPPLTFDAKARVQPAPNPQLMAIVRRVRVRPTLIFHGSVEEEPVAKAVTPPPPIAATPKKVDAAPTRGSVMERVRSFFRRLWS
Ga0335072_1035183323300032898SoilVCAMVCAAVFFAAAGTPDQATGDAVGVIEGEDISVTGPVSMDVVGGQTKTILRSGSDVRVKSGQARISLVEGGQISICGPTHFSVLKSGGSLTVALESGAIHALLEQQPALTVYTAQIQAQTLAIGDDPREILVGFENPGLMCIRTYRGAIRVEQQLSGRSVIVPQGEDVLLADGQIDAMGNGAGHCNCELQMAKTSTAALPKSVAPPTAQERAIAETTPNSSEAAVGAPSSSEKPEAKEEPIYQVDMPPLQFDASAKVQPEPDPALMAVVRRIRVRPALIFQGRVEDAPVAVAAAASPKPPVAIAPPKTMPAQGSLLNRMRSYLKRLWTRSG
Ga0310810_1058241313300033412SoilVGDELRVLLREHERELARRARFFGVVGAMLWVAVLVAVAAASDQPAGDAVGVIEGQDIAVTGPMSVEVVGGQVKTILRSGSDVRVKSGQARISLVEGGQISICGPAHFSVLKSGGSLTVALESGAIRARLEREPALTVYTAQIQAQSVAIGDDPREILVGFESPGMMCIRTYRGAMRLEQQLSGQSVMVPQGGDVILANGQIDTLRNGAGHCNCELQIAKAPAVPRPESAIPARSEAGETAKNEPAQNASNSEAPAEKPAAKEEPIYEVYPPPLRFDASAKVQPEPDARLMVIVRRVRVRPTLIFQGRVEGEAVATAAATP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.