NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069972

Metagenome Family F069972

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069972
Family Type Metagenome
Number of Sequences 123
Average Sequence Length 154 residues
Representative Sequence MSADNGDELTRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGSDLQTPLAELQTHVVLEGLGRFVTVALKAAIPRRSDCGP
Number of Associated Samples 102
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 61.79 %
% of genes near scaffold ends (potentially truncated) 43.90 %
% of genes from short scaffolds (< 2000 bps) 89.43 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (52.033 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(22.764 % of family members)
Environment Ontology (ENVO) Unclassified
(33.333 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.650 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 42.62%    β-sheet: 18.58%    Coil/Unstructured: 38.80%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF01925TauE 7.32
PF04199Cyclase 5.69
PF03795YCII 5.69
PF13460NAD_binding_10 3.25
PF00069Pkinase 3.25
PF13628DUF4142 2.44
PF05154TM2 1.63
PF01638HxlR 1.63
PF05163DinB 1.63
PF08264Anticodon_1 1.63
PF13527Acetyltransf_9 1.63
PF00072Response_reg 0.81
PF00106adh_short 0.81
PF11196DUF2834 0.81
PF00392GntR 0.81
PF13603tRNA-synt_1_2 0.81
PF00924MS_channel 0.81
PF03706LPG_synthase_TM 0.81
PF14534DUF4440 0.81
PF13540RCC1_2 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 13.01
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 7.32
COG1878Kynurenine formamidaseAmino acid transport and metabolism [E] 5.69
COG2350YciI superfamily enzyme, includes 5-CHQ dehydrochlorinase, contains active-site pHisSecondary metabolites biosynthesis, transport and catabolism [Q] 5.69
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.63
COG2314Uncharacterized membrane protein YozV, TM2 domain, contains pTyrGeneral function prediction only [R] 1.63
COG2318Bacillithiol/mycothiol S-transferase BstA/DinB, DinB/YfiT family (unrelated to E. coli DinB)Secondary metabolites biosynthesis, transport and catabolism [Q] 1.63
COG0392Predicted membrane flippase AglD2/YbhN, UPF0104 familyCell wall/membrane/envelope biogenesis [M] 0.81
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.81
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A52.03 %
All OrganismsrootAll Organisms47.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000363|ICChiseqgaiiFebDRAFT_14096379Not Available860Open in IMG/M
3300000956|JGI10216J12902_108426434Not Available751Open in IMG/M
3300000956|JGI10216J12902_109502749All Organisms → cellular organisms → Bacteria1573Open in IMG/M
3300003321|soilH1_10090705All Organisms → cellular organisms → Bacteria4067Open in IMG/M
3300003324|soilH2_10032928All Organisms → cellular organisms → Bacteria8246Open in IMG/M
3300003990|Ga0055455_10116445Not Available798Open in IMG/M
3300004156|Ga0062589_100777996All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300004156|Ga0062589_101778475Not Available617Open in IMG/M
3300004157|Ga0062590_100155042All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1577Open in IMG/M
3300004479|Ga0062595_100341044All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300004480|Ga0062592_100520103All Organisms → cellular organisms → Bacteria989Open in IMG/M
3300004643|Ga0062591_100068199All Organisms → cellular organisms → Bacteria2136Open in IMG/M
3300004778|Ga0062383_10304899Not Available765Open in IMG/M
3300004779|Ga0062380_10131146All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium962Open in IMG/M
3300004808|Ga0062381_10400214Not Available526Open in IMG/M
3300005093|Ga0062594_101256465All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300005178|Ga0066688_10675741Not Available659Open in IMG/M
3300005181|Ga0066678_10223896All Organisms → cellular organisms → Bacteria1209Open in IMG/M
3300005332|Ga0066388_106326249Not Available597Open in IMG/M
3300005434|Ga0070709_11267427Not Available594Open in IMG/M
3300005444|Ga0070694_100345088All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1152Open in IMG/M
3300005445|Ga0070708_100095743All Organisms → cellular organisms → Bacteria2711Open in IMG/M
3300005445|Ga0070708_100208298All Organisms → cellular organisms → Bacteria1832Open in IMG/M
3300005455|Ga0070663_101119165Not Available689Open in IMG/M
3300005467|Ga0070706_101167535Not Available708Open in IMG/M
3300005518|Ga0070699_100161845All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1981Open in IMG/M
3300005518|Ga0070699_101140161Not Available715Open in IMG/M
3300005536|Ga0070697_100351487All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1273Open in IMG/M
3300005577|Ga0068857_102327417Not Available526Open in IMG/M
3300005577|Ga0068857_102410119Not Available517Open in IMG/M
3300005598|Ga0066706_10190077All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300005618|Ga0068864_100631458Not Available1042Open in IMG/M
3300006755|Ga0079222_10805422All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiales genera incertae sedis → Thiomonas768Open in IMG/M
3300006845|Ga0075421_100051439All Organisms → cellular organisms → Bacteria5213Open in IMG/M
3300006853|Ga0075420_100355510All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Deinococcales → Deinococcaceae → Deinococcus → Deinococcus peraridilitoris → Deinococcus peraridilitoris DSM 196641268Open in IMG/M
3300006853|Ga0075420_101931803Not Available504Open in IMG/M
3300006871|Ga0075434_102476082Not Available520Open in IMG/M
3300009012|Ga0066710_101519397Not Available1032Open in IMG/M
3300009156|Ga0111538_11200193All Organisms → cellular organisms → Bacteria959Open in IMG/M
3300010166|Ga0126306_10112564All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1982Open in IMG/M
3300010166|Ga0126306_10333942All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1174Open in IMG/M
3300010400|Ga0134122_12054643Not Available611Open in IMG/M
3300011444|Ga0137463_1094697Not Available1123Open in IMG/M
3300012208|Ga0137376_10227087All Organisms → cellular organisms → Bacteria1617Open in IMG/M
3300012211|Ga0137377_10351326All Organisms → cellular organisms → Bacteria1411Open in IMG/M
3300012353|Ga0137367_10906186Not Available607Open in IMG/M
3300012360|Ga0137375_10990866All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300012361|Ga0137360_10951175Not Available741Open in IMG/M
3300012362|Ga0137361_11495035Not Available597Open in IMG/M
3300012469|Ga0150984_100239257Not Available581Open in IMG/M
3300012917|Ga0137395_10134891All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1677Open in IMG/M
3300012923|Ga0137359_10224579All Organisms → cellular organisms → Bacteria1677Open in IMG/M
3300012929|Ga0137404_10413829Not Available1191Open in IMG/M
3300012929|Ga0137404_11816403Not Available567Open in IMG/M
3300012930|Ga0137407_10189713All Organisms → cellular organisms → Bacteria1838Open in IMG/M
3300012930|Ga0137407_10375278All Organisms → cellular organisms → Bacteria1313Open in IMG/M
3300012941|Ga0162652_100084797Not Available554Open in IMG/M
3300012955|Ga0164298_10476610Not Available828Open in IMG/M
3300012955|Ga0164298_10741000Not Available695Open in IMG/M
3300012955|Ga0164298_11202417Not Available574Open in IMG/M
3300012957|Ga0164303_10743952Not Available667Open in IMG/M
3300012957|Ga0164303_10994806Not Available596Open in IMG/M
3300012958|Ga0164299_10074523All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1680Open in IMG/M
3300012960|Ga0164301_10113718All Organisms → cellular organisms → Bacteria1583Open in IMG/M
3300012961|Ga0164302_10352634Not Available984Open in IMG/M
3300012985|Ga0164308_10674877All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium888Open in IMG/M
3300012986|Ga0164304_10226266All Organisms → cellular organisms → Bacteria1238Open in IMG/M
3300012986|Ga0164304_10266181All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1157Open in IMG/M
3300012986|Ga0164304_10279574All Organisms → cellular organisms → Bacteria1133Open in IMG/M
3300012986|Ga0164304_10353893All Organisms → cellular organisms → Bacteria1027Open in IMG/M
3300012986|Ga0164304_11451442Not Available565Open in IMG/M
3300012989|Ga0164305_11558327Not Available588Open in IMG/M
3300015264|Ga0137403_10099933All Organisms → cellular organisms → Bacteria2910Open in IMG/M
3300015371|Ga0132258_10125332All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium6113Open in IMG/M
3300015371|Ga0132258_12614579All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae1261Open in IMG/M
3300015371|Ga0132258_12884064Not Available1195Open in IMG/M
3300015374|Ga0132255_104584182Not Available585Open in IMG/M
3300018000|Ga0184604_10262082Not Available606Open in IMG/M
3300018027|Ga0184605_10189517All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300018071|Ga0184618_10008235All Organisms → cellular organisms → Bacteria3048Open in IMG/M
3300018429|Ga0190272_10514569All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300019789|Ga0137408_1143585Not Available519Open in IMG/M
3300019889|Ga0193743_1230672Not Available558Open in IMG/M
3300021078|Ga0210381_10125431All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300021080|Ga0210382_10186807Not Available897Open in IMG/M
3300021090|Ga0210377_10114427All Organisms → cellular organisms → Bacteria1790Open in IMG/M
3300025552|Ga0210142_1001564All Organisms → cellular organisms → Bacteria4545Open in IMG/M
3300025556|Ga0210120_1108613Not Available554Open in IMG/M
3300025906|Ga0207699_10674277Not Available756Open in IMG/M
3300025910|Ga0207684_10195559All Organisms → cellular organisms → Bacteria → Proteobacteria1745Open in IMG/M
3300025912|Ga0207707_10291994All Organisms → cellular organisms → Bacteria → Proteobacteria1410Open in IMG/M
3300025916|Ga0207663_11308387Not Available584Open in IMG/M
3300026067|Ga0207678_10085048All Organisms → cellular organisms → Bacteria2704Open in IMG/M
3300026088|Ga0207641_10251668All Organisms → cellular organisms → Bacteria1650Open in IMG/M
3300026088|Ga0207641_12297751Not Available539Open in IMG/M
3300026095|Ga0207676_10407807Not Available1272Open in IMG/M
3300026095|Ga0207676_10426088Not Available1245Open in IMG/M
3300026095|Ga0207676_10475468Not Available1182Open in IMG/M
3300026116|Ga0207674_11554053Not Available630Open in IMG/M
3300026377|Ga0257171_1040993All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300026497|Ga0257164_1092870Not Available518Open in IMG/M
3300027787|Ga0209074_10552324Not Available507Open in IMG/M
3300027909|Ga0209382_10014293All Organisms → cellular organisms → Bacteria9751Open in IMG/M
3300028380|Ga0268265_10781854All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium929Open in IMG/M
3300028717|Ga0307298_10064382All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300028771|Ga0307320_10201154Not Available778Open in IMG/M
3300028792|Ga0307504_10405681Not Available537Open in IMG/M
3300028796|Ga0307287_10056226All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → unclassified Bryobacterales → Bryobacterales bacterium1453Open in IMG/M
3300028812|Ga0247825_10051464All Organisms → cellular organisms → Bacteria2736Open in IMG/M
3300028824|Ga0307310_10114828All Organisms → cellular organisms → Bacteria1212Open in IMG/M
3300028828|Ga0307312_11188060Not Available504Open in IMG/M
3300028881|Ga0307277_10101588Not Available1220Open in IMG/M
3300028885|Ga0307304_10425903Not Available602Open in IMG/M
(restricted) 3300031197|Ga0255310_10172925Not Available599Open in IMG/M
3300031455|Ga0307505_10611316Not Available530Open in IMG/M
3300031716|Ga0310813_11728194Not Available586Open in IMG/M
3300032002|Ga0307416_100060887All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3077Open in IMG/M
3300032005|Ga0307411_10618440Not Available934Open in IMG/M
3300032126|Ga0307415_100544666Not Available1023Open in IMG/M
3300032205|Ga0307472_101090224Not Available755Open in IMG/M
3300032421|Ga0310812_10039998All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1764Open in IMG/M
3300034817|Ga0373948_0032718Not Available1056Open in IMG/M
3300034820|Ga0373959_0186409Not Available542Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil22.76%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.38%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil5.69%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere5.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.88%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.88%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere3.25%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.44%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment2.44%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.44%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.44%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.44%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.44%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.63%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.63%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.63%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil1.63%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.63%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.81%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.81%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.81%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.81%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003990Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragC_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300004778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare3FreshEnvironmentalOpen in IMG/M
3300004779Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare3FreshEnvironmentalOpen in IMG/M
3300004808Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1FreshEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012941Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t4i015EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300025552Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025556Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026067Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032126Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-2Host-AssociatedOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1409637913300000363SoilMSPSRARSISACLFTTLSGVTTMSDDGGNDLTRILVKHREDVAAARLAAEQRARNDATARHDCEAPLRAVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPTVRFRIAVGGQNIGADLQTPLDNLATPVVVEGLGRFVTKALQATISHRSDCGP*
JGI10216J12902_10842643413300000956SoilMSVYTEDLLTQILTRHREDVAAARLAAQKRVHDDEDARHDCEAPLRDVALPVLREWSKRLAVEGYPTSVEDLLGCRPPSLVFRLTARGGRESSLTLACESGPTVHFKINIEGKDAGNDIRTPLAELRTDGVVQGLGRFVSVALAATIPKRSDCGP*
JGI10216J12902_10950274923300000956SoilMSADNGDELARIIAKHREEVAAARAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFKMNIDGKDVGTDLQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSDCGP*
soilH1_1009070543300003321Sugarcane Root And Bulk SoilMLTHDDDTLTHILTSHREAVAAARLAAARRAHDDEHTSHDCETPLRDVALPLLKDWSRRLAVEGYPTSIEDLLGCRPPALIIRLVPRGGPESSLTLACEPGPAVRLRITIDGRDTGDDVRTPLAELRAPVVLDGLGRFVEAALAATIPKRSDCRG*
soilH2_1003292873300003324Sugarcane Root And Bulk SoilMLTHDDDTLTHILTSHREAVAAARLAAARRAHDDEHTSHDCETPLRDVALPLLKDWSRRLAVEGYPTSIEDLLGCRPPALIIRLVPRGGPESSLTLACEPGPAVRLRITIDGRDTGDNVRTPLAELRAPVVLDGLGRFVE
Ga0055455_1011644523300003990Natural And Restored WetlandsMLQSAFSEVHDMSADNGDELSRIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGIAMPLLRDWSKRLSAEGYPTSVEDRLGCRPASLVLRLAPHKGPESSLTLACEAGPAVRFRMSVDGKDIGGDTQTPLGELQSGVVLEGLGRFVTAALKAAIPRRSDCGPP*
Ga0062589_10077799623300004156SoilMSADRGDELTRILATHREHVAAARVAAQQRARNAENARHDCEAPLRSVAVPALRQWSVQLAAEGYPASVEDRLGCRPPSLVFRLAPHGAPASSLTLVCEAGPSVRFRINVDGQDLGDDLQTPLAELQTHVVEQLGRFVTAALEATIPRRSDCGP*
Ga0062589_10177847513300004156SoilRPPRRGVGPSGDPETPPCFVSALREVHDMSTDDSDELTRIVAKHREAVAAARVAAQQRARNDESAQHDCESPLRGVALPLLRDWSKRLSVEGYPTSVEDRLGCRPPSLVFRLAPPGGLESSLTLACEAGPAVRFRMNVDGKDVGADLPTPLAELQPRIVLEGLGRFVTAALEAAIPKRSDCGP*
Ga0062590_10015504223300004157SoilMSDDGDVLTEILVQHRSDVAAARLAAEQRARNDATARHDCEGPLRSVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRGGPESSLTLACEVGPAVRFQMVVGGKEIGTDLQTPLADLATPVVREGLGRFVTKALAATIAKRSDCGP*
Ga0062595_10034104413300004479SoilMIPVILIEVHDMSADNGDELSRIIAKHREDVAAARAAAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMTVDGKDLGSDLQTPLAQLQPQVVLEGLGRFVTEALAAAIPRRSDCGPP*
Ga0062592_10052010313300004480SoilMSADRGDELTRILATHREHVAAARVAAQQRARNAENARHDCEAPLRSVAVPALRQWSVQLAAEGYPASVEDRLGCRPPSLVFRLAPHGAPASSLTLVCEAGPSVRFRMNVDGQDLGDDLQTPLAELQTHVVEQ
Ga0062591_10006819933300004643SoilMSDDGDVLTEILVQHRSDVAAARLAAEQRARNDATARHDCEGPLRSVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRGGPESSLTLACEVGPAVRFQIVVGGKEIGTDLQTPLADLATPVVREGL
Ga0062383_1030489913300004778Wetland SedimentMSADNSDELTRILATHRENVAAARVAAEQRARSDEHVRHECEAPLREVALPLLREWSKRLSVEGYPTRVEDRLGCRPPSLVFRLAPHRGTESFLTIACEAGPAVRFMMNVDGKDVGADSRTPLAELQTRVVLEGLGRFVTAALEATIPRRSDCGP*
Ga0062380_1013114613300004779Wetland SedimentMSADNSDELTRILARHRENVAAARVAAEQRARSDEHVRHECEAPLREVALPLLREWSRRLSVEGYPTSVEDRLGCRPPSLVFRLAPHRGTESFLTIACEAGPAVRFTMNVDGKDVGADSRTPLAELQTRVVLEGLGRFVTAALEATIPRRSDCGP*
Ga0062381_1040021413300004808Wetland SedimentMSADNSDELTRILARHRENVAAARVAAEQRARSDEHVRHECEAPLREVALPLLREWSRRLSVEGYPTSVEDRLGCRPPSLVFRLAPHRGTESFLTIACEAGPAVRFTMNVDGKDVGADSRTPLAELQTRVVLEGLGRFVTAA
Ga0062594_10125646523300005093SoilMLLESLHEVHDMSADNGHELTRILATHREHVAAARMAAEQRARNADNARHDCEAPLRRVALPPLREWSARLATEGYPFSVEDRLGCRPPSLVFRLAPHGALEASLTLVCDTGPGVRFRMNVDGQDVGDDLHTPLAELQTDV
Ga0066688_1067574113300005178SoilNPILRRRIARPPVRPQIAFVSSLEVHDMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0066678_1022389633300005181SoilMSTDNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0066388_10632624913300005332Tropical Forest SoilVAAAKLAAEQRARDDEHTRHDCEAPVRAVAIPLLREWSERLAVEGYPTSVEDRLGCRPPALVFHFAPRGSPESSLTLACEAGPVVRFKIAVEGQPAGGDLRTPLAELGPDIVREGLSRFVTAALAATIPRRSDCGPHAKG*
Ga0070709_1126742713300005434Corn, Switchgrass And Miscanthus RhizosphereMSVDVGDQLTRIIAKHRIDVAAAKIAAEQRARNDESVRHDCEAPLRALASPLLRDWAKRLLVEGYPASVEDRLDCRPSSLVFRLTPRGAPESSLTLACEPGPAVRFRINVHGEEVDADFQTPLAELQPQDVLDGLDRFVTAALEATIPRRSDCR*
Ga0070694_10034508813300005444Corn, Switchgrass And Miscanthus RhizosphereMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENARHACEAPLRALALPVLRDWSKRLAVEGYPTNVEDRLGCRPSTLVFRLAPRGGTESSLTLACEPGPTVRFRITVDGKHLGGDSQTPLAELQTHDLLEGLGRFLTAALAATIPKRSDCGP*
Ga0070708_10009574333300005445Corn, Switchgrass And Miscanthus RhizosphereMSADNGDELTRIIAKHRVDVAAAKVAAEQRARNDENVRHDCEDPLRRIALPLLRDWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPHKAPESSLTLACEPGPAVRFRINVQGKDVGADLQTPLAELETRVLLEGLGRFVTVALAAAIPKRSDCGP*
Ga0070708_10020829823300005445Corn, Switchgrass And Miscanthus RhizosphereMSSDNGDELTRIIAKHREEVAAARAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGQDVGADLQTPLAELQPRIVLEGLGRFVTVALKAAIPRRSDCGP*
Ga0070663_10111916523300005455Corn RhizosphereMIPVILIEVHDMSADNGDELSRIIAKHREDVAAARAAAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESTLTLACESGPAVRFRMNVDGKDLGSDLRTPLGELQPHII
Ga0070706_10116753513300005467Corn, Switchgrass And Miscanthus RhizosphereMSVDVDVGDQLTRIIAKHRIDVAAAKVAAEQRARNDESVRHDCEAPLRALALPLLRDWAKRLLVEGYPANVEDRLGCRPSSLVFRLTPRGAPESSLTLVGAPGPAVRFRINVQGEEVDTDFQTPLAELQPQDVLDGLDRFVTAALEATIPRRSDCR*
Ga0070699_10016184533300005518Corn, Switchgrass And Miscanthus RhizosphereMSADNGDELTRIIAKHRVDVAAAKVAAEQRARNDENVRHDCEDPLRRIALPLLRDWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPQKAPEYSLTLACEPGPAVRFKINVQGKDVGADLQTPLAELETRVLLEGLGRFVTVALAAAIPKRSDCGP*
Ga0070699_10114016113300005518Corn, Switchgrass And Miscanthus RhizosphereMPVDTDDLLTRIIAKHRIDVAAAKIAAQQRARNDESVRHDCEAPLRALALPLLRDWAKRLLVEGYPANVEDRLGCRPSSLVFRLTPRGAPESSLTLVGAPGPAVRFRINVQGEEVDTDFQTPLAELQPQDVLDGLDRFVTAALEATIPRRSDCR*
Ga0070697_10035148713300005536Corn, Switchgrass And Miscanthus RhizosphereRAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGQDVGADLQTPLAELQPRIVLEGLGRFVTVALKAAIPRRSDCGP*
Ga0068857_10232741713300005577Corn RhizosphereMLTEIGDELARIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGSDLQTPLAELQTGIVLEGLGRF
Ga0068857_10241011913300005577Corn RhizosphereAARAAAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMTVDGKDLGSDLQTPLAQLQPQVVLEGLGRFVTQALAAAIPRRSDCGPP*
Ga0066706_1019007713300005598SoilMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHACEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0068864_10063145823300005618Switchgrass RhizosphereMSDDDGDDLTQIITRHREDVAAARDAAAQRARNDATARHDCEAPLRGVALPLFRDWSKRLLVEGYPTSIEDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPAIRFKIAVDGKNIGADLQTPLADLAPPVVLEGLGRFVSKALAATISHRSDCGP*
Ga0079222_1080542223300006755Agricultural SoilMPVDTADELTRILAQHHEAVEAARRAATQRAHDDEHTRHDCETPLRDIALPVLREWSKRLAVEGYPANIEDLLGCRPPALVFRLAPRGGPESSLTLACESGPAVRFRMSVDGHVDAGVRTPLAELRPPVVVDGLARFVEAALAATIPKRSDCGP*
Ga0075421_10005143933300006845Populus RhizosphereMLLESLHEVPDMSAHDGDELTRILATHREHVAAAQVAAQQRARNDENARHDCEAPLRRVAFPPLREWSLRLATEGYPATIEDRLGCHPPSLVFRLAPHGAPASSLTLACEVGPAVRFRMNVDGREIVDDLQTPLQDLQTSVVLEQLGRFVTAALAATIPSRSDCWP*
Ga0075420_10035551023300006853Populus RhizosphereVAAQQRARNDENARHDCEAPLRRVAFPPLREWSLRLATEGYPATIEDRLGCHPPSLVFRLAPHGAPASSLTLACEVGPAVRFRMNVDGREIVDDLQTPLQDLQTSVVLEQLGRFVTAALAATIPSRSDCWP*
Ga0075420_10193180313300006853Populus RhizosphereELTRILATHRERVAAAQVAAQQRARNDEYARHDCEAPVRRLAFPTLREWSLRLAAEGYPASVEDRLGCRPPSLVFRLAPHGAPASSLTLACEAGPAVRFTMNVDGKDVDDDLQTPLAELQTRVVLEQLGRFVTAALAATIPRRSDCGP*
Ga0075434_10247608213300006871Populus RhizosphereMSTDTAEQLTRIIEKHREDVEAARTAALQRARSDDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPAVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEATIPKRSDCRP*
Ga0066710_10151939723300009012Grasslands SoilMPVDTEDQLTRIIAKHRTHVAAAKIAAEQRARNDESVRHDCEAPLRALALPLLRDWAKRLLVEGYPANVEDRLGCRPSSLVFRLTPRGAPESSLTLVGAPGPAVRFRINVQGEEVDTDFQTPLAELQSQDVLDGLDRFVTAALEATIPRRSDCR
Ga0111538_1120019313300009156Populus RhizosphereNSPMLLESLHEVHDMSADNGHELTRILATHREHVAAARMAAEQRARNADNARHDCEAPLRRVALPPLREWSARLATEGYPFSVEDRLGCRPPSLVFRLAPHGALEASLTLVCDTGPGVRFRMNVDGQDVGDDLHTPLAELQTDVVVEQLGRFVTAALEATIPRRSDCGP*
Ga0126306_1011256423300010166Serpentine SoilMLMGSLHEVHDMSADDGDELTRILAAHREHVAAARVAAQQRARNAESARHDCEAPLRRVAFPLLREWSVRLGTEGYPASVEDRLGCRPPSLVFRLAPHGAPASSLTLACEAGPAVRFKMNVDGKDLDKDLQTPLVELETHVVVEQLGRFVASALAATIPRRSDCGP*
Ga0126306_1033394223300010166Serpentine SoilMPADNGDELSRILAIHREHVAAARVAAEQRARTDENARHDCEAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPPSLVFRLAPHKGPESSLTLACEAGPAVRFRMTVDGKDVGTDLQTPLAELQTGIVLEGLGRFVTAALAAAIPRRSDCGPP*
Ga0134122_1205464323300010400Terrestrial SoilADHGDELARILATHRQHVAAAKAAAEQRARKDETAWHDCEAPLRGVALPLLQTWAKRLSAEGYPTSVEDRVGCRPSCLVFRLAPHKGPESTLTLAVETGPAVRFRMNVDGEELGDALRTPLADLRPTVVLQGLGQFVTAALEAAIPRRSDCGP*
Ga0137463_109469723300011444SoilMSADNGDELTRIIAKHREEVAAARAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADLQTPLAELQPRIVLEGLGRFVTVALKAAIPRRSDCGP*
Ga0137376_1022708723300012208Vadose Zone SoilMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0137377_1035132633300012211Vadose Zone SoilPPVRPQIAFVSSLEVHDMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0137367_1090618613300012353Vadose Zone SoilRKLPHASVSLLEVHDMSADNGDELTRILAKHREDVAAARVAAEQRARNDENARHDCEAPLRGVALPLLRDWSKRLSAEGYPTSVEDRLGCRPAGLVFRLAPHRGPESSLTLVCEAGPAVRFRMNVDGKDVGADLQTPLAELQTRVVLEGLGRFVTVALEATIPRRSDCGP*
Ga0137375_1099086623300012360Vadose Zone SoilAAARAAAEQRARNDASARHDCEAPLRGVALPPLREWAKRLSAEGYPTTVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDLGSDLQTPLAELQPRIVLEGLGRFVTAALEAAIPRRSDCGPP*
Ga0137360_1095117513300012361Vadose Zone SoilMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKEAIPKRSDCGPWRSPAGALATHLRRH
Ga0137361_1149503513300012362Vadose Zone SoilIPLHVILSGAKDRVGGTSEVPRVTRMREPILRFAQDDPGRRRIARPPVRPQIAFVSSLEVHDMSTDIADQLTRIIANYSEDVAAAKVAAEQRARNDEDTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNIQGKDVDAELQPPLAELKSPVLLEGLS
Ga0150984_10023925713300012469Avena Fatua RhizosphereMSADNGDELARIIAKHREDVAVARAAAEQRARNDENSRHGCDAPLRGMAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFKMNVDGKDLGSDLQTPLAELQPRVIIEGLGRFVTEALAAAIPRRSDCGPP*
Ga0137395_1013489123300012917Vadose Zone SoilMSADNGDELTRIIAKHRTDVAAARVAAEQRARNDENVRHDCEDPLRRIALPLLRDWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPHKAPESSLTLACEPGPAVRFRINVQAKDVGADLQTPLAELETRVLLEGLGRFVTVALAAAIPKRSDCGP*
Ga0137359_1022457933300012923Vadose Zone SoilMSTDIADQLTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNIQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0137404_1041382913300012929Vadose Zone SoilMPVDTDDQLTRIIAKHRTHVAAAKIAAEQRARNDESVRHDCEAPLRALALPLLRDWAKRLLVEGYPANVEDRLGCRPSSLVFRLTPRGAPESSLTLVGAPGPAVRFRINVQGEEVDTDFQTPLAELQPQDVLDGLDRFVTAALEATIPRRSDCR*
Ga0137404_1181640313300012929Vadose Zone SoilVAAAKAAAEQRARNDENARHGCDAPLRGVAMPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADVQTPLAELQTRVVLEGLGRFVTAALEAAIPRRSDCGP*
Ga0137407_1018971333300012930Vadose Zone SoilMSADNGDELTRIIATHREHVAAARVAAEQRAHNDENARHDCGAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADSQTPLAELQTGIVLEGLERFVTVALKAAIPRRSDCGP*
Ga0137407_1037527813300012930Vadose Zone SoilEVHDMSADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNIQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALKAAIPKRSDCGP*
Ga0162652_10008479713300012941SoilDELTRILAKHREDVAAARVAAEQRARQDENVRHDCVAPLRGVALPVLRDWSQRLSAEGYPTSVEDRLGCRPPSLVFRLAPHGAPEATLTLVCDAGPGVRFRMNVDGQDVGDDLQTPLAELQTHVVVEQLGRFVTAALEATIPRRSDCGP*
Ga0164298_1047661023300012955SoilMSTETAEQLTRIIEKHREDVEAARTAALQRARSEDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPAVRFRINIEGKEVSADLQTSLAELEPDVVIEGLGRFVTAALEATIPKRSDCRP*
Ga0164298_1074100013300012955SoilMEQLEVSMSAHRVLEAHEMSADGGDQLNRIIAKHKDAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRMNVDGKDVQGDSQTPLAELQPRDVLEGLGRFVTAALAASIPKRSDCGP*
Ga0164298_1120241713300012955SoilARDAAAQRARNDATARHDCEAPLRGVALPLFRDWSKRLLVEGYPTSIEDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPAIRFKIAVDGKNIGADLQTPLGDLAPPIVLEGLGRFVSKALAATISHRSDCGP*
Ga0164303_1074395213300012957SoilAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRMNVDGKDVQGDSQTPLAELQPRDVLEGLGRFVTAALAASIPKRSDCGP*
Ga0164303_1099480613300012957SoilGHDMLTDNADQLTRIIEKHREDGAAAKSAAEQRARNDLNARHDCEAPLRGVALPLLREWSKRLSVEGYPTSVEDRLGCRPASLVFRLAPHGGMESSSLTLACEAGPAVRFKMHVQGKDLGADSQTPLAELESHVVLEGLGRFVTVALAAAIPKRSDCGP*
Ga0164299_1007452323300012958SoilMSTDTGEQLTRIIEKHREDVEAARTAALQRARSEDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPVVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEAAIPKRSDCRP*
Ga0164301_1011371823300012960SoilMSTDTAEQLTRIIEKHREDVEAARTAALQRARSEDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPAVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEATIPKRSDCRP*
Ga0164302_1035263413300012961SoilMEQREVSMSAQRVLEVHEMSADGGDQLNRIIAKHKDAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRLNVDGKDLEGDLQTPLAELQPRDVLDGLGRFVTAALAAAIPKRSDCGP*
Ga0164308_1067487713300012985SoilMEQREVSMSAQRVLEAHEMSADGGDQLNLILAKHKDAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRMNVDGKDVQGDSQTPLAELQPRDVL
Ga0164304_1022626633300012986SoilMSDDNGDDLTQIIVRHREDVAAARDAAAQRARNDATARHDCEAPLRGVALPLFRDWSKRLLVEGYPTSIEDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPAIRFKIAVDGKNIGADLQTPLADLAPPIVLEGLGRFVSKALAATISHRSDCGP*
Ga0164304_1026618123300012986SoilMSDDGGDVLTQILVQHRSDVAAARLAAEQRARNDATARHDCEGPLRSVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRGGPESSLTLACEVGPAVRFQMVVGGKEIGTDLQTPLADLATPVVREGLGRFVTKALAATIAKRSDCGP*
Ga0164304_1027957423300012986SoilMSTDTAEQLTRIIEKHREDVEAARTAALQRARSEDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPVVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEATIPK
Ga0164304_1035389323300012986SoilMLTDNADQLTRIIEKHREDVAAAKSAAEQRARNDLNARHDCEAPLRGVALPLLREWSKRLSVEGYPTSVEDRLGCRPASLVFRLAPHGGMESSSLTLACEAGPAVRFKMHVQGKDLGADSQTPLAELESHVVLEGLGRFVTVALAAAIPKRSDCGP*
Ga0164304_1145144213300012986SoilLEVHDMSANNGDELSRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVAMPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPAVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEATIPKRSDCRP*
Ga0164305_1155832713300012989SoilMLTDNADQLTRIIEKHREDVAAAKSAAEQRARNDLNARHDCEAPLRGVALPLLREWSKRLSVEGYPTSVEDRLGCRPASLVFRLAPHGGMESSSLTLACEAGPAVRFKMNVQGKDLGADSQTPLAELESHVVLEGLGRFVTVALAAAIPKRSDCGP*
Ga0137403_1009993333300015264Vadose Zone SoilVAAARVAAEQRARNDQNARHDCEAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADVQTPLAELQTRVVLEGLGRFVTAALEAAIPRRSDCGP*
Ga0132258_1012533243300015371Arabidopsis RhizosphereMSDNGGDDLTRILVQHREDVAAARLAAEQRARNDATARHDCEGPLRTVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRAGPESSLTLACEVGPAVRFRMVVGGKEVGTDLQTPLADLASPIVLDGLGRFVTKALAATIAKRSDCGP*
Ga0132258_1261457923300015371Arabidopsis RhizosphereMSTDTAEQLTRIIEKHREDVEAARTAALQRARSEDNVRHDCEAPLRAVALPLLRDWSKRLGVEGYPTRVEDRLGCRPASLVFRLAPHGAPESSLTLACEAGPVVRFRINIEGKEVSTDLQTSLAELEPDVVIEGLGRFVTAALEATIPKRSDCRP*
Ga0132258_1288406423300015371Arabidopsis RhizosphereRLAAQQRVHDDEDARHDCEAPLRDVALPVLREWSKRLAVEGYPTSVEDLLGCRPPSLVFRLTARGGRESSLTLACESGPTVHFKINIEGKDAGNDIRTPLAELRTDGVVQGLGRFVSVALAATIPKRSDCGP*
Ga0132255_10458418213300015374Arabidopsis RhizosphereGGDQLNLILAKHKDAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRMNVDGKDVQGDSQTPLAELQPRDVLEGLGRFVTAALAASIPKRSDCGP*
Ga0184604_1026208213300018000Groundwater SedimentMSADNGDELTRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGRMVRLKMNVDGKDAGADAPTPLAELQTRDILEGLGRFVTEALAAAIPRRSDCGPP
Ga0184605_1018951723300018027Groundwater SedimentMSADNGDELSRIIAQHREGVAKAKAAAEQRARNDENARHGCDAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMDVDGKNVGADLQTPLAELQPSVVLEGLGVFVTAALEAAIPRRSDCGP
Ga0184618_1000823523300018071Groundwater SedimentMSADNGDELSRIIAQHREGVAKAKAAAEQRARNDENARHGCDAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMDVDGKNVGADLQTPLAELQPSVVLEGLGRFVTEALAAAIPRRSDCGPP
Ga0190272_1051456923300018429SoilMLLESLHEVHDMSADNGAELTRILAAHREHVAAARAAAQQRALNVESVRHECEAPLRRVAFPLLREWSLRLGVEGYPASVEDRLGCRPPGLVFRLAPHGTPASSLTLACEAGPAVRFKMNVDGKDLDDDLQTPLVELETGVVVEQLGRFVTAALAATIPRRSDCRP
Ga0137408_114358513300019789Vadose Zone SoilGPRHVCADNGDELTRIIAKHREDVAAAKVAAEQRARNDENTRHDCEAPLRAAALPLLREWSKRLGVEGYPTSVEDRLGCRPPSLVFRLAPRGGPESSLTLACEPGPAVRFRMNVQGKDVDAELQTPLAELKTPVLLEGLSRFVTVALRAAIPKRSDCGP
Ga0193743_123067213300019889SoilRPTWVSPFQRPGNSPILPVILLEVHDMSADNGDELTRIIVKHREDVAAARVAAEQRARNDENARHGCDAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMSVDGKDVGADSQTPLAELQTGVVLEGLGRFVTEALAAAIPRRSDCG
Ga0210381_1012543123300021078Groundwater SedimentDMSADNGDELTRIIVKHREDVAAARTAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLIFRLAPHKGPESTLTLACESGPAVRFRMNVDGKDLGSDLQTPLAELQPRIIIEGLGRFVTEALAAAIPRRSDCGP
Ga0210382_1018680713300021080Groundwater SedimentAAAEQRARNDENARHGCDAPLRGVALPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMDVDGKNVGADLQTPLAELQPSVVLEGLGVFVTAALEAAIPRRSDCGP
Ga0210377_1011442723300021090Groundwater SedimentMSADQGDELTRILAKHREDVAQARVAAEQRARNAEHVRHDCEAPLRGIALPLLREWSKRLSAEGYPTSVEDLLGCRPPSLVFRLAPHKGPESSLTLACEAGPAVRLRMNVDGKDVGADLQTPLAELQTGVVLEGLGRFVTAALEATIPRRSDCGP
Ga0210142_100156453300025552Natural And Restored WetlandsMSADNGDELSRIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGIAMPLLRDWSKRLSAEGYPTSVEDRLGCRPASLVLRLAPHKGPESSLTLACEAGPAVRFRMSVDGKDIGGDTQTPLGELQSGVVLEGLGRFVTAALKAAIPRRSDCGPP
Ga0210120_110861313300025556Natural And Restored WetlandsMSADNGDELSRIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGIAMPLLRDWSKRLSAEGYPTSVEDRLGCRPASLVLRLAPHKGPESSLTLACEAGPAVRFRMSVDGKDIGGDTQTPLGELQSGVVLEGLGRFVTAALKAAIPRRSDCGP
Ga0207699_1067427723300025906Corn, Switchgrass And Miscanthus RhizosphereMSVDVGDQLTRIIAKHRIDVAAAKIAAEQRARNDESVRHDCEAPLRALASPLLRDWAKRLLVEGYPASVEDRLDCRPSSLVFRLTPRGAPESSLTLACEPGPAVRFRINVHGEEVDADFQTPLAELQPQDVLDGLDRFVTAALEATIPKRSDCR
Ga0207684_1019555923300025910Corn, Switchgrass And Miscanthus RhizosphereMSSDNGDELTRIIAKHREEVAAARAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGQDVGADLQTPLAELQPRIVLEGLGRFVTVALKAAIPRRSDCGP
Ga0207707_1029199423300025912Corn RhizosphereMIPVILIEVHDMSADNGDELSRIIAKHREDVAAARAAAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMTVDGKDLGSDLQTPLAQLQPQVVLEGLGRFVTEALAAAIPRRSDCGPP
Ga0207663_1130838713300025916Corn, Switchgrass And Miscanthus RhizosphereMSAQRVLEVHEMSADGGDQLNRIIAKHKDAVAAAKAAAEQRARNDANARHDCEAPLRRVVLPLLRDWSERLAVEGYPTSVEDRLGCRPSSLVFRLAPHGGPESSLTFGCVAGPAVRFRLNVDGKDLEGDLQTPLAELQPRDVLDGLGRFVAAALAASIPKRSDCGP
Ga0207678_1008504853300026067Corn RhizosphereMIPVILIEVHDMSADNGDELSRIIAKHREDVAAARAAAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFKMNVDGKDLGSDLQTPLAQLQPQFVLEGLGRFVSVALAGAIPRRSDCGP
Ga0207641_1025166823300026088Switchgrass RhizosphereMSDDDGDDLTQIITRHREDVAAARDAAAQRARNDATARHDCEAPLRGVALPLFRDWSKRLLVEGYPTSIEDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPAIRFKIAVDGKNIGADLQTPLADLAPPVVLEGLGRFVSKALAATISHRSDCGP
Ga0207641_1229775113300026088Switchgrass RhizosphereVAAARLAAEQRARNDATARHDCEGPLRTVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRGGPESSLTLACEVGPAVRFQMVVGGKEIGTDLQTPLADLATPVVREGLGRFVTKALAATIAKRSDCGP
Ga0207676_1040780723300026095Switchgrass RhizosphereMSADNGDELSRIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESTLTLACEAGPAVRFRMNVDGKDLGSDLRTPLAELQPGIIIEGLGRFVTEALAAAIPRRSDCGPP
Ga0207676_1042608833300026095Switchgrass RhizosphereMSDDDGDDLTQIITRHREDVAAARDAAAQRARNDATARHDCEAPLRGVALPLFRDWSKRLLVEGYPTSIEDRLGCRPPSLVLRLAPRGGPESSLTLACEVGPAIRFKIAVDGKNIGADLQTPLADLAPPVVLEGLGRFVSKALAATISHRSDCG
Ga0207676_1047546823300026095Switchgrass RhizosphereMSADNGDELSRIIVKHREEVAAAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMSVDGKDLGFDLQTPLAELQPRVILEGLGRFVTEALAAAI
Ga0207674_1155405323300026116Corn RhizosphereMLTEIGDELARIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGSDLQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSDCGP
Ga0257171_104099323300026377SoilRAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADSQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSDCGP
Ga0257164_109287013300026497SoilEDVAAARAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADSQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSDCGP
Ga0209074_1055232413300027787Agricultural SoilAMPVDTADELTRILAQHHEAVEAARRAATQRAHDDEHTRHDCETPLRDIALPVLREWSKRLAVEGYPANIEDLLGCRPPALVFRLAPRGGPESSLTLACESGPAVRFRMSVDGHVDAGVRTPLAELRPPVVVDGLARFVEAALAATIPKRSDCGP
Ga0209382_1001429323300027909Populus RhizosphereMLLESLHEVPDMSAHDGDELTRILATHREHVAAAQVAAQQRARNDENARHDCEAPLRRVAFPPLREWSLRLATEGYPATIEDRLGCHPPSLVFRLAPHGAPASSLTLACEVGPAVRFRMNVDGREIVDDLQTPLQDLQTSVVLEQLGRFVTAALAATIPSRSDCWP
Ga0268265_1078185423300028380Switchgrass RhizosphereAEQRARSDENARHGCDAPLRGIAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFLLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGSDLQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSDCGP
Ga0307298_1006438223300028717SoilVLRSPCLLNASSGRLTQPLETTRKLPHPSVILLQVDHMSADNGDELARIIAKHREDVAVAKAAAEQRARNDENSRHGCDAPLRGMAMPLLRQWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDLGSDLRTPLAELQPGIIIEGLGRFVTEALAAAIPRRSDCGPP
Ga0307320_1020115423300028771SoilMSADNGDELTRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDIGADLQTPLAELQTGIVLEGLGRFVTAALKAAIPRRSDCGP
Ga0307504_1040568113300028792SoilMLTDQADELTRIIEKHREDVAAAKAAAEQRARNDLNARHDCEAPLRTVALPLLREWSKRLSVEGYPTSVEDRLGCRPASLVFHLAPRGGPESSLTLACEAGPAVRFKMNIQGKDVGDDSQTPLAQLESHVVLEGLGRF
Ga0307287_1005622623300028796SoilMSADNGDELTRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGSDLQTPLAELQTHVVLEGLGRFVTVALKAAIPRRSDCGP
Ga0247825_1005146453300028812SoilMSAVPMSADHGDELARILATHRQHVAAAKAAAEQRARKDEIAWHDGEAPLRGVALPLLHTWAKRLSAEGYPTSVEDRVGCRPSCLVFRLAPHKGRESTLTLAVESGPTVRFRMNVDGEELGDELKTPLADLRPPIVLQGLGQFVTAALEAAIPRRSDCGP
Ga0307310_1011482823300028824SoilMSADNGDELARIIAKHREDVAVARAAAEQRARNDENSRHGCDAPLRGMAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESTLTLACEAGPAVRFRMNVDGKDLGSDLRTPLAELQPGIIIEGLGRFVTEALAAAIPRRSDCGPP
Ga0307312_1118806013300028828SoilLLEVDHMSADNGDELARIIAKHREDVAVARAAAEQRARNDENSRHGCDAPLRGMAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDLGSDLQTPLTELQPRIIIDGLGRFVTEALAAAIPRRSDCGPP
Ga0307277_1010158823300028881SoilMSADNGDELTRILATHREQVAAAQVAAQQRSRNAEHARHDCEAPLRRVAFPPLREWSARLATEGYPTTIEDRLGCLPPSLVFRLAPHGGPASSLTLACEAGPAVRFRMDVDGKAVEDDLQTPLAELETRVILEQLGRFVAVALAASIPRRSDCGP
Ga0307304_1042590313300028885SoilMSADNGDELTRIIAKHREDVAAAKAAAEQRARNDENARHGCDAPLRGVAMPLLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMDVDGKNVGADLQTPLAELQPSVVLEGLGVFVTAALEAAIPRR
(restricted) Ga0255310_1017292513300031197Sandy SoilMSTDNGDELTRIIAKHREEVAAARMAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESSLTLACEAGPAVRFRMNVDGKDVGADSQTPLAELQTGIVLEGLGRFVTVALKAAIPRRSD
Ga0307505_1061131613300031455SoilPMLHADLMEAHDMSADNSEELTRILDTHREHVAAARMAAEQRARHDEHTRHDCEAPLRGVALPVLRDWSKRLSGEGYPTSVEDRLGCRPPGLVFRLAPRGGPESSLTLACEAGPAVRFRMQVDGKEVGADLQTPLAELQAPVVLEELGRFVTAALEATIRRRSDCGP
Ga0310813_1172819413300031716SoilTRILVQHREDVAAARLAAEQRARNDATARHDCEGPLRTVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRAGPESSLTLACEVGPAVRFRMVVGGKEVGTDLQTPLADLASPIVLDGLGRFVTKALAATIAKRSDCGP
Ga0307416_10006088733300032002RhizosphereMSTDNGDELTRILATHREHVAAAKVAAEQRARNDETARHDCESPLRGVASPLLREWAKRLSAEGYPTHVEDRLGCRPPSVVFRLAPHKGPESSLMLVCVAGPAVRFRMNIDGKDVGDDWQTPLAELQPRVIQEGLGRFVTIALEAIIPRRSDCGP
Ga0307411_1061844023300032005RhizosphereMSTDNGDELTRILATHREHVAAAKVAAEQRARNDETARHDCESPLRGVASSLLREWAKRLSAEGYPTHVEDRLGCRPPSVVFRLAPHKGPESSLTLVCVAGPAVRFRMNIDGKDVGDDWQTPLAELQPRVIQEGLGRFVTIALEAIIPRRSDCGP
Ga0307415_10054466623300032126RhizosphereMSTDNGDELTRILATHREHVAAAKVAAEQRARNDETARHDCESPLRGVASPLLREWAKRLSAEGYPTHVEDRLGCRPPSVVFRLAPHKGPESSLTLVCVAGPAVRFRMNIDGKDVGDDWQTPLAELQPRVIQEGLGRFVTIALEAIIPRRSDCGP
Ga0307472_10109022413300032205Hardwood Forest SoilMSVYTEDLLTQILTRHREDVAAARLAAQQRVHDDEDARHDCEAPLRDVALPVLREWSKRLAVEGYPTSVEDLLGCRPPSLVFRLTARGGRESSLTLACESGPTVHFKINIEGKDAGNDIRTPLAELRTDGVVQGLGRFVSVALAATIPKRSDCGP
Ga0310812_1003999833300032421SoilMSDNGGDDLTRILVQHREDVAAARLAAEQRARNDATARHDCEGPLRTVALPLFREWSKRLGVEGYPTSIDDRLGCRPPSLILRLAPRAGPESSLTLACEVGPAVRFRMVVGGKEVGTDLQTPLADLASPIVLDGLGRFVTKALAATIAKRSDCGP
Ga0373948_0032718_3_4913300034817Rhizosphere SoilLFEVNDMSSDNGDELTRIIAKHREEVAAARAAAEQRARNDENARHGCDAPLRGVALPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESTLTLACEAGPAVRFRMNVDGKDLGSDLRTPLAELQPGIIIEGLGRFVTEALAAAIPRRSDCGPP
Ga0373959_0186409_24_4943300034820Rhizosphere SoilMLTEIGDELARIIAKHREDVAVAKAAAEQRARNDENARHGCDAPLRGIAMPVLREWSKRLSAEGYPTSVEDRLGCRPASLVFRLAPHKGPESTLTLACEAGPAVRFRMNVDGKDLGSDLRTPLAELQPGIIIEGLGRFVTEALAAAIPRRSDCGPP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.