NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F063180

Metagenome Family F063180

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F063180
Family Type Metagenome
Number of Sequences 130
Average Sequence Length 150 residues
Representative Sequence AIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK
Number of Associated Samples 117
Number of Associated Scaffolds 130

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 4.62 %
% of genes near scaffold ends (potentially truncated) 95.38 %
% of genes from short scaffolds (< 2000 bps) 90.00 %
Associated GOLD sequencing projects 111
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.231 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil
(14.615 % of family members)
Environment Ontology (ENVO) Unclassified
(44.615 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.231 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 62.21%    β-sheet: 0.00%    Coil/Unstructured: 37.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 130 Family Scaffolds
PF00012HSP70 6.92
PF02518HATPase_c 4.62
PF09084NMT1 2.31
PF03401TctC 1.54
PF10162G8 1.54
PF00528BPD_transp_1 1.54
PF00004AAA 0.77
PF13458Peripla_BP_6 0.77
PF04392ABC_sub_bind 0.77
PF02371Transposase_20 0.77
PF13384HTH_23 0.77
PF13229Beta_helix 0.77
PF00005ABC_tran 0.77
PF13472Lipase_GDSL_2 0.77
PF13683rve_3 0.77

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 130 Family Scaffolds
COG0443Molecular chaperone DnaK (HSP70)Posttranslational modification, protein turnover, chaperones [O] 6.92
COG0715ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic componentInorganic ion transport and metabolism [P] 2.31
COG4521ABC-type taurine transport system, periplasmic componentInorganic ion transport and metabolism [P] 2.31
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 1.54
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.77
COG3547TransposaseMobilome: prophages, transposons [X] 0.77


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms79.23 %
UnclassifiedrootN/A20.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001431|F14TB_100744049All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300004633|Ga0066395_10450932All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300004779|Ga0062380_10137098All Organisms → cellular organisms → Bacteria945Open in IMG/M
3300004779|Ga0062380_10443338All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300004808|Ga0062381_10043359All Organisms → cellular organisms → Bacteria1252Open in IMG/M
3300005289|Ga0065704_10051853Not Available697Open in IMG/M
3300005328|Ga0070676_11348726Not Available546Open in IMG/M
3300005343|Ga0070687_101378023Not Available526Open in IMG/M
3300005347|Ga0070668_101048237All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300005456|Ga0070678_100671294Not Available931Open in IMG/M
3300005564|Ga0070664_101111634All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300005569|Ga0066705_10853533Not Available541Open in IMG/M
3300005719|Ga0068861_100747857All Organisms → cellular organisms → Bacteria913Open in IMG/M
3300005764|Ga0066903_102835791All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300005764|Ga0066903_107850258All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300005829|Ga0074479_10762445All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300005829|Ga0074479_11032819All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300005836|Ga0074470_11116658Not Available552Open in IMG/M
3300006046|Ga0066652_100738122Not Available939Open in IMG/M
3300006845|Ga0075421_100201634All Organisms → cellular organisms → Bacteria2462Open in IMG/M
3300006845|Ga0075421_102558281Not Available531Open in IMG/M
3300006852|Ga0075433_11921616All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300006853|Ga0075420_101878555All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300006854|Ga0075425_101085287All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300006871|Ga0075434_101474182Not Available690Open in IMG/M
3300006881|Ga0068865_100913276All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300006904|Ga0075424_102635073All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300006969|Ga0075419_10232293All Organisms → cellular organisms → Bacteria1229Open in IMG/M
3300009053|Ga0105095_10555252All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300009111|Ga0115026_10676282All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300009137|Ga0066709_101374409All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300009143|Ga0099792_10576312All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300009147|Ga0114129_10048644All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales5958Open in IMG/M
3300009167|Ga0113563_13989431All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300009174|Ga0105241_11257819All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300009176|Ga0105242_12176554All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300009609|Ga0105347_1025187All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2058Open in IMG/M
3300010040|Ga0126308_10075514All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2017Open in IMG/M
3300010366|Ga0126379_11826334All Organisms → cellular organisms → Bacteria712Open in IMG/M
3300010397|Ga0134124_12407158All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300010401|Ga0134121_10807317Not Available902Open in IMG/M
3300010401|Ga0134121_11333621All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300010403|Ga0134123_11421331All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300010403|Ga0134123_11739698All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300011003|Ga0138514_100005218Not Available1905Open in IMG/M
3300011406|Ga0137454_1064532All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300011409|Ga0137323_1013648All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300011413|Ga0137333_1162640All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300011430|Ga0137423_1174168All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300011437|Ga0137429_1151049All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300011441|Ga0137452_1313520All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300011442|Ga0137437_1005760All Organisms → cellular organisms → Bacteria → Proteobacteria4382Open in IMG/M
3300012038|Ga0137431_1023728All Organisms → cellular organisms → Bacteria1674Open in IMG/M
3300012038|Ga0137431_1083094All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300012041|Ga0137430_1019896All Organisms → cellular organisms → Bacteria → Proteobacteria1731Open in IMG/M
3300012142|Ga0137343_1029909All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300012143|Ga0137354_1075642All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300012199|Ga0137383_10675856Not Available754Open in IMG/M
3300012202|Ga0137363_10267812All Organisms → cellular organisms → Bacteria1393Open in IMG/M
3300012205|Ga0137362_11089176All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300012206|Ga0137380_11279614All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300012232|Ga0137435_1119217All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300012355|Ga0137369_10905471All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300012480|Ga0157346_1010052All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300012484|Ga0157333_1031854All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300012499|Ga0157350_1007649Not Available842Open in IMG/M
3300012511|Ga0157332_1029250Not Available693Open in IMG/M
3300012923|Ga0137359_11610543Not Available537Open in IMG/M
3300012929|Ga0137404_11037824All Organisms → cellular organisms → Bacteria751Open in IMG/M
3300012961|Ga0164302_10055406All Organisms → cellular organisms → Bacteria1992Open in IMG/M
3300013297|Ga0157378_10025753All Organisms → cellular organisms → Bacteria5182Open in IMG/M
3300013306|Ga0163162_10103072All Organisms → cellular organisms → Bacteria2947Open in IMG/M
3300013308|Ga0157375_10926455All Organisms → cellular organisms → Bacteria1014Open in IMG/M
3300013308|Ga0157375_12153293Not Available664Open in IMG/M
3300014874|Ga0180084_1056402All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300014876|Ga0180064_1134154All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus chondroitinus525Open in IMG/M
3300014879|Ga0180062_1051371Not Available887Open in IMG/M
3300015258|Ga0180093_1079751Not Available779Open in IMG/M
3300015373|Ga0132257_100414258All Organisms → cellular organisms → Bacteria1640Open in IMG/M
3300015373|Ga0132257_102485401All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300015374|Ga0132255_105965638All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300018053|Ga0184626_10003464All Organisms → cellular organisms → Bacteria5874Open in IMG/M
3300018056|Ga0184623_10107250All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1291Open in IMG/M
3300018072|Ga0184635_10054456All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300018081|Ga0184625_10031312All Organisms → cellular organisms → Bacteria2604Open in IMG/M
3300018084|Ga0184629_10096356All Organisms → cellular organisms → Bacteria1441Open in IMG/M
3300018422|Ga0190265_10481579All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300018422|Ga0190265_11614244Not Available760Open in IMG/M
3300018429|Ga0190272_13026306All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300019377|Ga0190264_11057356All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300019878|Ga0193715_1077772Not Available690Open in IMG/M
3300019885|Ga0193747_1095263All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300020002|Ga0193730_1084614All Organisms → cellular organisms → Bacteria894Open in IMG/M
3300020015|Ga0193734_1073142Not Available600Open in IMG/M
3300020197|Ga0194128_10430809All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300021051|Ga0206224_1017226All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300021080|Ga0210382_10545657All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300021082|Ga0210380_10494643All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300021344|Ga0193719_10015774All Organisms → cellular organisms → Bacteria3201Open in IMG/M
(restricted) 3300023208|Ga0233424_10420246Not Available500Open in IMG/M
3300025324|Ga0209640_11320085All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300025903|Ga0207680_10133611All Organisms → cellular organisms → Bacteria1638Open in IMG/M
3300025916|Ga0207663_11725095Not Available504Open in IMG/M
3300025934|Ga0207686_10625441Not Available850Open in IMG/M
3300025935|Ga0207709_10759120Not Available780Open in IMG/M
3300026555|Ga0179593_1089305All Organisms → cellular organisms → Bacteria2025Open in IMG/M
3300027513|Ga0208685_1007047All Organisms → cellular organisms → Bacteria2905Open in IMG/M
3300027765|Ga0209073_10315648All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300027778|Ga0209464_10389187All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300027843|Ga0209798_10263589All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300027874|Ga0209465_10277936All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300027877|Ga0209293_10425930All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300027909|Ga0209382_10190433All Organisms → cellular organisms → Bacteria2357Open in IMG/M
3300027909|Ga0209382_11600865All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300027909|Ga0209382_12238815Not Available516Open in IMG/M
3300028381|Ga0268264_11451111All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300028828|Ga0307312_11127111All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300031170|Ga0307498_10331950All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300031226|Ga0307497_10386409All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae → Paenibacillus → Paenibacillus alginolyticus665Open in IMG/M
3300031720|Ga0307469_10440163All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300031820|Ga0307473_10408459All Organisms → cellular organisms → Bacteria892Open in IMG/M
3300031943|Ga0310885_10780798All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300032180|Ga0307471_101715380All Organisms → cellular organisms → Bacteria781Open in IMG/M
3300032180|Ga0307471_102096672All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300032211|Ga0310896_10817584Not Available535Open in IMG/M
3300033406|Ga0316604_10476275All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300033485|Ga0316626_10495591All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1039Open in IMG/M
3300033813|Ga0364928_0194124All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300034155|Ga0370498_170363All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300034354|Ga0364943_0248959All Organisms → cellular organisms → Bacteria663Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil14.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.54%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.23%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.92%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment3.85%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil3.85%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.08%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.08%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)2.31%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.31%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.31%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.31%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland1.54%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.54%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.54%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.54%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.54%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.54%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.77%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.77%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.77%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.77%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.77%
Unplanted SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Unplanted Soil0.77%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.77%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.77%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.77%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.77%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.77%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.77%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.77%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.77%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere0.77%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.77%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.77%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.77%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.77%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300004779Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare3FreshEnvironmentalOpen in IMG/M
3300004808Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1FreshEnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009111Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009167Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG - Illumina Assembly (version 2)EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300011406Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT539_2EnvironmentalOpen in IMG/M
3300011409Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT423_2EnvironmentalOpen in IMG/M
3300011413Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT231_2EnvironmentalOpen in IMG/M
3300011430Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT600_2EnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300011442Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT138_2EnvironmentalOpen in IMG/M
3300012038Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT800_2EnvironmentalOpen in IMG/M
3300012041Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT754_2EnvironmentalOpen in IMG/M
3300012142Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT499_2EnvironmentalOpen in IMG/M
3300012143Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT790_2EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012480Arabidopsis rhizosphere microbial communities from North Carolina - M.Oy.5.yng.040610Host-AssociatedOpen in IMG/M
3300012484Unplanted soil (control) microbial communities from North Carolina - M.Soil.2.old.190510EnvironmentalOpen in IMG/M
3300012499Unplanted soil (control) microbial communities from North Carolina - M.Soil.2.yng.030610EnvironmentalOpen in IMG/M
3300012511Unplanted soil (control) microbial communities from North Carolina - M.Soil.8.old.080610_10EnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014874Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT660_2_16_10DEnvironmentalOpen in IMG/M
3300014876Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200_16_10DEnvironmentalOpen in IMG/M
3300014879Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10DEnvironmentalOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300020197Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015037 Kigoma Deep Cast 65mEnvironmentalOpen in IMG/M
3300021051Subsurface sediment microbial communities from Mancos shale, Colorado, United States - Mancos A1EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300023208 (restricted)Freshwater microbial communities from Lake Towuti, South Sulawesi, Indonesia - Watercolumn_Towuti2014_125_MGEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027843Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare3Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027877Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031170Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032211Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1EnvironmentalOpen in IMG/M
3300033406Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day20_CTEnvironmentalOpen in IMG/M
3300033485Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D5_AEnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F14TB_10074404913300001431SoilKRLLHMGTILQIPQAGLAATEEKLKTKRSEVIEVLKACIDGLEFTLAERDXXXXMIAKWMGLSPAQGVKAYDSVKDTFSRNGVPTEEQSKAYIAMLSSTAGVSADLVPATIFDFSLAAAAAKELASKK*
Ga0066395_1045093213300004633Tropical Forest SoilIAGTGFGNLLAYEIQFLIDRYKLGPKTTIVTAQSSLDRLIAIQKGLADGAIIPAPADLKGEEMGLKRLLQMGTVLQIPQAGLAAIDEKIKTKHGEVIEVLKASIEGLDFTLTQPEEATAIIGKWMALTPAQAAKAYESVKETYSRDGLPTQEQSKAYIAFLAATAGLSPDLPAATIFDFSLSSTAAKELAAKK*
Ga0062380_1013709823300004779Wetland SedimentIQFLIERYKLGPKTTIINAPSSIDRLIAVQKGLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIRTQRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPAQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSMSTVAAKELKK*
Ga0062380_1044333813300004779Wetland SedimentLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTQRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPIQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELKK*
Ga0062381_1004335923300004808Wetland SedimentFGNLVAYEIQFLIDRYKLGPKTTIINAPSSIDRLIAVQRGLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTQRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPIQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELKK*
Ga0065704_1005185313300005289Switchgrass RhizosphereRLVQMGSILQIPQAGLAATDEKIKTQRSEVIEVLKAAIDGLEYMATQREDAIALIGKWMALTPTQAAKAYETVKDTYSRDGVPTPEQSKAYIAMLAATAGLHADLPAATIFDFSLSAAAAKELSAVRSKW*
Ga0070676_1134872613300005328Miscanthus RhizosphereEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0070687_10137802323300005343Switchgrass RhizosphereTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIIGKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAAKK*
Ga0070668_10104823713300005347Switchgrass RhizosphereALHKGIADAAIISAPLDLKGEEMGFKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0070678_10067129423300005456Miscanthus RhizosphereSEVIEVLKAGIEGLEYTLADREDNADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPTDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK*
Ga0070664_10111163413300005564Corn RhizosphereLQTRLAEAAIVASPLDLKGEEMGLKRLLHMGTILQIPQAGLAATDETIKNKRGEALEVLKASIQGLDYTFSQREPTSELIANWMALSPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLAATADVSAKLPSAVIFDFSLASAAAKELATKK*
Ga0066705_1085353313300005569SoilKRLLHMGTILQIPQAGLAATDEKLKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKYTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0068861_10074785723300005719Switchgrass RhizosphereKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0066903_10283579113300005764Tropical Forest SoilQFLIDRYKLGPKTTIVTAQSSLDRLIAIQKGLADGAIIPAPADLKGEEMGLKRLLQMGTVLQIPQAGLAAIDEKIKTKHGEVIEVLKASIEGLDFTLTQPEEATAIISKWMALTPAQAAKAYESVKETYSRDGLPTQEQSKAYIAFLAATAGLSPDLPAATIFDFSLSSTAAKELAAKK*
Ga0066903_10785025813300005764Tropical Forest SoilRLLQMGTVLQIPQAGLAATEEKMKTNRSEVMDVLKAAIEGLEYTATQREDATALIAKWLALTPTQASKAYDAVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSATAAKDLAVRK*
Ga0074479_1076244513300005829Sediment (Intertidal)LSRAVDYTIIPSTIATAAARGAAAKVIHFSSVKLQHTLMARPEFTTVAELAGKRFGSGGLGNLTAYEINFLIERYNLGPKTTILAIASTTDRLIAMQKGIAEAAIIAAPMDLKGEEMGLRRLLHMGTVMPIPQAGLAAIDDKIKNKRGEVMEVLKASIEGLDFTATQREETADLIAKWMALTPAQSLKAYDAAKDTFSRDGVPTPEQSKAYITMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK*
Ga0074479_1103281913300005829Sediment (Intertidal)GTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIEGLEYTANQREDATAMIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNAELPLSTIFDFSLSAAAAKELAAKR*
Ga0074470_1111665823300005836Sediment (Intertidal)EIGLKRLIQMGALLQIPQAGLAATDEKLKTNRGEVLEVLKAAIDGLEYTSDQRDNATALIGNWMALTPEQSKAYIAMLAATAGLNANLPAASTFDFSLSAAAAKELATKR*
Ga0066652_10073812223300006046SoilKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0075421_10020163433300006845Populus RhizosphereMGLKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLENTHSERAENSEIISQWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYLAMLAATAGVSADLKPESIFDFSLAAAAAKDLAPKNSGFVGPSRPSF*
Ga0075421_10255828113300006845Populus RhizosphereHMGTILQIPQAGLAATEEKIKTKRSEVIEVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPELIFDFSLAAAAAKDLAQKK*
Ga0075433_1192161613300006852Populus RhizosphereTIVSVISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0075420_10187855513300006853Populus RhizosphereAEAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTDEKLKTKRSEVIDVLKSSIEGLEYTLADREDNSDIISKWMGLTAAQGVKAYDSVKDTYSRNGIPTDEQSKAYIAMLAATAGVSADLAPATIFDFSFAAAAAKELAGKK*
Ga0075425_10108528713300006854Populus RhizosphereISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTDEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGARAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0075434_10147418213300006871Populus RhizosphereAGLAATDEKLKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0068865_10091327613300006881Miscanthus RhizosphereGEELGLKRLLHMGTILQIPQAGLATTDEKIKTKRSEVIEVLKAGIEGLEYTLADREDNADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPTDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK*
Ga0075424_10263507313300006904Populus RhizosphereRLLQMGTVLQIPQAGLAATEEKMKTNRSEVMDVLKAAIEGLEYTATQREDATALIAKWLALTPTQASKAYDAVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSATAAKDLAARK*
Ga0075419_1023229323300006969Populus RhizosphereLAEAAIVASPLDLKGEEMGLKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLENTHSERAENSEIISQWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYLAMLAATAGVSADLKPESIFDFSLAAAAAKDLAPKK*
Ga0105095_1055525213300009053Freshwater SedimentFGNLVAYEIQFLIERYKLGPKTTIVNAPSSIDRLIAVQRDLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAIKAYDSVKDTYSRDGVPTLEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAARK*
Ga0115026_1067628223300009111WetlandTAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLVQMGTLMPIPQAGLGATDEKIRTNRGEVIEVLKAAIEGLEYTANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSISAAAAKELAIKR*
Ga0066709_10137440923300009137Grasslands SoilIAEAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLAATDETIKTRRGEVLEVLKASIEGLEYTLSDREDNSAIISKWMALSTSQGAKAYDSVKDTYSRNGVPTDEQSKAYIAMLAATAGVSADLKPEMIFDFSLAAAAAKELAAKR*
Ga0099792_1057631223300009143Vadose Zone SoilLIDRYRLGPKTTIVSVISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAR
Ga0114129_1004864473300009147Populus RhizosphereAGKKIAASGFGNLTSYEIQFLIDRYRLGPKTTIVSVISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLENTHSERAENSEIISQWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYLAMLAATAGVSADLKPESIFDFSLAAAAAKDLAPKK*
Ga0113563_1398943113300009167Freshwater WetlandsTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIEGLEYTTNQREDATALIGKWMALTPIQAAKAYDTVRDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSAAAAKELATKR*
Ga0105241_1125781913300009174Corn RhizosphereEKLKTKRSEVIDVLKAGIEGLEYTLNEREDNSDIISKWMGLTPAQGVKAYDSVRDTFSRNGIPTDEQSKAYIAMLAATAGISADLSPASIFDFSFAAAAAKELAARK*
Ga0105242_1217655413300009176Miscanthus RhizosphereLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0105347_102518713300009609SoilQAGLATTDEKLKTKRSEVIEVLKASIEGLEYTLAEPEDGSAIISKWMGLTAVQGVKAYDSVKDTFSRNGVPTDEQSKAYIAMLAATAGVSADLSPATIFDFSFAAAAAKELAGRK*
Ga0126308_1007551423300010040Serpentine SoilVPGLFCKPCPACTLVAYEIQFLIDRYKLGPKTTIVSAPSSLDRLITVQKGIAEAAIIAAPADIKGEEMGLKRLVQMGSILQIPQAGLAATDEKIKAQRSEVIEVLKAAIDGLDYTANQREDATSLIGKWMALTPPQAVKAYDTVKDTYSRDGVPTAEQSKAYIAMLAGTAGLSADLPASTIFDFSLSVAAAKELAAKR*
Ga0126379_1182633413300010366Tropical Forest SoilGAIIPAPADLKGEEMGLKRLLQMGTVLQIPQAGLAAIDEKIKTKHGEVIEVLKASIEGLDFTLTQPEEATAIIGKWMALTPAQAAKAYESVKETYSRDGLPTQEQSKAYIAFLAATAGLSPDLPAATIFDFSLSSTAAKELAAKK*
Ga0134124_1240715813300010397Terrestrial SoilRYKLGPKTTIVSAPSSMDRLLAVQRGIAEAAVVAAPADLKGEEMGLKRLVQMSSILQIPQAGLAATDEKIKTQRNEVIEVLKAAIDGLDYTANNREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKQ*
Ga0134121_1080731713300010401Terrestrial SoilHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALTPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0134121_1133362123300010401Terrestrial SoilGIAEAAVVPAPADLKGEEMGLKRLVQMSSILQIPQAGLAATDEKIKTQRNEVIEVLKAAIDGLDYTANHREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKQ*
Ga0134123_1142133123300010403Terrestrial SoilYEIQFLIDRYRLGSKTTIVSVISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0134123_1173969813300010403Terrestrial SoilTIVSAPSSMDRLLAVQRGIAEAAVVAAPADLKGEEMGLKRLVQMSSILQIPQAGLAATDEKIKTQRIEVIEVLKAAIDGLDYTANNREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKQ*
Ga0138514_10000521813300011003SoilLKAGIEGLEYTLADREDNADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPPDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK*
Ga0137454_106453213300011406SoilEIQFLIERYKLGAKTTIINAPSSIDRLIAVQKGLAEAAIIAAPADIKGEEMGLKRLIQMGSLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPSQAIKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLAAAAIFDFSLSASAAKELGVKK*
Ga0137323_101364813300011409SoilMFASVKLQHTLVARADIMAVTELAGKKIAGSGFGNLVAYEIQFLIDRYKLGPKTTIVSAPSSIDRLITVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGTVLQIPQAGLAATDEKIRTSRGEVIEVLKAAIDGLDYTATQKEDAIALIGKWMALTPAQASKAYDTVRDTYSRDGVPTAEQNKAYIAMLAATAGLNADLPPATIFDFSLSATAAKEIAAKK*
Ga0137333_116264013300011413SoilDIKGEEMGLKRLIQMGTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK*
Ga0137423_117416813300011430SoilAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK*
Ga0137429_115104923300011437SoilIKGEEMGLKRLVQMGTLMPIPQAGLAATDEKIRTNRGEVIEVLKAAIEGLEYTANQREDATVLIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSAAAAKELATKR*
Ga0137452_131352013300011441SoilLGPKTTFVTAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLVQMGTLMPIPQAGLAATDEKIRSNRGEVIEVLKAAIEGLEYTANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSAAAAKELATKR*
Ga0137437_100576033300011442SoilVAYEIQFLIDRYKLGPKTTIVSAPSSIDRLIAVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGSVLQIPQAGLATTDEKIRNSRVEVVEVLKAAIDGLEYTATQKEDATALIGKWMALTPAQAGKAYDTVRDTYSRDGVPTAEQSKAYIAMLAATAGLNPDLSAATIFDFSLSAAAAKELATRK*
Ga0137431_102372823300012038SoilKRLIHMGTILQIPQAGLATTDEKLKTKRSEVIEVLKASIEGLEYTLAEPEDGSAIISKWMGLTAVQGVKAYDSVKDTFSRNGVPTDEQSKAYIAMLAATAGVSADLSPATIFDFSFAAAAAKELAGRK*
Ga0137431_108309413300012038SoilVDYTIIPSTIATAAARGAAAKVIMFASVKLQHTLVARADIMAVTELAGKKIAGSGFGNLVAYEIQFLIDRYKLGPKTTIVSAPSSIDRLITVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGTVLQIPQAGLAATDEKIRTSRGEVIEVLKAAIDGLDYTATQKEDAIALIGKWMALTPAQASKAYDTVRDTYSRDGVPTAEQNKAYIAMLAATAGLNADLPPATIFDFSLSATAAKEIAAKK*
Ga0137430_101989633300012041SoilYKLGPKTTIVSAPSSIDRLIAVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGTVLQIPQAGLAATDEKIRTSRGEVIEVLKAAIDGLDYTATQKEDATALIGKWMALTPAQAGKAYDTVRDTYSRDGVPTAEQSKAYIAMLAATAGLNPDLSAATIFDFSLSAAAAKELATRK*
Ga0137343_102990913300012142SoilFGNLTSYEIQFVIDRYKLGPKTTIVSVISSTDRLLALHKGIADAAIISAPLDLKGEEMGLKRLIHMGTILQIPQAGLATTDEKLKTKRSEVIEVLKASIEGLEYTLAEPEDGSAIISKWMGLTAVQGVKAYDSVKDTFSRNGVPTDEQSKAYIAMLAATAGVSADLSPATIFDFSFAAAAAKELAGRK*
Ga0137354_107564213300012143SoilNLVAYEIQFLIERYKLGPKTTIINAPSSIDRLIAVQRGLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIRTNRGEVIEVLKAAIEGLEYTANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSISAAA
Ga0137383_1067585613300012199Vadose Zone SoilIPQAGLAATDEKIKTKRGELIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0137363_1026781213300012202Vadose Zone SoilVTDLAGKKIAGSGFGNLVAYEIQFLIDRYKLGSKTTIVSAASSIDRLIAVQKGIAEAAIIAAPADLKGEEMGLKRLVQMGSILQIPQAGLAATDEKTKTRRGEVIEVQKAAIDGLEYTATQREDATALIGKWMALTPTQAAKAYETVKDTYSRDGVPTPEQSKAYVAMLAARRDFTPICRRRLFSI
Ga0137362_1108917613300012205Vadose Zone SoilAGKRIVAAGFGNLTSYEVQFLIDRYRLGPNTTLVSAPSSTDRLIALQKGLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0137380_1127961413300012206Vadose Zone SoilVAAGFGNLTSYEVQFLIDRYRLGPNTTLVSAPSSTDRLIALQKGLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR*
Ga0137435_111921713300012232SoilRYKLGPKTTIVSAPSSIDRLIAVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGSVLQIPQAGLATTDEKIRNSRVEVVEVLKAAIDGLEYTATQKEDATALIGKWMALTPAQAGKAYDTVRDTYSRDGVPTAEQSKAYIAMLAATAGLNPDLSAATIFDFSLSAAAAKELATRK*
Ga0137369_1090547113300012355Vadose Zone SoilISSTDRLIAMHKGITDAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLEYTLSEREENSEIISKWMALNPSQGARAYDSVKDTFSRNGIPTDEQSKAYIAMLAATASVSAELKPESIFDFSLAAAATKDLAQKK*
Ga0157346_101005223300012480Arabidopsis RhizosphereRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0157333_103185413300012484SoilHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAKLKPESIFDFSLAAAAAKDLAPKK*
Ga0157350_100764913300012499Unplanted SoilTDEKIKGRRAEVIEVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0157332_102925013300012511SoilKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0137359_1161054313300012923Vadose Zone SoilGLKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLEYTLSEREENSEIISKWMALNPSQGIRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAATKDLAQKK*
Ga0137404_1103782413300012929Vadose Zone SoilGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLRASIQGHEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAATKDLAQKK*
Ga0164302_1005540613300012961SoilQRGIAEAAVVAAPADLKGEEMGLKRLVQMSSILQIPQAGLAATDEKIKTQRNEVIEVLKAAIDGLDYTANHREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKQ*
Ga0157378_1002575313300013297Miscanthus RhizosphereAVVAAPADLKGEEMGLKRLVQMSSILQIPQAGLAATDEKIKTQRNEVIEVLKAAIDGLDYTANHREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKQ*
Ga0163162_1010307213300013306Switchgrass RhizosphereEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLARKK*
Ga0157375_1092645513300013308Miscanthus RhizosphereKLGPKTTLVSVISSTDRLLAMHKGIADAAIISAPLDLKGEELGLKRLLHMGTILQIPQAGLATTDEKIKTKRSEVIEVLKAGIEGLEYTLADREDNADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPTDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK*
Ga0157375_1215329313300013308Miscanthus RhizosphereVLKASIEGLEYTLSEREENSEIISKWMALTPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0180084_105640213300014874SoilELAGKKIAGSGFGNLVAYEIQFLIERYKLGAKTTIINAPSSIDRLIAVQKGLAEAAIIAAPADIKGEEMGLKRLIQMGTLLQIPQAGLAATDEKSKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAIKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK*
Ga0180064_113415413300014876SoilNLTSYEIQFVIDRYKLGPKTTIVSVISSTDRLLALHKGIADAAIISAPLDLKGEEMGLKRLIHMGTILQIPQAGLATTDEKLKTKRSEVIEVLKASIEGLEYTLAEPEDGSAIISKWMGLTAVQGVKAYDSVKDTFSRNGVPTDEQSKAYIAMLAATAGVSADLSPATIFDFSFA
Ga0180062_105137123300014879SoilTTDEKIKTKRSEVIEVLKAGIEGLEYTLNEREDTADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPTDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK*
Ga0180093_107975123300015258SoilLIQMGSLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK*
Ga0132257_10041425823300015373Arabidopsis RhizosphereDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLATTAGVSAELKPESIFDFSLAAAAAKDLAPKK*
Ga0132257_10248540113300015373Arabidopsis RhizosphereKGEEMGLKRLLHMGTILQIPQAGLAATDETIKNKRGEALEVLKASIEGLDYTFSQREPTTELIANWMALNPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLAATAGVNAKLPSAAIFDFSLAGAAAKELATKK*
Ga0132255_10596563813300015374Arabidopsis RhizosphereFLIEHYHLGPQTTIVSASSSTDRLIALQTRLAEAAIVASPLYLKGEEIGLKRLLHMGTILQIPQAGLAATDETIKNKRGEALEVLKASIEGLDYTFSQREPTTELIANWMALSPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLAATAGVSAKLPSAAIFDFSLASAAA
Ga0184626_1000346473300018053Groundwater SedimentMGTILQIPQSGLATTDEKLKTKRSEVIEVLKACIDGLEYTLAEREDNSDIIAKWMGLNSAQGMKAYDYVKDTFLRNGVPTDEQSNAYIAMLAATAAVSADLQTATIFDFSLAAAAAKELGSIK
Ga0184623_1010725023300018056Groundwater SedimentEEMGLKRLLQMGTVLQIPQAGLAATDEKIKTTRAEVIEVLKAAIDGLDYTATQKEDATALIGKWMALTPTEASKAYDSVKDTYSRDGVPTAEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAVRK
Ga0184635_1005445613300018072Groundwater SedimentKIAGSGFGNLVAYEIQFLIDRYKLGSKTTIVSAASSIDRLIAVQKGIAEAAIIAAPADLKGEEMGLKRLVQMGSILQIPQAGLAATDEKTKTQRSEVIEVLKAAIDGLEYTATQREDATALIGKWMALTPTQAAKAYETVKDTYSRDGVPTPEQSKAYIAMLAATAGLHADLPAATIFDFSLSAAAAKELSAVRSK
Ga0184625_1003131233300018081Groundwater SedimentMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSVDLKTESIFDFSLAAAAAKDLAPKK
Ga0184629_1009635623300018084Groundwater SedimentIQFLIDRYKLGPKTTIVSAPSSIDRLIAVQRGIAEAAIIAAPADIKGEEMGLKRLLQMGSVLQIPQAGLATTDEKIRNSRVEVVEVLKAAIDGLEYTATQKEDATALIGKWMALTPAQAGKAYDTVRDTYSRDGVPTAEQSKAYIAMLAATAGLNPDLSAATIFDFSLSAAAAKELATRK
Ga0190265_1048157913300018422SoilDEKLKTKRGEVIEVLKASIEGLEYTVTEREDNSDIISKWMGLTAAQGVRAYDSVKDTYSRNGIPTDEQSKAYIAMLAATAGVSADLAPATIFDFSFAAAAAKELAGKK
Ga0190265_1161424413300018422SoilKRLLHMGSILQIPQAGLATTDEILKTKRGEVIEVFKASIEGLEYTLAEREDNSDIITKWMGLTAAQGVRAYDSVKDTYSRNGIPTDEQSKAYIAMLAATAGVSADLAPATIFDFSFAAAAAKELTGKK
Ga0190272_1302630613300018429SoilAGSGFGNLVAYEIQFLIERYKLGAKTTIVNAPSSIDRLIAVQRGLADAAIIAAPADIKGEEMGLKRLVQMGTLLHIPQAGLAATDEKIKANRAEVIEVLKGAIEGLEYTANQREDATALIGKWMALTPIQASKAYESVKDTYSRDGVPTSEQSKAYIAMLAATAGLSADLPAA
Ga0190264_1105735613300019377SoilHGQRSRRKENRRQRFGNLVAYEIQFIIDRYKLGPKTTIISAPSSIDRLIAVQKGIAEGAIIAAPADLKGEEMGVKRLLQMGTILQIPQAGLAATDEKLKTNRGEVIEVLKAAIDGLDYTAVQKEDAAALIGKWMSLTPAQASKAYDTVKDTFSRDGVPTQEQSKAYIAMLAATAGLSADLPAPTAPRNATRSPGWMARVIP
Ga0193715_107777223300019878SoilQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAVAAKELSAKR
Ga0193747_109526313300019885SoilLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAAIAGVSPDLKPEMIFDFSLAAAAAKELAAKR
Ga0193730_108461413300020002SoilRLIALQKGLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAAIAGVSPDLKPEMIFDFSLAAVAAKELSAKR
Ga0193734_107314213300020015SoilEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAVAAKELSAKR
Ga0194128_1043080913300020197Freshwater LakeGFGNLVAYEIQFLIDRYKLGPKTTIINAPSSIDRLIAIQKGLAEGAIIAAPADIKGEEMGLKRLIQMGMVLQIPQAGLAATDEKIKASRGEVSEVLKAAIEGLDYTATQREEATALIGKWMALTPAQAVKAYDSVRDTYSRDGVPSAEQSKAYIAMLAATVGLNADLPPATIFDFSLSAAAAKEIAAKK
Ga0206224_101722623300021051Deep Subsurface SedimentMTEVSDQRSIDRYKLGPKTTIISAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLLQMGTILQILQAGLTATDEKIKNNRAEVIEVLKAAIDGLEYTVTQKEDATALIGKWMALTPAQASQAYDTVKDTFCRDGVPTPEQSKAYIAMLATTAGLSADLPAATI
Ga0210382_1054565713300021080Groundwater SedimentEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAVAAKELSAKR
Ga0210380_1049464313300021082Groundwater SedimentLIERYKLGPKTTIINAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLIQMGSLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK
Ga0193719_1001577443300021344SoilDRYRLGPNTTLVSAPSSTDRLIALQKGLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAAIAGVSPDLKPEMIFDFSLAAAAAKELAAKR
(restricted) Ga0233424_1042024613300023208FreshwaterFLIDRYKLGPKTTMVTAASSIDRMIAVQKGLADAAVIAAPADIKGEEMGLKRLLVIGSVLQIPQAGLAATDEKLKTHRSEVLEVLKGAIEGLEYTANQKEDATALIGKWMALTPPQAARAYETVKDTYSRDGVPTPEQSKAYIAMLAATAGVSPDLAAATIFDFSL
Ga0209640_1132008513300025324SoilVQKGIAEAAIIAAPADIKGEEMGLKRLLQMGTILQIPQAGLAATDEKIKNNRAEVIEVLKAAIDGLEYTATQKEDATALIGKWMALTPAQASKAYDTVKDTFCRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFS
Ga0207680_1013361113300025903Switchgrass RhizosphereQIPQAGLAATDEKIKTQRNEVIEVLKAAIDGLDYTANHREEAVALIGKWMALTPAQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLSPDLQAATIFDFSLSAAAASAVRSKRTHVT
Ga0207663_1172509523300025916Corn, Switchgrass And Miscanthus RhizosphereTKRSEVLDVLKSSIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLARKK
Ga0207686_1062544123300025934Miscanthus RhizosphereEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK
Ga0207709_1075912023300025935Miscanthus RhizosphereRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK
Ga0179593_108930543300026555Vadose Zone SoilMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELNPNRFLISHSLPQRPKIWRQKNSRLAGPSRPSF
Ga0208685_100704713300027513SoilGLATTDEKLKTKRSEVIEVLKASIEGLEYTLAEPEDGSAIISKWMGLTAVQGVKAYDSVKDTFSRNGVPTDEQSKAYIAMLAATAGVSADLSPATIFDFSFAAAAAKELAGRK
Ga0209073_1031564823300027765Agricultural SoilQFLIERYHLGPQTTIVSASSSTDRLIALQTRLAEAAIVASPTDLKGEEMGLKRFLHMGTILQIPQAGLAATDETIKNKRGEALEVLKASIEGLDYTFSQREPTTELIANWMALNPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLGHCRC
Ga0209464_1038918713300027778Wetland SedimentGFGNLVAYEIQFLIDRYKLGPKTTIINAPSSIDRLIAVQRGLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTQRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPIQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAAT
Ga0209798_1026358923300027843Wetland SedimentIERYKLGPKTTIINAPSSIDRLIAVQKGLAEAAIIAAPADIKGEEMGLKRLVQMGTLLQIPQAGLAATDEKIKTNRGEVLEVLKAAIDGLEYTANQREDATALIGKWMALTPIQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLTAAAAKELAAKK
Ga0209465_1027793613300027874Tropical Forest SoilLAGKRIAGTGFGNLLAYEIQFLIDRYKLGPKTTIVTAQSSLDRLIAIQKGLADGAIIPAPADLKGEEMGLKRLLQMGTVLQIPQAGLAAIDEKIKTKHGEVIEVLKASIEGLDFTLTQPEEATAIIGKWMALTPAQAAKAYESVKETYSRDGLPTQEQSKAYIAFLAATAGLSPDLPAATIFDFSLSSTAAKELAAKK
Ga0209293_1042593023300027877WetlandQKGIAEAAIIAAPADIKGEEMGLKRLVQVGTLMPIPQAGLGATDEKIRTNRGEVIEVLKAAIEGLEYTANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSISAAAAKELAIKR
Ga0209382_1019043313300027909Populus RhizosphereMGFKRLLHMGTILQIPQAGLAATEEKIKTKRSEVLEVLKASIEGLENTHSERAENSEIISQWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYLAMLAATAGVSADLKPESIFDFSLAAAAAKDLAPKNSGFVGPSRPSF
Ga0209382_1160086513300027909Populus RhizosphereYYEIQFLIERYHLGPQTTIVSASSSTDRLIALQKRLAEAAIVASPLDQKGEEMGLKRLLHMGTVLQIPQAGLAATDETIKNKRSEVLEVLKASIEGLEHTSSRREPTIELIGNWMALNSVQAIKAYDSVRDTYSQNGIPTEEQAKAYIAMLAATAGVSGNLPSAAIFDFSLASAAAKELAAKK
Ga0209382_1223881513300027909Populus RhizosphereTEEKIKTKRSEVIEVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPELIFDFSLAAAAAKDLAQKK
Ga0268264_1145111123300028381Switchgrass RhizosphereVASEAAAAAIVASPLDLKGEEMGLKRLLHMGTILQIPQAGLAATDETIKNKRGEALEVLKASIQGLDYTFSQREPTSELIANWMALSPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLAATAGVSAKLPSAAIF
Ga0307312_1112711113300028828SoilAAVNDLAGKRIVAAGFGNLTSYEVQFLIDRYRLGPNTTLVSAPSSTDRLIALQKGLAEAAIIAAPAHLKGEEMGLKRLLHMGTILQIPQAGLAATDEKIKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAA
Ga0307498_1033195013300031170SoilGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAQRPKIWRQKNKDLVEPSLSLP
Ga0307497_1038640913300031226SoilFLIDRYRLGSKTTIVSVISSTDRLIAMHKGIADAAIISAPLDLKGEEMGLKRLLHMGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK
Ga0307469_1044016323300031720Hardwood Forest SoilLGTILQIPQAGLATTEEKIKTKRSEVLDVLKASIEGLEYTLSEREENSEIISKWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSAELKPESIFDFSLAAAAAKDLAPKK
Ga0307473_1040845923300031820Hardwood Forest SoilLVSAPSSTDRLIALQKGLAEAAIIAAPADLKGEEMGLKRLLHMGTILQIPQAGLAATDEKLKTKRGEVIEVLKASIEGLEYTLSDREDNSAIIAKWMALNASQGVRAYDSVKDTFSRNGIPTDEQSKAYIAMLAATAGVSPDLKPEMIFDFSLAAAAAKELAAKR
Ga0310885_1078079813300031943SoilEMGLKRLLHMGTILQIPQAGLAATDETIKNKRREALEVLKASIEGLDYTFSQREPTTELIANWMALSPIQANKAYDSVKDTYSQNGIPTEEQAKAYIAMLAATADVSAKLPSAVIFDFSLASAAAKELATKK
Ga0307471_10171538023300032180Hardwood Forest SoilIQKGLAEGAVVASPTDIKGEEMGLKRLLQMGTVLQIPQAGLAATEEKIKTNRSEVMEVLKAAIEGLDYTATQREDAIAMIAKWMALTPAQANKAYEAVKDTYSRDGVPTAEQSKAYIAMLAATAGLSPDLSPTTIFDFSLSATAAKELAARK
Ga0307471_10209667213300032180Hardwood Forest SoilSSIDRLIAVQKGIAEAAIIAAPADLKGEEMGLKRLVQMGSILQIPQAGLAATDEKTKTQRGEVIEVLKAAIDGLEYTATQREDATALIGKWMALTPTQAAKAYETVKDTYSRDGVPTPEQSKAYIAMLAATAGLHADLPAATIFDFSLSAAAAKELSAVRSKW
Ga0310896_1081758423300032211SoilASIEGLENTHSERAENSEIISQWMALNPSQGVRAYDSVKDTFSRNGIPTDEQSKAYLAMLAATAGVSADLKPESIFDFSLAAAAAKDLAPKNSGFVGPSRPSF
Ga0316604_1047627523300033406SoilIQFLIDRYKLGPKTTIVTAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLVQMGTLMPIPQAGLAATDDKIKTNRGEVIDVLKAAIEGLEYTANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSLSAAAAKELATKR
Ga0316626_1049559123300033485SoilAPADIKGEEMGLKRLVQMGTLMPIPQAGLAATDDKIKTNRGEVIDVLKAAIEGLEYAANQREDATALIGKWMALTPIQAAKAYDTVKDTYSRDGVPTPEQSKAYIAMLAATAGLNADLPASTIFDFSISAAAAKELAIKR
Ga0364928_0194124_3_5093300033813SedimentNLVAYEIQFLIDRYRLGPKTTIIHAPSSIDRLIAVQKGIAEAAIIAAPADIKGEEMGLKRLLQMGTVLQIPQAGLAATDEKIKNSRGEVIEVLKAAIDGLDYTANQREDATALIGKWMALTPAQANKAYDAVRDTYSRDGVPTAEQSKAYIAMLAATAGLNADLPPGTI
Ga0370498_170363_1_4113300034155Untreated Peat SoilLKGEELGLKRLLHMGTILQIPQAGLATTDEKIKTKRSEVIEVLKAGIEGLEYTLADREDNADIIAKWMGLTAAQGVKAYDSVKDTFSRNGIPTDEQSKAYISMLAATAGVSPDLAPTTIFDFSLAAAAAKELAVKK
Ga0364943_0248959_264_6593300034354SedimentMGLKRLIQMGSLLQIPQAGLAATDEKIKTNRGEVIEVLKAAIDGLEYTANQREDATALIGKWMALTPTQAAKAYDSVKDTYSRDGVPTPEQSKAYIAMLAATAGLSADLPAATIFDFSLSAAAAKELAAKK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.