NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F086238

Metagenome Family F086238

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086238
Family Type Metagenome
Number of Sequences 111
Average Sequence Length 270 residues
Representative Sequence VRFIPPAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAMQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Number of Associated Samples 95
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 8.11 %
% of genes near scaffold ends (potentially truncated) 92.79 %
% of genes from short scaffolds (< 2000 bps) 92.79 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.099 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(18.919 % of family members)
Environment Ontology (ENVO) Unclassified
(27.928 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.856 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.17%    β-sheet: 4.01%    Coil/Unstructured: 43.81%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 111 Family Scaffolds
PF02540NAD_synthase 4.50
PF17137DUF5110 0.90
PF05990DUF900 0.90
PF14023DUF4239 0.90
PF13439Glyco_transf_4 0.90

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 111 Family Scaffolds
COG0171NH3-dependent NAD+ synthetaseCoenzyme transport and metabolism [H] 4.50
COG4782Esterase/lipase superfamily enzymeGeneral function prediction only [R] 0.90


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16842061All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1146Open in IMG/M
2088090014|GPIPI_17359008All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1030Open in IMG/M
2166559005|cont_contig19132All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1277Open in IMG/M
2199352025|deepsgr__Contig_122137All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1009Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100615969All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium829Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100616258All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium836Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100616495All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium930Open in IMG/M
3300000955|JGI1027J12803_108956986All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium991Open in IMG/M
3300000955|JGI1027J12803_108995950All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium922Open in IMG/M
3300004114|Ga0062593_101011999All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium853Open in IMG/M
3300004156|Ga0062589_100911933All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium810Open in IMG/M
3300004157|Ga0062590_100702939All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium911Open in IMG/M
3300004463|Ga0063356_102640555All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium771Open in IMG/M
3300004643|Ga0062591_100174810All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1538Open in IMG/M
3300005167|Ga0066672_10215563All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1228Open in IMG/M
3300005171|Ga0066677_10447029All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium740Open in IMG/M
3300005178|Ga0066688_10533534All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium755Open in IMG/M
3300005179|Ga0066684_10002472All Organisms → cellular organisms → Bacteria7715Open in IMG/M
3300005179|Ga0066684_10411249All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium909Open in IMG/M
3300005180|Ga0066685_10678279All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium708Open in IMG/M
3300005184|Ga0066671_10343732All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium945Open in IMG/M
3300005294|Ga0065705_10305777All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1043Open in IMG/M
3300005294|Ga0065705_10433037All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium844Open in IMG/M
3300005294|Ga0065705_10509351All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium771Open in IMG/M
3300005295|Ga0065707_10236324All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1171Open in IMG/M
3300005332|Ga0066388_100513073All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1833Open in IMG/M
3300005332|Ga0066388_103673347All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium783Open in IMG/M
3300005354|Ga0070675_100617270All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium984Open in IMG/M
3300005354|Ga0070675_101344932All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium658Open in IMG/M
3300005434|Ga0070709_10706417All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium785Open in IMG/M
3300005451|Ga0066681_10351256All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium906Open in IMG/M
3300005548|Ga0070665_101177433All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium777Open in IMG/M
3300005568|Ga0066703_10330180All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium919Open in IMG/M
3300005574|Ga0066694_10328693All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium726Open in IMG/M
3300005575|Ga0066702_10311639All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium959Open in IMG/M
3300005764|Ga0066903_104350785All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Cereibacter → Cereibacter sphaeroides757Open in IMG/M
3300006028|Ga0070717_11115407All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium718Open in IMG/M
3300006031|Ga0066651_10311919All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium843Open in IMG/M
3300006163|Ga0070715_10249877All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium926Open in IMG/M
3300006854|Ga0075425_101103880All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium903Open in IMG/M
3300007788|Ga0099795_10207866All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium828Open in IMG/M
3300009012|Ga0066710_100533188All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1774Open in IMG/M
3300009012|Ga0066710_101464985All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1055Open in IMG/M
3300009137|Ga0066709_101441583All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium998Open in IMG/M
3300009137|Ga0066709_101552210All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium951Open in IMG/M
3300009137|Ga0066709_102511519All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium694Open in IMG/M
3300010043|Ga0126380_10686051All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium821Open in IMG/M
3300010047|Ga0126382_10677815All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium861Open in IMG/M
3300010048|Ga0126373_11572119All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium722Open in IMG/M
3300010359|Ga0126376_10894481All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium877Open in IMG/M
3300010360|Ga0126372_11490696All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium712Open in IMG/M
3300010376|Ga0126381_102563041All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium730Open in IMG/M
3300010376|Ga0126381_102647587All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium717Open in IMG/M
3300010863|Ga0124850_1038710All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1519Open in IMG/M
3300012207|Ga0137381_10708706All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium875Open in IMG/M
3300012209|Ga0137379_10295488All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1530Open in IMG/M
3300012285|Ga0137370_10236073All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1080Open in IMG/M
3300012353|Ga0137367_10159009All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1647Open in IMG/M
3300012355|Ga0137369_10509617All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium849Open in IMG/M
3300012356|Ga0137371_10533133All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium905Open in IMG/M
3300012918|Ga0137396_10071302All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2428Open in IMG/M
3300012922|Ga0137394_10760787All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium813Open in IMG/M
3300012958|Ga0164299_10047085All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1992Open in IMG/M
3300012961|Ga0164302_10426674All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium913Open in IMG/M
3300013306|Ga0163162_11441294All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium784Open in IMG/M
3300014325|Ga0163163_10812592All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium998Open in IMG/M
3300015371|Ga0132258_13314631All Organisms → Viruses → Predicted Viral1108Open in IMG/M
3300015374|Ga0132255_101263426All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1113Open in IMG/M
3300016357|Ga0182032_10198371All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1526Open in IMG/M
3300018071|Ga0184618_10077501All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1264Open in IMG/M
3300018076|Ga0184609_10170883All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1007Open in IMG/M
3300018431|Ga0066655_10120445All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1507Open in IMG/M
3300018433|Ga0066667_10720434All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium839Open in IMG/M
3300018468|Ga0066662_11619255All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium676Open in IMG/M
3300018482|Ga0066669_10954454All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium771Open in IMG/M
3300019867|Ga0193704_1028945All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1121Open in IMG/M
3300019869|Ga0193705_1001042All Organisms → cellular organisms → Bacteria6241Open in IMG/M
3300019873|Ga0193700_1011739All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1418Open in IMG/M
3300019877|Ga0193722_1079859All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium804Open in IMG/M
3300020002|Ga0193730_1063153All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1061Open in IMG/M
3300021078|Ga0210381_10063114All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1139Open in IMG/M
3300021080|Ga0210382_10334247All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium668Open in IMG/M
3300021086|Ga0179596_10333591All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium760Open in IMG/M
3300021168|Ga0210406_10561627All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium895Open in IMG/M
3300021344|Ga0193719_10021017All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2790Open in IMG/M
3300022756|Ga0222622_10236668All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1230Open in IMG/M
3300023058|Ga0193714_1011598All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1338Open in IMG/M
3300025931|Ga0207644_10328528All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1238Open in IMG/M
3300025939|Ga0207665_10387511All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1062Open in IMG/M
3300025960|Ga0207651_10759160All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium858Open in IMG/M
3300026310|Ga0209239_1185241All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium777Open in IMG/M
3300026312|Ga0209153_1115087All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1017Open in IMG/M
3300026316|Ga0209155_1115399All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium943Open in IMG/M
3300026530|Ga0209807_1004981All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6547Open in IMG/M
3300026547|Ga0209156_10194665All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium968Open in IMG/M
3300027512|Ga0209179_1087505All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium690Open in IMG/M
3300028711|Ga0307293_10103895All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium902Open in IMG/M
3300028791|Ga0307290_10012910All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2840Open in IMG/M
3300028793|Ga0307299_10136647All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium921Open in IMG/M
3300028793|Ga0307299_10207141All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium738Open in IMG/M
3300028807|Ga0307305_10000593All Organisms → cellular organisms → Bacteria13414Open in IMG/M
3300028811|Ga0307292_10259817All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium722Open in IMG/M
3300031231|Ga0170824_101639911All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium820Open in IMG/M
3300031474|Ga0170818_107045782All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium782Open in IMG/M
3300031474|Ga0170818_112209709All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1227Open in IMG/M
3300031474|Ga0170818_114912454All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium698Open in IMG/M
3300031910|Ga0306923_10595255All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1242Open in IMG/M
3300031947|Ga0310909_10783594All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium788Open in IMG/M
3300032001|Ga0306922_11231531All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium760Open in IMG/M
3300032076|Ga0306924_10892579All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium984Open in IMG/M
3300032180|Ga0307471_100006062All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia7639Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.51%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.01%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil6.31%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere3.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.60%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.60%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.60%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.60%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.70%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.80%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.80%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.80%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.80%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.90%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.90%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.90%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.90%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.90%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.90%
SimulatedEngineered → Modeled → Simulated Communities (Sequence Read Mixture) → Unclassified → Unclassified → Simulated0.90%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2166559005Simulated microbial communities from Lyon, FranceEngineeredOpen in IMG/M
2199352025Soil microbial communities from Rothamsted, UK, for project Deep Soil - DEEP SOILEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010863Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (PacBio error correction)EnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019873Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300023058Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m1EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300027512Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_005677602088090014SoilLVYSIHYASHQWDSASNIPQHGNEGNRGSKAHRSPGYRIILADAGRGAELLKLRSDNATIGELIARYRANASQRPDTVRSNVRSLRMIVKTVHPGDPDAKPTSLLTASLIRQFEKRQFERAQKHATAETRATAIQRVRSSTSSYVRQGRSIVALRKMKFYDGMKLPDLAGFRGENVESPKRSLPRPLDMKALEAMEGAAPALAKDDLGAYVAHLLFSRLGLRNIEILNARTHWINAGSIGIINRTEEDFFPKGCEGWVPIAPDVLKEVLSFQPLCVDGYLVPGANLTERHHAVYRRQIEKGQPSSFVKSDDLAVNN
GPIPI_013025402088090014SoilLVYSIHYASHQWDSASNIPQHGNEGNRGSKAHRSPGYRIILADAGRGAELLKLRSDNATIGELIARYRANASQRPDTVRSNVRSLRMIVKTVHPGDPDAKPTSLLTASLIRQFEKRQFERAQKHATAETRATAIQRVRSSTSSYVRQGRSIVALRKMKFYDGMKLPDLAGFRGENVESPKRSLPRPLDMKALEAMEGAAPALAKDDLGAYVAHLLFSRLGLRNIEILNARTHWINAGSIGIINRTEEDFFPKGCEGWVPIAPDVLKEVLSFQPLCVDGYLVPGANLTERHHAVYRR
cont_0132.000017702166559005SimulatedHVRFTPPSTDGTRRVIFRSTGTKEIAAAKRIAARIIESFWTDAGRGAEPLKLHNNNASVGELITKYRGNAAQRPDTLRSNIRSLRMVIKTVHHGDPDTKSTSVLTPSLIREFEKRQLERAEKRATAVSRSAVIQRVRSSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPKRSLPRPLDMKALTAMEAAEAALAKGDPGVYVAHLLFSRLGLRNIEIVNARIHWISDGSIGIVNRSEEDFFPKGCEGWVPIAPDVLKEYSLSSRCVPMAI
deepsgr_019495402199352025SoilPPAVDGVSRVVFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIERYRASAAQRPTTIRSNVRSLRMIVKTVHSGDPDIKPTSILTANLIREFEKRQLERVEKRPAGVSRVSAMQRVRTSTGSYVRQARAIVALRKMKFYEGLKLPDLTGFRGESVEAPKRSLPRPLDMKALAAMEAAAPLLARDDPGAYVAHLLFSRLGAPQHRNRERPHSLDQRRQHWHHQSPRGGXXXERL
INPhiseqgaiiFebDRAFT_10061596913300000364SoilSIK*RVLLXLGTMRDRKLLIPARCTTRKFYLQKPPKGRNDWHVRFSPPPVNGMSRVVFRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEALKLRNDNAIIGELIERYESNAVQRPRTIRGNVRSLRLIVKTVHNGDPDQKPTTVLTSSLIRDFEKRRILRAEERATATTRAGVIQRTRNSTASFVRQARSIIAMRKMKFYEDLKLPELAPFRGESVEPPHRALPRPLDMKALNAMEVATPTLATNDPAVYVAHLLFSRLGLRNIEIVNARTHWIS
INPhiseqgaiiFebDRAFT_10061625813300000364SoilMSRVVFRSTGTKEIAAAKRIAAQIIESFWNXAGRGAEALKLRNDNATIGELIGRYERNAVQRPRTIRGNVRSLRLIVRTVHNGDPDQRPTTVLTSSLIRDFEKRRIVRAEERATVATRAAVIQRTRNSTASFVRQARSIVALRKMKFYEDLKLADFAAFRGESVETPHRSLPRPLDMKALSAMEGATPKLAAGDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGS
INPhiseqgaiiFebDRAFT_10061649513300000364SoilVKRDPRLLIPVRCTTRKIFLQHPPKGRNDWHVRFTAPAIDGSRREIFRSTGTKEVAAAKRIAAQIIESFWSDAGRGAEPLKLRNDNATIRELIERYERNAVQRPRTIRGNVRSLRLIVKTVRNGNPDDKPTTVLTASLIREFEKRRLQKAERRATSSTRAAVIHRTRNSTASFVRQARSIVALRKMKFYEDLKLPELGAFRGESVETPQRSLPRPLDMKALQGMEGAVPKLALEDSAVYVAHLLFSRLGLRNIEILNARTHWISDGXIGIINRP
JGI1027J12803_10895698613300000955SoilPVRCTTRNLYLHRPPPGRNDWHVRFTPPSVDGMRRVVFRSTGTKEIAAAKRIAAQIIESFWIDAGRGAETLKLRNNNATIGELITTYQDNAAQRPSTVRSNIRSLRMIMKTVHRGDPDIKPTSLLTANLIREFEKRQLARGEKHATSATRSAVIQRVRTSTASYVRQARSIVALRKMKFYETLKLPDLSGFRGESVETPQRSLPRPLDMKALAAMEAAEPALDKNDPGAYVSHLLFSRIGLRNIEIVNARTHWISDGSIGIVSRPEEDXXXXAAMNAAAPVLAESDPGAYVAHLLFSRVGLRNIEIVNARVHWISNGSIGIVNRPEEDFF
JGI1027J12803_10899595013300000955SoilMKDPKLLVPVRCTTRKFYLHKPPPGRNDWHVRFTPPAIDGIRRVIFRSTGTKEIGAAKRIAAEIIKSFWIDSGRSAVRLKLRDDNATIGELIANYKQRAAQRPGTIRSNIRSLRMVIKTVHNGDPDLKSSSVLTANLIREFEKRQLERAEKRATPATRASVVQKVRISTASYVRQARAIVALRKMKFYERLKLPDLSGFRGESVEAPKRSLPRPLDMKALAAMEEAAPGLASDDPGAYVAHLLFSRLGLRNIEIVNARTHWIN
Ga0062593_10101199913300004114SoilEIASAKRIAARIIESFWTDAGRGAEPLKLRNNNALIGELITKYKENARQRPDTLRSNIRSLRMITKTVHRGDPDTKSTALLTANLIREFEKRQLERAEKRATAATRSTVIQRVRSSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPKRSLPRPLDMKALTAMEAAEAALAKDDPGVYVAHLLFSRLGFRNIEIVNARIHWISDGSIGIVNRPEEDFFPKGCEGWVPIAPDVLKKILSFQPLSTDGYLVPGANQTERHDAVYRRHSKWISQWIKG
Ga0062589_10091193313300004156SoilPGRNDWHVRFTPPSTDGTRRVIFRSTGTKEIAAAKRIAARIIESFWTDAGRGAEPLKLRNNNVSIGELITKYKENARQRPDTLRSNIRSLRMIIKTVHHGNPDTKSTALLTANLIREFEKRQLERAEKRATPASRSAAIQRVRSSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPKRSLPRPLDMKALAAMEAAEATLAKDDPGAYVAHLLFSRLGLRNIEIVNARVPWVSDGSIGIVNRPEEDFFPKGCEGWVPIAPDVL
Ga0062590_10070293913300004157SoilMYRSVIIDPVKDRKLLIPIRCTTRNLYLHRPPPGRNDWHVRFTPPSTDGTRRVIFRSTGTKEIAAAKRIAARIIESFWTDAGRGAEPLKLRNNNVSIGELITKYKENARQRPDTLRSNIRSLRMIIKTVHHGNPDTKSTALLTANLIREFEKRQLERAEKRATPASRSAAIQRVRSSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPKRSLPRPLDMKALAAMEAAEATLAKDDPGAYVAHLLFSRLGLRNIEIVNAR
Ga0063356_10264055513300004463Arabidopsis Thaliana RhizosphereEVAAAKRIAAQIIESFWNDAGLGAEPLKLRNTNASIGELIARYEDRATQRWDTVRGNIRSLRMIVKTVYSGDPDDRSTSVLTPELIREFEKRLLKAAEKDVTQTQRAAAIQRARNSIGSYVRQARSIVALRKMKFYDGMALPNFAPFRGESVESPRRSLPKPLNMRGLAEMEAATPELAESDPGAYVAHLLFSRLGLRNIEIVSARTHWINDGSIGIVDRPEEDFFPKGCEGWVPIAPDVLAEVLRFQPLCTDGFL
Ga0062591_10017481013300004643SoilMKDRKLRIPARCTTRKFFLHPPPPGRNDWHVRFTPPAIDGVRRVIFRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEDLKLRNDHATIGELIERYEERAAQRPTTVRSNSRSLRMIVRTVHPGDPDERSTSALTADLIREFEKRQLARVEKRATPSTRTVAIQRVRNSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPRRSLPRPLDMEQLAKMEEATPDLAEKYPAVYVAHLLFSRLGLRNIEIVMRGEHLALTNVLNSASLLPVPAMILLRVQCA*
Ga0066672_1021556313300005167SoilMKDRKLLIPVRSTTRKFYLHPPPAGRNDWHVRFTPPAINGSRRVVFRSTGTKEIAAAKRIASQIIESFWNDAGRGAEPLKLRNDNATIGELIERYEGNAVQRPRTIRGNVRSLRLIVKTVHNGDPDQKPTTVLTSSLIRDFEKRRLQRAEERATAATRAGVIQRTRNSTASFVRQARSIVALRKMKFYEDLKLPDLAAFRGESVETPHRSLPRPLDMKALSAMEAATPKLASEDPAVYVAHLLFSRLGLRNIEIVNARR
Ga0066677_1044702913300005171SoilGEHLHMKNPKLLIPIRCTTRKIYLQPPPQGRNDWHVRFTAPSVDGSRREIFRSTGTKEISAAKRIAAQIIESFWTDAGRWAEPLKLRNDNATVGALIERYRANARQRRDTIQGNVQSLRLMIRTVHHGDPDRHPSTVLNGRLVREFERKRIAEAEKLATAETRASILQRVRTSTASYLRQARSVVAPAKIKFYEGMKLPDLSGFRAERVETPQRSLPRPLDMKALAAMEGATPNLATEDPAVYVAH
Ga0066688_1053353413300005178SoilGEHLHMKNPKLLIPIRCTTRKIYLQPPPQGRNDWHVRFTAPSVDGSRREIFRSTGTKEISAAKRIAAQIVESFWTDAGRGAEPLKLRNDNATVGALIERYRANARQRRDTIQGNVQSLRLMIRTVYRGDPDRHPSTVLNARLVREFERKRIAEAEKLATPETRASILQRVRTSTASYLRQARSVVAPAKAKFYEGMKLPDLSGFRAERVETPQRSLPRPLDMKALAAMEGATPKLATEDPAVYVAHLLFSR
Ga0066684_1000247293300005179SoilMKDRKLLIPVRCTTRKIYLQHPPAGRNDWHVRFTSPAIDGSRREIFRSTGTKEIAAAKRIAAQIIESFWTEGGRGAERLKLRNDNATIGELLERYERAAAQRPRTVQCNARSLRLIVRTVHSGNPDQKPTSVLTAGLIREFEKRRIQRIEKHATSLNRAVLIQRTRNSTASFVRQARSVVALRKMKFYEDLRLSDLTAFRGESVEPPQRSLPRPLDMKALKAMEAAVPTLAEKDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRDEENFFPK
Ga0066684_1041124913300005179SoilMKDPRLLIPVRCTTRKIFLQHPPAGRNDWHVRFTAPAIDGSRREIFRSTGSKEIAAAKRIAARIIESFWTDAGRGVEVLKLRNDNATIRELIERYERNAVQRPRTIRGNVRSLRLIVKTVHNGDPDQKPTTLLTSSLIRDFEKRRLQRAEERATAATRAGVIQRTRNSTASFVRQARSVVALRKMKFYEDLKLPDLGAFRGESVETPHRSLPRPLDMKVLSAMQAATPKLATEDPAVYVAHLLFSRLGLRNIEIINARAHWISDGNIGIINRPEENFFPK
Ga0066685_1067827913300005180SoilKEIGAAKRIAADIIKSFWIDSGRSAVRLKLRDDNATIGELIAKYRQRAAQRPGTIQSNVRSLRMVMKTVHGGDPDSKSTSVLTGNLIREFEKRQLERAEKRATPTTRASVVQKVRISTASYVRQARAIVALRKMKFYEGLKLPDLAGFRGESVETPKRSLPRPLEMSALAAMEAAAPSLARDDPGAYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFPKGCEGWV
Ga0066671_1034373213300005184SoilMSRVVFRSTGTKEIAAAKRIAAQIIESFWNDAGSGAETLKLRNDTATIGELIERYRQNAEQRPGTVRGNARSLRIIVRTMYGEDPGQKPTSVLRADLIRQFEKRQIEAAEERATPTTRAALIQQTRNSTASFVRQARSIVTLRKMKFFKDLKLPELRAFRGESVEMPHTGRCPVLSTMKALCAMEAAVPKLAEKDPAVYVAHLLFSRLGLRNIEIANSPPHIESAMAALA
Ga0065705_1030577723300005294Switchgrass RhizosphereMRDRKLLIPVRATTRKFYLHPPPVGRNDWHVRFRAPCIDGVRRNIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAEPLKLRNENATVGELLERYTRSAVQRPRTIQCNARSLRLIVRAVHGGNPDEKSTSVLTANLIREFEKRRIQRVEENTGASNRIALIQRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRAESVETPHRSLPRPLDMKALSAMEAAVPALAEKDPAGYVAHLLFSRLGLRNIEIVNARSHWISNGSIGIINRPEENFFPKG
Ga0065705_1043303713300005294Switchgrass RhizosphereAAQIIESFWNDTGRGAEPLKLPNNNASIGELIAAYECNAVQRPRTIRGNVRSLRLIVRTVHSGNPEQKPTTLLTASLIREFEKRRLQQGEARATPTTRASLIQSTRNSTASFVRQARSIVALRKMRFYEDLKLPDLAAFRGESVDTPHRSLPRPLDMKALGAMEGATAKLAVKDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRSEENFFPKGCEGWVPVARDVLAEIMKFQPLATDNYLVPGKNQTERHEAVYRRHSKWVSQWIKGRAKTS
Ga0065705_1050935113300005294Switchgrass RhizosphereGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNNNATVGELLERYGRSAVQRPRTIQCNARSLRLIVRTVHGGNADEKSTSVLTADLIREFEKRRIQRVEENADVSNRTALLHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWVPIASDVLTEIRRFQPLATD
Ga0065707_1023632423300005295Switchgrass RhizosphereVKDPRLLIPVRCTTRKIFLQHPPKGRNDWHVRFTAPAVDGSRREIFRSTGTKEIAAAKRIAAQIIDSFWNDAGRGAEPLKLRNDNATIGQLIERYERNAVQRPRTIRGNVRSLRLIVRTIHNGDPNDKPTTVLTSNLIREFEKRRLQKAEQRATGSTRATVIHRTRNSTASFVRQARSIVAMRKMKFYEGLKLPDLAAFRGESVETPHRSLPRPLDMKALSAMEAATPKLASEDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRAEENFFPKGCEGWVPIAPDVLAEIIRFQPVATESYLVPGRNQTERHEA
Ga0066388_10051307313300005332Tropical Forest SoilMYGCVSIGPMKNRKLLIPVRCTTRKFYLQPPPPGRNDWHVRFVPPSGDGVGREICRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIERYRQNATQRPGTIRSNARSLRMVVKTALSGDPDLKPTSILTANLIRNFEKLQLERVEKRPASVTRLSALQRLRVSTASYVRQARAIVALRKLKFYEGLKLPDLAGFRGESVDTPKRSLPRPLDMKALAAMEAAAPALARDDPGAYIAHLLFSRLGLRNIEIV
Ga0066388_10367334713300005332Tropical Forest SoilKFYLQPAPPGKNNWYIRFTTPAINGIRRVIFRSTGSKEIAAAKRIAAQIIESFWADAGRGAELLKLRSDSATIGELIARYRENATQRPDTVRSNVRSLRMIVKTVHPGDPDKKPTSLLTASLIREFEKRQFESAQKRATAENRATAIQRVRSSTSSYVRQGRSIIALRKMKFYDGMKLPDLAGFRGENVESSRRSLPRPLDMKALEAMEAATPALAKDDPAAYVAHLLFSRLGLRNIEIVSARTHWIHEGIIGIINRPEED
Ga0070675_10061727013300005354Miscanthus RhizosphereMRDRKLLIPVRATTRKFYLHPPPVGRNDWHVRFRAPSIDGTRRNIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAEPLKLRNNNATVGELLERYGRSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSVLTASLIREFEKRRIQRVEENADVPNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWVPIASDVLTEIRRFQPLATDGYLV
Ga0070675_10134493213300005354Miscanthus RhizosphereRVIFRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEDLKLRNDHATIGELIERYEERAAQRPTTVRSNSRSLRMIVRTVHPGDPDERSTSALTADLIREFEKRQLARVEKRATPSTRTVAIQRVRNSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPRRSLPRPLDMEQLAKMEEATPDLAEKYPAVYVAHLLFSRLGLRNIEIVMRGEHL
Ga0070709_1070641713300005434Corn, Switchgrass And Miscanthus RhizosphereAGRGAEPLKLRNDNATVGALLERYTRSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSVLTANLIREFEKRRIQRVEESPNASNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWVPIARDVLAEITKYQPLTSDGYLVPGRNQTERHEAVYRRHSKWVSQW
Ga0066681_1035125613300005451SoilRKFYLHRPPPGRNDWHVRFTPPAMNGVRRVVFRSTGTKEIAAAKRIGAQIIESFWTDSGRGAEPLKLRSDNATIGELIMRYAQNASQRPSTIRSNARSLRMIVKTVYSGDPDQKSTALLTANLIREFEKRQIERAEKRATAATRSKFIERVRTSTASYVRQARSIIALRKMKFHEGMKLPDLSGFRGETVETPHRSLPRPLDMKALTEMNAAAPALAKRDPGAYVGHLLFSRVGLRNIEIVNARVHWISDGSIGIVNRPEEDFFAKGCEGWVPIAPDVLKEILTFQPLCTEGYLVPGANR
Ga0070665_10117743313300005548Switchgrass RhizosphereRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEPLKLRNNNATVGELLERYERSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSVLTASLIREFEKRRIQRVEENADVPNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPELASFRGESVETPQRSLPRPLDMKALSAMEAAVPALAEKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEENFFPKGCEGWVPIARDVLAEIMKFQSLATD
Ga0066703_1033018013300005568SoilHLHMKNPKLLIPIRCTTRKIYLQPPPQGRNDWHVRFTAPSVDGSRREIFRSTGTKEISAAKRIAAQIIESFWTDAGRGAEPLKLRNDNATVGALIERYRANARQRRDTIQGNVQSLRLMIRTVYRGDPDRHPSTVLNARLVREFERKRIAEAEKLATRETRASILQRVRTSTASYLRQARSVVAPAKIKFYEGMKLPDLSGFRAERVEPPQRSLPRPLDMKALAAMEGATPKLATEDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRAEENFFPKVVKAGCRSRPTFSRKLFVSSD*
Ga0066694_1032869313300005574SoilNTGAKEIGPAKRIAAKIIESFWADAGRGADRLKLRNDNVKIGELIARYERNASQRRATIRSNVRSLRMIVRTVHGGDPDQKSMAVLTANLIREFEKRQVDSAEKRATPATRAVVIQRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLSGFRGETVETPHRSLPRPLDMKALTAMNAAAPTLAKKDPGAYVAHLLFSRLGLRNIEIVNARVHWISDGSIGIVDRLEEDFFPKGCEGWVPMA
Ga0066702_1031163923300005575SoilMKDRKLLIPVRSTTRKFYLHPPPAGRNDWHVRFTPPAINGSRRVVFRSTGTKEIAAAKRIASQIIESFWNDAGRGAEPLKLRNDNATIGELIERYEGNAVQRPRTIRGNVRSLRLIVKTVHNGDTDQKPTTVLTSSLIRDFEKRRLQRAEERATAATRAGVIQRTRNSTASFVRQARSIVALRKMKFYEDLKLPDLAAFRGESVETPHRSLPRPLDMKALSAMEAATPKLASEDPAVYVAHLLFS
Ga0066903_10435078513300005764Tropical Forest SoilGRNDWHVRFTPPSIDGTRRVIFRSTGTKEVGAAKRIAAQIIESFWSDAGRGAEPLKLRNNHATIGELIAKYQQNAVQRRSTIRSNIRSLRMIVKTVHRGDPDRKPTSLLTPNLIREFEKWQLDRALKRATASTRSVAIQRVRVSTASYVRQARSIVARRKMKFYEGLNLPDLIGFRGESVETPQRSLPRPLDMKALSEMEAAEPALARNDPGAYVAHLLFSRLGLRNIEIVNARVDWISDGSIGIVNRPEED
Ga0070717_1111540713300006028Corn, Switchgrass And Miscanthus RhizosphereGTKEVAAARRIAAQIIESFWADAGRGAELLKLRSDNATIGELIARYRENASQRPDTVRSNVRSLRMILKTVHSGDPDIKPTSLLTASLIREFEKRQFDCAQKRATAATRSTVIQRVRTSTASYVRQARSIVALRKMKFYDGMKLPDLAGFRGESVESPQRPLPRPLDMKALRAMEAAEPTLAKQDPGAYVAHLLFSRLGLRNIEIVNARVHWISDGSIGIVNRPEEDFFSKGCEGWVPI
Ga0066651_1031191913300006031SoilVSIAPMKDRKLLIRVRCTTRKFYLHRPLPGRNDWHVRFTAPAIDGGRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGDLIARYERNALQRRATIRSNIRSLRMIVKTAHGADPDTKPTSVLTANLIREFEKRQFDRAQKRATAQTRATAIQRVRTSTSSYVRQARSIVALRKMKFYDGMKLPDLAGFRGETVESPQRSLPRPLDMKALQAMEAAEPTLAENDPAAYVAHLLFSRVGLRNIEIVNARVHWINDGSIGIVNRPEEDF
Ga0070715_1024987713300006163Corn, Switchgrass And Miscanthus RhizosphereTTRKFYLHSPPSGRNDWHVRFAPPAADGVRRVIFRSTGTKEVAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIERYKQSAAQRPGTVRSNVRSLRMIVKTVLSGDPDTKPMSILTANLIREFEKRQLVRAEKHSTGVSRLSAIQRVRTSTASYVRQARAIVALRKMKFYEELKLPDLTGFRGESVEAPRRSLPRPLDMKALTAMEAAAPILAKDDPGAYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRPEEDFFPKGCEGWVPIAPDVLKELLSLQSLCTDDYLVPGANRTERHDA
Ga0075425_10110388013300006854Populus RhizosphereVKDPRLLIPIRCTTRKIYLQPPPAGRNDWHVRFTAPAVNGTRREIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAEPLKLRNDNATIGLLIERYERSAVQRPRTIRGNVRSLRFIVKTVHNGDPDQKPTTVLTAGLIREFEKRRLQQTEQRASASTRVGMIQRTRNSTASFIRQARSIVALRKMRFYEDLKLPDLVAFRGESVEMPPRSLPRPLDMKALRAMEAAAPELAKEDPGAYVAHLLFSRLGLRNIEIVSARTHWI
Ga0099795_1020786613300007788Vadose Zone SoilAQIIESFWTDAGRGAERLKLRNDHATIGELIDRYKASALQRPATIRSNVRSVRMIVKTVHPGDPDTKPTSILTANLIRDFEKRQLERVEKRAVGVTRLSAIQRVRTSTTSYVRQARAIVALRKMKFYEGLKLPDLAGFRGESVETPKRSLPRPLDMKPLAAMDAAAPALARDDPAAYVAHLLFSRLGLRNIEIVNARTRWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVLKEILSFQPVSTDGYLVPGANRTERHDAVYRRHSRWVSHWIKDRT
Ga0066710_10053318813300009012Grasslands SoilVLALDSVKSRKLLVPVRSTTRKFYLQPAPPGKNNWYIRFTTPAINGIQRVIFRSTGSKEIAAAKRIAAQIIESFWADAGRGAELLKLRSDNATIGELIARYRERASQRPDTVRSNVRSLRMIVKTVHPGDPDTKPTSLLTASLIREFEKRQFESAQKRATTETRATAIQRVRSSTSSYVRQGRSIVALRKMKFYDGMKLPDLAGFRGENVESPKRSLPRPLDMKALEAMQRAAPAMAKDDPGAYVAHLLFSRLGLRNIEIVNA
Ga0066710_10146498513300009012Grasslands SoilMKDRKLLIRVRCTTRKFYLHRPRPGRNDWHVRFTAPAIDGARRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIASYERNALQRRATIRSNVRSLRMIVRTVHGGDPDAKSTSLLTANLIREFEKRQIESAEKRATAATHSVVILRVRTSTASYVRQARSIVALRKMKFYETIKLPDLTGFRGETVETPHRSLPRPLDMKGLTEMNAAAPALAKRDPGAYVAHLLFSRMGLRNIEIVNARVHWISDGSIGILNRPEEDFFSKGCEGWV
Ga0066709_10144158313300009137Grasslands SoilMKDRKLLIRVRCTTRKFYLHPPRPGRNDWHVRFTAPEINGSRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIGRYERNASQRPATIRSNVRSIRMIVKTVHCGDPDQKSTAVMTANLIREFEKRQVDSAEKRATAATRSVVIQRVRTSTASYVRQARSIVALRKMKFYDGMKLPDLTGFRGETIETPRRSLPRPLDMKALAEMNAAAPALAKSDPGAYVAHLLFSRMGLRNIEIVNARVYWISDGSIGIVNRP
Ga0066709_10155221013300009137Grasslands SoilVRFTSPAIDGSRREIFRSTGTKEIAAAKRIAAQIIESFWTEGGRGAERLKLRNDNATIGELLERYERAAAQRPRTVQCNARSLRLIVRTVHSGNPDQKPTSVLTAGLIREFEKRRIQRIEKHATSLNRAVLIQRTRNSTASFVRQARSVVALRKMKFYEDLRLSDLTAFRGESVETPERSLPRPLDMKALRAMEAAVPTLAEKDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRDEENFFPKGCEGWVPIARDVLREIMKFQSLTT
Ga0066709_10251151913300009137Grasslands SoilESFWIDAGRSAERLKLRNDHASIGELIERYRASAAQRPDTIRSNVRSLRMIVKTVHTGDPDIKPTSILTANLIREFEKRQLERVEKPPASVSRVSALQRVRTSIGSYVRQARAIVALRKMKFYEGLRLPDLTGFRGETVEAPKRSLPRPLDMKALAAMEAAAPALAKDDPGAYAAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFPKGCEGWVPIARDVLKEV
Ga0126380_1068605113300010043Tropical Forest SoilTRKFYLNPPRPGRNDWHVRFTAPGINGTRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIARYQRNAVQRHDTIRSNIRSLRMIVKTVHGGDPDTKPTSLLNANLIREFEKRQVDAAEKRATAATRSVVIQRVRTSTASYVRQARSIVALRKMKFYEGMKLPDFTGFRGETVETPHRSLPRPLDMKALTAMNAAAPALAENDPGAYVAHLLFSRVGLRNIEIANARVHWINDGRMGIVNRPEEDFFPKGCEGWVP
Ga0126382_1067781513300010047Tropical Forest SoilEIAAAKRIAAQIIESFWIDAGRGAERLKLRNDHANIHELIERYKQNAAQRPETIRSNVRSLRMIVKTVHSGDPDAKPTSILTANLIREFEKRQLERVEKRPANVTRLSALQRVRASTASYVRQARAIVALRKMKFYEGLKLPDFGGFRGESVETPHRSLPRPLDMKALAAMEAAAPSLATDDPGAYVAHLLFSRLGLRNIEIVNARTHWISDGNIGIINRPEEDFFPKGCEGWVPIAPDVLKEIQTFQPLCANGYLVPGTNRTERHDAVYRRHSKWVSQWVKDRTK
Ga0126373_1157211913300010048Tropical Forest SoilDAGRGAELLKLRSDSATIGELIARYRENATQRPDTVRSNVRSLRMIVKTVHPGDPDKKPTSLLTASLIREFEKRQFESAQKRATAENRATAIQRVRSSTSSYVRQGRSIIALRKMKFYDGMKLPDLAGFRGENVESPKRSLPRPLDMKALEAMEAASPALARDDPGAYVAHLLFSRLGLRNIEIVNARTHWISDGSLGIINRPEENFFAKGSEGWLPIAPDVLKEILSFQPLCTNGYLVP
Ga0126376_1089448113300010359Tropical Forest SoilTRRVIFRSTGTKEIAAAKRIAAQIIESFWSDAGRGAEPLKLRNNHATIGELITKYQQNAVQRPSTIRSNIRSLRMILKTVHRGDPDRKPTSLLTPNLIREFEKRQLDRAEKHATAATRSVAIQRVRVSTASYVRQARSILARRKMKFYEELKLPDLIGFRGESVETPQRSLPRPLDMKALTAMEAAEPMLARRDPGAYAAHLLFSRLGLRNIEIVNARVHWISEGSIGIVNRPEEDFFSKGCEGWVPVAPDVLKEILSFQPRCTDGYLVPGANRTERHDAVYRRHSRWVSLW
Ga0126372_1149069613300010360Tropical Forest SoilSKEIAAAKRIAAQIIESFWADAGRGAELLKLRSDSATLGELIARYRENATQRPDTVRSNVRSLRMIVKTVHPGDPDKKPTSLLTASLIREFEKRQFESAQKRATAENRATAIQRVRSSTSSYVRQGRSIIALRKMKFYDGMKLPDLAGFRGENVESSRRSLPRPLDMKALEAMEAATPALAKDDPAAYVAHLLFSRLGLRNIEIVSARTHWIHEGIIGIINRPEEDFFPKGCEGWVP
Ga0126381_10256304113300010376Tropical Forest SoilPPSRNDWHVRFVPPSADGIGREIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHASIGELIERYKQRAAQRPGTIRSNVRSLRMIVKTVHSGDPDPKPTSILTENLIREFEKRQLERIERRGAGVSRLLALQRVRTSTASYVRQARAIVALRKMKFYEGMKIPDLTGFRGESVEAPKRSLPRPLDMKALAAMEAAAPALARDDPGAYVAHLLFSRLGLRNIEIVNARTHWINE
Ga0126381_10264758713300010376Tropical Forest SoilKEIAAAKRIAAQIIESFWADAGRRAELLKLRSDNATIGELIKRYAQNAAQRPTTIRSNSRSLRMIVKTVHGGDPDQKSTGLLTANLIREFEKRQFERIEKRSSELNRTSAIQRVRTSTASYIRQARSIVALRKMKFYEGLKLPDLTGFRGEAVETPKRSLPRPLDMKALAAMEAAAAALAKSDPGAYVAHLLFSRLGLLNIEIVNARVHWIAHGSIGIINRPEEDFFPKGCEGWVPIAP
Ga0124850_103871013300010863Tropical Forest SoilVRFVPPSVDGVAREIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATVGELIERYRQNAAQRPATIRSNVRSLRMIVKTVHSGDSDIKATSILTANLIRDFEKRQLERVQKRGAGVSRLSALQRVRVSTASYVRQARAIVALRKMKFYEGLKLPDFTGFRGESVEAPHRSLPRPLDMKALAAMEAAAPSLARDDPGAYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIIKIGPRRISFRKAAKDGCRLRPTF*
Ga0137381_1070870613300012207Vadose Zone SoilPRPGRNDWHVRFTAPANDGRRRLFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIDRYERNALQRRSTIRSNVRSLRMIVRTVYGGDPDDKSTSLLTANLIREFEKRQIESAEKRATVATRSVVILRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTEMNAAAPALAKRDPGAYVAHLLFSRMGLRNIEIVNARVHWISDGSIGIVNRPEEDFFAKGCEGWVPIAPDVLKEILRFQPLCTDGYLVPGEN
Ga0137379_1029548813300012209Vadose Zone SoilMKDRKLLIRVRCTTRKFYLHPPRPGRNDWHVRFTAPAIDGGRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIDRYERNALQRRSTIRSNVRSLRMIVRTVYGGDPDDKSTSLLTANLIREFEKRQIESAEKRATVATRSVVILRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTEMNAAAPALAKRDPGAYVAHLLFSRMGLRNIEIVNAR
Ga0137370_1023607323300012285Vadose Zone SoilMKDRKLLIRVRCTTRKFYLHPPRPGRNDWHVRFTAPAIDGGRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIARYERNASQRHATIRSNVRSLRMIVRTVHGGDPDQKSTALLTANLIREFEKRQVDCAEKRVTAATRSVVIQRVRTSTASYVRQARSIVALRKMKFYEGMKFPDLIGFRGETVETQHRSLPRPLDMNALSDMNAAAPALAKTDPGAYVAHLLFSRMGLRNIEIVNARVHWISDGSIGIVDRLEEDFFPKGCEGWVPMAPD
Ga0137367_1015900933300012353Vadose Zone SoilMKDRKLLIRVRCTTRKFYLHPPPPGRNDWHVRFTAPAIDGGRRIFRNTGAKEIGPAKRIAAKIIESFYADAGRGADRLRLRNDNAKIGELIARYERNASQRHTTIRSNVRSLRMIVRTVHGGDPDQKSTALLTANLIREFEKRQVDSAEKRATAATRTVVIQRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLSGFRGETVETPHRSLPRPLDMKALSDMNAAAPALAKKDPGAYVAHLLFSRMGLRNIEIVNARVHWISDGSIGIVDRPEEDFFPKGREGWVPMAPDVLKEILSFQPLCTDSYLVPG
Ga0137369_1050961713300012355Vadose Zone SoilRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIARYERNASQRHATIRSNVRSLRMIVKTVYGADPNSKSTSLLTANLIREFEKRQVDSAEKRATAATRSVVIERVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTDMNAAAPALAKRDPGAYVAHLLFSRLGLRNIEILNARVHWISDGSIGIVNRLEEDFFPKGCEGWVPMAPDVLKEILRFQPLCADGYLVPGANQTERHDAVYRRHSKWV
Ga0137371_1053313313300012356Vadose Zone SoilMKDRKLLVRVRCTTRKFYLQPPRPGRNDWHVRFAAPTVDGTRQIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIDRYERNALQRRSTIRSNVRSLRMIVRTVYGGDPDDKSTSLLTANLIREFEKRQIESAEKRATAATRSVVILRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTEMNAAASALAKRDAGAYVAHLLCSRVGLRNIEI
Ga0137396_1007130233300012918Vadose Zone SoilVFRSTGTKEIPAARRIAAQIIESFWTDSRRGAEPLKHHNDNVTVGELIKRYEQNAAQRPSTVRSNARSLRMVVKTVHGGDPDQKSTSLLTANLIREFEKRQLERAEGRATAATRSKLIERVRTSTASYVRQARSIIALRKIKFYEGMKLPDLIGFRGETVETPHRSVPRPLDMKGLTAMNTAAPALAKTDPGAYVAHLLFSRLGLRNIEIVNARVHWISDGSIGIVNRSEEDFFPKGCEGWVPMAPDVLNEILKFQDLCTDGYLVPG
Ga0137394_1076078713300012922Vadose Zone SoilSPAIDGSRREIFRSTGTKEIAAAKRIAAQIIESFWTEGGRGAEPLKLRNDNATIGELLERYERAASQRPRTVQCNARSLRLIVRTVHSGNPDEKPTSVLTAGLIREFEKRRIQRIEKHATSLNRAVLIQRTRNSTASFVRQARSVVALRKMKFYEDLRLPDLTAFRGESVETPQRSLPRPLDMKALSAMEAATPKLATEDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRPEENFFPKGCEGWVPIARDVLAEIMTFRSLTT
Ga0164299_1004708543300012958SoilVRFRAPSIDGARRNIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAEPLKLRNDNATVGALLERYTRSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSILTANLIREFEKRRIQRVEENADVSNRIALIHRARNSTASFVRQARSIVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWVPIARDVLAEITKYQPLTSDGYLVPGRNQTERHEAVYRRHSKWVS
Ga0164302_1042667413300012961SoilVRFTPPAIDGVRRVVFRSTGTKEIAAAKRIAADIIKSFWIDSGRSAVRLKLRDDNATIGELIAKYKQHAAQRPGTIRSNVRSLRMVIKTVHVGDPDSKSSSVLTANLIREFEKRQVERAEKRSTPATRASVIQKVRISTASYVRQARAIVALRKMKFYERLKLPELSGFRSESVEAPKRSLPRPLDMKALAAMEAAEGALAKEDPGAYVAHLLFSRLGLRNVEIVNARTHWINDGRIGIINRPEEDFFQKG
Ga0163162_1144129413300013306Switchgrass RhizosphereVVFRSTGTKEIGAAKRIAADIIKSFWIDSGRSAVRLKLRDDNATIGELIAKYQQRATQRPGTIRSNVRSMRMLIKTVHGSDPDSKSSSVLTANLIREFEKRQLERAEKRATPATRASVVQKVRISTASYVRQARAIVALRKMKFYEGLKLPDLTGFRGESVEAPHRSLPRPLDMKALTGMEAAAAALAKEDPGAYVAHLLFSRLGLRNIEIVNARTHWINDGNIGI
Ga0163163_1081259213300014325Switchgrass RhizosphereMKDRKLLVPVRCTTRKFYLQPPPPGRNDWHVRFVPPSIDGVPREIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELLERYRASATQRPDTIRSNVRSLRMIVKTVHSGDPDIKPTSILTANLIREFEKRQLERIEKRSVDVSRLAATQRVRTSTASYVRQARAIVALRKMKFYEGLKLPDLVEFRGESVEPPKRSLPRPLDMKALAAMEATAPSLARVDPGAYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRPEEDFFPKGCEGWVPIAPDVLR
Ga0132258_1331463123300015371Arabidopsis RhizosphereNNATIGELIQRYERNAAQRPRTIRGNVRSLRLIVRTVHNGDPDEKSATVLTSNLIREFEKRRIQKAEQRATSFTRVAMIHRTRNSTASFVRQARSIVACRKMKFYEDLKLPDLAAFRGESVETPHQSLPRPLDMKALIAMEAAVPDLANEDPAAYVAHLLFSRLGLRNIEILNARSHWISDGNIGIINRPEENFFPKGCEGWVPIAADVLSEILRFQPLTTAGYLVPGRNQTERHKAIYRRH*
Ga0132255_10126342623300015374Arabidopsis RhizosphereMNGMSRVVFRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEPLKLRNNNATVGELLERYERSAVQRPRTIQCNVRSLRLIVRTVHGGNPDEKSTSVLTASLIREFEKRRIQRVEENADVPNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWV
Ga0182032_1019837113300016357SoilMYRCASIGPMKNRKLLIPVRCTTRKFYLQPPPPGRNDWHVRFVPPSIDGVPREIFRSTGTKEIAAAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELLDRYKASAAQRPDTIRSNVRSLRMVIKTVHSGDPDTKPTSILTPNLIREFEKRQLERVGERAAGVSRLTALQRVRTSTASYVRQARAIVALRKMKFYEGLKFPDLSGFRGESVEAPRRSLPRPLDMKALEEMEAAAPRLAREDPAVYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDF
Ga0184618_1007750113300018071Groundwater SedimentVYFRAAKIGSRFGSIKCVVVLALEPMKNRKLLIPVRCTTRKFYLHSPPSGRNDWHVRFTPPSVDGVRHVVFRSTGTKEIAAARRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIERYKQSAAQRPATIRSNVRSLRMIVKTVHAGDPDTKPTSILTANLIREYEKRQLERVEKRPASVTRLSALQRVRASTASYVRQARAIVALRKMKFYGSLRLPDLTDFRGETVETPKRSLPRPLDMKALTAMEAAEPALAKNDPGAYVAHLLFSRLGLRNIEIVNARVHWINNGSIGIIDR
Ga0184609_1017088313300018076Groundwater SedimentMPCCVSISPMKDRKLLIRVRSTTRKFYLHPPRPGRNDWHVRFTAPTIDGTRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGELIARYEQNAAQRPSTIRSNARSLRMIVKTVHGGDPDQKSTALLTANLIREFEKRQIERAEKRATAITRSKFIERVRTSTASYVRQARSIIALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTAMNAAAPALAESDPGAYVAHLLFSRVGLRNIEIVNARVHWISDGSIGIVNRPEEDFFSKGCEGWVPIAPDVLKEILKFQPLCTDGYLVP
Ga0066655_1012044513300018431Grasslands SoilFRSTGSKEIAAAKRIAARIIESFWTDAGRGVEVLKLRNDNATIRELIERYERNAVQRPRTIRGNVRALRLIVKNVHNGDPDQKPTTVLTSGLIRDFEKRRLQRVEKRATAATCAGVIQRTRNSTASFVRQARSIVALRKMKFYEDLKLPDLAAFRGESVETPHRSLPRPLDMRALSAMEAAAPKLAFEDPAVYVAHLLFSRLGLRNIEIVNARTHWISGGSIRIINRAEENFFPKVVKAGCRSRPTFSRKLFVSSD
Ga0066667_1072043413300018433Grasslands SoilLHRPPPGRNDWHVRFTPPAINGIRRVVFRSTGTKEVAAAKRIGAQIIESFWMDSGRGAEPLKLRNDNATIGELITRYEETAAQRPSTIRGNERSLRMIVKTVHGGDPDEKSTALLTANLIREFEKRQIERAEKRATAATRSKFIERVRTSTGSYVRQARSIIALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALRAMNTAAPALAKSDPGAYVAQLLFSRIGLRNIEIVNARVHWISDGSIGIVNRPEEDFFPKGCEGWVPIAPDVLKEIL
Ga0066662_1161925513300018468Grasslands SoilIAAAKRIASQIIESFWNDAGRGAEPLKLRNDNATIGELIERYESNAVQRLRTIRGNVRSLRLIVKTVHNGDPDQKPTTVLTSSLIRDFEKRRVQRAEKRATAATHAGVIQRTRNSTASFVRQARSIVALRKMKFYEDLKVPDLSAFRGESVETPHRSLPRPLDMKALSAMETATPKLATEDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGRVGIINRPEENF
Ga0066669_1095445413300018482Grasslands SoilIIESFWADAGRGADRLRLRNDNAKIGELIARYERNASQRRATIRSNIRSLRMIVRTVHGGDPDQKSTALLTANLIREFEKRQVDSAEKRATAATRSVLIQRVRMSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMTALTQMNAAAPALSKNDPGTYVAHLLFSRLGLRNIEITNARVHWISNGSIGIVNRPEEDFFAKGCEGWVPVAPDVLNEILSFQPLCTDGYLVPGANRTERHD
Ga0193704_102894523300019867SoilSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAIQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0193705_100104223300019869SoilVRFIPPAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAMQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0193700_101173923300019873SoilVRFIPPAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAIQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0193722_107985913300019877SoilKIYLQPAPQGRNDWHVRFTAPSVDGSRREIFRSTGTKEIPAAKRIAAQIIESFWTDAGRGAEPLKLRNDNATVGALIERYRANARQRCDTIQGNVQSLRLMIRTVYRGDPDRHPSTVLNARLVREFERKRIAEAEKLSTPETRASILQRVRTSTASYLRQARSVVAPAKIKFYEGMKLPDLSGFRAERVEPPQRSLPRPLDMKALAAMEGATPKLATEDPAVYVAHLLFSRLGLRNIEIINARTHWISDGSIGIIDRPEENFFPKGCE
Ga0193730_106315313300020002SoilMKDRKLLIRVRCTTRKFYLHPPRPGRNDWHVRFTAPAINGTRRIFRNTGAKEIGPAKRIATKIIESFWADAGRGADRLRLRNDNAKIGELIASYERNASQRPATIRSNVRSLRMIVKTIHSGDPDQKSTAVMTATLIREFEKRQVDSAEKRATATTRSVVIERVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFCGETVETPHRSLPRPLDMKALNAMNAAAPLLGKSDPGAYVAHLLFSRLGMRNIEIVNTRVHWISDGSIGIVNRPEEDFFPKGCEGWIPMAPDVLKEILGFQP
Ga0210381_1006311413300021078Groundwater SedimentMKDPKLLVPVRCTTRKFYLHKPPPGRNDWHVRFTPPAIDGVRRIVYRSTGTKEIGAAKRIAADIIKSFWIDSGRSATRLKLRDDNATIGELIANYKQRAPQRPGTIRSNVRSLRMVIKTVHAGDPDSKSSSVLTANLIREFEKRQFERAEKRATPATRTSVVQRVRISTASYVRQARAIVALRKMKFYDRLKLPDLTGFRGESVEAPKRSLPRPLDMKAVAAMEAAAAALAKNDAGAYVAHLLFSRLGLRNIEIVNARTHWINDGNIGIINRPEEDFFPKGCEGWVPIALDVLEEVLGFQSLCVDGYLVPGANQTE
Ga0210382_1033424713300021080Groundwater SedimentTKEIAAARRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIERYKQSAAQRPATIRSNVRSLRMIVKTVHAGDPDTKPTSILTANLIREFEKRQLERVEKRPASVTRLSALQRVRASTASYVRQARAIVALRKMKFYGSLRLPDLTDFRGETVETPKRSLPRPLDMKALTAMEAAEPALAKNDPGAYVAHLLFSRLGLRNIEIVNARVHWINNGSIGIIDR
Ga0179596_1033359113300021086Vadose Zone SoilSTGTKEMAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIDRYKASAVQRPATIRSNVRSVRMIVKTVHPGDPDTKPTSILTANLIRDFEKRQLERVEKRAVGVTRLSAIQRVRTSTTSYVRQARAIVALRKMKFYEGLKLPDLAGFRGESVETPKRSLPRPLDMKPLAAMDAAAPALARDDPAAYVAHLLFSRLGLRNIEIVNARTRWINDGSIGIINRPEEDFFSKGCEGWVPIAPDVLKEILSF
Ga0210406_1056162713300021168SoilQDLHMKNPRLLIPVRCTTRKIFLQHPPAGRNDWHVRFTAPSVDGSRREIFRSTGTKEVAAAKRIAARIVESFWTDAGRGAEPLKLRNDHATVGELLERYERSAGQRPSTIQCNARSLRFIVRTVHGGNPDEKSTSVLTANLIREFEKRRIQRVEENADASNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLRLPDLTAFRGESVDPPQRSLPRPLDMKALRAMEAAVPTLAEKDPAVYVAHLLFSRLGLRNIEILNARSHWISGGSIGIINRPEEDFFPKGCEGWVPIAPDVLG
Ga0193719_1002101713300021344SoilVYVRAAKIGSEIGSIKCSVLLALGTMRDRKLLIPARCTTRKFYLQKPPKGRNDWHVRFWPPPVNGVSRIVFRSLGTKEIAAAKRVAAKIIESFWNDAGRGAEPLKLRSDNATIGALIERYERNAVQRPRTIRGNVRSLRLIVRTVHNGDADQKPTTILTAGLIREFEKRRIQQAEQRPTIGTRAGVLQRTRNSTASFVRQARSVVALRKMKFYEDLKLPDLAAFRGESVETPHRSLPRPLDMKALSAMEERVPKLASEDPAVYVTHLLFSRLGLRNIEIVNARTHWISDGSIGIINRPEE
Ga0222622_1023666823300022756Groundwater SedimentMKDRKLLIRVRCTTRKFYLHPPRPGRNDWHVRFTAPAINGTRRIFRNTGAKEIGPAKRIATKIIESFWADAGRGADRLRLRNDNAKIGELIASYERNASQRPATIRSNVRSLRMIVKTIHSGDPDQKSTAVMTATLIREFEKRQVDSAEKRATATTRSVVIERVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFCGETVETPHRSLPRPLDRKALNAMNAAAPLLGKSDPGAYVAHLLFSRLGMRNIEIVNTRVQWISDGSIGIVNRPEEDFFPKGCEGWIPMAPDVLKEILGFQPLC
Ga0193714_101159823300023058SoilMKDPKLLVPVRCTTRKFYLHKPPPGRNDWHVRFIPPAIDGVRRVVFRSTGTKEIGAAKRIAADIIKSFWIDSGRGAVRLKLRDDNATIGELISKYKQRASQRPETIRSNVRSLRMVTKTVHGGDPDSKSTSVLTANLIREFEKRQLERAEKRATPTTRNSVIQKVRISTTSYVRQARAIVALRKMKFYEALKLPDLSGFRSESVETAKRSLPRPLDMKALAAMEATALALARDDPAAYVAHLLFSRLGLRNIEIVNARTHWISDSSIGIINRA
Ga0207644_1032852823300025931Switchgrass RhizosphereMKDRKLRIPARCTTRKFFLHPPPPGRNDWHVRFTPPAIDGVRRVIFRSTGTKEIAAAKRIAAQIIESFWNDAGRGAEDLKLRNDHATIGELIERYEERAAQRPTTVRSNSRSLRMIVRTVHPGDPDERSTSALTADLIREFEKRQLARVEKRATPSTRTVAIQRVRNSTASYVRQARSIVALRKMKFYEGLKLPDLTAFRGETVETPRRSLPRPLDMEQLAKMEEATPDLAEKYPAVYVAHLLFSRLGLRNIEIVMRGEHLALTNVLNSASLLPVPAMILLRVQCA
Ga0207665_1038751123300025939Corn, Switchgrass And Miscanthus RhizosphereMRDRKLLIPVRATTRKFYLHPPPAGRNDWHVRFRAPSIDGTRRNIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAEPLKLRNNNATVGELLERYGRSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSVLTASLIREFEKRRIQRVEENADVPNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRGESVETPQRSLPRPLDMKALSAMEAAVPALADKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDSSIGIINRPEEDFFPKGCEGWVPIARDVLAEITKYQPL
Ga0207651_1075916013300025960Switchgrass RhizosphereAAAKRIAAQIIESFWTDAGRGAEPLKLRNDNATVGKLLERYTRSAVQRPRTIQCNARSLRLIVRTVHGGNPDEKSTSVLTANLIREFEKRRIQRVEENADASNRIALIHRARNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRAESVETPHRSLPRPLDMKALSAMEAAVPALAEKDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIINRPEEDFFPKGCEGWVPIARDVLAEIMKYQPLTSDGYLVPGRNQTERHEAVYRRHSKWVSQWIKGRAKT
Ga0209239_118524113300026310Grasslands SoilLRNDNAKIGDLIARYERNALQRRATIRSNIRSLRMIVKTAHGADPDTKPTSVLTANLIREFEKRQFDRAQKRATAQTRATAIQRVRTSTSSYVRQARSIVALRKMKFYDGMKLPDLAGFRGETVESPQRSLPRPLDMKALQAMEAAEPTLAENDPAAYVAHLLFSRVGLRNIEIVNARVHWINDGSIGIVNRPEEDFFPKGCEGWVPIAPDVLKEIFKFQPLCTDGYLVPGANRTERHHAVYRRHSRWVSQWIKDRTK
Ga0209153_111508713300026312SoilMSRVVFRSTGTKEIAAAKRIAAQIIESFWNDAGSGAETLKLRNDTATIGELIERYRQNAEQRPGTVRGNARSLRIIVRTMYGEDPGQKPTSVLRADLIRQFEKRQIEAAEERATPTTRAALIQQTRNSTASFVRQARSIVTLRKMKFFKDLKLPELRAFRGESVEMPHTGRCPVLSTMKALCAMEAAVPKLAEKDPAVYVAHLLFSRLGLRNIEIANSPRTLNQRWQHWHYQRLEENFFPKRCEGWVPIARDVLGE
Ga0209155_111539913300026316SoilMKDRKLLIRVRCTTRKFYLHRPPPGRNDWHVRFTAPAIDGGRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGADRLRLRNDNAKIGDLIARYERNALQRRATIRSNIRSLRMIVKTAHGADPDTKPTSVLTANLIREFEKRQFDRAQKRATAQTRATAIQRVRTSTSSYVRQARSIVALRKMKFYDGMKLPDLAGFRGETVESPQRSLPRPLDMKALQAMEAAEPTLAENDPAAYVAHLLFSRVGLRNIEIVNARVHWINDGSIGIVNRPEEDFFPKGCEGWVPI
Ga0209807_100498153300026530SoilMKDPRLLIPVRCTTRKIFLQHPPAGRNDWHVRFTAPAIDGSRREIFRSTGSKEIAAAKRIAARIIESFWTDAGRGVEVLKLRNDNATIRELIERYERNAVQRPRTIRGNVRALRLIVKNVHNGDPDQKPTTVLTSGLIRDFEKRRLQRAEKRATAATCAGVIQRTRKSTASFVRQARSVVALRKMKFYEDLKLPDLGAFRGESVETPHRSLPRPLDMKVLSAMQAATPKLATEDPAVYVAHLLFSRLGLRNIEIINARAHWISDGSIGIINRAEENFFPKVVKAGCRSRPTFSRKLFVSS
Ga0209156_1019466513300026547SoilMKDRKLLIPVRCTTRKIYLQHPPAGRNDWHVRFTSPAIDGSRREIFRSTGTKEIAAAKRIAAQIIESFWTEGGRGAERLKLRNDNATIGELLERYERAAAQRPRTVQCNARSLRLIVRTVHSGNPDQKPTSVLTAGLIREFEKRRIQRIEKHATSLNRAVLIQRTRNSTASFVRQARSVVALRKMKFYEDLRLSDLTAFRGESVEPPQRSLPRPLDMKALKAMEAAVPTLAEKDPAVYVAHLLFSRLGLRNIEIVNARTHWISDGSIGIINRDEENFFPKGCEGWVPIARDVL
Ga0209179_108750513300027512Vadose Zone SoilIAAQIIESFWTDAGRGAERLKLRNDHATIGELIDRYKASALQRPATIRSNVRSVRMIVKTVHPGDPDTKPTSILTANLIRDFEKRQLERVEKRAVGVTRLSAIQRVRTSTTSYVRQARAIVALRKMKFYEGLKLPDLAGFRGESVETPKRSLPRPLDMKPLAAMDAAAPALARDDPAAYVAHLLFSRLGLRNIEIVNARTRWINDGSIGIINRPEEDFFPKGCEGWVPI
Ga0307293_1010389523300028711SoilAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAMQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0307290_1001291033300028791SoilVRFIPPAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGGPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAMQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0307299_1013664713300028793SoilMKDRKLLIRVRCTTRKFYLNPPRPGRNDWHVRFIAPGINGNGTRRIFRNTGAKEIGPAKRIAAKIIESFWADAGRGAEHLRLRNDNAKIGELIAMYERNASQRHATIRSNVRSLRMIVRTVHGGDPDAKSTSLLTANLIREFEKRQIESAEKRATPATRSVVILRVRTSTASYVRQARSIVALRKMKFYEGIKLPDLAGFRGETVETPHRSLPRPLDMKALTAMNAAAPALAKRDPGAYVAHLLFSRMGLRNIEIVNACVHWISDGNIGIVNRPEEDFFAKGCEGWVPI
Ga0307299_1020714113300028793SoilTKEVAAAKRIGAQIIESFWMDSGRGAEPLKLRNDNATIGELITRYEENAAQRPSTIRSNARSLRMIVKTVHGGDPDEKSTALLTANLIREFEKQQLDSAEKRATAATRSVVIQRVRTSTASYVRQARSIVALRKMKFYEGMKLPDLTGFRGETVETPHRSLPRPLDMKALTEMNAAAPALAKKDPGAYVAHLLFSRLGLRNIEMVNARVHWISDGSIGIVNRPEEDFFPKGCEGWVPMATDVLKEI
Ga0307305_1000059363300028807SoilVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDTKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAMQAASPTLARNDPGAYVAHLLFSRLGLRNIEIVNARVHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVDRDCMRNAR
Ga0307292_1025981713300028811SoilSNAGILSASSPSGRNDWHVRFIPPAINGVRRVVFRSTGTKEIAAAKRIASQIIESFWTDAGRGAEPLKLRNDNATIGELIIKYEENAAQRPATVRSNIRSLRMIVKTVHRGDPDIKSTSLLTANLIREFEKRQIDRAEKRATAATRSVIIQRVRSSTASYVRQARSIVAVRKMKFYEGLKLPDLTAFRGESVETPHRSLPRPLDMKALTAIQAASPTLARNDPGAYVAHLLFSRLGLRNI
Ga0170824_10163991113300031231Forest SoilPPPGRNDWHVRFVPPSVDGVPREIFRSTGTKKIAVAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELIERYQVNAAQRPGTIRSNVRSLRMIIKAVHSGDPDNKPTSILTANLIRDFEKRQVERIEKRVAGVTRLSAMHRVRTSTASYVRQARAIIALRKMKFYEGLRLPDLVGFRGESVEAPKRSLPRPLDMKALAAMEAAAPSLAGHDPGAYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFPKGCEGWVPIAPDVL
Ga0170818_10704578213300031474Forest SoilFWTDAGRGAERFKLRNDHATIGELIERYRASAAQRPATVRSNVRSLRMIVKTVHSGDPDVKPTSILTANLIREFEKRQLERVEKRPAGKSRIAAMQRVRTSTGSYVRQARAIVALRKMKFYEGLRLPDLTGFRGETVEAPKRSLPRPLDMNALAAMEVAAPALAKDDPGAYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRAEEDFFPKGCEGWVPIAPDVLQAILSFQPLCVEGYLVPGANRTERHDAVYRRHSK
Ga0170818_11220970923300031474Forest SoilMRRCVSIVPMKNRKLLIPVRCTTRKFYLQSPPPGRNDWHVRFVPPSVDGVPREIFRSTGTKKIAVAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELIERYQVNAAQRPGTIRSNVRSLRMIIKAVHSGDPDNKPTSILTANLIRDFEKRQVERIEKRVAGVTRLSAMHRVRTSTASYVRQARAIIALRKMKFYEGLRLPDLVGFRGESVEAPKRSLPRPLDMKALATMEAAAPSLAKHDPGAYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIIDRSEEDFFPKGCEGWVPIAPDVL
Ga0170818_11491245413300031474Forest SoilGVRRIVFRSTGAKEIAAAKRIAAQIIESFWSDAGRGAEPLKLRNNNATLGELIERYRRSAVQRPRTIQCNARSLRLIVRAVHGGNPDEKSTSVLTANLIREFEKRRIQRVEENADASNRIALIHRVRNSTASFVRQARSVVALRKMKFYEDLKLPDLASFRAESVETPQRSLPRPLDMKALSAMEAAVAALADEDPAVYVAHLLFSRLGLRNIEIVNARSHWISDGSIGIIN
Ga0306923_1059525523300031910SoilMYRCASIGPMKNRKLLIPVRCTTRKFYLQPPPPGRNDWHVRFVPPSIDGVPREIFRSTGTKEIAAAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELLDRYKASAAQRPDTIRSNVRSLRMVIKTVHSGDPDTKPTSILTPNLIREFEKRQLERVGERAAGVSRLTALQRVRTSTASYVRQARAIVALRKMKFYEGLKFPDLSGFRGESVEAPRRSLPRPLDMKALEEMEAAAPRLAREDPAVYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFSKGCEGWVPIAPDVLTEI
Ga0310909_1078359413300031947SoilTGTKEIAAAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELLDRYKASAAQRPDTIRSNVRSLRMVIKTVHSGDPDTKPTSILTPNLIREFEKRQLERVGERAAGVSRLTALQRVRTSTASYVRQARAIVALRKMKFYEGLKLPDLSGFRGESVEAPRRSLPRPLDMKALEEMEAAAPRLAREDPAVYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFSKGCEGWVPIAPDVLTEILRFQPLCTNGYLVP
Ga0306922_1123153113300032001SoilEIFRSTGTKEIAAAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELLDRYKANAAQRPDTIRSNVRSLRMVIKTVHSGDPDTKPTSILTPNLIREFEKRQLERVGERAAGVSRLTALQRVRTSTASYVRQARAIVALRKMKFYEGLKFPDLSGFRGESVEAPRRSLPRPLDMKALEEMEAAAPRLAREDPAVYVAHLLFSRLGLRNIEIVNARTHWINDGSIGIINRPEEDFFSKGCEGWVPIAPDVLTEI
Ga0306924_1089257913300032076SoilMYRCASIGPMKNRKLLIPVRCTTRKFYLQPPPPGRNDWHVRFVPPSIDGVPREIFRSTGTKEIAAAKRIGAQIIESFWTDAGRGAERLKLRNDHATIGELLDRYKANAAQRPDTIRSNVRSLRMVIKTVHSGDPDTKPTSILTANLIREFEKRQLERVGERAAGVSRLTALQRVRTSTASYVRQARAIVALRKMKFYEGLKFPDLSGFRGESVEAPRRSLPRPLDMKALEEMEAAAPRLAREDPAVYVAHLLFSRLGLRNIEI
Ga0307471_10000606213300032180Hardwood Forest SoilMKSRKLLIPVRCTTRKFYLQSPPAGRNDWHVRFVPPSVDGVPREIFRSTGTKEIAAAKRIAAQIIESFWTDAGRGAERLKLRNDHATIGELIARYRASAAQRPATIRSNVRSLRMIVKTVHPGDPDLRPTSIMTANLIREFEKRQLERAEKRAAGMSRLIVIQRVRTSTASYLRQARSIIALRKMKFYDGIKLPDLAAFRGESVESPQRSLPRPLDMKALRAMEAAEPTLAKQDPGAYVAHLLFSRLGLRNIEIVNARV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.