NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F051434

Metagenome / Metatranscriptome Family F051434

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F051434
Family Type Metagenome / Metatranscriptome
Number of Sequences 144
Average Sequence Length 86 residues
Representative Sequence MRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLR
Number of Associated Samples 100
Number of Associated Scaffolds 144

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 53.15 %
% of genes near scaffold ends (potentially truncated) 34.03 %
% of genes from short scaffolds (< 2000 bps) 92.36 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.82

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (50.694 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(52.083 % of family members)
Environment Ontology (ENVO) Unclassified
(53.472 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(57.639 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.73%    β-sheet: 0.00%    Coil/Unstructured: 47.27%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.82
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.74.1.0: automated matchesd3mi9b13mi90.67211
a.74.1.1: Cyclind2i53a12i530.66619
c.2.1.7: Aminoacid dehydrogenase-like, C-terminal domaind1pj3a11pj30.65641
a.286.1.1: Sama2622-liked2pv4a12pv40.65223
d.92.1.3: Leishmanolysind1lmla_1lml0.65104


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 144 Family Scaffolds
PF04392ABC_sub_bind 4.17
PF00072Response_reg 4.17
PF00510COX3 3.47
PF02518HATPase_c 0.69
PF03734YkuD 0.69
PF09745NSRP1_N 0.69
PF03466LysR_substrate 0.69
PF14706Tnp_DNA_bind 0.69
PF03070TENA_THI-4 0.69
PF11154DUF2934 0.69
PF00293NUDIX 0.69
PF13442Cytochrome_CBB3 0.69
PF09351DUF1993 0.69
PF13701DDE_Tnp_1_4 0.69
PF00313CSD 0.69
PF06035Peptidase_C93 0.69
PF01402RHH_1 0.69

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 144 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 4.17
COG1845Heme/copper-type cytochrome/quinol oxidase, subunit 3Energy production and conversion [C] 3.47
COG1376Lipoprotein-anchoring transpeptidase ErfK/SrfKCell wall/membrane/envelope biogenesis [M] 0.69
COG3034Murein L,D-transpeptidase YafKCell wall/membrane/envelope biogenesis [M] 0.69
COG3672Predicted transglutaminase-like proteinPosttranslational modification, protein turnover, chaperones [O] 0.69


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms50.69 %
UnclassifiedrootN/A49.31 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16822513All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2623Open in IMG/M
2088090014|GPIPI_17200335All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2306Open in IMG/M
3300000787|JGI11643J11755_11755188Not Available787Open in IMG/M
3300000956|JGI10216J12902_101031777Not Available658Open in IMG/M
3300001867|JGI12627J18819_10143010All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium976Open in IMG/M
3300002906|JGI25614J43888_10052373All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1224Open in IMG/M
3300002911|JGI25390J43892_10124460All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Hyphomicrobium → Hyphomicrobium denitrificans590Open in IMG/M
3300004479|Ga0062595_101843682Not Available577Open in IMG/M
3300005160|Ga0066820_1014173All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2568Open in IMG/M
3300005166|Ga0066674_10459799Not Available580Open in IMG/M
3300005166|Ga0066674_10507557All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → Pseudomonas stutzeri group → Pseudomonas stutzeri subgroup → Pseudomonas stutzeri542Open in IMG/M
3300005167|Ga0066672_10303523All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1038Open in IMG/M
3300005174|Ga0066680_10253586All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1118Open in IMG/M
3300005174|Ga0066680_10445514All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei818Open in IMG/M
3300005177|Ga0066690_10820688All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2602Open in IMG/M
3300005181|Ga0066678_11121977Not Available505Open in IMG/M
3300005445|Ga0070708_101607869All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR65605Open in IMG/M
3300005467|Ga0070706_101579910All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2599Open in IMG/M
3300005518|Ga0070699_100556271Not Available1044Open in IMG/M
3300005552|Ga0066701_10343844Not Available924Open in IMG/M
3300005552|Ga0066701_10758453All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2580Open in IMG/M
3300005553|Ga0066695_10395305All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium858Open in IMG/M
3300005557|Ga0066704_10232457All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1250Open in IMG/M
3300005561|Ga0066699_10833641Not Available647Open in IMG/M
3300005575|Ga0066702_10182178All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1263Open in IMG/M
3300005937|Ga0081455_10026706All Organisms → cellular organisms → Bacteria → Proteobacteria5302Open in IMG/M
3300006034|Ga0066656_10113578All Organisms → cellular organisms → Bacteria → Proteobacteria1659Open in IMG/M
3300006046|Ga0066652_100211245All Organisms → cellular organisms → Bacteria1676Open in IMG/M
3300006172|Ga0075018_10269112Not Available830Open in IMG/M
3300006172|Ga0075018_10623457All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium576Open in IMG/M
3300006791|Ga0066653_10438228Not Available667Open in IMG/M
3300006794|Ga0066658_10460322Not Available692Open in IMG/M
3300006796|Ga0066665_11353385Not Available549Open in IMG/M
3300006854|Ga0075425_102500715All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2572Open in IMG/M
3300007076|Ga0075435_101052261All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium711Open in IMG/M
3300007258|Ga0099793_10487378Not Available612Open in IMG/M
3300007258|Ga0099793_10510109Not Available598Open in IMG/M
3300007265|Ga0099794_10569379Not Available599Open in IMG/M
3300009012|Ga0066710_100175448All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3020Open in IMG/M
3300009012|Ga0066710_103090921Not Available644Open in IMG/M
3300009012|Ga0066710_103147321All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium636Open in IMG/M
3300009038|Ga0099829_11174156Not Available636Open in IMG/M
3300009038|Ga0099829_11244787Not Available616Open in IMG/M
3300009088|Ga0099830_10303258Not Available1275Open in IMG/M
3300009088|Ga0099830_11154842Not Available642Open in IMG/M
3300009090|Ga0099827_10698805Not Available876Open in IMG/M
3300009137|Ga0066709_103478315All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300010301|Ga0134070_10371121Not Available559Open in IMG/M
3300010301|Ga0134070_10442208Not Available519Open in IMG/M
3300010335|Ga0134063_10103311All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_71294Open in IMG/M
3300010399|Ga0134127_13358751Not Available524Open in IMG/M
3300011269|Ga0137392_10236101Not Available1502Open in IMG/M
3300011269|Ga0137392_11181301Not Available623Open in IMG/M
3300011270|Ga0137391_10254401Not Available1521Open in IMG/M
3300011271|Ga0137393_10691464All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_7874Open in IMG/M
3300012096|Ga0137389_10455329Not Available1096Open in IMG/M
3300012096|Ga0137389_10967831Not Available730Open in IMG/M
3300012198|Ga0137364_10952152Not Available650Open in IMG/M
3300012199|Ga0137383_10270387All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1245Open in IMG/M
3300012199|Ga0137383_10878291Not Available655Open in IMG/M
3300012199|Ga0137383_11363155Not Available503Open in IMG/M
3300012201|Ga0137365_10223616Not Available1404Open in IMG/M
3300012201|Ga0137365_11014209Not Available601Open in IMG/M
3300012202|Ga0137363_10103853All Organisms → cellular organisms → Bacteria → Proteobacteria2165Open in IMG/M
3300012202|Ga0137363_10244040All Organisms → cellular organisms → Bacteria1458Open in IMG/M
3300012204|Ga0137374_10179826Not Available1848Open in IMG/M
3300012204|Ga0137374_10248898All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1491Open in IMG/M
3300012204|Ga0137374_10547871Not Available890Open in IMG/M
3300012205|Ga0137362_10112643All Organisms → cellular organisms → Bacteria2301Open in IMG/M
3300012205|Ga0137362_10697244All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium873Open in IMG/M
3300012206|Ga0137380_10540541All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1023Open in IMG/M
3300012207|Ga0137381_11797714Not Available502Open in IMG/M
3300012208|Ga0137376_10104585All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2402Open in IMG/M
3300012208|Ga0137376_11396205Not Available591Open in IMG/M
3300012209|Ga0137379_10255345All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1665Open in IMG/M
3300012209|Ga0137379_10490579All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1136Open in IMG/M
3300012210|Ga0137378_10557394Not Available1055Open in IMG/M
3300012211|Ga0137377_10072309All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae3231Open in IMG/M
3300012211|Ga0137377_10224581All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1804Open in IMG/M
3300012211|Ga0137377_10456221All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1216Open in IMG/M
3300012211|Ga0137377_11709381All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300012211|Ga0137377_11851134Not Available522Open in IMG/M
3300012285|Ga0137370_10117220All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1515Open in IMG/M
3300012285|Ga0137370_10353726All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria884Open in IMG/M
3300012285|Ga0137370_10488878Not Available753Open in IMG/M
3300012349|Ga0137387_10849399Not Available661Open in IMG/M
3300012350|Ga0137372_10230921All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1464Open in IMG/M
3300012350|Ga0137372_10396450All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300012351|Ga0137386_10117103Not Available1894Open in IMG/M
3300012354|Ga0137366_10134786All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1862Open in IMG/M
3300012355|Ga0137369_10296522Not Available1201Open in IMG/M
3300012355|Ga0137369_10888021Not Available600Open in IMG/M
3300012356|Ga0137371_10050522All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3215Open in IMG/M
3300012357|Ga0137384_10218540All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1593Open in IMG/M
3300012357|Ga0137384_11187336All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria607Open in IMG/M
3300012358|Ga0137368_10164093All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1623Open in IMG/M
3300012358|Ga0137368_10265673All Organisms → cellular organisms → Bacteria1178Open in IMG/M
3300012359|Ga0137385_10376477Not Available1213Open in IMG/M
3300012359|Ga0137385_10769212All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. th.b2801Open in IMG/M
3300012359|Ga0137385_10783489All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria793Open in IMG/M
3300012361|Ga0137360_10129491Not Available1974Open in IMG/M
3300012361|Ga0137360_10857776All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales782Open in IMG/M
3300012361|Ga0137360_11253345All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria641Open in IMG/M
3300012361|Ga0137360_11295243Not Available629Open in IMG/M
3300012362|Ga0137361_10524835All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1087Open in IMG/M
3300012362|Ga0137361_10712816All Organisms → cellular organisms → Bacteria → Proteobacteria916Open in IMG/M
3300012362|Ga0137361_11119721All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria708Open in IMG/M
3300012532|Ga0137373_10254108All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1417Open in IMG/M
3300012917|Ga0137395_10528556All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium850Open in IMG/M
3300012923|Ga0137359_10232408All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1646Open in IMG/M
3300012923|Ga0137359_10419002Not Available1186Open in IMG/M
3300012923|Ga0137359_10631011Not Available938Open in IMG/M
3300012929|Ga0137404_10982156Not Available772Open in IMG/M
3300012930|Ga0137407_11194604Not Available722Open in IMG/M
3300012930|Ga0137407_12079821Not Available542Open in IMG/M
3300012957|Ga0164303_11277847Not Available542Open in IMG/M
3300012961|Ga0164302_11266485Not Available594Open in IMG/M
3300012984|Ga0164309_10600112Not Available859Open in IMG/M
3300012985|Ga0164308_10609420All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300012989|Ga0164305_10312235All Organisms → cellular organisms → Bacteria1167Open in IMG/M
3300015373|Ga0132257_103183346Not Available598Open in IMG/M
3300017657|Ga0134074_1162490Not Available784Open in IMG/M
3300018431|Ga0066655_10883571Not Available610Open in IMG/M
3300018433|Ga0066667_10680795All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria862Open in IMG/M
3300018468|Ga0066662_11036560All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria815Open in IMG/M
3300020579|Ga0210407_10424639Not Available1040Open in IMG/M
3300021168|Ga0210406_10964308Not Available637Open in IMG/M
3300022726|Ga0242654_10363679Not Available547Open in IMG/M
3300025939|Ga0207665_11228185Not Available598Open in IMG/M
3300026304|Ga0209240_1002168All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7509Open in IMG/M
3300026317|Ga0209154_1080862All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1401Open in IMG/M
3300026319|Ga0209647_1091690All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1471Open in IMG/M
3300026354|Ga0257180_1052139Not Available582Open in IMG/M
3300026494|Ga0257159_1045219All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria745Open in IMG/M
3300026498|Ga0257156_1025520All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylobacteriaceae1186Open in IMG/M
3300026532|Ga0209160_1141433Not Available1130Open in IMG/M
3300026557|Ga0179587_10784253Not Available628Open in IMG/M
3300027548|Ga0209523_1073367Not Available706Open in IMG/M
3300028536|Ga0137415_10260314All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1542Open in IMG/M
3300028881|Ga0307277_10287297Not Available728Open in IMG/M
3300031720|Ga0307469_12155629Not Available542Open in IMG/M
3300032180|Ga0307471_100540982All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1318Open in IMG/M
3300032180|Ga0307471_103830332Not Available532Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil52.08%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.67%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.17%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.78%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.78%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.08%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.39%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.39%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.69%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.69%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.69%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.69%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005160Soil and rhizosphere microbial communities from Laval, Canada - mgLMBEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_037614602088090014SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR
GPIPI_022847402088090014SoilLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR
JGI11643J11755_1175518813300000787SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPR*
JGI10216J12902_10103177723300000956SoilMLGFLKEHPEVFDPDDAFDKVWQTAQASGVVYPEAQAEAARAILAKHIIEAAKQGERDYARLCDGALRALAQSNLRTAPRPQR*
JGI12627J18819_1014301023300001867Forest SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVIYPKAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALAQTNLRNASHQLR*
JGI25614J43888_1005237323300002906Grasslands SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
JGI25390J43892_1012446013300002911Grasslands SoilMRNLLAECPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0062595_10184368223300004479SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0066820_101417313300005160SoilMRGFLKEHSGAFDPDEVHTLVAAFDKAWETVQASGVVYHEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALARSNLPNSRPPPRR*
Ga0066674_1045979913300005166SoilGQRMRTFLAEHSEAFDPDEVHTLIAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0066674_1050755713300005166SoilAARHATGRPLPSPVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0066672_1030352313300005167SoilMRTFLAEHSEAFNPDEVHTLIVAFDRAWETIQASGVVYPEAKAEIVRAILARHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0066680_1025358613300005174SoilWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0066680_1044551423300005174SoilWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0066690_1082068813300005177SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATGVVYAEAKAEAVRTILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNLPPPPPR*
Ga0066678_1112197723300005181SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0070708_10160786913300005445Corn, Switchgrass And Miscanthus RhizosphereMRNLLAEYPAWFEPDEVQILVAAFDKAWEAVQASVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0070706_10157991013300005467Corn, Switchgrass And Miscanthus RhizosphereMGSGEMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVIFPKAQAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0070699_10055627123300005518Corn, Switchgrass And Miscanthus RhizosphereMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNP*
Ga0066701_1034384423300005552SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNL*
Ga0066701_1075845313300005552SoilMRTFLAEHSEGAFDPDEVHTLIAAFDRAWETIQASGVVYPEAKAAIVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0066695_1039530513300005553SoilMRGYLKDHVGVFEPDEVVILLAAFDNAWGAVQASGVRYPADKLEFVRAILAKHIIAAAMEGERDVGRLRDRALLALAQSNLRSG*
Ga0066704_1023245733300005557SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWGAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0066699_1083364123300005561SoilLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNL*
Ga0066702_1018217813300005575SoilMRTFLAEHSEAFDPDEVHTLIAAFDKAWETIQASGVVYPEAKAAIVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0081455_1002670633300005937Tabebuia Heterophylla RhizosphereMRDLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYAEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRTAPDPLR*
Ga0066656_1011357823300006034SoilRPLPSPVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0066652_10021124513300006046SoilMRSLLAEYPAWFDPDEVQILLAAFDKAWEAVQASGVTYPADKIESVRTILSKHIIAAAMDGERDLGRLRDGALLALAQSNLRSGPASKLS*
Ga0075018_1026911213300006172WatershedsMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSSASPLP*
Ga0075018_1062345713300006172WatershedsMRDLLAEYPAWFEPGEVQILVAAFDKAWETVQASGMVYTKANAEAVRAILAKHIIEAARQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0066653_1043822823300006791SoilMRTFLAEHSEAFDPDEVHTLIAAFDRAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNL
Ga0066658_1046032223300006794SoilWFEHDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0066665_1135338523300006796SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATGVVYAEAKAEAVRTILAKHIVEAAKQGERDHARLREGALLALAQSNLRNSPPPPPR*
Ga0075425_10250071513300006854Populus RhizosphereMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0075435_10105226113300007076Populus RhizosphereMRNLLAEYPAWFEPDEIQILVAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0099793_1048737813300007258Vadose Zone SoilVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLK
Ga0099793_1051010913300007258Vadose Zone SoilSGKMRDLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0099794_1056937913300007265Vadose Zone SoilVDSGEMRNLLAEYPAWFEPDEVQILLAAFDNAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0066710_10017544833300009012Grasslands SoilMRNFLAEHSGAFDPDEVHTLVAAFDKAWEAIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLCDGALLALAQANLRTSDRPPR
Ga0066710_10309092113300009012Grasslands SoilMRTFLAEHSEGAFDPDEVHTLIAAFDRAWETIQASGVVYPEAKAAIVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT
Ga0066710_10314732113300009012Grasslands SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATGVVYAEAKAEAVRTILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNLPPPPPR
Ga0099829_1117415623300009038Vadose Zone SoilRWLSRPERGGGEMRGFLKEHSGAFDPDEVHTLVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNAPPPPPR*
Ga0099829_1124478713300009038Vadose Zone SoilVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSSASPLP*
Ga0099830_1030325813300009088Vadose Zone SoilMHGFLAEHSGAFTPDEVHTLVAAFDKAWETIQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHARLRDGALLALAKSNLRNAPPPPPR*
Ga0099830_1115484213300009088Vadose Zone SoilVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0099827_1069880513300009090Vadose Zone SoilMRNLLAEYPAWFERDEVQVLLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0066709_10347831523300009137Grasslands SoilWFEPDEVQILVAAFDKAWETVQASGMVYAQANAEAVRAILAKHIIEAARQDERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0134070_1037112123300010301Grasslands SoilMRTFLAGHSEAFDPDEVRTLVAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQS
Ga0134070_1044220823300010301Grasslands SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASVVIYPEAKAEAARAILAKHIIEAARQGERDHARLRDGALLALAQSNLRNSPPRPPR*
Ga0134063_1010331123300010335Grasslands SoilMRTFLAEHSEAFDPDEVHTLVAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0134127_1335875113300010399Terrestrial SoilMRNLRAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGAL
Ga0137392_1023610123300011269Vadose Zone SoilMHGFLAEHSGAFTPDEVHTLVAAFDKAWETIQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNAPPPPPR*
Ga0137392_1118130113300011269Vadose Zone SoilMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLKVMWKKQNF*
Ga0137391_1025440123300011270Vadose Zone SoilMHGFLAEHSGAFTPDEVHTLVAAFDKAWGTIQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNAPPPPPR*
Ga0137393_1069146423300011271Vadose Zone SoilVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLKVMWKKQ
Ga0137389_1045532923300012096Vadose Zone SoilMHGFLAEHSGAFTPDEVHTLVAAFDKAWETIQASGVVYPEAKAEAVRAILAKHIIEAAKQCERDHAQLRDRALLALAQSNLRNAPPPPPR*
Ga0137389_1096783113300012096Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGAL
Ga0137364_1095215213300012198Vadose Zone SoilEPDEVQILVAAFDKAWETVQASGVIYPKAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137383_1027038723300012199Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSSASTLQ*
Ga0137383_1087829113300012199Vadose Zone SoilMRGLLAEHSGVFTPDEVRTLVVAFDKAWETIQASGVVYPEAKAEAARAILAKHIIATATNGERDHARLRDGALLALAQSNLRNSPPPPPR*
Ga0137383_1136315513300012199Vadose Zone SoilESVGSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLCDGALLALAQSNL*
Ga0137365_1022361613300012201Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137365_1101420913300012201Vadose Zone SoilSPVGSGKMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYAEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPHERRGDAMRYLDNRGKIS*
Ga0137363_1010385313300012202Vadose Zone SoilMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGERDYARLRDGALLALAQSNLSSASPLP*
Ga0137363_1024404023300012202Vadose Zone SoilVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLKVMWKKQNF*
Ga0137374_1017982633300012204Vadose Zone SoilRLLPSPVGSGEMRNLLAEYPAWFEPDEVQILVAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0137374_1024889843300012204Vadose Zone SoilMRGYLKDHVGVFEPDEVVILLAAFDKSWGAVQASGVRFPPDELEFVRAILAKHIVAAAMKGERDVGRLRDGALLALAQSNLRSGQA
Ga0137374_1054787113300012204Vadose Zone SoilMRSLLAEYPAWFDPDEVQILLAAFDKAWEAVQASGVTYPADKIESVRTILSKHIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0137362_1011264353300012205Vadose Zone SoilSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGERDYARLRDGALLALAQSNLSSASPLP*
Ga0137362_1069724423300012205Vadose Zone SoilMRNFLAEHSGAFDPNEVHTLVAAFDKAWEAVQASGVVYPEAKAEAARAILAKHIIAAAKHGERDHARLRDGALLALAQSNLRTAPDPLR*
Ga0137380_1054054123300012206Vadose Zone SoilMRTFLAGHSEAFDPDEVHTLVAAFDKAWETIQASGLVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0137381_1179771413300012207Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRHTNLRNASHQPR*
Ga0137376_1010458533300012208Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATRVVYAEAEAEAVRTILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNSPPPPPR*
Ga0137376_1139620513300012208Vadose Zone SoilMRNFLAEHSGAFDPNEVHTLVAAFDKAWEAVQASGVVYPQAKAEAARAILAKHIIAAAMHGKRDQARLRDGALLALAQSNLR
Ga0137379_1025534513300012209Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASPVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALL
Ga0137379_1049057923300012209Vadose Zone SoilMRTFLAEHSEAFDPDEVHTLIAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0137378_1007710813300012210Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRL
Ga0137378_1055739413300012210Vadose Zone SoilLAEHSEAFNPDEVHTLIVASPEAKAEIVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0137377_1007230943300012211Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATGVAYAEAKAEAVRTILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNSPPPPPR*
Ga0137377_1022458153300012211Vadose Zone SoilDPDEVQILLAAFDKAWEAVQASGVTYPADKIESVRTILSKHIIAAAMSGERDLGRLRDGALLTLAQSNLRSGSASLLS*
Ga0137377_1045622123300012211Vadose Zone SoilVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGERDYARLRDGALLALAQSNLSSASPLP*
Ga0137377_1170938113300012211Vadose Zone SoilNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVIYPKAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALAQTNLRNASHQLR*
Ga0137377_1185113413300012211Vadose Zone SoilSPVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0137370_1011722013300012285Vadose Zone SoilMRTFLAEHSEAFDPDEVHTLVAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLAL
Ga0137370_1035372623300012285Vadose Zone SoilVEKSLPGGRIGEMRGFLKEHSGVFGPDEVHILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNLRSGPASKLS*
Ga0137370_1048887823300012285Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQATGVVCAEAKAEAVRTILAKHIIEAAKQGERDHARLRDGALLALAQSNLRNSPPPPPR*
Ga0137387_1084939913300012349Vadose Zone SoilVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAIDGERDLGRLRDGALYSPWLNQIRSRP*
Ga0137372_1023092113300012350Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLVAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0137372_1039645023300012350Vadose Zone SoilGSDEVHILVAAFDKAWETVQASGVRYPEAKAEQVRAILAKHIIATAMNGERDLGRLRDGALLALAQSNLRSGSASLLS*
Ga0137386_1011710343300012351Vadose Zone SoilMRNLLAEYPAWFEPNEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNL*
Ga0137366_1013478613300012354Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQTSVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137369_1029652213300012355Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQAGVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALAQSNL*
Ga0137369_1088802123300012355Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGWLRDGALLALAQSTL*
Ga0137371_1005052253300012356Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASPVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137384_1021854043300012357Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRVRDGALLALAQSNL*
Ga0137384_1118733613300012357Vadose Zone SoilMRTFLAEHSEAFDPDEVHTLVAAFDKAWETIQASGVVYPKAKAEIVRAILARHIIAAAKKGERDHGRLRDGALLALAQSNLRT*
Ga0137368_1016409333300012358Vadose Zone SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQTSVVIYPEAKAEAARAILAKHIRAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137368_1026567323300012358Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLCDGALLALAQSNL*
Ga0137385_1037647713300012359Vadose Zone SoilPSPVDSGEMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137385_1076921213300012359Vadose Zone SoilVDSGEMRNLLAEYPAWFEPDEVQILLVAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALYSPWLNQIRSRP*
Ga0137385_1078348913300012359Vadose Zone SoilMRTFLAEHSEAFNPDEVHTLIVAFDRAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0137360_1012949133300012361Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLPAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGRLRDGALLALAQSNP*
Ga0137360_1085777623300012361Vadose Zone SoilMRDLLAEYRAWFEPDEVQILVAAFDKAWETVQASGMVYTEANAEAVRAILAKHIIEAARQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0137360_1125334513300012361Vadose Zone SoilSGEMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASVVIYPEAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALRLTNLRNASHQPR*
Ga0137360_1129524313300012361Vadose Zone SoilMRNFLAEHSGAFDPNEVDTLVAAFDKAWEAVQASGVVYPEAKAEAARAILAKHIIAAAMHGERDHDGALLALAQSNLRTAPDPR*
Ga0137361_1052483523300012362Vadose Zone SoilMKMRNLLAEYPAWFEPDEVQIFVAAFDDAWEAVQASGVIYPEATAKAARAILAKHIIEAAKQGERDHARLRDGALAALARSNLRSGPASQLR*
Ga0137361_1071281623300012362Vadose Zone SoilSPVGSREMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLKVMWKKQNF*
Ga0137361_1111972113300012362Vadose Zone SoilPVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLDRLRDGALLALAQSNL*
Ga0137373_1025410823300012532Vadose Zone SoilVDSGEMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLGWLRDGALLALAQSNL*
Ga0137395_1052855623300012917Vadose Zone SoilMRDLLAWFDPDEVEILIAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSSASPLP*
Ga0137359_1023240813300012923Vadose Zone SoilMRNFLAEQSEAFDPNEVHTLVAAFDKAWEAVQASGVVYPEAKAEAARAILAKHIIAAAKHGERDHARLRDGALLALAQSNLQT
Ga0137359_1041900223300012923Vadose Zone SoilMKMRNLLAEYPAWFEPDEVQIFVAAFDDAWEAVQASGVIYPEATAKAARAILAKHIIEAAKQGERDHARLRDGALAALARS
Ga0137359_1063101113300012923Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLPAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP*
Ga0137404_1098215623300012929Vadose Zone SoilMRTFLAENSEAFDPDEVHTLVAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLRT*
Ga0137407_1119460413300012930Vadose Zone SoilVGSGEIRNLLAEYPGWFEPDEVQILVAAFDKAWETVQASGIVYTKANAEAVRAILAKHIIEAARQDERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0137407_1207982123300012930Vadose Zone SoilAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNL*
Ga0164303_1127784713300012957SoilMRNFLAEHSGAFDANEVDTLVAAFDKAWEAVQASGVVYPQAKAEAARAILAKHIIAAAMHGERDHARLRDGALLALAQSNLRTTPDPL
Ga0164302_1126648523300012961SoilPDEVQFLVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0164309_1060011223300012984SoilMRDLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR*
Ga0164308_1060942023300012985SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQS
Ga0164305_1031223523300012989SoilMRNLLAEYPAWFEPDEVQILVAAFEKAWGTVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQS
Ga0132257_10318334623300015373Arabidopsis RhizospherePVGSGEMRNLLAEYPAWFEPDEVQILVGAFYKAWETVQATGVVYAEAKAEAVRTILAKHIIEAAEQGERDHTRLRDGALLALAQSNLRNAPPPPPR*
Ga0134074_116249023300017657Grasslands SoilMRTFLAEHSEAFDPDEVHTLLAAFDKAWETIQASGVVYPEAKAETVRAILAKHIIAAAKNGERDHGRLRDGALLALAQSNLQT
Ga0066655_1088357113300018431Grasslands SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP
Ga0066667_1068079523300018433Grasslands SoilPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP
Ga0066662_1103656023300018468Grasslands SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNL
Ga0210407_1042463913300020579SoilYLRAEYPAWFEPDEVQFLVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR
Ga0210406_1096430823300021168SoilMRDLLAEYPAWFEPDEVQILVAAFDKAWETVQASGMVYAEANAEAVRAILAKHIIEAARQGERDHTRLRDGALLALAQSNLRNSPPPPPR
Ga0242654_1036367923300022726SoilGKMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLRNSPPPPPR
Ga0207665_1122818513300025939Corn, Switchgrass And Miscanthus RhizosphereMRNLLAEYPAWFEPDEVQILVGAFDKAWETVQASGVVYPEAKAEAVRAILAKHIIEAAKQGERDHTRLRDGALLALAQSNLR
Ga0209240_100216813300026304Grasslands SoilMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSDISLKVMWKKQNF
Ga0209154_108086223300026317SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERHLGRLRDGALLALAQSNP
Ga0209647_109169023300026319Grasslands SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWEAVQASSIRYPADKLESVRAILAKRIIAAAMDGERDLDRLRDGALLALAQSNL
Ga0257180_105213913300026354SoilMRDLLAWFDPDEVEILIAAFDKAWETVQASGVVYPEAKAEAARAILAKHIIAAARDGERDYARLRDGALLALAQSNLSSASPLP
Ga0257159_104521923300026494SoilMRDLLAEYPAWFEPGEVQILVAAFDKAWETVQASGMVYTKANAEAVRAILAKHIIEAARQGERDHTRLRDGALLALAQSNLRNSPPPPPR
Ga0257156_102552033300026498SoilMRDLLAWLDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGELDYARLRDGALLALAQSNLSSASPLP
Ga0209160_114143313300026532SoilMRNLLAEYPAWFEPDEVQILLAAFDKAWGAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP
Ga0179587_1078425313300026557Vadose Zone SoilMRDLLAWFDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDGERDYARLRDGALLALAQSNLSSASPLP
Ga0209523_107336723300027548Forest SoilMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVIYPKAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALAQTNLRNASHQLR
Ga0137415_1026031423300028536Vadose Zone SoilMRNLLAEYPAWFEPDEVQILLAAFDNAWEAVQASSIRYPADKLESVRAILAKRIIAAAVDGERDLGRLRDGALLALAQSNP
Ga0307277_1028729723300028881SoilMLGFLKEHSGVFDLDEIRTLAAAFDKAWQTAQASGVVYPEAQAEAAREILAKYIIAAAMDGERDCARLRDGALLALAQSNLRTATRPQR
Ga0307469_1215562913300031720Hardwood Forest SoilELSARFLQSPVGTGKMRNLLAEYPAWFEPDEVQILVAAFDKAWETVQASGVIYPKAKAEAARAILAKHIIEAAKQGERDHARLRDGALLALAQTNLRNASHQQR
Ga0307471_10054098223300032180Hardwood Forest SoilMRDLLAWLDPDEVEILVAAFDKAWETVQASGVVYSEAKAEAVRAILAKHIIAAARDSEVDYARLRDGALLALAQSNLSSASPLP
Ga0307471_10383033223300032180Hardwood Forest SoilMRDLLARFDPDEVQILVAAFDKAWETVQASGVIYPEAKAKAARAILAKHIIEAAKQGERDHARLRDGALLALAQANLGNASDQRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.