NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069013

Metagenome Family F069013

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069013
Family Type Metagenome
Number of Sequences 124
Average Sequence Length 161 residues
Representative Sequence MHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG
Number of Associated Samples 108
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 75.00 %
% of genes near scaffold ends (potentially truncated) 38.71 %
% of genes from short scaffolds (< 2000 bps) 75.81 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.59

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (64.516 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(14.516 % of family members)
Environment Ontology (ENVO) Unclassified
(37.903 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(41.935 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 27.84%    β-sheet: 32.99%    Coil/Unstructured: 39.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.59
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF02687FtsX 20.97
PF13360PQQ_2 3.23
PF00118Cpn60_TCP1 1.61
PF00873ACR_tran 1.61
PF12770CHAT 1.61
PF13193AMP-binding_C 1.61
PF01208URO-D 1.61
PF12704MacB_PCD 0.81
PF13472Lipase_GDSL_2 0.81
PF13237Fer4_10 0.81
PF03476MOSC_N 0.81
PF11104PilM_2 0.81
PF01011PQQ 0.81
PF00206Lyase_1 0.81
PF13671AAA_33 0.81
PF00106adh_short 0.81
PF05137PilN 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG0407Uroporphyrinogen-III decarboxylase HemECoenzyme transport and metabolism [H] 1.61
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 1.61
COG3166Type IV pilus assembly protein PilNCell motility [N] 1.61
COG3217N-hydroxylaminopurine reductase subunit YcbX, contains MOSC domainDefense mechanisms [V] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms67.74 %
UnclassifiedrootN/A32.26 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000363|ICChiseqgaiiFebDRAFT_11421910Not Available693Open in IMG/M
3300004156|Ga0062589_100101518All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1833Open in IMG/M
3300004463|Ga0063356_103934215Not Available640Open in IMG/M
3300004479|Ga0062595_100217523All Organisms → cellular organisms → Bacteria1202Open in IMG/M
3300005093|Ga0062594_100494066All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → unclassified Symbiodinium → Symbiodinium sp. CCMP25921032Open in IMG/M
3300005332|Ga0066388_103033862All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium858Open in IMG/M
3300005353|Ga0070669_101765132Not Available540Open in IMG/M
3300005353|Ga0070669_102023685Not Available503Open in IMG/M
3300005354|Ga0070675_100020313All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria5300Open in IMG/M
3300005355|Ga0070671_102106566Not Available502Open in IMG/M
3300005356|Ga0070674_100244327Not Available1407Open in IMG/M
3300005364|Ga0070673_100262713Not Available1508Open in IMG/M
3300005438|Ga0070701_10892967All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium613Open in IMG/M
3300005440|Ga0070705_100410958All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300005440|Ga0070705_101935929Not Available502Open in IMG/M
3300005459|Ga0068867_100211105Not Available1559Open in IMG/M
3300005543|Ga0070672_100882286All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium790Open in IMG/M
3300005543|Ga0070672_101760616Not Available557Open in IMG/M
3300005546|Ga0070696_100150056All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1710Open in IMG/M
3300005841|Ga0068863_101399013All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium707Open in IMG/M
3300005844|Ga0068862_100514702All Organisms → cellular organisms → Bacteria1138Open in IMG/M
3300006196|Ga0075422_10378457Not Available622Open in IMG/M
3300006844|Ga0075428_100294160All Organisms → cellular organisms → Bacteria1746Open in IMG/M
3300006852|Ga0075433_10007155All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria8837Open in IMG/M
3300006871|Ga0075434_100000678All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria26446Open in IMG/M
3300006876|Ga0079217_10038645All Organisms → cellular organisms → Bacteria1841Open in IMG/M
3300006880|Ga0075429_101053032Not Available711Open in IMG/M
3300006881|Ga0068865_101076000Not Available707Open in IMG/M
3300006894|Ga0079215_10299747All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300006918|Ga0079216_10814375Not Available688Open in IMG/M
3300007004|Ga0079218_10032547All Organisms → cellular organisms → Bacteria3041Open in IMG/M
3300007004|Ga0079218_10032866All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3032Open in IMG/M
3300007004|Ga0079218_10255914All Organisms → cellular organisms → Bacteria1386Open in IMG/M
3300007004|Ga0079218_12369883All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_30623Open in IMG/M
3300009078|Ga0105106_10077342All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2451Open in IMG/M
3300009082|Ga0105099_10408246Not Available811Open in IMG/M
3300009094|Ga0111539_10009404All Organisms → cellular organisms → Bacteria12333Open in IMG/M
3300009094|Ga0111539_11257479Not Available859Open in IMG/M
3300009148|Ga0105243_12005508Not Available613Open in IMG/M
3300009156|Ga0111538_10505826Not Available1531Open in IMG/M
3300009156|Ga0111538_11001304All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300009176|Ga0105242_10484945All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1172Open in IMG/M
3300009597|Ga0105259_1000242All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria8457Open in IMG/M
3300009610|Ga0105340_1017462All Organisms → cellular organisms → Bacteria2810Open in IMG/M
3300009802|Ga0105073_1021344Not Available690Open in IMG/M
3300009870|Ga0131092_10039910All Organisms → cellular organisms → Bacteria → Proteobacteria6599Open in IMG/M
3300009873|Ga0131077_10003085All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria40699Open in IMG/M
3300009987|Ga0105030_101950All Organisms → cellular organisms → Bacteria1821Open in IMG/M
3300010362|Ga0126377_11855767Not Available678Open in IMG/M
3300010400|Ga0134122_11826487Not Available640Open in IMG/M
3300011421|Ga0137462_1022189All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1253Open in IMG/M
3300011423|Ga0137436_1032011All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1322Open in IMG/M
3300011429|Ga0137455_1039890All Organisms → cellular organisms → Bacteria1312Open in IMG/M
3300012140|Ga0137351_1000830All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3686Open in IMG/M
3300012225|Ga0137434_1000871All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2202Open in IMG/M
3300012901|Ga0157288_10191948Not Available644Open in IMG/M
3300012904|Ga0157282_10292992Not Available570Open in IMG/M
3300012905|Ga0157296_10035903All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → unclassified Symbiodinium → Symbiodinium sp. CCMP25921085Open in IMG/M
3300012912|Ga0157306_10029036All Organisms → cellular organisms → Bacteria1252Open in IMG/M
3300014299|Ga0075303_1001022All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Sinobacteraceae → Nevskia → Nevskia soli2577Open in IMG/M
3300014326|Ga0157380_10842721All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → unclassified Symbiodinium → Symbiodinium sp. CCMP2592938Open in IMG/M
3300015200|Ga0173480_10257651All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium954Open in IMG/M
3300015258|Ga0180093_1013670All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301604Open in IMG/M
3300015371|Ga0132258_12252147All Organisms → cellular organisms → Bacteria → Proteobacteria1367Open in IMG/M
3300015374|Ga0132255_100605798All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1620Open in IMG/M
3300018051|Ga0184620_10158316Not Available736Open in IMG/M
3300018422|Ga0190265_10215470All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1944Open in IMG/M
3300018422|Ga0190265_10715342All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301123Open in IMG/M
3300018422|Ga0190265_11124595All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300018429|Ga0190272_10021185All Organisms → cellular organisms → Bacteria3381Open in IMG/M
3300018466|Ga0190268_10160112All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301165Open in IMG/M
3300018476|Ga0190274_10734538All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301036Open in IMG/M
3300018481|Ga0190271_10881188All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301020Open in IMG/M
3300018481|Ga0190271_13593166Not Available519Open in IMG/M
3300019360|Ga0187894_10000808All Organisms → cellular organisms → Bacteria → Proteobacteria42210Open in IMG/M
3300019377|Ga0190264_11033880All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_30661Open in IMG/M
3300019458|Ga0187892_10031372All Organisms → cellular organisms → Bacteria → Proteobacteria4312Open in IMG/M
3300019487|Ga0187893_10040072All Organisms → cellular organisms → Bacteria → Proteobacteria5024Open in IMG/M
3300020195|Ga0163150_10000031All Organisms → cellular organisms → Bacteria → Proteobacteria247308Open in IMG/M
3300020198|Ga0194120_10311800Not Available784Open in IMG/M
3300021082|Ga0210380_10391990Not Available635Open in IMG/M
3300025567|Ga0210076_1099974Not Available635Open in IMG/M
3300025908|Ga0207643_10159862All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1355Open in IMG/M
3300025923|Ga0207681_11061601Not Available680Open in IMG/M
3300025926|Ga0207659_10166210All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1736Open in IMG/M
3300025938|Ga0207704_10519720All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → unclassified Symbiodinium → Symbiodinium sp. CCMP2592962Open in IMG/M
3300025940|Ga0207691_10182961All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1830Open in IMG/M
3300025960|Ga0207651_10263935All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1415Open in IMG/M
3300026088|Ga0207641_10244423All Organisms → cellular organisms → Bacteria1674Open in IMG/M
3300026089|Ga0207648_11300062Not Available683Open in IMG/M
3300026118|Ga0207675_100458128All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300027362|Ga0208320_1003204All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301847Open in IMG/M
3300027637|Ga0209818_1024984All Organisms → cellular organisms → Bacteria → Terrabacteria group1325Open in IMG/M
3300027639|Ga0209387_1039303All Organisms → cellular organisms → Bacteria → Proteobacteria1002Open in IMG/M
3300027682|Ga0209971_1012504All Organisms → cellular organisms → Bacteria → Proteobacteria2008Open in IMG/M
3300027818|Ga0209706_10004204All Organisms → cellular organisms → Bacteria → Proteobacteria7714Open in IMG/M
3300027886|Ga0209486_10001231All Organisms → cellular organisms → Bacteria11168Open in IMG/M
3300027886|Ga0209486_10014588All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales3582Open in IMG/M
3300027907|Ga0207428_10206831Not Available1476Open in IMG/M
3300027964|Ga0256864_1229651Not Available531Open in IMG/M
3300028380|Ga0268265_10252228Not Available1564Open in IMG/M
3300030620|Ga0302046_10000375All Organisms → cellular organisms → Bacteria → Proteobacteria45991Open in IMG/M
3300031731|Ga0307405_10003823All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales → Steroidobacteraceae → Steroidobacter → Steroidobacter denitrificans7005Open in IMG/M
3300031740|Ga0307468_100053369Not Available2129Open in IMG/M
3300031740|Ga0307468_100485440All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium974Open in IMG/M
3300031740|Ga0307468_100682166All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium855Open in IMG/M
3300031824|Ga0307413_10253292All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_301307Open in IMG/M
3300031847|Ga0310907_10861403Not Available510Open in IMG/M
3300031901|Ga0307406_11898839All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_30531Open in IMG/M
3300031903|Ga0307407_10090836All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1871Open in IMG/M
3300031995|Ga0307409_100466772All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1222Open in IMG/M
3300032002|Ga0307416_100140873All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium SG8_302191Open in IMG/M
3300032004|Ga0307414_10530775All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1046Open in IMG/M
3300032005|Ga0307411_10032989All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3206Open in IMG/M
3300032005|Ga0307411_10472333All Organisms → cellular organisms → Bacteria1054Open in IMG/M
3300032126|Ga0307415_100093983All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2179Open in IMG/M
3300032144|Ga0315910_10311405All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1196Open in IMG/M
3300032174|Ga0307470_11245964Not Available606Open in IMG/M
3300032174|Ga0307470_11530639Not Available556Open in IMG/M
3300032180|Ga0307471_102245145Not Available688Open in IMG/M
3300033412|Ga0310810_10045666All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Nevskiales5349Open in IMG/M
3300034149|Ga0364929_0223381Not Available628Open in IMG/M
3300034257|Ga0370495_0058874Not Available1164Open in IMG/M
3300034354|Ga0364943_0204386All Organisms → cellular organisms → Bacteria727Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil14.52%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil8.87%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere8.06%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere8.06%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil7.26%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere4.84%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere4.84%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere4.03%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere4.03%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.23%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.42%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.61%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.61%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.61%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.61%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.61%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.81%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.81%
Freshwater Microbial MatEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat0.81%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.81%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.81%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.81%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.81%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.81%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.81%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.81%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.81%
WastewaterEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater0.81%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006918Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009082Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 1-3cm May2015EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009802Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60EnvironmentalOpen in IMG/M
3300009870Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Linkou plantEngineeredOpen in IMG/M
3300009873Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Wenshan plantEngineeredOpen in IMG/M
3300009987Switchgrass associated microbial communities from Austin, Texas, USA, to study host-microbe interactions - RS_213 metaGHost-AssociatedOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011421Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT769_2EnvironmentalOpen in IMG/M
3300011423Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT119_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012140Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT690_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012901Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S119-311C-1EnvironmentalOpen in IMG/M
3300012904Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S029-104C-1EnvironmentalOpen in IMG/M
3300012905Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S013-104B-2EnvironmentalOpen in IMG/M
3300012912Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S163-409C-2EnvironmentalOpen in IMG/M
3300014299Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D1EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015200Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2)EnvironmentalOpen in IMG/M
3300015258Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_1DaEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020195Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.P2.IBEnvironmentalOpen in IMG/M
3300020198Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015019 Mahale Deep Cast 65mEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300025567Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025908Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025926Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025940Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027362Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299 (SPAdes)EnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027639Agricultural soil microbial communities from Utah to study Nitrogen management - NC Control (SPAdes)EnvironmentalOpen in IMG/M
3300027682Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027818Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031731Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-1Host-AssociatedOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031847Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D4EnvironmentalOpen in IMG/M
3300031901Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-2Host-AssociatedOpen in IMG/M
3300031903Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-1Host-AssociatedOpen in IMG/M
3300031995Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-2Host-AssociatedOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032004Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-3Host-AssociatedOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032126Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-2Host-AssociatedOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300034149Sediment microbial communities from East River floodplain, Colorado, United States - 20_j17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1142191013300000363SoilSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRAPKPVAVGGAGFPERIDLFVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSFNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLTSAGR*
Ga0062589_10010151823300004156SoilMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVGWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYTRFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNTSMLPPGHYLLRVEGYARDGRLEHFAEARLAAG*
Ga0063356_10393421523300004463Arabidopsis Thaliana RhizosphereMHTEISPWIERLSSHRLAIAAGAAFVALVIALAWVSWRWGVAEDRLAMLQEQAAAGFLEAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLREDGTLLLHADQVARDSNLDLRLSFNTSLLAA
Ga0062595_10021752323300004479SoilMHTEISPWVEKLSPYRLVIAIGAAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSKFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGK
Ga0062594_10049406613300005093SoilMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0066388_10303386223300005332Tropical Forest SoilMPTEISPWVEKLSPHRQAIAAVAVLVVLIVAIAWLAYRWNVADDRLALLQKQADAGFLQAPSTSRTVRIDLRAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHADQMLRDSNNDLRLSFNSSILPAGTYCIRVEGYLRSVPQFMAEARLTSAGR*
Ga0070669_10176513213300005353Switchgrass RhizosphereVEKLSPYRLVIAIGAVGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLTAG*
Ga0070669_10202368513300005353Switchgrass RhizosphereGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0070675_10002031353300005354Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG*
Ga0070671_10210656613300005355Switchgrass RhizosphereMRTEISPWVEKLAPYRLVIGAVALLVLLVGAIAWLAYRWNIAEDRLAILQKEADAGFLQAPSTSRSVRIDLHAPRSVAVGGVGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHGDQMVRDSNNDLRLSFNTSILPAG
Ga0070674_10024432713300005356Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALVWVAWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPGLVSVGGSGFPERLDLLLNARTNRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLV
Ga0070673_10026271323300005364Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGKLERFAEARL
Ga0070701_1089296723300005438Corn, Switchgrass And Miscanthus RhizosphereGLVLVVALAWVGWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYTRFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNTSMLPPGHYLLRVEGYARDGRLEHFAEARLAAG*
Ga0070705_10041095813300005440Corn, Switchgrass And Miscanthus RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSFNSSILPAGDYRVRV
Ga0070705_10193592913300005440Corn, Switchgrass And Miscanthus RhizosphereSPWVEKLSPYRQAIAAGAVLVVLIGAIAWLAYRWNVANDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHADEMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFIAEAPLTSAGR*
Ga0068867_10021110523300005459Miscanthus RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAG
Ga0070672_10088228623300005543Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG*
Ga0070672_10176061613300005543Miscanthus RhizospherePWVEKLAPYRLVIGAVALLVLLVGAIAWLAYRWNIAEDRLAILQKEADAGFLQAPSTSRTVRIDLHAPRSVAVGGVGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHVDQMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGEAQFMAEAPLTSAGR*
Ga0070696_10015005623300005546Corn, Switchgrass And Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG*
Ga0068863_10139901313300005841Switchgrass RhizosphereAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG*
Ga0068862_10051470223300005844Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVGWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYTRFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNTS
Ga0075422_1037845713300006196Populus RhizosphereMPTEISPWVEKLSPYRQAIAASAVLVVLIGAIAWLAYRWNVADDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHADEMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFMAEAPLTSAGR*
Ga0075428_10029416023300006844Populus RhizosphereMHTEISPWIEKLSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRAPRPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGALQFMAEDRLISAGR*
Ga0075433_1000715553300006852Populus RhizosphereMPTEISPWVEKLSPYRQAIAASAVLVVLIGAIAWLAYRWNVADDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHVDQMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFMAEAPLTSAGR*
Ga0075434_100000678233300006871Populus RhizosphereASAVLVVLIGAIAWLAYRWNVADDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHVDQMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFMAEAPLTSAGR*
Ga0079217_1003864523300006876Agricultural SoilMHTEISPWIERIKPHRLTIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPVGQSRIVAG*
Ga0075429_10105303213300006880Populus RhizosphereAAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVDARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGALQFMAEDRLISAGR*
Ga0068865_10107600013300006881Miscanthus RhizosphereYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0079215_1029974713300006894Agricultural SoilTEISPWIERIKPHRLTIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPLGQSRIVAG*
Ga0079216_1081437523300006918Agricultural SoilMHTEISPWIERLSPYRLAIAAGGVFLLLLVALGWMAWRWGVAADRMALMQKQAQAGFLQAPSSTRSVSVDLRAAGAVAVGGREFPERIDLRLNARTNRYARFRVSLLREDGTLLLHAEQLVRDSNQDLRLSLNTSLLPAGRYALRVDGYGRGGGLE
Ga0079218_1003254723300007004Agricultural SoilMHTEISPWIERIKQHRLTIGIGAALLLLAVAFAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRAVSLGGGDFPERVDFLVNARTPRFARFRVSLLCDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGHARGGKLERFGEAQLRAAGR*
Ga0079218_1003286623300007004Agricultural SoilMHTEISPWIERIRPHRLAIGIGTAMLVLVVALVWVAWRWGVAEDRMEIMQRQADAGFLQAPSTNRTVRIDLRAPRAVSVGGGDFPERVDFVLNARTSRFSRFRVSLLREDGTFLLHADQQVRDSNLDLRLSVNSSMLPAGAYVLRVEGYARGGKLERLGEAPLRVAGR*
Ga0079218_1025591423300007004Agricultural SoilMHTEISPWIERIKPHRLTIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPLGQSRIVAG*
Ga0079218_1236988313300007004Agricultural SoilMHTEISPWIERIKPHRLAIGIGAAMLVLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRAVSLGGGDFPERVDLLVNARTPRFARFRVSLLRDDGTLLLQADQMVRDSNQGLRLSLNTSLLPAGRYVLRVEGYARSGKLERFAE
Ga0105106_1007734223300009078Freshwater SedimentMPTEISPWIERLRPHRLAIGTALLILTLAVALAWVAWRWGVAEDRLAMLERQAEAGFLQAPTSSRTVRIDLRAPGTVSVGGGEFPERIDLRVNARSDRYSRFRVSLVRDDGTLLFHADQLVRDSNYDLRLSFNTSILPAGRYLVRVEGYARAGQLAPFAQALILVSAP*
Ga0105099_1040824633300009082Freshwater SedimentALAWVAWRWGVAEDRLAMLERRAEAGFLQAPTSSRTVRIDLRAPGTVSVGGGEFPERIDLRVNARSDRYSRFRVSLVRDDGTLLFHADQLVRDSNYDLRLSFNTSILPAGRYLVRVEGYARAGQLAPFAQALILVSAP*
Ga0111539_1000940433300009094Populus RhizosphereMHTEISPWIEKLSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRTPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGALQFMAEDRLISAGR*
Ga0111539_1125747923300009094Populus RhizosphereMPTEISPWVEKLSPYRQAIAASAVLVVLIGAIAWLAYRWNVADDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHVDQMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFIAEAPLTSAGR*
Ga0105243_1200550813300009148Miscanthus RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQF
Ga0111538_1050582613300009156Populus RhizosphereMHTEISPWIEKLSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRTPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGY
Ga0111538_1100130413300009156Populus RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRAPRPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGALQFMAEDRLISAGR*
Ga0105242_1048494523300009176Miscanthus RhizospherePYRLVIAIGAVGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG*
Ga0105259_100024263300009597SoilMHTEISPWVEKLKPHRITIGIGALVLVLLVALIWVAWRWGVAEDRMEMMQKQAEAGFLQAPSSTRMVRIDLRAPRLVSIGGGAFPERIDLLMNARTKQYARFRVSLLRDDGTLLVHADQVVRDSNFDLRFSFNSSILPAGAYVIRVEGYGRGGKLERFAEAKLSAS*
Ga0105340_101746213300009610SoilMPTEISPWVNTLKEHKVVVGTATALLALVVALVWVAWRWGVAEDRMEMLEQQAAKGFLQAPSSNRSVRIDLRAPRLVPIDGGGFPQRVDLLINARTTQYARFRVSLVRADGTLLVHADQMVRDSNNDLRLSFNTSMLPNGQYEIRVEGYARGGKLAHFGEAPMQVSGR*
Ga0105073_102134413300009802Groundwater SandMHTEISPWIEKLKERRVVFGTAGIMLVLAVALIWVAWRWNVAEDRLEMLEKRAGEGFLQAPSTNRTVRIDPRSPRLVVVGGGDFPERIDLLINARTNQYARFRVSLARDDGTLLLHADQMVRDSNYDLRLSLNTSILPVGRYLIRVEGYARGGKMQRFAEAPLQVAGK*
Ga0131092_1003991043300009870Activated SludgeMHTEISPWIEKLSPYRLVIALGAAFLVLLVALAWVAWRWSVAEDRVAMMQQQAAAGFLQAPSSNRTVRIDLRNPSRVGVGGGKFPERIDFLVNARTERYARFRVSLLRDDGTLIVHADQMVRDSNMDLRLSLNTSILPAGHYVLRVEGYGRKGQLERFAEARLSAAPAAR*
Ga0131077_1000308573300009873WastewaterMHTEISPWIEKLSPYRLAIALGAAFLALLVALAWVGWRWSVAEDRMEMMRQQAAAGFLQAPSSNRTVRIDLRKPGNVGIGGGEFPERVDFLVNARTQRYVRFRVSLLRDDGTFVARADQMVRDSNMDLRLSFNTSILPEGQYVLRVEGYGRRGQLERFAEARLTAAPTTR*
Ga0105030_10195033300009987Switchgrass RhizosphereMHTEISPWIEKLSPYRLVIALGAAFLALVVALAWVGWRWSVALDRMEIMQRQAAAGFLQAPTSSRTVRIDLRNPSSVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGTLLLHADQLVRDSNMDLRFSLNSSILPQGRYVIRVEGYG
Ga0126377_1185576723300010362Tropical Forest SoilTEISPWVERLSPYRYVIAAGAVLVLLIGAIGWLAWRWSIAEDRLALLQKQADAGFLQAPSSSRTVRIDLRAPRPVSVGGAGFPERIDLLVNARTDRFARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSFNTSILPAGTYRVRVEGYARNGALQFMAEDRLSSAGR*
Ga0134122_1182648713300010400Terrestrial SoilMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSFLPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0137462_102218913300011421SoilATALLVLLVALIWVAWRWSVAEDRMEMLEAKAAAGFLQAPSSTRTLRVDLRASRLVSVDGGGFPQRIDLLVNARTKQYARFRVSLVRADGTLLFHADQMVRDSNDDLRLSFNTSMLPDGRYEIRVEGFARSGKMERFGEAPLQVSGR*
Ga0137436_103201123300011423SoilMHTEISAWVEKLSPYRLVIAIGAGVLVLIVALAWVAWRWSVAEDRMELLQKQAEAGFLQAPSTNRTVRIDLRAPASVTVGGGDFPERVDLVLNARTDRYARFRVSLLREDGTLLMHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARSGALERFAEARLAAG*
Ga0137455_103989023300011429SoilMPTEISPWVNTLKEHKVVVGTATALLALVVALVWVAWRWGVAEDRMEMLEQQAAKGFLQAPSSNRSVRIDLRAPRLVPIDGGGFPQRVDLLINARTTQYARFRVSLVRNDGTLLIHADQMVRDSNNDLRLSFNTSMLPDGRYEIRVEGYARGGKMEHFGEAPMQVSGR*
Ga0137351_100083023300012140SoilMHTEISPWVEKLKPHRITIGIGALVLVLLVELIWVAWRWGVAEDRMEMMQKQAEAGFLQAPSSTRMVRIDLRAPRLVSIGGGAFPERIDLLMNARTKQYARFRVSLLRDDGTLLVHADQVVRDSNFDLRFSFNSSILPAGAYVIRVEGYGRGGKLERFAEAKLSAS*
Ga0137434_100087123300012225SoilMHTEISPWVEKLKPHRITIGIGALVLVLLVALIWVAWRWGVAEDRMEMMQKQAEAGFLQAPSSTRMVRIDLRAPRLVSIGGGALPERIDLLMNARTKQYARFRVSLLRDDGTLLVHADQVVRDSNFDLRFSFNSSILPAGAYVIRVEGYGRGGKLERFAEAKLSAS*
Ga0157288_1019194813300012901SoilRARLRRFPLTALGIALVLVVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRTVRIDLRAPKPVAVAGTGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHGDQMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0157282_1029299213300012904SoilPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRAVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYTRFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0157296_1003590323300012905SoilMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYTRFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0157306_1002903623300012912SoilMHTEISPWIEKLSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR*
Ga0075303_100102233300014299Natural And Restored WetlandsMHTEISPWIEKLTPYRLVIALGAAFLVLLIALAWVAWRWSVAEDRMEMMQQQATEGFLQAPSSTRTVRIDLRNPSRVGVGGGKFPERIDFLVNARTERYARFRVSLLRDDGTLIVHADQMVRDSNMDLRLSLNTSILPAGHYVLRVEGYGRKGQLERFAEARLTAAPAAR*
Ga0157380_1084272113300014326Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGVVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVLVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFADVRRRFRCETTEELLARAAEARAELEALDQGLDPAEAARAAVE
Ga0173480_1025765123300015200SoilEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMQLMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG*
Ga0180093_101367013300015258SoilLVLLVALIWVAWRWGVAEDRMEMMQKQAEAGFLQAPSSTRMVRIDLRAPRLVSIGGGAFPERIDLLMNARTKQYARFRVSLLRDDGTLLVHADQVVRDSNFDLRFSFNSSILPAGAYVIRVEGYGRGGKLERFAEAKLSAS*
Ga0132258_1225214723300015371Arabidopsis RhizosphereMPTEISPWVEKLSPYRQAIAAGAVLVVLIGAIAWLAYRWNVADDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHVDQMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFIAEAPLTSAGR*
Ga0132255_10060579823300015374Arabidopsis RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNSRTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG*
Ga0184620_1015831623300018051Groundwater SedimentMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVGWRWSVAEGRMELMKKQAEIGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYARFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNSSMLPPGHYLLRVEGYARDGRLEHFAEARFVAG
Ga0190265_1021547013300018422SoilMHTEISPWIERLKLHRLAIGIGSAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRAVSLGGGDFPERVDFLVNARTTRYARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGDYLLRVEGFARGGKLERFGEAQLRVAGR
Ga0190265_1071534223300018422SoilMHTEISPWIERIKPHRLAIGIAAALLLLAVALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTNRTVRIDLRAPGGLTVGGRDFPERIDLRMNARSNRHARFRASLLREDGTLLLHADQMVRDSNQDLRLSLNTSLLPAGRYVLRVEGYARGGKLERFAEARLVAN
Ga0190265_1112459523300018422SoilMHTEISPWIERIKPHRLAIGIGAAMLVLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRAVSLGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLERFGEAQLRAAGR
Ga0190272_1002118523300018429SoilMPTEISPWIDKVKEHKVVVGAATALVVLLVALVWVAWRWGVAEDRMEMLEAKAAAGFLQAPSSTRNLRVDLRAPRLVSLDGGGFPQRIDLRVNARTKQYARFRVSLVRADGTLLFHADQMVRDSNDDLRFSFNTSMLPDGRYEIRVEGYARGGKMERFGEAPMQVSGR
Ga0190268_1016011223300018466SoilMHTEISPWIERLSPYRLYIAAAGLILILLAALAWVAWRWGVAEDRMEFMQKQAEAGFLQAPSSTRSVRVDLRAPGTVAVGGRDFPERIDLRLNARTTRYARFRVSLLREDGTLLLHADQLVRDSNMDLRLSFNTSLLPAGPYVLRVEGYARGGKLERVGEARVVAA
Ga0190274_1073453813300018476SoilHRLAIALGGVILFLLITLAWVAWRWTVVEDRMEFMQKQAEAGFLQAPSSTRSVRVDLRAAGTVAVGGRDFPERVDLRLNARTTRYARFRVSLLREDGTLLLHADQLVRDSNQDLRLALNTSLLPAGPYVLRVEGYARGGKLERVGEARVVAA
Ga0190271_1088118823300018481SoilMHIEISPWIERLSAYRLAIALGGVILVLLTTLAWVAWRWTVVEDRMEFMQKQAEAGFLQAPSSTRSVRVDLRAAGTVAVGGRDFPERIDLRLNARTDRYARFRVSLLREDGTLLLHADQLVRDSNMDLRLSFNTSLLPAGPYVLRVEGYARGGKLERVGEARVVVA
Ga0190271_1359316613300018481SoilMHTEISPWIERLSPYRLAIAVGSAILVLLVALVWVAWRWGVAEDRMELMQKQADTGFLQAPSSTRSVRLDLRTPGTVSVGGRDFPERIDLRLNARTDRHSRFRVSLLREDGTLLLHADQLVRDSNQDLRLSINTSILPRGDYELNVEGF
Ga0187894_10000808103300019360Microbial Mat On RocksMPTEIGPWIDRLREHRLAVGAVTALVALAVALIWVAWRWGVAEDRMDMLQKQADAGFLQAPSTNRLVRVDLRSPGAVSIGGGDFPERVDFLLNARSDRYARFRVSLLRQDGTLVFRADQLVRDSNMDLRLSLNSSALPQGNYVIRVEGHARDGRTEPFSEVTLRALGKQ
Ga0190264_1103388013300019377SoilMNTEILPLVERLSPHRLAIALGGVILVLLITLAWVAWRWTVVEDRMEFMQKQAEAGFLQAPSSTRSVRADLRAAGTVAVGGRDFPERIDLRLNARTTRYARFRVSLLREDGTLLLHADQLVRDSNMDLRLSFNTSLLPAGPYVLRVEGYARGGKLERVGEARVVAA
Ga0187892_1003137243300019458Bio-OozeMHTEISSWIEKLKQHRVVVSTAGTMLVLAVALIWVAWRWNIANDRLEMLEQKAAVGFLQAPSSNRTVRLDLRSPRVVVVGGGGFPERVDLLVNARTNQYARFRVSLVRDDGTLLVHADQMVRDSNYDLRLSFNTSMLPVGRYLIRVEGYARGGQMQRFAEAKMQVVGQ
Ga0187893_1004007223300019487Microbial Mat On RocksMHTEISPWIERLSPYRLAIALGAAFFALLVALIWVAWRWNVAEDRMTLMQRQAEAGFLQAPSTNRTVRIDLRAPRSVAIGGREFPERVDLLVNARTPRHARFRVSLLRDDGTLLLHADQMVRDSNQDLRLSLNTSMLPAGPYRIQVEGYARGGRLERFAAVPLRVGGR
Ga0163150_10000031913300020195Freshwater Microbial MatMPTEISPWIERLRPHRLAIGAVLLIVTLAVALAWVAWRWGVAADRMAMLQRQAEAGFLQAPTSSRTVRIDLRAPGTVAVGGGEFPERIDLRVTARTKRYARFRASLARADGTLLFHADQLVRDSNQELRVSLNTSILPQGDYVLRVEGYGRGDKLERFAEAKIRVP
Ga0194120_1031180023300020198Freshwater LakeMPTEIGPWIERLREHRLAVGAATTLIVLAVALAWVAWRWGVAEDRMEMLQKQADAGFLQAPSSTRIVSVDLRNPGAVSIGGGDFPERVDFLLNARSDRYARFRVSVLREDGTLVLHADQMVRDSNLDLRFSLNDSVLPPGEYLIRVEGYARGGNLERFAEAALRAGGR
Ga0210380_1039199023300021082Groundwater SedimentMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALVWVAWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRVDLRAPGLVSVGGSGFPERLDLLLNARTNRYARFRVSLLREDGTLLVHAGPLVRDSNFDLRFSLNSSILPVG
Ga0210076_109997413300025567Natural And Restored WetlandsMHTEISPWIEKLSPYRLVIALGAAFLVLLVALAWVAWRWSVAEDRMDLMQQQAAAGFLQAPSSNRLVRIDLHAPSRVGIGGGEFPERVDFLVNARTKLYARFRVSLLRDDGTLIVHADQMVRDSNMDLRLSLNSSILPAGHYVLRVEGYGRGGRLERFAEARLTAA
Ga0207643_1015986223300025908Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVGWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYTRFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNTSMLPPGHYLLRVEGYARDGRLEHFAEARLAAG
Ga0207681_1106160113300025923Switchgrass RhizosphereVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR
Ga0207659_1016621023300025926Miscanthus RhizosphereAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG
Ga0207704_1051972023300025938Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG
Ga0207691_1018296123300025940Miscanthus RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG
Ga0207651_1026393523300025960Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGKLERFAEARHASG
Ga0207641_1024442323300026088Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAVLVLVVALAWVAWRWSVAEDRMELMQKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSELPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIWFSLNSSILPAGRYLVRVEGYARNGKLERFAEARLASG
Ga0207648_1130006223300026089Miscanthus RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAED
Ga0207675_10045812823300026118Switchgrass RhizosphereMHTEISPWIEKLSPYRFVIAGAAVLVLLVGAIAWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRTVRIDLRAPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGVLQFMAEDRLISAGR
Ga0208320_100320413300027362SoilMHTEISPWVEKLKPHRITIGIGALVLVLLVALIWVAWRWGVAEDRMEMMQKQAEAGFLQAPSSTRMVRIDLRAPRLVSIGGGAFPERIDLLMNARTKQYARFRVSLLRDDGTLLVHADQVVRDSNFDLRFSFNSSILPAGAYVIRVEGYGRGGKLERFAEAKLSAS
Ga0209818_102498413300027637Agricultural SoilIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPVGQSRIVAG
Ga0209387_103930323300027639Agricultural SoilMHTEISPWIERIKPHRLTIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPLGQSRIVAG
Ga0209971_101250423300027682Arabidopsis Thaliana RhizosphereMHTEISPWIERLSRHRLAIAAGAAFVALVIALAWVSWRWGVAEDRLAMLQEQAAAGFLEAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLREDGTLLLHADQVARDSNLDLRLSFNTSLLAAGSYLLRVEGYAPGGRLERLGVARIAAS
Ga0209706_1000420433300027818Freshwater SedimentMPTEISPWIERLRPHRLAIGTALLILTLAVALAWVAWRWGVAEDRLAMLERQAEAGFLQAPTSSRTVRIDLRAPGTVSVGGGEFPERIDLRVNARSDRYSRFRVSLVRDDGTLLFHADQLVRDSNYDLRLSFNTSILPAGRYLVRVEGYARAGQLAPFAQALILVSAP
Ga0209486_1000123193300027886Agricultural SoilMHTEISPWIERIKPHRLTIGIGAAMLLLAVALAWVAWRWGVAEDRMELMQKQAEAGFLQAPSTNRTVRIDLRAPRVVSIGGGDFPERVDFLVNARTPRFARFRVSLLRDDGALLLHADQMVRDSNLDLRLSVNTSMLPAGAYLLRVEGYARGGKLEPVGQSRIVAG
Ga0209486_1001458823300027886Agricultural SoilMHTEISPWIERIRPHRLAIGIGTAMLVLVVALVWVAWRWGVAEDRMEIMQRQADAGFLQAPSTNRTVRIDLRAPRAVSVGGGDFPERVDFVLNARTSRFSRFRVSLLREDGTFLLHADQQVRDSNLDLRLSVNSSMLPAGAYVLRVEGYARGGKLERLGEAPLRVAGR
Ga0207428_1020683123300027907Populus RhizosphereMHTEISPWIEKLSPYRLVIAGAAVLVLLVGAIAWLAYRWNIAEDRLEILQKQADAGFLQAPSSSRAVRIDLRTPKPVAVGGAGFPERIDLLVNARTDRYARFRVSLLRDDGTLIVHADHMVRDSNYDLRLSLNSSILPAGDYRVRVEGYVRGGALQFMAEDRLISAGR
Ga0256864_122965113300027964SoilKLQDHKVLIGTATAILALVVVLIWVAWRWGVAEDRLEMLEKQAATGFLQAPSSNRTVRIDPRSPRVVTVGGGEFPERIDLLINARTNQHSRFRVSLVRHDGTLLVHADQVVRDSNYDLRLSFNTSMLPEGPYLIRVEGYARGGKLQRFAEAQMRVVGR
Ga0268265_1025222813300028380Switchgrass RhizosphereMHTEISPWVEKLSPYRLVIAIGAAGLVLGVALAWVGWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERVDFVLNARTQRYTRFRVSLLRDDGALLVHADQLVRDSNFDLRLSFNTS
Ga0302046_10000375113300030620SoilMPTEISPWIDKLQDHKVLIGTATAILALVVVLIWVAWRWGVAEDRLEMLEKQAATGFLQAPSSNRTVRIDPRSPRVVTVGGGEFPERIDLLINARTNQHSRFRVSLVRHDGTLLVHADQVVRDSNYDLRLSFNTSMLPEGPYLIRVEGYARGGKLQRFAEAQMRVVGR
Ga0307405_1000382323300031731RhizosphereMPTEISPWIQRLSAHRIAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIAVGGRDFPERIDLRVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVISG
Ga0307468_10005336923300031740Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAIAAAGLVLVVALVWVAWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPALVSVGGSEFPERIDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRIEGYARNGRLERFAEARLAAS
Ga0307468_10048544023300031740Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAAGAVLALLLGAIGWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRAVRIDLRAPRAVTVGGVGFPERIDLFVNARTDRYARFRVSLLRDDGTLIVHADQMVRDSNNDLRLSFNTSILPAGAYRVRVEGYVRGGVLQLMAETRLTSAGR
Ga0307468_10068216623300031740Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEVGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERLDFVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARSGALERFAEARFVAG
Ga0307413_1025329223300031824RhizosphereMPTENSPWIQRLSAHRIAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIAVGGRDFPERIDLQVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVIAG
Ga0310907_1086140313300031847SoilSMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRVDLRAPALVSVGGSEFPERLDLVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRVEGYARNGELERFAEARLASG
Ga0307406_1189883913300031901RhizosphereFAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIAVGGRDFPERIDLQVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVIAG
Ga0307407_1009083623300031903RhizosphereMPTEISPWIQRLSAHRIAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIEVGGRDFPERIDLRVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVISG
Ga0307409_10046677223300031995RhizosphereMHTEISPWIERLSSHRLAIAAGAAFVALVIALAWVSWRWGVAEDRLAMLQEQAAAGFLDAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLREDGTLLLHADQVARDSNLDLRLSFNTSLLAAGSYLLRVEGYAPGGRLERLGVARIAAS
Ga0307416_10014087323300032002RhizosphereMPTENSPWIQRLSAHRIAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIAVGGRDFPERIDLRVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVISG
Ga0307414_1053077523300032004RhizosphereMHTEISPWIERLSSHRLAIAAGAACVALVIAFAWVSWRWGVAEDRLAMLQEQAAAGFLEAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLRDDGTLLLHADQVARDSNLDLRLSFNTSLLAAGSYLLRVDGYAPGGRLERLGEARIAAS
Ga0307411_1003298923300032005RhizosphereMPTENSPWIQRLSAHRIAVAAGGLLLALAIALAWVAWRWGVAEDRMELLQKQAEAGFLQAPSTTRAVRVDLRAPGTIEVGGRDFPERIDLRVNARTDRFARFRLSLLRDDGTLLVHADQVVRDSNMDLRLSFNTSLLPAGRYVLRVEGYARGGRLEHLGEARVISG
Ga0307411_1047233323300032005RhizosphereMHTEISPWIERLSRHRLAIAAGAAFVALVIALAWVSWRWGVAEDRLAMLQEQAAAGFLEAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLREDGTLLLHADQVARDSNLDLRLSFNTSLLAAGSYLLR
Ga0307415_10009398323300032126RhizosphereMHTEISPWIERLSSHRLAIAAGAACVALVIALAWVSWRWGVAEDRLAMLQEQAAAGFLEAPSSTRAVRVDLRAPGTIPVGGRAFPERIDLRVNARSGRHSRFRVSLLRDDGTLLLHADQVARDSNLDLRLSFNTSLLAAGSYLLRVDGYAPGGRLERLGEARIAAS
Ga0315910_1031140523300032144SoilMHTEISPWIEKLSPYRLAIAAGGAILVLVVALAWVAWRWNVAEGRMELMQKQAEAGFLQAPSSTRSVRLDLRDPGTVSVGGRDFPERIDLRLNARSDRYARFRVSLLREDGTLLLHADQLVRDSNQDLRLAFNTSLLPAGRYVIRVEGYGRGGKLEHFAEARLVAG
Ga0307470_1124596413300032174Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAIAAAGLVLVVALVWVAWRWSVAEGRMELMKKQAEVGFLQAPSTNRTVRIDLRAPALVSVGGSEFPERIDLVLNARTDRYARFRVSLLRDDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYLVRIEGY
Ga0307470_1153063913300032174Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAIGAAGLVLVVALAWVAWRWSVAEDRMELMKKQAEAGFLQAPSTNRTVRIDLRAPASVSVGGGEFPERLDFVLNARTDRYARFRVSLLREDGTLLVHAGPLVRDSNFDIRFSLNSSILPAGRYVVRVEGYARSGALEHFAEARFVA
Ga0307471_10224514513300032180Hardwood Forest SoilMHTEISPWVEKLSPYRLVIAAGAVLVLLLGAIGWLAYRWNIAEDRLELLQKQADAGFLQAPSSSRAVRIDLRAPRAVTVGGVGFPERIDLFVNARTDRYARFRVSLLRDDGTLIVHADQMVRDSNNDLRLSFNTSILPAGAYRVRVEGYVRGGVLQLMAETRLTSAGR
Ga0310810_1004566623300033412SoilMPTEISPWVEKLSPYRQAIAAGAVLVVLIGAIAWLAYRWNVANDRLALLQQQADAGFLQAPSTSRTVRIDLHAPRAVSVGGVGFPERVDLLVNARTDRYARFRMSLLRDDGTLIVHADEMLRDSNNDLRLSFNSSILPAGAYRVRVEGYLRAGETQFMAEAPLTSAGR
Ga0364929_0223381_199_6273300034149SedimentMHTEISPWVERLSPYRLVIAIGAAGFVLVVALAWVAWRWSVAEDRMDLMKKQAEVGFLQAPSTNRTVRIDLHAPALVSVGGQEFPERVDLVLNARTDRYARFRVSLLREDGTLLVHADQMVRDSNFDIRFSLNSSILPAGRYL
Ga0370495_0058874_255_7613300034257Untreated Peat SoilMPTEISPWVDKVKEHKVVVAGATALLVLLVALAWVAWRWGVAEDRMEMLDAKAAAGFLQAPSSTRNLRVDLRAPKLVAIDGSGFPQRIDLLVNARTKQFARFRVSLVRADGTLLFHADQMVRDSNDDLRISFNTSMLPDGRYEIRVEGYARGGKRVHFGEAPMQVSGR
Ga0364943_0204386_7_4803300034354SedimentLKEHKVVVGTATALLALVVALVWVAWRWGVAEDRMEMLEQQAAKGFLQAPSSNRSVRIDLRAPRLVPIDGGGFPQRVDLLINARTTQYARFRVSLVRNDGTLLIHADQMVRDSNNDLRLSFNTSMLPDGRYEIRVEGYARGGKMEHFGEAPMQVSGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.