NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F054831

Metagenome / Metatranscriptome Family F054831

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F054831
Family Type Metagenome / Metatranscriptome
Number of Sequences 139
Average Sequence Length 129 residues
Representative Sequence ELCPVASLSIGSGPNDIVGSGVDMSSRTFAFGAAIGALVGHSTQVRILPNASFQFANTRLSLDDGTTSAASSESYGLLTLGTGFVFNSRFSINPSISIPMGLDGSNTSFGLVGAMNFGR
Number of Associated Samples 128
Number of Associated Scaffolds 139

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.73 %
% of genes near scaffold ends (potentially truncated) 94.96 %
% of genes from short scaffolds (< 2000 bps) 90.65 %
Associated GOLD sequencing projects 119
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.561 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(21.583 % of family members)
Environment Ontology (ENVO) Unclassified
(30.216 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.568 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 2.04%    β-sheet: 58.50%    Coil/Unstructured: 39.46%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 139 Family Scaffolds
PF14542Acetyltransf_CG 7.19
PF06762LMF1 5.04
PF13505OMP_b-brl 2.88
PF12802MarR_2 1.44
PF09209CecR_C 1.44
PF03169OPT 1.44
PF03741TerC 1.44
PF07676PD40 1.44
PF00657Lipase_GDSL 0.72
PF00069Pkinase 0.72
PF03575Peptidase_S51 0.72
PF14023DUF4239 0.72
PF01381HTH_3 0.72
PF05726Pirin_C 0.72
PF00144Beta-lactamase 0.72
PF08327AHSA1 0.72
PF00266Aminotran_5 0.72
PF13360PQQ_2 0.72
PF09722Xre_MbcA_ParS_C 0.72
PF13302Acetyltransf_3 0.72
PF07670Gate 0.72
PF13570PQQ_3 0.72
PF13579Glyco_trans_4_4 0.72
PF00383dCMP_cyt_deam_1 0.72
PF13432TPR_16 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 139 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.88
COG0861Tellurite resistance membrane protein TerCInorganic ion transport and metabolism [P] 1.44
COG1297Predicted oligopeptide transporter, OPT familyGeneral function prediction only [R] 1.44
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.72
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.72
COG1741Redox-sensitive bicupin YhaK, pirin superfamilyGeneral function prediction only [R] 0.72
COG2367Beta-lactamase class ADefense mechanisms [V] 0.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms98.56 %
UnclassifiedrootN/A1.44 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459016|G1P06HT01A1YN1All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c0629684All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300000890|JGI11643J12802_10426048All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300000891|JGI10214J12806_12045693All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300003321|soilH1_10187691All Organisms → cellular organisms → Bacteria → Proteobacteria5417Open in IMG/M
3300003987|Ga0055471_10313690All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300003992|Ga0055470_10083947All Organisms → cellular organisms → Bacteria762Open in IMG/M
3300004020|Ga0055440_10068229All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300004643|Ga0062591_100397341All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1138Open in IMG/M
3300004808|Ga0062381_10165210All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300005093|Ga0062594_102805143All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300005329|Ga0070683_101455184All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300005334|Ga0068869_101690195All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300005336|Ga0070680_101224640All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300005356|Ga0070674_101561912All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300005434|Ga0070709_11410486All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300005445|Ga0070708_101720892All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300005471|Ga0070698_100728148All Organisms → cellular organisms → Bacteria → Proteobacteria934Open in IMG/M
3300005536|Ga0070697_100134184All Organisms → cellular organisms → Bacteria2078Open in IMG/M
3300005539|Ga0068853_102158130All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300005546|Ga0070696_100173464All Organisms → cellular organisms → Bacteria1595Open in IMG/M
3300005563|Ga0068855_100889949All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300005577|Ga0068857_102486125All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300005578|Ga0068854_101267506All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300005615|Ga0070702_101521826All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300005841|Ga0068863_102291109All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300005879|Ga0075295_1028721All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300005883|Ga0075299_1036721All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300005889|Ga0075290_1063307All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300005897|Ga0075281_1061461All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300006605|Ga0074057_11675769All Organisms → cellular organisms → Bacteria751Open in IMG/M
3300006853|Ga0075420_100127971All Organisms → cellular organisms → Bacteria → Proteobacteria2235Open in IMG/M
3300006854|Ga0075425_100028589All Organisms → cellular organisms → Bacteria6171Open in IMG/M
3300006871|Ga0075434_100846648All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300006894|Ga0079215_11005670All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300006894|Ga0079215_11669208All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300006904|Ga0075424_100795041All Organisms → cellular organisms → Bacteria1010Open in IMG/M
3300007004|Ga0079218_13761046All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300009090|Ga0099827_10532333All Organisms → cellular organisms → Bacteria1011Open in IMG/M
3300009093|Ga0105240_11373432All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300009148|Ga0105243_11109781All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300009162|Ga0075423_10169793All Organisms → cellular organisms → Bacteria2290Open in IMG/M
3300009162|Ga0075423_11959120All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300009176|Ga0105242_10553010All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1104Open in IMG/M
3300009545|Ga0105237_10224251All Organisms → cellular organisms → Bacteria1880Open in IMG/M
3300010373|Ga0134128_13142166All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300010396|Ga0134126_11327009All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium796Open in IMG/M
3300010397|Ga0134124_10752295All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium969Open in IMG/M
3300011333|Ga0127502_10113004All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300011333|Ga0127502_10467784All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300012200|Ga0137382_10497393All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300012204|Ga0137374_10268899All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1417Open in IMG/M
3300012354|Ga0137366_10701551All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300012930|Ga0137407_10106335All Organisms → cellular organisms → Bacteria2415Open in IMG/M
3300012930|Ga0137407_10753409All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300012930|Ga0137407_11124536All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300012943|Ga0164241_11289307All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300012955|Ga0164298_10045065All Organisms → cellular organisms → Bacteria2083Open in IMG/M
3300012955|Ga0164298_10615206All Organisms → cellular organisms → Bacteria → Proteobacteria748Open in IMG/M
3300012957|Ga0164303_11229552All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300012958|Ga0164299_10536120All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium786Open in IMG/M
3300012958|Ga0164299_11293651All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300012960|Ga0164301_11102747All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300012961|Ga0164302_10235171All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1153Open in IMG/M
3300012961|Ga0164302_11307834All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300012985|Ga0164308_11352636All Organisms → cellular organisms → Bacteria649Open in IMG/M
3300012986|Ga0164304_10475333All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300012987|Ga0164307_10358528All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1061Open in IMG/M
3300012988|Ga0164306_10899245All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium721Open in IMG/M
3300012988|Ga0164306_11254471All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300012989|Ga0164305_10745165All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium806Open in IMG/M
3300013770|Ga0120123_1130051All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → unclassified Actinomycetales → Actinomycetales bacterium589Open in IMG/M
3300013772|Ga0120158_10312260All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300014302|Ga0075310_1135405All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300015201|Ga0173478_10255633All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300015245|Ga0137409_10810875All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300015371|Ga0132258_10348975All Organisms → cellular organisms → Bacteria3658Open in IMG/M
3300015372|Ga0132256_102113975All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300015373|Ga0132257_103087215All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300018059|Ga0184615_10635207All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300018061|Ga0184619_10370255All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300018066|Ga0184617_1056334All Organisms → cellular organisms → Bacteria1013Open in IMG/M
3300018076|Ga0184609_10479547All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300018077|Ga0184633_10450021All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300018082|Ga0184639_10558756All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300018466|Ga0190268_10383096All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300018469|Ga0190270_11555236All Organisms → cellular organisms → Bacteria712Open in IMG/M
3300019377|Ga0190264_11848284All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300022694|Ga0222623_10373659All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300025792|Ga0210143_1001506All Organisms → cellular organisms → Bacteria5311Open in IMG/M
3300025796|Ga0210113_1097936All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300025906|Ga0207699_10763901All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300025908|Ga0207643_10588653All Organisms → cellular organisms → Bacteria715Open in IMG/M
3300025912|Ga0207707_10483674All Organisms → cellular organisms → Bacteria → Proteobacteria1057Open in IMG/M
3300025914|Ga0207671_10612096All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium868Open in IMG/M
3300025921|Ga0207652_10156658All Organisms → cellular organisms → Bacteria2041Open in IMG/M
3300025922|Ga0207646_10176361All Organisms → cellular organisms → Bacteria1930Open in IMG/M
3300025934|Ga0207686_10358185All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1100Open in IMG/M
3300025945|Ga0207679_10811498All Organisms → cellular organisms → Bacteria853Open in IMG/M
3300025959|Ga0210116_1020410All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300025961|Ga0207712_11308925All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300025972|Ga0207668_10839243All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium815Open in IMG/M
3300025981|Ga0207640_11420778All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300026003|Ga0208284_1022808All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300026041|Ga0207639_11856666All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300026046|Ga0208780_1001216All Organisms → cellular organisms → Bacteria2225Open in IMG/M
3300026142|Ga0207698_11286154All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium746Open in IMG/M
3300026497|Ga0257164_1099646All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300027717|Ga0209998_10037990All Organisms → cellular organisms → Bacteria1085Open in IMG/M
3300027722|Ga0209819_10339776All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300027775|Ga0209177_10271247All Organisms → cellular organisms → Bacteria635Open in IMG/M
3300027778|Ga0209464_10133953All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300027882|Ga0209590_10644957All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300028608|Ga0247819_10547960All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300028771|Ga0307320_10218280All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300028793|Ga0307299_10366129All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300028799|Ga0307284_10388329All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300028811|Ga0307292_10056247All Organisms → cellular organisms → Bacteria → Proteobacteria1481Open in IMG/M
3300028814|Ga0307302_10303375All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300028824|Ga0307310_10747186All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300028828|Ga0307312_10251028All Organisms → cellular organisms → Bacteria → Proteobacteria1145Open in IMG/M
3300028828|Ga0307312_10886726All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300028889|Ga0247827_10822091All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300030006|Ga0299907_10510019All Organisms → cellular organisms → Bacteria949Open in IMG/M
(restricted) 3300031197|Ga0255310_10250210All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300031228|Ga0299914_10223205All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1670Open in IMG/M
3300031548|Ga0307408_101407302All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300031716|Ga0310813_11025397All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium753Open in IMG/M
3300031854|Ga0310904_11106201All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300032002|Ga0307416_102144344All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300032004|Ga0307414_10487980All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300032012|Ga0310902_10676688All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300032179|Ga0310889_10717373All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300034659|Ga0314780_197965All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300034664|Ga0314786_184907All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300034670|Ga0314795_137136All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300034672|Ga0314797_159008All Organisms → cellular organisms → Bacteria505Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil21.58%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil6.47%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.76%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere5.04%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.04%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands4.32%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.32%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.60%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.88%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.88%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.16%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.16%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.16%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.16%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.16%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment1.44%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil1.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.44%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.44%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.44%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.44%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.44%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.72%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.72%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.72%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.72%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.72%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.72%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.72%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.72%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459016Litter degradation ZMR2EngineeredOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003987Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleB_D2EnvironmentalOpen in IMG/M
3300003992Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleB_D1EnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300004808Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1FreshEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300005883Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_302EnvironmentalOpen in IMG/M
3300005889Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_201EnvironmentalOpen in IMG/M
3300005897Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_103EnvironmentalOpen in IMG/M
3300006605Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011333Cornfield soil microbial communities from Stanford, California, USA - CI-CA-CRN metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012943Backyard soil microbial communities from Emeryville, California, USA - Original compost - Back yard soil (BY)EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014302Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2EnvironmentalOpen in IMG/M
3300015201Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S014-104B-1 (version 2)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025792Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025796Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025908Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025945Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025959Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026003Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_201 (SPAdes)EnvironmentalOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026046Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026142Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027722Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027778Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1Fresh (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028608Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Xylose_Day6EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031228Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032004Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-3Host-AssociatedOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300034659Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034670Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034672Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24R2 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
2ZMR_016033102170459016Switchgrass, Maize And Mischanthus LitterSAPRQPDGRAVPDCEPEPWVWANNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
ICChiseqgaiiDRAFT_062968413300000033SoilDLNLGGGYQIPLQASRTTELCXIASLSLGSGPNDVLGSGVDLSSRTFAFGASVGAVLGRNPQLRILPNAAFQFANTRATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSISFPVGLDGADASFGLMGAMNFGH*
JGI11643J12802_1042604813300000890SoilYQIPLRANRMAELCPIASLSLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTTSSSESQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR*
JGI10214J12806_1204569313300000891SoilLGTTSYDGLDGSSVDFGVNGGYQIALKSSVPVQVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAINFGR*
soilH1_1018769193300003321Sugarcane Root And Bulk SoilNLGGGYQIPLQTTRTAELCPVASLSLGSGPNDVLGTTDISSRTFAFGAAAGAVLGRNPQTRIVPNASFQFANTHTSLDDGTDSVSGSESYAVVTLGTGFVFNSRYSLNPSVSFPIGLEGSSASFGLAGAIQFGH*
Ga0055471_1031369013300003987Natural And Restored WetlandsGYGAPKGLYGEAGLGSTSYDAFDGSSFDLGLGGGYQIPLQTSRMAEVCPVANLSIGSGPNDVMGTGVDMSNRTFSFGASVGALVGRNPKMQILPNASFQFANTRVEMNDGTNSAAGSESYGLLTLGTGFVFNSRFSVNPSVSIPMGLDGSSTSFGLNGAINFGR*
Ga0055470_1008394713300003992Natural And Restored WetlandsVSGGYQIALKSSRPVEVCPVASLSIGSGPNDVLPGVDMSSRTFSMGAALGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGYEGSDASFTIAGAIHFGR*
Ga0055440_1006822913300004020Natural And Restored WetlandsTSYDAFNGSSFDFSVGGGYQLPLHTTRSAELCPVASLSIGSGPNDVLGTGVDMSNRTFAFGASAGALVGHSTQLQILPNAAFQFANTRASADNGTTSSSASESYALLTLGTGFVFNSRFSVNPSISFPIGLDGGNTSFGLVGAMNFGR*
Ga0062591_10039734113300004643SoilGIGTTSYDAFDGSSFDLNLGGGYQIPLQASRTTELCPIASLSLGSGPKDVLGSGVDLSNRTFAFGASVGAVLGRNPQVRILPNAAFQFANSRATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINIPVGLDGASTSFGLMGAMNFGR*
Ga0062381_1016521013300004808Wetland SedimentKGLYGKAGIGSTSYDGFDGSSFDLNFGGGYQIPLQASRVAQVCPVANLSIGSGPNDILGAGVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRISMDDGTTSASGSESYALLTLGTGFVFNSRFSINPSISVPMGLDGGNTSFGLMGAMNFGR*
Ga0062594_10280514313300005093SoilAGIGTTSYDGLNGSSFDLGLSGGYQVPLHSSRTAELCPVASLSLGSGPKNVLGSGVDMSSRTFALGASVGALLGRNPQLRFVPNASFQFANTRATADDGTTSASASQSYGLLSLGTGFVFNSRFSLNPSISIPVGLDGGDASFGIMGAMNFGR*
Ga0070683_10145518413300005329Corn RhizosphereYQVALKTAKPAELCPVASLSIGSGPNDVGGTGIDMSSRTFSFGGSVGAVLSQTSQMQIVPNAGFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0068869_10169019513300005334Miscanthus RhizosphereTFGYGTPKSLYGKAGIGTTSYDAFDGSSFDLNLGGGYQIPLQASRTTELCPIASLSLGSGPKDVLGSGVDLSNRTFAFGASVGAVLGRNPQVRILPNAAFQFANSRATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINIPVGLDGASTSFGLMGAMNFGR*
Ga0070680_10122464013300005336Corn RhizosphereDMSSRTFSFGAAVGAVVSQSSQMQILPNASFQFANTRLSVDDGTTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0070674_10156191213300005356Miscanthus RhizosphereGTFGYGQPKSFYGKAALGTTSYDGLDGSSVDFGVNGGYQIALKSNVPVQVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAINFGR*
Ga0070709_1141048613300005434Corn, Switchgrass And Miscanthus RhizosphereGGYQVPLHAKQAAELCPIASLSLGSGPNDVLGSGVNLSSRTFAFGASTGVVLGTNPQLRFLPNASFQFANTRATADDGTNSASDSQSYGLLTLGTGFVFNSRFSLNPSISIPIGLDGGDTSFGLMGAMNFGR*
Ga0070708_10172089213300005445Corn, Switchgrass And Miscanthus RhizosphereSYDALNGSSLDLSVGGGYQIPLQASRVAQVCPIASLSINSGPKNVLGGGVDMSGRTFAFGAAIGGLVGHNSQMQIVPNASFQFANTRATVDNGTTSASGSESYGLLTLGTGFVINSRFSVNPSINVPVGLSGSSTSFGLSGAINFGR*
Ga0070698_10072814833300005471Corn, Switchgrass And Miscanthus RhizosphereCPVASLSIGSGPNDIVGSGVDMSSRTFALGAAVGALVGHSSQMRILPNASFQFANGRVSLDDGTTSAASSESYGLLTLGTGFVFNSRFSINPSINIPMGLDGSSTSFGLVGAMNFGR*
Ga0070697_10013418433300005536Corn, Switchgrass And Miscanthus RhizosphereNVQGTGVDMSSRTFAFGASIGALVGRSSRMQILPNASFQFANTRAKVDDGTTSATGSESYGLLTLGTGFVFNSRFSVNPSINVPVGLSGSSASFGLSGAINFGR*
Ga0068853_10215813013300005539Corn RhizosphereVGTGVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRISMDDGTTSAAGSESYGLLTLGTGFVFNSRFSINPSVSIPMGLDGSSTSFGLVGAMNFGR*
Ga0070696_10017346433300005546Corn, Switchgrass And Miscanthus RhizosphereSSFDLGLSGGYQVPLHSSRTAELCPVASLSLGSGPKNVLGSGVDMSSRTFALGASVGALLGRNPQLRFVPNASFQFANTRATADDGTTSASASQSYGLLSLGTGFVFNSRFSLNPSISIPVGLDGGDASFGIMGAMNFGR*
Ga0068855_10088994923300005563Corn RhizosphereGTGIDMSSRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0068857_10248612513300005577Corn RhizosphereTAKPAELCPVASLSIGSGPNDVGGTGIDMSSRNFSFGAAMGAVLSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRYSLNPSINIPMGLDGSSTSFGLAGAINFGH*
Ga0068854_10126750623300005578Corn RhizosphereLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0070702_10152182623300005615Corn, Switchgrass And Miscanthus RhizosphereSIGSGPNDVGGTGIDMSSRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0068863_10229110913300005841Switchgrass RhizosphereFGYGTPKSLYGKAGIGTTSYDAFNGSSFDFNVGGGYQIPLQASRTTELCPIASLSLGSGPKDVLGSGVDLSSRTFAFGASMGAVLGRNPQLRILPNAAFQFANARATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINIPVGLDGASTSFGLVGAMNFGR*
Ga0075295_102872113300005879Rice Paddy SoilIGSTSYDGLDGSSFDFGVGAGYQIPLETTRTAELCPVANLSIGSGPNDIAGSGVDLSSRTFSFGASVGVLVNKSSKVQILPNAAFQFANTRNTVDDGTTSLSGSESYGLLTLGTGFVFHSRFSVNPSLSIPMGLDGSSTSFGLGGAINFGR*
Ga0075299_103672113300005883Rice Paddy SoilSRTFSFGASVGVLVNKSSKVQILPNAAFQFANTRNTVDDGTTSLSGSESYGLLTLGTGFVFHSRFSVNPSLSIPMGLDGSSTSFGLGGAINFGR*
Ga0075290_106330723300005889Rice Paddy SoilTRTAELCPVANLSIGSGPNDIAGSGVDLSSRTFSFGASVGVLVNKSSKVQILPNAAFQFANTRNTVDDGTTSLSGSESYGLLTLGTGFVFHSRFSVNPSLSIPMGLDGSSTSFGLGGAINFGR*
Ga0075281_106146113300005897Rice Paddy SoilTSYDGLDGSSFDFGVGAGYQIPLETTRTAELCPVANLSIGSGPNDIAGSGVDLSSRTFSFGASVGVLVNKSSKVQILPNAAFQFANTRNTVDDGTTSLSGSESYGLLTLGTGFVFHSRFSVNPSLSIPMGLDGSSTSFGLGGAINFGR*
Ga0074057_1167576913300006605SoilVLGTGTDTSSRTLAFGASVGALLGRNPQLQILPNASFQFANTRASVDDGTDSASDSQSYALLTLGTGFVFNSRFSLNPSINFPIGLDGGDTSFGLVGAMNFGRLRVDPPPSLARLRRQLLRAGGG*
Ga0075420_10012797113300006853Populus RhizosphereYDGMDGSSLDFGVSGGYQIALKSSRPVEVCPIASLSIGSGPNDVLGSGVDMSSRTFSAGAAIGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAINFGR*
Ga0075425_10002858913300006854Populus RhizosphereYGVPKSVYGKALVGRTSYDGLSGSSFDFGVGGGYQIPLHTSRMAELCPIASLSLGSGPHNVLNSGVDMSSRTFAFGGSVGVGVGGSSQVRILPNASFQFANTRATLDNGTISSSASESYGLLTLGSGFVFNSRFSLNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0075434_10084664823300006871Populus RhizosphereCPVASLSIGSGPNDVGGTGIDMSSRNFSFGAAMGAVLSQTSQMQIVPNAAFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFSSRYSLNPSINIPMGLDGASTSFGLAGAINFGH*
Ga0079215_1100567013300006894Agricultural SoilGLDGSSLDFGVSGGYQIALKSSRPVEVCPVASLSIGSGPNDVLGTGTDMSSRTFSMGAAIGAQLGNNPQLQIVPNASFQFANTRLSMDDGSDSVSGSESYGLLTLGTGFIFNSRYSLNPSISLPMGYDGSDASFTLAGAIHFGH*
Ga0079215_1166920813300006894Agricultural SoilPLKSSRTSELCPVANLSLNSGPNDIAGTGIDMSSRTFSFGAAFGTQVGNSPQMQILPNASFQFANTRLTMDDGTESASGSESYALLTLGTGFVFNSRYSLNPSVSFPIGLEGSDASFSLGGAIHFGR*
Ga0075426_1091417823300006903Populus RhizosphereSRTFAFGGSVGVGVGGSSQVRILPNASFQFANTRATLDNGTISSSASESYGLLTLGSGFVFNSRFSLNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0075424_10079504113300006904Populus RhizosphereKAGVGTTSYDGLNGSSFDLGLSGGYQVPLHSSRTAELCPVASLSLGSGPKNVLGSGVDMSSRTFALGASVGALLGRNPQLRFVPNASFQFANTRATADDGTTSASASQSYGLLSLGTGFVFNSRFSLNPSISIPVGLDGGDASFGIMGAMNFGR*
Ga0079218_1376104613300007004Agricultural SoilYQIALKSSRPVEVCPVASLSFGSGPNDIAGTGVDMSSRTFSMGAALGAQLGHNPQLQIVPNGSFQFANTRLSVDDGTESVSGSESYGLLTLGTGFIFNSRYSLNPSISFPMGYDGSDASFTLAGAIHFGH*
Ga0099827_1053233323300009090Vadose Zone SoilMSGRTFAFGASLGALVNKSSQVQILPNAAFQVANTRNTIDDGTTSTSGSESYGRLTLGTGFVFHSRSSVNPRISIPVGHNGSSTSFRLGGAINFGR*
Ga0105240_1137343213300009093Corn RhizosphereYDGLSGSSVDYGVGGGYQIPLRASRMAELCPIASLSLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0105243_1110978113300009148Miscanthus RhizosphereMSSRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVVAGSTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0075423_1016979343300009162Populus RhizosphereDAFNGSSFDLNAGGGYQIPLHTSRTAELCPVANLSLGSGPNDVLGSGVDLSTRTLAFGASVGAILGRNPQLRILPNASFQFANTRATADDGTTSSSDSESYGLLTLGTGFVFNSRFSLNPSISIPVGLTGGDTSFGLVGAMNFGR*
Ga0075423_1195912013300009162Populus RhizosphereMSSRNFSFGAAMGAVLSQTSQMQIVPNAAFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFSSRYSLNPSINIPMGLDGASTSFGLAGAINFGH*
Ga0105242_1055301013300009176Miscanthus RhizosphereSRMAELCPIASLSLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0105237_1022425143300009545Corn RhizosphereRMAELCPIASLSLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR*
Ga0134128_1314216613300010373Terrestrial SoilLCPIASLGLSSGPSDVLGSGVDLSGRTFAFGATVGALLGRTQQLRFVPNAAFQFANTRSTADDGTTSTSASESYGLLTLGTGFVFNSRFSLNPSISFPIGLDGGNTSFGLMGAMNFGR*
Ga0134126_1132700923300010396Terrestrial SoilQIPLQTSRAAELCPIASLGLSSGPSDVLGSGVDLSGRTFAFGATVGALLGRTQQLRFVPNAAFQFANTRSTADDGTTSTSASESYGLLTLGTGFVFNSRFSLNPSISFPIGLDCGNTSFGLMGAMNFGR*
Ga0134124_1075229513300010397Terrestrial SoilTGIDMSSRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0127502_1011300413300011333SoilGVPKSFYGKAALGSTSYDGLDGSSLDIGVSGGYQIALKSSRPVQVCPVASLSFGSGPNDVLGSGVDMSTRTFSMGAALGAQLGNNPQLQIVPNGSFQFANTRLSMDDGAESVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGWDGRDASFTIAGAIHFGR*
Ga0127502_1046778413300011333SoilGVPKSFYGKAALGSTSYDGLDGSSLDIGVSGGYQIALKSSRPVEVCPVASLSFGSGPNDVLGSGVDMSTRTFSMGAALGAQLGNNPQLQIVPNGSFQFANTRLSMDDGAESVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGWDGRDASFTIAGAIHFGR*
Ga0137382_1049739323300012200Vadose Zone SoilCPIANLSIGSGPKDVLSSGVDMSSRTFAFGAAIGGLVGHSTQMQILPNASFQFANTRAVVDDGTTSAAGSESYGLLTLGTGFVFNSRFSVNPSLSFPMGLNGSNTSFGLSGAINFGR*
Ga0137374_1026889933300012204Vadose Zone SoilSLDFGVGGGYQVPLHSSRLAELCPIASLSLGSGPNDVLGTGVDMSSCTFAFGASVGALVGRSSQVRILPNASFQFANTRAKLDDGTTSLSDSQSYGLLTLGTGFVFNSRFSLNPSISFPVGLDGGDASFGLIAAMNFGR*
Ga0137366_1070155113300012354Vadose Zone SoilANLSIGSGPNNVLGTGVDLSSRTFSFGAAVGGLVGHSTQMQIVPNASFQLANTHSTVDDGTTRTSGSESYGLLTLGTGFVFNARYSLNPSIGIPVGLDGSSASFGLSGAINFGR*
Ga0137407_1010633513300012930Vadose Zone SoilNLSIGSGPKNVLGSGVDMSSRTFSFGAAVGGLVGHSTQMQILPNASFQFANTRATADDGTTSASASETYGLLTLGTGFVFSSRFSVNPSVSFPMGLNGSSTTFGLSAAMNFGR*
Ga0137407_1075340923300012930Vadose Zone SoilVGGGYQIPLQTSRKAELCPVASLSIGSGPKNIVGSGVDMSSRTFALGAAIGTFVGHSSQVRILPNASFQFANTRVSLDDGTTSTAGSESYGLLTLGTGFVFNSRFSVNPSISIPMGLDGSSTSFGLVGAMNFGR*
Ga0137407_1112453623300012930Vadose Zone SoilSGPKNVLGSGVDMSSRTFALGASVGALLGRNPQLRFVPNASFQFANTRATADDGTTSASASQSYGLLSLGTGFVFNSRFSLNPSISVPVGLDGGDASFGIMGAMNFGR*
Ga0164241_1128930713300012943SoilLSIGSGPNDVLGSGVDMSSRTFSVGAATGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSASDSQSYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAINFGR*
Ga0164298_1004506533300012955SoilVGGGYQIPLHASRTAELCPIASLGLSSGPNDVLGSGVDLSSRTFALGASVGAVLFRTQQLRILPNAAFQFADTRATADDGTNSASASQSYALLTQGTGFVFNSRFSLNPSISFPIGLDGGSTSFGLMGAMNFGR*
Ga0164298_1061520613300012955SoilSSRTFAFGGSLGVLVGGSSQVRILPNASFQFANTRAKLDNGTSSSSDSQSYGLLTLGTGFVFNSRFSLNPSLSFPVGLDGGDASFGLIGAMNFGR*
Ga0164303_1122955213300012957SoilHASRTAELCPIASLGLSSGPNDVLGSGVDLSSRTFALGASVGAVLGRTQQLRILPNAAFQFADTRATADDGTNSASASQSYALLTLGTGFVFNSRFSLNPSISFPIGLDGGSTSFGLMGAMNFGR*
Ga0164299_1053612033300012958SoilSGPNDVLGSGVDLSSRTFALGASVGAVLGRTQQLRILPNAAFQFADTRATADDGTNSASASQSYALLTLGTGFVFNSRFSLNPSISFPIGLDGGSTSFGLIGAMNFGR*
Ga0164299_1129365113300012958SoilGYQVPLRSSRMAELCPIASLSLGSGPHDVLGTGVNMSSRTFAFGGSLGVLVGGSSQVRILPNASFQFANTRAKLDNGTTSSSASQSYGLLTLGTGFVFNSRFSLNPSLSFPVGLDGGDASFGLIGAMNFGR*
Ga0164301_1110274713300012960SoilMAELCPIASLSLGSGPHDVLGTGVNMSSRTFAFGGSLGVLVGGSSQVRILPNASFQFANTRAKLDNGTSSSSDSQSYGLLTLGTGFVFNSRFSLNPSLSFPVGLDGGDASFGLIGAMNFGR*
Ga0164302_1023517133300012961SoilGGGYQIPLHASRTAELCPIASLGLSSGPNDVLGSGVDLSSRTFALGASVGAVLGRTQQLRILPNAAFQFADTRATADDGTNSASASQSYALLTLGTGFVFNSRFSLNPSISFPIGLDGGSTSFGLMGAMNFGR*
Ga0164302_1118040023300012961SoilLNDVNHTQEIAEIKRLNKKEEGKEDEATLGGLAGGSSFDFNASGGYQVPLHAKQAAELCPIASLSLGSGPNDVLGSGVNLSSRTFAFGASTGVVLGTNPQLRFLPNASFQFANTRATADDGTNSASDSQSYGLLTLGTGFVFNSRFSLNPSLRIPIGLDGGETSFGLL
Ga0164302_1130783413300012961SoilAFNGSSFDFNVGGGYQIPLQASRTTQLCPIARLSLGSAPKDALGSGVDLSSRTFAFGASVGAVLGRNPALRILPNAAFQFANSRATADDGTTSASDSQSYGLLTLATGFVFNSRFSLNPSINIPVGLDGSSASFGLVGAMNFGR*
Ga0164308_1135263613300012985SoilLGGGYQIPLQASKTTELCPIASLSLGSGPKDVLGSGVDLSSRTFAFGASVGAVLGRNPALRILPNAAFQFANSRATADDGTTSASDSQSYGLLTLATGFVFNSRFSLNPSINIPVGLDGSSASFGLVGAMNFGR*
Ga0164304_1047533313300012986SoilVLGSGVDLSSRTFAFGASVGAVLGRNPALRILPNAAFQFANSRATADDGTTSASDSQSYGLLTLATGFVFNSRFSLNPSINIPVGLDGSSASFGLVGAMNFGR*
Ga0164307_1035852833300012987SoilSFDFNVSGGYQVPLHAKQAAELCPIASLSLGSGPNDVLGSGVNLSSRTFAFGASTGVVLGTNPQLRFLPNASFQFGNTRASADDGTNSASDSQSYGLLTLGTGFVFNSRFSLNPSISIPIGLDGGDTSFGLMGAMNFGR*
Ga0164306_1089924523300012988SoilGVNLSSRTFAFGASTGVVLGTNPQLRFLPNASFQFANTRATADDGTNSASDSQSYGLLTLGTGFVFNSRFSLNPSISIPIGLDGGDTSFGLMGAMNFGR*
Ga0164306_1125447113300012988SoilELCPIASLSLGSGPKDVLGSGVDLSSRTFAFGASMGAVLGRNPQVRILPNAAFQFANSRATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINIPVGLDGASTSFGLMGAMNFGR*
Ga0164305_1074516523300012989SoilGIGTTSYDAANGSSFDLNLGGGYQIPLQASRTTELCPIASLSLGSGPKDVLGSGVDLSSRTLAFGASVGAVLGRNPQLRILPNAAFQFANTRATVDDGTTSASDSQSYGLLTLATGFVFNSRFSLNPSINIPVGLDGSSTSFGLMGAMNFGR*
Ga0120123_113005123300013770PermafrostMSSRTFAFGAAIGGLVGHSSQMQIIPNASFQFANTRATVDDGTTSASGSESYGLLTLGTGFVINSRFSVNPSINVPMGLNGSSTSFGL
Ga0120158_1031226023300013772PermafrostYQIPLQSSRVAQVCPVASLSVGSGPKNVLGGGVDMSSRTFAFGAAIGGLVGHSSQMQIIPNASFQFANTRATVDDGTTSASGSESYGLLTLGTGFVINSRFSVNPSINVPMGLNGSSTSFGLSGAINFGR*
Ga0075310_113540513300014302Natural And Restored WetlandsCPVASLSIGSGPNDVGGTSIDMSSRTFSFGGSVGAVLSQTSQMQIVPNAGFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0173478_1025563323300015201SoilLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAINFGR*
Ga0137409_1081087513300015245Vadose Zone SoilNLSIGSGPKNVLSSGVDMSSRTFAFGAAIGGLVGHSTQMQILPNASFQFANTRAVVDDGTTSAAGSESYGLLTLGTGFVINSRFSVNPSINVPMGLNGRF*
Ga0132258_1034897513300015371Arabidopsis RhizosphereSGVDLSSRTFAFGASAGVGLGTNPQLRFLPNASFQFANTRASADDGTTSSSDSQSYGLLTLGTGFVFNSRFSLNPSISLPIGLDGGDTSFGLMGAMNFGH*
Ga0132256_10211397513300015372Arabidopsis RhizosphereRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH*
Ga0132257_10308721513300015373Arabidopsis RhizosphereKDVLGSGVDLSSRTFAFGASMGAVLGRNPQLRILPNAAFQFANARATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINVPVGLDGASTSFGLMGAMNFGR*
Ga0184615_1063520713300018059Groundwater SedimentSVGGGYQIPLQTSRTAELCPVASLSIGSGPNDIVGTGVDMSSRTFAFGAAVGTLVGHSSQVRILPNASFQFANTRVSLDDGTTSAAGSESYALLTLGTGFVFNSRFSVNPSISIPMGLDGSSTSFGLVGAMNFGR
Ga0184619_1037025513300018061Groundwater SedimentYGAPKGLYGKAGVGSTSYDALSGSSLDLNFGGGYQIPLQRSRMAQVCPIANLSIGSGPKNVLSSGVDMSSRTFAFGAAIGGLVGHSTQMQILPNASFQFANTRAVVDDGTTSAAGSESYGLLTLGTGFVFNSRFSVNPSLSFPMGLNGSNTSFGLNGAINFGR
Ga0184617_105633423300018066Groundwater SedimentGPNNVFGGTDFSSRTFSFGASVGGLVGHSTQMQILPNASFQFAHTHSTVDVGTIRASGSESYGLLTLGTGFVFNARYSLNPSIGIPVGLNGSSASVGLSGVINFGR
Ga0184609_1047954713300018076Groundwater SedimentTSYDGTDGSSFDLSVGGGYQIPLQTSRTAELCPIASLGIASGPNDIAGTGVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRVSLDDGTTSGAGSESYGLLTLGTGFVFNSRFSVNPSISIPVGLSGASTSFGLVGAMNFGR
Ga0184633_1045002123300018077Groundwater SedimentGPNDVLGSGVDMSSRTFAFGAALGAPVGHNPRVQILPNASLQFANTRLEIEDATGSAAGSESYGLLTLGTGFVFNSRISVNPSLSIPMGLDGGDTAFGISAALNFGH
Ga0184639_1055875623300018082Groundwater SedimentDMSSRTFAFGAALGAPVGHNPRVQILPNASLQFANTRLEIEDATGSAAGSESYGLLTLGTGFVFNSRISVNPSLSIPMGLDGGDTAFGISAALNFGH
Ga0190268_1038309613300018466SoilFGYGVPKNFYGKAALGTTSYDGLDGSSLDFGLGGGYQIPLHSRRTAEVCPVASLSIGSGPNDILGTGTDMSSRTFAFGAAMGAHVGNNPQLQIVPNASFQFANTRQELDDGTDSVSGSESYGLLTLGTGFVFNSRYSLNPSISIPMGANGLDASFGLAGAIHFGR
Ga0190270_1155523623300018469SoilYGQPKSFYGKAALGTTSYDGLDGSSVDFGVNGGYQIALKSSVPVQVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNGSFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAIHFGR
Ga0190264_1184828413300019377SoilFDLSVGGGYQIPLQTSRTAELCPVASLSIGSGPNDIVGTGVDMSSRTLAFGAAVGALVGNSSQVRILPNASFQFANTRISTDDGTTSAAGSESYGLLTLGTGFVFNSRFSINPSISIPMGLDGSSTSFGLVGAMNFGR
Ga0222623_1037365913300022694Groundwater SedimentGLDGSSLDFGVGGGYQIPLHSSRKAELCPIASLSFGSGPNDMLGSGVDMSTRDFAFGAALGAQVGNSPQMQILPNASFQFANKRIAFDDVAGSESYGLLTLGTGFVFNSRFSVNPSISIPVGLDGYDTSFGLAGAINFGR
Ga0210143_100150643300025792Natural And Restored WetlandsLGVSGGYQIALKSSRPVEVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAVGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGYEGSDASFTIAGAIHFGR
Ga0210113_109793613300025796Natural And Restored WetlandsLSIGSGPNDVGGTGIDMSSRTFSFGAAVGAVVSQSSQMQILPNASFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH
Ga0207699_1076390123300025906Corn, Switchgrass And Miscanthus RhizosphereAKPRCSGSMRARRSDHHRPHDIENRLLAIVTDGHIDFNASGGYQVPLHAKQAAELCPIASLSLGSGPNDVLGSGVNLSSRTFAFGASTGVVLGTNPQLRFLPNASFQFANTRATADDGTNSASDSQSYGLLTLGTGFVFNSRFSLNPSISIPIGLDGGDTSFGLMGAMNFGR
Ga0207643_1058865313300025908Miscanthus RhizosphereVPVQVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH
Ga0207707_1048367423300025912Corn RhizospherePKSLYGKAGIGTTSYDALDGSSFDFNVAGGYQIPLQTSRTAELCPIASLGLSSGPSDVLGSGVDLSGRTFAFGATVGALLGRTQQLRFVPNAAFQFANTRSTADDGTTSTSASESYGLLTLGTGFVFNSRFSINPSVSIPMGLDGSSTSFGLVGAMNFGR
Ga0207671_1061209613300025914Corn RhizosphereAELCPIASLSLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0207652_1015665833300025921Corn RhizosphereGPNDVGGTGIDMSSRTFSFGAAMGAVVSQTSQMQIVPNASFQFANTRLSVDDGTTSASGSESYGLLTLSTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH
Ga0207646_1017636143300025922Corn, Switchgrass And Miscanthus RhizosphereDMSSRTFAFGAAIGTLVGHSSQVRILPNASFQFANTRVSLDDGTTSTAGSESYGLLTLGTGFVFNSRFSVNPSISIPVGLDGSSTSFGLVGAMNFGR
Ga0207686_1035818513300025934Miscanthus RhizosphereTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0207679_1081149823300025945Corn RhizosphereDGSSLDLGANAGYQVALKTAKPAELCPVASLSIGSGPNDVGGTGIDMSSRNFSFGAAMGAVLSQTSQMQIVPNAAFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRYSLNPSINIPMGLDGSSTSFGLAGAINFGH
Ga0210116_102041033300025959Natural And Restored WetlandsSRKAELCPVASLSIGSGPNDISGTGFDMSSRTFGFGASAGVLVGRSSQVQILPNASFQFANTRVSLDNGTTSTSGSESYGVLTLGTGFVFDSRFSVNPSLSFPMGLDGGSTSFGLMGAINFGR
Ga0207712_1130892513300025961Switchgrass RhizospherePIVTRRMTSAVRTLSVACRAPDDCWLATGVEPAWHFDGSSLDLSVGGGYQIPLQTSRTAELCPVASLSIGSGPNNVLGSGVDMSSRTFAFGASVGALVGRSTRMQILPNASFQFANTRVSLDDGTTSTAGSESYGLLTLGTGFVFNSRFSINPSINIPMGLDGSSTSFGLVGAMNFGR
Ga0207668_1083924323300025972Switchgrass RhizosphereGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0207640_1142077813300025981Corn RhizosphereLGSGPNNVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0208284_102280823300026003Rice Paddy SoilTRTAELCPVANLSIGSGPNDIAGSGVDLSSRTFSFGASVGVLVNKSSKVQILPNAAFQFANTRNTVDDGTTSLSGSESYGLLTLGTGFVFHSRFSVNPSLSIPMGLDGSSTSFGLGGAINFGR
Ga0207639_1185666623300026041Corn RhizosphereIGSGPNDIVGTGVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRISMDDGTTSAAGSESYGLLTLGTGFVFNSRFSINPSVSIPMGLDGSSTSFGLVGAMNFGR
Ga0208780_100121613300026046Natural And Restored WetlandsGGTFGYGQPKSFYGKAALGTTSYDGFDGSSLDLGVSGGYQIALKSSRPVEVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAVGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGYEGSDASFTIAGAIHFGR
Ga0207698_1128615413300026142Corn RhizosphereVLGTGVDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTISSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0257164_109964613300026497SoilQTSRQAELCPVASLSIGSGPKDIVGSGVDMSSRTFALGAAIGTFVGHSSQVRILPNASFQFANTRVSLDDGTTSTAGSESYGLLTLGTGFVFNSRFSVNPSISIPMGLDGSNTSFGLVGAMNFGR
Ga0209998_1003799023300027717Arabidopsis Thaliana RhizosphereAGYQVALKTAKPAEFCPVASLSIGSGPNDVGGTGIDMSSRTFSFGGSMGAVLSQTSQMQIVPNAGFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH
Ga0209819_1033977613300027722Freshwater SedimentSYDGMDGSSLDFGVSGGYQIALKSSRPVEVCPVASLSIGSGPNDVLPGVDMSSRTFSMGAALGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFIFNSRYSLNPSVSLPMGYDGSDASFTIAGAIHFGH
Ga0209177_1027124713300027775Agricultural SoilSFDLNLGGGYQIPLHASRTAELCPIASLGLSSGPNDVLGSGTDLSSRTFAFGASVGALLGRNPQLRFVPNAAFQFANTRASVNDGTNSASDSQSYALLTLGTGFVFNSRFSLNPSINFPIGLDGADASFGLMGAMNFGH
Ga0209464_1013395313300027778Wetland SedimentKGLYGKAGIGSTSYDGFDGSSFDLNFGGGYQIPLQASRVAQVCPVANLSIGSGPNDILGAGVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRISMDDGTTSASGSESYALLTLGTGFVFNSRFSINPSISVPMGLDGGNTSFGLMGAMNFGR
Ga0209590_1064495713300027882Vadose Zone SoilMSGRTFAFGASLGALVNKSSQVQILPNAAFQVANTRNTIDDGTTSTSGSESYGRLTLGTGFVFHSRSSVNPRISIPVGPNGSSTSFRLGGAINFGR
Ga0247819_1054796023300028608SoilGTSSFGGSFGYGQPKSFYGKAALGTTSYDGLDGSSVDFGVNGGYQIALKSSVPVQVCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAIHFGR
Ga0307320_1021828013300028771SoilGTSSFGGTFGYGQPKSFYGKAALGTTSYDGLDGSSVDFGVNGGYQIALKSSVPVQVCPVASLSIGSGPNDVLGTGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFVFNSRYSLNPSISLPMGLEGSDASFSIAGAIHFGR
Ga0307299_1036612913300028793SoilELCPVASLSIGSGPNDIVGSGVDMSSRTFAFGAAIGALVGHSTQVRILPNASFQFANTRLSLDDGTTSAASSESYGLLTLGTGFVFNSRFSINPSISIPMGLDGSNTSFGLVGAMNFGR
Ga0307284_1038832923300028799SoilGSSLDLSVGGGYQVPLQTSRTAELCPVASLSIGSGPNDIVGSGVDMSSRTFAFGAAIGALVGHSTQVRILPNASFQFANTRLSLDDGTTSAASSESYGLLTLGTGFVFNSRFSINPSISIPMGLDGSNTSFGLVGAMNFGR
Ga0307292_1005624713300028811SoilIGSTSYDALSGSSLDLNFGGGYQIPLQRSRMAQVCPIANLSIGSGPKNVLSSGVDMSSRTFAFGAAIGGLVGHSTQMQILPNASFQFANTRAVVDDGTTSAAGSESYGLLTLGTGFVFNSRFSVNPSLSFPMGLNGSNTSFGLNGAINFGR
Ga0307302_1030337523300028814SoilGVDMSSRTFAFGAAVGALVGHSSQVRIVPNASFQFANTRVSLDDGTNSTAGSESYGLLTLGTGFVFNSRFSINPSINIPMGLDGSSTSFGLVGAMNFGR
Ga0307310_1074718613300028824SoilLSIGSGPNNVFGGTDFSSRTFSFGASVGGLVGHSTQMQILPNASFQFANTHSTVDDGTIRASGSESYGLLTLGTGFVFNARYSLNPSIGIPVGLNGSSASVGLSGAINFGR
Ga0307312_1025102823300028828SoilNDIVGSGVDMSSRTFAFGAAIGALVGHSTQVRILPNASFQFANTRLSLDDGTTSAASSESYGLLTLGTGFVFNSRFSINPSISIPMGLDGSNTSFGLVGAMNFGR
Ga0307312_1088672613300028828SoilLSVGGGYQVPLHTSRTAELCPVASLSIGSGPNDIVGSGVDMSSRTFALGAAVGALVGHSTQVRILPNASFQFANTRVSLDDGTDSAAASESYGLLTLGTGFVFNSRFSLNPSVSIPMGLDGSSASFGLVGAMNFGR
Ga0247827_1082209113300028889SoilVALKTAKPAQVCPVASLSIGSGPNDVGGTGIDMSSRNFSFGAAMGAVLSQTSQMQIVPNAAFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRYSLNPSINIPMGLDGSSTSFGLAGAINFGH
Ga0299907_1051001923300030006SoilSGPDDVLGSGIDMSSRTFALGAALGAQLGHNPQLQIVPNASFQLANTRLSVDDGTDSASDSESYGLLTLGTGFVFNSRYSLNPSISFPMGLEGSDASFSLAGAIHFGR
(restricted) Ga0255310_1025021013300031197Sandy SoilVDMSSRTFAFGAAVGALVGHSTQVRILPNASFQFANTRISMDDGTTSAAGSESYALLTLGTGFVFNSRFSVNPSISFPIGLDGGNTSFGLVGAMNFGR
Ga0299914_1022320533300031228SoilSLSIGSGPNDVIGSGVDISSRTFAFGAAVGALVGNSSQVQILPNASFQFANTRLSLDDGSTSASGSESYGLLTLGTGFVFNSRFSLNPSISIPMGLEGSDASFGLGGAINFGR
Ga0307408_10140730213300031548RhizosphereLCPVASLSIASGPNDIFGAGVDMSSRTLAFGAAVGALVGHSSQMQILPNASFHLANTRVSVDDGTDSAADSESYGLLTLGTGFVFSSRFSLNPSISIPVGLDGSSASFGLVGAMNFGR
Ga0310813_1102539713300031716SoilSRTFAFGASMGAVLGRNPQLRILPNAAFQFANARATVDDGTTSASDSQSYGLLTLGTGFVFNSRFSLNPSINIPVGLDGASTSFGLVGAMNFGR
Ga0310904_1110620113300031854SoilGQPKSFYGKAALGTTSYDGMDGSSLDFGVSGGYQIALKSSRPVEVCPIASLSIGSGPNDVLGSGVDMSSRTFSAGAAIGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAIHFGR
Ga0307416_10214434423300032002RhizosphereNDVGGAGIDMSSRTFSFGGSVGAVLSQTSQMQIVPNAGFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSTSFGLAGAMNFGH
Ga0307414_1048798023300032004RhizosphereGGYQIPLQASRTTELCPVASLSLNSGPNDIVGSGVDMSSRTFSFGAALGAQVGHNPQMRILPNASFQFANTRLSMDDGTNDVSGSESYGLLTLGTGFVFNSRYSLNPSINIPMGLDGSDVSFGLAGAINFGR
Ga0310902_1067668813300032012SoilKAALGTTSYDGLDGSSLDLGANAGYQVALKTAKPAELCPVASLSIGSGPNDVGGTGIDMSSRNFSFGAAMGAVLSQTSQMQIVPNAAFQFANTRLSVDDGTTSASGSESYGLLTLGTGFVFNSRYSLNPSINIPMGLDGSSTSFGLAGAINFGH
Ga0310889_1071737313300032179SoilDMSSRTFAFGASAGVLVNRSSQVGILPNASFQFANTRAKVDNGTTSSSASQSYGLLTLGTGFVFHSRFSVNPSLSFPVGLDGGDVSFGLIGAMNFGR
Ga0314780_197965_1_5013300034659SoilTFGYGQPKSFYGKAALGTTSYDGMDGSSLDFGVSGGYQIALKSSRPVEVCPIASLSIGSGPNDVLGSGVDMSSRTFSAGAAIGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFVFNSRYSLNPSVSLPMGYEGSDASFTIAGAIHFGH
Ga0314786_184907_1_4863300034664SoilQPKSFYGKAALGTTSYDGMDGSSLDFGVSGGYQIALKSSRPVEVCPIASLSIGSGPNDVLGSGVDMSSRTFSAGAAIGAQLGNNPQLQIVPNASFQFANTRLSLDDGTDSVSGSESYGLLTLGTGFVFNSRYSLNPSVSLPMGYEGSDASFTIAGAIHFGH
Ga0314795_137136_16_5193300034670SoilGTFGYGAPKGLYGKAGVGTTSYDALDGSSFDLNVGGGYQIPLQTSRMAELCPVASLSIGSGPNNVLGSGVDMSSRTFAFGASIGALVGHSTRMQILPNASFQFANTRAKVDDGTTSASGSESYGLLTLGTGFVFNSRFSLNPSINIPVGLDGSSASFGLAGAMNFGH
Ga0314797_159008_30_5033300034672SoilLYGKAGVGTTSYDALDGSSFDLNVGGGYQIPLQTSRMAELCPVASLSIGSGPNDVLGSGVDMSSRTFSMGAAMGAQLGNNPQLQIVPNASFQFANTRLSMDDGTDSASGSESYGLLTLGTGFIFNSRYSLNPSISLPMGLEGSDASFSIAGAIHFGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.