NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F056976

Metagenome / Metatranscriptome Family F056976

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F056976
Family Type Metagenome / Metatranscriptome
Number of Sequences 137
Average Sequence Length 169 residues
Representative Sequence MAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Number of Associated Samples 125
Number of Associated Scaffolds 137

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.73 %
% of genes near scaffold ends (potentially truncated) 29.93 %
% of genes from short scaffolds (< 2000 bps) 64.96 %
Associated GOLD sequencing projects 114
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (75.912 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand
(20.438 % of family members)
Environment Ontology (ENVO) Unclassified
(37.956 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(44.526 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 80.10%    β-sheet: 0.00%    Coil/Unstructured: 19.90%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 137 Family Scaffolds
PF00106adh_short 24.82
PF02803Thiolase_C 17.52
PF13561adh_short_C2 12.41
PF09712PHA_synth_III_E 2.19
PF07167PhaC_N 1.46
PF00108Thiolase_N 1.46
PF01674Lipase_2 1.46
PF01592NifU_N 0.73
PF03129HGTP_anticodon 0.73
PF00160Pro_isomerase 0.73
PF00174Oxidored_molyb 0.73
PF02678Pirin 0.73
PF12225DUF5981 0.73

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 137 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 18.98
COG3243Poly-beta-hydroxybutyrate synthaseLipid transport and metabolism [I] 1.46
COG0124Histidyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.73
COG0423Glycyl-tRNA synthetase, class IITranslation, ribosomal structure and biogenesis [J] 0.73
COG0441Threonyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.73
COG0442Prolyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.73
COG0652Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin familyPosttranslational modification, protein turnover, chaperones [O] 0.73
COG0822Fe-S cluster assembly scaffold protein IscU, NifU familyPosttranslational modification, protein turnover, chaperones [O] 0.73
COG1741Redox-sensitive bicupin YhaK, pirin superfamilyGeneral function prediction only [R] 0.73
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.73
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.73


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A75.91 %
All OrganismsrootAll Organisms24.09 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001431|F14TB_101936104Not Available684Open in IMG/M
3300002122|C687J26623_10189728Not Available569Open in IMG/M
3300005167|Ga0066672_10078230Not Available1972Open in IMG/M
3300005172|Ga0066683_10404056Not Available844Open in IMG/M
3300005177|Ga0066690_10415485Not Available913Open in IMG/M
3300005180|Ga0066685_10364192Not Available1004Open in IMG/M
3300005355|Ga0070671_101850915Not Available536Open in IMG/M
3300005446|Ga0066686_10589217Not Available756Open in IMG/M
3300005467|Ga0070706_100007204All Organisms → cellular organisms → Bacteria10451Open in IMG/M
3300005468|Ga0070707_100137288Not Available2379Open in IMG/M
3300005526|Ga0073909_10526531Not Available575Open in IMG/M
3300005546|Ga0070696_100193237All Organisms → cellular organisms → Bacteria1516Open in IMG/M
3300005555|Ga0066692_10294594Not Available1028Open in IMG/M
3300005561|Ga0066699_10179162Not Available1465Open in IMG/M
3300005576|Ga0066708_10495389Not Available787Open in IMG/M
3300005586|Ga0066691_10192325Not Available1186Open in IMG/M
3300005598|Ga0066706_10269463All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1331Open in IMG/M
3300005764|Ga0066903_103297834Not Available872Open in IMG/M
3300006032|Ga0066696_10213799All Organisms → cellular organisms → Bacteria1235Open in IMG/M
3300006049|Ga0075417_10322881Not Available752Open in IMG/M
3300006796|Ga0066665_10360580Not Available1189Open in IMG/M
3300006800|Ga0066660_10144376All Organisms → cellular organisms → Bacteria1755Open in IMG/M
3300006846|Ga0075430_101276461Not Available604Open in IMG/M
3300006854|Ga0075425_100732047Not Available1135Open in IMG/M
3300006880|Ga0075429_100592701Not Available971Open in IMG/M
3300009012|Ga0066710_103139480Not Available637Open in IMG/M
3300009081|Ga0105098_10268492Not Available810Open in IMG/M
3300009137|Ga0066709_103161754Not Available601Open in IMG/M
3300009162|Ga0075423_12115461Not Available610Open in IMG/M
3300009553|Ga0105249_11957989Not Available659Open in IMG/M
3300009809|Ga0105089_1005544All Organisms → cellular organisms → Bacteria1446Open in IMG/M
3300009811|Ga0105084_1027681Not Available958Open in IMG/M
3300009815|Ga0105070_1130472Not Available510Open in IMG/M
3300009818|Ga0105072_1006797All Organisms → cellular organisms → Bacteria1959Open in IMG/M
3300009821|Ga0105064_1043616Not Available856Open in IMG/M
3300009822|Ga0105066_1087390Not Available680Open in IMG/M
3300010361|Ga0126378_10355171All Organisms → cellular organisms → Archaea1575Open in IMG/M
3300010398|Ga0126383_11504302Not Available763Open in IMG/M
3300010400|Ga0134122_10199102Not Available1660Open in IMG/M
3300012206|Ga0137380_10195343Not Available1837Open in IMG/M
3300012207|Ga0137381_10480931All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300012349|Ga0137387_10547391Not Available840Open in IMG/M
3300012957|Ga0164303_10000358All Organisms → cellular organisms → Bacteria10260Open in IMG/M
3300012984|Ga0164309_11176569Not Available642Open in IMG/M
3300012987|Ga0164307_10017854All Organisms → cellular organisms → Bacteria3605Open in IMG/M
3300014150|Ga0134081_10126559Not Available823Open in IMG/M
3300015356|Ga0134073_10062268Not Available1027Open in IMG/M
3300015371|Ga0132258_11407540All Organisms → cellular organisms → Bacteria1762Open in IMG/M
3300017657|Ga0134074_1223694Not Available672Open in IMG/M
3300017997|Ga0184610_1033998Not Available1446Open in IMG/M
3300017997|Ga0184610_1054593Not Available1189Open in IMG/M
3300018000|Ga0184604_10367165Not Available513Open in IMG/M
3300018028|Ga0184608_10047218All Organisms → cellular organisms → Bacteria1685Open in IMG/M
3300018028|Ga0184608_10068420Not Available1435Open in IMG/M
3300018031|Ga0184634_10549053Not Available511Open in IMG/M
3300018051|Ga0184620_10096708Not Available905Open in IMG/M
3300018052|Ga0184638_1000033All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira29903Open in IMG/M
3300018053|Ga0184626_10011890All Organisms → cellular organisms → Bacteria3429Open in IMG/M
3300018054|Ga0184621_10188979Not Available739Open in IMG/M
3300018056|Ga0184623_10001195All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira11299Open in IMG/M
3300018063|Ga0184637_10022382All Organisms → cellular organisms → Bacteria3810Open in IMG/M
3300018072|Ga0184635_10007897All Organisms → cellular organisms → Bacteria3579Open in IMG/M
3300018075|Ga0184632_10177212Not Available940Open in IMG/M
3300018076|Ga0184609_10008968Not Available3667Open in IMG/M
3300018076|Ga0184609_10080001Not Available1444Open in IMG/M
3300018076|Ga0184609_10218049Not Available891Open in IMG/M
3300018078|Ga0184612_10004153All Organisms → cellular organisms → Bacteria7200Open in IMG/M
3300018468|Ga0066662_10322125Not Available1312Open in IMG/M
3300019254|Ga0184641_1301374Not Available672Open in IMG/M
3300019487|Ga0187893_10009863All Organisms → cellular organisms → Bacteria14508Open in IMG/M
3300019879|Ga0193723_1117654Not Available739Open in IMG/M
3300019885|Ga0193747_1083628Not Available784Open in IMG/M
3300020004|Ga0193755_1084206Not Available1018Open in IMG/M
3300020018|Ga0193721_1174154Not Available503Open in IMG/M
3300021073|Ga0210378_10033370Not Available2051Open in IMG/M
3300021476|Ga0187846_10030282All Organisms → cellular organisms → Bacteria2461Open in IMG/M
3300022534|Ga0224452_1141653Not Available740Open in IMG/M
3300022694|Ga0222623_10079274All Organisms → cellular organisms → Bacteria → Proteobacteria1274Open in IMG/M
3300025324|Ga0209640_10001200All Organisms → cellular organisms → Bacteria21605Open in IMG/M
3300025910|Ga0207684_10000555All Organisms → cellular organisms → Bacteria45995Open in IMG/M
3300025931|Ga0207644_11658102Not Available536Open in IMG/M
3300026324|Ga0209470_1030170All Organisms → cellular organisms → Bacteria2794Open in IMG/M
3300026331|Ga0209267_1125479Not Available1078Open in IMG/M
3300026536|Ga0209058_1252691Not Available619Open in IMG/M
3300026548|Ga0209161_10362658Not Available645Open in IMG/M
3300026552|Ga0209577_10156144All Organisms → cellular organisms → Bacteria1790Open in IMG/M
3300027013|Ga0209884_1013678All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300027056|Ga0209879_1006053Not Available1868Open in IMG/M
3300027332|Ga0209861_1057689Not Available572Open in IMG/M
3300027490|Ga0209899_1055208Not Available815Open in IMG/M
3300027561|Ga0209887_1008045All Organisms → cellular organisms → Bacteria2843Open in IMG/M
3300027948|Ga0209858_1006547All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium860Open in IMG/M
3300027952|Ga0209889_1004240All Organisms → cellular organisms → Bacteria3952Open in IMG/M
3300027954|Ga0209859_1037253Not Available807Open in IMG/M
3300027957|Ga0209857_1055806Not Available690Open in IMG/M
3300028536|Ga0137415_10307045All Organisms → cellular organisms → Bacteria1391Open in IMG/M
3300028807|Ga0307305_10551662Not Available514Open in IMG/M
3300028807|Ga0307305_10552050Not Available514Open in IMG/M
3300030988|Ga0308183_1174134Not Available545Open in IMG/M
3300030990|Ga0308178_1040385Not Available836Open in IMG/M
3300030990|Ga0308178_1156987Not Available527Open in IMG/M
3300031677|Ga0307480_1007973Not Available700Open in IMG/M
3300031716|Ga0310813_11703015Not Available591Open in IMG/M
3300031940|Ga0310901_10253270Not Available723Open in IMG/M
3300032012|Ga0310902_10884977Not Available614Open in IMG/M
3300032075|Ga0310890_10159214Not Available1504Open in IMG/M
3300032180|Ga0307471_100005797All Organisms → cellular organisms → Bacteria7768Open in IMG/M
3300032205|Ga0307472_100103220All Organisms → cellular organisms → Bacteria1968Open in IMG/M
3300034681|Ga0370546_007622Not Available1206Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand20.44%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment14.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.14%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.30%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.65%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.92%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.19%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.19%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.46%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.46%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.46%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.46%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.73%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.73%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.73%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.73%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.73%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.73%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.73%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.73%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.73%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009081Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009800Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_30_40EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009809Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300009815Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300009823Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_40_50EnvironmentalOpen in IMG/M
3300009836Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20EnvironmentalOpen in IMG/M
3300010029Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019254Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027013Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027056Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027209Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027332Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027948Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027957Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300030988Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_157 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031677Metatranscriptome of hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031940Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D2EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034681Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_121 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F14TC_10023059033300000559SoilMATLKSKQPVKSSQDGANRSGEKAQRKAIPSETIMKSQXXXXXTDAQPTRRSTMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQL
F14TB_10193610413300001431SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYENWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARTSRKYAIDLNKKVMDTVISA*
C687J26623_1018972813300002122SoilMAAEEMKKTAEEMQQTGGATADDMRRTGEEFTRAAIAFDVSGMSQAWKQGYLRGLDAFFRSQEQTESLLKETVKQGISGSQQILQGYEKWLEQIQGQAGTASPFVEWSRQLVRSFHSNADPLFKTAADTAESAFNYYQNSFAHPARKYTVDVNKRVMDTVIAV*
Ga0066672_1007823033300005167SoilMAAEDIKKTAEELQQTGRATAEDIKRTSEEFARAAGNFDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVSWSRQLVRSFHTNADPIFKNAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066677_1079953713300005171SoilRGALLAMASPKKKKFAKGPHSRATQFPNRGRGNAMAAGTIKRSHRLSEEGPETQPQRRSTMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVR
Ga0066683_1040405613300005172SoilMATEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVVSA*
Ga0066690_1041548523300005177SoilMAAEDIKKTAEELQQTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNAL
Ga0066685_1036419213300005180SoilMATEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0070671_10185091513300005355Switchgrass RhizosphereEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0066686_1058921723300005446SoilMAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRRMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVVSA*
Ga0070706_100007204113300005467Corn, Switchgrass And Miscanthus RhizosphereMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0070707_10013728833300005468Corn, Switchgrass And Miscanthus RhizosphereMATEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNANPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0073909_1052653113300005526Surface SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTAESAFNYYENALARPSRK
Ga0070696_10019323723300005546Corn, Switchgrass And Miscanthus RhizosphereMTTEHFKERADEARKVYETSAEDMGRIGEAFARAASTFDLKGMSNAWKLGYLRGLEAVFQSQEQTHHFVKETVKQGINGAQQMLQSYDKCLDDVQGKAGTALPFVELSRQFMRSVQHAADPLYKTAADTTESAFNYYEDSLARPSRQYAIDLNKRVLDTIITA*
Ga0066692_1029459423300005555SoilMTAEDIKKTAEQTQKTGAATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAVAGSPFVEWSRQLVRSFHTNADPLFKGAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066699_1017916223300005561SoilMAAEDIKKTAEESQKTGRATAEDIKRTSEEFARAASNYDVKGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAVAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQ
Ga0066708_1049538923300005576SoilEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLYGLETFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066691_1019232523300005586SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAVAGSPFVEWSRQLVRSFHTNADPLFKGAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066706_1026946323300005598SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066903_10329783413300005764Tropical Forest SoilMAAEEIKERAEKMREAGETTAEDLRRTGEEFTKAASTFDLKGMSNAWKQGYLRGLEAFFQSQEQTEHLLKEMVKQGISGSEQILQSYEKWLEQTQGQAGAASPFVGWARQLVHSFRSTADQLFKTAGDTTESAFNYYENALGRPSRQYAVDLNKKVMDTVISA*
Ga0066696_1021379923300006032SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAVAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0075417_1032288113300006049Populus RhizosphereMAAEEMKKRAEEIQKTGEATAEHLKRTGEDFTRAASDFDVKGMSKAWKEGYVRGLEALFVSQEQTGLLFKETVKQGISGSRHMLQAYEKWLEQIQGQAGAASPFVEWSRQFVRSLQGNADPLFKTAADSVENTFTYYENAVGRPSRKYALDLHKKVMDTVISA*
Ga0066665_1036058013300006796SoilMTAEDIKKTAEQTQKTGAATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALAR
Ga0066660_1014437613300006800SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNQKAMDTVISA*
Ga0075430_10127646113300006846Populus RhizosphereMAIEEMKKRAEDIHKTGEATAEHMKRTGEDFTKAAGDFDVKGMNKAWKDGYVRGLEAFFVTQEQTGLLLKETVKQGISGSRQMLQVYEKWLEQIQGQAGAASPFVEWSRQFVHSFQGNADPLFKTAAATVESTYTYYENAVGRPSRKYALDLHKKVIDTVISA*
Ga0075425_10073204723300006854Populus RhizosphereHMKRTGEDFTKAAGDFDVRGMSKAWKDGYVRGLEAFFVTQEQTGLLVKETVKQGISGSRHMLQVYEKWLEQIQGHAGAASPFVEWSRQFVHSFQGNADPLFKTAAATVESTYTYYENAVGRPSRKYTLDLHKKVIDTVISA*
Ga0075429_10059270123300006880Populus RhizosphereMAAEEMKKRAEEIQKTGEATAEHLKRTGEDFTRAASDFDVKGMSKAWKEGYVRGLEALFVSQEQTGLLFKETVKQGISGSRHMLQAYEKWLEQIQGQAGAASPFVEWSRQFVRSLQGNADPLFKTATDSVENTFTYYENAVGRPSRKYALDLHKKVMDTVISA*
Ga0066710_10313948023300009012Grasslands SoilFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0105098_1026849213300009081Freshwater SedimentGTDAQPTRRSTMAAEDIKKKGEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEEWLEQVQGQAGAASPFVEWSRQLVRSFHSATDPLFKTAADTTESAFNYYENALARPSRKYAIDLNKKVMDSVISA*
Ga0066709_10089369823300009137Grasslands SoilMASRKKKKFAKGPHSRTSQSQNRGQGKAMAAGTIKRSHRLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0066709_10316175413300009137Grasslands SoilMATEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLQAFFQSQEQTERLLKETVKQGLSGSQQILQGYEKWLDQIQGQAGAGSLFVEWSRQLVRSFHTNADPSFKSAADATESAFNYY
Ga0075423_1211546113300009162Populus RhizosphereMAAEDIKKRAEDIQKTGEATAEHMKRTGEDFTKAAGDFDVRGMSKAWKDGYVRGLEAFFVTQEQTGLLVKETVKQGISGSRHMLQVYEKWLEQIQGHAGAASPFVEWSRQFVHSFQGNADPLFKTAAATVESTYTYYENAVGRPSRKYTLDLHKKVIDTVISA*
Ga0105249_1195798913300009553Switchgrass RhizosphereISETREKPQKPAVPAAAISQSHRLSEVIHEQPTRRPTMNEDMNKTAEEMQKTGKQTAEDLKRTGEEFAKAARDFDVKGMSKIWKQGYLGGLEAFYQSQEQSERLVKETVKQGISGSQQMFQAYEKWLEQIQGNAGAASPFVDWPRQIVRALHNNADPFFKTAADTADNAFNYYENALARPSRKYTLDLHAKVIDTVIPA*
Ga0105069_101939613300009800Groundwater SandMATPKSKKLVKSSQDGANRSGEKVQRKAIPSETIMKSQRLSDGTDAQPTRRSTMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHSTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0105061_100317513300009807Groundwater SandMATPKRKKPVKSSQDRANHSREMAQRKAIPSETIMKSHRLSDGTGVQPTRRSTMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA*
Ga0105089_100554423300009809Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA*
Ga0105084_102768113300009811Groundwater SandTGEEFTKAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISSSEQILQSYEKWLEQIQDQAGAASPLVEWSRQLVRSFHSRADPLFKTAADTTESAFNYYQNALARPSRKYAVDLNKQVMDSVLSA*
Ga0105070_101530913300009815Groundwater SandMAPPKRKKPAKSSQDRANHSREKAQRKAIPSEAIMKSHRLSDGTGAQPTRRSIMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA*
Ga0105070_113047213300009815Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDL
Ga0105076_100105853300009816Groundwater SandMATRKRKKPVKSSQDRANHSREMAQRKAIPSETIMKSHRLSDGTGVQPTRRSTMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA*
Ga0105076_107063513300009816Groundwater SandDRANHSREKAQRKAIPSETIMKSHRLSDGTGAQATRRSTMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTDSAFNYYENALARPSRKYAVDLNKKVMDTVISA*
Ga0105072_100679743300009818Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSMARPSRNYAIDLNKKVLDTIITA*
Ga0105072_101638613300009818Groundwater SandMAPPKRKKPAKSSQDRANHSREKAQRKAIPSETIMKSHRLSDGTGAQPTRRSTMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA*
Ga0105064_104361623300009821Groundwater SandFNKAASSFDLKGMSNAWKQGYLRGLEAVFQSQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA*
Ga0105066_108057513300009822Groundwater SandMAPPKRKKPAKSSQDRANHSREKAQRKAIPSETIMKSHRLSDGTGAQPTRRSIMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA*
Ga0105066_108739013300009822Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSIHRTADPLLKTAADTTESAFNYYEDSMARPSRNYAIDLNKKVLDTIITA*
Ga0105078_102454813300009823Groundwater SandMATPKSKKPVKSSQDGANRSGEKAQRKAILSETIMKSHRLSDGTGVQPTRRSTMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA*
Ga0105068_106194213300009836Groundwater SandMATPKRKKPAKSSQDRANYSREKVQRKAIPSETIMKSQRLSDGTGAQPTRRSTMDAAELKKRAEEMRKAGETTAEDMRRTGEEFTKAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYY
Ga0105074_105151123300010029Groundwater SandAQRKAIPSETIMKSHRLSDGTGAQATRRSTMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTKAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKQLMDSVLSA*
Ga0134063_1039224813300010335Grasslands SoilMANPKKKKFAQDPHSGASQSRNKAQRKAMAAGTIKRSHRLSEEGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVSWSRQLVRSFHTNAEPIFKNAADATESAFNYYQNALARPS
Ga0126378_1035517123300010361Tropical Forest SoilMAAEEIKERAEKMREAGETTAEDLRRTGEEFTKAASTFDLKGMSNAWKQGYLRGLEAFFQSQEQTEHLLKEMVKQGISGSEQILQSYEKWLEQTQGQAGAASPFVGWARQLVHSFRSTADQLFKTAADTTESAFNYYENALGRPSRQYAVDLNKKVMDTVISA*
Ga0126383_1150430213300010398Tropical Forest SoilMAAEEIEKRAEKMQKAGETTAEDMRRTGEEFTKAASTFDLKGMSNAWKQGYLRGLEAFFQSQEQTEHLLKEMVKQGISGSEQILQSYEKWLEQTQGQTGAASPFVGWARQLVHSFRSTADQLFKTAADTTESAFNYYENALGRPSRQYAVDLNKKVMDTVISA*
Ga0134122_1019910223300010400Terrestrial SoilMTTEHFKERADEARKVYETSAEDMGRIGEAFARAASTFDLKGMSNAWKLGYLRGLETVFQSQEQTHHFVKETVKQGINGAQQMLQSYDKCLDEVQGKAGTALPFVELSRQFMRSVQHAADPLYKTAADTTESAFNYYEDSLARPSRQYAIDLNKRVLDTIITA*
Ga0137364_1024060523300012198Vadose Zone SoilMASPKKKKFAKGPQSRTSQSQNRGQGKAMAAGTIKRSHRLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAVAGSPFVEWSRQLVRSFHPNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137365_1003015663300012201Vadose Zone SoilMASPKKKKFAKGPHSRTSQSQNRGQGKAMAAGTIKRSHRLSEEGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137380_1019534323300012206Vadose Zone SoilMEAGTIKRSHRLFEEGPETQPQRRSTMAAEDIKKTAEELQKTGRETAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137381_1048093123300012207Vadose Zone SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVDWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137376_1120503113300012208Vadose Zone SoilKKKFAKGPHSRTSQSQNRGQGKAMAAGTIKRSHRLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137370_1003383223300012285Vadose Zone SoilMASPKKKKFAKGPHSRTSQSQNRGQGKAMAAGTIKRSHRLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNY*
Ga0137387_1054739113300012349Vadose Zone SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIHGQAGAGSPFVEWSRQLVRSFHSNADPLFKTAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0137386_1114194213300012351Vadose Zone SoilTSQYKNRAQGTAMAAGTIKRSHSLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTEHLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPLFKGAADATESAFNYYQNALARP
Ga0137385_1049859113300012359Vadose Zone SoilMASPKKKKFAKGPHSRTSQSQNRGQGKAMAAGTIKRSHRLSEKGPETQPQRRSTMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA*
Ga0164303_1000035883300012957SoilMATPKSKQPVKSSQDGANRSGEKAQRKAIPSETIMKSQRLSDGTDAQPTRRSTMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENALARPSRKYAIDLNKKVMDTVISA*
Ga0164309_1117656923300012984SoilIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEEILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0164307_1001785413300012987SoilKAEEMRRSSETTAEDMRQTGEEFTRAASAFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0134081_1012655923300014150Grasslands SoilMAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLN
Ga0134073_1006226823300015356Grasslands SoilMAAEDIKKTAEESQKTGRATAEEIKRTSEDFARAASNYDVKGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVVSA*
Ga0134089_1055526913300015358Grasslands SoilADMRKLTMATPKRKHPAKRSQAGATHSREKAQRKTIPSETIMKSPRLSDGTGAQLTRRSTMDAAEIKKRAEEMRKTGETTAEDMRRTGEEFTKAASTFDLKGMSKAWKQGYLRGLEAFFQTQEQTEQFLKETVKQGISGSEQILQSYEKWLEQIQGQAGAASPFVEWSR
Ga0132258_1140754013300015371Arabidopsis RhizosphereMAAEDIQKSDGTDAQPTRRSTMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEEILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0132256_10001433183300015372Arabidopsis RhizospherePVKSSQDGANRSGEKAQRKAIPSETIMKSQRLSDGTHAQPTRRSTMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEEILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA*
Ga0134074_122369413300017657Grasslands SoilTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0184610_103399813300017997Groundwater SedimentGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKNAADTTESAFNYYENAVARPSRKGNFDVQKKVMDTVLSV
Ga0184610_105459323300017997Groundwater SedimentMAEDMRRTGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENAVARPSRKHTLDLHAKVIDTVISANGAFKRE
Ga0184604_1036716513300018000Groundwater SedimentMAEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKSWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENAVARPSR
Ga0184608_1004721823300018028Groundwater SedimentMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLQAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0184608_1006842013300018028Groundwater SedimentMAAEDMKKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHALDLHAKVIDTVISANGAFKRE
Ga0184634_1054905313300018031Groundwater SedimentTMAAEEMKKTAEEIHKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAFFQSQEQTERLVKETVKQGISSSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENAVARPSRKGTFDVQKKVMDTVLTV
Ga0184620_1009670813300018051Groundwater SedimentMTEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHTLDLHAKVIDTVISANGAFKRE
Ga0184638_1000033253300018052Groundwater SedimentMAAEEMKKTAEEIHKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKNAADTTESAFNYYENAVARPSRKGTFDVQKKVMDTVLTV
Ga0184626_1001189013300018053Groundwater SedimentMAAEEMKKTAEEMQKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKNAADTTESAFNYYENAVARPSRKGTFDVQKKVMDTVLSV
Ga0184626_1001209263300018053Groundwater SedimentMATPKRKKPAKSSQDGANHSREKAQRKAIPSETIMKSHRLSDGTGAQPTRRSTMAAAEMKKRAEEMRKTGETTAEDMRRTGEEFTRASSTFDLKGMGKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISV
Ga0184621_1018897913300018054Groundwater SedimentMTEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHTLDLHAKVIDTVISANGAFKR
Ga0184623_1000119543300018056Groundwater SedimentMAAEEMKKTAEEIHKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENAVARPSRKGNFDVQKKVMDTVLSV
Ga0184637_1002238223300018063Groundwater SedimentMAAAEMKKRAEEMRKTGETTAEDMRRTGEEFTRASSTFDLKGMGKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA
Ga0184635_1000789733300018072Groundwater SedimentMKAEEMKKTAEEMQKNGEAMAEDMRRTGEEFTRAASNFDVKGMSQAWKQSYLHGLEAFFQSQEQTERFLKETVKQRISDSQQILQGYEKWLDQIQSQAGLASPFIEWSRQLVRSFHSNADPAFKTAADTTETAFNYYQNALARPSRKYAADLNKKVIDTVIAA
Ga0184632_1017721213300018075Groundwater SedimentMAAEEMRKTAEEMQQTGGATAENMRRTSEEFTRAASAFDVRGMSQAWKHGYLNGLDAFLQSQEQTERLLKETVKQGISGSQQILQNYEKWLEQIQSQTGSASPFVEWSRQLVRSFHSTADPLFKTAADTAETAFNYYQNSLARPARKYTVDLNKKVIDTVLSA
Ga0184609_1000896833300018076Groundwater SedimentMAAEEMKKTAEEIHKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENAVARPSRKGTFDVQKKVMDTVLTV
Ga0184609_1008000113300018076Groundwater SedimentMTEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKIWKQGYLGGLEALYQSQEQNERLLKETVKQGISGSQQMLQVYEKWLEQIQGHVGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHTLDLHAKVIDTVISANGAFKRE
Ga0184609_1021804913300018076Groundwater SedimentMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNFDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQRISDSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0184612_1000415313300018078Groundwater SedimentMAAEEMKKTAEEMQKTGGATAEDMRRTGEEFTRAARDLDLKGMSKAWKYGYLHGLEAVFQSQEQTERLLKETVKQGISGSQQMLQVYDKWLEQIQSQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENAVARPSRKGTFDVQKKVMDTVLTV
Ga0184627_1002679533300018079Groundwater SedimentMATPKRKKPAKSSQDGANHSREKAQRKAIPSEPIMKSHRLSDGTGAQPTRRSTMAAAEMKKRAEEMRKTGETTAEDMRRTGEEFTRASSTFDLKGMGKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGQAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA
Ga0066662_1032212523300018468Grasslands SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVVSA
Ga0184641_130137413300019254Groundwater SedimentMAEDMNKTAEEMQKTGKATAEDMKTRGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMFQVYEKWLEQIQGQAGAASPFVEWSRQAVRAVHNNADPFFKTATDTADHAFNYYENALARPSRKYTLDLHAKVID
Ga0187893_10009863123300019487Microbial Mat On RocksMSAEGFKKRAEEVRKAGETTAEDMGRIGEEFARAAGSFDLKGMSNAWKQGYLRGLEAIFQSHEQTGRLLKETVQHGISGSQQMLQSYDKCLEEIQGKAGAALPFVEWSRQLMCSFHRTADPLFKTAADTTDSAFNYYEDSLARPSRNYAIDLNKKVMDTFISA
Ga0193723_111765423300019879SoilMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLQAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQSQAGTGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAAD
Ga0193747_108362823300019885SoilMATEDIKKTAEEMQKNGEATAEDMRRTSEEFTRAASNFDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAVLPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0193755_108420613300020004SoilMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLQAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQSQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0193735_109950713300020006SoilSQSRTTHSKEKAQKKAIPDETIRKSHRLSEAIHAQPTGRSTMAEDMNKTAEEMQKAGKATAEDMKTRGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQDQSERLLKETVKQGISGSQQMFQVYEKWLEQIQGQAGAASPFVEWSRQAVRAFHSNADPFFKTAADTAENAFNYYENALARPSRKYTLDLHAKVIDTVISA
Ga0193721_117415413300020018SoilMAEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKSWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGQAGVASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSR
Ga0210378_1003337023300021073Groundwater SedimentMAAEDMKKTAEEMQKTAKRPEDMRRTGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHVGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHALDLHAKVIDTVISANGAFKRE
Ga0187846_1003028223300021476BiofilmMAAEEIKKRAEKMRKTGETTAEDMRRTGEEFTKAASTFDLKGMSNAWKQGYLRGLEAFFQSQEQTEHLLKEMVKQGISGSEQILQSYEKWLEQTQGQAGAASPFVGWARQLVHSFRSTADQLFKTAADTTETAFNYYENALGRPSRQYAVDLNKKVMDTVISA
Ga0224452_114165323300022534Groundwater SedimentMTEDMNKTAEEMQKTGKAKAEEMKKTGEEFTRAAREFDIKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHAGAASPFVELSRQAVRAFHNNADPFLKTVADTAENAFNYYENALARPSRQHALDLHAKVIDTVISANGAFKRE
Ga0222623_1007927423300022694Groundwater SedimentMAAEDMKKTAEEMQKTAKRPEDMRRTGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQDQSERLLKETVKQGISGSQQMFQVYEKWLEQIQGQVGAASPFVEWSRQAVRAFHNNADPFFKTVADTTENAFNYYENALARPSRKYTLDLHAKVIDTVISANGAFKRE
Ga0209640_10001200213300025324SoilMAAEEMKKTAEEMQQTGGATADDMRRTGEEFTRAAIAFDVSGMSQAWKQGYLRGLDAFFRSQEQTESLLKETVKQGISGSQQILQGYEKWLEQIQGQAGTASPFVEWSRQLVRSFHSNADPLFKTAADTAESAFNYYQNSFAHPARKYTVDVNKRVMDTVIAV
Ga0207684_10000555393300025910Corn, Switchgrass And Miscanthus RhizosphereMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0207644_1165810213300025931Switchgrass RhizosphereEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEEILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA
Ga0209470_103017053300026324SoilMATEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVVSA
Ga0209267_112547913300026331SoilMAAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0209058_125269113300026536SoilMAEDIKKTAEELQKTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0209161_1036265813300026548SoilMAAEDIKKTAEELQQTGRATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVEWSRQLVRSFHTNADPIFKSAADATESAFNYYQNALARPSRKYAADLNKKAMD
Ga0209577_1015614423300026552SoilMAAEDIKKTAEELQQTGRATAEDIKRTSEEFARAAGNFDVRGMSQAWKQSYLHGLEAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQGQAGAGSPFVSWSRQLVRSFHTNADPIFKNAADATESAFNYYQNALARPSRKYAADLNKKAMDTVISA
Ga0209884_101367813300027013Groundwater SandEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQSQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA
Ga0209879_100605333300027056Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA
Ga0209875_100111913300027209Groundwater SandMAPPKRKKPAKSSQDRANHSREKAQRKAIPSETIMKSHRLSDGTGVQPTRRSTMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA
Ga0209846_100446913300027277Groundwater SandMATPKRKKPVKSSQDRANHSSEKAQRKAIPSETIMKSHRLSDGTGVQPTRRSTMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA
Ga0209861_105768923300027332Groundwater SandMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYEN
Ga0209899_105520813300027490Groundwater SandMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA
Ga0209887_100804523300027561Groundwater SandMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQSYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKQVMDSVLSA
Ga0209874_100130983300027577Groundwater SandMATPKRKKPAKSSQDRANHSREKAQRKAIPSETIMKSHRLSDGTGAQATRRSTMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQDQAGAASPLVEWSRQLVRSLHSTADPLFKTAADTTESAFNYYQNALARPSRKYAVDLNKKVMDTVISA
Ga0209858_100654713300027948Groundwater SandMTAENIKERAEEVRKAAETTAEDMGRIGEEFTKAASSFDLKGMSNAWKQGYLRGLEAVFQSQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAID
Ga0209889_100424083300027952Groundwater SandMDAAEMQKRAEEMRKTGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENALARPSRKYAVDLNKKVMDTVISA
Ga0209859_103725313300027954Groundwater SandMTAENIKERAEEVRKAAETTAEDMRRTGEEFTRAASTFDLKGMSNAWKQGYLRGLEAVFQTQEQTERLLKETVKQGISGSQQLLQRYDKCLEEVQGKAGAALPFVELSRQLMRSVHRTADPLLKTAADTTESAFNYYEDSLARPSRNYAIDLNKKVLDTIITA
Ga0209857_105580613300027957Groundwater SandMDAAELKKRAEEMRKAGETTAEDMRRTGEEFTRAASTFDLKGMSKAWKQGYLRGLEAFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQIQGEAGAASPFVEWSRQLVRSFHSTADPLFKTAADTTESAFNYYENA
Ga0137415_1030704523300028536Vadose Zone SoilMAAEEMKKTAEDLKKTGEATAENMRRTGEEFTRAASTFDVRGMSQAWKQGYLRGLEAFFQSQEQTEHLLKETLKQGISGSEQILKSYEKLLEQIQGQAGSASPFTEWARQLVRSFHSNADPLFKTAADTVESAFNYYEDSLARPSRKYTVDFNKKVMDSVIAA
Ga0307305_1055166213300028807SoilMAAEDIKKTAEELQKTGQATAEDIKRTSEEFARAAGNCDVRGMSQAWKQSYLHGLQAFFQSQEQTERLLKETVKQGISGSQQILQGYEKWLDQIQDQAGAGSPFVEWSRQLVRSFHTNADPFFKGAADATESAFNYYQNALARPSRKY
Ga0307305_1055205013300028807SoilMAEDMNKTAEEMQKTGKATAEDMKTRGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMFQVYEKWLEQIQGQAGAASPFVEWSRQAVRAVHNNADPFFKTVADTTEKAFNYYENALARPSRKYTLDLHAKVIDTVIS
Ga0308183_117413413300030988SoilMAAEEMKKRAEEIQKTGEATAEHLKRTGEDFTRAASDFDVKGMSKAWKDGYVRGLEALFVSQEQTGLLFKETVKQGISSSRQMFQAYEKWLEQIQGQAGAASPFVEWSRQFVRSFQGNADPLFKTAADSVENTFTYYENAVARPSRKYALIYTKKSWTLSSLHEA
Ga0308178_104038513300030990SoilMKKTAEDLKKTGEATAENMRRTGEEFTRAASTFDVKGMSKAWKQSYLHGLEAFFQSQEQTERLLKETVRQGISNSQQILQGYEKWLEQIQGQASVASPLADWSRQLVRSFHSNADPLFKTAADTVESAFNYYEDSLARPSRKYTVDFNKKVMDSVIPA
Ga0308178_115698713300030990SoilMAAEEMKKRAEEIQKTGEATAEHLKRTGEDFTRAASDFDVKGMSKAWKDGYVRGLEALFVSQEQTGLLFKETVKQGISGSRQMLQAYEKWLEQIQGQAGAASPFVEWSRQFVRSFQGNADPLFKTAADSVENTFTYYENAVARPSRKYALDLHKK
Ga0307480_100797323300031677Hardwood Forest SoilMAAEEIEKRAEEIRKAGETTAEDMRRTGEEFTKAASTFDLKGMSNTWKQGYLRGLEAFFQSQEQTEHLLKEMVKQGLSGSEQILQSYEKWLEQTQGQAGAASPFVGGARQLVHSFHSTADQLFKTAADTTESA
Ga0310813_1170301513300031716SoilKKRAEEIQKTGEATAEHLKRTGEDFTRAASDFDVKGMSKAWKEGYVRGLEALFVSQEQTGLLFKETVKQGISGSRHMLQAYEQWLEQIQGQAGAASPFVEWSRQFVRSLQGNADPLFKTAADSVENTFTYYENAVGRPSRKYALDLHKKVMDTVISA
Ga0310901_1025327023300031940SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASAFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENALARPSRKYAIDLNKKVMDTVISA
Ga0310902_1088497713300032012SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYKNALARPSRKYAIDLNKKVMDTVISA
Ga0310890_1015921423300032075SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENAVARPSRKYAIDLNKKVMDTVISA
Ga0307471_10000579733300032180Hardwood Forest SoilMAAEEMKKTAEDLKKTGEATAENMRRTGEEFTRAASTFDVRGMSQAWKFGYLSGLEAFFQSQEQTERLLKETLKQGISGSQQILQGYEKWLEQIQGQAGSASPFTEWSRQLVRSFHSNADPLFKTAADTVESAFNYYEDSLARPSRKYTVDFNKKVMDSVISA
Ga0307472_10010322023300032205Hardwood Forest SoilMAAEDIKKKAEEMRRSSETTAEDMRQTGEEFTRAASTFDLKGMSKAWKQGYLHGLEGFFQSQEQTEHLLKETVKQGISGSEQILQSYEKWLEQVQRQAGAASPFVEWSRQLVRSFHTTTDPLFKTAADTTESAFNYYENALARPSRKYAIDLNKKVMDTVISA
Ga0370546_007622_36_5243300034681SoilMAEDMNKTAEEMQKTGKATAEDMKTRGEEFTRAAREFDVKGMSKIWKQGYLGGLEALYQSQEQSERLLKETVKQGISGSQQMLQVYEKWLEQIQGHVGAASPFVELSRQAVRAFHNNADPFFKTAADTAENAFNYYENALARPSRQHALDLHAKVIDTVISA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.