NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068264

Metagenome / Metatranscriptome Family F068264

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068264
Family Type Metagenome / Metatranscriptome
Number of Sequences 125
Average Sequence Length 143 residues
Representative Sequence MKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Number of Associated Samples 104
Number of Associated Scaffolds 125

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 79.20 %
% of genes near scaffold ends (potentially truncated) 38.40 %
% of genes from short scaffolds (< 2000 bps) 80.00 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (61.600 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(12.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.400 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(36.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.29%    β-sheet: 22.86%    Coil/Unstructured: 62.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 125 Family Scaffolds
PF10576EndIII_4Fe-2S 39.20
PF00730HhH-GPD 8.00
PF02195ParBc 4.00
PF14791DNA_pol_B_thumb 3.20
PF00633HHH 2.40
PF12850Metallophos_2 1.60
PF08281Sigma70_r4_2 1.60
PF12679ABC2_membrane_2 1.60
PF13502AsmA_2 0.80
PF07282OrfB_Zn_ribbon 0.80
PF01381HTH_3 0.80
PF12846AAA_10 0.80
PF05960DUF885 0.80
PF02230Abhydrolase_2 0.80

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 125 Family Scaffolds
COG01223-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 8.00
COG0177Endonuclease IIIReplication, recombination and repair [L] 8.00
COG1059Thermostable 8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 8.00
COG1194Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairsReplication, recombination and repair [L] 8.00
COG22313-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamilyReplication, recombination and repair [L] 8.00
COG4805Uncharacterized conserved protein, DUF885 familyFunction unknown [S] 0.80


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A61.60 %
All OrganismsrootAll Organisms38.40 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101358694All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300000891|JGI10214J12806_10047234All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes8303Open in IMG/M
3300001372|YBBDRAFT_1156042All Organisms → cellular organisms → Bacteria1333Open in IMG/M
3300004114|Ga0062593_100898222All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300004463|Ga0063356_100884331Not Available1259Open in IMG/M
3300004463|Ga0063356_101050664Not Available1168Open in IMG/M
3300005330|Ga0070690_100439829Not Available965Open in IMG/M
3300005332|Ga0066388_102534912Not Available933Open in IMG/M
3300005338|Ga0068868_101403328Not Available651Open in IMG/M
3300005440|Ga0070705_101083054Not Available655Open in IMG/M
3300005444|Ga0070694_100783489Not Available781Open in IMG/M
3300005471|Ga0070698_100848839Not Available858Open in IMG/M
3300005518|Ga0070699_100637935Not Available972Open in IMG/M
3300005518|Ga0070699_101125662Not Available720Open in IMG/M
3300005534|Ga0070735_10118515All Organisms → cellular organisms → Bacteria1656Open in IMG/M
3300005534|Ga0070735_10709408Not Available594Open in IMG/M
3300005536|Ga0070697_101455162All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales612Open in IMG/M
3300005537|Ga0070730_10001068All Organisms → cellular organisms → Bacteria27317Open in IMG/M
3300005538|Ga0070731_10002794All Organisms → cellular organisms → Bacteria16260Open in IMG/M
3300005538|Ga0070731_10077878All Organisms → cellular organisms → Bacteria2199Open in IMG/M
3300005544|Ga0070686_101531486Not Available563Open in IMG/M
3300005549|Ga0070704_101074538Not Available730Open in IMG/M
3300005549|Ga0070704_101534507Not Available613Open in IMG/M
3300005615|Ga0070702_101377673Not Available576Open in IMG/M
3300005617|Ga0068859_100669030Not Available1130Open in IMG/M
3300005719|Ga0068861_101562605Not Available649Open in IMG/M
3300005764|Ga0066903_102233990Not Available1056Open in IMG/M
3300005829|Ga0074479_10146991All Organisms → cellular organisms → Bacteria2654Open in IMG/M
3300005829|Ga0074479_10340047All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium3316Open in IMG/M
3300005829|Ga0074479_10425601All Organisms → cellular organisms → Bacteria1249Open in IMG/M
3300005829|Ga0074479_11022310All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium3416Open in IMG/M
3300005836|Ga0074470_10509677All Organisms → cellular organisms → Bacteria33953Open in IMG/M
3300005952|Ga0080026_10169912Not Available637Open in IMG/M
3300006046|Ga0066652_100285326Not Available1460Open in IMG/M
3300006844|Ga0075428_102374324Not Available545Open in IMG/M
3300006845|Ga0075421_101195948Not Available848Open in IMG/M
3300006846|Ga0075430_100063982All Organisms → cellular organisms → Bacteria3090Open in IMG/M
3300006852|Ga0075433_10685820Not Available898Open in IMG/M
3300006854|Ga0075425_100503953Not Available1394Open in IMG/M
3300006854|Ga0075425_100924510Not Available997Open in IMG/M
3300006854|Ga0075425_101464424Not Available772Open in IMG/M
3300006865|Ga0073934_10213940Not Available1298Open in IMG/M
3300006865|Ga0073934_10772003Not Available549Open in IMG/M
3300006954|Ga0079219_12330720Not Available518Open in IMG/M
3300007004|Ga0079218_10856271Not Available884Open in IMG/M
3300009012|Ga0066710_100563352All Organisms → cellular organisms → Bacteria1725Open in IMG/M
3300009094|Ga0111539_10008367All Organisms → cellular organisms → Bacteria13164Open in IMG/M
3300009100|Ga0075418_10379212Not Available1511Open in IMG/M
3300009147|Ga0114129_10622666Not Available1396Open in IMG/M
3300009157|Ga0105092_10184327Not Available1164Open in IMG/M
3300009162|Ga0075423_11335093Not Available767Open in IMG/M
3300009162|Ga0075423_13158659Not Available505Open in IMG/M
3300009527|Ga0114942_1072952All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300009678|Ga0105252_10083772All Organisms → cellular organisms → Bacteria1258Open in IMG/M
3300009678|Ga0105252_10219967Not Available821Open in IMG/M
3300010039|Ga0126309_10654078Not Available669Open in IMG/M
3300010337|Ga0134062_10423493All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales656Open in IMG/M
3300010391|Ga0136847_11910296Not Available867Open in IMG/M
3300010397|Ga0134124_10023027All Organisms → cellular organisms → Bacteria5119Open in IMG/M
3300010399|Ga0134127_10938222Not Available923Open in IMG/M
3300010399|Ga0134127_12651954Not Available581Open in IMG/M
3300010400|Ga0134122_10033561All Organisms → cellular organisms → Bacteria3892Open in IMG/M
3300010400|Ga0134122_10300861Not Available1377Open in IMG/M
3300010401|Ga0134121_11681924Not Available657Open in IMG/M
3300010403|Ga0134123_10101006All Organisms → cellular organisms → Bacteria → Acidobacteria → Thermoanaerobaculia → Thermoanaerobaculales → Thermoanaerobaculaceae → unclassified Thermoanaerobaculaceae → Thermoanaerobaculaceae bacterium2295Open in IMG/M
3300010938|Ga0137716_10096132All Organisms → cellular organisms → Bacteria2299Open in IMG/M
3300011434|Ga0137464_1115471Not Available797Open in IMG/M
3300011440|Ga0137433_1118501Not Available838Open in IMG/M
3300012905|Ga0157296_10068859All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300012912|Ga0157306_10080184Not Available900Open in IMG/M
3300012913|Ga0157298_10268945All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes587Open in IMG/M
3300012916|Ga0157310_10215643All Organisms → cellular organisms → Bacteria706Open in IMG/M
3300012944|Ga0137410_10000590All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes25502Open in IMG/M
3300012955|Ga0164298_10870752Not Available652Open in IMG/M
3300012971|Ga0126369_10957195All Organisms → cellular organisms → Bacteria943Open in IMG/M
3300014166|Ga0134079_10580797Not Available554Open in IMG/M
3300015245|Ga0137409_10742729Not Available816Open in IMG/M
3300015374|Ga0132255_104090234All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes619Open in IMG/M
3300017961|Ga0187778_10001594All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes15612Open in IMG/M
3300017965|Ga0190266_11048703Not Available551Open in IMG/M
3300019356|Ga0173481_10726284Not Available540Open in IMG/M
3300019362|Ga0173479_10448618Not Available636Open in IMG/M
3300019458|Ga0187892_10016449All Organisms → cellular organisms → Bacteria7820Open in IMG/M
3300019487|Ga0187893_10000858All Organisms → cellular organisms → Bacteria76448Open in IMG/M
3300019487|Ga0187893_10188770All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300019877|Ga0193722_1098647Not Available700Open in IMG/M
3300020202|Ga0196964_10256505Not Available821Open in IMG/M
3300020215|Ga0196963_10171371All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium932Open in IMG/M
3300021067|Ga0196978_1034962Not Available932Open in IMG/M
3300021357|Ga0213870_1023225All Organisms → cellular organisms → Bacteria2154Open in IMG/M
3300021432|Ga0210384_10030358All Organisms → cellular organisms → Bacteria5059Open in IMG/M
3300022549|Ga0212091_10118821All Organisms → cellular organisms → Bacteria1028Open in IMG/M
3300024219|Ga0247665_1024594Not Available763Open in IMG/M
3300024288|Ga0179589_10075603Not Available1329Open in IMG/M
3300025324|Ga0209640_10554973Not Available931Open in IMG/M
3300026075|Ga0207708_10423329Not Available1104Open in IMG/M
3300026118|Ga0207675_102093898Not Available582Open in IMG/M
3300027815|Ga0209726_10346327Not Available709Open in IMG/M
3300027857|Ga0209166_10008564All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium6749Open in IMG/M
3300027869|Ga0209579_10000726All Organisms → cellular organisms → Bacteria30195Open in IMG/M
3300027869|Ga0209579_10157080Not Available1213Open in IMG/M
3300027907|Ga0207428_10152814All Organisms → cellular organisms → Bacteria1756Open in IMG/M
3300027909|Ga0209382_11802530Not Available596Open in IMG/M
3300027986|Ga0209168_10415424Not Available654Open in IMG/M
3300031421|Ga0308194_10195777Not Available651Open in IMG/M
3300031576|Ga0247727_10116025All Organisms → cellular organisms → Bacteria2729Open in IMG/M
3300031716|Ga0310813_10680243Not Available917Open in IMG/M
3300031716|Ga0310813_10927406Not Available790Open in IMG/M
3300031718|Ga0307474_10835172Not Available729Open in IMG/M
3300031720|Ga0307469_11031461Not Available770Open in IMG/M
3300031731|Ga0307405_10929458Not Available738Open in IMG/M
3300031754|Ga0307475_10018414All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium4852Open in IMG/M
3300031820|Ga0307473_11023558Not Available604Open in IMG/M
3300031954|Ga0306926_10969661Not Available1014Open in IMG/M
3300032075|Ga0310890_11280644Not Available599Open in IMG/M
3300032174|Ga0307470_10900530Not Available695Open in IMG/M
3300032179|Ga0310889_10589540Not Available572Open in IMG/M
3300032180|Ga0307471_103442630Not Available560Open in IMG/M
3300032205|Ga0307472_101048415Not Available768Open in IMG/M
3300032205|Ga0307472_101350834All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales689Open in IMG/M
3300032261|Ga0306920_101377551Not Available1012Open in IMG/M
3300032421|Ga0310812_10000026All Organisms → cellular organisms → Bacteria76671Open in IMG/M
3300032421|Ga0310812_10138331All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1027Open in IMG/M
3300032421|Ga0310812_10458188All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300034663|Ga0314784_103907All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → unclassified Planctomycetales → Planctomycetales bacterium 12-60-4595Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.20%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil7.20%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.40%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil5.60%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)4.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.20%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.40%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil2.40%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.40%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.60%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.60%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.60%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.60%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.60%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.60%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.60%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.60%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.80%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.80%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.80%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.80%
Marine EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Sediment → Marine Estuarine0.80%
Hot Spring Fe-Si SedimentEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Neutral → Hot Spring Fe-Si Sediment0.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.80%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.80%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.80%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil0.80%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.80%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.80%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.80%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.80%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.80%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300001372YB-Back-sedEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005952Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 1 DNA2013-045EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009527Groundwater microbial communities from Cold Creek, Nevada to study Microbial Dark Matter (Phase II) - Lower Cold CreekEnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300010938Sediment microbial community from Chocolate Pots hot springs, Yellowstone National Park, Wyoming, USA. Combined Assembly of Gp0156111, Gp0156114, Gp0156117EnvironmentalOpen in IMG/M
3300011434Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT814_2EnvironmentalOpen in IMG/M
3300011440Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT840_2EnvironmentalOpen in IMG/M
3300012905Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S013-104B-2EnvironmentalOpen in IMG/M
3300012912Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S163-409C-2EnvironmentalOpen in IMG/M
3300012913Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S043-104R-2EnvironmentalOpen in IMG/M
3300012916Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S213-509R-2EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300019356Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S073-202C-2 (version 2)EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300020215Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_5EnvironmentalOpen in IMG/M
3300021067Soil microbial communities from Anza Borrego desert, Southern California, United States - S3+v_20-13CEnvironmentalOpen in IMG/M
3300021357Freshwater microbial communities from subterranean cave lake in Wind Cave National Park, South Dakota, United States - WICALVC2017EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022549Cold Creek_combined assemblyEnvironmentalOpen in IMG/M
3300024219Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK06EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031731Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-1Host-AssociatedOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300034663Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10135869423300000364SoilMKLNEITAEDVLAGKVWIIEPGQDGEANPNVKEGVGFTGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKRPEPDRASTHSKIFRDTA
JGI10214J12806_1004723433300000891SoilMKLNEIKAEDVLAGKLWIIIPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPATAKKK*
YBBDRAFT_115604233300001372Marine EstuarineMKLNEITAEDVLAGKLWIIEPGQDGETNPTVKESVGFAGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDK
Ga0062593_10089822223300004114SoilMKLNEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFAAVVKIADGSEHCGLVVKSFPAGGDDIDIYILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKASGHGKTFRETAVRIRQLPGTAKKK*
Ga0063356_10088433113300004463Arabidopsis Thaliana RhizosphereIGMKLSEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFSAVVKIADGSEHVGLVVKSFPAGGDDIDIFILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKASGHGKTFRETAVRIRQLPGAAKK*
Ga0063356_10105066413300004463Arabidopsis Thaliana RhizosphereMKLNEITRADVLEGKCWLLMPGQDGETNPDVRESIGFTSEDTGLFSAVVKIADGSEHVGLVVKSFPAGGDDIDIFLLTNFGWLNIHANGFMRALGKYSHEIFPFDYFLANPWKGGRQPEPDKASSHGKTFRETAIRIRQIPADKKK*
Ga0070690_10043982923300005330Switchgrass RhizosphereMKLNEISAEAVLAGKLWIVLPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0066388_10253491213300005332Tropical Forest SoilMKLNEIKAEDVLGGKLWIIVPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFRDTA
Ga0068868_10140332823300005338Miscanthus RhizosphereMKLNEITAEAVLAGKLWIIPPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFKDTAVRIRQTPAGAKK
Ga0070705_10108305413300005440Corn, Switchgrass And Miscanthus RhizosphereMKLTEITAEDVLAGKLWIIPPGQESEAVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0070694_10078348923300005444Corn, Switchgrass And Miscanthus RhizosphereVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKGSSHPKIFRDTAVRIKGAPSAKKR*
Ga0070698_10084883913300005471Corn, Switchgrass And Miscanthus RhizosphereMKLTEITAEDVLAGKLWIVLPGQDGETNPDVKETIGFTGEDLGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0070699_10063793523300005518Corn, Switchgrass And Miscanthus RhizosphereMKLTEISGEDVLAGKVWIILPGQDGETDPSVKEAIGFTGEDMGLISAVVRIADGSEHLGLVVKPFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPASAKK
Ga0070699_10112566213300005518Corn, Switchgrass And Miscanthus RhizosphereMKLSDLKPEDVLAGKCWLVEAGGDETNPNVKETIGYASDDIGLFSAVVKIADGTEHLALVVKSFPQGGDDIDIFIHTKFGWMNIHAQGFIRAMGKYSHDMFPFDYFLANPWKGGKQPEPDKASPHNKIFRETAVRMKGMPAAKK*
Ga0070735_1011851533300005534Surface SoilMKLNEITAEDVLAGKVWIIEPGQDGEGNPKVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRSSSHPKIFRDTAVRIKQIPGTKKK*
Ga0070735_1070940823300005534Surface SoilRNVGEPMKLSDITAEDVLAGKLWIIDSGQDGNANPTVKESVGFAGDDIGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGRQPEPDRSSSHPKIFRDTAVRIKQVPGGSKKK*
Ga0070697_10145516213300005536Corn, Switchgrass And Miscanthus RhizosphereMKLTEITAEDVLAGKLWIIPPGQESEAVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKASSHPKIFRDTAVRIKGMPSPAKKK*
Ga0070730_10001068243300005537Surface SoilMKLNEITAEDVLAGKVWIIEPGQDGEPNPNVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKSSSHPKIFRDTAVRIKQVPGAKKK*
Ga0070731_1000279473300005538Surface SoilMKLNEISAEDVLAGKLWIIEPGQDGVANPTVKESVGFAGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRNSSHPKIFKDTAIRIKQVPGGVKRK*
Ga0070731_1007787833300005538Surface SoilMKLSDITAEDVLAGKLWIIDSGQDGNANPTVKESVGFAGDDIGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGRQPEPDRSSSHPKIFRDTAVRIKQVPGGSKKK*
Ga0070686_10153148613300005544Switchgrass RhizosphereMKLNEIKADDVLAGKLWIIVPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQL
Ga0070704_10107453823300005549Corn, Switchgrass And Miscanthus RhizosphereGKLWIVLPGQDGETNPDVKETIGFTGEDLGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0070704_10153450713300005549Corn, Switchgrass And Miscanthus RhizosphereVPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0070702_10137767313300005615Corn, Switchgrass And Miscanthus RhizosphereEPMKLNEITAEDVLAGKLWLIEPGQDGEANPNIKESVGFTGEDLGLISAVVRIADGSEHLGLVIKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLCSPWKGGKQPEPDKASSHPKIFRDTAVRIKQVPGTAKRK*
Ga0068859_10066903013300005617Switchgrass RhizosphereLSVGGEDRLNPGNGSMKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0068861_10156260523300005719Switchgrass RhizosphereMKLNEISGEAVLAGKLWIILPGQDGEVNPEVKETIGFTGEDMGLVSAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDRASSHPKIFRDTAVRIRQIPAPAKKK*
Ga0066903_10223399013300005764Tropical Forest SoilMKLNEIGAEAVLAGKVWIILPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFKDTAVRIRQMPASAKKK*
Ga0074479_1014699123300005829Sediment (Intertidal)MKLTEISAEDVLAGKLWIIPPGQENDPIPTVKETIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKASSHPKIFRDTAVRIKQAPTGKKK*
Ga0074479_1034004723300005829Sediment (Intertidal)MKLSDIKREDVLAGKCWLLEPGGEDSNPNVRETVGFAGDDVGLFAAVVKIADGSEHVALVVKSFPQGGDDIDIFIHTKFGWMGIHSPGFMRALGKYSHDMFPFDYFLANPWKGGKQPEPDKASSHNKIFRETAVRIKGMPAGKK*
Ga0074479_1042560123300005829Sediment (Intertidal)MKLTDIKTEDVLAGKCWILEPGGDEANPGVRETIGYASEDIGLFSAVVKLADGSEHTALVVKSFPQGGDDIDIYIHTKFGWMNIHAPGFLRALGKYSHDMFPFDYFLANPWKGGKQPEPDKASPHNKIFRETSVRIKGMPAGNKK*
Ga0074479_1102231043300005829Sediment (Intertidal)MKLSDIKPEEVLAGKCWLMEPGGEETNPNVRETIGFAGEDVGLFAAVVKIADGSEHVALVVKSFPAGGDDIDIFIHTKFGWMGIHSPGFMRALGKYSHDLFPFDYFLANPWKGGRQPEPDQASPHNKIFRETAVRIKGVPAKK*
Ga0074470_10509677113300005836Sediment (Intertidal)MKLNEITAEDVLAGKLWIIEPGQDGETNPTVKESVGFAGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKVSSHPKIFRDTAVRIKQAPGAAKKK*
Ga0080026_1016991213300005952Permafrost SoilMKLNEITAEQVLAGKLWIIDPGQDGESNPTIKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIKQVPGTSKKK*
Ga0066652_10028532623300006046SoilMKLNEITAEAVLAGKLWIIPPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADASEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKRPEPDRASTHSKIFRDTAVRIKQVPGAKKK*
Ga0075428_10237432413300006844Populus RhizosphereKLNEITAEAVLAGKLWIVEPGQDGETNPNVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0075421_10119594813300006845Populus RhizosphereLAEITAESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0075430_10006398233300006846Populus RhizosphereMKLAEITAESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0075433_1068582013300006852Populus RhizosphereMKLNEIKAEDVLAGKLWIVTPGQDGEANPDVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEP
Ga0075425_10050395323300006854Populus RhizosphereMKLNEIKAEDVLAGKLWIVTPGQDGEANPDVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPASAKKK*
Ga0075425_10092451023300006854Populus RhizosphereMKLNEISAEAVLAGKLWIVLPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFKDTAVRIRQMPAGAKKK*
Ga0075425_10146442413300006854Populus RhizosphereMKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0073934_1021394023300006865Hot Spring SedimentMKLSEIKREDVLEGKCWLLLPGQDGVSDPEIRESVGFTSDDIGLFSAVVRIADGSEHVGLVVKSFPAGGDDIDIYLLTNFGWLNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKEPQPDKTSAHGKTFRETAIRIRQSPASSAKKK*
Ga0073934_1077200323300006865Hot Spring SedimentCWLIDPVTVNDPNPKVSDAIGFASTDVGLFSAVVKIADGTEHLGLVVKSFPQGGDDVDIFIYTKFGWMNIHADGFMRAMGKYSHDLFPFDYFLANPWKGGRQPEPDRTSTHAKIFRDTAVRIKQMPPPPAKK*
Ga0079219_1233072023300006954Agricultural SoilMKLTEITAEDVLAGKLWILDPGQEDTANPTVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKI
Ga0079218_1085627123300007004Agricultural SoilMKLNEIKAEDVLAGKLWIIVPGQDGETNPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0066710_10056335223300009012Grasslands SoilLSDIKAEDVLAGKCWLLETGGDDANPNVKDTIGFASEDIGLFSAVVKIADGSEHVALVVKSFPQGGDDIDIFIHTKFGWMNIHAQGFMRAMGKYSHDMFPFDYFLANPWKGGKQPEPDKASPHNKIFRETAVRIKGMPAGKK
Ga0111539_10008367113300009094Populus RhizosphereMKLNEITAEEVLAGKLWIIVPGQDGETNPTVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQVPSGAKKK*
Ga0075418_1037921223300009100Populus RhizosphereMKLAEITAESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0114129_1062266623300009147Populus RhizosphereMKLAEITAESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKLPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0105092_1018432723300009157Freshwater SedimentMKLNEITAEEVLGGKLWIIVPGQDGETNPSVKETIGFTGEDLGLISALVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQLPAGAKKK*
Ga0075423_1133509313300009162Populus RhizosphereMKLNEIKAEDVLAGKLWIVTPGQDGEANPDVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTA
Ga0075423_1315865913300009162Populus RhizosphereMKLNEIKADDVLAGKLWIIVPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAAAKKK*
Ga0114942_107295223300009527GroundwaterMKLSEIKGEDVLQGKSWLLKPGADGEDDPEVSETVGFTQEDVGLFSAAVRIADGSEHLGLVVKSFPAGGDDVDIFIHTQYGWMNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGRMPEPDKASSHGKTFRDTAVRIRQTPPTPKKK*
Ga0105252_1008377213300009678SoilMKLSEIKGEDVLQGKSWLLKPGADGEDDPEVSETVGFTQEDVGLFSAAVRIADGSEHLGLVVKSFPAGGDDVDIFIHTQYGWMNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGRMPEPDKGSS
Ga0105252_1021996723300009678SoilMKLNEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFSAVVKIADGSEHCGLVVKSFPAGGDDIDIYILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKASGHGKTFRETAVRIRQLPGTAKKK*
Ga0126309_1065407823300010039Serpentine SoilMKLNEITAEDVLAGKLWIVEPGQDGETNPTVKETIGFTGEDIGLVSAVVRIADASEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPASAKKK*
Ga0134062_1042349323300010337Grasslands SoilMKLNEITAEAVLAGKLWIVTPGQDGEQNPEVKETIGFTGEDVGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKTPEPDKASSHPKIFKDTAVRIRQTPAGSKKK*
Ga0136847_1191029623300010391Freshwater SedimentMKLTEISAEDVLAGKLWIIPPGQEAESSPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNILANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKASSHPKIFRDTAVRIKQAPGASKKK*
Ga0134124_1002302743300010397Terrestrial SoilMKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0134127_1093822223300010399Terrestrial SoilMDMKLTEITAEDVLAGKLWIVLPGQDGETNPDVKETIGFTGEDLGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFRDTAVRIRQMPASAKKK*
Ga0134127_1265195413300010399Terrestrial SoilMKLSEIKAEDVLAGKLWIIIPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0134122_1003356123300010400Terrestrial SoilMKLTEITAEDVLAGKLWIIEPGQDGEVNPSVKDTIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKASSHPKIFRDTAVRIKGMPSPAKKK*
Ga0134122_1030086123300010400Terrestrial SoilMKLTEITAEDVLAGKLWIIPPGQESEAVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFTFDYFLSNPWKGGKMPEPDKGSSHPKIFRDTAVRIKGAPSAKKR*
Ga0134121_1168192423300010401Terrestrial SoilGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0134123_1010100623300010403Terrestrial SoilMKLTEITAEDVLAGKLWIIPPGQESEAVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKGSSHPKIFRDTAVRIKGAPSAKKR*
Ga0137716_1009613233300010938Hot Spring Fe-Si SedimentMKLSEITPDDVLAGKCWVIEPGQADESDPRIREVTLFTNEDIGLFSAVVKIADGTEHLALVVKSFPQGGDDIDIYIYTRFGWMNIHAPGFMRALGKYSHELFPFDYYLANPWKGGKRPEPDKTSPHTRIFRDTAVRIRQMPPPPAVRK*
Ga0137464_111547113300011434SoilNPDVKDTIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Ga0137433_111850123300011440SoilMKLAEITAESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLANPWKGGRMPEPDKGSSHGK
Ga0157296_1006885913300012905SoilMKLNEITAEQVLAGKLWIIEPGQDGEANPSIKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKLPEPDKASSHPKIFRDTAVRIKQVPGTAKKK*
Ga0157306_1008018423300012912SoilMKLNEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFAAVVKIADGSEHCGLVVKSFPAGGDDIDIYILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLCSPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0157298_1026894513300012913SoilMKLNEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFAAVVKIADGSEHCGLVVKSFPAGGDDIDIYILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK*
Ga0157310_1021564323300012916SoilMKLNEITAEQVLAGKLWIIEPGQDGEANPSIKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKA
Ga0137410_10000590123300012944Vadose Zone SoilMKLSEISAEDVLAGKLWIIEPGQDGNADPGLKEAVGFAGEDIGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYMKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRASSHSKIFKDTAVRLKSKTGAAKKK*
Ga0164298_1087075223300012955SoilMKLNEITAEAVLAGKLWIVTPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHSKIFKDTAVRIRQMPAG
Ga0126369_1095719523300012971Tropical Forest SoilMKLSDIKAEDVLAGKLWIITPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK*
Ga0134079_1058079713300014166Grasslands SoilMKLNEITAEAVLAGKLWIVTPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKTPEPDKASSHPKIFKDTAVRIRQTPAG
Ga0137409_1074272913300015245Vadose Zone SoilMKLNEITAEDVLAGKVWIIDSGQDGETNPSIKDAVGFTGEDLGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKSSSHPKIFRDTAVRIKQVP
Ga0132255_10409023413300015374Arabidopsis RhizosphereMKLTEITAEAVLAGKLWIIEPGQDGETNPNVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFKDTAVRIRQMPAGAKKK*
Ga0187778_1000159463300017961Tropical PeatlandMKLSEIKSEDVLAGKCWIMEPGGDDANPNVRETIGFASEDLGLFSAVVKIADGSEHVALVVKSFPQGGDDIDIFIHTKFGWMNIHAAGFMRAIGKYSHDLFPFDYFLANPWKGRPPEPDKNSPHNKIFQETAVRIKAMPTGKK
Ga0190266_1104870313300017965SoilMKLNEITAEDVLAGKVWIIDPGQDGETNPNIKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRASTHPKIFRDTAVRIKQVPGAKKK
Ga0173481_1072628413300019356SoilGSARRGSRSAGSVRWLKCVRRQGRLRDIGEDMKLNEITAEAVLAGKLWIIQPGQDGESNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFKDTAVRIRQMPAGAKKK
Ga0173479_1044861813300019362SoilMKLNEITAEEVLAGKLWIIEPGQDGEANPSIKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKLPEPDKASSHPKIFRDTAVRIKQVPGTAKKK
Ga0187892_1001644933300019458Bio-OozeMKLSDITREDVLAGKCWLVEQGSEDGNPTVKDAIGFASEDVGLFSAVVKIADGTEHLGLVVKSFPQGGDDIDIFILTKFGWMNIHSPGFMRALGKYSHDLFPFDYFLANPWKGGRQPEPDKASPHSKIFRDTAVRIRKVPGEPKK
Ga0187893_10000858533300019487Microbial Mat On RocksMKLSEIKAADVLAGKSWILKGGGDVDLNSEVQEAIGFTNEDTGLLAAVVKLADGSEHAGLVVKSFPQGGDDVDIYMFTQFGWLNIHAQGFMRALGKYSHEIFPFDYFLANPWKGGKTPEPDKASSHPKIFRDTAVRVKPPSGVPKKK
Ga0187893_1018877033300019487Microbial Mat On RocksMKLSEIKREDVLAGKCWLLEAGGDEKDPSIRESIGFTGEDIGLFSAVVKIADGTEHLAVVVKSFPAGGDDIDIFMWTKFGWMNIHSDGFMRALGKYSHDLFPFDYFLANPWKGGKQPEPDKASSHSKIFRDTAVRIKQMPGEKSKKS
Ga0193722_109864723300019877SoilMKLTEITSEDVLAGKLWILDAGQEGETNPTVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIFIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Ga0196964_1025650513300020202SoilARGSIMKLSEIQREDVLEGKSWLVLPGQDDVADPELRESVGFTSEDIGLFSAVVRIADGSEHVGLVVKSFPAGGDDVDIYLLTSFGWLNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKSSGHGKTFRETAVRIRQSPFPSAKKK
Ga0196963_1017137113300020215SoilMKLSEIAADDVLAGKCWIIKPGQDDQDNPDVDEAIGFTSEDVGLFAAVVKIADGSEHLGLVVKSFPQGGDDVDIYIRTVHGWMNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGRNPEPDKVSSHSKIFRDTAVRIKSAPAPKKR
Ga0196978_103496223300021067SoilDVLAGKCWIIKPGHDEQDNPEVEEAIGFTSEDVGLFSAAVKIADGSEHLGLVVKSFPQGGDDVDIYIRTTHGWMNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGRSPEPDKVSSHSKIFRDTAVRIKSAPAPRKK
Ga0213870_102322523300021357FreshwaterMNLNDIENNDVLDGKVWVIKTGSEGESNPEIREAISFLNEDIGLFSAVVKIADGSEHLGVVVKSFPQGGDDIDIYIKTNFGWLNIHAAGFMRALGKYSHEIFPFDYFLASPWKGGKTPEPDKASSHPKIFRDTAVRIRKTPSVIKKNQP
Ga0210384_1003035863300021432SoilMMKLNEIKPEDVLAGKCWLMESGGEETNPSVRETIGFASDDVGLFSAVVKIADGSEHVALVVKSFPQGGDDIDIFIHTKFGWMNIHAPGFMRAIGKYSHDMFPFDYFLANPWKGGRQPEPDKASPHNKIFRETAVRIKAMPAGKK
Ga0212091_1011882123300022549GroundwaterMKLSEIKGEDVLQGKSWLLKPGADGEDDPEVSETVGFTQEDVGLFSAAVRIADGSEHLGLVVKSFPAGGDDVDIFIHTQYGWMNIHAPGFMRALGKYSHEIFPFDYFLANPWKGGRMPEPDKASSHGKTFRDTAVRIRQTPPTPKKK
Ga0247665_102459413300024219SoilMKLNEITAEDVLAGKLWIIDADQDGVTNPNVKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFKQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYYLSSPWKGGKQPEPDRASSHPKIFRDTAVRIKQMPGAK
Ga0179589_1007560323300024288Vadose Zone SoilMKLNDITAEDVLAGKVWIIDPGQDGEVNPNVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIKQVPGAKKK
Ga0209640_1055497323300025324SoilMKLSDIKAEDVLAGKCWLLESGGDEANPGVRETIGFASEDIGLFSAVVKIADGSEHVALVVKSFPQGGDDIDIYIHTKFGWMGIHSPGFMRALGKYSHDMFPFDYFLANPWKGGKQPEPDKASSHTKIFRETAVRIKGMPAGKK
Ga0207708_1042332913300026075Corn, Switchgrass And Miscanthus RhizosphereMKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGARKK
Ga0207675_10209389823300026118Switchgrass RhizosphereMKLNEISGEAVLAGKLWIILPGQDGEVNPEVKETIGFTGEDMGLVSAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVR
Ga0209726_1034632713300027815GroundwaterPGGEDSNPNVRETVGFAGDDVGLFAAVVKIADGSEHVALVVKSFPQGGDDIDIFIHTKFGWMGIHSPGFMRALGKYSHDMFPFDYFLANPWKGGKQPEPDKASSHNKIFRETAVRIKGMPAGKK
Ga0209166_1000856453300027857Surface SoilMKLNEITAEDVLAGKVWIIEPGQDGEPNPNVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKSSSHPKIFRDTAVRIKQVPGAKKK
Ga0209579_10000726253300027869Surface SoilMKLNEISAEDVLAGKLWIIEPGQDGVANPTVKESVGFAGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRNSSHPKIFKDTAIRIKQVPGGVKRK
Ga0209579_1015708013300027869Surface SoilMKLSDITAEDVLAGKLWIIDSGQDGNANPTVKESVGFAGDDIGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGRQPEPDRSSSHPKIFRDTAVRIKQVPGGSKKK
Ga0207428_1015281413300027907Populus RhizosphereMKLNEITAEEVLAGKLWIIVPGQDGETNPTVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKLPEPDKASSHPKIFRDTAVRIRQMPAGAKKK
Ga0209382_1180253013300027909Populus RhizosphereESVLAGKLWIIEPGQDGETNPNVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK
Ga0209168_1041542413300027986Surface SoilMKLNEITAEDVLAGKVWIIEPGQDGEGNPKVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDRSSSHPKIFRDTAVRIKQIPGTKKK
Ga0308194_1019577713300031421SoilMKLNEITAEDVLAGKLWIIEPGQDGEANPKVKESVGFAGDDVGLISAVVRIADGSEHLGLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKSSSHPKIFRDTAVRIKQIPGSKKK
Ga0247727_1011602533300031576BiofilmMKLSEIKAADVLAGKSWIVKQGTEVDVNAEVQDAIGFTNEDTGLLAAVVKLADGSEHAGLVVKSFPQGGDDVDIYMLTQFGWLNIHAQGFMRALGKYSHEIFPFDYFLANPWKGGKTPEPDKASSHPKIFKDTAVRVRPPVSAQKKK
Ga0310813_1068024313300031716SoilPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK
Ga0310813_1092740613300031716SoilMKLNEIKADDVLAGKLWIIVPGQDGEANPDVKDTIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRI
Ga0307474_1083517223300031718Hardwood Forest SoilMTLNEITAEDVLAGKVWIIEPGQDGEGNPKVKESVGFAGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEP
Ga0307469_1103146123300031720Hardwood Forest SoilMKLTEITAEDVLAGKLWIIPPGQESEAVPTVKEAIGFTDEDTGLISAMIRIADGSEHLSLVVKSFPAGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDK
Ga0307405_1092945813300031731RhizosphereMKLTEIKREDVLDGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFSAVVKIADGSEHVGLVVKSFPAGGDDIDIFILTNFGWLGIHAAGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKASGHGKTFRETAVRIRQLPGAAKK
Ga0307475_1001841433300031754Hardwood Forest SoilMTLNEITAEDVLAGKVWIIEPGQDGEGNPKVKESVGFAGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKSSSHPKIFRDTAVRIKQIPGSKKK
Ga0307473_1102355813300031820Hardwood Forest SoilMKLNDITGEDVLAGKLWIVEPGAETDKNPTVKESVGFTGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHADGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPASAKKK
Ga0306926_1096966113300031954SoilAGKLWIVDPGQDGVADPNVSESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYYLSSPWKGGRQPEPDRNSSHPKIFRDTAVRIKQTPGSKKR
Ga0310890_1128064413300032075SoilGDAMKLNEITAEEVLAGKLWIIVPGQDGETNPSVKETIGFTGEDLGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQVPSGAKKK
Ga0307470_1090053013300032174Hardwood Forest SoilMKLNDITAEDVLAGKVWIIEPGQDGEANPNVKESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Ga0310889_1058954013300032179SoilLIGDRMKLNEITAESVLGGKLWIIEPGQEGETNPSVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPAGAKKK
Ga0307471_10344263013300032180Hardwood Forest SoilMKLTEITAEDVLAGKLWILVPGQDGETNPEVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Ga0307472_10104841513300032205Hardwood Forest SoilMKLNEITAEDVLAGKLWIIEADQDGVANPNVKESVGFTGEDMGLISAVVRIADGSEHLGLVVKSFKQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKMPEPDKASSHPKIFRDTAVRIKGMPSPAKKK
Ga0307472_10135083413300032205Hardwood Forest SoilMKLTEITAEDVLAGKLWIIPPGQETEAVPTVKEAIGFTGEDTGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIFIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIKQMPAGAKKK
Ga0306920_10137755113300032261SoilMKLSEITAEDVLAGKLWIVDSGQDGVADPNVSESVGFAGEDVGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSSPWKGGKQPEPDR
Ga0310812_10000026673300032421SoilMKLNEISAEAVLAGKLWIVLPGQDGEQNPEVKETIGFTGEDIGLVSAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKTPEPDKASSHPKIFKDTAVRIRQMPAGAKKK
Ga0310812_1013833133300032421SoilMKLNEIKAEDVLAGKLWIITPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEPDKASSHPKIFRDTAVRIRQMPASAKKK
Ga0310812_1045818823300032421SoilMKLNEIKAEDVLAGKLWIIIPGQDGEANPDVKETIGFTGEDMGLISAVVRIADGSEHLGLVVKSFPQGGDDVDIYIKTNFGWLNIHANGFMRALGKYSHEIFPFDYFLSNPWKGGKQPEP
Ga0314784_103907_150_5933300034663SoilMKLNEIKREDVLEGKCWLLLPGQDGESNPDVRESIGFTSEDTGLFAAVVKIADGSEHCGLVVKSFPAGGDDIDIYILTNFGWLGIHAPGFMRALGKYSHEIFPFDYFLANPWKGGKQPEPDKASGHGKTFRETAVRIRQLPGTAKKK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.