NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F038468

Metagenome / Metatranscriptome Family F038468

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F038468
Family Type Metagenome / Metatranscriptome
Number of Sequences 166
Average Sequence Length 87 residues
Representative Sequence MPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPN
Number of Associated Samples 111
Number of Associated Scaffolds 166

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 77.71 %
% of genes near scaffold ends (potentially truncated) 31.93 %
% of genes from short scaffolds (< 2000 bps) 76.51 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.65

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (66.265 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(15.060 % of family members)
Environment Ontology (ENVO) Unclassified
(25.904 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.145 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.90%    β-sheet: 0.00%    Coil/Unstructured: 43.10%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.65
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 166 Family Scaffolds
PF01739CheR 22.29
PF01967MoaC 12.65
PF07687M20_dimer 12.05
PF01546Peptidase_M20 6.02
PF03454MoeA_C 4.22
PF01050MannoseP_isomer 3.61
PF13620CarboxypepD_reg 3.61
PF14602Hexapep_2 1.20
PF07494Reg_prop 1.20
PF01791DeoC 0.60
PF00483NTP_transferase 0.60
PF00701DHDPS 0.60
PF00593TonB_dep_Rec 0.60
PF13145Rotamase_2 0.60
PF08448PAS_4 0.60
PF076987TM-7TMR_HD 0.60
PF02463SMC_N 0.60
PF05401NodS 0.60
PF08379Bact_transglu_N 0.60
PF01738DLH 0.60
PF02618YceG 0.60
PF07495Y_Y_Y 0.60
PF01569PAP2 0.60
PF00795CN_hydrolase 0.60
PF01555N6_N4_Mtase 0.60
PF14805THDPS_N_2 0.60

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 166 Family Scaffolds
COG1352Methylase of chemotaxis methyl-accepting proteinsSignal transduction mechanisms [T] 44.58
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 22.29
COG0315Molybdenum cofactor biosynthesis enzyme MoaCCoenzyme transport and metabolism [H] 12.65
COG0303Molybdopterin Mo-transferase (molybdopterin biosynthesis)Coenzyme transport and metabolism [H] 4.22
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 1.20
COG3292Periplasmic ligand-binding sensor domainSignal transduction mechanisms [T] 1.20
COG0642Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.60
COG0863DNA modification methylaseReplication, recombination and repair [L] 0.60
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 0.60
COG1305Transglutaminase-like enzyme, putative cysteine proteasePosttranslational modification, protein turnover, chaperones [O] 0.60
COG1480Cyclic di-AMP-specific phosphodiesterase PgpH, HD superfamilySignal transduction mechanisms [T] 0.60
COG1559Endolytic transglycosylase MltG, terminates peptidoglycan polymerizationCell wall/membrane/envelope biogenesis [M] 0.60
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 0.60


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms66.27 %
UnclassifiedrootN/A33.73 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090015|GPICI_8750031All Organisms → cellular organisms → Bacteria2107Open in IMG/M
3300000559|F14TC_105431693All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300001431|F14TB_104005254All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300003203|JGI25406J46586_10029211All Organisms → cellular organisms → Bacteria2090Open in IMG/M
3300003321|soilH1_10020309All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia8500Open in IMG/M
3300003321|soilH1_10168777All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300004062|Ga0055500_10089900Not Available670Open in IMG/M
3300004157|Ga0062590_100972185All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300004281|Ga0066397_10023405Not Available941Open in IMG/M
3300004463|Ga0063356_101293474All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300004479|Ga0062595_101344965All Organisms → cellular organisms → Bacteria646Open in IMG/M
3300004480|Ga0062592_100350118All Organisms → cellular organisms → Bacteria1148Open in IMG/M
3300004633|Ga0066395_10351041Not Available820Open in IMG/M
3300004797|Ga0007764_10206316All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300005294|Ga0065705_10318526All Organisms → cellular organisms → Bacteria1016Open in IMG/M
3300005328|Ga0070676_10888801All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300005332|Ga0066388_100023995All Organisms → cellular organisms → Bacteria5476Open in IMG/M
3300005332|Ga0066388_100082767All Organisms → cellular organisms → Bacteria3592Open in IMG/M
3300005332|Ga0066388_103804109All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium770Open in IMG/M
3300005332|Ga0066388_105154691Not Available663Open in IMG/M
3300005332|Ga0066388_106444285Not Available592Open in IMG/M
3300005332|Ga0066388_108572250Not Available509Open in IMG/M
3300005347|Ga0070668_100694323All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300005353|Ga0070669_100007276All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7942Open in IMG/M
3300005406|Ga0070703_10395332All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes601Open in IMG/M
3300005416|Ga0068880_1319313All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300005417|Ga0068884_1485431All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300005444|Ga0070694_100021319All Organisms → cellular organisms → Bacteria4139Open in IMG/M
3300005444|Ga0070694_100429115All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300005445|Ga0070708_100995871All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300005468|Ga0070707_100000142All Organisms → cellular organisms → Bacteria67685Open in IMG/M
3300005471|Ga0070698_100505165All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300005518|Ga0070699_100824120All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300005529|Ga0070741_10000950All Organisms → cellular organisms → Bacteria95338Open in IMG/M
3300005529|Ga0070741_10206935All Organisms → cellular organisms → Bacteria → Acidobacteria1904Open in IMG/M
3300005559|Ga0066700_10846252Not Available612Open in IMG/M
3300005559|Ga0066700_10896298All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → Caldithrix → unclassified Caldithrix → Caldithrix sp.590Open in IMG/M
3300005618|Ga0068864_101671296All Organisms → cellular organisms → Bacteria641Open in IMG/M
3300005713|Ga0066905_100209645All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia1464Open in IMG/M
3300005713|Ga0066905_100280936Not Available1296Open in IMG/M
3300005713|Ga0066905_101761769Not Available570Open in IMG/M
3300005719|Ga0068861_102004629Not Available577Open in IMG/M
3300005764|Ga0066903_101825529All Organisms → cellular organisms → Bacteria1162Open in IMG/M
3300005764|Ga0066903_109048880Not Available504Open in IMG/M
3300005937|Ga0081455_10521332All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300005937|Ga0081455_10583945Not Available732Open in IMG/M
3300005985|Ga0081539_10370831Not Available598Open in IMG/M
3300006844|Ga0075428_101045930All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Chloracidobacterium863Open in IMG/M
3300006844|Ga0075428_101197897All Organisms → cellular organisms → Bacteria801Open in IMG/M
3300006844|Ga0075428_102299271Not Available555Open in IMG/M
3300006845|Ga0075421_100005340All Organisms → cellular organisms → Bacteria → Acidobacteria15522Open in IMG/M
3300006845|Ga0075421_100190076All Organisms → cellular organisms → Bacteria → Acidobacteria2547Open in IMG/M
3300006845|Ga0075421_100321986All Organisms → cellular organisms → Bacteria1874Open in IMG/M
3300006845|Ga0075421_100641516All Organisms → cellular organisms → Bacteria → Acidobacteria1242Open in IMG/M
3300006845|Ga0075421_100643159Not Available1240Open in IMG/M
3300006846|Ga0075430_101671858All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium522Open in IMG/M
3300006847|Ga0075431_101822382Not Available564Open in IMG/M
3300006854|Ga0075425_101201121Not Available862Open in IMG/M
3300006871|Ga0075434_101952411Not Available592Open in IMG/M
3300006880|Ga0075429_100214964All Organisms → cellular organisms → Bacteria1684Open in IMG/M
3300006914|Ga0075436_100763545All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium718Open in IMG/M
3300006969|Ga0075419_10342546All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300007076|Ga0075435_101615964All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes569Open in IMG/M
3300007212|Ga0103958_1050873All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia4418Open in IMG/M
3300007216|Ga0103961_1022412All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia4058Open in IMG/M
3300009094|Ga0111539_11575331Not Available762Open in IMG/M
3300009094|Ga0111539_13262382All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes523Open in IMG/M
3300009100|Ga0075418_10177806All Organisms → cellular organisms → Bacteria2266Open in IMG/M
3300009112|Ga0115923_10846960Not Available577Open in IMG/M
3300009112|Ga0115923_10866809All Organisms → cellular organisms → Bacteria2200Open in IMG/M
3300009162|Ga0075423_12326548Not Available583Open in IMG/M
3300009243|Ga0103860_10036347Not Available944Open in IMG/M
3300009354|Ga0115925_10249473All Organisms → cellular organisms → Bacteria1273Open in IMG/M
3300009354|Ga0115925_10936087All Organisms → cellular organisms → Bacteria3568Open in IMG/M
3300009430|Ga0114938_1199439All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300009553|Ga0105249_12143256Not Available632Open in IMG/M
3300009792|Ga0126374_10882071Not Available691Open in IMG/M
3300009792|Ga0126374_10925779Not Available678Open in IMG/M
3300009792|Ga0126374_11353509Not Available578Open in IMG/M
3300009870|Ga0131092_11046902Not Available656Open in IMG/M
3300009873|Ga0131077_10045455All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → Pyrinomonas → Pyrinomonas methylaliphatogenes6306Open in IMG/M
3300009873|Ga0131077_10135341All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2823Open in IMG/M
3300009873|Ga0131077_11300207Not Available604Open in IMG/M
3300010043|Ga0126380_10148758All Organisms → cellular organisms → Bacteria1496Open in IMG/M
3300010043|Ga0126380_10213268All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia1304Open in IMG/M
3300010043|Ga0126380_10651471All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Chloracidobacterium838Open in IMG/M
3300010043|Ga0126380_11169061All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium661Open in IMG/M
3300010046|Ga0126384_10173971All Organisms → cellular organisms → Bacteria → Proteobacteria1680Open in IMG/M
3300010046|Ga0126384_10456347All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → Caldithrix → unclassified Caldithrix → Caldithrix sp.1092Open in IMG/M
3300010046|Ga0126384_11964866Not Available559Open in IMG/M
3300010047|Ga0126382_10579881All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes919Open in IMG/M
3300010047|Ga0126382_11480737Not Available623Open in IMG/M
3300010358|Ga0126370_11797716Not Available593Open in IMG/M
3300010360|Ga0126372_10508180Not Available1133Open in IMG/M
3300010360|Ga0126372_11570068Not Available696Open in IMG/M
3300010360|Ga0126372_12022496All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → Caldithrix → unclassified Caldithrix → Caldithrix sp.623Open in IMG/M
3300010361|Ga0126378_12099888Not Available644Open in IMG/M
3300010362|Ga0126377_10025678All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → Pyrinomonas → Pyrinomonas methylaliphatogenes4909Open in IMG/M
3300010362|Ga0126377_10030793All Organisms → cellular organisms → Bacteria → Acidobacteria4533Open in IMG/M
3300010362|Ga0126377_11182902All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia835Open in IMG/M
3300010362|Ga0126377_11183231Not Available835Open in IMG/M
3300010362|Ga0126377_11783586Not Available691Open in IMG/M
3300010362|Ga0126377_13618956All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium500Open in IMG/M
3300010391|Ga0136847_11544927Not Available512Open in IMG/M
3300010403|Ga0134123_10886221All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300010863|Ga0124850_1078469Not Available988Open in IMG/M
3300012081|Ga0154003_1000463All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae12720Open in IMG/M
3300012081|Ga0154003_1015648Not Available1531Open in IMG/M
3300012208|Ga0137376_11248054All Organisms → cellular organisms → Bacteria633Open in IMG/M
3300012355|Ga0137369_10056836All Organisms → cellular organisms → Bacteria3387Open in IMG/M
3300012944|Ga0137410_10080024All Organisms → cellular organisms → Bacteria2387Open in IMG/M
3300012948|Ga0126375_10595069All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300012971|Ga0126369_11094477Not Available886Open in IMG/M
3300013297|Ga0157378_11078449All Organisms → cellular organisms → Bacteria840Open in IMG/M
3300013306|Ga0163162_11492696All Organisms → cellular organisms → Bacteria770Open in IMG/M
3300014326|Ga0157380_10832225All Organisms → cellular organisms → Bacteria943Open in IMG/M
3300015371|Ga0132258_10511215Not Available3006Open in IMG/M
3300015371|Ga0132258_13146985All Organisms → cellular organisms → Bacteria → Acidobacteria1140Open in IMG/M
3300015371|Ga0132258_13216347Not Available1126Open in IMG/M
3300015372|Ga0132256_102253061Not Available649Open in IMG/M
3300015374|Ga0132255_105275393Not Available547Open in IMG/M
3300017788|Ga0169931_10018159All Organisms → cellular organisms → Bacteria → Acidobacteria9190Open in IMG/M
3300017792|Ga0163161_10456347All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300018469|Ga0190270_12764294All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium553Open in IMG/M
3300019360|Ga0187894_10007457All Organisms → cellular organisms → Bacteria10312Open in IMG/M
3300019360|Ga0187894_10016107All Organisms → cellular organisms → Bacteria → Acidobacteria5648Open in IMG/M
3300019360|Ga0187894_10020946All Organisms → cellular organisms → Bacteria → Acidobacteria4623Open in IMG/M
3300019360|Ga0187894_10046687All Organisms → cellular organisms → Bacteria → Acidobacteria2582Open in IMG/M
3300019487|Ga0187893_10099992All Organisms → cellular organisms → Bacteria2561Open in IMG/M
3300019487|Ga0187893_10293059All Organisms → cellular organisms → Bacteria1164Open in IMG/M
3300020202|Ga0196964_10400366Not Available662Open in IMG/M
3300021362|Ga0213882_10256037All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes726Open in IMG/M
3300023201|Ga0256614_1238112All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes804Open in IMG/M
3300024298|Ga0255178_1095296Not Available549Open in IMG/M
3300024545|Ga0256347_1026580All Organisms → cellular organisms → Bacteria1386Open in IMG/M
3300024850|Ga0255282_1010566All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1662Open in IMG/M
3300024858|Ga0255286_1094169All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300025910|Ga0207684_10067724All Organisms → cellular organisms → Bacteria → Acidobacteria3034Open in IMG/M
3300025922|Ga0207646_10000047All Organisms → cellular organisms → Bacteria172446Open in IMG/M
3300025922|Ga0207646_11247546Not Available650Open in IMG/M
3300025942|Ga0207689_10103742All Organisms → cellular organisms → Bacteria2336Open in IMG/M
3300026562|Ga0255285_1031973All Organisms → cellular organisms → Bacteria → Acidobacteria1063Open in IMG/M
3300027527|Ga0209684_1036420Not Available754Open in IMG/M
3300027793|Ga0209972_10005580All Organisms → cellular organisms → Bacteria → Acidobacteria9222Open in IMG/M
3300027793|Ga0209972_10417146All Organisms → cellular organisms → Bacteria567Open in IMG/M
(restricted) 3300027799|Ga0233416_10029884All Organisms → cellular organisms → Bacteria → Acidobacteria1817Open in IMG/M
3300027870|Ga0209023_10122765All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia1808Open in IMG/M
3300027870|Ga0209023_10314990All Organisms → cellular organisms → Bacteria989Open in IMG/M
3300027880|Ga0209481_10484823Not Available638Open in IMG/M
3300027909|Ga0209382_10071664All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia4094Open in IMG/M
3300027909|Ga0209382_10444415All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300027909|Ga0209382_11672066Not Available626Open in IMG/M
3300028267|Ga0256358_1121307Not Available516Open in IMG/M
3300028647|Ga0272412_1233905All Organisms → cellular organisms → Bacteria752Open in IMG/M
3300028647|Ga0272412_1430381Not Available519Open in IMG/M
3300028648|Ga0268299_1000212All Organisms → cellular organisms → Bacteria224528Open in IMG/M
3300031576|Ga0247727_10014433All Organisms → cellular organisms → Bacteria13130Open in IMG/M
3300031576|Ga0247727_10200201All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1830Open in IMG/M
3300031576|Ga0247727_10292825All Organisms → cellular organisms → Bacteria1389Open in IMG/M
3300031576|Ga0247727_10726168All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300031854|Ga0310904_11280096Not Available531Open in IMG/M
3300031943|Ga0310885_10726219Not Available560Open in IMG/M
3300032013|Ga0310906_10025071All Organisms → cellular organisms → Bacteria2625Open in IMG/M
3300032157|Ga0315912_11526550Not Available526Open in IMG/M
3300034355|Ga0335039_0013927All Organisms → cellular organisms → Bacteria → Acidobacteria5044Open in IMG/M
3300034670|Ga0314795_137290Not Available520Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil15.06%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere14.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil9.04%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.02%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater3.61%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks3.61%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake3.01%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge3.01%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.41%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm2.41%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.41%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere2.41%
Swimming Pool Sandfilter BackwashEngineered → Built Environment → Unclassified → Unclassified → Unclassified → Swimming Pool Sandfilter Backwash2.41%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.81%
WastewaterEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater1.81%
Freshwater And SedimentEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater And Sediment1.20%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.20%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.20%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.20%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.20%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.20%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.20%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.20%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.20%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.20%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens1.20%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.60%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.60%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.60%
River WaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water0.60%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.60%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.60%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.60%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.60%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.60%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.60%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.60%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.60%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.60%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.60%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.60%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.60%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090015Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300003203Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004281Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBioEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300004797Metatranscriptome of freshwater lake microbial communities from Lake Michigan, USA - Fa13.BD.MLB.DN (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005416Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel2S_0400h metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005417Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel6S_1000h metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300005985Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S4T2R2Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007212Combined Assembly of cyanobacterial bloom in Punggol water reservoir, Singapore (Diel cycle-Bottom layer) 7 sequencing projectsEnvironmentalOpen in IMG/M
3300007216Combined Assembly of cyanobacterial bloom in Punggol water reservoir, Singapore (Diel cycle-Surface and Bottom layer) 16 sequencing projectsEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009112Microbial communities from sand-filter backwash in Singapore swimming pools - KB-2EngineeredOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009243Microbial communities of water from Amazon river, Brazil - RCM13EnvironmentalOpen in IMG/M
3300009354Microbial communities from sand-filter backwash in Singapore swimming pools - PR-2EngineeredOpen in IMG/M
3300009430Groundwater microbial communities from Big Spring, Nevada to study Microbial Dark Matter (Phase II) - Ash Meadows Big SpringEnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009870Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Linkou plantEngineeredOpen in IMG/M
3300009873Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Wenshan plantEngineeredOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300010863Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (PacBio error correction)EnvironmentalOpen in IMG/M
3300012081Attine ant fungus gardens microbial communities from Florida, USA - TSFL087 MetaGHost-AssociatedOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017788Freshwater microbial communities from Lake Kivu, Western Province, Rwanda to study Microbial Dark Matter (Phase II) - Kivu_15m_20LEnvironmentalOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300021362Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R09EnvironmentalOpen in IMG/M
3300023201Activated sludge enriched bacterial communities from WWTP in Fort Collins, Colorado, USA ? PNEngineeredOpen in IMG/M
3300024298Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Atlam_RepC_8dEnvironmentalOpen in IMG/M
3300024545Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Colum_RepB_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024850Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Atl_RepB_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024858Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Colum_RepA_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026562Metatranscriptome of freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Atlam_RepC_8d (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300027527Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027793Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel1S_2200h metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027870Freshwater and sediment microbial communities from Lake Erie, Canada (SPAdes)EnvironmentalOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028267Metatranscriptome of freshwater microbial communities from Mississippi River, Louisiana, United States - Miss_Colum_RepC_8h (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300028647Metatranscriptome of activated sludge microbial communities from WWTP in Nijmegen, Gelderland, Netherland - WWTP Weurt (Metagenome Metatranscriptome)EngineeredOpen in IMG/M
3300028648Activated sludge microbial communities from bioreactor in Nijmegen, Gelderland, Netherland - NOB reactorEngineeredOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032013Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3EnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300034355Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME18Oct2015-rr0135EnvironmentalOpen in IMG/M
3300034670Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R4 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
GPICI_005006502088090015SoilMPTAKFSSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV
F14TC_10543169313300000559SoilMPTAKFSAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDIAARSLEIVFQTRK*
F14TB_10400525413300001431SoilMPTAKFTAEPDSPEYVASDWLITIHNAFRAVDYAIDRSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDVASRAL
JGI25406J46586_1002921123300003203Tabebuia Heterophylla RhizosphereMATAKFTSEPGTPEYQASEWLITIHNAYREIDYAIAKSPNPEEFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVLETLILPNR*
soilH1_10020309113300003321Sugarcane Root And Bulk SoilMPTAKFTHEPGTPEYHASEWLMTIHNAYRELDHAIANSPNPADFIERLKERVRRDYEQNREQMGWTERTRDIADRSLNVVLETLKLPNL*
soilH1_1016877723300003321Sugarcane Root And Bulk SoilMPTAKFTHEPGTPEYHASEWLMTIHNAYRELDYAIANSPNPAEFIERLKERVRRDYEQNREQMGWTERTRDIADRSLNVVLETLKLPNL*
Ga0055500_1008990023300004062Natural And Restored WetlandsMPTAKFTHEPGTPEYQASEWLMTIHNAYREIDHAIEKAPDSDEFVQRLKERVRRDYELNREQMGWTERTRDIASKALEQVLKTRE*
Ga0062590_10097218523300004157SoilMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAV
Ga0066397_1002340513300004281Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0063356_10129347423300004463Arabidopsis Thaliana RhizosphereMPTAKFSAEPDTPEYLASDWLITIHNAYRAIDFAIEKSPDPAGFIERLKERVRRDYEQNREQMGWSERTRDIAGRSLDIVLQTRS*
Ga0062595_10134496513300004479SoilMPTAKFTAEPDSPEYVASDWLITIHNAFRAVDYAIDRSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDVASRALDIVLQTRKQ*
Ga0062592_10035011823300004480SoilMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPEAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAVEIVLQTRT*
Ga0066395_1035104123300004633Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0007764_1020631623300004797Freshwater LakeMSTGKFTSEPGSPEYHASDWLITIHNAYRELDHAIENSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLA
Ga0065705_1031852623300005294Switchgrass RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAVEIVLQTRVP*
Ga0070676_1088880113300005328Miscanthus RhizosphereMPTAKFNAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDITARSLDIVLQTRK*
Ga0066388_10002399543300005332Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0066388_10008276723300005332Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNL*
Ga0066388_10380410923300005332Tropical Forest SoilMATAKFSAEPGTPEYHASDWLITIHNAYREIDHAIEKSPNPEDFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLDLPNR*
Ga0066388_10515469113300005332Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLN
Ga0066388_10644428523300005332Tropical Forest SoilMPTGKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0066388_10857225023300005332Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0070668_10069432323300005347Switchgrass RhizosphereMPTAKFSAEPDSPEYAASNWLMTLHNLYREIDHAIEKSPDQQAFVERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVLQTRK*
Ga0070669_100007276103300005353Switchgrass RhizosphereMPTAKFNAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDIT
Ga0070703_1039533223300005406Corn, Switchgrass And Miscanthus RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRSLEIVLQTRK*
Ga0068880_131931323300005416Freshwater LakeMSTGKFTSEPGSPEYHASDWLITIHNAYRELDHAIENSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR*
Ga0068884_148543123300005417Freshwater LakeMPTAKFTSEPGSPEYHASDWLITLHNAYRELDFAIEKSPNPEEFIQRLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNR*
Ga0070694_10002131933300005444Corn, Switchgrass And Miscanthus RhizosphereMPTAKFTAEPDTPEYVASDWLMTIYNAYREVDYAIEKSPDPQAFIERLKERVRRDYEQNREQMGWTERTRDIAARSLDIVLETRK*
Ga0070694_10042911523300005444Corn, Switchgrass And Miscanthus RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAVEIVLQTRT*
Ga0070708_10099587123300005445Corn, Switchgrass And Miscanthus RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPPAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAVELVLQTRT*
Ga0070707_100000142453300005468Corn, Switchgrass And Miscanthus RhizosphereMPTAKFTEEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRALEIVLQTRK*
Ga0070698_10050516523300005471Corn, Switchgrass And Miscanthus RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWTERTRDIAGRALEIVLQTRT*
Ga0070699_10082412023300005518Corn, Switchgrass And Miscanthus RhizosphereDTELNMPTAKFTHDAGTPEYQASEWLMTIHNAYRELDFAIANSPNPPEYVERLKERVRRDYEQNREQMGWSERTRDIADRSLNVVLETLKLPNL*
Ga0070741_10000950113300005529Surface SoilMPTAKFAAEPDTPEYVASDWLITIHNAYRAIDFAIAKSPDPEAFIERLRERVRRDYEQNREEMGWTERTRDIASRALDIVLETRR*
Ga0070741_1020693523300005529Surface SoilMPTAKFTHEPGTPEYHASEWLITIHNAYRELDHAIESAPDANEFIERLRERVRRDYEQNREQMGWTERTRDIAQNALDYVLKTRS*
Ga0066700_1084625223300005559SoilMPTAKFTAEPDTPEYVASDWLMTIYNAYREVDHAIEKSPDPQAFIERLKERVRRDYEQNREQMGWTERTRDIAARALDIVLETRK*
Ga0066700_1089629823300005559SoilMPTAKFTEEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDVAGRSLEIVLQTRK*
Ga0068864_10167129623300005618Switchgrass RhizosphereMPTAKFTHDPGTPEYQASEWLMTIHNAYRELDFAIASSPNPADYVERLKERVRRDYELNREQMGWSERTRDIADRSLNVVLETLKLPNL*
Ga0066905_10020964533300005713Tropical Forest SoilMPTAKLTSEPGTPEYQASEWLITIHNAYRELDYAIAQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNRL*
Ga0066905_10028093623300005713Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERS
Ga0066905_10176176923300005713Tropical Forest SoilPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0068861_10200462913300005719Switchgrass RhizosphereMPTAKFTHEPGTPEYQASEWLMTIHNAYREIDHAIESAPDSDEFMQRLKERVRRDYELNREQMGWTERTRDIASKALEQVLKTRE*
Ga0066903_10182552923300005764Tropical Forest SoilMPTAKFTAEPDSPEYVASDWLITIHNAFRSVDYAIDRSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDVAARALDIVLQTRKQ*
Ga0066903_10904888023300005764Tropical Forest SoilMPTAKFTAEPDSPEYVASDWLITIHNAYRAVDYAIDRSPDPEAFVERLRERVRRDYEQNREQMGWSERTRDVAARSLEIVLQTRKQ*
Ga0081455_1052133223300005937Tabebuia Heterophylla RhizosphereMPTAKLTSEPGTPEYQASEWLITIHNAYRELDYAIAQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLELVIETLKLPNR
Ga0081455_1058394523300005937Tabebuia Heterophylla RhizosphereMATAKFSSEPGTPEYHASDWLITIHNAYREIDHAIERSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVIETLALPNR*
Ga0081539_1037083123300005985Tabebuia Heterophylla RhizosphereMPTAKFTHEPGTPEYQASEWLMTMHNAYRELDQAIEKAPDANEFIERLRERVRRDYEQNREQMGWTERTRDIAMNALEYVLKTRS*
Ga0075428_10104593013300006844Populus RhizosphereSEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0075428_10119789723300006844Populus RhizosphereMPTAKFSAEPDTPEYLASDWLITIHNAYRAIDFAIEKSPDPAGFIERLKERVRRDYEQNREQMGWSERTRDIA
Ga0075428_10229927123300006844Populus RhizosphereMPTAKFTAEPDTPEYMASDWLMTIHNAYRAIDHAIEKSPDPEAFIERLKERVRRDYEQNREQMGWSERTRDVAARSLEIV
Ga0075421_10000534043300006845Populus RhizosphereMATAKFTAEPDSPEYVASDWLITLHNAYRAIDYAIEKSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLQTRR*
Ga0075421_10019007633300006845Populus RhizosphereMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSL
Ga0075421_10032198613300006845Populus RhizosphereELGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0075421_10064151623300006845Populus RhizosphereMPTAKFTAEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0075421_10064315913300006845Populus RhizosphereELGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0075430_10167185813300006846Populus RhizosphereMPTAKFSSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0075431_10182238223300006847Populus RhizospherePGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0075425_10120112123300006854Populus RhizosphereMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0075434_10195241113300006871Populus RhizosphereASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0075429_10021496433300006880Populus RhizosphereMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNL*
Ga0075436_10076354523300006914Populus RhizosphereMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0075419_1034254613300006969Populus RhizosphereMPTAKFSSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNL*
Ga0075435_10161596413300007076Populus RhizosphereMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVV
Ga0103958_105087343300007212Freshwater LakeMPTAKFTSEPGSPEYHASDWLITLHNAYRELDFAIEKSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNR*
Ga0103961_102241243300007216Freshwater LakeMPTAKFTSEPGSPEYHASDWLITLHNAYRELDFAIEKSPNPEEFIDRLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNR*
Ga0111539_1157533123300009094Populus RhizosphereMPTAIFTFEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA*
Ga0111539_1326238213300009094Populus RhizosphereMPTGKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIA
Ga0075418_1017780613300009100Populus RhizospherePEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0115923_1084696013300009112Swimming Pool Sandfilter BackwashKYENMPTAKFTHEPWSPEYQASEWLMTIHNAYRELDFAIAKSPNPAEFVEKLKERVRRDYEQNREQMGWSERTRDVAERSLNVVLETLKLPNL*
Ga0115923_1086680923300009112Swimming Pool Sandfilter BackwashMPTAKFTHEPGTPEYQASEWLMTLHNAYREIDHAIEKAPDSAEFIERLKERVRRDYEQNREQMGWTERSRDIAMHALEYALKTRS*
Ga0075423_1232654813300009162Populus RhizosphereKFTHEPGTPEYQASEWLMTIHNAYREIDHAIEKAPDSDEFIQRLKERVRRDYELNREQMGWTERTRDIASKALEQVLKTRE*
Ga0103860_1003634713300009243River WaterGTPEYHASDWLITIHNAYREIDHAIESSPNPAEFIDRLRERVRRDYEQNREQMGWSERSRDIAARSLDIVIETLKTPKK*
Ga0115925_1024947323300009354Swimming Pool Sandfilter BackwashMATAKMTHEPDTPEYQAAEWLMTIHNAYREIDFAIEKAPNSEEFIERLKERVRREYEQNREKMGWTERSRDIAMNALEYVLKTRK*
Ga0115925_1093608733300009354Swimming Pool Sandfilter BackwashMPTAKFTHEPGSPEYQASEWLMTIHNAYRELDFAIAKSPNPAEFVEKLKERVRRDYEQNREQMGWSERTRDVAERSLNVVLETLKLPNL*
Ga0114938_119943923300009430GroundwaterMPTGKFTSEPGTPEYHASDWLMTIHTAYREIDHAISQSPDPEQFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVVETLKLPNK*
Ga0105249_1214325623300009553Switchgrass RhizosphereLKPTLLCHHFQTENQTNMPTAKFTHEPGTPEYQASEWLMTIHNAYREIDHAIEKAPDSEEFVQRLKERVRRDYELNREQMGWTERTRDIASKALEQVLKTRE*
Ga0126374_1088207123300009792Tropical Forest SoilKLIDTEPRKGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0126374_1092577913300009792Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLKERVRRDYEQNREQMGWTERTRDIAER
Ga0126374_1135350923300009792Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNA*
Ga0131092_1104690223300009870Activated SludgeMPTAKFTAEPDTPEYVASDWLMTIHNAYRQIDFAIAKSPDPEGFIEKLRERVRRDYEQNREEMGWSERSRDIAARALEIVLETRR*
Ga0131077_1004545513300009873WastewaterMATGKFTSEFGTPEYHASDWLITIHNAFRDIDHAIEKSPNPEEFIDKLRERVRRDYEQNREQMGWSERTRDIAARSLDIV
Ga0131077_1013534123300009873WastewaterMPTAKFTSEYGTPEYHASDWLITIHNAYREIDHAIEKSPNPEEFIDRLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLALPNK*
Ga0131077_1130020713300009873WastewaterMPTAKFTSEPGTPEYHASDWLITIHNAYREIDHAIEKSPHPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNRV*
Ga0126380_1014875813300010043Tropical Forest SoilTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0126380_1021326823300010043Tropical Forest SoilMPTAKFNSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLKERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNA*
Ga0126380_1065147113300010043Tropical Forest SoilQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0126380_1116906123300010043Tropical Forest SoilMPTAKLTSEPGTPEYQASEWLITIHNAYRELDHAIAQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVIETLKLPNRL*
Ga0126384_1017397113300010046Tropical Forest SoilRLVEKFTRSQGKGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0126384_1045634723300010046Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVIETLKLPNA*
Ga0126384_1196486613300010046Tropical Forest SoilKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0126382_1057988113300010047Tropical Forest SoilMPTAKFTSEPGTPEYHASDWLMTIHTAYREIDYAISQSPNPEEFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVVETLKLPNR*
Ga0126382_1148073713300010047Tropical Forest SoilMPTAKLTSEPGTPEYQASEWLITIHNAYRELDYAIAQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVIETLKLPNR*
Ga0126370_1179771623300010358Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLKERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNA*
Ga0126372_1050818013300010360Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPN
Ga0126372_1157006813300010360Tropical Forest SoilMPTGKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0126372_1202249623300010360Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETL
Ga0126378_1209988823300010361Tropical Forest SoilMPTGKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0126377_1002567843300010362Tropical Forest SoilMPTAKLTSEPGTPEYQASEWLITIHNAYRELDHAIAQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVIETLKLPNR*
Ga0126377_1003079323300010362Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDHAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0126377_1118290213300010362Tropical Forest SoilASEWLMTIHNAYREIDQAISQSPDPSAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0126377_1118323123300010362Tropical Forest SoilMATAKFSAEPGTPEYHASDWLITIHNAYRELDHAIEKSPNPEDFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLDLPNR*
Ga0126377_1178358613300010362Tropical Forest SoilGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA*
Ga0126377_1361895613300010362Tropical Forest SoilMPTAKFNSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLN
Ga0136847_1154492723300010391Freshwater SedimentMPTGKFTSESGTPEYHASEWLMTIHNAYREIDYAIEKSPDPQAFIERLKERVRRDYEQNREQMGWTERTRDIAARSLDIVLETRK*
Ga0134123_1088622123300010403Terrestrial SoilMPTAKFTHEPGTPEYQASEWLMTIHNAYREIDHAIEKAPNSDEFIQRLKERVRRDYELNREQMGWTERTRDIASKALKQVLKTRE*
Ga0124850_107846923300010863Tropical Forest SoilKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV*
Ga0154003_100046343300012081Attine Ant Fungus GardensMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA*
Ga0154003_101564823300012081Attine Ant Fungus GardensMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSINVVLETLKLPNA*
Ga0137376_1124805413300012208Vadose Zone SoilMPTAKFTAEPDTPEYVASDWLMTIYNAYREIDHAIEKSPDPQAFIERLKERVRRDYEQNREQMGWTERTRDIAARSLDIVLETRK*
Ga0137369_1005683663300012355Vadose Zone SoilIDTELGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0137410_1008002423300012944Vadose Zone SoilMPTAKFTSEPSTPEYQASEWLMTIHNAYREIDQAILKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA*
Ga0126375_1059506923300012948Tropical Forest SoilMPTAKLTSEPGTPEYQASEWLITIHNAYRELDYAISQSPNPNEFIERLRERVRRDYEQNREQMGWSERTRDIAARSL
Ga0126369_1109447723300012971Tropical Forest SoilMEYSFADLDKSIDTELGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV*
Ga0157378_1107844913300013297Miscanthus RhizosphereMPTAKFTAEPDTPEYVASDWLMTIYNAYREIDQSIEKSPDPQAFIERLKERVRRDYEQNREQMGWTERTRDIAARSLDIVLETRK*
Ga0163162_1149269623300013306Switchgrass RhizosphereMPTAKFSAEPDSPEYVASDWLITIHNAFRAVDYAIDRSPDPQAFVERLRERLRRDYEQNREQMGWSERTRDIASRALDIVLQTRKQ*
Ga0157380_1083222523300014326Switchgrass RhizosphereMPTAKFSAEPDTPEYMASDWLITIHNAYRAIDYAIEKSPDPEAFIERLKERVRRDYEQNREQMGWSERTRDVAARSLDIVLQTRK*
Ga0132258_1051121523300015371Arabidopsis RhizosphereMPTAKFTAEPDSPEYVASDWLITIHNAFRAVDYAIDRSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDIASRSLDIVLQTRKQ*
Ga0132258_1314698513300015371Arabidopsis RhizosphereMPTAKFTAEPDSPEYVASDWLITIHNAFRGVDYAIDRSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDVASRALDIVLQTRKQ*
Ga0132258_1321634723300015371Arabidopsis RhizosphereMPTAKFSAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDITARSLDI
Ga0132256_10225306123300015372Arabidopsis RhizosphereMPTAKFSAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDITARSLDIVLQTRK*
Ga0132255_10527539323300015374Arabidopsis RhizosphereEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA*
Ga0169931_1001815923300017788FreshwaterMPTAKFTSEPGSPEYHASDWLITLHNAYRELDFAIEKSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNR
Ga0163161_1045634723300017792Switchgrass RhizosphereMPTAKFNAEPDTPEYVASDWLITIHNAYRAIDFAIEKSPNPAEFIDRLRERVRRDYEQNREQMGWSERTRDITARSLDIVLQTRK
Ga0190270_1276429413300018469SoilMPTAKFTHEPGTPEYQASEWLMTIHNAFRELDFAIANSPNPGEYVERLKERVRRDYEQNREQMGWSERSRDIAERSLNVVLQTLKLPNL
Ga0187894_1000745773300019360Microbial Mat On RocksMPTAKFTSEPGTPEYQASEWLMTIHNASREIDFAISQSPNPAEFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLVLPNA
Ga0187894_1001610753300019360Microbial Mat On RocksMPTGKFTSEPGTPEYQASEWLMTIHNAYREMDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV
Ga0187894_1002094643300019360Microbial Mat On RocksMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV
Ga0187894_1004668723300019360Microbial Mat On RocksMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAYIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA
Ga0187893_1009999233300019487Microbial Mat On RocksMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAYIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV
Ga0187893_1029305913300019487Microbial Mat On RocksMPTAKFTSEPGTPEYQASEWLMTIHNASREIDFAISQSPNPAEFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV
Ga0196964_1040036613300020202SoilMPTAKFTHEPGTPEYQASEWLMTIHNAYREIDHAIEKAPDANEFIERLRERVRRDYEQNREQMGWTERTRDIAMNALEYVLKTRS
Ga0213882_1025603713300021362Exposed RockMPTAKFTHEPGTPEYEASDWLITIHNAYRQLDHAIEKSPDPEGFIERLRERVRRDYEQNREEMGWSTRTRDIAARALEIVLQTRQ
Ga0256614_123811223300023201Activated SludgeMPTAKFTHEPGTPEYQASEWLMTIHNASRELDFAISNSPNPAEYVDKLKERVRRDYEQNREQMGWSERTRDIADRSLNVLLETLKLPNM
Ga0255178_109529613300024298FreshwaterKFTAEPGSPEYHASDWLITIHNAYRELDHAIENSPNPDEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0256347_102658013300024545FreshwaterMSTGKFTSEPGSPEYHASDWLITIHNAYREIDHAIEDSPNPEEFIEKLRERVRRDYEQNREQMGWSERSRDIAARSLEIVLATLKLPNRPTQ
Ga0255282_101056623300024850FreshwaterMSTGKFTSEPGSPEYHASDWLITIHNAYRELDHAIENSPNPDEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0255286_109416923300024858FreshwaterFTSEPGSPEYHASDWLITIHNAYREIDHAIEQSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0207684_1006772413300025910Corn, Switchgrass And Miscanthus RhizosphereMATAKFSAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAA
Ga0207646_100000471143300025922Corn, Switchgrass And Miscanthus RhizosphereMPTAKFTEEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRALEIVLQTRK
Ga0207646_1124754613300025922Corn, Switchgrass And Miscanthus RhizosphereMPTAKFTAEPDTPEYVASDWLITIHNAYREIDHAIEKSPDPQAFIERLKERVRRDYEQNREQMGWSERTRDIAGRALEIVLQTRK
Ga0207689_1010374233300025942Miscanthus RhizosphereMATAKFTAEPDTPEYVASDWLITIHNAYRAIDHAIEKSPDPQAFIERLRERVRRDYEQNREQMGWSERTRDIAGRAVEIVLQTRVP
Ga0255285_103197323300026562FreshwaterMSTGKFTAEPGSPEYHASDWLITIHNAYRELDHAIENSPNPDEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0209684_103642023300027527Tropical Forest SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAILQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNL
Ga0209972_1000558053300027793Freshwater LakeMSTGKFTSEPGSPEYHASDWLITIHNAYRELDHAIENSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0209972_1041714623300027793Freshwater LakeMPTAKFTSEPGSPEYHASDWLITLHNAYRELDFAIEKSPNPEEFIQRLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLETLKLPNR
(restricted) Ga0233416_1002988433300027799SedimentMATAKFTSEPGTPEYQASEWLITIHNAYREIDHAIAQSPNPEQFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVLETLILPNR
Ga0209023_1012276513300027870Freshwater And SedimentEYQASEWLMTIHNAYRELDFAIANSPNPGDYVERLKERVRRDYEQNREQMGWSERTRDIADRSLNVVLETLKLPNM
Ga0209023_1031499023300027870Freshwater And SedimentMPTAKFTHEPGTPEYQASEWLMTIHNAYRELDHAIAKSSNPPEYVERLKERVRRDYEQNREQMGWSERTRDIADRSLNVVLETLKLPNL
Ga0209481_1048482313300027880Populus RhizosphereTLRNKSIDTELGKMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNL
Ga0209382_1007166443300027909Populus RhizosphereMATAKFTAEPDSPEYVASDWLITLHNAYRAIDYAIEKSPDPQAFVERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLQTRR
Ga0209382_1044441523300027909Populus RhizosphereMPTAKFTAEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNV
Ga0209382_1167206623300027909Populus RhizosphereMPTAKFSAEPDTPEYLASDWLITIHNAYRAIDFAIEKSPDPAGFIERLKERVRRDYEQNREQMGWSERTRDIAGRSLDIVLQTRS
Ga0256358_112130713300028267FreshwaterGSAAESCGVEKMSTAKFTSEPGSPEYQASDWLITIHNAYRDIDHAIANSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0272412_123390523300028647Activated SludgeMPTAKFTHEPGTPEYQASEWLMTIHNAYRELDFAIANSPNPGEYVERLKERVRRDYEQNREQMGWSERSRDIAERSLNVVLETLKLPNL
Ga0272412_143038113300028647Activated SludgeMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDHAIAKSPNPAEFIERLKERVRRDYEQNREQMGWSERTRDVAERSLNVVLETLKLPNL
Ga0268299_10002122063300028648Activated SludgeMPTAKFTHDPGTPEYQASEWLMTIHNAYRELDFAIANSPNPAEYVDRLKERVRRDYEQNREQMGWSERTRDIAERSLNVALETLKLPNL
Ga0247727_10014433153300031576BiofilmMPTGKFTHEPGTPEYHASEWLMTIHNAYRELDYAISNSPSPAEFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLVLPNA
Ga0247727_1020020123300031576BiofilmMPTGKFASESGTPEYHASDWLMTIHTAYRELDHAISSSPNPEEFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVVETLKLPNK
Ga0247727_1029282523300031576BiofilmEYHASDWLMTIHTAYRELDHAISSSHNPEEFIERLKERVRRDYEQNREQMGWSERTRDIAARSLDIVVETLKLPNK
Ga0247727_1072616823300031576BiofilmTINTELKGKMPTAKFTSEPGTPEYQASEWLMTIHNASREIDFAISQSPNPAEFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLVLPNA
Ga0310904_1128009623300031854SoilAKFSSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV
Ga0310885_1072621913300031943SoilYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNV
Ga0310906_1002507123300032013SoilMPTAKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLRERVRRDYEQNREQMGWTERTRDIAERSLNVVLETLKLPNL
Ga0315912_1152655013300032157SoilMPTGKFTSEPGTPEYQASEWLMTIHNAYREIDQAISQSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVIETLKLPNV
Ga0335039_0013927_3259_35283300034355FreshwaterMSTGKFTSDPGSPEYHASDWLITVHNAYRELDHAIESSPNPEEFIERLRERVRRDYEQNREQMGWSERTRDIAARSLEIVLATLKLPNR
Ga0314795_137290_31_3003300034670SoilMPTAKFSSEPGTPEYQASEWLMTIHNAYREIDQAISKSPDPAAFIERLKERVRRDYEQNREQMGWSERTRDIAERSLNVVLETLKLPNA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.