NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F017915

Metagenome / Metatranscriptome Family F017915

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F017915
Family Type Metagenome / Metatranscriptome
Number of Sequences 238
Average Sequence Length 107 residues
Representative Sequence GFIIPAGPPDLSFFLLSRVSAVGNYCCEVPVGQDETVYVYAAKSVKILYDPLRVYRVRGAFEAGSHTDRTYGVSLFRLRNARVEEAVGAKIFKVGETTAPAPR
Number of Associated Samples 175
Number of Associated Scaffolds 238

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 1.26 %
% of genes near scaffold ends (potentially truncated) 97.48 %
% of genes from short scaffolds (< 2000 bps) 94.96 %
Associated GOLD sequencing projects 163
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.580 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(18.067 % of family members)
Environment Ontology (ENVO) Unclassified
(27.311 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.756 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 21.37%    Coil/Unstructured: 78.63%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 238 Family Scaffolds
PF00528BPD_transp_1 3.78
PF04820Trp_halogenase 2.94
PF01717Meth_synt_2 2.94
PF00072Response_reg 2.52
PF13432TPR_16 2.52
PF07690MFS_1 1.68
PF01494FAD_binding_3 1.26
PF00005ABC_tran 1.26
PF00296Bac_luciferase 1.26
PF13343SBP_bac_6 0.84
PF12773DZR 0.84
PF01174SNO 0.84
PF01145Band_7 0.84
PF00378ECH_1 0.84
PF04392ABC_sub_bind 0.84
PF03916NrfD 0.42
PF01011PQQ 0.42
PF06580His_kinase 0.42
PF01680SOR_SNZ 0.42
PF04909Amidohydro_2 0.42
PF13181TPR_8 0.42
PF08402TOBE_2 0.42
PF00166Cpn10 0.42
PF02738MoCoBD_1 0.42
PF07993NAD_binding_4 0.42
PF01266DAO 0.42
PF12867DinB_2 0.42
PF05103DivIVA 0.42
PF05685Uma2 0.42
PF07969Amidohydro_3 0.42
PF04280Tim44 0.42
PF00115COX1 0.42
PF00211Guanylate_cyc 0.42
PF03480DctP 0.42
PF13610DDE_Tnp_IS240 0.42
PF12695Abhydrolase_5 0.42
PF02538Hydantoinase_B 0.42
PF04199Cyclase 0.42
PF00873ACR_tran 0.42
PF02627CMD 0.42
PF13470PIN_3 0.42
PF03976PPK2 0.42
PF00929RNase_T 0.42

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 238 Family Scaffolds
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 2.94
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 2.52
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.26
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 1.26
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 1.26
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 1.26
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 0.84
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.84
COG0311Pyridoxal 5'-phosphate synthase subunit PdxT (glutamine amidotransferase)Coenzyme transport and metabolism [H] 0.84
COG0118Imidazoleglycerol phosphate synthase glutamine amidotransferase subunit HisHAmino acid transport and metabolism [E] 0.84
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 0.42
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 0.42
COG1878Kynurenine formamidaseAmino acid transport and metabolism [E] 0.42
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.42
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 0.42
COG0214Pyridoxal 5'-phosphate synthase subunit PdxSCoenzyme transport and metabolism [H] 0.42
COG2326Polyphosphate kinase 2, PPK2 familyEnergy production and conversion [C] 0.42
COG2972Sensor histidine kinase YesMSignal transduction mechanisms [T] 0.42
COG3275Sensor histidine kinase, LytS/YehU familySignal transduction mechanisms [T] 0.42
COG4395Predicted lipid-binding transport protein, Tim44 familyLipid transport and metabolism [I] 0.42
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 0.42


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.58 %
UnclassifiedrootN/A0.42 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2067725004|GPKC_F5V46DG04IQDDRAll Organisms → cellular organisms → Bacteria511Open in IMG/M
2088090013|LWAnNN_GHFF8UE02HOVR1All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c2401485All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300000363|ICChiseqgaiiFebDRAFT_14531310All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300000550|F24TB_12543639All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300000858|JGI10213J12805_10081537All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300001431|F14TB_101799878All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300002561|JGI25384J37096_10231283All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300002568|C688J35102_118309515All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Deinococcales → Deinococcaceae → Deinococcus → Deinococcus pimensis547Open in IMG/M
3300003319|soilL2_10053040All Organisms → cellular organisms → Bacteria1203Open in IMG/M
3300004268|Ga0066398_10128864All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300004479|Ga0062595_101819480All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300005093|Ga0062594_101631312All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria670Open in IMG/M
3300005174|Ga0066680_10095489All Organisms → cellular organisms → Bacteria1812Open in IMG/M
3300005174|Ga0066680_10453451All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300005174|Ga0066680_10490266All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300005176|Ga0066679_10234150All Organisms → cellular organisms → Bacteria1179Open in IMG/M
3300005181|Ga0066678_10880426All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300005186|Ga0066676_10925293All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300005332|Ga0066388_104094415All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium743Open in IMG/M
3300005332|Ga0066388_104740603All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300005332|Ga0066388_107390172All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300005437|Ga0070710_10782452All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300005446|Ga0066686_10024613All Organisms → cellular organisms → Bacteria3435Open in IMG/M
3300005446|Ga0066686_10340943All Organisms → cellular organisms → Bacteria → Proteobacteria1022Open in IMG/M
3300005446|Ga0066686_10404116All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300005468|Ga0070707_101913975All Organisms → cellular organisms → Bacteria → Synergistetes → Synergistia → Synergistales → Synergistaceae → Jonquetella → Jonquetella anthropi560Open in IMG/M
3300005518|Ga0070699_100759034All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300005536|Ga0070697_101590227All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300005546|Ga0070696_100782419All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium784Open in IMG/M
3300005552|Ga0066701_10368443All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300005552|Ga0066701_10900630All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300005554|Ga0066661_10093212All Organisms → cellular organisms → Bacteria1790Open in IMG/M
3300005555|Ga0066692_10778178All Organisms → cellular organisms → Bacteria589Open in IMG/M
3300005556|Ga0066707_10848705All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300005559|Ga0066700_10900663All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Neomegalonemataceae → Neomegalonema → Neomegalonema perideroedes588Open in IMG/M
3300005569|Ga0066705_10575301All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales695Open in IMG/M
3300005615|Ga0070702_100211301All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1291Open in IMG/M
3300005617|Ga0068859_102269093All Organisms → cellular organisms → Bacteria → Acidobacteria599Open in IMG/M
3300005713|Ga0066905_101637246All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300005764|Ga0066903_100726409All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1758Open in IMG/M
3300005764|Ga0066903_100865194All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1629Open in IMG/M
3300005764|Ga0066903_105160340All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300005764|Ga0066903_105419315All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300005764|Ga0066903_109086509All Organisms → cellular organisms → Bacteria503Open in IMG/M
3300005843|Ga0068860_101177207All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300006028|Ga0070717_11582258All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300006049|Ga0075417_10047957All Organisms → cellular organisms → Bacteria1835Open in IMG/M
3300006049|Ga0075417_10349853All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300006102|Ga0075015_101001285All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium → unclassified Rhizobium → Rhizobium sp. NFR12511Open in IMG/M
3300006163|Ga0070715_10794517All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300006196|Ga0075422_10414480All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300006755|Ga0079222_11182213All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300006755|Ga0079222_11880192All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300006796|Ga0066665_10820953All Organisms → cellular organisms → Bacteria731Open in IMG/M
3300006796|Ga0066665_10976226All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300006844|Ga0075428_101162401All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300006844|Ga0075428_101565500All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium → unclassified Rhizobium → Rhizobium sp. NFR12690Open in IMG/M
3300006845|Ga0075421_101096228All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae895Open in IMG/M
3300006845|Ga0075421_101504701All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300006846|Ga0075430_100861281All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300006847|Ga0075431_100307633All Organisms → cellular organisms → Bacteria1600Open in IMG/M
3300006847|Ga0075431_100806798All Organisms → cellular organisms → Bacteria911Open in IMG/M
3300006847|Ga0075431_101116304All Organisms → cellular organisms → Bacteria753Open in IMG/M
3300006847|Ga0075431_101240019All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300006853|Ga0075420_100128701All Organisms → cellular organisms → Bacteria2228Open in IMG/M
3300006853|Ga0075420_101418668All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300006853|Ga0075420_101697994All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300006903|Ga0075426_10406693All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300006904|Ga0075424_102054898All Organisms → cellular organisms → Bacteria602Open in IMG/M
3300006914|Ga0075436_100056955All Organisms → cellular organisms → Bacteria → Proteobacteria2699Open in IMG/M
3300006914|Ga0075436_101371587All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300007004|Ga0079218_13631851All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300007265|Ga0099794_10538028All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300009012|Ga0066710_102555730All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium736Open in IMG/M
3300009012|Ga0066710_102567016All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300009038|Ga0099829_11112420All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300009090|Ga0099827_10233356All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300009090|Ga0099827_11624056All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300009090|Ga0099827_11872348All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300009100|Ga0075418_10651572All Organisms → cellular organisms → Bacteria1134Open in IMG/M
3300009100|Ga0075418_11964603All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300009137|Ga0066709_104166159All Organisms → cellular organisms → Bacteria526Open in IMG/M
3300009147|Ga0114129_10533495All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300009147|Ga0114129_12093408All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300009147|Ga0114129_13433385All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300009156|Ga0111538_10058132All Organisms → cellular organisms → Bacteria4937Open in IMG/M
3300009156|Ga0111538_11300424All Organisms → cellular organisms → Bacteria918Open in IMG/M
3300009156|Ga0111538_13959110All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300009176|Ga0105242_13233820All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300009444|Ga0114945_10320111All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi916Open in IMG/M
3300009691|Ga0114944_1033019All Organisms → cellular organisms → Bacteria1848Open in IMG/M
3300009799|Ga0105075_1048644All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300009808|Ga0105071_1071689All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300009814|Ga0105082_1112804All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300009815|Ga0105070_1123443All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300010040|Ga0126308_11121072All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300010042|Ga0126314_10584817All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300010044|Ga0126310_10267745All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Caulobacter → unclassified Caulobacter → Caulobacter sp.1161Open in IMG/M
3300010045|Ga0126311_11848822All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300010046|Ga0126384_10599485All Organisms → cellular organisms → Bacteria964Open in IMG/M
3300010046|Ga0126384_11601593All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300010047|Ga0126382_10324224All Organisms → cellular organisms → Bacteria1167Open in IMG/M
3300010047|Ga0126382_10657694All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300010047|Ga0126382_12123334All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300010336|Ga0134071_10178464All Organisms → cellular organisms → Bacteria1042Open in IMG/M
3300010358|Ga0126370_11550316All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium632Open in IMG/M
3300010359|Ga0126376_12372712All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300010360|Ga0126372_12090853All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300010360|Ga0126372_12994438All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300010360|Ga0126372_13243248All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300010362|Ga0126377_11252328All Organisms → cellular organisms → Bacteria813Open in IMG/M
3300010366|Ga0126379_10968597All Organisms → cellular organisms → Bacteria955Open in IMG/M
3300010366|Ga0126379_11366233All Organisms → cellular organisms → Bacteria814Open in IMG/M
3300010366|Ga0126379_13150977All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300010398|Ga0126383_12660074All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300010398|Ga0126383_12865964All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300011269|Ga0137392_11645985All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300011270|Ga0137391_10352468All Organisms → cellular organisms → Bacteria1264Open in IMG/M
3300011271|Ga0137393_10122998All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2142Open in IMG/M
3300011271|Ga0137393_11571168All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300011445|Ga0137427_10451702All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300012096|Ga0137389_10548436All Organisms → cellular organisms → Bacteria993Open in IMG/M
3300012096|Ga0137389_11584956All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300012189|Ga0137388_11095785All Organisms → cellular organisms → Bacteria733Open in IMG/M
3300012202|Ga0137363_11795605All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300012203|Ga0137399_10125681All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2021Open in IMG/M
3300012203|Ga0137399_11764873All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300012205|Ga0137362_10243086All Organisms → cellular organisms → Bacteria → Proteobacteria1554Open in IMG/M
3300012206|Ga0137380_11107736All Organisms → cellular organisms → Bacteria674Open in IMG/M
3300012209|Ga0137379_10736921All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300012210|Ga0137378_11744001All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300012210|Ga0137378_11882243All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300012211|Ga0137377_11397640All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300012356|Ga0137371_10918217All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300012356|Ga0137371_11172692All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300012357|Ga0137384_11085675All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300012359|Ga0137385_10401078All Organisms → cellular organisms → Bacteria1169Open in IMG/M
3300012363|Ga0137390_11440445All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300012685|Ga0137397_10145945All Organisms → cellular organisms → Bacteria1752Open in IMG/M
3300012922|Ga0137394_10417332All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300012927|Ga0137416_11449791All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300012929|Ga0137404_10701285All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium915Open in IMG/M
3300012930|Ga0137407_10879595All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300012971|Ga0126369_11482532All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300012976|Ga0134076_10158221All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300014745|Ga0157377_11458489All Organisms → cellular organisms → Bacteria542Open in IMG/M
3300015241|Ga0137418_10066032All Organisms → cellular organisms → Bacteria3302Open in IMG/M
3300015245|Ga0137409_10033180All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhodovibrionaceae → Tistlia → Tistlia consotensis4990Open in IMG/M
3300015245|Ga0137409_10130350All Organisms → cellular organisms → Bacteria2312Open in IMG/M
3300015245|Ga0137409_10400395All Organisms → cellular organisms → Bacteria1188Open in IMG/M
3300015245|Ga0137409_11108924All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300015264|Ga0137403_10263351All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1630Open in IMG/M
3300015264|Ga0137403_10519995All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300015264|Ga0137403_10608735All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium957Open in IMG/M
3300015264|Ga0137403_10665565All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300015358|Ga0134089_10080929All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300015371|Ga0132258_10175818All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5167Open in IMG/M
3300015374|Ga0132255_105255157All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300015374|Ga0132255_105760803All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300016341|Ga0182035_11917398All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300016404|Ga0182037_11919148All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300018027|Ga0184605_10157822All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1023Open in IMG/M
3300018028|Ga0184608_10114006All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300018061|Ga0184619_10443302All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300018063|Ga0184637_10099872All Organisms → cellular organisms → Bacteria1787Open in IMG/M
3300018063|Ga0184637_10367275All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300018429|Ga0190272_11204485All Organisms → cellular organisms → Bacteria → Acidobacteria744Open in IMG/M
3300018465|Ga0190269_11084339All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300018468|Ga0066662_10215372All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1534Open in IMG/M
3300018468|Ga0066662_11295458All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300019881|Ga0193707_1139371Not Available689Open in IMG/M
3300020004|Ga0193755_1033862All Organisms → cellular organisms → Bacteria1688Open in IMG/M
3300020004|Ga0193755_1191989All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300021080|Ga0210382_10503649All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300021478|Ga0210402_11211549All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300021478|Ga0210402_11254178All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300022563|Ga0212128_10030101All Organisms → cellular organisms → Bacteria3497Open in IMG/M
3300022563|Ga0212128_10456562All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300025910|Ga0207684_10278180All Organisms → cellular organisms → Bacteria1443Open in IMG/M
3300025911|Ga0207654_10591516All Organisms → cellular organisms → Bacteria791Open in IMG/M
3300026095|Ga0207676_11764952All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300026326|Ga0209801_1320544All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300026328|Ga0209802_1061537All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300026328|Ga0209802_1233658All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300026328|Ga0209802_1289133All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300026508|Ga0257161_1028683All Organisms → cellular organisms → Bacteria1086Open in IMG/M
3300026515|Ga0257158_1092907All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300026524|Ga0209690_1299069All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300026537|Ga0209157_1027098All Organisms → cellular organisms → Bacteria3392Open in IMG/M
3300026537|Ga0209157_1180277All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300026557|Ga0179587_11014359All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300027384|Ga0209854_1054408All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300027645|Ga0209117_1146349All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300027695|Ga0209966_1112982All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300027717|Ga0209998_10167700All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300027787|Ga0209074_10557273All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300027862|Ga0209701_10593631All Organisms → cellular organisms → Bacteria588Open in IMG/M
3300027873|Ga0209814_10387473All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300027873|Ga0209814_10424828All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300027875|Ga0209283_10958228All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300027907|Ga0207428_10136361All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1876Open in IMG/M
3300027909|Ga0209382_11444755All Organisms → cellular organisms → Bacteria688Open in IMG/M
3300028381|Ga0268264_12292736All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300028791|Ga0307290_10140907All Organisms → cellular organisms → Bacteria883Open in IMG/M
3300028792|Ga0307504_10215862All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300028799|Ga0307284_10301184All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300028809|Ga0247824_10703368All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300028812|Ga0247825_11314582All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300028878|Ga0307278_10184380All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300030006|Ga0299907_10159881All Organisms → cellular organisms → Bacteria1862Open in IMG/M
3300030620|Ga0302046_11186358All Organisms → cellular organisms → Bacteria601Open in IMG/M
3300031057|Ga0170834_109082764All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300031058|Ga0308189_10398361All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300031092|Ga0308204_10225916All Organisms → cellular organisms → Bacteria596Open in IMG/M
3300031098|Ga0308191_1007265All Organisms → cellular organisms → Bacteria941Open in IMG/M
(restricted) 3300031150|Ga0255311_1154471All Organisms → cellular organisms → Bacteria511Open in IMG/M
(restricted) 3300031197|Ga0255310_10243429All Organisms → cellular organisms → Bacteria509Open in IMG/M
3300031680|Ga0318574_10340682All Organisms → cellular organisms → Bacteria874Open in IMG/M
3300031720|Ga0307469_10276520All Organisms → cellular organisms → Bacteria → Acidobacteria1369Open in IMG/M
3300031720|Ga0307469_10277542All Organisms → cellular organisms → Bacteria1366Open in IMG/M
3300031763|Ga0318537_10368864All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300031858|Ga0310892_10157768All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300031941|Ga0310912_10307438All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1228Open in IMG/M
3300031997|Ga0315278_10464811All Organisms → cellular organisms → Bacteria1304Open in IMG/M
3300032002|Ga0307416_101373345All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300032002|Ga0307416_102971350All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300032003|Ga0310897_10171206All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300032005|Ga0307411_10782048All Organisms → cellular organisms → Bacteria839Open in IMG/M
3300032017|Ga0310899_10227866All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300032065|Ga0318513_10338752All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300032089|Ga0318525_10445706All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300032180|Ga0307471_100525572All Organisms → cellular organisms → Bacteria1334Open in IMG/M
3300032180|Ga0307471_102682514All Organisms → cellular organisms → Bacteria632Open in IMG/M
3300032180|Ga0307471_103967161All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300033417|Ga0214471_10954741All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300033551|Ga0247830_10159044All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1657Open in IMG/M
3300034354|Ga0364943_0234676All Organisms → cellular organisms → Bacteria682Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil18.07%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere13.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.18%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.14%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.20%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.78%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.78%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.10%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.10%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand2.10%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.52%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.52%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.68%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil1.68%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.68%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.68%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.26%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.26%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.26%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.84%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.84%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.84%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.42%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Sediment0.42%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.42%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.42%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.42%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.42%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.42%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.42%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.42%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.42%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.42%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.42%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2067725004Soil microbial communities from Great Prairies - Kansas Corn soilEnvironmentalOpen in IMG/M
2088090013Freshwater sediment microbial communities from Lake Washington, Seattle, for methane and nitrogen Cycles - SIP 13Cmethane anaerobic no nitrateEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000858Soil microbial communities from Great Prairies - Wisconsin Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300009799Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_0_30EnvironmentalOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300009815Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10EnvironmentalOpen in IMG/M
3300010040Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55EnvironmentalOpen in IMG/M
3300010042Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105BEnvironmentalOpen in IMG/M
3300010044Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot60EnvironmentalOpen in IMG/M
3300010045Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027384Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027695Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028799Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_123EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031098Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_186 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031763Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f29EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031997Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032003Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D1EnvironmentalOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032017Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D4EnvironmentalOpen in IMG/M
3300032065Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f20EnvironmentalOpen in IMG/M
3300032089Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f23EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
GPKC_073404602067725004SoilDVSQQMKSYAGKVVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFGAKGVKLPFDPLRVYKIRGVFEAGKQVDRTYGLSMFRVRDARVEEAVGAKIFKVGDSPAPAAK
LWAnNN_069495502088090013Freshwater SedimentQSLSGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEIATGQDEIVYVATARGVTIAYDPLRVYKVRGVFEAGKHVDATYGVSMYRLRNAKVEVAVGAKIFRAGETPPSTKP
ICChiseqgaiiDRAFT_240148513300000033SoilKSYNGKEVEXQGFIIPAGPPDLSFFLLXRXSAXGNXCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPAXGVXLFRLRQAHVEEA
ICChiseqgaiiFebDRAFT_1453131013300000363SoilKSYNGKEVEXQGFIIPAGPPDLSFFLLXRXSAXGNXCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPAXGVXLFRLRQAHVEEAXGAKIMKIEDSPASAR*
F24TB_1254363913300000550SoilPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGLFEAGAHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDSPASAR*
JGI10213J12805_1008153713300000858SoilEVQGFIIPAGPPDLSFFLLSRISAIGNYCCELPTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLHQAHVEEAVGAKIMKIEDTPASAR*
F14TB_10179987813300001431SoilGFIIPAGPTDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGVSFFRLRDARVEEAVGARIFKLGDAPPSGR*
JGI25384J37096_1023128313300002561Grasslands SoilKSYGGREVEIQGFIIPAGPPDLSFFLLSRVSATGNYCCEVPVGQDETVYVYAAKGVKILYDPLRVYKVRGAFEAGPYTDRTYGVSLFRVRSARVEEAVGAKIFKMGETTTPGSNK*
C688J35102_11830951513300002568SoilADSGITQTIKSYDGKEVEIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAANGVKLNYDPLRVYKIRGVFEAGMRSDPAYGPSLFRVRQARVEEAVGVPIFKVGETAPAATAKP*
soilL2_1005304023300003319Sugarcane Root And Bulk SoilKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGAHTDPANGVSLFRLRQASVEEAVGAKIMKIEDAPASR*
Ga0066398_1012886423300004268Tropical Forest SoilEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGTKILYDPLRVYKVRGAFEAGLHTDRTYGVSLYRLRDARVEEAVGAKIFKDR*
Ga0062595_10181948013300004479SoilAAITAYAGREVEIQGFIIPAGPPDLSFFLLGRVSAMGNYCCELPSGQDETVYVYAAKGLSIRYDPLRVYRVRGVFEAGPVPDRAHGVSFYRLRDARVEEAVGARIFRVGDPPPAGR*
Ga0062594_10163131223300005093SoilDLSFFLLSRVSALGNYCCEVPVGQDETVYVFGAKGVKLPYDPLRVYKIRGIFEAGKQVDRTYGVSMFRVRDAHVEEAIGAKIFKVGESPAPAAKP*
Ga0066680_1009548943300005174SoilSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRNARVEEAVGAKIFKVGDTPGR*
Ga0066680_1045345113300005174SoilSLRANFEVTQTIKSYSGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPKQ*
Ga0066680_1049026613300005174SoilSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRSARVEEAVGAKIFKVGDTPGR*
Ga0066679_1023415013300005176SoilLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRVRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKSP*
Ga0066678_1088042613300005181SoilNFGVTQAISSYNGREVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVTIAYDPLRVYKVHGLFEAGMLSDPAYGPSLFRIRNARVVEAVGAKIFKVGETGK*
Ga0066676_1092529313300005186SoilEVEIHGFIIPAGLPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPNGVKILYDPLRVYKVRGAFEAGPHTDRTYGLSLFRVRNARVEEAVGAKIFKVGETPAPASR*
Ga0066388_10409441513300005332Tropical Forest SoilFIIPAGPPDLSFFLLARVSAVGNYCCELPSGQDETVYVYAAKGLAISYDPLRVYKVRGVFEAGAVADKANGISFFRIRNARVEEAVGAKIFKVGDPSSGR*
Ga0066388_10474060313300005332Tropical Forest SoilEIEIQGFIIPAGPPDLSFFLLGRVSATGNYCCELPSGQDETVYVYAAKGLSLRYDPLRVYRVRGVFEAGRHADQTYGISMFRVRNARVEEAVGAKIFKVGDAPPAGR*
Ga0066388_10739017213300005332Tropical Forest SoilIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPQGVKILYDPLRIYKIRGVFEAGMRADPTYGPSLFRVRNARVEEAVGARIFKVGEVPASAAKP*
Ga0070710_1078245213300005437Corn, Switchgrass And Miscanthus RhizosphereDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRVRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0066686_1002461353300005446SoilIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPNK*
Ga0066686_1034094313300005446SoilMKSHNGKEVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGEPPAPTPDK*
Ga0066686_1040411623300005446SoilIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKSVKILYDPLRVYRVRGAFEAGSHTDRTYGVSLFRLRNARVEEAVGAKIFKVGETTAPAPH*
Ga0070707_10191397513300005468Corn, Switchgrass And Miscanthus RhizosphereDSIRANLELTQKIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVFTAKGVKLFYDPLRVYKVRGRFDAGIRHDAAYGVSLFRLHQAQVEEAVGAKVMKSEGNAATAPGR*
Ga0070699_10075903423300005518Corn, Switchgrass And Miscanthus RhizosphereQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVFTAKGVKLFYDPLRVYKVRGRFDAGIRHDAAYGVSLFRLHQAQVEEAVGAKVMKSEGNAATAPGR*
Ga0070697_10159022723300005536Corn, Switchgrass And Miscanthus RhizosphereIRAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0070696_10078241923300005546Corn, Switchgrass And Miscanthus RhizosphereFIIPAGPPDLSFFLLGRVSATGNYCCEAPSGQDEVVYVYRAGGGSIRYDPLRVYAVRGVFEAGHHVDPRHGVSLFRVRDARVEEAVGATIFRIGD*
Ga0066701_1036844323300005552SoilANFDVTQTIKSYSGKEVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAARDVKIHYDPLRVYRVRGAFEAGPHTDRVYGVSLFRMRNARVEEAVGAKIFRIGGP*
Ga0066701_1090063013300005552SoilGPPDLSFFLLSRVSAIGNYCCELPVGQDETVYVYAPSGVKIAYDPLRVYKVRGAFEAGLHTDRAYGVSLFRVRNARVEEAVGAKIFKIGETPAPASR*
Ga0066661_1009321233300005554SoilFEVTQTMKSHNGKEVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGEPPAPTPDK*
Ga0066692_1077817813300005555SoilTMKSHNGKEVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYASNRVKILYDPLRVYKVRGAFEAGPHTDRNYGFSLFRVRNARVEEAVGTKIFKVGEPPAPTPDK*
Ga0066707_1084870513300005556SoilVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAGGAKIFKVGETPAPTPNK*
Ga0066700_1090066313300005559SoilADSELTQSIKSYSGKEVEIGGFIVPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVFVFAAKAVKLLYDPLRVYKVRGVFEAGMSSDPANGPSLFRVRQARVEEAVGAPIFKVGETPPAAAAKP*
Ga0066705_1057530113300005569SoilTAYAGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEMPTGQDETVYVFTAKAVKISYDPLRVYRVRGVFEAGKHVDPTYGVSMFRVRDARVDVAVGAKIFRAGEVIR*
Ga0070702_10021130123300005615Corn, Switchgrass And Miscanthus RhizosphereEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVYTAKGVQIFYDPLRVYKVRGRFEAGMHTDPANGVSLFRLRQAHVEEAVGAKIMKVEDTPASAR*
Ga0068859_10226909323300005617Switchgrass RhizosphereAGPPDLSFFLLGRVSAVGNYCCELPSGQDETVYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADRAYGVSFFRLRNARAEEAVGARIFKLGDAPPTGR*
Ga0066905_10163724623300005713Tropical Forest SoilDSIRANFDVTQQIKTYTGREVEIQGFIVPAGPPDLSFFLLSRVSALGNYCCEIPTGQDETVYVFAATGLKLVYDPLRVYRVRGVFEAGKDVDRTYGVSMFRIRGARVEEAVGAKIFKVGETPAPGPKP*
Ga0066903_10072640933300005764Tropical Forest SoilGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPQGAKIQYDPLRVYKVRGAFEAGLHTDRAYGVSLYRLRDARVEEAAGAKIFKVGETPAATPKK*
Ga0066903_10086519423300005764Tropical Forest SoilIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYATNGVKILYDPLRVYKVRGAFEAGPYTDRTYGFSLFRVRNARVEEAVGAKIFKVGETPAPTRDR*
Ga0066903_10516034023300005764Tropical Forest SoilEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTPASAR*
Ga0066903_10541931513300005764Tropical Forest SoilIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYATNGVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRIHNARVEEAVGAKIFKVGETPAPPRDR*
Ga0066903_10908650913300005764Tropical Forest SoilIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPQGVKILYDPLRIYKIRGMFEAGMRADPTYGPSLFRVRNARVEEAVGARIFKVGEVPAPTAKP*
Ga0068860_10117720713300005843Switchgrass RhizosphereIKSYAGKDVEIQGFIVPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFADRGLKLVYDPLRVYRVRGLFEAGKHVDSTYGVSMFRIRGARVEEAVGAKIFKVGESAPAGKP*
Ga0070717_1158225823300006028Corn, Switchgrass And Miscanthus RhizosphereSMRANLGVTQGIRAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVHVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0075417_1004795713300006049Populus RhizosphereEVEIHGFIIPAGPPDLSFFPLSRVSALGNYCCEVPVGQDETVYVFGAKGVKLPFDPLRVYRVRGVFEVGKQVDRAYGVSMFRIRDARVEHAVGAKIFKVGETPAPVSKP*
Ga0075417_1034985323300006049Populus RhizosphereFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVHVYTAKGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKSP*
Ga0075015_10100128513300006102WatershedsHNGKEVEVRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAAKEVNIHYDPLRLYKVRGVFEAGMRSDPAYGPSVFRLRQARVEEAVGAMLFKVGETLPASAAKP*
Ga0070715_1079451713300006163Corn, Switchgrass And Miscanthus RhizosphereGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAKDVHVMYDPLRIYQVRGVFEAGMRSEPATGPSMFRLRQAHVEAAVGAVVFKVGEAGPAKP*
Ga0075422_1041448023300006196Populus RhizosphereQQMKSYAGKVVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFGAKGVKLPFDPLRVYKIRGVFEAGKQVDRTYGVSMFRVRDARVEEAVGAQIFKVGDSPAPAAKP*
Ga0079222_1118221323300006755Agricultural SoilLGFIIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGRTISYDPLRIYKIRGVFEAGLHVDKVYGVSLFRVRNAQMEEAVGAKIFKAGPPSR*
Ga0079222_1188019223300006755Agricultural SoilSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGTSLRYDPVRVYRIRGRFEAGHQVDPSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0066665_1082095323300006796SoilFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPYTDPTYGVSLFRVRNARVEEAGGAKIFKVGETLAPTPNK*
Ga0066665_1097622613300006796SoilAGPPDLAFFLLSRVSAMGNYCCEVLIGQDETVYVFAAKSVNIAYDPLRVYKVRGVFEAGKLVDRTYGVSMFRLREARVEVAVGAKIFKVGETPPRR*
Ga0075428_10116240123300006844Populus RhizosphereFDVTRQIKSYAGREVEIQGFIIPAGPPDLSFFLLSRVSATGNYCCEVPVGQDETVYVFAAGGLKLLYDPLRVYRVRGVFEAGKHVDASYGVSMYRIRDARVEEAVGAKIFKVGDGVAPGGRP*
Ga0075428_10156550013300006844Populus RhizosphereLTQKIKSYNGTEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGAHTDPASGVSLFRLRQAHVEEAVGAKIMKIEDSPASAR*
Ga0075421_10109622813300006845Populus RhizosphereAGPPDLSFFLLSRVSSIGNYCCEAPVGQDETVYVFTAKGVKIAYDPLRVYKVRGLFEAGQQRDTAYGISLFRMRQARVEAAVGAKIMQVEEVTAPPANK*
Ga0075421_10150470123300006845Populus RhizosphereDLSFFLLSRVSSTGNYCCEAPVGQDETVYVFATRGVNLIYDPLRIYKVRGVFEAGLTRDAVHGISLFRMRQAHVEEAVGAKIIKVEEATVPASRQ*
Ga0075430_10086128113300006846Populus RhizosphereQAIGSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAAKGVKIVYDPLRVYKVRGVFEAGMRPDATYGPSLFRMRSARVEEAVGAKIFKVGEAPEK*
Ga0075431_10030763313300006847Populus RhizosphereAGKRVEIRGFIIPAGPPDLSFFLLGRVSATGNYCCEAPSGQDEVVYVYRAGGGSIRYDPLRVYAVRGVFEAGHHVDPRHGVSLFRVRDAHVEEAVGATIFRIGD*
Ga0075431_10080679823300006847Populus RhizosphereGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVATARGVTITYDPLRVYKVRGVFEAGKHVDATYGVSMYRLRNAKVEVATGAKIFRVGETPAR*
Ga0075431_10111630423300006847Populus RhizosphereAGLPDLTFFLLSRVSSIGNYCCEAPVGQDETVYVFAARADQVTYDPLRVYTVRGVFEAGLQKDAVYGISLFRIRQARVEEAVGATIMKVGEPSAPASHP*
Ga0075431_10124001913300006847Populus RhizosphereVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVYTAKGVKLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKNEDTPAAAR*
Ga0075420_10012870143300006853Populus RhizosphereLLSRVSSTGNYCCEAPVGQDETVYVFAKRGVDLIYDPLRVYRVHGVFEAGLVRDAVYGISLFRMRQARVEEAMGAKIMKVEEATR*
Ga0075420_10141866823300006853Populus RhizosphereLKANFEVSQSIRSYNGKEVEVQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVFTAKAVNMAYDPLRIYKVRGIFEAGKHTDKAYGISLYRVRQARVEEAVGAKIMKIEGTATPSGQR*
Ga0075420_10169799413300006853Populus RhizosphereFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGAHTDPASGVSLFRLRQAHVEEAVGAKSMKIEDSPASAR*
Ga0075426_1040669323300006903Populus RhizosphereDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDKVYGVSLFRVRNAQMEEAVGAKIFKMGQPSR*
Ga0075424_10205489823300006904Populus RhizosphereEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGIKISYDPLRVYKVRGAFEAGPHTDRTYGISLFRVRNARVEEAVGAKVFKVGETPAPVSR*
Ga0075436_10005695553300006914Populus RhizosphereGPPDLSFFLLSRVSAIGNYCCELPVGQDETVYVYAPSGVKIAYDPLRVYKVRGAFEAGLHTDRAYGVSLFRVRNARVEEAVGAKIFKVGEPPPPSR*
Ga0075436_10137158713300006914Populus RhizosphereLVDSIRANSALTQEIKSYGGKEIEILGFIIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDKVYGVSLFRVRNAQMEEAVGAKIFKMGQPSR*
Ga0079218_1363185123300007004Agricultural SoilGFIIPAGPPDLSFFLLGRVSATGNYCCEAPSGQDEVVYVYRAGGPSIRYDPLRVYAVRGVFEAGHHVDPRHGVSLFRVRDARVEEAVGAKIFKIGD*
Ga0099794_1053802823300007265Vadose Zone SoilANFEVTQTMKSFNGKEVEIHGFIVPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDPTYGVSLFRVRNARVEEAVGAKIFKVGETPAPTPSK*
Ga0066710_10255573013300009012Grasslands SoilIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPNK
Ga0066710_10256701613300009012Grasslands SoilIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKSVKILYDPLRVYRVRGAFEAGSHTDRTYGVSLFRLRNARVEEAVGAKIFKVGETTAPAPH
Ga0099829_1111242013300009038Vadose Zone SoilIRANFEVTQTIRSYSGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGE*
Ga0099827_1023335623300009090Vadose Zone SoilPDLSFFLLSRVSAIGNYCCELPIGQDETVYVFTAKGVQIFYDPLRVYRVRGRFDAGVHRDAVYGVSLFRLHQAQVEEAVGAKIMKSEGTAASPR*
Ga0099827_1162405613300009090Vadose Zone SoilVEIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAATGVTLRYDPLRVYQVRGTFEAGMRSDPATGPSLFRLRDARVEEAVGAPIFKVGETPPVAAAKP*
Ga0099827_1187234813300009090Vadose Zone SoilYNGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKSVKILYDPLRIYRVRGAFEAGPHTDRAYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPNK*
Ga0075418_1065157233300009100Populus RhizosphereFLLSRVSSIGNYCCEAPVGQDETVYVFATRGVNLIYDPLRIYKVRGVFEAGLTRDAVHGISLFRMRQAHVEEAVGAKIIKVEEATVPASRQ*
Ga0075418_1196460313300009100Populus RhizosphereQKIKSYNGTEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGAHTDPASGVSLFRLRQAHVEEAVGAKIMKIEDSPASAR*
Ga0066709_10416615913300009137Grasslands SoilSGGKEVAIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAATGVTLRYDPLRVYQVRGTFDAGMRSDPATGPSLFRLRQARVEEAVGAPIFRIGETQPAAAAKP*
Ga0114129_1053349523300009147Populus RhizosphereAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVHVYTAKGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKSP*
Ga0114129_1209340813300009147Populus RhizosphereNGKEVEIHGFIIPVGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYASSRVKILYDPLRVYKVRGAFEAGPHTDRAYGFSLFRVRNARVEEAVGAKIFKVGEPPAPTPDK*
Ga0114129_1343338513300009147Populus RhizosphereVEVHGFIIPAGPPDLSFFLLSRVSAVGNYCCEMPVGQDETVYVFSAKGVKIAYDPLRVYKVSGIFEAGKHTDKAYGISLYRVRQARVEEAVGAKIMKVEEVATPPGQR*
Ga0111538_1005813213300009156Populus RhizosphereFFLLSRVSALGNYCCEVPTGQDETVYVFAAKGLKLAYDPLRVYRVRGTFEAGKQVDRTYGVSMFRIREARVEEAVGAKIFRVTP*
Ga0111538_1130042423300009156Populus RhizosphereVTQKIKSYNGKVIEMQGFIIPAGLPDLSFFLLSRVSVLGNYCCELPTGQDETVYVSAAKGVRLFYDPLRVYKVRGLFEAGIQTDPANGVSLFRLRQAHVEEAVGAKIMPMEDTPAAPR*
Ga0111538_1395911013300009156Populus RhizosphereKSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTPAAPR*
Ga0105242_1323382023300009176Miscanthus RhizosphereVEIQGFIVPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFADRGLKLVYDPLRVYRVRGLFEAGKHVDSTYGVSMFRIRGARVEEAVGAKIFKVGESAPAGKP*
Ga0114945_1032011123300009444Thermal SpringsVEIQGFIIPAGPPDLSFFLLSRVSAIGNFCCELPIGQDETVYVFTAKSVKLFYDPLRVYRVRGRFDAGVHRDAVYGVSLFRLHQAQVEEAIGAKIMKMEDTATPAPGR*
Ga0114944_103301913300009691Thermal SpringsKIKSYRGKEVEVQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFTAKGVNLLYDPLRIYKVRGVFEAGRHSDTTYGLSLFRLRQAHVEEAVGAKIFKVGETTAPASNQ*
Ga0105075_104864413300009799Groundwater SandKSYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKIYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASNQ*
Ga0105071_107168923300009808Groundwater SandSIKAELELTQKIKSYSGKEVEMQGFIIPAGLPDLSFFLLSRVSAIGNYCCEAPVGQDETVYVFTAKGVQILYDPLRVYKVRGVFEAGMQTDKAYGVSLFRLRQAHVEEAVGAKIFKVGEPTAPASNQ*
Ga0105082_111280413300009814Groundwater SandVEIHGFIIPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFAPKGVKILYDPLRVYKIRGLFEAGMRADATYGPSLFRVRNARVEEVVGAKIFKVGEAAAPAPKP*
Ga0105070_112344323300009815Groundwater SandFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKIYGVSLFRLRHAQVEEAVGAKIFKVGEPAAPASNQ*
Ga0126308_1112107213300010040Serpentine SoilQTIKWYDGKEVEIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAAKEVKLNYDPLRVYKVRGVFEAGMRSDPAYGPSLFRVRQARVKEVGVPIFRIGETAPAAAAKP*
Ga0126314_1058481713300010042Serpentine SoilMKSRGGKQVEIRGFIIPAGPPDLSFFLLSRVSALGNYCCEFPVGQDETVYVFTAKDVKIRYDPLRVYQVRGVFEAGMYSDPVYGPSLFRIRQARVEEALSATILKAGESPSTSPGIGTR*
Ga0126310_1026774513300010044Serpentine SoilFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETIYVFAANGVKLNYDPLRVYKIRGVFEAGMRSDPAYGPSLFRVRQARVEETAGVPIFKIGETAPAATAKP*
Ga0126311_1184882213300010045Serpentine SoilKEVEIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETIYVFAANGVKLNYDPLRVYKIRGVFEAGMRSDPAYGPSLFRIRQARVEEAVGVPIFKVGETVPAATAKP*
Ga0126384_1059948513300010046Tropical Forest SoilNFDVTQMIKSYNGKEVELHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYATNGVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRIHNARVEEAVGAKIFKVGETPAPPRDR*
Ga0126384_1160159313300010046Tropical Forest SoilNFEVTQTIKSYNGKEVEIHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYALKGAKILYDPLRVYRVRGAFEAGLHEDRAYGFSLFRLRDARVEEAAGAKIFKVGETPGPTPNR*
Ga0126382_1032422433300010047Tropical Forest SoilGNYCCEAPVGQDETVYVFAAKAPHLVYDPLKVYKVRGVFEAGLKTDPVQGISLYRLRQAWVEEAVGAKIMKTEEAGSPPPHK*
Ga0126382_1065769413300010047Tropical Forest SoilIRANLELTQKIQSYNGKEVELQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVFAAKGVQIFYDPLRVYKVRGRFDAGVHRDAVYGVSLFRLRQAQVEEAVGAKIMKIEGTTTTAPGQ*
Ga0126382_1212333413300010047Tropical Forest SoilKEVEIQGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETVYVYAAKGLTIRYDALRVYRVRGVFEAGPAADKAHGVSFFRLRNARVEEAVGARIFKVGDPPGPTGR*
Ga0134071_1017846413300010336Grasslands SoilQTIKSYGGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKSVKILYDPLRVYRVRGAFEAGSHTDRTYGVSLFRLRNARVEEAVGAKIFKVGETTAPAPH*
Ga0126370_1155031623300010358Tropical Forest SoilDGKEIEIQGFIIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGATITYDPLRVYKIRGVFEAGLHVDKVYGVSLFRVRNAQMEEAVGAKIFKVGPPR*
Ga0126376_1237271213300010359Tropical Forest SoilAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGAKILYDPLRVYKVRGAFEAGPHTDRAYGVSLFRLRDARVEEAVGAKIFKVGETPAPTSNR*
Ga0126372_1209085323300010360Tropical Forest SoilGFIIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGATISYDPLRIYKIRGVFEAGLHVDRVYGASLFRVRNAQLEEAVGAKIFKVGQPGR*
Ga0126372_1299443813300010360Tropical Forest SoilRANFEVTQQMKSYAGKDVEIQGFIVPAGPPDLSFFLLSRVSALGNYCCEIPTGQDETVYVFAATGLKLVYDPLRVYRVRGVFEAGKDVDRTYGVSMFRIRGARVEEAVGAKIFKVGETSPAAPKP*
Ga0126372_1324324823300010360Tropical Forest SoilEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASKQ*
Ga0126377_1125232813300010362Tropical Forest SoilVKSYNGKEVEMQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGLFEAGVHTDPANGVSLFRLYQAHVEEAVGAKIMKIEDTPASAR*
Ga0126379_1096859723300010366Tropical Forest SoilGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPQGAKIQYDPLRVYKVRGAFEAGLHTDRAYGVSLYRLRDARVEEAAGAKIFKVGEPPAATPNR*
Ga0126379_1136623313300010366Tropical Forest SoilRANAALTEGIRAYAGREVEIQGFIIPAGPPDLSFFLLGRVSATGNYCCELPSGQDETVYVYAASGLSIRYDPLRVYRVRGVFEAGRVVDQAYGVSMFRVRRASVEEAVGAKIFKVGETPASNR*
Ga0126379_1315097713300010366Tropical Forest SoilGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVFTAKGVQIFYDPLRVYKVRGRFDAGVHRDAVYGVSLFRLRQAQVEEAVGAKIMKIEGTTTTAPGQ*
Ga0126383_1266007413300010398Tropical Forest SoilGLPDLSFFLLSRVSSIGNYCCEAPVGQDETVYVFAASAPRISFDPLKVYKVHGIFEAGLKTDHQYGMSLFRVRQARVEEAVGAKIMKVGETTASPTER*
Ga0126383_1286596413300010398Tropical Forest SoilYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGTQLFYDPLRVYKVRGLFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTPASAR*
Ga0137392_1164598523300011269Vadose Zone SoilKAELEVTQKIKSYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVNILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASNQ*
Ga0137391_1035246823300011270Vadose Zone SoilGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAAKGVKILYDPLRIYRIRGVFEAGMRADAAYGPSLFRVHNARVEEAVGAKMFKVGESPVPASKP*
Ga0137393_1012299813300011271Vadose Zone SoilEVTQTIKSYGGKEVAIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVKILYDPLRVYRIRGVFEAGMRADTTYGPSFFRVHNARVEEVVGAKIFKVGETPAPAAKP*
Ga0137393_1157116813300011271Vadose Zone SoilLSGKEVEIQGFIIPAGPPDLSFFLLSRVSAVGNYCCEIPTGQDETVYVATARGVTIAYDPLRVYKVRGVFEAGKHVDATYGVSMYRVRNAKVEVAAGAKIFRVGETPKP*
Ga0137427_1045170213300011445SoilFIIPAGPPDLSFFLLSRVSVLGNYCCELPTGQDETVYVSTAKEVRLFYDPLRVYKVRGLFEAGVQTDPAHGVSLFRLRQAHVEEAVGVKIIPMEDTPAAPR*
Ga0137389_1054843613300012096Vadose Zone SoilQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRVYKVRGAFEAGPYTDRTYGISLYRVRNARVEEAVGARIFKLGQTTTPGSNK*
Ga0137389_1158495613300012096Vadose Zone SoilQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRVYKVRGAFEAGPYTDRTYGISLYRVRNARVEEAVGARIFKLGETTTPGSNK*
Ga0137388_1109578523300012189Vadose Zone SoilVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTCQDETVYVYTATGTQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTAAPAR*
Ga0137363_1179560523300012202Vadose Zone SoilEVTQKIKSYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFDAGMQTDKTYGVSLFRLRNAQVEEAVGAKIFKVGEPTAPASNQ*
Ga0137399_1012568113300012203Vadose Zone SoilNGKEVEIHGFIIPAGPPDLSFFLLSRVSAIGNYCCEVPVGQDETVYVYAPKGVKILYDPLRIYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGETPAPTPNK*
Ga0137399_1176487313300012203Vadose Zone SoilKSYSGKEVEIRGFIVPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVFVFAAKAVKLLYDPLRVYKVRGVFEAGMSSDPANGSSLFRVRQARVEEAVGAPIFKVGETPPAAAAKP*
Ga0137362_1024308633300012205Vadose Zone SoilGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRNARVEEAVGAKIFKVGDTPGR*
Ga0137380_1110773613300012206Vadose Zone SoilGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGLFEAGAHTDPANGMSLFRLRQAHVEEAVGAKIMKIEDTPASAR*
Ga0137379_1073692113300012209Vadose Zone SoilKEVEIRGFIIPAGPPDLSFFMLSRVSALGNYCCEVPVGQDETVYVFAATSVTLLYDPLRVYQVRGTFEAGMRSDPATGPSLFRLRDARVEEAVGAPIFKVGETPPVAAAKP*
Ga0137378_1174400113300012210Vadose Zone SoilIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYAAKGVQLFYDPLRVYKVRGVFEAGAHTDPANGMSLFRLRQAHVEEAVGAKIMKIEDTPASAR*
Ga0137378_1188224313300012210Vadose Zone SoilELTQKIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPIGQDETVYVFTAKGVQIFYDPLRVYRVRGRFDAGVHRDAVYGVSLFRLHQAQVEEAVGAKIMKSEGTAASPR*
Ga0137377_1139764023300012211Vadose Zone SoilYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASNQ*
Ga0137371_1091821723300012356Vadose Zone SoilLLSRVSAVGNYCCEAHVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTVPASNQ*
Ga0137371_1117269213300012356Vadose Zone SoilSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPIGQDETVYIFTAKGVQIFYDPLRVYRVRGRFDAGVHRDAVYGVSLFRLHQAQVEEAVGAKIMKSEGTAASPR*
Ga0137384_1108567523300012357Vadose Zone SoilSFFLLSRVSGTGNYCCELPSGQDETVHVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0137385_1040107833300012359Vadose Zone SoilLTQKIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPIGQDETVYVFTAKGVQIFYDPLRVYRVRGRFDAGVHRDAVYGVSLFRLHQAQVEEAVGAKIMKSEGTAASPR*
Ga0137390_1144044513300012363Vadose Zone SoilSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTPASAR*
Ga0137397_1014594523300012685Vadose Zone SoilPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYIYTARGLSLRYDPSRVYRIRGRFEAGHHVDRTYGVSFFRVRDARVEEAVGAKIFKVDEPR*
Ga0137394_1041733223300012922Vadose Zone SoilGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYTSNRVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGEPPAPTPDK*
Ga0137416_1144979113300012927Vadose Zone SoilRVSGTGNYCCELPSGQDETVHVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP*
Ga0137404_1070128523300012929Vadose Zone SoilAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDPTYGVSLFRVRNARVEEAVGAKIFKVRETPAPTPSK*
Ga0137407_1087959513300012930Vadose Zone SoilPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYATKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRIRNARVEEAVGAKIFKVEETTAPALHPP*
Ga0126369_1148253223300012971Tropical Forest SoilGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPQGAKILYDPLRVYKVRGAFEAGLHTDRAYGVSLYRLRDARVEEAAGAKIFKVGEPPAATPNR*
Ga0134076_1015822123300012976Grasslands SoilGFIIPAGPPDLSFFLLSRVSAVGNYCCEVPVGQDETVYVYAAKSVKILYDPLRVYRVRGAFEAGSHTDRTYGVSLFRLRNARVEEAVGAKIFKVGETTAPAPR*
Ga0157377_1145848913300014745Miscanthus RhizosphereTVTQEIKSYAGKEVEIQGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGVSFFRLRNARVEEAVGARIFKLGDVPPTGR*
Ga0137418_1006603223300015241Vadose Zone SoilMPIPGTQNEPVASPVEKIKVRVDGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTATGAQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKVEDTPAPAR*
Ga0137409_1003318063300015245Vadose Zone SoilPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVTLRYDPIRVYRIRGVFEAGHHVDRSHGVSFFRVRDARVEEAVGAKIFKQ*
Ga0137409_1013035023300015245Vadose Zone SoilANLEVTQKIKSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTATGAQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKVEDTPAPAR*
Ga0137409_1040039523300015245Vadose Zone SoilANLEVTQKIKSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYATKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRIRNARVEEAVGAKIFKVEETTAPALHPP*
Ga0137409_1110892423300015245Vadose Zone SoilDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAGGVKILYDPLRVYKVRGVFEAGPHTDKSYGVSLFRVRNARVEEAVGAKIFKVGDERR*
Ga0137403_1026335113300015264Vadose Zone SoilPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYTSNRVKILYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGEPPAPTPDK*
Ga0137403_1051999513300015264Vadose Zone SoilLSRVSGTGNYCCELPSGQDETVYIYAARGLSLRYDPSRVYRIRGRFEAGHHVDRTYGVSFFRVRDARVEEAVGAKIFKVDEPR*
Ga0137403_1060873523300015264Vadose Zone SoilPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYKVRGAFEAGPHTDPTYGVSLFRVRNARVEEAVGAKIFKVRETPAPTPSK*
Ga0137403_1066556513300015264Vadose Zone SoilPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVHAARGVKIVYDPLRVYRVQGVFEAGPHTDRTHGVSLFRLRNARVEEAVGARIFKENARP*
Ga0134089_1008092913300015358Grasslands SoilEIQGFIIPAGPPDLSFFLLSRVSAVGNYYYEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPNK*
Ga0132258_1017581873300015371Arabidopsis RhizosphereIQGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETVYVYAAKGLAIRYDPLRVYRVRGVFEAGPVADKANGVSFFRLRNARVEEAVGARIFKVGDAPAPTGR*
Ga0132255_10525515723300015374Arabidopsis RhizosphereGPPDLSFFLLGRVSAVGNYCCELPSGQDETVYVYAAKGLTIRYDPLRVYKVRGVFEAGAAADKAYGVSFFRLRNARVEEAVGARIFKVGDAPPPGR*
Ga0132255_10576080323300015374Arabidopsis RhizosphereGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDKVYGVSLFRVRNAQMEEAEGAKIFKVGPPSR*
Ga0182035_1191739813300016341SoilEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLLYDPLRVYKVRGLFEAGIHTDPANGVSLFRLRQAHVEEAVGAKIMKIEDTPAATR
Ga0182037_1191914823300016404SoilDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRICKIRGVFEAGLHDDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0184605_1015782213300018027Groundwater SedimentIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADKANGVSFFRLRNARVEEAVGARIFKLGDAPPTGR
Ga0184608_1011400623300018028Groundwater SedimentEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPIRVYRIRGVFEAGHHVDRSHGVSFFRVRDARVEEAVGARIFKQ
Ga0184619_1044330223300018061Groundwater SedimentYAGKEVEIQGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADKANGVSFFRLRNARVEEAVGARIFKLGDAPPTGR
Ga0184637_1009987213300018063Groundwater SedimentNIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSSIGNYCCELPAGQDETVYVFTAQGAKLFYDPLRVYKVRGHFEAGLYTDPANGASLFRLRQAQIEEAVGVKIMKIEDTVSSPR
Ga0184637_1036727513300018063Groundwater SedimentNLEVTRTIKAYSGREVEIHGFIIPTGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVKILYDPLRVYKIRGLFEAGMSADATYGPSLFRVHNARVEEVVGAKIFKVGEAAAPAPKP
Ga0190272_1120448523300018429SoilIRANQTVTRDIKSYAGKEVEVHGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGVSFFRLRNARVEEAVGARIFKLGDAPPTGR
Ga0190269_1108433913300018465SoilGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFTAKGVKIFYDPLRIYKVRGVFEAGMHSDKAYGVSLFRVRQAHVEEAVGAKIFKVGESTTPAANQ
Ga0066662_1021537233300018468Grasslands SoilFGVTQAIASYNGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRNARVEEAVGAKIFKVGDTPGR
Ga0066662_1129545813300018468Grasslands SoilFGVTQAIASYNGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRSARVEEAVGAKIFKVGDTPGR
Ga0193707_113937113300019881SoilLLSRVSGTGNYCCELPSGQDETVYVYVAKGISIRYDPVRVYKIRGMFEAGHQADRSYGVSFFRVRNAHVEEAVGARIFKEEKGR
Ga0193755_103386223300020004SoilIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYIYTARGLSLRYDPSRVYRIRGRFEAGHQVDRTYGVSFFRVRDARVEEAVGAKIFKVDETR
Ga0193755_119198923300020004SoilVPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFADKGLKLVYDPLRIYRVRGIFEAGKHVDSTYGVSMFRIRGARVEEAVGAKIFKVGESAPAPKP
Ga0210382_1050364923300021080Groundwater SedimentEVEIQGFIVPAGPPDLSFFLLSRVSAMGNYCCEVPTGQDETVYVFAAKGLKLLYDPLRVYRIRGVFEAGKQVDTTYGVSMFRIRDARVEEAVGAKIFKLGDDTAPAAKP
Ga0210402_1121154913300021478SoilVEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVSVYAAKGLSLRYDPVRVYRIRGVFEAGHQADRSYGVSFFRIRDARVEEAVGARIFKVGETKAPLNP
Ga0210402_1125417813300021478SoilKHRLKKRPQILKLGFIIPAGPPDLSLFLLSRVSALGNYCCEAPVGQDETVYVFTTKPVNITYDPLRIFKIVGVFEAGMRADATYGPSLFRVRDARVEEAVGAKMFKTGDFPASASKP
Ga0212128_1003010113300022563Thermal SpringsSFFLLSRVSALGNYCCEVPVGQDETVYVFTAKGVNLLYDPLRIYKVRGVFEAGRHSDTTYGLSLFRLRQAHVEEAVGAKIFKVGETTAPASNQ
Ga0212128_1045656223300022563Thermal SpringsTQKIRSYNGQEVEIQGFIIPAGAPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFTAKGVQLFYDPLRVYRVRGRFDAGVRHDAVYGVSLFRLHQAQVEEAVGAKIMPTKDTATTAPGP
Ga0207684_1027818043300025910Corn, Switchgrass And Miscanthus RhizosphereDLSFFLLGRVSAVGNYCCELPSGQDETIYVYSAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGISFFRLRNARVEEAVGARIFKLGDAPPTGR
Ga0207654_1059151623300025911Corn RhizosphereDLSFFLLSRVSAMGNYCCEIPVGQDETVYVFGAKGLKLPYDPLRVYKVRGVFEAGKQVDRTYGVSMFRVRDARVEEAIGAKIFKVGESAAPAAKP
Ga0207676_1176495223300026095Switchgrass RhizosphereGKEVEIHGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGVSFFRLRNARVEEAVGARIFKLGDAPPTGR
Ga0209801_132054413300026326SoilNFGVTQAISSYNGREVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVTIAYDPLRVYKVHGLFEAGMLSDPAYGPSLFRIRNARVVEAVGAKIFKVGETGK
Ga0209802_106153743300026328SoilSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRNARVEEAVGAKIFKVGDTPGR
Ga0209802_123365823300026328SoilSLRANFEVTQTIKSYSGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPKQ
Ga0209802_128913313300026328SoilMRANLGVTQGIRAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVHVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0257161_102868313300026508SoilQGIKAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0257158_109290713300026515SoilLGVTQGIKAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0209690_129906913300026524SoilASYNGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRSARVEEAVGAKIFKVGDTPGR
Ga0209157_102709813300026537SoilFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPAPNK
Ga0209157_118027713300026537SoilPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVTIAYDPLRVYKVHGLFEAGMLSDPAYGPSLFRIRNARVVEAVGAKIFKVGETGK
Ga0179587_1101435913300026557Vadose Zone SoilVDSIRANFGVTQAIASYNGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFATKGLKILYDPLRIYKVRGVFEAGMRPDATYGPSLFRIRNARVEEAVGAKIFKVGDTPGR
Ga0209854_105440823300027384Groundwater SandIKSYSGKEVEIQGFIIPAGRPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVFTAKEVKIFYDPLRVYKVRGVFEAGMHRDKAYGVSLFRLRQAQVEEAVGAKIFKLGETTTPAANQ
Ga0209117_114634913300027645Forest SoilPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRIRGVFEAGHQVDPSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0209966_111298223300027695Arabidopsis Thaliana RhizospherePPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVFGAKGLKLPYDPLRVYKVRGVFEAGKQVDRAYGVSMFRVRDARVEEAIGAKIFKVGETAAPAAKP
Ga0209998_1016770023300027717Arabidopsis Thaliana RhizosphereVEIQGFIIPAGLPDLTFFLLSRVSSIGNYCCEAPVGQDETVYVFAARADQVTYDPLRVYKVRGVFEAGLKKDAVYGLSLFRIRQARVEEVIGATIMKVGETTAPPSRQ
Ga0209074_1055727323300027787Agricultural SoilSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGTSLRYDPVRVYRIRGRFEAGHQVDPSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0209701_1059363113300027862Vadose Zone SoilEVTRTIKSYSGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEVPVGQDETVYVFAPKGVKILYDPLRVYRIRGVFEAGMRADTTYGPSFFRVHNARVEEVVGAKIFKVGEIPAPAAK
Ga0209814_1038747323300027873Populus RhizosphereLELTQKIKSYNGTEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYTAKGVQLFYDPLRVYKVRGRFEAGAHTDPASGVSLFRLRQAHVEEAVGAKIMKIEDSPASAR
Ga0209814_1042482823300027873Populus RhizosphereFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVHVYTAKGVSLRYDPVRVYRIRGVFEAGHQVDRSYGVSFFRVRDARVEEAVGARIFKVGETKSP
Ga0209283_1095822813300027875Vadose Zone SoilRSYSGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAPKGVKILYDPLRVYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGE
Ga0207428_1013636143300027907Populus RhizosphereGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFAAKGLKLAYDPLRVYRVRGTFEAGKQVDSTYGVSMFRIRGARVEEAVGAKIFRVTP
Ga0209382_1144475523300027909Populus RhizosphereSRVSSIGNYCCEAPVGQDETVYVFATRGVNLIYDPLRIYKVRGVFEAGLTRDAVHGISLFRMRQAHVEEAVGAKIIKVEEATVPASRQ
Ga0268264_1229273613300028381Switchgrass RhizosphereQGFIVPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFADRGLKLVYDPLRVYRVRGLFEAGKHVDSTYGVSMFRIRGARVEEAVGAKIFKVGESAPAGKP
Ga0307290_1014090713300028791SoilTRSIMAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPIRVYRIRGVFEAGHHVDRSHGVSFFRVRDARVEEAVGARIFKQ
Ga0307504_1021586223300028792SoilANFGLTQGIRAYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYIYTARGLSLRYDPSRVYRIRGRFEAGHQVDRTYGVSFFRVRDARVEEAVGAKIFKVDETR
Ga0307284_1030118423300028799SoilVTQGIKTYNGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGVSLRYDPVRVYRIRGVFEAGHQVDRTYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0247824_1070336823300028809SoilDSMRANFDVTQQIKSYAGKDVEIQGFIVPAGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFAAKGLKLAYDPLRVYRVRGTFEAGKQVDRTYGVSMFRIREARVEEAVGAKIFRVTP
Ga0247825_1131458223300028812SoilGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVFGAKGLKLPYDPLRVYKVRGVFEAGKQVDRAYGVSMFRVRDARVEEAIGAKIFKVGETAAPAAKP
Ga0307278_1018438013300028878SoilPAGPPDLSFFLLSRVSAMGNYCCEVPTGQDETVYVFAAKGLKLLYDPLRVYKIRGVFEAGKQVDTTYGVSMFRIRDARAEEAVGAKIFKVGEGTAPAAKP
Ga0299907_1015988133300030006SoilSFFLLSRVSAMGNYCCEVPVGQDETVYVFTAKGAQIRYDPLRVYKVRGIFEAGLRRDREYGISLFRLRQARVEEAVGAKIMKVEDVTPAPARQ
Ga0302046_1118635823300030620SoilQSYNDKEVEMHGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVSTAKGVQLHYDPLRVYKVRGTFEAGMRRDKEYGVSLFRLRQAQVEEAIGARIMKVEDATPAPTRQ
Ga0170834_10908276423300031057Forest SoilADSGVTETIKSYSGKQVAIQGFIIPAGPPDLSFFLLSRVSALGNYCCEAPVGQDETVYVFTAKAVNITYDPLRIFMIVGVFEAGMRADATYGPSLFRVRNARVEEAVGARMFKTGDFPASASKP
Ga0308189_1039836113300031058SoilAELEVTQKIKSYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAAKGVKILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASNQ
Ga0308204_1022591613300031092SoilKIKSYGGKEVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFAEKGVKILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTVPASNQ
Ga0308191_100726513300031098SoilIPAGPPDLSFFLLSRVSAMGNYCCEAPVGQDETVYVFTAKGVNILYDPLRVYKVRGVFEAGMQTDKTYGVSLFRLRHAQVEEAVGAKIFKVGEPTAPASNQ
(restricted) Ga0255311_115447123300031150Sandy SoilADSMRANLGVTRGIMAYSGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYAARGLSLRYDPVRVYRIRGVFEAGHHVDRLYGVSFFRVRDARVEEAVGAKIFRTP
(restricted) Ga0255310_1024342913300031197Sandy SoilLADSMRANLGVTRGIMAYSGREIEIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYAARGLSLRYDPVRVYRIRGVFEAGYHVDRLYGVSFFRVRDARVEEAVGAKIFRTPG
Ga0318574_1034068223300031680SoilGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0307469_1027652023300031720Hardwood Forest SoilTVTQGIKAYAGKEVEIHGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPAADKAYGVSFFRLRNARVEEAIGARIFKLGDAPPIGR
Ga0307469_1027754213300031720Hardwood Forest SoilIQGFIIPAGPPDLSFFLLSRVSGTGNYCCELPSGQDETVYVYTARGTSLRYDPIRVYRIRGRFEAGHQVDPSYGVSFFRVRDARVEEAVGARIFKVGETKGP
Ga0318537_1036886413300031763SoilFIIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYASKGMTISYDPLRIYKIRGVFEAGLHVDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0310892_1015776813300031858SoilSFFLLSRVSAMGNYCCEIPVGQDETVYVFGAKGLKLPYDPLRVYKVRGVFEAGKQVDRTYGVSMFRVRDARVEEAIGAKIFKVGESAAPAAKP
Ga0310912_1030743833300031941SoilPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFDAGLLVDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0315278_1046481123300031997SedimentRANNQVTQTMQSLSGKEVEIQGFIIPAGPPDLSFFLLSRVSALGNYCCEIATGQDEIVYVATARGVTIAYDPLRVYKVRGVFEAGKHVDATYGVSMYRLRNAKVEVAVGAKIFRAGETPPSTKP
Ga0307416_10137334513300032002RhizosphereKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELPTGQDETVYVYTAKGVQLFYDPLRVYKVCGRFEAGVHTDPANGVSLFRLRQARVEEAVGAKIMKLEDTPASAR
Ga0307416_10297135013300032002RhizosphereRAELELTQKIKSFSGKEVEIQGFIIPAGAPDLSFFLLSRVSSIGNYCCEAPVGQDETVYVFANRGVNLTYDPLRVYRVRGVFEAGLTRDAVHGISLFRMRQARVEEAVGAKIMKTEDTTTPGK
Ga0310897_1017120623300032003SoilSRVSALGNYCCEVPVGQDETVYVFGAKGVKLPYDPLRVYKIRGVFEAGKQVDRTYGVSMFRVRDARVEEAIGAKIFKVGESAAPAAKP
Ga0307411_1078204813300032005RhizosphereEVTQKIQSYNGKEVEIQGFIIPAGPPDLSFFLLSRVSAIGNYCCELSTGQDETVYVYMAKGVQLFYDPLRVYKVRGRFEAGVHTDPANGVSLFRLRQAHVEEAVGAKIMKLEDTPASAR
Ga0310899_1022786623300032017SoilFFLLSRVSAMGNYCCEIPVGQDETVYVFGAKGLKLPYDPLRVYKVRGVFEAGKQVDRTYGVSMFRVRDARVEEAIGAKIFKVGESAAPAAKP
Ga0318513_1033875213300032065SoilFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0318525_1044570623300032089SoilILGFVIPAGPPDLSFFLLSRVSATGNYCCELPSGQDETVYVYAAKGMTISYDPLRIYKIRGVFEAGLHVDRVYGVSLFRVRKAQMEEAVGAKIFRVDQR
Ga0307471_10052557233300032180Hardwood Forest SoilYNGKEVEIHGFIIPAGPPDLSFFLLSRVSAIGNYCCEVPVGQDETVYVYAPRRANIVYDPLRVYKVRGAFEAGPHTDRTYGFSLFRVRNARVEEAVGAKIFKVGETHAPTPDK
Ga0307471_10268251413300032180Hardwood Forest SoilSLRANFEVTQTIKSYSGREVEIQGFIIPAGPPDLSFFLLSRVSAMGNYCCEVPVGQDETVYVYAAKGVKILYDPLRIYRVRGAFEAGPHTDRTYGVSLFRVRNARVEEAVGAKIFKVGETTAPASKK
Ga0307471_10396716113300032180Hardwood Forest SoilGDAIRANQTVTHEVKSYAGKQVEIHGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPAADKAYGVSFFRLRNARVEEAIGARIFKLGDAPPTGR
Ga0214471_1095474113300033417SoilVEIQGFIVPAGAPDLSFFLLSRVSSIGNYCGEAPVGQDETVYVFATRGVNLIYDPLRVYKVCGVFEAGLTRDAVYGISLFRMRQARVEEAVGANIMKVEDATAPTAGK
Ga0247830_1015904413300033551SoilGPPDLSFFLLSRVSALGNYCCEVPTGQDETVYVFAAKGLKLAYDPLRVYRVRGTFEAGKQVDSTYGVSMFRIREARVEEAVGAKIFRVTP
Ga0364943_0234676_1_3393300034354SedimentSYAGKEVEIHGFIIPAGPPDLSFFLLGRVSAVGNYCCELPSGQDETIYVYAAKGMTIRYDPLRVYKVRGVFEAGPVADKAYGVSFFRLRNARVEEAVGARIFKLGDAPPTGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.