NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F010644

Metagenome / Metatranscriptome Family F010644

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F010644
Family Type Metagenome / Metatranscriptome
Number of Sequences 301
Average Sequence Length 153 residues
Representative Sequence MERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTFGPSGGYPTH
Number of Associated Samples 250
Number of Associated Scaffolds 301

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 79.00 %
% of genes near scaffold ends (potentially truncated) 38.87 %
% of genes from short scaffolds (< 2000 bps) 73.09 %
Associated GOLD sequencing projects 224
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.076 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(11.960 % of family members)
Environment Ontology (ENVO) Unclassified
(33.555 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(38.870 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 27.13%    β-sheet: 16.49%    Coil/Unstructured: 56.38%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 301 Family Scaffolds
PF02775TPP_enzyme_C 20.27
PF13419HAD_2 18.60
PF00378ECH_1 6.64
PF16113ECH_2 3.32
PF02776TPP_enzyme_N 2.66
PF08402TOBE_2 2.66
PF01799Fer2_2 2.33
PF00132Hexapep 1.99
PF00702Hydrolase 1.99
PF03450CO_deh_flav_C 1.66
PF02738MoCoBD_1 1.00
PF00528BPD_transp_1 1.00
PF03328HpcH_HpaI 0.66
PF13242Hydrolase_like 0.66
PF13683rve_3 0.66
PF02653BPD_transp_2 0.33
PF13343SBP_bac_6 0.33
PF03901Glyco_transf_22 0.33
PF08352oligo_HPY 0.33
PF13751DDE_Tnp_1_6 0.33
PF12085DUF3562 0.33
PF13416SBP_bac_8 0.33
PF00154RecA 0.33
PF00535Glycos_transf_2 0.33
PF02771Acyl-CoA_dh_N 0.33
PF13439Glyco_transf_4 0.33
PF03703bPH_2 0.33
PF04338DUF481 0.33
PF13458Peripla_BP_6 0.33
PF00106adh_short 0.33

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 301 Family Scaffolds
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 0.66
COG2301Citrate lyase beta subunitCarbohydrate transport and metabolism [G] 0.66
COG38362-keto-3-deoxy-L-rhamnonate aldolase RhmACarbohydrate transport and metabolism [G] 0.66
COG0468RecA/RadA recombinaseReplication, recombination and repair [L] 0.33
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.33
COG3137Putative salt-induced outer membrane protein YdiYCell wall/membrane/envelope biogenesis [M] 0.33
COG3402Uncharacterized membrane protein YdbS, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 0.33
COG3428Uncharacterized membrane protein YdbT, contains bPH2 (bacterial pleckstrin homology) domainFunction unknown [S] 0.33


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms77.08 %
UnclassifiedrootN/A22.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2162886012|MBSR1b_contig_10793162All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1287Open in IMG/M
3300000443|F12B_10649567Not Available1180Open in IMG/M
3300000550|F24TB_10104085All Organisms → cellular organisms → Bacteria2890Open in IMG/M
3300000559|F14TC_100480766All Organisms → cellular organisms → Bacteria1660Open in IMG/M
3300000956|JGI10216J12902_101910299Not Available645Open in IMG/M
3300002122|C687J26623_10031523All Organisms → cellular organisms → Bacteria1415Open in IMG/M
3300003324|soilH2_10032192All Organisms → cellular organisms → Bacteria → Proteobacteria16741Open in IMG/M
3300003911|JGI25405J52794_10071039Not Available759Open in IMG/M
3300003994|Ga0055435_10013364All Organisms → cellular organisms → Bacteria1617Open in IMG/M
3300004009|Ga0055437_10034422All Organisms → cellular organisms → Bacteria1281Open in IMG/M
3300004025|Ga0055433_10025624Not Available1065Open in IMG/M
3300004052|Ga0055490_10094004Not Available838Open in IMG/M
3300004062|Ga0055500_10007407All Organisms → cellular organisms → Bacteria1714Open in IMG/M
3300004156|Ga0062589_100562846All Organisms → cellular organisms → Bacteria979Open in IMG/M
3300004156|Ga0062589_101002321Not Available780Open in IMG/M
3300004157|Ga0062590_100596024All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria969Open in IMG/M
3300004463|Ga0063356_100066783All Organisms → cellular organisms → Bacteria3718Open in IMG/M
3300004463|Ga0063356_102297757All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus normandii823Open in IMG/M
3300005093|Ga0062594_101938353All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus normandii627Open in IMG/M
3300005328|Ga0070676_10036618All Organisms → cellular organisms → Bacteria2827Open in IMG/M
3300005331|Ga0070670_100106110All Organisms → cellular organisms → Bacteria2420Open in IMG/M
3300005332|Ga0066388_102315289Not Available972Open in IMG/M
3300005332|Ga0066388_108481778All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Salinispora → Salinispora pacifica512Open in IMG/M
3300005334|Ga0068869_100051720All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2980Open in IMG/M
3300005336|Ga0070680_100169569All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1836Open in IMG/M
3300005337|Ga0070682_100006921All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia6375Open in IMG/M
3300005339|Ga0070660_100199770All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1622Open in IMG/M
3300005341|Ga0070691_10037606All Organisms → cellular organisms → Bacteria2284Open in IMG/M
3300005341|Ga0070691_10229636All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria987Open in IMG/M
3300005353|Ga0070669_100503013All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300005356|Ga0070674_101911130Not Available539Open in IMG/M
3300005366|Ga0070659_100609878All Organisms → cellular organisms → Bacteria938Open in IMG/M
3300005406|Ga0070703_10069698All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1172Open in IMG/M
3300005440|Ga0070705_101359881All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus amargosae591Open in IMG/M
3300005441|Ga0070700_101685155All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus normandii544Open in IMG/M
3300005444|Ga0070694_100972185Not Available704Open in IMG/M
3300005445|Ga0070708_100026332All Organisms → cellular organisms → Bacteria4976Open in IMG/M
3300005456|Ga0070678_100946450All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300005458|Ga0070681_10355135All Organisms → cellular organisms → Bacteria1376Open in IMG/M
3300005459|Ga0068867_100172482All Organisms → cellular organisms → Bacteria1714Open in IMG/M
3300005468|Ga0070707_101028058All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria789Open in IMG/M
3300005471|Ga0070698_101191161All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300005529|Ga0070741_10000361All Organisms → cellular organisms → Bacteria170100Open in IMG/M
3300005547|Ga0070693_100068208All Organisms → cellular organisms → Bacteria2087Open in IMG/M
3300005549|Ga0070704_100061437All Organisms → cellular organisms → Bacteria2689Open in IMG/M
3300005549|Ga0070704_100101679All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2167Open in IMG/M
3300005549|Ga0070704_101772025All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus amargosae571Open in IMG/M
3300005555|Ga0066692_10719928Not Available617Open in IMG/M
3300005564|Ga0070664_100958004All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia803Open in IMG/M
3300005616|Ga0068852_101732125All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Geodermatophilus → Geodermatophilus normandii647Open in IMG/M
3300005713|Ga0066905_100024476All Organisms → cellular organisms → Bacteria3295Open in IMG/M
3300005713|Ga0066905_100402283All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300005842|Ga0068858_100076480All Organisms → cellular organisms → Bacteria3109Open in IMG/M
3300005843|Ga0068860_100399575All Organisms → cellular organisms → Bacteria1359Open in IMG/M
3300005876|Ga0075300_1016827All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → unclassified Micromonospora → Micromonospora sp. ATA51890Open in IMG/M
3300005876|Ga0075300_1022862Not Available800Open in IMG/M
3300005878|Ga0075297_1010649All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria889Open in IMG/M
3300005878|Ga0075297_1029102Not Available621Open in IMG/M
3300005879|Ga0075295_1002815All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1418Open in IMG/M
3300005937|Ga0081455_10005497All Organisms → cellular organisms → Bacteria13890Open in IMG/M
3300006041|Ga0075023_100143103All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300006047|Ga0075024_100133672All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1117Open in IMG/M
3300006172|Ga0075018_10291898All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → Leptolinea802Open in IMG/M
3300006755|Ga0079222_10052573All Organisms → cellular organisms → Bacteria1902Open in IMG/M
3300006755|Ga0079222_10447775All Organisms → cellular organisms → Bacteria → Proteobacteria922Open in IMG/M
3300006804|Ga0079221_10109892All Organisms → cellular organisms → Bacteria1372Open in IMG/M
3300006804|Ga0079221_10325613All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia915Open in IMG/M
3300006806|Ga0079220_10218434All Organisms → cellular organisms → Bacteria1115Open in IMG/M
3300006845|Ga0075421_100552087All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300006847|Ga0075431_100522873All Organisms → cellular organisms → Bacteria1176Open in IMG/M
3300006852|Ga0075433_10399341All Organisms → cellular organisms → Bacteria1213Open in IMG/M
3300006854|Ga0075425_100105045All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3215Open in IMG/M
3300006854|Ga0075425_101842434All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300006904|Ga0075424_100456118All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1364Open in IMG/M
3300006954|Ga0079219_10015928All Organisms → cellular organisms → Bacteria2669Open in IMG/M
3300007255|Ga0099791_10049135All Organisms → cellular organisms → Bacteria1886Open in IMG/M
3300007258|Ga0099793_10379370Not Available693Open in IMG/M
3300009088|Ga0099830_10041508All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia3192Open in IMG/M
3300009088|Ga0099830_10291843All Organisms → cellular organisms → Bacteria1300Open in IMG/M
3300009089|Ga0099828_10001811All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria14568Open in IMG/M
3300009090|Ga0099827_10121571All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2104Open in IMG/M
3300009090|Ga0099827_10592892All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria955Open in IMG/M
3300009143|Ga0099792_10225832All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1078Open in IMG/M
3300009147|Ga0114129_10030580All Organisms → cellular organisms → Bacteria7619Open in IMG/M
3300009174|Ga0105241_10006294All Organisms → cellular organisms → Bacteria8755Open in IMG/M
3300009553|Ga0105249_10058975All Organisms → cellular organisms → Bacteria3519Open in IMG/M
3300009810|Ga0105088_1081304Not Available581Open in IMG/M
3300010358|Ga0126370_10447173All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300010359|Ga0126376_10011171All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes5602Open in IMG/M
3300010360|Ga0126372_10711699All Organisms → cellular organisms → Bacteria982Open in IMG/M
3300010362|Ga0126377_10939397Not Available929Open in IMG/M
3300010371|Ga0134125_10537448All Organisms → cellular organisms → Bacteria1294Open in IMG/M
3300010391|Ga0136847_13501713Not Available579Open in IMG/M
3300010397|Ga0134124_10057867All Organisms → cellular organisms → Bacteria3274Open in IMG/M
3300010397|Ga0134124_10075614All Organisms → cellular organisms → Bacteria2884Open in IMG/M
3300010399|Ga0134127_11182195Not Available831Open in IMG/M
3300010400|Ga0134122_10242283All Organisms → cellular organisms → Bacteria1519Open in IMG/M
3300010400|Ga0134122_10247652All Organisms → cellular organisms → Bacteria1504Open in IMG/M
3300010403|Ga0134123_10055281All Organisms → cellular organisms → Bacteria3004Open in IMG/M
3300011119|Ga0105246_10682587All Organisms → cellular organisms → Bacteria898Open in IMG/M
3300011270|Ga0137391_10003187All Organisms → cellular organisms → Bacteria12426Open in IMG/M
3300011271|Ga0137393_10388413All Organisms → cellular organisms → Bacteria1194Open in IMG/M
3300011419|Ga0137446_1022825All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1287Open in IMG/M
3300011429|Ga0137455_1105360Not Available828Open in IMG/M
3300012035|Ga0137445_1079163Not Available661Open in IMG/M
3300012040|Ga0137461_1082550All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300012096|Ga0137389_10001703All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria12996Open in IMG/M
3300012096|Ga0137389_10141234All Organisms → cellular organisms → Bacteria1962Open in IMG/M
3300012189|Ga0137388_10017442All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium5303Open in IMG/M
3300012202|Ga0137363_10313737All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1290Open in IMG/M
3300012202|Ga0137363_11565540Not Available551Open in IMG/M
3300012361|Ga0137360_10100248All Organisms → cellular organisms → Bacteria2214Open in IMG/M
3300012363|Ga0137390_11444422All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium630Open in IMG/M
3300012685|Ga0137397_10094926All Organisms → cellular organisms → Bacteria2181Open in IMG/M
3300012922|Ga0137394_11452413Not Available546Open in IMG/M
3300012923|Ga0137359_10813025Not Available809Open in IMG/M
3300012927|Ga0137416_11014930All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria742Open in IMG/M
3300012927|Ga0137416_11820170Not Available556Open in IMG/M
3300012929|Ga0137404_10345103All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300012930|Ga0137407_10066400All Organisms → cellular organisms → Bacteria2989Open in IMG/M
3300012931|Ga0153915_10894782Not Available1032Open in IMG/M
3300012944|Ga0137410_10468930All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1025Open in IMG/M
3300012955|Ga0164298_10046836All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2053Open in IMG/M
3300012958|Ga0164299_10068361All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1734Open in IMG/M
3300012986|Ga0164304_10197603All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300013100|Ga0157373_10009169All Organisms → cellular organisms → Bacteria → Proteobacteria7317Open in IMG/M
3300013102|Ga0157371_10956647All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria652Open in IMG/M
3300013105|Ga0157369_10303430All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1661Open in IMG/M
3300014497|Ga0182008_10469629All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria687Open in IMG/M
3300014745|Ga0157377_10573543Not Available801Open in IMG/M
3300014873|Ga0180066_1011329All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1533Open in IMG/M
3300014877|Ga0180074_1029224All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1110Open in IMG/M
3300014884|Ga0180104_1066173All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division NC10989Open in IMG/M
3300014885|Ga0180063_1008162All Organisms → cellular organisms → Bacteria2796Open in IMG/M
3300015245|Ga0137409_10191824All Organisms → cellular organisms → Bacteria1845Open in IMG/M
3300015254|Ga0180089_1105704Not Available585Open in IMG/M
3300015264|Ga0137403_10202400All Organisms → cellular organisms → Bacteria1917Open in IMG/M
3300015264|Ga0137403_11203799Not Available603Open in IMG/M
3300015371|Ga0132258_10729438All Organisms → cellular organisms → Bacteria2496Open in IMG/M
3300015371|Ga0132258_13895100All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1015Open in IMG/M
3300015374|Ga0132255_101560499All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1000Open in IMG/M
3300017930|Ga0187825_10142069All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria846Open in IMG/M
3300017936|Ga0187821_10008493All Organisms → cellular organisms → Bacteria3482Open in IMG/M
3300017936|Ga0187821_10092374All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1110Open in IMG/M
3300017939|Ga0187775_10042973All Organisms → cellular organisms → Bacteria1350Open in IMG/M
3300017993|Ga0187823_10115904All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria816Open in IMG/M
3300017997|Ga0184610_1007349All Organisms → cellular organisms → Bacteria2668Open in IMG/M
3300018029|Ga0187787_10261863Not Available635Open in IMG/M
3300018031|Ga0184634_10142837All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1071Open in IMG/M
3300018032|Ga0187788_10352588Not Available608Open in IMG/M
3300018053|Ga0184626_10034967All Organisms → cellular organisms → Bacteria2081Open in IMG/M
3300018054|Ga0184621_10042963All Organisms → cellular organisms → Bacteria1487Open in IMG/M
3300018056|Ga0184623_10273239All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria768Open in IMG/M
3300018056|Ga0184623_10460660Not Available548Open in IMG/M
3300018074|Ga0184640_10213700All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300018075|Ga0184632_10046115All Organisms → cellular organisms → Bacteria1876Open in IMG/M
3300018076|Ga0184609_10300907All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium750Open in IMG/M
3300018076|Ga0184609_10486907Not Available563Open in IMG/M
3300018078|Ga0184612_10046876Not Available2249Open in IMG/M
3300018079|Ga0184627_10275060All Organisms → cellular organisms → Bacteria885Open in IMG/M
3300018084|Ga0184629_10383815Not Available739Open in IMG/M
3300018084|Ga0184629_10478662All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium651Open in IMG/M
3300018422|Ga0190265_10299892All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1677Open in IMG/M
3300018422|Ga0190265_11030680All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300018422|Ga0190265_11123472All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria906Open in IMG/M
3300018429|Ga0190272_10028903All Organisms → cellular organisms → Bacteria2983Open in IMG/M
3300018429|Ga0190272_10190742All Organisms → cellular organisms → Bacteria1468Open in IMG/M
3300019255|Ga0184643_1123413Not Available733Open in IMG/M
3300019360|Ga0187894_10298987All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria744Open in IMG/M
3300019377|Ga0190264_10283353All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria989Open in IMG/M
3300019458|Ga0187892_10000855All Organisms → cellular organisms → Bacteria → Proteobacteria75967Open in IMG/M
3300019458|Ga0187892_10020011All Organisms → cellular organisms → Bacteria6507Open in IMG/M
3300019487|Ga0187893_10153491All Organisms → cellular organisms → Bacteria1868Open in IMG/M
3300019487|Ga0187893_10356062All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1011Open in IMG/M
3300019879|Ga0193723_1008023All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3451Open in IMG/M
3300019881|Ga0193707_1052615All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1295Open in IMG/M
3300019883|Ga0193725_1034366All Organisms → cellular organisms → Bacteria1343Open in IMG/M
3300019999|Ga0193718_1036492All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300020002|Ga0193730_1022148All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1834Open in IMG/M
3300020003|Ga0193739_1047912All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1101Open in IMG/M
3300020003|Ga0193739_1078522Not Available840Open in IMG/M
3300020004|Ga0193755_1045670All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1435Open in IMG/M
3300020061|Ga0193716_1204691Not Available747Open in IMG/M
3300020082|Ga0206353_10904957All Organisms → cellular organisms → Bacteria2863Open in IMG/M
3300021073|Ga0210378_10028108All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2249Open in IMG/M
3300021073|Ga0210378_10191150Not Available783Open in IMG/M
3300021081|Ga0210379_10055555All Organisms → cellular organisms → Bacteria1586Open in IMG/M
3300021090|Ga0210377_10008695All Organisms → cellular organisms → Bacteria → Proteobacteria7992Open in IMG/M
3300021168|Ga0210406_10321201All Organisms → cellular organisms → Bacteria1255Open in IMG/M
3300021403|Ga0210397_11467332Not Available530Open in IMG/M
3300022756|Ga0222622_10905292Not Available647Open in IMG/M
3300025160|Ga0209109_10299989Not Available767Open in IMG/M
3300025165|Ga0209108_10139644Not Available1284Open in IMG/M
3300025324|Ga0209640_10017854All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales6188Open in IMG/M
3300025569|Ga0210073_1014015All Organisms → cellular organisms → Bacteria1555Open in IMG/M
3300025901|Ga0207688_10017353All Organisms → cellular organisms → Bacteria3911Open in IMG/M
3300025903|Ga0207680_10625426All Organisms → cellular organisms → Bacteria770Open in IMG/M
3300025907|Ga0207645_10173851All Organisms → cellular organisms → Bacteria1412Open in IMG/M
3300025907|Ga0207645_10185695All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1365Open in IMG/M
3300025908|Ga0207643_10001338All Organisms → cellular organisms → Bacteria → Proteobacteria14254Open in IMG/M
3300025910|Ga0207684_10000570All Organisms → cellular organisms → Bacteria44869Open in IMG/M
3300025910|Ga0207684_10108007All Organisms → cellular organisms → Bacteria2381Open in IMG/M
3300025912|Ga0207707_10502894All Organisms → cellular organisms → Bacteria1033Open in IMG/M
3300025912|Ga0207707_10792967All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300025912|Ga0207707_11320913All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria579Open in IMG/M
3300025917|Ga0207660_10601801All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300025919|Ga0207657_10001153All Organisms → cellular organisms → Bacteria28116Open in IMG/M
3300025922|Ga0207646_10000408All Organisms → cellular organisms → Bacteria57610Open in IMG/M
3300025922|Ga0207646_10268432Not Available1542Open in IMG/M
3300025927|Ga0207687_10217740All Organisms → cellular organisms → Bacteria1502Open in IMG/M
3300025931|Ga0207644_10546081All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria959Open in IMG/M
3300025942|Ga0207689_10000275All Organisms → cellular organisms → Bacteria → Proteobacteria46532Open in IMG/M
3300025960|Ga0207651_11462977Not Available615Open in IMG/M
3300025965|Ga0210090_1006731All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1569Open in IMG/M
3300025986|Ga0207658_10192481Not Available1696Open in IMG/M
3300026001|Ga0208000_110283Not Available598Open in IMG/M
3300026011|Ga0208532_1005488All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium751Open in IMG/M
3300026075|Ga0207708_11161586Not Available674Open in IMG/M
3300026088|Ga0207641_10700495All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300026320|Ga0209131_1031010All Organisms → cellular organisms → Bacteria3132Open in IMG/M
3300026345|Ga0257148_1005228All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → unclassified Micromonospora → Micromonospora sp. ATA51917Open in IMG/M
3300026351|Ga0257170_1001406All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2372Open in IMG/M
3300026351|Ga0257170_1005644Not Available1460Open in IMG/M
3300026358|Ga0257166_1017587All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria927Open in IMG/M
3300026360|Ga0257173_1000018All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5191Open in IMG/M
3300026360|Ga0257173_1004603Not Available1378Open in IMG/M
3300026374|Ga0257146_1006650All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1888Open in IMG/M
3300026377|Ga0257171_1017610All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1195Open in IMG/M
3300026475|Ga0257147_1000006All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria9428Open in IMG/M
3300026480|Ga0257177_1018135All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → unclassified Micromonospora → Micromonospora sp. ATA51984Open in IMG/M
3300026482|Ga0257172_1015562Not Available1302Open in IMG/M
3300026490|Ga0257153_1002540All Organisms → cellular organisms → Bacteria3483Open in IMG/M
3300026494|Ga0257159_1000080All Organisms → cellular organisms → Bacteria7609Open in IMG/M
3300026496|Ga0257157_1022526All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → Leptolinea1027Open in IMG/M
3300026497|Ga0257164_1001754All Organisms → cellular organisms → Bacteria1953Open in IMG/M
3300026499|Ga0257181_1013730All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1128Open in IMG/M
3300026508|Ga0257161_1021624All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300026514|Ga0257168_1016386All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1487Open in IMG/M
3300026515|Ga0257158_1038178All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → Leptolinea863Open in IMG/M
3300026535|Ga0256867_10080276All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1278Open in IMG/M
3300026557|Ga0179587_10026030All Organisms → cellular organisms → Bacteria3202Open in IMG/M
3300027068|Ga0209898_1018028Not Available872Open in IMG/M
3300027650|Ga0256866_1008682All Organisms → cellular organisms → Bacteria2509Open in IMG/M
3300027725|Ga0209178_1289533Not Available600Open in IMG/M
3300027775|Ga0209177_10178036All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria742Open in IMG/M
3300027787|Ga0209074_10141608All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria857Open in IMG/M
3300027787|Ga0209074_10213626Not Available731Open in IMG/M
(restricted) 3300027799|Ga0233416_10029887All Organisms → cellular organisms → Bacteria1817Open in IMG/M
3300027846|Ga0209180_10002579All Organisms → cellular organisms → Bacteria → Proteobacteria9078Open in IMG/M
3300027862|Ga0209701_10165063All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1343Open in IMG/M
3300027862|Ga0209701_10352801Not Available832Open in IMG/M
3300027875|Ga0209283_10943083Not Available519Open in IMG/M
3300027882|Ga0209590_10048867All Organisms → cellular organisms → Bacteria2347Open in IMG/M
3300027894|Ga0209068_10017797All Organisms → cellular organisms → Bacteria3436Open in IMG/M
3300027903|Ga0209488_10047685All Organisms → cellular organisms → Bacteria3150Open in IMG/M
3300027909|Ga0209382_10349956All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1655Open in IMG/M
3300027910|Ga0209583_10108468All Organisms → cellular organisms → Bacteria1081Open in IMG/M
3300028047|Ga0209526_10017886All Organisms → cellular organisms → Bacteria → Proteobacteria4926Open in IMG/M
3300028536|Ga0137415_10049613All Organisms → cellular organisms → Bacteria4076Open in IMG/M
3300028587|Ga0247828_11026253Not Available541Open in IMG/M
3300028592|Ga0247822_10144825All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1745Open in IMG/M
3300028592|Ga0247822_10595608All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria885Open in IMG/M
3300028596|Ga0247821_10627272Not Available696Open in IMG/M
3300028673|Ga0257175_1018475All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1136Open in IMG/M
3300028673|Ga0257175_1036818All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium867Open in IMG/M
3300028718|Ga0307307_10199960Not Available632Open in IMG/M
3300028792|Ga0307504_10045296All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1234Open in IMG/M
3300028811|Ga0307292_10494964Not Available524Open in IMG/M
3300028812|Ga0247825_10347040All Organisms → cellular organisms → Bacteria1043Open in IMG/M
3300028812|Ga0247825_11045324Not Available594Open in IMG/M
3300028828|Ga0307312_11138868Not Available516Open in IMG/M
3300030336|Ga0247826_10123354All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1659Open in IMG/M
3300030619|Ga0268386_10033350All Organisms → cellular organisms → Bacteria4075Open in IMG/M
3300030620|Ga0302046_10232850All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1515Open in IMG/M
(restricted) 3300031150|Ga0255311_1002717All Organisms → cellular organisms → Bacteria → Proteobacteria3330Open in IMG/M
(restricted) 3300031150|Ga0255311_1076075Not Available716Open in IMG/M
3300031152|Ga0307501_10157295Not Available622Open in IMG/M
(restricted) 3300031197|Ga0255310_10040363All Organisms → cellular organisms → Bacteria1212Open in IMG/M
3300031229|Ga0299913_10132671All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2451Open in IMG/M
3300031455|Ga0307505_10185193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria958Open in IMG/M
3300031720|Ga0307469_10213276All Organisms → cellular organisms → Bacteria → Proteobacteria1519Open in IMG/M
3300031720|Ga0307469_10798529All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium865Open in IMG/M
3300031720|Ga0307469_11614412Not Available623Open in IMG/M
3300031740|Ga0307468_100190783All Organisms → cellular organisms → Bacteria1373Open in IMG/M
3300031740|Ga0307468_100406028All Organisms → cellular organisms → Bacteria1042Open in IMG/M
3300031820|Ga0307473_10232981Not Available1116Open in IMG/M
3300031943|Ga0310885_10420810All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria715Open in IMG/M
3300032180|Ga0307471_100157161All Organisms → cellular organisms → Bacteria2199Open in IMG/M
3300032180|Ga0307471_101004231Not Available1001Open in IMG/M
3300032180|Ga0307471_101251330All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300032770|Ga0335085_10000667All Organisms → cellular organisms → Bacteria90272Open in IMG/M
3300033407|Ga0214472_10585978All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1026Open in IMG/M
3300033417|Ga0214471_10023055All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales4979Open in IMG/M
3300033432|Ga0326729_1011214All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1562Open in IMG/M
3300033475|Ga0310811_11459798Not Available515Open in IMG/M
3300033551|Ga0247830_10062306All Organisms → cellular organisms → Bacteria2500Open in IMG/M
3300033551|Ga0247830_10067539All Organisms → cellular organisms → Bacteria2416Open in IMG/M
3300034090|Ga0326723_0027032All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2365Open in IMG/M
3300034155|Ga0370498_163109Not Available542Open in IMG/M
3300034817|Ga0373948_0131977Not Available611Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.97%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.32%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.66%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.33%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.33%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil2.33%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.33%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.99%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.66%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.66%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.33%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.33%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.33%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.33%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.00%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.66%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.66%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.66%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.66%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.66%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.66%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.66%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.33%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.33%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.33%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.33%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.33%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.33%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.33%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.33%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.33%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.33%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.33%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.33%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.33%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.33%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.33%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.33%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2162886012Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005337Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3L metaGEnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005366Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005876Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005879Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012040Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT746_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013100Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaGHost-AssociatedOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300014497Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-129_1 metaGHost-AssociatedOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014877Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT366_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018029Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MGEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020082Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025569Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025901Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4 (SPAdes)Host-AssociatedOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025908Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025965Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025986Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026001Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026011Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_301 (SPAdes)EnvironmentalOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026345Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027068Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028596Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glycerol_Day14EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
MBSR1b_0047.000014602162886012Miscanthus RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
F12B_1064956723300000443SoilMKRFAMTVGLGAGLLLLAGHADAQWRYTDDRGVTKVTQYKIDIPSAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLVQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSTGYGQDGYGTFGTISVR*
F24TB_1010408533300000550SoilMKRFAMTVGLGAGLLLLAGHADAQWRYTDDRGVTKVTQYKIDIPTAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLIQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSSGYGQDGYGTFGNIMVR*
F14TC_10048076623300000559SoilMKRFAMTVGLGAGLLLLAGHADAQWRYTDDRGVTKVTQYKIDIPSAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLIQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSTGYGQDGYGTFGTISVR*
JGI10216J12902_10191029923300000956SoilLLTGHADAQWRYTDDRGVTRVTQYKIDVPSAYRDAAEWIGPVGIGKPALSADQVLAAQRWEAIQRIVNAEAGLIQYRNMTPSPPLRDPGGPNKSMATMCIAGELRVMTSPGSWKVAGACSGGFSTGYGTDGYGSFGGMTIR*
C687J26623_1003152333300002122SoilMKRWATAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPIGIGKPALSADQIRAAQLWDAIQRIIAAEAGLLQFRNVAAPAPPRPESGTAGKPMATMCIAGELRAMTSPGSWRTVGGCAPGFSTGYGTDGYGAVGGFIVR*
soilH2_10032192183300003324Sugarcane Root And Bulk SoilMGQWGMGRWGIVTTLGGALLLLAGQAEAQWRYTDDKGTSRVTQYKIDVPSDFRDGAEWIGPVGPGKPGLSEDQHRAAQRSEANRRIIAAEAGLVRYRNMPAAARPAPDPGGPPKAMATMCIAGQQRVMTSPGSWKVTGSCSSDFSTGYSGGYPSYGGWGGYPTH*
JGI25405J52794_1007103913300003911Tabebuia Heterophylla RhizosphereMKRFATXLGLGAALLVLAGQADAQWRYTDDKGTTRVTQYKIDIPTAHRDAAEWIGPVGVGKPALSADQALTAKRWEAIERLGIAEAGLVQFRNMPPPRALRDPGGSPKSMATMCMAGELRVMTSPGNWKVLGPCAGGFSTGYGSDGYGSFGGIMIR*
Ga0055435_1001336423300003994Natural And Restored WetlandsMKRSAMAVGLSAGLLLLAGPADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPIGIGKPALSADQVRAAQHWDAIRRIIAAEAGLLQFRNAATPAPPRVVSDASGRATTTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGFTTR*
Ga0055437_1003442213300004009Natural And Restored WetlandsMKRSAMAVGLSAGLLLLAGPADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPIGIGKPALSADQVRAAQHGDAIRRIIAAEAGLLQFRNTATPAPPRVVSDASGRATTTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGFTTR*
Ga0055433_1002562413300004025Natural And Restored WetlandsMKRSAMAVGLSAGLLLLAGPADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPIGIGKPALSADQVRAAQHWDAIRRIIAAEAGLLQFRNAATPAPPRVVSDASGRATTTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGPVGGFTTR*
Ga0055490_1009400423300004052Natural And Restored WetlandsMKRMAVAIGLSAGLLLLTGPAGAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPVGIGKPALSADQILRAERWDAIQRIVNAEAGLLLLRNAAAPTPLRDPGPGGKPMATMCIAGELRAMTSPG
Ga0055500_1000740723300004062Natural And Restored WetlandsMKRSAMAVGLGAGLLLLAGPADAQWRYTDDRGVSKVTQYKLDVPAPHRDAAEWIGPIGVGKPALSADQVRAAQHWEAIRRIIAAEAGLLQFQNAATPAPPRAVSDAAGRPLTTMCVAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGFTIR*
Ga0062589_10056284623300004156SoilLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0062589_10100232113300004156SoilMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0062590_10059602423300004157SoilDAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0063356_10006678363300004463Arabidopsis Thaliana RhizosphereMNRLAMAVGFGAGLLLAAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAQPPPLPRVDAGTSDRSNPTMCIAGELRSMTSPG
Ga0063356_10229775713300004463Arabidopsis Thaliana RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0062594_10193835313300005093SoilMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSP
Ga0070676_1003661843300005328Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSW
Ga0070670_10010611043300005331Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSW
Ga0066388_10231528923300005332Tropical Forest SoilMKPILTLVGLAVSLMLLAAQADAQWRYVDKNGVSRVTQYKLDVPAPYRDLAEWVGPIGIGRPALSADQIRAAQLSEANRRIIDAEAEMIQFRNMQPPPKPYVDTTPPRPMASMCIAGEQRIMTSPGSWKTVGSCATGFSTNYGTAGYGTSSYGGFAPR*
Ga0066388_10848177813300005332Tropical Forest SoilRAGVASGHATSDGGTSMGRWGMVVMFGGALLLLAGQADAQWRYTDDKGTSRVTQYKIDIPTELRDGAEWIGPVGPGKPALSEGQLRAAQREDATRRIVAADAGLVRYRNMPAPARPAPDPGGPAKTMATMCIAGQQRVMTSPGSWKVTGTCSSDFSTGYNGGYPTYGGWG
Ga0068869_10005172023300005334Miscanthus RhizosphereMGRWGMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0070680_10016956933300005336Corn RhizosphereMNRLAMAVGFGAGLLLAAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAQPPPLPRVDAGTSDRSNPTMCIAGELRSMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVNVR*
Ga0070682_10000692113300005337Corn RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPA
Ga0070660_10019977023300005339Corn RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIAAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0070691_1003760633300005341Corn, Switchgrass And Miscanthus RhizosphereMERFTMVVALGLGLFLLAGQADAQWRYTDDKGSSKVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRQAQLADAIRRIVTAEAGLVQFRNVPAPARPAAVPDGPGKPMASMCVSGEQRAMTSPGIWKVVGGCSSDFSSGYGTGGYGTFGATGGYPTHY*
Ga0070691_1022963623300005341Corn, Switchgrass And Miscanthus RhizosphereGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0070669_10050301323300005353Switchgrass RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGG
Ga0070674_10191113013300005356Miscanthus RhizosphereLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVITSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0070659_10060987823300005366Corn RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYG
Ga0070703_1006969823300005406Corn, Switchgrass And Miscanthus RhizosphereMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPELYRDLAEWIGPVGIGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPGPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH*
Ga0070705_10135988113300005440Corn, Switchgrass And Miscanthus RhizosphereMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRDGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVKYRNMPAPAKPAPDPGGPSKPMATMCIAGEQRAMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVNVR*
Ga0070700_10168515513300005441Corn, Switchgrass And Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYG
Ga0070694_10097218513300005444Corn, Switchgrass And Miscanthus RhizosphereMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPELYRDLAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPGPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH*
Ga0070708_10002633223300005445Corn, Switchgrass And Miscanthus RhizosphereMKRFAMAATLGTGLLLSAWQVDAQWRYTDDRGTNKVTQYKIDVPASSRDTAVWIGPIGIGNPGLSADQVRAAQLWDAVRRIVAAEAGLLQFKNVQAPTSPRWDSGAAGKPMATMCIAGELRTMTSPGSWKVVGACGAGFSTGYGTDGYGSFGGFSVR*
Ga0070678_10094645013300005456Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKV
Ga0070681_1035513523300005458Corn RhizosphereMNRLAMAVGFGAGLLLAAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAQPPPLPRVDAGTSDRSNPTMCIAGELRSMTSPGTWKVVGGCSTGPSGFSTGYGTDGYGSFGGVNVR*
Ga0068867_10017248243300005459Miscanthus RhizosphereDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0070707_10102805813300005468Corn, Switchgrass And Miscanthus RhizosphereGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH*
Ga0070698_10119116113300005471Corn, Switchgrass And Miscanthus RhizosphereAQWRYTDDKGANRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSVAGPAAPPADPGAGRPLATMCIAGELRNMTSPGIWKVVGGCATGPSGFSTGYGTEGYGSFGGILVR*
Ga0070741_100003611003300005529Surface SoilMVLVVGGALALLAGPAAAQWRYTDDEGKSRVTQYKIDIPKHLRDGAEWIGPVGPGKPGLSAEQKQKAQREEANRRLIAAEAGLIGYRHLPPPARPAPDRGGPDRILATMCIAGQQRVMTSPGSWKVVGSCSSDFSTGYVGGYPTLGGWGGYPTH*
Ga0070693_10006820823300005547Corn, Switchgrass And Miscanthus RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGIDGYGSFGGIMVR*
Ga0070704_10006143723300005549Corn, Switchgrass And Miscanthus RhizosphereMNRFAMAAGLGAGLLLLAGQAEAQWRYTDDKGANRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIRRLVAAEAELLRYKSVAGPAAPPADPGAGSPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGILVR*
Ga0070704_10010167933300005549Corn, Switchgrass And Miscanthus RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGYFGGIMVR*
Ga0070704_10177202513300005549Corn, Switchgrass And Miscanthus RhizosphereMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPEPLRDGAEWIGPVGVGKPGLSADQIRAAELSDAIRRLVAAEAGLVKYRNMPAPAKPAPDPGGPSKPMATMCIAGEQRAMTSPGIWKVVGGCNGDFSTGYGTGGYGTFGAT
Ga0066692_1071992813300005555SoilMERLAMALALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH*
Ga0070664_10095800423300005564Corn RhizosphereMGRWGMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGY
Ga0068852_10173212513300005616Corn RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGW
Ga0066905_10002447643300005713Tropical Forest SoilMKRFAMTVGLGAGLLLLAGQADAQWRYTDDRGVTKVTQYKIDIPTAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLIQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSTGYGQDGYGTFGGIMVR*
Ga0066905_10040228323300005713Tropical Forest SoilGLGAGLLLLTGHADAQWRYTDDRGVTRVTQYKIDVPSAYRDAAEWIGPVGIGKPALSADQILAAQRWEAIQRIVNAEAGLIQYRNMTPPPPMRDPGGPNKSLATMCIAGELRVMTSPGSWKVAGACAGGFSTGYGTDGYGSFGGITIR*
Ga0068858_10007648033300005842Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0068860_10039957523300005843Switchgrass RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGY
Ga0075300_101682713300005876Rice Paddy SoilMQRFAMTVGLGVGLLLLAGQADAQWRYTDDKGATRVTQYKLHVPPPYRDAAEWIGPIGIGKPALSEDRILAEQHWNAIRRIIDAEAGLLQFKSAAAPAPPRVSSGAAGKPMATMCIAGQLRAMT
Ga0075300_102286213300005876Rice Paddy SoilMERFAMSVALGLGLFLVAGQADAQWRYTDDKGTSRVTQYKIDVPESYRDAAEWIGPVGIGKPALSADQIRQAQLADAIRRIVTAEAGLVQFRNVPAPARPAAVPDGPGKPMASMCVSGEQRAMTSPGIWKVVGGCSSDFSSGYGTGGYGTFGATGGYPTHY*
Ga0075297_101064913300005878Rice Paddy SoilQADAQWRYTDDKGATRVTQYKLHVPPPYRDAAEWIGPIGIGKPALSEDRILAEQHWNAIRRIIDAEAGLLQFKSAAAPAPPRVSSGAAGKPMATMCIAGQLRAMTSPGSWKVVGGCSPDFSTGYGTDGYGSVGGFMIR*
Ga0075297_102910213300005878Rice Paddy SoilMERFTMVVALGLGLFLLAGQADAQWRYTDDKGSSKVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRQAQLADAIRRIVTAEAGLVQFRNVPAPARPAAVPDGPGKPMASMCVSGEQRAMTSPGIWKVVGGCSSDFSTGYGPGGYGTFGATGGYPTH*
Ga0075295_100281523300005879Rice Paddy SoilMQRFAMTVGLGVGLLLLAGQADAQWRYTDDKGATRVTQYKLHVPPPYRDAAEWIGPIGIGKPALSEDRILAEQHWNAIRRIIDAEAGLLQFKSAAAPAPPRVSSGAAGKPMATMCIAGQLRAMTSPGSWKVVGGCSPDFSTGYGTDGYGSVGGFMIR*
Ga0081455_1000549783300005937Tabebuia Heterophylla RhizosphereMKRFATTLGLGAALLVLAGQADAQWRYTDDKGTTRVTQYKIDIPTAHRDAAEWIGPVGVGKPALSADQALTAKRWEAIERLGIAEAGLVQFRNMPPPRALRDPGGSPKSMATMCMAGELRVMTSPGNWKVLGPCAGGFSTGYGSDGYGSFGGIMIR*
Ga0075023_10014310323300006041WatershedsMERLAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEAHRDAAEWIGPVGIGKPALSAEQVRAAQLSDAYRRIGRAEAGLVQLRNMPAPARPAPDPGGPTKAMATMCVSGERRVMTSP
Ga0075024_10013367223300006047WatershedsMERLAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEAHRDAAEWIGPVGIGKPALSAEQVRAAQLSDAYRRIGRAEAGLVQLRNMPAPARPAPDPGGPTKAMATMCVSGERRVMTSPGIWKVVGRCSSDFSTGYSTGGYGTFGATGGYPTH*
Ga0075018_1029189813300006172WatershedsMERLAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEAHRDAAEWIGPVGIGKPALSAEQVRAAQLSDAYRRIGRAEAGLVQLRNMPAPARPAPDPGGPTKAMATMCVSGERRVMTSPGIWKVVGRCSSDFSTGYS
Ga0079222_1005257323300006755Agricultural SoilMGQWGMGRWGIVTTLGGALLLLAGQAEAQWRYTDDKGMSRVTQYKIDVPSDFRDGAEWIGPVGPGKPGLSEDQHRAAQRSEANRRIIAAEAGLVRYRNMPAAARPAPDPGGPPKAMATMCIAGQQRVMTSPGSWKVTGSCSSDFSTGYSGGYPSYGSWGGYPTH*
Ga0079222_1044777523300006755Agricultural SoilAMAAALGAGLLLLAGSADAQWRYTDDKGVSRVTQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIATAEAGLVQFRNMPAPVKPAAAPGGPDKPMATMCISGEQRAMTSPGIWKVVGGCNADFSTGYSTGGYGTSGPSGGYPTH*
Ga0079221_1010989223300006804Agricultural SoilMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0079221_1032561323300006804Agricultural SoilMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRDGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVKYRNMPAPAKPAPDPGGPSKPMTTMCIAGQQRAMTSPGIWKVVGGCNGDF
Ga0079220_1021843433300006806Agricultural SoilAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0075421_10055208733300006845Populus RhizosphereMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSYRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAAEAALLQYRSVAAPAPPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCPTGPSGFSTGYGTDGYGSFGGITVR*
Ga0075431_10052287323300006847Populus RhizosphereMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0075433_1039934123300006852Populus RhizosphereMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWMGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0075425_10010504523300006854Populus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAAH*
Ga0075425_10184243413300006854Populus RhizosphereMKRFAMAATLGTGLLLSAWQVDTQWRHTDDRGTSKVTQYKIDVPAPYRDTAEWIGPIGIGNPGLSADQVRAAQLWDAVRRIVAAEAGLLQFKNVQAPTSPRWDSGAAGKPMATMCIAGELRAMTSPGSWKVVGACGAGFSTGYGTDGYGSFGGFSVR*
Ga0075424_10045611813300006904Populus RhizosphereMGRWGTVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGA
Ga0079219_1001592843300006954Agricultural SoilMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGY
Ga0099791_1004913533300007255Vadose Zone SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYPTH*
Ga0099793_1037937023300007258Vadose Zone SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH*
Ga0099830_1004150833300009088Vadose Zone SoilMRPRQEIHMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR*
Ga0099830_1029184323300009088Vadose Zone SoilMKRFAMAVGLGTGLLLSAWQADAQWRYTDDKGTSRVTQYKLDVPAPHRDAAVWIGPTGIGNPALSADQTRAAQLWDAVRRIVAAEAGLLQFQNVQAPTPPRLDSGAAGKPRATMCIAGELRAMTSPGSWTVVGACGAGFSTGYGTDGYGSFGGFTVR*
Ga0099828_10001811163300009089Vadose Zone SoilMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFRNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR*
Ga0099827_1012157133300009090Vadose Zone SoilMERLAMALALGLGLLLSAGQADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPTGIGNPGLSADQIRAAQLWDAVRRIAAAEAGLLQFTNVQAPTPLRWDSGAAGKPMATMCIAGELRAMTSPGTWKVVGACGAGFSTGYGTDGYGSVGGFTVR*
Ga0099827_1059289213300009090Vadose Zone SoilMNRFAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWNAVQRIVAAEAGLLQFRTVAAAPPLPRVDTGASGRPTPTMCIAGELRAMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH*
Ga0099792_1022583223300009143Vadose Zone SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH*
Ga0114129_1003058063300009147Populus RhizosphereLVSVGHAAQQEKHMNRFAMAVGLGAGLLLLAGQADAQWRYSDDKGASRVTQYKLDIPTPYRDAAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSVAGPAAPLADPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGILVR*
Ga0105241_10006294103300009174Corn RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0105249_1005897553300009553Switchgrass RhizosphereMTRFAMAVGLGVGLLLLAGQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0105088_108130423300009810Groundwater SandLGTGLLLLAWPADAQWRYTDDKGASKVTQYKLDVPEPYRDAAVWIGPIGIGNPALSADQIRAAQLSDAIRRIVEAEAGLLQFKNAEAPAPPRMASGSAGKPMATMCIAGELRVMTSPGSWKVVGACDAR
Ga0126370_1044717323300010358Tropical Forest SoilMKPILTLVGLAVSLMLLAAQADAQWRYVDKNGVSRVTQYKLDVPAPYRDLAEWVGPIGIGKPALSADQIRAAQLSEANRRIIDAEAGLIQFRNMQPPPKPYVDTTPSRPMASMCIAGEQRIMTSPGSWKSVGSCAAGFSTDYGTAGYGTSSYGGFAPR*
Ga0126376_1001117143300010359Tropical Forest SoilMKPFLTLVGFAVGLFLLAAEADAQWRYVDKNGVSRVTQYKLDVPAPYRDLAEWVGPIGIGKPALSADQIRAAQLSEANRRIIDAEAGLIQFRNMQPPPKPYVDTTPSRPMASMCIAGEQRIMTSPGSWKTVGSCAAGFSSDYGTAGYGTSSYGGFAPR*
Ga0126372_1071169923300010360Tropical Forest SoilMKPILTLVGLAVSLMLLLAGQADAQWRYVDKNGVSRVTQYKLDVPAPYRDLAEWVGPIGIGKPALSADQIRAAQLSEANRRIIDAEAGLIQFRNMQPPPKPYVDTTPSRPMASMCIAGEQRIMTSPGSWKSVGSCAAGFSTDYGTAGYGTSSYGGFAPR*
Ga0126377_1093939723300010362Tropical Forest SoilMKRFAMTVGLGAGLLLLAGQADAQWRYTDDRGVTKVTQYKIDIPTAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLIQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSTG
Ga0134125_1053744823300010371Terrestrial SoilMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAETALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGIDGYGSFGGIMVR*
Ga0136847_1350171313300010391Freshwater SedimentMNRFAMAVGFGAGLLLLAGQADAQWRYTDDKGTSRVTQYKLDIPTPHRDAAEWIGPVGIGKPELSANQIRAAQRWDAIQRIVAAEAALLQYKSVAAPAPPPADPGAGKPLATMCVAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGIMVR*
Ga0134124_1005786723300010397Terrestrial SoilMTRFAVAVGLGVGLLLLAGQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAETALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGIDGYGSFGGIMVR*
Ga0134124_1007561423300010397Terrestrial SoilMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWRVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0134127_1118219523300010399Terrestrial SoilMTRFAVAVGLGVGLLLLAGQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGIDGYGSFGGIMVR*
Ga0134122_1024228313300010400Terrestrial SoilMNRLAMAVGFGAGLLLVAGQADAQWRYTDDKGARKVTQYKLDVPMPYRDAAEWVGPVGIGKPELSADQIRAAQRWDAVRRLVAAEAGLLQFRNVAAPPPLPRVDAGASDRPAPTMCIAGELRTMTS
Ga0134122_1024765223300010400Terrestrial SoilMNRLAMAVGFGAGLLLVAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPELSADQIRAAQHWEAVRRLVAAEAGLLQFRNVASPPPLPRVDAGTSDRSNPTMCIAGELRSMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGITVR*
Ga0134123_1005528143300010403Terrestrial SoilMTRFAMAVGLGVGLLLLAGQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGIDGYGSFGGIMVR*
Ga0105246_1068258713300011119Miscanthus RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFST
Ga0137391_1000318713300011270Vadose Zone SoilMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR*
Ga0137393_1038841323300011271Vadose Zone SoilMRPRQEIHMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTL
Ga0137446_102282523300011419SoilMKRFAMAVGLSAGLLVLAGQADAQWRYTDDKGTSKVTQYKLDVPTPYRDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAAAPPRVDAGASGRPIPTMCIASELRAMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVLIR*
Ga0137455_110536013300011429SoilMGMRPRQEIHMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVILR*
Ga0137445_107916323300012035SoilMKRFAMAVGLSAGLLVLAGQADAQWRYTDDKGTSKVTQYKLDVPTPYRDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIIAAEAGLLQFKNVAAPAPPRVDAGAAGRPMPTMCIAGELRV
Ga0137461_108255013300012040SoilMKRFAMAVGLSAGLLVLAGQADAQWRYTDDKGTSKVTQYKLDVPTPYRDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNLAAAVAPPRVDAGASGRPIPTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGVMVR*
Ga0137389_1000170333300012096Vadose Zone SoilMRPRQEIHMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR*
Ga0137389_1014123423300012096Vadose Zone SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH*
Ga0137388_1001744273300012189Vadose Zone SoilMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFRNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR*
Ga0137363_1031373723300012202Vadose Zone SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTFGPSGGYPTH*
Ga0137363_1156554013300012202Vadose Zone SoilMERFAMAVALGLGLLLWARQADAQWRYTDDKGASRVTQYKIDVPESSRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSP
Ga0137360_1010024823300012361Vadose Zone SoilMERLAMALALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYRPSGGYPTH*
Ga0137390_1144442213300012363Vadose Zone SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTTEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH*
Ga0137397_1009492623300012685Vadose Zone SoilMTQFAMAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPQRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0137394_1145241313300012922Vadose Zone SoilAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPQRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0137359_1081302513300012923Vadose Zone SoilMERFAMAVALGLGLLLWARQADAQWRYTDDKGASRVTQYKIDVPESSRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTFGPSGGYPTH*
Ga0137416_1101493023300012927Vadose Zone SoilMERLAMAVALGLGLLLSAAQAGAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKTMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH*
Ga0137416_1182017013300012927Vadose Zone SoilMNRFAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPVPYRDAAEWIGPIGIGKPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAPPLPRVDASASNRPIPTMCIAGDLRAMTSPGIWKTVGGCSSG
Ga0137404_1034510323300012929Vadose Zone SoilMTRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0137407_1006640033300012930Vadose Zone SoilLLSVGHAAQQEKHMNRFAMAVGLGAGLLLLAGQAEAQWRYTDDKGASRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSAAGPAAPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGILVR*
Ga0153915_1089478213300012931Freshwater WetlandsMERFAMSVALCLGLFFLAGPATAQWRYTDDKGASKVTQYKIDVPEPHRDAAEWIGPVGVGRPALSADQIRQAQLADAIRRIVTAEAGLVQFQNMPAPARPAAVPAGPSKPMASMCIAGEQRAMTSPGIWKVVGGCSSDFSTGYGPGGYGTFGATGGYPIH*
Ga0137410_1046893013300012944Vadose Zone SoilLLSVGHAAQQEKHMNRFAMAVGLGAGLLLLAGQAEAQWRYTDDKGASRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSAAGPAAPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGT
Ga0164298_1004683623300012955SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYGNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0164299_1006836133300012958SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWG
Ga0164304_1019760323300012986SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0157373_10009169103300013100Corn RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIAAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0157371_1095664723300013102Corn RhizosphereVAVGLGVGLLLLARQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0157369_1030343023300013105Corn RhizosphereMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0182008_1046962923300014497RhizosphereGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0157377_1057354313300014745Miscanthus RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADVRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH*
Ga0180066_101132923300014873SoilMKRFAMAVGLSAGLLVLAGQADAQWRYTDDKGTSKVTQYKLDVPTPYRDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAAAPPRGDAGASGRPIPTMCIGGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGVMVR*
Ga0180074_102922423300014877SoilMKRFAMAVGVGAGLLLVAGQADAQWRYIDDKGTSRVTQYKLDIPTPHRDAAEWIGPVGIGKPALSADQIAAAQRWEAIQRIAAAEAGLLQFKNMVAAPPLPRVDAGVSGRSIPVMCITGELRVMTSPGIWKVVGGCSTGASGFSTGYGTAGYGSFGGILFR*
Ga0180104_106617323300014884SoilMKRFAMAVALGTGLLLAAGQADAQWRYTDDKGASRVTQYKLDIPTPYRDAAEWIGPIGIGKPALSADRIAAAQRWEAIQRIVAAEAGLLQFKHVAAVPAPPRVNAGASSRPMPTMCITGELRVMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGIMVR*
Ga0180063_100816243300014885SoilMKRFAMAVALGTGLLLAAGQADAQWRYTDDKGASRVTQYKLDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKHVAAVPAPPRVNAGASSRPMPTMCITGELRVMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGILVR*
Ga0137409_1019182433300015245Vadose Zone SoilMAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0180089_110570413300015254SoilRLEIHMNRFAMAAGLSAGLLLLAGQADAQWRYTDDNGASKVTQYKLDMPTPYSDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAAAPPRVDAGASGRPIPAMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGVMVR*
Ga0137403_1020240023300015264Vadose Zone SoilMTRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPQRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR*
Ga0137403_1120379913300015264Vadose Zone SoilVGHAAQQEKHMNRFAMAVGLGAGLLLLAGQAEAQWRYTDDKGASRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSAAGPAAPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGILVR*
Ga0132258_1072943833300015371Arabidopsis RhizosphereMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAAH*
Ga0132258_1389510023300015371Arabidopsis RhizosphereMERWAMAVALGAGLLLLAGPADAQWRYIDDTGASRVTQYRIDVPESHRDAAEWIGPVGIGKPALSAEQIRQAQVAEAIRRIVTAEAGLVQFRNMPVPAKPAAVPDGPRKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTYGASGGYPTH*
Ga0132255_10156049923300015374Arabidopsis RhizosphereMERWAMAVALGAGLLLLAGPADAQWRYIDDTGASRVTQYRIDVPESHRDAAEWIGPVGIGKPALSADQIRQAQVAEAIRRIVTAEAGLVQFRNMPVPAKPAAVPDGPRKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTYGASGGYPTH*
Ga0187825_1014206923300017930Freshwater SedimentALGSGLLLLAGSADAQWRYTDDKGVSRVTQYKIDVPEPHRDAAEWVGPVGIGKPALSADQIRQAQLVEAIRRIVTAEAGLVQFRNMPAPVKPAAAPGGPDKPMATMCISGEQRAMTSPGIWKVVGGCNADFSTGYSTGGYGTSGPSGGYPTH
Ga0187821_1000849323300017936Freshwater SedimentMARLAMAVALGAGLLLLAGSADAQWRYTDDKGVSRVTQYKIDVPEPHRDAAEWVGPVGIGKPALSAEQIREAQLVEAIRRIVAAEAGLVQFRNMPASARPAAVPGGPGKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTSGPSGGYPTH
Ga0187821_1009237423300017936Freshwater SedimentMARLAMAAALGAGLLLMAGAADAQWRYADDKGVSRVTQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIATAEAGLVQFRDVPAPVKPAAAPGGPDKPMATMCISGEQRAMTSPGIWKVVGGCNADFSTGYSTGGYGTSGPSGGYPTH
Ga0187775_1004297313300017939Tropical PeatlandMTRLAMALSFGTIVLLAAGPGDAQWRYTDDRGTARVTQYKIDVPTPYRDAAEWVGPVGIGKPALSADQHLAAQRWEAIQRIIAAEAAMLQIRTVPQTPLRADTGGPSQPMTTMCVAGELRAMTSPGSWKVIGPCSAGFSTNYGTNGYGTFGSVTVR
Ga0187823_1011590423300017993Freshwater SedimentAMAAALGAGLLLMAGAADAQWRYADDKGVSRVTQYKIDVPEPHRDAAEWVGPVGIGKPALSAEQIREAQLVEAIRRIVAAEAGLVQFRNMPASARPAAVPGGPGKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTSGPSGGYPTH
Ga0184610_100734923300017997Groundwater SedimentMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAADWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPALPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTEGYGSFGGVILR
Ga0187787_1026186313300018029Tropical PeatlandMTRLAMALSFGTIVLLAAGPGDAQWRYTDDRGTARVTQYKIDVPTPYRDAAEWVGPVGIGKPALSADQHLAAQRWEAIQRIIAAEAAMLQIRTVPQTPLRADTGGPSQPMTTMCVAGELRAMTSPGSWKVIGPCSAGFS
Ga0184634_1014283723300018031Groundwater SedimentMNRFAMAVGLSAGLLALAGQADAQWRYTDDKGASRVTQYKIDITTPYRDAAEWIGPIGIGNPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAPAPPRVDAGAAGRPMPTMCIAGELRVMTSPGIWRVVGGCAPGFSTGYGTDGYGSFGGVILR
Ga0187788_1035258813300018032Tropical PeatlandMTRLAMALSFGTIVLLAAGPGDAQWRYTDDRGTARVTQYKIDVPTPYRDAAEWVGPVGIGKPALSADQHLAAQRWEAIQRIIAAEAAMLQIRTVPQTPLRADTGGPSQPMTTMCVAGQLRAMTSPGSWKVIGPCSAGFSTNYGTNGYGTFGSVTVR
Ga0184626_1003496723300018053Groundwater SedimentMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAADWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPALPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVILR
Ga0184621_1004296313300018054Groundwater SedimentMNRLAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSAAQIRAAQRWDAIQRIVAAEAGLLQFKNVATAPPLPRVDASASDRPIPTMCIAGELRTMTSPGIWKVV
Ga0184623_1027323913300018056Groundwater SedimentMNRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPMPYRDAAEWIGPVGIGKPELSADQIRAAQRWDAIQRIVAAEAGLLRYKSVATPAPPPVDPGAGRPLATMCVAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGTFGGIMVR
Ga0184623_1046066013300018056Groundwater SedimentMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAADWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPALPRVDAGASDRPIPTMCITGELRAMTSPGIWKVV
Ga0184640_1021370013300018074Groundwater SedimentMNRFAMAVGLSAGLLLLAGQAEAQWRYIDDKGASKVTQYKLDVPMPSRDAAEWIGPIGIGKPALSADQIAAAQRWEAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGC
Ga0184632_1004611523300018075Groundwater SedimentMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0184609_1030090713300018076Groundwater SedimentMNRFAMAVGLSAGLLLLARQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAPTAPPRADSSAAGRPVPTMCIAGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVILR
Ga0184609_1048690713300018076Groundwater SedimentMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAADWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPALPRVDAGASDRPIPTMCITGELRAM
Ga0184612_1004687623300018078Groundwater SedimentMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAPAPSRVDSSAAGRPVPTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSFGGVILR
Ga0184627_1027506023300018079Groundwater SedimentMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAADWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPALPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTD
Ga0184629_1038381523300018084Groundwater SedimentMKRFAMAVGLSAGLLVLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAIQRIVAAEAGLLQFKNVVAPAPPRVDSSAAGRPVPTMCIAGELRAMTSPGIW
Ga0184629_1047866223300018084Groundwater SedimentMKRFAMAVRLSAGLLVLAGQADAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0190265_1029989213300018422SoilAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPVGIGKPELSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAAPPPLPRVDASASARPNPTMCIAGELRTMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVTIR
Ga0190265_1103068023300018422SoilMTRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKVDVPLPSRDAAEWIGPIGVGKPGLSADQIAAAQRWNAVQRIVAAEAGLLQFRNVAAAPPLPRVDAGSSDRPNPTMCIAGELRTMTSPGIWKVVGGCSSGPSGFSS
Ga0190265_1112347223300018422SoilMNRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASRVTQYKLDIPTAHRDAAEWIGPVGTGKPELSADQIRNAQRWNAIQRLAAAEAGLLQYKPVAAPAPPPTDQGAGRPLATMCVAGELRNMTSPGIWRVVGSCPTGASGFSSGYGTDGYGSFGGVQVR
Ga0190272_1002890333300018429SoilMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASRVTQYKIDIPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSAGPSGFSTGYGTEGYGSVGGVILR
Ga0190272_1019074223300018429SoilMNRFAMAVGLGAGLLLLAGQADAQWRYSDDKGASRVTQYKLDIPTPYRDAAEWIGPVGIGKPGLSAEQIAAAQRWEATQRLVAAEAGLLQFKNVAAAPAPPRVDPGVSSRSIPTMCIAGEMRSMTSPGIWKVVGGCTSSPSGFSTGYGTDGYGSFGGITVR
Ga0184643_112341323300019255Groundwater SedimentMNRLAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAPPLPRVDASASDRPSPTMCIAGELRTMTSPGIWKVVGGCS
Ga0187894_1029898713300019360Microbial Mat On RocksMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSFRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAADAALLQYKSVAAPAPPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCPTGPSGFSTGYRTDGYGSFGGITVR
Ga0190264_1028335313300019377SoilMKRLAMAVGFGAGLLLVAGQADAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNMAAPPPLPRVDASASARPNPTMCIAGELRTMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGAFGGVTIR
Ga0187892_10000855263300019458Bio-OozeMTRLALTVGLGAGLLLLTGQADAQWRYTDDKGASKVTQYKLDVPAPHRDAAVWIGPTGIGNPALSADQIRGAQLWDAVRRIVAAEAGLLQFQNVQAPAPPRSDPGAAKPMASMCIAGELRAMTSPGSWKVVGACGAGFSTGYGIDGYGSFGGFLIR
Ga0187892_1002001193300019458Bio-OozeMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSYRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAAEAALLQYKSVAAPAPPPADPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGITVR
Ga0187893_1015349123300019487Microbial Mat On RocksMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSFRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAAEAALLQHKAVAAPAPSPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGITVR
Ga0187893_1035606223300019487Microbial Mat On RocksMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSYRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAADAALLQYKSVAAPAPPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCPTGPSGFSTGYRTDGYGSFGGITVR
Ga0193723_100802333300019879SoilMTRLAMAVGLGAGLLLLVGQADAQWRYTDDKGVSRVTQYKIDVPAPQRDAAEWVGPVGIGKPALSEEQLRAARHWEAVQRLIAAETALMQIKPVPTPAPLRVDSSAGGRAMPSMCIAGELRVMTSPGIWKIVGGCPTGPSGFSSGYGSDGYGSFGGITVR
Ga0193707_105261513300019881SoilMERFAMAAALGLGLLLSAGAADAQWRYTDDKGVSRITQYKIDVPEPSRDAAEWMGPVGIGKPALSAEQVRAAQLSDAFRRIGTAEAGLVQYRNMPAPARPAPDPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0193725_103436623300019883SoilMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0193718_103649223300019999SoilMERLAMAIALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEASRDAAEWIGPVGIGKPALSAEQVRAAQLADAFRRIGTAEAGLVQFRNMPAPARPAPDQGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTIGPSGGYPTH
Ga0193730_102214823300020002SoilMERLAMATALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQVRAAQLSDAFRRIGTAEAGLVQYRNMPAPARPAPDPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0193739_104791213300020003SoilFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAPAPPRADSSAAGRPVPTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSFGGVILR
Ga0193739_107852213300020003SoilMNRFAMAVGLSAGLLVSAGQAEAQWRYTDDKGASRVTQYKIDIPMPYRDAAEWIGPIGIGKLALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTEGYGSFGGVILR
Ga0193755_104567023300020004SoilMNRFAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0193716_120469123300020061SoilMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGGSRVKQYKLDIPTPYRDGAEWIGPVGIGKPELSADQVRAAQRWDAIQRLVAAEAALLQYKAVTAPAAPPVDPGAGKPLATMCIAGELRNMTSPGIWKVVGACSTGPSGFSTGYGTDG
Ga0206353_1090495733300020082Corn, Switchgrass And Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0210378_1002810843300021073Groundwater SedimentMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAPAPPRADSSAAGRPVPTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSFGGVILR
Ga0210378_1019115023300021073Groundwater SedimentMNRFAMAVGLSAGLLLLAGQAEAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAIQRIVAAEAGLLQFKNVAPPAPPRADSSAAGRPVPTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSFGGVTLR
Ga0210379_1005555523300021081Groundwater SedimentMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAIQRIVAAEAGLLQFKNVVAPAPPRVDSSAAGRPVPTMCIAGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0210377_1000869553300021090Groundwater SedimentMKRFAIAVGLSAGLLLLAGQADAQWRYTDDKAASRVTQYKLDVPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAIQRIVAAEAGLLQFKNVAAAPAPPRADAGASGRPIPTMCIASELRAMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVLIR
Ga0210406_1032120123300021168SoilMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRDGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVQYRNMPAPAKPAPDPGGPSKPMASMCIAGQQRAMTSPGIWKVVGGCNGDFSTGYGTG
Ga0210397_1146733213300021403SoilMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRAGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVQYRNMPAPAKPAPDPGGPSKPMATMCIAGQQRAMTSPGIWKVVGGCNGD
Ga0222622_1090529213300022756Groundwater SedimentMNRFAMAVGLGAGLLLLAGQAEAQWRYTDDKGANRVTQYKLDIPTPYRDGAEWIGPVGIGKPELSADQIRAAQRWDAIQRLVAAEAELLRYKSVAGPAAPPADPGAGSPLATMCIAGELRNMTSPGIWKVVGGCSTGPSGFSTGYGTEGYGSFGGILVR
Ga0209109_1029998913300025160SoilMKRWATAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPIGIGKPALSADQIRAAQLWDAIQRIIAAEAGLLQFRNVAAPAPPRPESGTAGKPMATMCIAGELRAMTSPGSWKVVGACGAG
Ga0209108_1013964423300025165SoilMKRWATAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPAPYRDAAEWIGPIGIGKPALSADQIRAAQLWDAIQRIIAAEAGLLQFRNVAAPAPPRPESGTAGKPMATMCIAGELRAMTSPGSWRTVGGCAPGFSTGYGTDGYGAVGGFIVR
Ga0209640_1001785443300025324SoilMKRFAMAVALGTGLLLVAGQADAQWRYTDDKGASRVTQYKLDIPTPYRDAAEWIGPVGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAAPAPPRVNAGASSRPIPTMCITGELRVMTSPGIWKVVGACSTGPSGFSTGYGTDGYGSSGGIMVR
Ga0210073_101401523300025569Natural And Restored WetlandsMKRSAMAVGLSAGLLLLAGPADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPIGIGKPALSADQVRAAQHGDAIRRIIAAEAGLLQFRNTATPAPPRVVSDASGRATTTMCIAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSVGGFTTR
Ga0207688_1001735353300025901Corn, Switchgrass And Miscanthus RhizosphereMGRWGMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207680_1062542623300025903Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTS
Ga0207645_1017385123300025907Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWRVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207645_1018569513300025907Miscanthus RhizosphereLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR
Ga0207643_1000133833300025908Miscanthus RhizosphereMGRWGMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207684_10000570123300025910Corn, Switchgrass And Miscanthus RhizosphereMKRFAMAATLGTGLLLSAWQVDAQWRYTDDRGTNKVTQYKIDVPASSRDTAVWIGPIGIGNPGLSADQVRAAQLWDAVRRIVAAEAGLLQFKNVQAPTSPRWDSGAAGKPMATMCIAGELRTMTSPGSWKVVGACGAGFSTGYGTDGYGSFGGFSVR
Ga0207684_1010800743300025910Corn, Switchgrass And Miscanthus RhizosphereMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDLAEWIGPVGIGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPGPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0207707_1050289423300025912Corn RhizosphereMNRLAMAVGFGAGLLLAAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAQPPPLPRVDAGTSDRSNPTMCIAGELRSMT
Ga0207707_1079296723300025912Corn RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAETALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPS
Ga0207707_1132091323300025912Corn RhizosphereGTSRLTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWRVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207660_1060180123300025917Corn RhizosphereMNRLAMAVGFGAGLLLAAGQADAQWRYTDDKGTSKVTQYKLDVPVPYRDAAEWIGPVGIGKPGLSADQIRAAQHWEAVRRLVAAEAGLLQFRNVAQPPLLPRVDAGTSDRSNPTMCIAGELRSMTSPGIWKVVGGCSTGP
Ga0207657_10001153313300025919Corn RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIAAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207646_10000408463300025922Corn, Switchgrass And Miscanthus RhizosphereMKRFAMAATLGTGLLLSAWQVDAQWRYTDDRGTSKVTQYKIDVPASSRDTAEWIGPIGIGNPGLSADQVRAAQLWDAVRRIVAAEAGLLQFKNVQAPTSPRWDSGAAGKPMATMCIAGELRTMTSPGSWKVVGACGAGFSTGYGTDGYGSFGGFSVR
Ga0207646_1026843223300025922Corn, Switchgrass And Miscanthus RhizosphereMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSGY
Ga0207687_1021774013300025927Miscanthus RhizosphereMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAESALMQIKPAPTPAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR
Ga0207644_1054608123300025931Switchgrass RhizosphereMGRWGTVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207689_10000275143300025942Miscanthus RhizosphereMGRWGMVVTVGGAVLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207651_1146297713300025960Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWRV
Ga0210090_100673133300025965Natural And Restored WetlandsMKRSAMAVGLGAGLLLLAGPADAQWRYTDDRGVSKVTQYKLDVPAPHRDAAEWIGPIGVGKPALSADQVRAAQHWEAIRRIIAAEAGLLQFQNAATPAPPRAVSDAAGRPLTTMCVAGELRAMTSPGIWRVVGGCAPGFSTGYGTDGYGSV
Ga0207658_1019248113300025986Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVDAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0208000_11028313300026001Rice Paddy SoilMERFTMVVALGLGLFLLAGQADAQWRYTDDKGSSKVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRQAQLADAIRRIVTAEAGLVQFRNVPAPARPAAVPDGPGKPMASMCVSGEQRAMTSPGIWKVVGGCSSDFSSGYGTGGYGTFGAT
Ga0208532_100548813300026011Rice Paddy SoilMQRFAMTVGLGVGLLLLAGQADAQWRYTDDKGATRVTQYKLHVPPPYRDAAEWIGPIGIGKPALSEDRILAEQHWNAIRRIIDAEAGLLQFKSAAAPAPPRVSSGAAGKPMATMCIAGQLRAMTSPGSWKVVGGCSPDFSTGYGTDGYGSVGGFMIR
Ga0207708_1116158613300026075Corn, Switchgrass And Miscanthus RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0207641_1070049523300026088Switchgrass RhizosphereMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYP
Ga0209131_103101033300026320Grasslands SoilMTRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASKVTQYRIDVPAPHRDAAEWIGPVGIGNPALSEDQLRAARHWEAVQRLIAAETALMQIKPVPSPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWRIVGACPTGPSGFSSGYGTDGYGSFGGIMVR
Ga0257148_100522813300026345SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSGYGTGGYGTSGPL
Ga0257170_100140643300026351SoilVNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0257170_100564423300026351SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257166_101758723300026358SoilMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0257173_100001873300026360SoilMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFRNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0257173_100460323300026360SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFS
Ga0257146_100665023300026374SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257167_101044313300026376SoilDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0257171_101761033300026377SoilHGAIRVAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257147_100000633300026475SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257177_101813523300026480SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGY
Ga0257172_101556213300026482SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGY
Ga0257153_100254063300026490SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSG
Ga0257159_1000080103300026494SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSY
Ga0257157_102252623300026496SoilMERLAMATALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQLRAAQLSDAFRRIGTAEAGLVRYRNMPAPARPAPDPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0257164_100175423300026497SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSPDFSSSYGTGGYGTSGPSGGYPTH
Ga0257181_101373013300026499SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257161_102162413300026508SoilMERLAMATALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQLRAAQLSDAFRRIGTAEAGLVQYRNMPAPARPAPVPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSS
Ga0257168_101638613300026514SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTSGPSGGYPTH
Ga0257158_103817823300026515SoilMERFAMATALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQLRAAQLSDAFRRIGTAEAGLVQYRNMPAPARPAPVPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0256867_1008027613300026535SoilLAGPADAQWSYTDGKGVSRVTQYKLDVPAPYRDAAEWIGPVGIGKPDLSADQILAAQLWDAVQRIIAAEAALLQFRNVAAPAPPRLDAGTSGKPMASMCVAGEQRIMTSPGIWKIVGGCATGPSGFSTGYGTDGYGSFGGITIR
Ga0179587_1002603033300026557Vadose Zone SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTFGPSGGYPTH
Ga0209898_101802813300027068Groundwater SandMMRFAMAIGLGAGLLLSAWQADAQWRYTDDKGMSKVTQYKLDVPEPYRDAAVWIGPTGIGNPALSADQIRAAQLSDAIRRIVEAEAGLLQFKNAEAPAPPRMASGSAGKPMAIMCIAGDLRAMTSPGAWKVVSACDSRFSTGYATDGYGSSFGSFIAR
Ga0256866_100868213300027650SoilMKRLAMAVALGTGLLLLAGPADAQWRYTDGKGVSRVTQYKLDVPAPYRDAAEWIGPVGIGKPALSADQILAAQLWDAVQRIIAAEAALLQFRNVAAPAPPRLDPGTSGKPMASMCVAGEQRIMTSPGIWKVVGGCATGPSGFSTGYGTDGYGSFGGIT
Ga0209178_128953313300027725Agricultural SoilMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRDGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVKYRNMPAPAKPAPDPGGPSKPMTTMCIAGQQRAMTSPGIWKVVGGCNGDFSTG
Ga0209177_1017803623300027775Agricultural SoilMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0209074_1014160813300027787Agricultural SoilTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0209074_1021362613300027787Agricultural SoilMGQWGMGRWGIVTTLGGALLLLAGQAEAQWRYTDDKGMSRVTQYKIDVPSDFRDGAEWIGPVGPGKPGLSEDQHRAAQRSEANRRIIAAEAGLVRYRNMPAAARPAPDPGGPPKAMATMCIAGQQRVMTSPGSWKVTGSCSSDFSTGYSGGYPSYGSWGGYPTH
(restricted) Ga0233416_1002988713300027799SedimentAGLGAGLLLSIGQVDAQWQYTDDTGAGKVTQYKLDVPAPYRDAAVWIGPTGVGNPALSADQVRLAQRWEAVRRLVAAEAELLRYRNAPAPARVAARPDPGPTAKAPTTMCVAGTLQAMVAPGSWRVVGTCAAGFSTGYRTDGYGSYGSFTVR
Ga0209180_1000257923300027846Vadose Zone SoilMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0209701_1016506323300027862Vadose Zone SoilMKRFAMAVGLGTGLLLSAWQADAQWRYTDDKGTSRVTQYKLDVPAPHRDAAVWIGPTGIGNPALSADQTRAAQLWDAVRRIVAAEAGLLQFQNVQAPTPPRLDSGAAGKPRATMCIAGELRAMTSPGSWTVVGACGAGFSTGYGTDGYGSFGGFTVR
Ga0209701_1035280123300027862Vadose Zone SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKTMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH
Ga0209283_1094308313300027875Vadose Zone SoilMNRFAMTVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGG
Ga0209590_1004886713300027882Vadose Zone SoilMKRFAMAAALGTGLLLSAWQADAQWRYTDDKGASKVTQYKLDVPAPHRDAAEWIGPTGIGNPGLSADQIRAAQLWDAVRRIAAAEAGLLQFTNVQAPTPLRWDSGAAGKPMATMCIAGELRAMTSPGTWKVVGACGAGFSTGYGTDGYGSVGGFTVR
Ga0209068_1001779733300027894WatershedsMERLAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEAHRDAAEWIGPVGIGKPALSAEQVRAAQLSDAYRRIGRAEAGLVQLRNMPAPARPAPDPGGPTKAMATMCVSGERRVMTSPGIWKVVGRCSSDFSTGYSTGGYGTFGATGGYPTH
Ga0209488_1004768553300027903Vadose Zone SoilMERFAMAVALGLGLLLWAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFS
Ga0209382_1034995623300027909Populus RhizosphereMNRLAMAVGVGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTSYRDEAEWIGPVGIGKPELSADRIRAAQRWDAIQRLVAAEAALLQYRSVAAPAPPPVDPGAGRPLATMCIAGELRNMTSPGIWKVVGGCPTGPSGFSTGYGTDGYGSFGGITVR
Ga0209583_1010846833300027910WatershedsMERLAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEAHRDAAEWIGPVGIGKPALSAEQVRAAQLSDAYRRIGRAEAGLVQLRNMPAPARPAPDPGGPTKAMATMCVSGERRVM
Ga0209526_1001788653300028047Forest SoilMERLAMAIALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQVRAAQLSDAFRRIGTAEAGLVRYRNMPAPARPAPDPGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0137415_1004961363300028536Vadose Zone SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQVRAAQLSDAFRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKTMATMCVSGEQRVMTSPGIWKVVGGCSSDFSTGYSTGGYGSYGPSGGYPTH
Ga0247828_1102625313300028587SoilMGRWGMVVTVGGALLLLAGKADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0247822_1014482533300028592SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGDPALL
Ga0247822_1059560813300028592SoilWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAETALMQIKPAPTSAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGIMVR
Ga0247821_1062727223300028596SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWMGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTS
Ga0257175_101847513300028673SoilMERLAMAVALGLGLLLSAAQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGVGKPALSADQVRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVG
Ga0257175_103681823300028673SoilMNRFAMAVGLSAGLLLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAAAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVTLR
Ga0307307_1019996013300028718SoilMERLAMAIALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEASRDAAEWIGPVGIGKPALSAEQVRAAQLADAFRRIGTAEAGLVQFRNMPAPARPAPDQGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSD
Ga0307504_1004529613300028792SoilKGVSRVTQYKIDVPEPSRDAAEWIGPVGIGKPALSAEQVRAAQLSDAFRRIGTAEAGLVQYRNMPAPARPAPDPGGSGKPMATMCVSGQQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTTGPSGGYPTH
Ga0307292_1049496413300028811SoilMERLAMAIALGLGLLLSAGAADAQWRYTDDKGVSRVTQYKIDVPEASRDAAEWIGPVGIGKPALSAEQVRAAQLADAFRRIGTAEAGLVQFRNMPAPARPAPDQGGSSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSSSYGTGGYGTIGPSGG
Ga0247825_1034704013300028812SoilMGRWGTVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDVPADLRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWRVTGACNSDF
Ga0247825_1104532413300028812SoilMTRFAMAVGLGVGLLLLARQADAQWRYTDDKGASKVTQYKIDVPAPHRDAAEWIGPVGIGNPALSDDQLRAARHWEAVQRLIAAETALMQIKPAPTSAPLRVDSTAGGRPMPSMCIAGELRVMTSPGIWRIVGTCPTGPSGFSSGYGTDGYGSFGGI
Ga0307312_1113886813300028828SoilLLAGQADAQWRYTDDKGASRVTQYKIDIPTPYRDAAEWIGPIGIGKPALSADQIAAAQRWDAVRRIVAAEAGLLQFKNVAVAPPLPRVDAGASDRPIPTMCITGELRAMTSPGIWKVVGGCSSGPSGFSTGYGTDGYGSFGGVILR
Ga0247826_1012335443300030336SoilWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPARPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0268386_1003335053300030619SoilMKRLAMAVALGTGLLLLAGPADAQWSYTDGKGVSRVTQYKLDVPAPYRDAAEWIGPVGIGKPDLSADQILAAQLWDAVQRIIAAEAALLQFRNVAAPAPPRLDAGTSGKPMASMCVAGEQRIMTSPGIWKIVGGCATGPSGFSTGYGTDGYGSFGGITIR
Ga0302046_1023285033300030620SoilMKRLAMAVALGTGLLLLAGPADAQWRYTDGKGVSRVTQYKLDVPAPYRDAAEWIGPVGIGKPALSADQILAAQLWDAVQRIIAAEAALLQFRNVAAPAPPRLDPGASGKPMASMCVAGEQRIMTSPGIWKVVGGCATGPSGFSTGYGTDGYGSFGGITIR
(restricted) Ga0255311_100271743300031150Sandy SoilMTRFAMAVGLGAGLLLLAGQADAQWRYTDDKGVSRVTQYKIDVPAAHRDAAEWIGPVGIGNPALSADQLRAARRWEAVQRFIAAETALMQLRPVPTPAPVRADPSGGGRPTPTMCIAGELRVMTSPGIWKIVGTCPTGPSGFSSGYGTDGYGSFGGILVR
(restricted) Ga0255311_107607513300031150Sandy SoilMAMALGAGLLLLAGSADAQWRYTDDKGVSRITQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIVTAEAGLVQFRNMPAPAKPAAAPGGPGKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYATGGYGTSGPSGGYPTH
Ga0307501_1015729513300031152SoilMNRLAMAVGLSAGLLLLAGPADAQWRYVDDKGASKVTQYKLDVPMPYRDAAEWIGPIGIGKPALSADQIRAAQRWDAIQRIVAAEAGLLQFKNVAAAPPLPRVDASGSDRPIPTMCIAGELRAMTSPGIWKVVGGCSSGPSGFSSGYGTDGYGFVGGSILVR
(restricted) Ga0255310_1004036323300031197Sandy SoilMGRLAMALGAGLLLLAGSADAQWRYTDDKGVSRITQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIVTAEAGLVQFRNMPAPAKPAAVPGRPDKAMATTCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTSGPSGGYPTH
Ga0299913_1013267153300031229SoilMKRLAMAVALGTGLLLLAGPAHAQWRYTDGKGVSRVTQYKLDVPAPYRDAAEWIGPVGIGKPALSADQILAAQLWDAVQRIIAAEAALLQFRNAAAPAPPRLDPGTSGKPMASMCVAGEQRIMTSPGIWKVVGGCATGPSGFSTGYGTDGYGSFGGITIR
Ga0307505_1018519313300031455SoilQEISMTRLTMAIGLGAGLLLLVGQADAQWRYTDDKGVTRVTQYKIDVPAPQRDAAEWVGPVGIGKPALSEEQLRAARHWEAVQRLIAAETALMQIKPVPTPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWKIVGGCPTGPSGFSSGYGSDGYGSFGGITVR
Ga0307469_1021327613300031720Hardwood Forest SoilMKRFAMAATRGTGLLLSAWQVDTQWRHTDDRGTSKVTQYKIDVPAPYRDTAEWIGPIGIGNPGLSADQVRAAQLWDAVRRIVAAEAGLLQFKNVQAPTSPRWDSGAADKPMATMCIAGELRAMTSPGSW
Ga0307469_1079852913300031720Hardwood Forest SoilMTRLAMAVGLGAGLLLLVGQADAQWRYTDDKGVSRVTQYKIDVPAPQRDAAEWIGPVGIGKPALSEEQLRAARHWEAVQRLIAAETALMQIKPVPTPAPLRVDSSAGGRPMPSMCIAGELRVMTSPGIWKIVGGCPTGPSGFSSGYGSDGYGSFGGITVR
Ga0307469_1161441213300031720Hardwood Forest SoilMNRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASRVTQYKLDVPTAYRDAAEWIGPVGIGKPELSAEQIRTAQRWDAIQRLVAAEAALLQYKPVAASAPPPANQGTGTPMATMCVAGELRSMTSPGIWKVVGGCSTG
Ga0307468_10019078313300031740Hardwood Forest SoilMRRLAMAGALGVGLLLSAGHADAQWRYTDDKGVSRVTQYKIDVPESLRDGAEWIGPVGVGKPGLSADQIRAAQLSDAIRRIVAAEAGLVKYRNMPAPAKPAPDPGGPSKPMATMCIAGQQRAMTSPGIWKVVGGCNGDFST
Ga0307468_10040602823300031740Hardwood Forest SoilMNRLAMAVGFGAGLLLVAGQADAQWRYTDDKGASKVTQYKLDVPMPYRVAAEWVGPVGIGKPELSADQIRAAQRWDAVRRLVAAEAGLLQFRNVAAPPPLPRVDAGASDRPAPTMCIAGELRTMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVNVR
Ga0307473_1023298123300031820Hardwood Forest SoilMERFAMAVALGLGLLLSAGQADAQWRYTDDKGASRVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRAAQLSDALRRIGTAEAGLVQFRNMPAPARPAPDRGGPSKPMATMCVSGEQRVMTSPGIWKVVGGCSSDFSS
Ga0310885_1042081023300031943SoilADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0307471_10015716123300032180Hardwood Forest SoilMNRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASRVTQYKLDVPTAYRDAAEWIGPVGIGKPELSAEQIRTAQRWDAIQRLVAAEAALLQYKPVAASAPPPANQGTGTPMATMCVAGELRSMTSPGIWKVVGGCSTGPSGFSSGYGTDGYGSFGGVTAR
Ga0307471_10100423123300032180Hardwood Forest SoilMKRFAMTVGLGAGLLLLAGHADAQWRYTDDRGVTKVTQYKIDIPTAYRDAAEWIGPVGIGKPALSEDQIRQARRWEAIERLVNAEAGLIQYRNMVPPPPLRDPGGQPRSMATMCIAGELRAMTSPGSWKVVGPCAGGFSTGYGQDGYGTFGTISVR
Ga0307471_10125133023300032180Hardwood Forest SoilMNRLAMAVGFGAGLLLVAGQADAQWRYTDDKGASKVTQYKLDVPMPYRDAAEWIGPVGIGKPELSADQIRAAQRWDAVRRLVAAEAGLLQFRNVAAPPPLPRVDAGASDRPAPTMCIAGELRTMTSPGIWKVVGGCSTGPSGFSTGYGTDGYGSFGGVNVR
Ga0335085_1000066733300032770SoilMKRLAMAVGLGAGLLLVGGQADAQWRYTDDRGASRVTQYKLDVPAPHRDAAEWIGPVGTGKPALSTEQILAAQRWEAIERIVAAEAGLVQFKNLAAPAPPRDPGGGGRPMATMCIAGELRAMTSPGSWKVVGGCPTGFSTDYGTAGFGSFGTVNVR
Ga0214472_1058597823300033407SoilALGTGLLLVAGQADAQWRYTDDKGASRVTQYKLDIPTPYRDAAEWIGPVGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAAPAPPRVNAGASSRPVPTMCITGELRVMTSPGIWKVVGACSTGPSGFSTGYGTDGYGSFGGIMVR
Ga0214471_1002305573300033417SoilMKRFAMAVALGAGLLLVAGQADAQWRYTDDKGASRVTQYKLDIPTPYRDAAEWIGPVGIGKPALSADQIAAAQRWEAIQRIVAAEAGLLQFKNVAAAPAPPRVNAGASSRPIPTMCITGELRVMTSPGIWKVVGACSTGPSGFSTGYGTDGYGSFGGIMVR
Ga0326729_101121433300033432Peat SoilMAVALGAGLLLLAGPANAQWRYTDDKGASRVTQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIVAAEAGLVQFRNMPAPAKPAALPGGPGKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTYGPSGGYPTH
Ga0310811_1145979813300033475SoilMGRWGMVVTVGGALLLLAGQADAQWRYTDDKGTSRVTQYRIDIPADLRDGAEWIGPVGPGKPGLSADQLRAAERDEATRRIVAAEAGLMRYRNMPAPARPAPDPGGAEKAMASMCISGQQRVMT
Ga0247830_1006230643300033551SoilMTRFAMAVGLGAGLLLLAGQADAQWRYTDDKGASRVTQYKIDVPAAHRDAAEWIGPVGIGNPALSADQLRAARHWEAVQRLIAAETALMQLRPVPTPAPVRVDSSVGGRPTPTMCIAGELRVMTSPGIWKIVGSCPTGPSGFSSGYGTDGYGSFGGILVR
Ga0247830_1006753923300033551SoilMGRWGTVVTVGGALLLLAGQAEAQWRYTDDKGTSRVTQYRIDIPADRRDGAEWIGPVGPGKPGLSADQLRAAQRDEATRRIVAAEAGLVRYRNMPAPVRPAPDPGGSEKAMASMCISGQQRVMTSPGSWKVTGACNSDFSTGYSGGYPTYGGWGGYPAH
Ga0326723_0027032_1935_23633300034090Peat SoilGPANAQWRYTDDKGASRVTQYKIDVPEPHRDAAEWIGPVGIGKPALSADQIREAQLVEAIRRIVAAEAGLVQFRNMPAPAKPAALPGGPGKPMATMCISGEQRAMTSPGIWKVVGGCNSDFSTGYSTGGYGTYGPSGGYPTH
Ga0370498_163109_55_5313300034155Untreated Peat SoilMNRFAMAVGLGAGLLLLAGQADAQWRYTDDKGAKVTQYKLDIPTPYRDAAEWIGPVGIGKPELSADQIRAAQRWDAIQRIVAAEAGLLQYKPVTTQALPPVDPGTGRPLATMCVAGELRNMTSPGIWKVVGGCSTGASGFSTGYGTDGYGSFGGIMVR
Ga0373948_0131977_142_6093300034817Rhizosphere SoilMERFTMVVALGLGLFLLAGQADAQWRYTDDKGSSKVTQYKIDVPEPYRDAAEWIGPVGIGKPALSADQIRQAQLADAIRRIVTAEAGLVQFRNVPAPARPAAVPDGPGKPMASMCVSGEQRAMTSPGIWKVVGGCSSDFSSGYGTGGYGTFGATGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.