NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F042081

Metagenome Family F042081

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F042081
Family Type Metagenome
Number of Sequences 159
Average Sequence Length 226 residues
Representative Sequence MKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRELRALDDDRAARGTLRREGFTAATSGGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGLTNRCVLCHVRAGQTVSGHVVNTHRFARPGAAGDEQ
Number of Associated Samples 134
Number of Associated Scaffolds 159

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 43.04 %
% of genes near scaffold ends (potentially truncated) 39.62 %
% of genes from short scaffolds (< 2000 bps) 69.18 %
Associated GOLD sequencing projects 118
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (59.748 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(11.950 % of family members)
Environment Ontology (ENVO) Unclassified
(32.075 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.654 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 28.52%    β-sheet: 8.37%    Coil/Unstructured: 63.12%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 159 Family Scaffolds
PF13561adh_short_C2 12.58
PF00732GMC_oxred_N 8.18
PF14522Cytochrome_C7 7.55
PF02780Transketolase_C 3.14
PF03446NAD_binding_2 3.14
PF14537Cytochrom_c3_2 1.89
PF07992Pyr_redox_2 1.26
PF10604Polyketide_cyc2 1.26
PF02771Acyl-CoA_dh_N 0.63
PF00326Peptidase_S9 0.63
PF05199GMC_oxred_C 0.63
PF00479G6PD_N 0.63
PF01740STAS 0.63
PF07076DUF1344 0.63
PF00939Na_sulph_symp 0.63
PF00484Pro_CA 0.63
PF04366Ysc84 0.63
PF13442Cytochrome_CBB3 0.63
PF13435Cytochrome_C554 0.63
PF05977MFS_3 0.63
PF09917DUF2147 0.63
PF08281Sigma70_r4_2 0.63
PF03364Polyketide_cyc 0.63
PF06508QueC 0.63
PF13432TPR_16 0.63
PF13458Peripla_BP_6 0.63
PF10755DUF2585 0.63
PF08811DUF1800 0.63

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 159 Family Scaffolds
COG2303Choline dehydrogenase or related flavoproteinLipid transport and metabolism [I] 8.81
COG06037-cyano-7-deazaguanine synthase (queuosine biosynthesis)Translation, ribosomal structure and biogenesis [J] 0.63
COG5267Uncharacterized conserved protein, DUF1800 familyFunction unknown [S] 0.63
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 0.63
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 0.63
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.63
COG1606ATP-utilizing enzyme, PP-loop superfamilyGeneral function prediction only [R] 0.63
COG1055Na+/H+ antiporter NhaD or related arsenite permeaseInorganic ion transport and metabolism [P] 0.63
COG0780NADPH-dependent 7-cyano-7-deazaguanine reductase QueF, C-terminal domain, T-fold superfamilyTranslation, ribosomal structure and biogenesis [J] 0.63
COG0037tRNA(Ile)-lysidine synthase TilS/MesJTranslation, ribosomal structure and biogenesis [J] 0.63
COG0519GMP synthase, PP-ATPase domain/subunitNucleotide transport and metabolism [F] 0.63
COG0482tRNA U34 2-thiouridine synthase MnmA/TrmU, contains the PP-loop ATPase domainTranslation, ribosomal structure and biogenesis [J] 0.63
COG0471Di- and tricarboxylate antiporterCarbohydrate transport and metabolism [G] 0.63
COG0364Glucose-6-phosphate 1-dehydrogenaseCarbohydrate transport and metabolism [G] 0.63
COG0301Adenylyl- and sulfurtransferase ThiI (thiamine and tRNA 4-thiouridine biosynthesis)Translation, ribosomal structure and biogenesis [J] 0.63
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 0.63
COG0171NH3-dependent NAD+ synthetaseCoenzyme transport and metabolism [H] 0.63
COG0137Argininosuccinate synthaseAmino acid transport and metabolism [E] 0.63


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms59.75 %
UnclassifiedrootN/A40.25 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000550|F24TB_13401409Not Available594Open in IMG/M
3300000956|JGI10216J12902_107262916Not Available1269Open in IMG/M
3300001431|F14TB_101154681All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1474Open in IMG/M
3300001431|F14TB_102181854Not Available1037Open in IMG/M
3300003995|Ga0055438_10071848Not Available935Open in IMG/M
3300004024|Ga0055436_10004011All Organisms → cellular organisms → Bacteria → Proteobacteria2847Open in IMG/M
3300004062|Ga0055500_10011511Not Available1459Open in IMG/M
3300004157|Ga0062590_100258357All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1318Open in IMG/M
3300004633|Ga0066395_10067046All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1641Open in IMG/M
3300005213|Ga0068998_10010576All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1349Open in IMG/M
3300005347|Ga0070668_100162841All Organisms → cellular organisms → Bacteria1811Open in IMG/M
3300005445|Ga0070708_100384498All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300005457|Ga0070662_100142099All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1861Open in IMG/M
3300005459|Ga0068867_100070212All Organisms → cellular organisms → Bacteria → Proteobacteria2618Open in IMG/M
3300005467|Ga0070706_100485484All Organisms → cellular organisms → Bacteria1149Open in IMG/M
3300005468|Ga0070707_100269436All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1656Open in IMG/M
3300005471|Ga0070698_100544831All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300005471|Ga0070698_101039885Not Available767Open in IMG/M
3300005536|Ga0070697_100495703Not Available1068Open in IMG/M
3300005545|Ga0070695_100769355Not Available769Open in IMG/M
3300005553|Ga0066695_10556665Not Available696Open in IMG/M
3300005713|Ga0066905_100693404Not Available873Open in IMG/M
3300005718|Ga0068866_10164434All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300005764|Ga0066903_100480819All Organisms → cellular organisms → Bacteria2094Open in IMG/M
3300005829|Ga0074479_10082283All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2541Open in IMG/M
3300005841|Ga0068863_100180280All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2027Open in IMG/M
3300005843|Ga0068860_100047167All Organisms → cellular organisms → Bacteria → Proteobacteria4106Open in IMG/M
3300005843|Ga0068860_100402524Not Available1354Open in IMG/M
3300006049|Ga0075417_10047967All Organisms → cellular organisms → Bacteria1835Open in IMG/M
3300006844|Ga0075428_101104657Not Available837Open in IMG/M
3300006845|Ga0075421_100164584All Organisms → cellular organisms → Bacteria2767Open in IMG/M
3300006846|Ga0075430_100977444Not Available697Open in IMG/M
3300006880|Ga0075429_100246186All Organisms → cellular organisms → Bacteria1565Open in IMG/M
3300006904|Ga0075424_100563323All Organisms → cellular organisms → Bacteria → Proteobacteria1217Open in IMG/M
3300006904|Ga0075424_100862382Not Available966Open in IMG/M
3300006904|Ga0075424_101265420Not Available785Open in IMG/M
3300006969|Ga0075419_10003885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria9519Open in IMG/M
3300007255|Ga0099791_10004341All Organisms → cellular organisms → Bacteria5815Open in IMG/M
3300007265|Ga0099794_10000571All Organisms → cellular organisms → Bacteria12352Open in IMG/M
3300009094|Ga0111539_10596756Not Available1286Open in IMG/M
3300009100|Ga0075418_12077280Not Available619Open in IMG/M
3300009147|Ga0114129_10237061All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2454Open in IMG/M
3300009148|Ga0105243_10348130Not Available1360Open in IMG/M
3300009156|Ga0111538_10328519All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1936Open in IMG/M
3300009162|Ga0075423_10346876Not Available1557Open in IMG/M
3300009174|Ga0105241_10135367Not Available2000Open in IMG/M
3300009174|Ga0105241_10310448Not Available1356Open in IMG/M
3300009176|Ga0105242_10857940All Organisms → cellular organisms → Bacteria904Open in IMG/M
3300009553|Ga0105249_11391261All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300010359|Ga0126376_10901646Not Available874Open in IMG/M
3300010362|Ga0126377_10296345Not Available1593Open in IMG/M
3300010398|Ga0126383_10374723All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1452Open in IMG/M
3300010398|Ga0126383_11206554All Organisms → cellular organisms → Bacteria846Open in IMG/M
3300010400|Ga0134122_10069417All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2736Open in IMG/M
3300010403|Ga0134123_10008278All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria6992Open in IMG/M
3300012205|Ga0137362_10267002All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1479Open in IMG/M
3300012582|Ga0137358_10177864All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1451Open in IMG/M
3300012685|Ga0137397_10006637All Organisms → cellular organisms → Bacteria → Proteobacteria8029Open in IMG/M
3300012685|Ga0137397_10061885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2700Open in IMG/M
3300012685|Ga0137397_10143045All Organisms → cellular organisms → Bacteria1770Open in IMG/M
3300012685|Ga0137397_10600027Not Available819Open in IMG/M
3300012922|Ga0137394_10103755All Organisms → cellular organisms → Bacteria2397Open in IMG/M
3300012922|Ga0137394_10117235All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2252Open in IMG/M
3300012929|Ga0137404_10600100Not Available990Open in IMG/M
3300012944|Ga0137410_10019740All Organisms → cellular organisms → Bacteria4632Open in IMG/M
3300012948|Ga0126375_10173229Not Available1387Open in IMG/M
3300013297|Ga0157378_10046305All Organisms → cellular organisms → Bacteria3866Open in IMG/M
3300014269|Ga0075302_1083876Not Available694Open in IMG/M
3300014881|Ga0180094_1020362All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1293Open in IMG/M
3300014884|Ga0180104_1002014All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4166Open in IMG/M
3300014884|Ga0180104_1212982Not Available576Open in IMG/M
3300015245|Ga0137409_10038223All Organisms → cellular organisms → Bacteria4620Open in IMG/M
3300015264|Ga0137403_10231260All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1767Open in IMG/M
3300015371|Ga0132258_11002078All Organisms → cellular organisms → Bacteria2111Open in IMG/M
3300015373|Ga0132257_101148000Not Available982Open in IMG/M
3300015374|Ga0132255_100295003All Organisms → cellular organisms → Bacteria → Proteobacteria2332Open in IMG/M
3300017961|Ga0187778_10014500All Organisms → cellular organisms → Bacteria4870Open in IMG/M
3300018063|Ga0184637_10043913Not Available2719Open in IMG/M
3300018482|Ga0066669_10209226All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1502Open in IMG/M
3300019487|Ga0187893_10004732All Organisms → cellular organisms → Bacteria → Proteobacteria25470Open in IMG/M
3300019789|Ga0137408_1178921All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2480Open in IMG/M
3300020579|Ga0210407_10456283All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300021081|Ga0210379_10117984All Organisms → cellular organisms → Bacteria1113Open in IMG/M
3300021086|Ga0179596_10204753Not Available962Open in IMG/M
3300021088|Ga0210404_10064868Not Available1759Open in IMG/M
3300021170|Ga0210400_10089470All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2428Open in IMG/M
3300021478|Ga0210402_10639289Not Available985Open in IMG/M
3300021479|Ga0210410_10299049All Organisms → cellular organisms → Bacteria → Proteobacteria1444Open in IMG/M
3300025535|Ga0207423_1005604All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1857Open in IMG/M
3300025549|Ga0210094_1010795Not Available1382Open in IMG/M
3300025899|Ga0207642_10641478Not Available665Open in IMG/M
3300025911|Ga0207654_10378670Not Available980Open in IMG/M
3300025922|Ga0207646_10027499All Organisms → cellular organisms → Bacteria5188Open in IMG/M
3300025923|Ga0207681_10056962All Organisms → cellular organisms → Bacteria2668Open in IMG/M
3300025930|Ga0207701_10160021All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium CSL121993Open in IMG/M
3300025933|Ga0207706_10026531All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium5185Open in IMG/M
3300025934|Ga0207686_10363688All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300025935|Ga0207709_10206120All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1408Open in IMG/M
3300025938|Ga0207704_10374219Not Available1116Open in IMG/M
3300025961|Ga0207712_10506147All Organisms → cellular organisms → Bacteria1033Open in IMG/M
3300025972|Ga0207668_10046850All Organisms → cellular organisms → Bacteria2957Open in IMG/M
3300026088|Ga0207641_11215751Not Available753Open in IMG/M
3300026118|Ga0207675_100344765All Organisms → cellular organisms → Bacteria1459Open in IMG/M
3300026285|Ga0209438_1003843All Organisms → cellular organisms → Bacteria5099Open in IMG/M
3300026285|Ga0209438_1072146Not Available1124Open in IMG/M
3300026358|Ga0257166_1049371Not Available598Open in IMG/M
3300026360|Ga0257173_1000626All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2471Open in IMG/M
3300026369|Ga0257152_1011487Not Available934Open in IMG/M
3300026446|Ga0257178_1023688Not Available741Open in IMG/M
3300026469|Ga0257169_1026770Not Available845Open in IMG/M
3300026469|Ga0257169_1063106Not Available594Open in IMG/M
3300026475|Ga0257147_1031666Not Available762Open in IMG/M
3300026481|Ga0257155_1014431All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300026482|Ga0257172_1001504All Organisms → cellular organisms → Bacteria2856Open in IMG/M
3300026482|Ga0257172_1045168Not Available805Open in IMG/M
3300026490|Ga0257153_1008733All Organisms → cellular organisms → Bacteria2014Open in IMG/M
3300026497|Ga0257164_1042241Not Available714Open in IMG/M
3300026507|Ga0257165_1001172All Organisms → cellular organisms → Bacteria3027Open in IMG/M
3300026514|Ga0257168_1050417Not Available912Open in IMG/M
3300027655|Ga0209388_1047621All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1238Open in IMG/M
(restricted) 3300027799|Ga0233416_10063262Not Available1253Open in IMG/M
3300027903|Ga0209488_10478859All Organisms → cellular organisms → Bacteria → Proteobacteria914Open in IMG/M
3300027909|Ga0209382_10304562All Organisms → cellular organisms → Bacteria1796Open in IMG/M
(restricted) 3300028043|Ga0233417_10011195All Organisms → cellular organisms → Bacteria3328Open in IMG/M
3300028381|Ga0268264_10384802All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1344Open in IMG/M
3300028809|Ga0247824_10164593All Organisms → cellular organisms → Bacteria1199Open in IMG/M
3300030336|Ga0247826_11466598Not Available553Open in IMG/M
3300031198|Ga0307500_10215251Not Available579Open in IMG/M
3300031547|Ga0310887_10034882All Organisms → cellular organisms → Bacteria2141Open in IMG/M
3300031562|Ga0310886_10161332Not Available1190Open in IMG/M
3300031720|Ga0307469_10009060All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4738Open in IMG/M
3300031720|Ga0307469_10144706All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1768Open in IMG/M
3300031720|Ga0307469_10171786All Organisms → cellular organisms → Bacteria1654Open in IMG/M
3300031720|Ga0307469_11280769Not Available695Open in IMG/M
3300031740|Ga0307468_100030428All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2563Open in IMG/M
3300031740|Ga0307468_100044466All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2261Open in IMG/M
3300031740|Ga0307468_100397003Not Available1051Open in IMG/M
3300031820|Ga0307473_10883773Not Available644Open in IMG/M
3300031834|Ga0315290_10582855All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium972Open in IMG/M
3300031873|Ga0315297_10420593Not Available1122Open in IMG/M
3300031943|Ga0310885_10027504All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2191Open in IMG/M
3300031997|Ga0315278_10255712All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1808Open in IMG/M
3300032012|Ga0310902_10147821Not Available1326Open in IMG/M
3300032075|Ga0310890_10007490All Organisms → cellular organisms → Bacteria4989Open in IMG/M
3300032164|Ga0315283_11361387Not Available733Open in IMG/M
3300032174|Ga0307470_10441314All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300032174|Ga0307470_10922724Not Available688Open in IMG/M
3300032179|Ga0310889_10195505All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300032180|Ga0307471_100014948All Organisms → cellular organisms → Bacteria5421Open in IMG/M
3300032180|Ga0307471_100286740All Organisms → cellular organisms → Bacteria1725Open in IMG/M
3300032180|Ga0307471_100620400All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1242Open in IMG/M
3300032180|Ga0307471_103441965Not Available560Open in IMG/M
3300032205|Ga0307472_100222980Not Available1454Open in IMG/M
3300032829|Ga0335070_10723819Not Available929Open in IMG/M
3300033004|Ga0335084_10008559All Organisms → cellular organisms → Bacteria10202Open in IMG/M
3300033433|Ga0326726_10011432All Organisms → cellular organisms → Bacteria → Proteobacteria7825Open in IMG/M
3300033550|Ga0247829_10230980All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1481Open in IMG/M
3300033551|Ga0247830_11145116Not Available621Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.95%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.32%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere10.06%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil9.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil6.29%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.03%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment3.77%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.77%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere3.77%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.14%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.52%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere2.52%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.89%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.89%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.89%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.26%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.26%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.26%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.26%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.63%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.63%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.63%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.63%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.63%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.63%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.63%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.63%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.63%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005213Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D2EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014269Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D1EnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300025535Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025549Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026369Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-AEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026490Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031198Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 14_SEnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031834Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_0EnvironmentalOpen in IMG/M
3300031873Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G15_0EnvironmentalOpen in IMG/M
3300031943Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D2EnvironmentalOpen in IMG/M
3300031997Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032164Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_0EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1340140913300000550SoilMTWLVCALLAACVALASAGCFLIPSAPRPDPNAPLAGPAPYVEDCQLCHASPVAAHYAMSLHTTKGIRCGQCHTPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDSAARKALRAEGFTAPAPGGGRRFVGDSSSGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARTGHFEIPTPEG
JGI10216J12902_10726291623300000956SoilMATLQRRSGAAWVWGVLTGPLAAALGGCFLLGSPGRPDPTAVSGPVPYVENCQACHAAPVGAQYVQSLHASMGIRCGQCHTPGGHPDFSQPVRDGKCGGCHLPEYQQTIASRHFARRQQRPLDHDRMARVELRRDAFTATTAAGRVFVGDSSSGALGGRLCAACHYDEHRLGRGAVQRVNFCTGCHTDRDAHFPIPIPGNRCVQCHVRAGQTVEGQDVNTHRFTRPGGSS*
F14TB_10115468123300001431SoilMTRSQVSEQRRRAVASAWSMLVGAIAVALGGCFLVGAPERPDRGSVSGPVPYVEDCQACHAAPVGAHYLQSLHASMGVRCGQCHTPGGHPDFRQPVRDAKCGGCHLPEYQQTIGSRHFARRDQRPLDGDRAARARLRRDGFMANTAQGRVFVGDASSGDLGGRLCAACHYDEHRLGRGAVQRVNFCTGCHTDRDAHFPIPIPGNRCVQCHVRAGQTVEGQDVNTHRFTRPGGSS*
F14TB_10218185413300001431SoilGPPSAWPSWPVPPTSATLKKPDTRTALAESARIAPETTVRRRSAICWLMWVPLVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSVHTAKGIRCGQCHTPGDHPDFSQPVQDGKCGGCHQAQYQQTLGSKHFAGRVQRPLDSDRAARASLRRDGFTASTGTKRHFVGDSASAELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGRAGHFPIPTPGLTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0055438_1007184813300003995Natural And Restored WetlandsMASVRTVLADSARMAPETAVNRRPAISWLRWVLLVAPLALASAGCFLAPWSPPRPDPAAPISGPAPYVEGCQTCHAAPVGYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDADRVARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLDAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARVGETV
Ga0055436_1000401153300004024Natural And Restored WetlandsMVAIAVAFGGCFLAPWSPDRPDPAAPVSGPVPYVEDCQACHGTRVGTAHAQSLHVAKGIRCGQCHTPGGHPNFAQPIRDGKCGGCHQPQYQQTIESKHFASRQLRALDDDRAARATLRREGFTAASSGGPHFVGDSSSGDLGGRLCMACHYDEHRLGLARVRRADFCTGCHAGREDHYPMSTPSLPNRCTECHVRAGKTVNGQVVDIHRFAKPGDEGAGR*
Ga0055500_1001151123300004062Natural And Restored WetlandsMAPETAVNRRPAIGRLRWVLLVAPLALASAGCFLAPWSPPRPDPAAPISGPAPYVEGCQACHAAPVGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARAALRREGFTAATARGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARVGETVSGQVVNTHRFARPGAESAGR*
Ga0062590_10025835713300004157SoilLIPSAPRPDPNAPIAGPAPYVEDCQMCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ*
Ga0066395_1006704613300004633Tropical Forest SoilMQVGAVAPSSESRFATSRLSPLLLVLVIALESAGCFLMSRTPPRPHPTESVSGPAPYIEGCQDCHATPVGAHYAQSLHAAMGIRCGQCHAPDGHPNFLRPIEDGKCGGCHQPQYQQTLASKHFATRELRALDGDPAARQALRADGFTAPTATGRRFVGDSSSRELGGRLCAACHYDEHRLGLRPVDRADFCTGCHAGREQHFPAAATPGLANRCMECHVRVGQTVKGQVVNIHLFARPGATSAGR*
Ga0068998_1001057623300005213Natural And Restored WetlandsLASAGCFLAPWSPPRPDPAAPISGPAPYVEGCQACHAAPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFTTGTARGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARVGETVSGQVVNTHRFARPGAESAGR*
Ga0070668_10016284113300005347Switchgrass RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTTPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTEPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0070708_10038449823300005445Corn, Switchgrass And Miscanthus RhizosphereMKRRSATSWLTWGLLAAPIALTITGCFLSPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0070662_10014209913300005457Corn RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0068867_10007021223300005459Miscanthus RhizosphereMVWLARTLLAAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTEPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0070706_10048548413300005467Corn, Switchgrass And Miscanthus RhizosphereMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0070707_10026943623300005468Corn, Switchgrass And Miscanthus RhizospherePRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0070698_10054483123300005471Corn, Switchgrass And Miscanthus RhizosphereMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRELRALDDDRAARGTLRREGFTAATSGGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGLTNRCVLCHVRAGQTVSGHVVNTHRFARPGAAGDEQ*
Ga0070698_10103988523300005471Corn, Switchgrass And Miscanthus RhizospherePRPDPAGSVSGPAPYAEGCQTCHATPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQEILASQHFATRDLRALDDDRAARKALRLEGFTAATAGGRRFVGDPSSGDLGGRLCAACHYDEHRLGLRVVQRADFCTGCHAGRKDHFPSTTAGPANRCMECHVRVGQTVSGQVVNTHRFARP*
Ga0070697_10049570323300005536Corn, Switchgrass And Miscanthus RhizosphereLVAPLALATAGCFLAPGSGPRPDSTGPISGPAPYVEGCQACHEAPVGVHYAQSLHAPKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFVTRVQQALDADQAARNTLRREGFTIATAQGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREEHFPSPTAGLTNRCVECHVRVGKTASGQVVNTHRFARP*
Ga0070695_10076935513300005545Corn, Switchgrass And Miscanthus RhizosphereMTWLVWALLAAPFALASAGCFLIPSAPRPDPNAPIAGPAPYVEDCQMCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ*
Ga0066695_1055666513300005553SoilRPDPAGPVSGPAPYVEGCQTCHATGVGAQYAQSLHTAKSIRCGQCHTPGNHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRVQQALDADQAARNTLRREGFTVATAQGRRFVGDSSSGDLGGRLCAACHYDEHRLDLRAVQRADFCTGCHTGREEHFPSPTPGLTNRCVECHVRVGKTASGQVVNTHRFARP*
Ga0066905_10069340413300005713Tropical Forest SoilVNRGSATGWLKCVLLVAPIALTSVGCFLAPGSPPRPDPVGSVSGPAPYVAGCEDCHATPVGAHYAQSLHTPKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLVSKHFATRDLRGLDDDRAARETLRQEGFTVTTAGRRRFVGDSSSGALGGRLCAACHYDEHRIDLGAVQQANFCTGCHTGRETHFPIPTPGLTNRCVQCHVRVGQTMSGQVVNTHRFARPGETSAGQ*
Ga0068866_1016443423300005718Miscanthus RhizosphereMVWLARTLLAAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQVWHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGK*
Ga0066903_10048081923300005764Tropical Forest SoilVAPSSASRFATSRLSPLLLVLVIALESAGCFLMSRTPARPHPTESVSGPAPYVEGCQDCHATPVGAHYAQSLHAAMGIRCGQCHAPDGHPNFLRPIEDGKCGGCHQPQYQQTLASKHFATRELRALDGDPAARQALRADGFTAPTATGRRFVGDSSSRELGGRLCAACHYDEHRLGLRPVDRADFCTGCHAGREQHFPAAATPGLANRCMECHVRVGQTVKGQVVNIHLFARPGATSAGR
Ga0074479_1008228323300005829Sediment (Intertidal)LPYSLAMELSRFCWVAWLRRANLVVAIAVALGGCFLAPWSPDRPDPAAPVAGPAPYVEDCQACHGARVGGSYAQSLHAAKGIRCGQCHTPGGHPDFAQPIRDGKCGGCHQPQYQQTLESKHFFSRQLRALDDDRAGRATLRREGFTAATPEGRRFVGDSSSGDLGGRLCVACHYDEHRLGLASVRRAEFCARCHAGREDHYPMSTPGSPNRCIECHVRAGKTVSGQVVDVHRFARPGNEGAGR*
Ga0068863_10018028013300005841Switchgrass RhizosphereVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0068860_10004716753300005843Switchgrass RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTTPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0068860_10040252423300005843Switchgrass RhizosphereVAPIVLASAGCFLAPGSSPRPDPAAPISGPAAYVEGCQDCHATPVGAHYAQSLHTTKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLVSKHFATRDLRALDNDRDARQALRLEGFTVATARGRRFVGDSSSGELGGRLCAACHYDEHRLGLGLVQRADFCTGCHAGREMHFPIPTPGLANRCVQCHVRAGQTVSGQVVNTHRFARPGAESAGQ*
Ga0075417_1004796723300006049Populus RhizosphereMWLRISSIRKKPDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGTHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0075428_10110465713300006844Populus RhizosphereTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQSCHATPVGAHYAQSVHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFIAATAAGRRFVGDESSGELGGRLCAACHYDEHRLGLGAVRRADFCAGCHAGRAEHFPTPTPELTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAER*
Ga0075421_10016458423300006845Populus RhizosphereMLVAPIVLASAGCFLAPGSHPRPDPAAPISGPAPYVENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGGAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVGQTESGQVVNTHRFARPGAASAER*
Ga0075430_10097744413300006846Populus RhizosphereRIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGTHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRF
Ga0075429_10024618613300006880Populus RhizosphereMLKKPDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGTHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRFAR
Ga0075424_10056332323300006904Populus RhizosphereVNRYSSIGWVTWALLVAPIALATAGCFLAPGSGSRPDSTGPISGPAPYVEECQSCHEAPVGVHYAQSLHTPKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLVSKHFATRDLRALDDDRAARQALRLEGFTVATAKGRRFVGDSSSGDLGGRLCAACHYDEHRIGLGAVQRADFCTGCHVGREMHFPIDTPGLTNRCVQCHVRVGQTVSGQVVNTHRFARPGAASAGQ*
Ga0075424_10086238213300006904Populus RhizosphereVSTVRLGWLALALAVTLGGCFLASWSPSRPDPAVPVSGPVPYVESCPACHGSRVAAAYAESLHAAKGIQCGQCHTPAGHPDFVQPVSDGKCGGCHQPQYQQSLQSRHFETRQASALDGDRPAREALRRAGFTAATGAGRHFVGDASSGELGGRLCVACHYDEHRLGLATVQRAEFCTGCHADRDGHFPVSSPETPNRCIQCHVRVGETESRQVVDTHRFAPPGSEDPDR*
Ga0075424_10126542013300006904Populus RhizosphereVNRGSAIGWLTWVLVVAPIVLASAGCFLAPGSSPRPDPAGPVSGPAPYVEGCETCHATPVGAHYAQSLHTAKGIRCGQCHTPGGHPDFTQPIRDGKCGGCHQPQYQETLASKHFATRGLRALDDDRAARQALRLEGFTAATAGARRFVGDSSSGKLGGRLCAACHYDEHRLGLGVVQRADFCTGCHAGREMHFPIDTPGLTNRCVPCHVRVGQTVSGQVVNTHRFARPGATSA
Ga0075419_1000388563300006969Populus RhizosphereMLKKPDTRTALAESVRIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0099791_1000434153300007255Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0099794_10000571143300007265Vadose Zone SoilGQGSAGPEASFLGMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0111539_1059675623300009094Populus RhizosphereLPLGELVAARRPDTRIALAESVRIAPEATVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAERAGHFPTPTPELTNRCVECHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0111539_1306305213300009094Populus RhizosphereDPAAPISGPAPYVENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPMRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGGAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVGQTES
Ga0075418_1207728013300009100Populus RhizosphereVSVWLARALLVASAGLALGGCFLLSPRRPDPGAPVSGPAPYVEGCLACHGARVAEQYARSLHTAMGIRCGQCHTPGGHPNFVQPVRDGKCGGCHQPAYQQTLRSKHFTSRLQRVLDDDRAARTTLRREGFMGATADRPHFVGDSTSGELGGRLCAACHYDEHRLGLGTVQRADFCVGCHTDRENHFPVSTPDVTTNRCTECHVRA
Ga0114129_1023706123300009147Populus RhizosphereMSRGDAEIAMIRSQVSEQRRRAVASAWSMLVGAIAVALGGCFLVGAPERPDRGSVSGPVPYVEDCQACHAAPVGAHYLQSLHASMGVRCGQCHTPGGHPDFRQPVRDAKCGGCHLPEYQQTIGSRHFARRDQRPLDGDRAARARLRRDGFMATTAQGRVFVGDASSGDLGGRLCAACHYDEHRLGRDAVQRADFCTGCHTNRTEHFPSPTDGNRCVQCHVRVGQTVKDQNINTHRFAKPGD*
Ga0105243_1034813023300009148Miscanthus RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFPMPGAECK*
Ga0111538_1032851913300009156Populus RhizosphereMWLRISSILKKPDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSAHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTALRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAGRAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0075423_1034687623300009162Populus RhizosphereMSAIRMRGTQQGRPVAAWVRGVLTGPLIVALGGCFLLGSPGRPDPTAVSGPVPYVENCQACHAAPIGAQYVQSLHASMGIRCGQCHTPGGHPDFSQPVRDGKCGGCHLPEYQQTVTSRHFARRQQRPLDHDRMARVELRRDAFTETTAAGRVFVGDSSSGELGGRLCAACHYDEHRLGRGAVQRADFCTGCHTDRDAHFPIPTPGNRCVQCHVRAGQTVDGQDVNTHRFTRPGGSAQ*
Ga0105241_1013536723300009174Corn RhizosphereMAWLVRTLLAAPLVLVTAGCFLTPGSSRPDPTVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0105241_1031044823300009174Corn RhizosphereMRRAFLAVFAASITLASAGCFLIPSAPRPDPNAPIAGPAPYVEDCQMCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ*
Ga0105242_1085794013300009176Miscanthus RhizosphereMAWLVRTLLAAPLVLVTAGCFLTPGSSRPDPTVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGK*
Ga0105249_1139126113300009553Switchgrass RhizosphereMNRHLMVWLARTLLSAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQV
Ga0126376_1090164613300010359Tropical Forest SoilMAAGDLHASTPRRGPSRAILLVVAALLGISGCFLMSRTPGRPDPTQPVSGPAIAAERCQTCHAAPVAEEYANSLHAAMGIRCGQCHAPGGHPNFTRPVRDATCAGCHQPEYQQTIASLHFVDRQPRALDGDRSARVALRAEGFVATTNEGRRFVGDSTSGDLGGRLCAACHYDEHRLGLGAVQRPDFCVECHTGLDRHFASAPPGPANRCAQCH
Ga0126377_1029634523300010362Tropical Forest SoilMRWIVWTLLATPIVLVGSGCFRSPRPDPNAPIAGPAPYVENCQVCHAEPVAAHYAMSLHTAKGIQCGQCHTPGGHPNFTEPVRDGKCGGCHQPQFQETLTSKHFLTRELLALDQDRAARKTLRGEGFTVPVAGGRRFVGDVSSAALGGRLCVACHYDEHRLDLAVVQRADFCSSCHADREQHFAFPTPGFANRCMQCHVRVGETVAGQIVNTHRFARPGTASK*
Ga0126383_1037472323300010398Tropical Forest SoilVVAIALAFGGCFLNPFAPTRPNPEAVSGPVVYVEDCETCHADRVGRQYAQSLHAAMGIHCGQCHAGAGHPDFTQPVRDAKCGGCHTPQYEQTLGSRHFATRVQRSLDADRAARVTLRREGFVAGTATGQHFVGDAVSGELGGRLCAACHYDGHRLGLGAVQGEHFCERCHAGREEHYPIPTPNPTNRCVQCHVRVGETVIGQIVDTHRFAAPGAEGSQK*
Ga0126383_1120655423300010398Tropical Forest SoilMRRPSLMPRLTWMLLAAAVVLASAGCFLVPSAPRPDPNAPIAGPAPYVEDCQSCHDASSPALAHYAQSLHAAKGIRCGQCHTPGNHPAFAEPIQDGKCGGCHQPQYQETLTSKHFVTRELRALNDDPEARKTLRQDGFTAAAPGGGRRFVGDRAAGALGGRLCAACHYDEHRLGLAAVQRADFCTGCHVGRENHFALPTPGFTN
Ga0134122_1006941723300010400Terrestrial SoilAGPVPYVENCQMCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ*
Ga0134123_1000827873300010403Terrestrial SoilMGGRSLMTWLVWALLAAPFALASAGCFLIPSAPRPDPNAPIAGPAPYVEDCQMCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ*
Ga0137362_1026700223300012205Vadose Zone SoilPISGPIPYVEGCQTCYAAPVAAHYAESLHTAKGIRCGQCYTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0137358_1017786423300012582Vadose Zone SoilGPEASFLGMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0137397_1000663723300012685Vadose Zone SoilMAPETAVNRRSAIGRLTWVLLVTPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEGCQTCHAAPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAEREGHFLIPTPGLTNRCVQCHVRVSQTVSGQVVNTHRFARPGAASAGR*
Ga0137397_1006188523300012685Vadose Zone SoilMAPETTVHRRFAIGWRMCVLLVAPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEECQTCHATPVGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFIAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGRGAAQRADFCTGCHAGRAGHFRIPTPGLTNRCVQCHVRVGQTVSGQVVNTHRFARPRAEGAGR*
Ga0137397_1014304533300012685Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFPIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGR*
Ga0137397_1060002723300012685Vadose Zone SoilMNRRSMTAWLVWTLLAAPLVLASAGCFLMPGSTPRPDPQAPVAGPAPYVKNCQLCHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSKHFASRAQLALDSDRAARKTLRGEGFTVPVTGGRHFVGDSSSGELGGRLCVACHYDEHRLDLAVVQRADFCTSCHAGRDAHFAIPTPGVANRCVQCHVRAGQTVSGQVVNTHRFAKPGTEGSGR*
Ga0137394_1010375513300012922Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGR*
Ga0137394_1011723513300012922Vadose Zone SoilMAPETTVHRRFAIGWRMCVLLVAPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEECQTCHATPVGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDDDRAARTALRREGFIAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGRGAAQRADFCTGCHAGRAGHFPIPTPGLTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR*
Ga0137404_1060010013300012929Vadose Zone SoilVVAISVAFGGCFLSPWSPDRPDPAAPVSGPAPYVEDCQGCHAAPVGASYAQSLHAAKGIRCGQCHTPGGHPDFAQPIRDGKCGGCHQPQYEQTLESKHFASRQLRPLDDDRAERATVRREGFIAATPEGRRFVGDSSSGALGGRLCATCHYDEHRLGLASVRRADSCAACHAGREDHYPMSTPALSNRCIQCHVRAGKTVSDQVVDTHRFARPGDEGAGR*
Ga0137410_1001974033300012944Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRRKGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0126375_1017322923300012948Tropical Forest SoilMKWAVWVLRVLPIVLASGGCFLLGSARPDSNAPVTGPAPYVENCQACHATPVAAHYAMSLHTTMGIKCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSKHFTTRQSLALDDNRDARKALRREGFTVPVAGGRRFVGDPASGPLGGRLCAACHYDEHRLDLATVQRADFCTGCHTDRGDHFPDPTPGFTNRCVPCHVREGETVTGQIVNTHRFAKPGAAAGGE*
Ga0157378_1004630533300013297Miscanthus RhizosphereMAWLVRTLLAAPLVLVTAGCFLTPGSSRPDPTVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGMLGGRLCAACHYDEHRLDLVVVQRADFCTSCHADRDAHYPDPTLGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK*
Ga0075302_108387613300014269Natural And Restored WetlandsMAPETAVNRRPAISWLRWVLLVAPLALASAGCFLAPWSPPRPDPAGPVSGPAPYVEGCQTCHAAPVGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDADRVARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARV
Ga0180094_102036223300014881SoilSAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYVQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQATLASEHFASREQRALRGDQAARTALRREGFIAAGAGGRRFVGDSSSVDLGGRLCAACHYDEHRLGLGAVQRAGFCTGCHAGREEHFPAPTPAPTNRCVQCHVRVGQTVSGQVVNTHRFARP*
Ga0180104_100201413300014884SoilAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQGCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQATLASEHFASREQRALRGDQAARTALRREGFIAAGAGGRRFVGDSSSVDLGGRLCAACHYDEHRLGLGAVQRAGFCTGCHAGREEHFPAPTPAPTNRCVQCHVRVGQTVSGQVVNTHRFARP*
Ga0180104_121298213300014884SoilPGSPPRPDSAVAVSGPAPYVEGCQACHAAPLGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPTQDGKCGGCHQPQYQETLASKHFATRDQRVLDADPAARRALRLEGFTVATAEGRRFVGDSSSGELGGRLCAACHYDEHRLGFGAVQRADFCAGCHAGGEGHFPIPTPGLTNRCVQCHVRVGQTVSGQVV
Ga0137409_1003822333300015245Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRRKGFTVATREGQRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK*
Ga0137403_1023126013300015264Vadose Zone SoilVVAISVAFGGCFLSPWSPDRPDPAAPVSGPAPYVEDCQGCHAAPVGASYAQSLHAAKGIRCGQCHTPGGHPDFAQPIRDGKCGGCHQPQYEQTLESKHFASRQLRPLDDDRAERATVRREGFIAATPEGRRFVGDSSSGALGGRLCATCHYDEHRLGLASVRRADSCAACHAGREDHYPMSTPALSNRCIQCHVRAGKTVRDQVVDTHRFARPGDEGAGR*
Ga0132258_1100207823300015371Arabidopsis RhizosphereMAWLRWALLAVPLVLGSAGCFLVPSAPRPDPNAPIAGPAPYVENCQMCHASPVGAHYAMSLHTTKGIRCGQCHTPDGHPNFAQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRSEEFTAPAPGGGRRFVGDSASGALGGRLCVACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDFPTPEVTNRCLQCHVRVGETVTGQVVNSHRFAKPGAESSGQ*
Ga0132257_10114800023300015373Arabidopsis RhizosphereAVPFVLESAGCFLVPSAPRPDPNAPIAGPAPYVENCQMCHASPVGAHYAMSLHTTKGIRCGQCHTPDGHPNFAQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRSEGFTAPAPGGGRRFVGDSASGALGGRLCVACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDFPTPEVTNRCLQCHVRVGETVTGQVVNSHRFAKPGAESSGQ*
Ga0132255_10029500323300015374Arabidopsis RhizosphereMAWLRWALLAVPLVLGSAGCFLVPSAPRPDPNAPIAGPAPYVENCQMCHASPVGAHYAMSLHTTKGIRCGQCHTPDGHPNFAQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDSASGALGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDFPTPEVTNRCLQCHVRVGETVTGQVVNSHRFAKPGAESSGQ*
Ga0187778_1001450023300017961Tropical PeatlandVIRRATIRWGTRVLLVAPIGLAIAGCFLAPSAPRPDPAGSVSGPAPYVEECQTCHAVRIGAHYAQSMHATKGIRCGQCHTPGHHPDFMQPVRDGTCGGCHQPQYQETLASTHFASREQHALDGDRAARTELRRAGFIVAAAKSRRFVGDSSSGELGGRLCSACHYDDHRLGLGAVQRADFCTGCHAGRDTHFPISTPGSTNRCVGCHVRVGQTVSGQVVNTHRFAKPGADGDK
Ga0184637_1004391323300018063Groundwater SedimentMNLTQFISRLTRKSRSKNRRSAIAWATWAMLVAPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEECQSCHATPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETSASKHFASREQRALRDDQAARTTLRREGFTAAAAGGRRFVGDSSSVDLGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREEHFPAPTPDSTNRCVKCHLRVGQTVSGQLVNTHRFARP
Ga0066669_1020922613300018482Grasslands SoilARMGPEATVNRYSSIGWMTWALLVAPIALATAGCFLAPGSGPRPDSTGPISGPAPYAEECQACHEAPVGVHYAQSLHTPKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRVQRVLDADQAARNTLRLEGFTVATAQGRRFVGDSSSGDLGGRLCAACHYDEHRLDLRAVQRADFCTGCHTGREEHFPSPTPGLTNRCVECHVRVGKTASGQVVNTHRFARP
Ga0187893_1000473263300019487Microbial Mat On RocksMPPPAVLAWLAWVVRVVSLLALGGCFLVSRAPDRPDPVESVSGPAVSVERCAACHAAAEQYAKSLHTAKGIRCGQCHTPGGHPNFTQPVEDGKCGGCHQPAYQQTLASQHFAGRQLRSLDGDRAARAMLRREDFVTTTAVGRRFVGDSTSGDLGGRLCAACHYDEHRLGLAAVQRADFCVGCHTDLERHFPIPTPGLTNRCTRCHVRAGTTDSGQPLNTHRFARPGAERAGR
Ga0137408_117892123300019789Vadose Zone SoilGRHRTLVLCGKVLATARMQPETAVNRRSAIGSLTWVLLVTPIALASAGCFLAPWSPGRPDPAGPVAGPAPYVEGCQTCHAAPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRGQRALDDDRAARTALRREDFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREGHFLIPTPGLTNRCVQCHVRVSQTVSGQVVNTHRFARPGAEGAGR
Ga0210407_1045628323300020579SoilMGWLTRVLLVGPLALASAGCFLAPWSPPRPDPAGSVSGPAPYAEGCQTCHATPVGAHYAQSLHAAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQEILASQHFATRDLRALDDERAARKALRLEGFTAATAGGRHFVGDATSRELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGLTNRCVLCHVRTGQTASGQVVNTHRFARPGAAGDSQ
Ga0210379_1011798423300021081Groundwater SedimentMTPPRFVTRLTRKSRSKNRRSAIAWATWAMLVAPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQETAASKHFASREQRALRDDQAARATLRREGFTAAAAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGREEHFPIPTPGSTNRCVQCHVRVGQTVSGQVVNTHRFARP
Ga0179596_1020475313300021086Vadose Zone SoilLASAGCFLASSSPPRPESAGPVSGPTPYVEECETCHAPVGAHYAQSLHTAKGIRCGQCHTPGDHPDFTQPIQDGKCGGCHQPQYQETLTSKHFATRDPRALDHDRAARQALRLEGFTAATAGGRRFVGDSSSGDLGGRLCAACHYDEHRLGLRVVQRGDFCTGCHAGREDHFPSTTADPANRCMECHVRVGQTVSGQVVNTHRFALP
Ga0210404_1006486823300021088SoilMGWLTRVLLVGPLAFASAGCFLAPWSPPRPDPAGSVSGPPPYAEGCQTCHATPVGAHYAQSLHTAKGIRCGQCHTPGNHPNFTQPVRDGTCGGCHQPQYQETLTSKHFATRDQRALDDDRAGRTALRREGFTAATAGARHFVGDASSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGPTNRCVLCHVRAGQTVSGQVVNTHRFARAGAAGHSQ
Ga0210400_1008947023300021170SoilMGWLTRVLLVGPLALASAGCFLAPWSPPRPDPAGSVSGPAPYAGGCQTCHATPVGAHYAQSLHAAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQEILASQHFATRDLRALDDERAARKALRLEGFTAATAGGRHFVGDATSRELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGLTNRCVLCHVRTGQTASGQVVNTHRFARPGAAGDSQ
Ga0210402_1063928923300021478SoilMGWLTRVLLVGPLAFASAGCFLAPWSPPRPDPAGSVSGPAPYAEGCQTCHATPVGAHYAQSLHAAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQEILASQHFATRDLRALDDERAARKALRLEGFTAATAGGRHFVGDATSRELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGLTNRCVLCHVRTGQTASGQVVNTHRFARPGAAGDSQ
Ga0210410_1029904923300021479SoilMGWLTRVLLVGPLAFASAGCFLAPWSPPRPDPAGSVSGPAPYAEGCQTCHATPVGAHYAQSLHTAKGIRCGQCHTPGNHPNFTQPVRDGTCGGCHQPQYQETLTSKHFATRDQRALDDDRAGRTALRREGFTAATAGARHFVGDASSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREVHFPIPTPGPTNRCVLCHVRAGQTVSGQVVNTHRFARAGAAGHSQ
Ga0207423_100560423300025535Natural And Restored WetlandsMAPETAVNRRPAISWLRWVLLVAPLALASAGCFLAPWSPPRPDPAGPVSGPAPYVEGCQTCHAAPVGTHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDADRVARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARVGETVSGQVVNTHRFARPGAESAG
Ga0210094_101079513300025549Natural And Restored WetlandsMAPETAVNRRPAISWLRWVLLVAPLALASAGCFLAPWSPPRPDPAAPISGPAPYVEGCQTCHAAPVGYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFATRDQRALDADRVARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVKRADFCTGCHAGREEHFSSPTPDPTNRCVQCHARVGETVSGQVVNTHRFARPGAESAGR
Ga0207642_1064147813300025899Miscanthus RhizosphereLSERVAMRWLVWALRVVPIVLASGGCFLWSSPPRPDPNAPISGPAPYVEDCQMCHAAPVAAHYAMSLHTAMGIKCGQCHTPGGHPNFAQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNT
Ga0207654_1037867023300025911Corn RhizosphereIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207646_1002749953300025922Corn, Switchgrass And Miscanthus RhizosphereMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0207681_1005696223300025923Switchgrass RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTTPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207701_1016002143300025930Corn, Switchgrass And Miscanthus RhizosphereVAPIVLASAGCFLAPGSSPRPDPAAPISGPAAYVEGCQDCHATPVGAHYAQSLHTTKGIRCGQCHTPGGHPNFTPPIRDGKCGGCHQPQYQETLVSKHFATRDLRALDNDRDARQALRLEGFTVATARGRRFVGDSSSGELGGRLCAACHYDEHRLGLGLVQRADFCTGCHAGREMHFPIPTPGLANRCVQCHVRAGQTVSGQVVNTHRFARPGAESAGQ
Ga0207706_1002653143300025933Corn RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207686_1036368823300025934Miscanthus RhizosphereMVWLARTLLAAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPDPTLGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGK
Ga0207709_1020612013300025935Miscanthus RhizosphereDCQMCHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207704_1037421923300025938Miscanthus RhizosphereHSMAWLVRTLLAAPLVLVTAGCFLTPGSSRPDPTVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207712_1050614713300025961Switchgrass RhizosphereMSGRSLRTWLVWTLLGVSIALASAGCFLVPSAPRPDPTAPIAGPVPYVENCQMCHASPVAAHYAMSLHTTKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRGEGFTAPAPGGGRRFVGDSSSGELGGRLCAACHYDEHRLDLAVVRRDGFCTGCHGARPGHFDFPVPEGTNRCLQCHVRVGETVTGQVVNSHRFARPGAE
Ga0207668_1004685033300025972Switchgrass RhizosphereMAWLVRTLFAAPLVLVTAGCFLTPGSSRPDPTTPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTEPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0207641_1121575113300026088Switchgrass RhizosphereVPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTG
Ga0207675_10034476523300026118Switchgrass RhizosphereMVWLARTLLAAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0209438_100384343300026285Grasslands SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0209438_107214613300026285Grasslands SoilEAGSLGMKLTWGLLAAPVALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCQTCHAVPVAAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQETLGSKHFAGRERRALDDDRAARGTLRREGFTAAVSGGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFPIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGR
Ga0257166_104937113300026358SoilVLLVAPIALASAGCFLAPGSPPRPDSTEPISGPALYVEGCQACHAEPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLTSKHFATRDKRVLDADQTARRALRLEGFTVATAQGRRFVGDSSSGDLGGRLCAACHYDEHRLSLGAVQRADFCTGCHAGRGEHFPSPTPGSTNRCVECHVSV
Ga0257173_100062633300026360SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARATLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257152_101148723300026369SoilATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257178_102368813300026446SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARP
Ga0257169_102677013300026469SoilTTVNRHSTIGWLTWVLLVAPIALASAGCFLAPGSPPRPDSTEPISGPALYVEGCQACHAEPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLTSKHFATRDKRVLDADQTARRALRLEGFTVATAQGRRFVGDSSSGDLGGRLCAACHYDEHRLSLGAVQRADFCTGCHAGRGEHFPSPTPSSTNRCVECHVSVGKTVSGQVVNTHRFARP
Ga0257169_106310613300026469SoilVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDAGR
Ga0257147_103166613300026475SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVN
Ga0257155_101443123300026481SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257172_100150423300026482SoilVAPETTVNRRSAIGWLTWVLLVAPIALASAGCFLASWSPPRPDPAEPVSGPAPYVEGCQTCHAAPIGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFAARDQQALDDDRAARTALRREGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREGHFLIPTPGLTNRCVQCHVRMGQTVRGQVVNTHQFARPGAASAG
Ga0257172_104516813300026482SoilRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257153_100873313300026490SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEH
Ga0257164_104224113300026497SoilLGMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257165_100117223300026507SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQFQETLASKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0257168_105041723300026514SoilSDGPEASFLGMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDRAARGTLRREAFTVATSGGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0209388_104762123300027655Vadose Zone SoilLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
(restricted) Ga0233416_1006326213300027799SedimentMMEPMNRPSILRSRMVRSTPWGLVVVAALALGGCFLAPWSPDRPSPSAPVSGPVPYVEDCQGCHAEPVGGHYAQSLHAAKGVRCGQCHTPGGHPNFTRPVEDGKCGGCHQPQYQQTLHSRHFATRQARRLDDDQPARAALRRAGFVAAGPAGRAFVGDTASGALGGRLCAACHYDEHRLGLRAVRAAPFCTGCHTDRESHYPDPAPGTPNRCVQCHVRTGETVSGQVVDTHRFGLPGAEETGR
Ga0209488_1047885923300027903Vadose Zone SoilMKRRSATSWLTWGLLAAPIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLASKHFAGRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVMLTPSFE
Ga0209382_1030456213300027909Populus RhizosphereMLVAPIVLASAGCFLAPGSHPRPDPAAPISGPAPYVENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGGAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLT
(restricted) Ga0233417_1001119523300028043SedimentMMEPMSRPSILRSRMVRSTPWGLVVVAALALGGCFLAPWSPDRPSPSAPVSGPVPYVEDCQGCHAEPVGGHYAQSLHAAKGVRCGQCHTPGGHPNFTRPVEDGKCGGCHQPQYQQTLHSRHFATRQARRLDDDQPARAALRRAGFVAAGPAGRAFVGDTASGALGGRLCAACHYDEHRLGLRAVRAAPFCTGCHTDRESHYPDPAPGTPNRCVQCHVRTGETVSGQVVDTHRFGLPGAEETGR
Ga0268264_1038480223300028381Switchgrass RhizospherePIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDRAARKTVRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLAVVQRADFCTSCHADRDAHYPNPTPGPANRCVECHVRVGQTVTGQVVNTHRFTMPGAEGTGK
Ga0247824_1016459323300028809SoilMLVAPIVLASAGCFLAPGSHPRPDPAAPISGPAPYVENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGDAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVGQTESGQVVNTHRFARPGAASAER
Ga0247826_1146659813300030336SoilPRPDPAAPISGPAPYVENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGGAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVGQTESG
Ga0307500_1021525113300031198SoilEECQTCHASPVAAHYAMSLHTTKGIRCGQCHTPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMPLDHDPAARKALRTEGFTAPAPGGGRRFVGDSASGELGGRLCAACHYDEHRLDLAVVRRDLFCISCHGPRPGHFDFPTPEVTNRCLQCHVRVGETRTGQVVNSHRFAKPGAEGSGQ
Ga0310887_1003488223300031547SoilMWLRISSILKKPDTRNALAESARIAAETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSAHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPTPTPELTNRCVECHVRVGQTVSGQVVNTHRFARPGAENAGQ
Ga0310886_1016133213300031562SoilMWLRISSILKKLDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSAHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFITATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPTPTPELTNRCVECHVRVGQTVSGQVVNTHRFARPGAENAGQ
Ga0307469_1000906023300031720Hardwood Forest SoilMKRRSATSRLTWGLLAASIALTITGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQSQYQETLGSKHFAGRERRSLDDDRAARGTLRREGFTAATGEGRRFVGDASSGELGGRLCSMCHYDEHRLGLGPVQRTDFCTTCHAGREEHFLIPTPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGR
Ga0307469_1014470623300031720Hardwood Forest SoilMTWLVWTRLAATIALASAGCFLVPSAPRPDPNAPVAGPAPYVENCQLCHATPVAAHYAMSLHTTKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQESLISKHFPTRALMALDHDQAARKALRQEGFTAPAPGGGRRFVGDSSSGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPDHFDVPSPDGTNRCLQCHVRVGETVTGQVVNSHRFANPGAEGK
Ga0307469_1017178623300031720Hardwood Forest SoilMKRRSATSWLTWGLLAAPIALTITGCFLSPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0307469_1128076913300031720Hardwood Forest SoilMGPEATVTRRSSIGWLTWALLVAPIALATAGCFLAPGSGPRPDSTGPISGPAPYVEGCQACHEAPVGVHYAQSLHAPKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLTSKHFVTRVQQALDADQAARNTLRREGFTVATAQGRRFVGDSSSGELGGRLCAACHYDEHRLDLAAVQRADFCTGCHARREEHF
Ga0307468_10003042823300031740Hardwood Forest SoilAPRPDPTAPIAGPVPYVENCQMCHASPVAAHYAMSLHTTKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRGEGFTAPAPGGGRRFVGDSSSGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDFPAPEGTNRCLQCHVRVGETVTGQVVNSHRFARPGAEGSGQ
Ga0307468_10004446623300031740Hardwood Forest SoilMWLRISSILKKPDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSVHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAGRAGHFPIPTPELTNRCVQCHVRVGQTVSGQVVNTHRFARPGAEGAGR
Ga0307468_10039700323300031740Hardwood Forest SoilDAMDVRLSEHRPRSPESWIPGLPYLPYPRAVKTSGLHPMKARATLVVVIAVALGGCFLAPGSPKRPESTAPISGPVPYVEDCQTCHGTQVGVSYAQSLHAAKGIRCGQCHTPGGHPDFTLPIRDGKCGGCHLPQYQQTLESKHFASRQLRALDDDRAARATLRGEGFTGATPRGRRFVGDASSGDLGGRLCVACHYDEHRLGLASVRRADFCTGCHTVREEHYPMPAPDLPNRCTQCHVRAGATVTGQVVDTHRFATPGAGGAGR
Ga0307473_1088377313300031820Hardwood Forest SoilMAWLRWALRAVPIVLGSAGCFLVPSAPRPDPNAPVAGPAPYVENCQMCHASPVAAHYAMSLHTTKGIRCGQCHTPDGHPNFAQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAQGFTAPAPGGGRRFVGDPASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETV
Ga0315290_1058285523300031834SedimentMNPTRFVAWLTRKSRSINRRPAVAWAARVLLVTPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQETAASKHFASREQRALRDDQAARATLRREGFTAAATGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGREEHFPIPTPGSTNRCVQCHVRVGQTVSGQVVNTHRFAGP
Ga0315297_1042059323300031873SedimentMNRRPAVAWATWAMLVTPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQENAASKHFASREQRALRDDQAARATLRREGFTAAATGGRRFAGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGREEHFPIPTPGSTNRCVQCHVRVGQTVSGQVVNTHRFARP
Ga0310885_1002750413300031943SoilVSGPAPYVEGCQTCHATPVGAHYAQSAHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFITATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPTPTPELTNRCVECHVRVGQTVSGQVVNTHRFARPGAENAGQ
Ga0315278_1025571223300031997SedimentMAAETTMNRRFAAAWATWAMLVTPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQETAASKHFASREQRALRDDQAARATLRREGFTAAATGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGREEHFPIPTPGSTNRCVQCHVRVGQTVSGQVVNTHRFARP
Ga0310902_1014782123300032012SoilMSGRSLRTWLVWTLLGASIALASAGCFLIPSAPRPDPNAPLAGPAPYVEDCQLCHASPVAAHYAMSLHTTKGIRCGQCHSPDGHPNFTQPVRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRAEGFTAPAPGGGRRFVGDAASGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDVPTPEGTNRCLECHVRVGETVTGQVVNSHRFAKPGAEGSGQ
Ga0310890_1000749043300032075SoilMWLRISSILKKLDTRTALAESARIAPETTVRRRSAICWLMWVALVAPIALASAGCFLAPWSPRRPDPAGAVSGPAPYVEGCQTCHATPVGAHYAQSAHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFATRDQRALDDDRAARTAIRREGFIAATAAGRRFVGDASSGELGGRLCAACHYDEHRLGLGAVQQADFCAGCHAERAGHFPTPTPELTNRCVECHVRVGQTVSGQVVNTHRFARPGAENAGQ
Ga0315283_1136138713300032164SedimentMAAETTMNRRFAAAWATWAMLVTPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEDCQSCHAAPIGAHYAQSLHTAKGIRCGQCHTPGGHPNFTQPVGDGKCGGCHQPQYQETAASKHFASREQRALRDDQAARATLRREGFTAAAAGRRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCAGCHAGREEHFPIPTPGSTNRCVQCHVRVG
Ga0307470_1044131413300032174Hardwood Forest SoilMVWLARTLLAAPLVLATGGCFLTPGSSPRPDPTAPIAGPAPYIENCQACHASPVAAHYAMSLHTAKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQETLTSRHFATRTLLALDDDGAARKTLRGEGFTVAGAGRRHFVGDSSSGVLGGRLCAACHYDEHRLDLVVVQRADFCTSCHADRDAHYPDPTL
Ga0307470_1092272423300032174Hardwood Forest SoilDSTEPISGPAPYVEGCQACHEAPVGVHYAQSLHAPRGIRCGQCHTPGDHPNFTEPIRDGKCGGCHQPQYQETLASKHFATRDRQALDADQSARSALRLQGFTVPAAEGRRFVGDPSSGDLGGRLCAACHYDQHRLGLGAVQRADFCTGCHAKREEHFPDPTPGLTNRCVECHVRVGKTASGQVVNTHRFARP
Ga0310889_1019550513300032179SoilMSGRSLRTWLVWTLLGASIALASAGCFLIPSAPRPDPNAPLAGPAPYVEDCQLCHASPVAAHYAMSLHTTKGIRCGQCHTPGGHPNFTQPIRDGKCGGCHQPQYQESLVSTHFATRALMALDHDPAARKALRGESFTAPAPGGGRRFVGDSSSGELGGRLCAACHYDEHRLDLAVVRRDGFCTSCHGARPGHFDFP
Ga0307471_10001494833300032180Hardwood Forest SoilMAPIALASAGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTVATREGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPGPANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0307471_10028674023300032180Hardwood Forest SoilMGWLTRVLLVGPLAFASAGCFLAPWSPPRPDPAGSVSGPAPYAEGCQTCHATPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQETLGSKHFGGRELRALDDDRAARETLRREGFTAATSGGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCITCHAGREEHFPIATPGLANRCVQCHVRVSETARGQVVNTHRFARPGAEDTGR
Ga0307471_10062040023300032180Hardwood Forest SoilMGRRVFTVVSRRADPGIRTAPETIGHRRSALSWLTCVLFVAPIALASAGCFLAPWSPPRPDPAGPVSGPAPYVEGCETCHATPVGAHYAQSLHTAKGIRCGQCHTPGGHPDFTRPIRDGACGGCHQPQYQETLTSKHFATRDQRALDDDRAARAALRREGFTAGPAGARRFVGDSSSGDLGGRLCVACHYDEHRVSLGAVERADFCTRCHAAREA
Ga0307471_10344196513300032180Hardwood Forest SoilSPPRPDSAGPVSGPALYVEGCQDCHATPVGAHYAQSLHTAKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASKHFVTRDLRALDDDRAARKALRLEGFTAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVQRADFCTGCHAGREEHFPIPTPGLTNRCVQCHVRMGQTVSG
Ga0307472_10022298023300032205Hardwood Forest SoilVNRRSAISWLTGVLLMAPIALASAGCFLAPWSPPRPDPVASVSGPAPYVEGCETCHAAPVAAHYAQSLHAAKGIRCGQCHTPGGHPNFTQPVRDGKCGGCHQPQYQETLGSKHFADRERRALDDDGTARGTLRREGFTAATSGGRRFVGDASSGELGGRLCATCHYDEHRLGLGPVQRADFCTTCHAGREEHFPSPAPANRCAQCHVRVSETARGQVVNTHRFARPGAEDTGK
Ga0335070_1072381913300032829SoilMTWVLLLAPITLATAGCFLVSGSSPRPDSTGPVSGPAPYVEGCQACHAAPVGEHYAQSLHTPKGIRCGQCHTPGDHPNFTQPVRDGKCGGCHQPQYQETLASKHFATRVQQALDADQAARKALRREGFTVATAQGRRFAGDSSSGQLGGRLCAACHYDDHRLSLGAVQRADFCTGCHAGREEHFPGATPGLANRCVECHVRVGKTASGQVVNTHRFARPGEESAGW
Ga0335084_1000855983300033004SoilMHVVTITAKRQATLPAALCREGFSSGPEDGRTPDDMRAGSWTRLESRPRPTVNRRAAMRWLTWVLLVAPIALASAGCFLAPGSPARPDPAGPVSGPAPYVEGCQTCHAAPVGAHYAQSLHAVKGIRCGQCHTPGDHPNFTQPIRDGKCGGCHQPQYQETLASTHFTTRDRRALDDDRAARTALRQEHFTAAATAGGRRFVGDSSSGELGGRLCAACHYDEHRLGLGAVRRADFCTGCHAGREEHFPILTPGLTNRCVQCHVRAGQTASGQVVNTHRFALPGAEGTGR
Ga0326726_1001143213300033433Peat SoilVIAVALGGCFLAPWSPDRPDPAVPVSGPAPYVEDCQGCHAAPVGGAYAQSLHAAKGIRCGQCHTPGGHPNFAQPIRDGKCGGCHLPQYQQTLESKHFASRQLRPLDDNGAERATVRGKGFTVATPEGRRFVGDSSSGDLGGRLCAACHYDEHRLGLASVRRADFCAGCHAGREDHYPVATAGSPNRCVQCHVRAGKTVTGQVVDTHRFARPGDEGAGR
Ga0247829_1023098023300033550SoilMLVAPIVLASAGCFLAPGSHPRPDPAAPISGPAPYIENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGGAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVGQTESGQVVNTHRFARPGAASAER
Ga0247830_1114511613300033551SoilMLVAPIVLASAGCFLAPGSHPRPDPAAPISGPAPYIENCQVCHAAPVGAHYAQSLHTAKGIQCGQCHTPGGHPHFTQPIRDGKCGGCHQPQYQETLTSKHFVTRDLRGLDGDPAAAKALRREGFTVATAQGRRFAGDQAAGDAGGRLCAACHYDEHRLGLGGVQRADFCVGCHTGREEHFAIPTPGLTNRCVQCHVRVG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.