NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F032986

Metagenome Family F032986

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F032986
Family Type Metagenome
Number of Sequences 178
Average Sequence Length 253 residues
Representative Sequence MRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Number of Associated Samples 144
Number of Associated Scaffolds 178

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 48.88 %
% of genes near scaffold ends (potentially truncated) 44.38 %
% of genes from short scaffolds (< 2000 bps) 58.99 %
Associated GOLD sequencing projects 127
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(29.775 % of family members)
Environment Ontology (ENVO) Unclassified
(41.011 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.494 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.10%    β-sheet: 19.06%    Coil/Unstructured: 49.83%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
c.66.1.15: Arylamine N-methyltransferased2g72a_2g720.77793
c.66.1.15: Arylamine N-methyltransferased2a14a12a140.77631
c.66.1.0: automated matchesd5h02a_5h020.77611
c.66.1.0: automated matchesd2efja_2efj0.77232
c.66.1.5: Glycine N-methyltransferased3thra_3thr0.76855


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 178 Family Scaffolds
PF04909Amidohydro_2 32.58
PF01361Tautomerase 12.92
PF04773FecR 5.06
PF00180Iso_dh 4.49
PF13489Methyltransf_23 3.37
PF13649Methyltransf_25 2.81
PF12847Methyltransf_18 2.25
PF01547SBP_bac_1 2.25
PF00378ECH_1 1.69
PF08242Methyltransf_12 1.12
PF05494MlaC 1.12
PF00296Bac_luciferase 1.12
PF12773DZR 1.12
PF13432TPR_16 1.12
PF00072Response_reg 0.56
PF13191AAA_16 0.56
PF00211Guanylate_cyc 0.56
PF00528BPD_transp_1 0.56
PF16113ECH_2 0.56
PF01594AI-2E_transport 0.56

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 178 Family Scaffolds
COG1942Phenylpyruvate tautomerase PptA, 4-oxalocrotonate tautomerase familySecondary metabolites biosynthesis, transport and catabolism [Q] 12.92
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.12
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 1.12
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.56
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.56


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10069154All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1909Open in IMG/M
3300005166|Ga0066674_10059173All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1742Open in IMG/M
3300005167|Ga0066672_10013874All Organisms → cellular organisms → Bacteria4004Open in IMG/M
3300005172|Ga0066683_10063870All Organisms → cellular organisms → Bacteria2193Open in IMG/M
3300005174|Ga0066680_10342391All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium953Open in IMG/M
3300005178|Ga0066688_10366396All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales → Cellulomonadaceae → Cellulomonas → Cellulomonas flavigena933Open in IMG/M
3300005178|Ga0066688_10679154All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300005181|Ga0066678_10053385All Organisms → cellular organisms → Bacteria2311Open in IMG/M
3300005186|Ga0066676_10030462All Organisms → cellular organisms → Bacteria2924Open in IMG/M
3300005295|Ga0065707_10275779All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1060Open in IMG/M
3300005332|Ga0066388_100633826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1686Open in IMG/M
3300005332|Ga0066388_103915719All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300005445|Ga0070708_100076168All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3029Open in IMG/M
3300005446|Ga0066686_10085052All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2011Open in IMG/M
3300005446|Ga0066686_10110833All Organisms → cellular organisms → Bacteria1778Open in IMG/M
3300005446|Ga0066686_10221226All Organisms → cellular organisms → Bacteria1272Open in IMG/M
3300005447|Ga0066689_10056190All Organisms → cellular organisms → Bacteria2131Open in IMG/M
3300005450|Ga0066682_10009259All Organisms → cellular organisms → Bacteria → Proteobacteria5092Open in IMG/M
3300005450|Ga0066682_10097165All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1846Open in IMG/M
3300005467|Ga0070706_100145810All Organisms → cellular organisms → Bacteria → Proteobacteria2210Open in IMG/M
3300005468|Ga0070707_100048414All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4072Open in IMG/M
3300005471|Ga0070698_100093013All Organisms → cellular organisms → Bacteria2995Open in IMG/M
3300005518|Ga0070699_100031508All Organisms → cellular organisms → Bacteria → Proteobacteria4578Open in IMG/M
3300005536|Ga0070697_100060549All Organisms → cellular organisms → Bacteria3086Open in IMG/M
3300005540|Ga0066697_10103955All Organisms → cellular organisms → Bacteria1657Open in IMG/M
3300005552|Ga0066701_10169860All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1323Open in IMG/M
3300005552|Ga0066701_10197037All Organisms → cellular organisms → Bacteria1233Open in IMG/M
3300005554|Ga0066661_10214335All Organisms → cellular organisms → Bacteria1193Open in IMG/M
3300005558|Ga0066698_10277803All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1159Open in IMG/M
3300005559|Ga0066700_10078737All Organisms → cellular organisms → Bacteria2110Open in IMG/M
3300005561|Ga0066699_10006994All Organisms → cellular organisms → Bacteria5215Open in IMG/M
3300005568|Ga0066703_10065313All Organisms → cellular organisms → Bacteria2072Open in IMG/M
3300005569|Ga0066705_10100927All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1716Open in IMG/M
3300005574|Ga0066694_10154426All Organisms → cellular organisms → Bacteria1089Open in IMG/M
3300005598|Ga0066706_10101825All Organisms → cellular organisms → Bacteria2085Open in IMG/M
3300005598|Ga0066706_10851869All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales → Cellulomonadaceae → Cellulomonas → Cellulomonas flavigena714Open in IMG/M
3300005764|Ga0066903_100110876All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3762Open in IMG/M
3300006034|Ga0066656_10038019All Organisms → cellular organisms → Bacteria2718Open in IMG/M
3300006034|Ga0066656_10094956All Organisms → cellular organisms → Bacteria1802Open in IMG/M
3300006034|Ga0066656_10306551All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1025Open in IMG/M
3300006034|Ga0066656_10407018All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300006046|Ga0066652_100452905All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1181Open in IMG/M
3300006755|Ga0079222_10553071All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300006796|Ga0066665_10164954All Organisms → cellular organisms → Bacteria → Proteobacteria1692Open in IMG/M
3300006806|Ga0079220_10804084All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycobacterium → unclassified Mycobacterium → Mycobacterium sp. UNC267MFSha1.1M11712Open in IMG/M
3300006852|Ga0075433_10024994All Organisms → cellular organisms → Bacteria5047Open in IMG/M
3300006854|Ga0075425_100082538All Organisms → cellular organisms → Bacteria3635Open in IMG/M
3300006880|Ga0075429_100017634All Organisms → cellular organisms → Bacteria → Proteobacteria6173Open in IMG/M
3300006903|Ga0075426_10379953All Organisms → cellular organisms → Bacteria → Proteobacteria1041Open in IMG/M
3300006904|Ga0075424_100161844All Organisms → cellular organisms → Bacteria2372Open in IMG/M
3300006914|Ga0075436_100426171All Organisms → cellular organisms → Bacteria964Open in IMG/M
3300007076|Ga0075435_100034600All Organisms → cellular organisms → Bacteria → Proteobacteria4004Open in IMG/M
3300009012|Ga0066710_100247177All Organisms → cellular organisms → Bacteria2578Open in IMG/M
3300009012|Ga0066710_100470108All Organisms → cellular organisms → Bacteria1889Open in IMG/M
3300009012|Ga0066710_101062858All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1251Open in IMG/M
3300009012|Ga0066710_101694723All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300009137|Ga0066709_100743809All Organisms → cellular organisms → Bacteria1415Open in IMG/M
3300009147|Ga0114129_10400097All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1810Open in IMG/M
3300010046|Ga0126384_10010032All Organisms → cellular organisms → Bacteria5875Open in IMG/M
3300010047|Ga0126382_10601219All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300010335|Ga0134063_10201331All Organisms → cellular organisms → Bacteria937Open in IMG/M
3300010336|Ga0134071_10253300All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300010358|Ga0126370_10113680All Organisms → cellular organisms → Bacteria1903Open in IMG/M
3300010359|Ga0126376_10218772All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1594Open in IMG/M
3300010362|Ga0126377_10144217All Organisms → cellular organisms → Bacteria2230Open in IMG/M
3300010362|Ga0126377_10266189All Organisms → cellular organisms → Bacteria1676Open in IMG/M
3300010366|Ga0126379_10719242All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300010366|Ga0126379_11011627All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria936Open in IMG/M
3300010398|Ga0126383_10704743All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300010401|Ga0134121_10387167All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1260Open in IMG/M
3300012202|Ga0137363_10292445All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1335Open in IMG/M
3300012203|Ga0137399_10228818All Organisms → cellular organisms → Bacteria1519Open in IMG/M
3300012203|Ga0137399_10333187All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1258Open in IMG/M
3300012204|Ga0137374_10028051All Organisms → cellular organisms → Bacteria6215Open in IMG/M
3300012205|Ga0137362_11048597All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300012206|Ga0137380_10291252All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1464Open in IMG/M
3300012349|Ga0137387_10054018All Organisms → cellular organisms → Bacteria2688Open in IMG/M
3300012349|Ga0137387_10261757All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1248Open in IMG/M
3300012351|Ga0137386_10607059All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300012360|Ga0137375_10200638All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1888Open in IMG/M
3300012362|Ga0137361_10917158All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300012582|Ga0137358_10027608All Organisms → cellular organisms → Bacteria3702Open in IMG/M
3300012582|Ga0137358_10181187All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300012685|Ga0137397_10007463All Organisms → cellular organisms → Bacteria → Proteobacteria7602Open in IMG/M
3300012918|Ga0137396_10004288All Organisms → cellular organisms → Bacteria → Proteobacteria8155Open in IMG/M
3300012922|Ga0137394_10480679All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1056Open in IMG/M
3300012923|Ga0137359_10090093All Organisms → cellular organisms → Bacteria2703Open in IMG/M
3300012925|Ga0137419_10578501All Organisms → cellular organisms → Bacteria900Open in IMG/M
3300012927|Ga0137416_10030387All Organisms → cellular organisms → Bacteria3590Open in IMG/M
3300012929|Ga0137404_10001034All Organisms → cellular organisms → Bacteria18110Open in IMG/M
3300012929|Ga0137404_10138172All Organisms → cellular organisms → Bacteria2013Open in IMG/M
3300012930|Ga0137407_10016513All Organisms → cellular organisms → Bacteria5416Open in IMG/M
3300012930|Ga0137407_10028863All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4295Open in IMG/M
3300012977|Ga0134087_10118162All Organisms → cellular organisms → Bacteria1126Open in IMG/M
3300014154|Ga0134075_10061969All Organisms → cellular organisms → Bacteria1555Open in IMG/M
3300015054|Ga0137420_1024235All Organisms → cellular organisms → Bacteria1767Open in IMG/M
3300015359|Ga0134085_10183414All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300015372|Ga0132256_100109524All Organisms → cellular organisms → Bacteria2705Open in IMG/M
3300015372|Ga0132256_100570937All Organisms → cellular organisms → Bacteria1245Open in IMG/M
3300015372|Ga0132256_102019014All Organisms → cellular organisms → Bacteria683Open in IMG/M
3300015374|Ga0132255_103200098All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300016270|Ga0182036_10352256All Organisms → cellular organisms → Bacteria → Proteobacteria1135Open in IMG/M
3300016319|Ga0182033_10167794All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1714Open in IMG/M
3300016341|Ga0182035_10778765All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300017656|Ga0134112_10048577All Organisms → cellular organisms → Bacteria1533Open in IMG/M
3300017659|Ga0134083_10096399All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1162Open in IMG/M
3300017997|Ga0184610_1007141All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2701Open in IMG/M
3300018031|Ga0184634_10012770All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3030Open in IMG/M
3300018052|Ga0184638_1012022All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2950Open in IMG/M
3300018053|Ga0184626_10004041All Organisms → cellular organisms → Bacteria5496Open in IMG/M
3300018063|Ga0184637_10013078All Organisms → cellular organisms → Bacteria4988Open in IMG/M
3300018077|Ga0184633_10001918All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria8943Open in IMG/M
3300018078|Ga0184612_10030870All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2770Open in IMG/M
3300018431|Ga0066655_10118429All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1517Open in IMG/M
3300018433|Ga0066667_10024557All Organisms → cellular organisms → Bacteria3277Open in IMG/M
3300018433|Ga0066667_10576002All Organisms → cellular organisms → Bacteria937Open in IMG/M
3300018468|Ga0066662_10199920All Organisms → cellular organisms → Bacteria1579Open in IMG/M
3300020170|Ga0179594_10131080All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300025910|Ga0207684_10054392All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3396Open in IMG/M
3300025922|Ga0207646_10070449All Organisms → cellular organisms → Bacteria3122Open in IMG/M
3300026088|Ga0207641_11213123All Organisms → cellular organisms → Bacteria754Open in IMG/M
3300026296|Ga0209235_1020125All Organisms → cellular organisms → Bacteria3577Open in IMG/M
3300026298|Ga0209236_1147073All Organisms → cellular organisms → Bacteria1001Open in IMG/M
3300026307|Ga0209469_1032341All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1748Open in IMG/M
3300026309|Ga0209055_1075166All Organisms → cellular organisms → Bacteria1403Open in IMG/M
3300026309|Ga0209055_1075175All Organisms → cellular organisms → Bacteria1403Open in IMG/M
3300026313|Ga0209761_1112115All Organisms → cellular organisms → Bacteria1341Open in IMG/M
3300026317|Ga0209154_1036643All Organisms → cellular organisms → Bacteria2233Open in IMG/M
3300026324|Ga0209470_1223993All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300026325|Ga0209152_10011122All Organisms → cellular organisms → Bacteria3120Open in IMG/M
3300026328|Ga0209802_1022054All Organisms → cellular organisms → Bacteria3476Open in IMG/M
3300026331|Ga0209267_1033966All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2413Open in IMG/M
3300026333|Ga0209158_1228674All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300026334|Ga0209377_1016067All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3963Open in IMG/M
3300026334|Ga0209377_1148995All Organisms → cellular organisms → Bacteria889Open in IMG/M
3300026524|Ga0209690_1024139All Organisms → cellular organisms → Bacteria3021Open in IMG/M
3300026524|Ga0209690_1122340All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1022Open in IMG/M
3300026530|Ga0209807_1027609All Organisms → cellular organisms → Bacteria2742Open in IMG/M
3300026532|Ga0209160_1065054All Organisms → cellular organisms → Bacteria2010Open in IMG/M
3300026532|Ga0209160_1117793All Organisms → cellular organisms → Bacteria1312Open in IMG/M
3300026536|Ga0209058_1000235All Organisms → cellular organisms → Bacteria45240Open in IMG/M
3300026536|Ga0209058_1009336All Organisms → cellular organisms → Bacteria7117Open in IMG/M
3300026537|Ga0209157_1103771All Organisms → cellular organisms → Bacteria1345Open in IMG/M
3300026540|Ga0209376_1012162All Organisms → cellular organisms → Bacteria → Proteobacteria6251Open in IMG/M
3300026542|Ga0209805_1002962All Organisms → cellular organisms → Bacteria9784Open in IMG/M
3300027187|Ga0209869_1011808All Organisms → cellular organisms → Bacteria957Open in IMG/M
3300027527|Ga0209684_1007578All Organisms → cellular organisms → Bacteria1770Open in IMG/M
3300027577|Ga0209874_1010161All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2774Open in IMG/M
3300027873|Ga0209814_10059153All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1604Open in IMG/M
3300027961|Ga0209853_1017184All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2206Open in IMG/M
3300028536|Ga0137415_10010205All Organisms → cellular organisms → Bacteria → Proteobacteria9321Open in IMG/M
3300028792|Ga0307504_10096746All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300028878|Ga0307278_10028877All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2544Open in IMG/M
(restricted) 3300031197|Ga0255310_10018657All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1787Open in IMG/M
(restricted) 3300031248|Ga0255312_1012542All Organisms → cellular organisms → Bacteria2025Open in IMG/M
3300031544|Ga0318534_10036005All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2727Open in IMG/M
3300031561|Ga0318528_10069935All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1806Open in IMG/M
3300031681|Ga0318572_10007943All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4879Open in IMG/M
3300031720|Ga0307469_10031299All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3091Open in IMG/M
3300031720|Ga0307469_10106170All Organisms → cellular organisms → Bacteria1988Open in IMG/M
3300031723|Ga0318493_10219014All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300031740|Ga0307468_100115806All Organisms → cellular organisms → Bacteria1636Open in IMG/M
3300031744|Ga0306918_10283695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1272Open in IMG/M
3300031744|Ga0306918_10587414All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300031747|Ga0318502_10464241All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300031769|Ga0318526_10006641All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3513Open in IMG/M
3300031779|Ga0318566_10014776All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3345Open in IMG/M
3300031793|Ga0318548_10120970All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1265Open in IMG/M
3300031820|Ga0307473_10082148All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1655Open in IMG/M
3300031820|Ga0307473_10112721All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1475Open in IMG/M
3300031860|Ga0318495_10183610All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300032060|Ga0318505_10136908All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300032091|Ga0318577_10378400All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300032180|Ga0307471_100354078All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1579Open in IMG/M
3300032180|Ga0307471_100395861All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1507Open in IMG/M
3300032180|Ga0307471_100803360All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1108Open in IMG/M
3300032180|Ga0307471_103135032All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300032261|Ga0306920_100085870All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4638Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil29.78%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil14.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.18%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.06%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.06%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.06%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.49%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.93%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.37%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.25%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.69%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.12%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.12%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.12%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.56%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.56%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.56%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.56%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027187Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027527Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031544Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f26EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031723Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f23EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031747Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f22EnvironmentalOpen in IMG/M
3300031769Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f24EnvironmentalOpen in IMG/M
3300031779Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f22EnvironmentalOpen in IMG/M
3300031793Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f21EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031860Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f25EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032091Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1006915423300002908Grasslands SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFDDLARLTRSSTLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGLSVRPSRAPADTRVRGSRRARPAPRRRAPSPEGDSPGARRATRPRSRV*
Ga0066674_1005917323300005166SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLVVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS*
Ga0066672_1001387443300005167SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066683_1006387033300005172SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAGALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0066680_1034239123300005174SoilETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066688_1036639613300005178SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066688_1067915413300005178SoilGISADEITSYLTPASFDLWESLARYRREFQIGARWVEIGGGMGDLASAALDRGYDVLMTDVQDELLATAATRHPKLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLYHARDRKELARGFGHLVALGEPGSLLVIDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFNAEFHERMLRLVLRMPTAAARARP
Ga0066678_1005338523300005181SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066676_1003046223300005186SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPPRRVPSPASDPPGTRRAARRRSRA*
Ga0065707_1027577923300005295Switchgrass RhizosphereMREDRRRREIKDYYLSISADEITSYLTPASFDLWDSLARYRREFRLGTRWLEVGGGMGDLAVAALDHGYDVLMTDVQDELLATAASRHPRLRGRLQRADIFDARDVQALAAQGPFSIVAGLGAVLNHARDREELARGFGHLVSLGEPDSLLVVDLMLSEMFPGHPASIWADIQHTLPGFADLARLTGAERLHVLEAHSLFHRYSPTAAFNADFDERMLRLFLRKTRTARRAAAGAANAGAAVRGSRRAGPGSRRRGPG
Ga0066388_10063382623300005332Tropical Forest SoilVSEAQRRREIKDYYLGISADEITSYLTPASFDLWDSVARYRRELDIADRWLEVGGGVGDLAAAALERGYDVVMTDVQAELLETAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHAELARGFDHLVTLSQSGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRAETRRERA*
Ga0066388_10391571913300005332Tropical Forest SoilRRREIKDYYLGISADEITSYLTPASFDLWDSVARYRRELGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELVEAAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHDELARGFDHLVTLAQPGSLLIVDLMLSEMFSGHPASIWADILHVLPSFADVAFLTRAWRLQVLETHSLYHRYPPTAAYDAEFDERMLRLFLHRAETACPRA*
Ga0070708_10007616823300005445Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRAIRRRLCA*
Ga0066686_1008505223300005446SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPAAVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDSPGARRAIRPRSRA*
Ga0066686_1011083313300005446SoilRGGSIASPSMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0066686_1022122623300005446SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRRELQIGARSLEIGGGMGDLAAAALDRGYDVLMTDVQDELLTTAAARHPRLRGRLQRADIFDPRDVQALAAHGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLGIDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFNAEFHERMLRLVLRMPTAAARSRPGAPVRGSRRARPGSPRRAPSPSSDSTGTRRATRRRLRA*
Ga0066689_1005619023300005447SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVRALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066682_1000925943300005450SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEQKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0066682_1009716513300005450SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLVVVDLLVREMFAGHPASVWADFLHVLPGFGDLARLTRAFGLHVLEAHSLYHRYPPTAACDREFDERMLRLFFHKRSP
Ga0070706_10014581023300005467Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGALVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0070707_10004841423300005468Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRAIRRRLRA*
Ga0070698_10009301323300005471Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA*
Ga0070699_10003150843300005518Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTERARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRAIRRRLRA*
Ga0070697_10006054933300005536Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTERARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRAIRRRLRA*
Ga0066697_1010395513300005540SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066701_1016986023300005552SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFDDLARLTRSSTLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDSPGARRAIRPRSRA*
Ga0066701_1019703723300005552SoilPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGALVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066661_1021433513300005554SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066698_1027780323300005558SoilSRRLSRRPPPLMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLVVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS*
Ga0066700_1007873713300005559SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066699_1000699433300005561SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066703_1006531333300005568SoilRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066705_1010092723300005569SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRESRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066694_1015442613300005574SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAGALERGYDVLMTDVEEKLLETAARRDPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0066706_1010182513300005598SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARP
Ga0066706_1085186913300005598SoilSIASPSMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0066903_10011087643300005764Tropical Forest SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRTLGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELARGFDHLVTLAQPGSLLIVDLMLSEMFPGHPVSIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRAETRCPRA*
Ga0066656_1003801953300006034SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDAQDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGQPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPPRRVPSPASDPPGTRRAARRRSRA*
Ga0066656_1009495623300006034SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFWLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0066656_1030655123300006034SoilSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPAAVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDSPGARRAIRPRSRA*
Ga0066656_1040701813300006034SoilRDDSRRLSRRPPPLMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLVVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS*
Ga0066652_10045290523300006046SoilCFGRTRGGSIASPSMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG*
Ga0079222_1055307113300006755Agricultural SoilMRDARRRQEIKTYYLGISADEITSYFTPATFDLWASLGRYRSEHGIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQPELLDSAGARHPRLRGRLDRSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPNALLIVDVMLREMFPGHPATIWADILHVLPGFDDLARLVRGSGLQVLETH
Ga0066665_1016495423300006796SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLENGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0079220_1080408413300006806Agricultural SoilFDLWASLGRYRSEHGIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQPELLDSAAARHPRLRGRLDRSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPSALLIVDVMLREMFPGHPATIWADIFHVLPGFDDLARLVRASRLQVLEAHSLYHRYPPTPAFDAEFDERMLRLVLHQTPSARGAAEATIRSSRRARPDSRRRAPSPSRDSAGARPATRRR
Ga0075433_1002499463300006852Populus RhizosphereMQEARRRREIKDYYLAISADEITSHLTPASFDLWASLARYRQEFRLGARWLELGGGMGDLAATAVDRGWDVLMTDVQEELLATAAVRHPTLRARLQRADVFDARDVRALAAQGPFAVVAALGAVLNHARDRKELARGFAHLVALTEPGALLVVDLMLSEMFPEHPASIWADMLHVMPSFAELAPLLGSARLHLLEAHSLYHRYPPTAAFDADFDERMLRLFLRKPAAPAAAGPDANPSRLRKKVQMRGDARRLRAKRTPGR*
Ga0075425_10008253823300006854Populus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFQLGVRWLEIGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPTLRGRLQRADIFDSRDVKMLAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLFLRKSPPAARAGAGALVRGSRRARPGSRRRVPSPSSDSTGTRRATRRRLRA*
Ga0075429_10001763443300006880Populus RhizosphereMQEARRRSEIKDYYLAISADEITSHLTPASFDLWASLARYRQEFRLGARWLELGGGMGDLAATAVDRGWDVLMTDVQEELLATAAVRHPTLRARLQRADVFDARDVRALAAQGPFAVVAALGAVLNHARDRKELARGFAHLVALTEPGALLVVDLMLSEMFPGHPASIWADMLHVMPSFAELAPLLGSARLHLLEAHSLYHRYPPTAAFDADFDERMLRLFLRKPAAPAAAGPDANPSRLRKKVQMRGDARRLRAKRTPGR*
Ga0075426_1037995323300006903Populus RhizosphereMRDARRRQEIKAYYLGISADEITSYFTPAMFDLWESLGRYRSEHGIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQSELLDSAAARHPRLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPSALLIVDVMLREMFPGHPATIWADIFHVLPGFDDLARLVRASRLQVLEAHSLYHRYPPTPAFDAEFDERMLRLVLHQTPS
Ga0075424_10016184413300006904Populus RhizosphereMRDARRRQEIKAYYLGISADEITSYFTPAMFDLWESLGRYRSEHGIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQSELLDSAAARHPRLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPSALLIVDVMLREMFAGHPATIWADIFHVLPGFDDLARLVRVSRLQVLEAHSLYHRYPPTPAFDAEFDERMLRLVLHQTPSGRRDSETTIRSSRRARPDSRRRAPSPSRDSAGARPA
Ga0075436_10042617123300006914Populus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFQLGVRWLEIGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPTLRGRLQRADIFDSRDVKMLAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLSGFADLARLIGASRLHVLEMHSLYHRYPPTAVFNADFDER
Ga0075435_10003460013300007076Populus RhizosphereMRDARRRQEIKAYYLGISADEITSYFTPAMFDLWESLGRYRSEHGIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQSELLDSAAARHPRLRGRLERSDVFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPSALLIVDVMLREMFAGHPATIWADIFHVLPGFDDLARLVRVSRLQVLEAHSLYHRYPPTPAFDAEFDERMLRLVLHQTPSGRRDSETTIRSSRRARPDSRRRAPSPSRDSAGARPAT
Ga0066710_10024717743300009012Grasslands SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDAQDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAARSLVRESRRARPVPRRRVPSPASDPPGTRRAARRRSRA
Ga0066710_10047010823300009012Grasslands SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLLVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS
Ga0066710_10106285813300009012Grasslands SoilRSRLMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPPRRVPSPASDPPGTRRAARRRSRA
Ga0066710_10169472323300009012Grasslands SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVRALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0066709_10074380923300009137Grasslands SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVRALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0114129_1040009723300009147Populus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLKTGGGMGDLAAAALDHGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVKMLAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLTRSWQLDVLETHSLYHRYPPTPTYNAEFDERMLRLFLHKREAGRPAPVKASRRARPGSRRPGPSPSRDPGGARRATRRRPRA*
Ga0126384_1001003233300010046Tropical Forest SoilVSEAQRRREIKDYYLGISADEITSYLTPASFDLWDSVARYRRELDIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLETAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELVRGFDHLVTLAQPGSLLIVDLMLSEMFSGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRAETSRPHA*
Ga0126382_1060121913300010047Tropical Forest SoilLTPASFDLWESVARYRRELGIGDRWLEVGGGVGDLAAAALERGYDVVMTDVQGELLETAAARHPRLRGRLERSDVFDPRDIAALAARGPYSIVAALGAVLNHARDHGELARGFDHLVTLAQPGSLLIVDLMLSEMFSGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTAAYEAEFDERMLRLFLHRAETSRPRA*
Ga0134063_1020133113300010335Grasslands SoilAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA*
Ga0134071_1025330013300010336Grasslands SoilYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA*
Ga0126370_1011368023300010358Tropical Forest SoilVSEAQRRREIKDYYLGISADEITSYLTPASFDLWDSVARYRRELDIADRWLEVGGGVGDLAAAALERGYDVVMTDVQAELLETAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHAELARGFDHLVTLSQSGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHPAETRRERA*
Ga0126376_1021877233300010359Tropical Forest SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRTLGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHAELARGFDHLVTLSQSGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYEAEFDERMLRLFLHRAETRRERA*
Ga0126377_1014421713300010362Tropical Forest SoilMRDARRRQEIKAYYLGISADEITSYFTPATFDLWESLGRYRSEHGIGARWLEVGGGIGDLAAAALDRGYDVLMTDVQPELLDTAAARHPRLRGRLERSDIFDPRDVAALASRGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPNALLTVDVMLREMFPGHPPTIWADIFHVLPGFDDLARLVRASRLQVLETHSLYHRYPPTPGFDAEFDERMLRLVLRRPPSDRGAAETTIRSSRRARPDSRRRAPSPSRDSA
Ga0126377_1026618913300010362Tropical Forest SoilEIKAYYLGISADEITSYFTPATFDLWESLGRYRSEHGIGARWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLETAAARHPRLRGRLERSDVFEPRDVAALAARGPYSIVAALGAVLNHARDHGELARGFDHLVTLAQPGSLLIVDLMLSEMFSGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTAAYEAEFDERMLRLFLHRAETSRPRA*
Ga0126379_1071924213300010366Tropical Forest SoilRRREIKDYYLGISADEITSYLTPASFDLWDSVARYRRELDIGDRWLEVGGGVGDLAAAALERGYDVLMTDVQAELLETAAARHPRLRGSLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHAELARGFDHLVALTQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLLLHRAETRRERA*
Ga0126379_1101162723300010366Tropical Forest SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELGIGDRWLEVGGGVGDLAAAALDRGYDVLMTDVEGELLETATARHPRLRGRLERSDVFDPQDVAALAARGPYSIVAALGAVLNHARDHGELARGFDHLVTLAQPGSLLIVDLMLSEMFSGHPASIWADILHVLPSFADLARLTRAWGLHVLETHSLYHRYPPTPACE
Ga0126383_1070474323300010398Tropical Forest SoilVNEVQRRREIKDYYLGICADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVPALAARGPYSIVAALGAVLNHARDHGELARGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWGLHVLETHSLYHRYPPTPVYDAEFDERMLRLFLRRTQTSPRA*
Ga0134121_1038716733300010401Terrestrial SoilMRDARRRQEIKAYYLGISADEITSYFTPATFDLWESLGRYRSEHGIGARWVEVGGGMGDLAAAALNRGYDVLMTDVQPELLDSAAARHPRLRGRLERSDIFDSRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPNALLIVDVMLREMFPGHPATIWADIFHVLPGFDDLARLVRGSGLQVLEAHSLYHRY
Ga0137363_1029244523300012202Vadose Zone SoilMRDAERRREIKNYYLSISPDEITSYLTPATFDLWESLARYRGEFQIGARWLEIGGGMGDLAAVALERGYDVLMTDVQDELLATATTRHPTLRGRLLRADIFDPRDVRALAARGPFSIVAALGAVLNHARDRKDLARGFAHLVALAEPGSLLVVDLMLSEMFPGHPAAIWADILHVLPGFADLARLTGGSPLHVLEAHSLYHRYPPTPTLNAEFDERMLRLFLRKVPPAPRPRASVPVRGSRRARPGSRRRAPNPANGSAGTRRAIRRRFRA*
Ga0137399_1022881823300012203Vadose Zone SoilMRDARRRREIKDYYLGISADEITSYLTPAAFDLWESLAGYRREFQIGARWLEIGGGMGDLAAAALDRGYDVLVTDVQDELLATAATRHPELRGRLRRADVFDPRDVQALAAKGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLVVDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFDAEFDERMLRLVLRMPPTAARARPGAPVRGSRRARPGSPRRAQSPSSDSAGTRRAIRRRFRA*
Ga0137399_1033318713300012203Vadose Zone SoilRDYYLGSSADEITSYLTPASFDLWESVARYRRELGLGATWLEVGGGLGDLAAAALDRGYDVLMTDVQAELLETAAARHPRLRDRLQCSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLGQPGSLLIVDLMLSEMFPAHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFLHKADRRAPARGSRRTRAALPGRRPTAIIAATNQPP*
Ga0137374_1002805143300012204Vadose Zone SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESQNARPAPRRRVPSPASDPPGTRRAARRRSRA*
Ga0137362_1104859713300012205Vadose Zone SoilDRRAGRGQGAVKDAQRRREIRDYYLGSSADEITSYLTPASFDLWESVARYRRELGLGATWLEVGGGLGDLAAAALDRGYDVLMTDVQAELLETAAARHPRLRDRLQCSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLGQPGSLLIVDLMLSEMFPAHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLF
Ga0137380_1029125213300012206Vadose Zone SoilMRDARRRREIRDYYLGISPDEITSHLTPTSFDLWDSVARYRRELQLGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRSPARPPRGAARSPVRESRRARPAPRRRAPSPESDSFGARRAIRPRSRA*
Ga0137387_1005401823300012349Vadose Zone SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLLVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS*
Ga0137387_1026175713300012349Vadose Zone SoilMRDARRRREIRDYYLGISPDEITSHLTPTSFDLWDSVARYRRELQLGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVVRSLVRESRRARPAPPRRVPSPASDPPGTRRAARRRSRA*
Ga0137386_1060705913300012351Vadose Zone SoilVLTSDRQRRREIKDYYLGISADEITSYLTPASFDLWQSVARYRRELGIGARWLEIGGGLGDLAATAFDHGYDVLMTDVQPELLESAAARHPRLAGRLERSDIFDARDVAALGARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPAAVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDS
Ga0137375_1020063813300012360Vadose Zone SoilDEPPSRVMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFAARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESQNARPAPRRRVPSPASDPPGTRRAARRRSRA*
Ga0137361_1091715813300012362Vadose Zone SoilMQERQRRREIQDYYLGISADEITSHLTPASFDLWESVARYRREFQLGRRWLEVGGGMGDLAAEALGRGYDVLMTDVEAGLLETAAHRHPKLQPRLQQADLFHAGDVAAVAARGPFSIVTAVGAVLNHARDHRALARGFDHLVALGDTSSLLVVDVMLSEMFPGHPPSVWADFRHVLPGLDLLARLIRSSGLLVLEAHSLYHRYPPTPTFDQEFDERMLRVFFHKSPRA
Ga0137358_1002760833300012582Vadose Zone SoilVKDAQRRREIRDYYLGSSVDEITSHLTPASFDLWESVARYRRELGLGATWLEIGGGLGDLAAAALDRGYDVLMTDVQAELLETAAARHPRLRDRLQCSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLRQPGSLLIVDLMLSEMFPAHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFLHKADRRAPARGSRRTRAALPGRRPTAIIAATNQPP*
Ga0137358_1018118713300012582Vadose Zone SoilKDYYLGVDADEITSFLTPAACDLWESLAGYRREFQIGARWLEIGGGMGDLAAVALERGYDVLMTDVQDELLATATTRHPTLRGRLLRADIFDPRDVRALAARGPFSIVAALGAVLNHARDRKDLARGFAHLVALAEPGSLLVVDLMLSEMFPGHPAAIWADILHVLPGFADLARLTGGSPLHVLEAHSLYHRYPPTPTFNAEFDERMLRLFLRKVPPAPRPRASVPVRGSRRARPGSRRRAPNPANGSAGTRRAIRRRFRA*
Ga0137397_1000746363300012685Vadose Zone SoilMRDAERRREIKDYYLSISADEITSHLTPASFDLWESVARYRRELEIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRARLQRADIFDARDVAALGARGPFSIVAALGAVLNHARDRNELARGFGHLVALGEPGALLVVDLMLSEMFPGHPVSVWADILHVLPGFADLAQLTSSSGLHVLEAHSLHHRYPPTPAFDAEFDERMLRLFFHKGNSARPRRAQPAAPVKGARPDIRRRPRG*
Ga0137396_1000428833300012918Vadose Zone SoilMRDAQRRREIKDYYLGISADEITSYLTPAAFDLWESLAGYRREFQIGARWLEIGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPELRGRLRRADVFDPRDVQALAAKGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLVVDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFDAEFDERMLRLVLRMPPTAARARPGAPVRGSRRARPGSPRRAQSPSSDSAGTRRAIRRRFRA*
Ga0137394_1048067913300012922Vadose Zone SoilMRERRRRREIQDYYLGISADEISSYLTPAAFDLWESVARYRRELRLGARWLEVGGGMGDLAAVAQERGYDVLMTDVEEKLLDTAARRHPKLRTRLQRADLFDAGDVAAVAARGPFAIVTAVGAVLNHARDQRALARGFDHLVALADASSLLVVDLMLSEMFPGHPSSVWADFLHVLPGLDQLARLIRSSGLQILEAHSLYHRYPPTPTFDQEFDERMLRVFFHKSPRSRG*
Ga0137359_1009009323300012923Vadose Zone SoilMRDAQRRREIKNYYLSISPDEITSYLTPATFDLWESLARYRGEFQIGARWLEIGGGMGDLAAVALERGYDVLMTDVQDELLATATTRHPTLRGRLLRADIFDPRDVRALAARGPFSIVAALGAVLNHARDRKDLARGFAHLVALAEPGSLLVVDLMLSEMFPGHPAAIWADILHVLPGFADLARLTGGSPLHVLEAHSLYHRYPPTPTFNAEFDERMLRLFLRKVPPAPRPRASVPVRGSRRARPGSRRRAPNPANGSAGTRRAIRRRFRA*
Ga0137419_1057850113300012925Vadose Zone SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELAVGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAVRRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFRHLVALGEPRSLLVVDLLVREMFAGHPATVWADFLHVLPGLDQLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRWHYSPRRWVPSALLEQMRADCRAVVFSNLQTERVV
Ga0137416_1003038723300012927Vadose Zone SoilMRDARRRREIKDYYLGISADEITSYLTPAAFDLWESLAGYRREFQIGARWLEIGGGMGDLAAAALARGYDVLMTDVQDELLATAATRHPELRGRLRRADVFDPRDVQALAAKGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLVVDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFDAEFDERMLRLVLRMPPTAARARPGAPVRGSRRARPGSPRRAQSPSSDSAGTRRAIRRRFRA*
Ga0137404_10001034143300012929Vadose Zone SoilMRDAERRREIKDYYLSISADEITSHLTPASFDLWESVARYRRELEIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRARLQRADIFDARDVAALGARGPFSIVAALGAVLNHARDRNELARGFGHLVALGEPGALLVVDLMLSEMFPGHPVSVWADILHVLPGFADLAQLTSSSGLHVLEAHSLHHRYPPTPAFDAEFDERMLRLFFHKGNSARPRRAQPAAPVKGARPDIRRRLRG*
Ga0137404_1013817213300012929Vadose Zone SoilLSISPDEITSYLTPATFDLWESLARYRGEFQIGARWLEIGGGMGDLAAVALERGYDVLMTDVQDELLATATTRHPTLRGRLLRADIFDPRDVRALAARGPFSIVAALGAVLNHARDREDLARGFAHLVALAEPGSLLVVDLMLSEMFPGHPAAIWADILHVLPGFADLARLTRGSPLHVLEAHSLYHRYPPTPTFNAEFDERMLRLFLRKVPPAPRPRASVPVRGSRRARPGSRRRAPNPANGSAGTRRAIRRRFRA*
Ga0137407_1001651333300012930Vadose Zone SoilMRDAERRREIKDYYLSISADEITSHLTPASFDLWESVARYRRELEIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRARLQRADIFDARDVAALGARGPFSIVAALGAVLNHARDRNELARGFGHLVALGEPGALLVVDLMLSEMFPGHPVSVWADILHVLPGFADLAQLTGSSGLHVLEAHSLHHRYPPTPAFDAEFDERMLRLFFHKGNSARPRRAQPAAPVKGARPDIRRRPRG*
Ga0137407_1002886343300012930Vadose Zone SoilMRDAQRRREIKNYYLSISPDEITSYLTPATFDLWESLARYRREFQIGARWLEIGGGMGDLAAVALERGYDVLMTDVQDELLATATTRHPTLRGRLLRADIFDPRDVRALAARGPFSIVAALGAVLNHARDRKDLARGFAHLVALAEPGSLLVVDLMLSEMFPGHPAAIWADILHVLPGFADLARLTGGSPLHVLEAHSLYHRYPPTPTFNAEFDERMLRLFLRKVPPAPRPRASVPVRGSRRARPGSRRRAPNPANGSAGTRRAIRRRFRA*
Ga0134087_1011816213300012977Grasslands SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVRALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSPRRAPSPSSDSTGTRRAIRRRFRA*
Ga0134075_1006196933300014154Grasslands SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGQPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAARSPVRESRRARSAPRRRVPSPASDPPGTRRAARRRSRA*
Ga0137420_102423523300015054Vadose Zone SoilMRDARRRREIKDYYLGISADEITSYLTPAAFDLWESLAGYRREFQIGARWLEIGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPELRGRLRRADVFDPRDVQALAAKGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLVVDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFDAEFDERMLRLVLRMPPTAARARPGAPVRGSRRARPGSPRRAQSPSSDSAGTRRAIRRRFRA*
Ga0134085_1018341413300015359Grasslands SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRAFGLHVLEAHSLYHRYPPTAACDREFDERMLRVFFHKRSPVRQSRRARPALRRPAPSPAGDPRGTRRAARRRARA*
Ga0132256_10010952443300015372Arabidopsis RhizosphereMRDARRRQEIKAYYLAISADEITSYFTPATFDLWESLCRYRSEHGIGARWLEVGGGVGDLAAAALDRGYDVLMTDVQAELLDSAAARHPLLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDRRQLARGFGHLVALTAPSALLVVDVMLREMFPRHPATIWADIFHVLPGFDDLARLVRASRLQVLEAHSLYHRYPPTPAFDAKFDERMLRLVLRRAPSDRGAPETTIRSSRRARPDSRRRAPNPSRDSAGAPPATRRRRRA*
Ga0132256_10057093723300015372Arabidopsis RhizosphereMRDARRRQEIKAYYLGISADEITSSFTPATFDLWESLGRYRSEHRIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQPKLLGSAAARHPRLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRQLARGFGHLVALAAPSALLVVDVMLREMFPGHPATIWADIFHVLPGFDDLARLVRASRLQVLEAHSLYHRYPPTPAFGAEFDERMLRLVLHRATSDHDAPETRIRSSRRARPDSRRRAQNPSRDSAGAP
Ga0132256_10201901413300015372Arabidopsis RhizosphereALERGYDVLMTDVQPELLATAATRHPALSGRLQRADVFDARDAQALAAQGPFAIVAALGAVLNHARDRKELARGFAHLVALTAPGALLVVDLMLSEMFPGHPESIWADMLHVMPSFAELAPLLGDARLHLLEAHSLYHRYPPTAAFDAPFDERMLRLFLRTPPAPARAGPAASARRLPQKVQMQGDARGSRRARPGSQRPAPGPAPDSAGARRASRRRRRA*
Ga0132255_10320009813300015374Arabidopsis RhizosphereMRDARRRQEIKAYYLAISADEITSYFTPATFDLWESLCRYRSEHGIGARWLEVGGGVGDLAAAALDRGYDVLMTDVQAELLDSAAARHPLLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDRRQLARGFGHLVALTAPSALLVVDVMLREMFPRHPATIWADIFHVLPGFDDLARLVRASRLQVLEAHSLYHRYPPTPAFDAEFDERMLR
Ga0182036_1035225623300016270SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGSDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA
Ga0182033_1016779413300016319SoilVEDALGQRAPGLPDHVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRTQTSPRA
Ga0182035_1077876523300016341SoilYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA
Ga0134112_1004857723300017656Grasslands SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDAQDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAARSLVRESRRARPAPRRRVPSPASDPPGTRRAARRRSRA
Ga0134083_1009639923300017659Grasslands SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELAVGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGTRGPFSIVAALGAVLNHARDGKQLAGGFRHLVELGEPRSLLVVDLLVREMFAGHPATVWADFLHVLPGLDQLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPPGPRRRARAGKSSRA
Ga0184610_100714123300017997Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRAFGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPGPQRRVPSPASDPPGTRRAARRRSRA
Ga0184634_1001277033300018031Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELSHGFDHLVVLGAPSSLLVVDLMLSEMFPGYPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSMVRESRRARPAPQRRVPSPASDPPGTRRAARRRSRA
Ga0184638_101202253300018052Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHILPGFGDLARLTRAAGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPTRGVARSLVRESRRARPAPQRRVPSPASDPPGTRRAARRRSRA
Ga0184626_1000404153300018053Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGYPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPGPQRRVPSPASDPPGTRRAARRRSRA
Ga0184637_1001307853300018063Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGYPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTATFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPGPQRRVPSPASDPPGTRRAARRRSRA
Ga0184633_1000191853300018077Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGYPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTATFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPGPRRRVPSPASDPPGTRRAARRRSRA
Ga0184612_1003087033300018078Groundwater SedimentMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGYPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPQRRVPSPASDPPGTRRAARRRSRA
Ga0066655_1011842923300018431Grasslands SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAGALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPSRG
Ga0066667_1002455733300018433Grasslands SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAGALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG
Ga0066667_1057600213300018433Grasslands SoilMREAERRREIKDYYLSISADEITSHLTPASFDLWESVARCRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRAFGLHVLEAHSLYHRYPPTAACDREFDERMLRLFFHKRSPARPPRGAARSPVRESRRARPAPRRRAPTPDGDRPGTRRA
Ga0066662_1019992023300018468Grasslands SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0179594_1013108023300020170Vadose Zone SoilMRDAERRREIKDYYLSISADEITSHLTPASFDLWESVARYRRELEIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRARLQRADIFDARDVAALGARGPFSIVAALGAVLNHARDRNELARGFGHLVALGEPGALLVVDLMLSEMFPGHPVSVWADILHVLPGFADLAQLTSSSGLHVLEAHSLHHRYPPTPAFDAEFDERMLRLFFHKGNSARPRRAPPAAPVK
Ga0207684_1005439233300025910Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTERARAGAPVR
Ga0207646_1007044923300025922Corn, Switchgrass And Miscanthus RhizosphereMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRAIRRRLRA
Ga0207641_1121312323300026088Switchgrass RhizosphereMRDARRRQEIKAYYLGISADEITSYFTPATFDLWESLGRYRSEHGIGARWVEVGGGMGDLAAAALNRGYDVLMTDVQPELLDSAAARHPRLRGRLERSDIFDRRDVAALAARGPFSIVAALGAVLNHARDHRELARGFGHLVALAAPNALLIVDVMLREMFPGHPATIWADIFHVLPGFDDLARLVRGSGLQVLEAHSL
Ga0209235_102012543300026296Grasslands SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPAAVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDSPGARRAIRPRSRA
Ga0209236_114707313300026298Grasslands SoilWRAAIATPRASRATRVGPRPRRSSPGPFWTRCDESPSRVMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDAHDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLIVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAARSPVRESRRARSAPRRRVPSPASDPPGTRRAARRRSRA
Ga0209469_103234123300026307SoilMREADRRREITTYYLGISPDEISSHVAASFDLWESLARYRRELTIGARWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFHHLVELGEPRSLVVVDLLVREMFAGHPASVWADFLHVLPGLDDLARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLRVVLHKSPAGPRRRARGEKSS
Ga0209055_107516613300026309SoilLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA
Ga0209055_107517513300026309SoilLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209761_111211523300026313Grasslands SoilAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPESDSPGARRAIRPRSRA
Ga0209154_103664323300026317SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209470_122399313300026324SoilGRTRGGSIASPSMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG
Ga0209152_1001112223300026325SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209802_102205423300026328SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA
Ga0209267_103396623300026331SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209158_122867413300026333SoilLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNAD
Ga0209377_101606723300026334SoilMRDAERRREIKDYYLGISADEITSYLSPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGALVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209377_114899513300026334SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGTRWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPAAVWADFLHVLPGFDDLARLTRSSALHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGHSVQPPRAPADTRVRGSRRARPAPRRRAPSPES
Ga0209690_102413923300026524SoilVLDALRGARAREGLGLMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAARGAGLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAAFNADFDERMLRLVLRKPPPTARARAGALVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209690_112234023300026524SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESVARYRRELTIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLETAATRHPRLRTRLQRVDIFDARDVEALAARGPFSIVAALGAVLNHARDRQELARGFGHLVALGEPGSLLIVDLMLSEMFPGHPASVWADFLHVLPGFDDLARLTRSSTLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFFHQGLSVRP
Ga0209807_102760953300026530SoilRTRGGSIASPSMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRLLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWR
Ga0209160_106505433300026532SoilSFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209160_111779323300026532SoilSFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA
Ga0209058_100023583300026536SoilMRESRRRREIQDYYLGISADEITSHLTPASFDLWESVARYRRELHLGARWLEVGGGMGDLAAVALERGYDVLMTDVEEKLLETAARRHPKLQTRRLRADLFDAGDVAAVAARGPFSIVTAVGAVLNHARDQRALARGFDHLVALGDASSLLVVDLMLSEMFPGHPPSVWADFLHVLPGLDLLARLIRSSGLLVLEAHSLHHRYPPTPTFDQEFDERMLRVFFHKFPPWRG
Ga0209058_100933623300026536SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPPRRVPSPASDPPGTRRAARRRSRA
Ga0209157_110377113300026537SoilMRDARRRREIRDYYLGISPDEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDAQDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAASSLVRESRRARPAPRRRVPSPASDPPGTRRAARR
Ga0209376_101216233300026540SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLAVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVRALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTAVFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPGSRRRAPSPSSDSTGTRRATRRRLRA
Ga0209805_100296283300026542SoilMRDAERRREIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLGVRWLETGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPRLRGRLQRADIFDSRDVQALASHRPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLARLIGASRLHVLETHSLYHRYPPTTAFNADFDERMLRLVLRKPPPAARARAGAPVRGSRRARPASRRRAPSPSSDSTGTRRATRRRLRA
Ga0209869_101180813300027187Groundwater SandAPARRVMRDTERRQEISDYYLGITADEITSHLTPASFDLWDSVARYRHELGLGARWLEIGGGVGDLAAAALDRGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDAAALAAQGPFSIVAAVGAVLNHARDQSELARGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGLADLALLTRAAGLQVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPRRRVPSPASDPPGTRRAARRLSRA
Ga0209684_100757833300027527Tropical Forest SoilVSEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHRELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFADVARLTRAWRLHVLEMHSLYHRYPPTPAYDAEFDERMLRLFLHRTQTSPRA
Ga0209874_101016113300027577Groundwater SandMRDAERRREIRNYYLGISADEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDAAALAAQGPFSIVAAVGAVLNHARDQSELARGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPRRRVPSPASDPPGTRRAARRRSRA
Ga0209814_1005915323300027873Populus RhizosphereMQEARRRREIKDYYLAISADEITSHLTPASFDLWASLARYRQEFRLGARWLELGGGMGDLAATAVDRGWDVLMTDVQEELLATAAVRHPTLRARLQRADVFDARDVRALAAQGPFAVVAALGAVLNHARDRKELARGFAHLVALTEPGALLVVDLMLSEMFPEHPASIWADMLHVMPSFAELAPLLGSARLHLLEAHSLYHRYPPTAAFDADFDERMLRLFLRKPAAPAAAGPDANPSRLRKKVQMRGDARRLRAKRTPGR
Ga0209853_101718423300027961Groundwater SandMRDAERRREIRNYYLGISADEITSHLTPASFDLWDSVARYRRELQIGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAAQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGVARSLVRESRRARPAPRRRVPSPASDPPGTRRAARRRSRA
Ga0137415_1001020573300028536Vadose Zone SoilMRDAQRRREIKDYYLGISADEITSYLTPAAFDLWESLADYRREFQIGARWLEIGGGMGDLAAAALDRGYDVLMTDVQDELLATAATRHPELRGRLRRADVFDPRDVQALAAKGPFSIVAALGAVLNHARDRKELARGFGHLVALGEPGSLLVVDLMLSEMFPGHPASIWADILHVLPSFADLARLTGASRLQVLESHSLYHRYPPTAAFDAEFDERMLRLVLRMPPTAARARPGAPVRGSRRARPGSPRRAQSPSSDSAGTRRAIRRRFRA
Ga0307504_1009674623300028792SoilMRDAERRREIKNYYLSISADEITSYLTPASFDLWESVARYRRELKIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLATAVTRHPRLRGRLQRADIFDSREVQALAARGPFSIVAALGAVLNHARDRKELARGFGHLLELGEPGSLLIVDLMLSEMFPGHPASVWADMLHVLPGFADLARLARPSRLHVLEAHSLYHRYPPTPTFDAEFDERMLRLFLHKGQPSRRARAIAPV
Ga0307278_1002887733300028878SoilMRDARRRREIRDYYLGISPDEITSHLTPTSFDLWDSVARYRRELQLGARWLEIGGGMGDLAAAALDHGYDVLMTDVQAELLDTAAARHPRLRTRLRRADIFDARDVAALAVQGPFSIVAAVGAVLNHARDQSELAHGFDHLVALGEPSSLLVVDLMLSEMFPGHPASVWADFLHVLPGFGDLARLTRASGLHVLEAHSLYHRYPPTAAFDREFDERMLRLFFHKRIPARPPRGAARSPVRESRRARPAPRRRVPSPASDPPGTRRAARH
(restricted) Ga0255310_1001865723300031197Sandy SoilMRDVERRREIKNYYLSISADEITSYLTPASFDLWESVARYRRELKIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLATAVTRHPRLRGRLQRADIFDSRHVQALAARGPFSIVAALGAVLNHARDRKELARGFGHLLELGEPGSLLIVDLMLSEMFPGHPASVWADMLHVLPGFADLARLARPSRLHVLEAHSLYHRYPPTPTFDAEFDERMLRLFLHKGQPSRRARAIAPVRGSRRARPGSRRRAPSPSSDPAGARRAIRRRPRA
(restricted) Ga0255312_101254223300031248Sandy SoilMRDVERRREIKNYYLSISADEITSYLTPASFDLWESVARYRRELKIGARWLEVGGGMGDLAAAALDRGYDVLMTDVQDELLATAVTRHPRLRGRLQRADIFDSRDVQALAARGPFSIVAALGAVLNHARDRKELARGFGHLLELGEPGSLLIVDLMLSEMFPGHPASVWADMLHVLPGFADLARLARPSRLHVLEAHSLYHRYPPTPTFDAEFDERMLRLFLHKGQPSRRARAIAPVRGSRRARPGSRRRAPSPSSDPAGARRAIRRRPRA
Ga0318534_1003600513300031544SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLH
Ga0318528_1006993523300031561SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRTQTSPRA
Ga0318572_1000794333300031681SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA
Ga0307469_1003129933300031720Hardwood Forest SoilVKDAQRRREIRDYYLGLSADEITSYLTPASFDLWESVARYRRELGLGATWLEVGGGVGDLAAAALDRGYDVLMTDVQAELLETAAARHPRLRDRLQRSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLMQPGSLLIVDLMLSEMFPSHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFLHKTDRRAPARGSRRARAALPGRRPTAIIAATNQPS
Ga0307469_1010617023300031720Hardwood Forest SoilMREADRRREIATYYLGISPDEISSHVAASFDLWESLARYRRELAIGARWLELGGGMGDLAADAIDHGYDVLMTDVQDALLETAARRHPRLSGRVQRADVFDARAVAALGARGPFSIVAALGAVLNHARDGRALARGFRHLVALGETRSLLIVDLLVREMFAGHPARVWADFLHVLPGLDELARLIRASRLQLLEAYSLFHRYPPTPAYDRVFDERMLRVVLLKSPAGAARTVRSSRA
Ga0318493_1021901413300031723SoilMRDTQRRREIKNYYLGISADEITSYLTPASFDLWDSVARYRHELGIGARWLEIGGGLGDLAVTALDRGYDVLMTDVQRELLDGAAARHPRLSGRLQRSDIFDARDVAALRAHGPFSIVAALGAVLNHARDHRELVRGFGHLVALAAPGSLVLVDVMLREMFPGHPATIWADMLHVLPGFDDLARLTRAHELHVLEAHSLYHRYPPTPAFAAEFDERMLRLILRRAPSTSGASAVRSSRGARPVAPRRAPSPSRD
Ga0307468_10011580623300031740Hardwood Forest SoilMRDARRRREIRDYYLGISADEITSYLTPASFDLWESVARYRRELGIGARWLEVGGGVGDLAATALERGYDVMMTDVQSELLEAAAARHPRLRGRLQRSDVFDARDVAALAARGPFSIVAALGAVLNHARDQRELALGFGHLVTLAAPGSLVIVDLMLSEMFPGHPASIWADILHVLPGFADLARLTRSWELDVLETHSLYHRYPPTPAYNAEFDERMLRLFLHRRETGRPAPVRASRRARPGSRRPGPSPSRDPGGARRATRRRPRA
Ga0306918_1028369513300031744SoilNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA
Ga0306918_1058741423300031744SoilMRDTQRRREIKNYYLGISADEITSYLTPASFDLWDSVARYRHELGIGARWLEIGGGLGDLAVTALDRGYDVLMTDVQRELLDGAAARHPRLSGRLQRSDIFDARDVAALRAHGPFSIVAALGAVLNHARDHRELVRGFGHLVALAAPGSLVLVDVMLREMFPGHPATIWADMLHVLPGFDDLARLTRAHELHVLEAHSLYHRYPPTPAFAAEFDERMLRLILRRAPSTSGASAVRSSRGA
Ga0318502_1046424113300031747SoilGCAALSYATEAARLMRDTQRRREIKNYYLGISADEITSYLTPASFDLWDSVARYRHELGIGARWLEIGGGLGDLAVTALDRGYDVLMTDVQRELLDGAAARHPRLSGRLQRSDIFDARDVAALRAHGPFSIVAALGAVLNHARDHRELVRGFGHLVALAAPGSLVLVDVMLREMFPGHPATIWADMLHVLPGFDDLARLTRAHELHVLEAHSLYHRYPPTPAFAAEFDERMLRLILRRAPSTSGASAVRSSR
Ga0318526_1000664133300031769SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLDRSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA
Ga0318566_1001477633300031779SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLDRSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRTQTSPRA
Ga0318548_1012097013300031793SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETH
Ga0307473_1008214833300031820Hardwood Forest SoilDYYLGLSADEITSYLTPASFDLWESVARYRRELGLGATWLEVGGGVGDLAAAALDRGYDVLMTDVQAELLETAAARHPRLRDRLQRSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLMQPGSLLIVDLMLSEMFPSHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFLHKTDRRAPARGSRRARAALPGRRPTAIIAATNQPS
Ga0307473_1011272123300031820Hardwood Forest SoilMRDAGRRREIRDYYLGISADEITSYLTPASFDLWESVARYRRELGIGARWLEVGGGVGDLAATALERGYDVMMTDVQSELLEAAAARHPRLRGRLQRSDVFDARDVAALAARGPFSIVAALGAVLNHARDQRELALGFGHLVTLAAPGSLVIVDLMLSEMFPGHPASIWADILHVLPGFADLARLTQSWELDVLETHSLYHRYPPTPTYNAEFDERMLRLFLHKPEAGRPAPVRASRRARPGSRRPGPSPSRDPGGARRATRRRPRA
Ga0318495_1018361023300031860SoilMRDTQRRREIKNYYLGISADEITSYLTPASFDLWDSVARYRHELGIGARWLEIGGGLGDLAVTALDRGYDVLMTDVQRELLDGAAARHPRLGGRLQRSDIFDARDVAALRAHGPFSIVAALGAVLNHARDHRELVRGFGHLVALAAPGSLVLVDVMLREMFPGHPATIWADMLHVLPGFDDLARLTRAHELHVLEAHSLYHRYPPTPAFAAEFDERMLRLILRRASSTSGASAVRSSRGARPVAPRRAPSPSR
Ga0318505_1013690813300032060SoilMRDTQRRREIKNYYLGISADEITSYLTPASFDLWDSVARYRHELGIGARWLEIGGGLGDLAVTALDRGYDVLMTDVQRELLDGAAARHPRLSGRLQRSDIFDARDVAALRAHGPFSIVAALGAVLNHARDHRELVRGFGHLVALAAPGSLVLVDVMLREMFPGHPATIWADMLHVLPGFDDLARLTRAHELHVLEAHSLYHRYPPTPAFAAEFDERMLRLILRRAPSTSGASAVRSSR
Ga0318577_1037840013300032091SoilSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGDLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLDRSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPAYDAEFDERMLRLFLHRTQTSPRA
Ga0307471_10035407833300032180Hardwood Forest SoilDYYLGIAADEIASYLTPASFDLWESVARYRRELELGATWLEVGGGVGGLAAAALDRGYDVLMTDVQAELLETAATRHPRLRDRLQRSDVFDARDVTALAARGPFSIVAALGAVLNHARDHRELALGFGHLVTLMQPGSLLIVDLMLSEMFPSHPSAVWADMLHVLPGFGDLARLTRAWGLHVLEAHSLYHRYPPTPAFDAEFDERMLRLFLHKTDRRAPARGSRRARAALPGRRPTAIIAATNQPS
Ga0307471_10039586113300032180Hardwood Forest SoilMRETDRRREITTYYLGISPDEISSHVAASFDLWETLARYRRELTIGPRWLELGGGMGDLAAAALDHGYDVLMTDVQDELLETAARRHPRLRARLQRADVFDARAVAAVGARGPFSIVAALGAVLNHARDGKQLARGFRHRVELGEPRSLLVVDLLVREMFAGHPASVWADFLHVLPGLDELARLIRSSGLQLLEAYSLFHRYPPTPAYDRAFDERMLR
Ga0307471_10080336023300032180Hardwood Forest SoilMQERQRRREIQDYYRSISADEITSYFTPVSFDLWESLARYRRELGIGARWLEVGGGLGDLAAAALDHGYDVLMTDVQAELLDAAATRHPRLRPRLGRADIFDARDVAALAARGPFSIVAGLGAVLNHARDPRELARGFKHLVSLGETSSLLVVDLMLSEMFPGHPASVWADFRHLLPSLSALSGLIRSSGLHLIEAHSLHHVYPPTAAFDQEFDERMLRLVLLKSPLSGSPRARRSTV
Ga0307471_10313503213300032180Hardwood Forest SoilIKDYYLGISADEITSYLTPASFDLWESLARYRREFRLDVRWLETGGGMGDLAAAALDRGYDVVMTDVQDELLATAATRHPRLRGRLQRADIFDPRDVQALAAQGPFSIVAALGAVLNHARDRKELARGFGHLVALAEPGSLLIVDLMLSEMFPGHPASIWADILHVLPGFADLSRLIGASRLHVLETHSLYHRYP
Ga0306920_10008587043300032261SoilVNEAQRRREIKDYYLGISADEITSYLTPASFDLWESVARYRRALGIGDRWLEVGGGVGGLAAAALDRGYDVVMTDVQGELLQTAAARHPRLRGRLERSDVFDPRDVAALAARGPYSIVAALGAVLNHARDHGELSQGFDHLVTLAQPGSLLIVDLMLSEMFPGHPASIWADILHVLPSFEDVARLTRAWGLHVLETHSLYHRYPPTPASDAEFDERMLRLFLHRTQTSPRA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.