NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069025

Metagenome Family F069025

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069025
Family Type Metagenome
Number of Sequences 124
Average Sequence Length 203 residues
Representative Sequence MDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Number of Associated Samples 100
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 58.87 %
% of genes near scaffold ends (potentially truncated) 43.55 %
% of genes from short scaffolds (< 2000 bps) 71.77 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.82

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil
(14.516 % of family members)
Environment Ontology (ENVO) Unclassified
(33.065 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(32.258 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 23.14%    β-sheet: 30.13%    Coil/Unstructured: 46.72%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.82
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
c.51.6.2: XCC0632-liked2iqia12iqi0.73146
c.51.6.1: NLBH-liked2i9ia12i9i0.62639


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF01042Ribonuc_L-PSP 12.10
PF01619Pro_dh 9.68
PF03923Lipoprotein_16 8.87
PF13432TPR_16 5.65
PF02627CMD 3.23
PF02596DUF169 3.23
PF13189Cytidylate_kin2 1.61
PF03404Mo-co_dimer 1.61
PF07885Ion_trans_2 0.81
PF08240ADH_N 0.81
PF00768Peptidase_S11 0.81
PF11716MDMPI_N 0.81
PF13649Methyltransf_25 0.81
PF09720Unstab_antitox 0.81
PF04909Amidohydro_2 0.81
PF00528BPD_transp_1 0.81
PF15780ASH 0.81
PF00486Trans_reg_C 0.81
PF01243Putative_PNPOx 0.81
PF00571CBS 0.81
PF04075F420H2_quin_red 0.81
PF01078Mg_chelatase 0.81
PF00072Response_reg 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 12.10
COG0506Proline dehydrogenaseAmino acid transport and metabolism [E] 9.68
COG3056Uncharacterized lipoprotein YajGFunction unknown [S] 8.87
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 3.23
COG2043Uncharacterized conserved protein, DUF169 familyFunction unknown [S] 3.23
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 3.23
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_101664616All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria864Open in IMG/M
3300000550|F24TB_10283996All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2518Open in IMG/M
3300000550|F24TB_10566892All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1456Open in IMG/M
3300000559|F14TC_100543441All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1305Open in IMG/M
3300000559|F14TC_101336152All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1020Open in IMG/M
3300001431|F14TB_100846300All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1271Open in IMG/M
3300002121|C687J26615_10097111All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria738Open in IMG/M
3300002122|C687J26623_10101082All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria789Open in IMG/M
3300002124|C687J26631_10029071All Organisms → cellular organisms → Bacteria1962Open in IMG/M
3300005093|Ga0062594_100577806All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria977Open in IMG/M
3300005180|Ga0066685_10213002All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1327Open in IMG/M
3300005186|Ga0066676_10695398All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria692Open in IMG/M
3300005294|Ga0065705_10012638All Organisms → cellular organisms → Bacteria2426Open in IMG/M
3300005336|Ga0070680_101616903All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria561Open in IMG/M
3300005343|Ga0070687_100475204All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria836Open in IMG/M
3300005345|Ga0070692_10363753All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria902Open in IMG/M
3300005353|Ga0070669_100450005All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1061Open in IMG/M
3300005440|Ga0070705_101762536All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria524Open in IMG/M
3300005444|Ga0070694_101091214All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria665Open in IMG/M
3300005713|Ga0066905_100197243All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1501Open in IMG/M
3300005713|Ga0066905_100999411All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria738Open in IMG/M
3300005764|Ga0066903_100041878All Organisms → cellular organisms → Bacteria5417Open in IMG/M
3300005764|Ga0066903_100729513All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1754Open in IMG/M
3300005829|Ga0074479_10772260All Organisms → cellular organisms → Bacteria → Proteobacteria7156Open in IMG/M
3300005843|Ga0068860_100516360All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1194Open in IMG/M
3300005875|Ga0075293_1022474All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria806Open in IMG/M
3300005878|Ga0075297_1001804All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1578Open in IMG/M
3300005880|Ga0075298_1024687All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria585Open in IMG/M
3300006034|Ga0066656_10249033All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1141Open in IMG/M
3300006049|Ga0075417_10083338All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1430Open in IMG/M
3300006049|Ga0075417_10178873All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria997Open in IMG/M
3300006196|Ga0075422_10112820All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1054Open in IMG/M
3300006844|Ga0075428_100034104All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5615Open in IMG/M
3300006865|Ga0073934_10000104All Organisms → cellular organisms → Bacteria247563Open in IMG/M
3300006903|Ga0075426_10102524All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2054Open in IMG/M
3300006904|Ga0075424_102061810All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria601Open in IMG/M
3300006969|Ga0075419_10154812All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1501Open in IMG/M
3300007255|Ga0099791_10003778All Organisms → cellular organisms → Bacteria6188Open in IMG/M
3300009012|Ga0066710_100001165All Organisms → cellular organisms → Bacteria18893Open in IMG/M
3300009100|Ga0075418_10029377All Organisms → cellular organisms → Bacteria5990Open in IMG/M
3300009156|Ga0111538_10043075All Organisms → cellular organisms → Bacteria5809Open in IMG/M
3300009162|Ga0075423_10174470All Organisms → cellular organisms → Bacteria2257Open in IMG/M
3300010047|Ga0126382_10127756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1700Open in IMG/M
3300010304|Ga0134088_10528584All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria583Open in IMG/M
3300010359|Ga0126376_12641795All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria551Open in IMG/M
3300010362|Ga0126377_11300390All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria799Open in IMG/M
3300010362|Ga0126377_11981502All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria658Open in IMG/M
3300010403|Ga0134123_10005634All Organisms → cellular organisms → Bacteria8267Open in IMG/M
3300010863|Ga0124850_1010209All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2981Open in IMG/M
3300012203|Ga0137399_10009481All Organisms → cellular organisms → Bacteria5749Open in IMG/M
3300012355|Ga0137369_10200954All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1540Open in IMG/M
3300012685|Ga0137397_10189534All Organisms → cellular organisms → Bacteria1528Open in IMG/M
3300012922|Ga0137394_10314832All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1337Open in IMG/M
3300012922|Ga0137394_10458671All Organisms → cellular organisms → Bacteria1084Open in IMG/M
3300012922|Ga0137394_11026653All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis685Open in IMG/M
3300012922|Ga0137394_11039950All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis680Open in IMG/M
3300012929|Ga0137404_10160664All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1877Open in IMG/M
3300012931|Ga0153915_10998813All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria975Open in IMG/M
3300012944|Ga0137410_10188758All Organisms → cellular organisms → Bacteria1590Open in IMG/M
3300012948|Ga0126375_10011559All Organisms → cellular organisms → Bacteria3755Open in IMG/M
3300012948|Ga0126375_10556493All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria867Open in IMG/M
3300015053|Ga0137405_1043494All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria903Open in IMG/M
3300015245|Ga0137409_10000695All Organisms → cellular organisms → Bacteria36855Open in IMG/M
3300015264|Ga0137403_10000141All Organisms → cellular organisms → Bacteria86368Open in IMG/M
3300015371|Ga0132258_10064803All Organisms → cellular organisms → Bacteria8440Open in IMG/M
3300015373|Ga0132257_100059905All Organisms → cellular organisms → Bacteria4270Open in IMG/M
3300017930|Ga0187825_10143930All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis840Open in IMG/M
3300017936|Ga0187821_10082630All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1172Open in IMG/M
3300018058|Ga0187766_10772881All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria669Open in IMG/M
3300018059|Ga0184615_10045038All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2453Open in IMG/M
3300018468|Ga0066662_12077785All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis595Open in IMG/M
3300025159|Ga0209619_10399375All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria706Open in IMG/M
3300025160|Ga0209109_10017610All Organisms → cellular organisms → Bacteria3846Open in IMG/M
3300025164|Ga0209521_10152960All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1424Open in IMG/M
3300025165|Ga0209108_10042692All Organisms → cellular organisms → Bacteria2535Open in IMG/M
3300025167|Ga0209642_10623867All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria584Open in IMG/M
3300025289|Ga0209002_10063183All Organisms → cellular organisms → Bacteria2527Open in IMG/M
3300025310|Ga0209172_10000009All Organisms → cellular organisms → Bacteria684519Open in IMG/M
3300025312|Ga0209321_10088559All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1732Open in IMG/M
3300025313|Ga0209431_10383243All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1088Open in IMG/M
3300025314|Ga0209323_10412767All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria811Open in IMG/M
3300025318|Ga0209519_10270019All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria991Open in IMG/M
3300025319|Ga0209520_10016744All Organisms → cellular organisms → Bacteria → Proteobacteria4461Open in IMG/M
3300025322|Ga0209641_10333188All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1109Open in IMG/M
3300025324|Ga0209640_10228305All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1575Open in IMG/M
3300025325|Ga0209341_10405242All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1104Open in IMG/M
3300025917|Ga0207660_11124570All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis640Open in IMG/M
3300025923|Ga0207681_10422649All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300025930|Ga0207701_10207901All Organisms → cellular organisms → Bacteria1718Open in IMG/M
3300025999|Ga0208417_107875All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis551Open in IMG/M
3300026329|Ga0209375_1047809All Organisms → cellular organisms → Bacteria2155Open in IMG/M
3300027646|Ga0209466_1108253All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis564Open in IMG/M
3300027655|Ga0209388_1001311All Organisms → cellular organisms → Bacteria5819Open in IMG/M
3300027907|Ga0207428_10219436All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1426Open in IMG/M
3300027909|Ga0209382_10518472All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1310Open in IMG/M
3300028381|Ga0268264_10413158All Organisms → cellular organisms → Bacteria1300Open in IMG/M
3300028587|Ga0247828_10889952All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis572Open in IMG/M
3300031720|Ga0307469_10000210All Organisms → cellular organisms → Bacteria → Proteobacteria22610Open in IMG/M
3300031720|Ga0307469_10020556All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3576Open in IMG/M
3300031720|Ga0307469_10115181All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1929Open in IMG/M
3300031720|Ga0307469_10158485All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1707Open in IMG/M
3300031740|Ga0307468_100501604All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria962Open in IMG/M
3300031740|Ga0307468_100659516All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria866Open in IMG/M
3300031740|Ga0307468_101507531All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis623Open in IMG/M
3300031820|Ga0307473_10191948All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1205Open in IMG/M
3300031820|Ga0307473_10199557All Organisms → cellular organisms → Bacteria1186Open in IMG/M
3300031820|Ga0307473_10290643All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1023Open in IMG/M
3300031949|Ga0214473_10260491All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1991Open in IMG/M
3300031949|Ga0214473_10993954All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria886Open in IMG/M
3300031997|Ga0315278_10075042All Organisms → cellular organisms → Bacteria → Proteobacteria3358Open in IMG/M
3300032174|Ga0307470_10023885All Organisms → cellular organisms → Bacteria2806Open in IMG/M
3300032174|Ga0307470_10118977All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1555Open in IMG/M
3300032180|Ga0307471_100721043All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1162Open in IMG/M
3300032180|Ga0307471_100722106All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1162Open in IMG/M
3300032180|Ga0307471_101771252All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria770Open in IMG/M
3300032205|Ga0307472_100003923All Organisms → cellular organisms → Bacteria6590Open in IMG/M
3300032205|Ga0307472_100078254All Organisms → cellular organisms → Bacteria2187Open in IMG/M
3300032205|Ga0307472_102584362All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis517Open in IMG/M
3300032782|Ga0335082_10021861All Organisms → cellular organisms → Bacteria6874Open in IMG/M
3300033004|Ga0335084_11185835All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis763Open in IMG/M
3300033417|Ga0214471_10173749All Organisms → cellular organisms → Bacteria1773Open in IMG/M
3300033550|Ga0247829_10972732All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria705Open in IMG/M
3300033550|Ga0247829_11328015All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis595Open in IMG/M
3300033551|Ga0247830_11128401All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Nitrosomonadaceae → Nitrosospira → Nitrosospira briensis626Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil14.52%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.29%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.68%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.26%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil6.45%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.03%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil4.03%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.23%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil2.42%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.42%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.61%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.61%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.61%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.61%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.81%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.81%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.81%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.81%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.81%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.81%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.81%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002121Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1EnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005829Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.190_CBCEnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005875Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006196Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300010863Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (PacBio error correction)EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300025159Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 3EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025310Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025312Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 4 - CSP-I_5_4EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025314Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 2EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025319Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025999Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300027646Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031997Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10166461623300000364SoilMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIXDALAAELXASGFTVLSAQEGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRXGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEXVLELMKXYPELGX*
F24TB_1028399623300000550SoilMHRVWLPCVLLALMLSGCATTDVKLKLPPTGLPTSIPGGNQRQIILTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELRKSGFTVLSAEEGARDTAVTIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRDSIVWTDRCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPAPVRVAGWREAGR*
F24TB_1056689223300000550SoilMRRVWIPGVVLALMLSGCATTDVNLKQPPAGLKTPIPGGNQRQVIVAIPFANERQIKNRCGMQKNGYGTETATAYCVSDPAQWIAAMLAAELKASGFTVLTTAEGSRDSAIKIDGVLLKIFAEPVVGAWSTLIESDLSVKLVATSRTGLRAERTFFVKGDVENVIWTQGIFNDSLERGTRDLLGKVVEAILELMKQYPQLGFARR*
F14TC_10054344123300000559SoilMHRVWLPCVLLALMLSGCATTDVKLKLPPTGLPTSIPGGNQRQIILTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELRKSGFTVLSAEEGARDTAVTIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGARESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
F14TC_10133615223300000559SoilMRRVWIPGVVLALMLSGCATTDVNLKQPPAGLKTPIPGGNQRQVIVAIPFANERQIKNRCGMQKNGYGNETATAYCVSDPAQWIAAMLAAELKASGFTVLTTAEGSRDSALKIDGVLLKIFAEPVVGAWSTLIESDLSVKLVATSRTGLRAERTFFVKGDVENVIWTQGIFNDSLERGTRDLLGKMVEAILELMKQYPQLGFAGR*
F14TB_10084630033300001431SoilMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRPVPTLCGVQKSGYGDETATVYCEDDPARWISDALAAELSASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
C687J26615_1009711113300002121SoilLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQIXNRCGMQKGGYGNETANALCQXEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFSHRHPAVVAWRPEQGR*
C687J26623_1010108213300002122SoilMVTLGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGF
C687J26631_1002907143300002124SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPA
Ga0062594_10057780613300005093SoilILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVDCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSWSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0066685_1021300223300005180SoilMRRVWLPCVLLGLMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTIPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTMIETDLSVRLVATTRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFTRPLPLRLAGWRETGR*
Ga0066676_1069539813300005186SoilLMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTIPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTMIETDLSVRLVATTRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLWLAGRREAG*
Ga0065705_1001263843300005294Switchgrass RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRPVPTLCGVQKSGYGDETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0070680_10161690313300005336Corn RhizospherePWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAV
Ga0070687_10047520433300005343Switchgrass RhizosphereSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0070692_1036375323300005345Corn, Switchgrass And Miscanthus RhizosphereAVKPPPARRPGGRALRASSSPVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0070669_10045000513300005353Switchgrass RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWNDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0070705_10176253613300005440Corn, Switchgrass And Miscanthus RhizospherePIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCF
Ga0070694_10109121413300005444Corn, Switchgrass And Miscanthus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVW
Ga0066905_10019724323300005713Tropical Forest SoilMRRMWIPGAILGLLLSGCATTDVNLKQPPAGLETPIPGGNQRQVIVTIPFTDERQIKNRCGLQKGGYGNETATAYCMGDPAQWIAAMLAAELRASGFTVLTTPEGSRDSALKIDGVLLKIFAEPVVGAWSTLIESDLSVRLVATSRTGLGAERTFFVKGNVENVIWTQGIFNDSVERGTRVLLRKMVEAILELMKQYPQLGFAGR*
Ga0066905_10099941123300005713Tropical Forest SoilMAIQRRVVLLAVLLFAMSGCALVDVKIKAPESGLEAPIPGGKQRQIVVVIPFKDDRANKTKCGVQKGGYGNETASAICEGNPAEWIASFLARELTASGFTVLRSEDGARDSALRVEGILLQIFAEPVVGFWSTTVESDFNVKLLATSKTGLQAERTFFSKGELTNVIWPQGIFNDSVRNGTRDLL
Ga0066903_10004187823300005764Tropical Forest SoilMHRVWLHSVLLALMLSGCATTDVKLKLPPTGLPTAIPGGNQRQIVLRIPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELKKSGFTVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTGKCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPSQLRLAEWREAGR*
Ga0066903_10072951333300005764Tropical Forest SoilMDRPWLPCVLLALMLSGCATTDVKLTLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPAHWITDALAAELRASGFTVLSAPEGARDTAVTIEGSLLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0074479_1077226083300005829Sediment (Intertidal)MTRKRWLWIALLVLVLPGCALVDVNIKAPDSGLKTPIPGGNQRQVIVIVPFADGRPKPDSCGIQKGGYGNETARGICQGNPATWLAEFLARELKTSGFTVLSAEDGKESALKIEGTLLKLFVEPVVGFWTTTIESDLQVKLVATSRIGLRAQRTFFVKGELTSIIWPQGMFNDSLEDGVHRLLRNMVQAILELMKQYPQLGFGRDGGRTRPGGQVGT*
Ga0068860_10051636013300005843Switchgrass RhizosphereRPSLVGGRHQIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0075293_102247423300005875Rice Paddy SoilKPPDSGLKKPIPGGNQRQVIVTIPFADARQITNRCGIQKGGFGNETAIAICQDDPARWIAAFLAAELKASGFTVLPAEDGTRDSALKVEGVLLKIFAEPVVGFWTTTVETDLNVKLVATSRTGLRAERTFFVKGELTSVIWPQGIFNDSLEDGTRQLLAKMVEAILELMKQYPELGFSHHHPAVVAWRPEQGR*
Ga0075297_100180433300005878Rice Paddy SoilMVTRGAGLGLAILLVTLSGCALADVHLKPPDSGLKKPIPGGNQRQVIVTIPFADARQITNRCGIQKGGFGNETAIAICQDDPARWIAAFLAAELKASGFTVLPAEDGTRDSALKVEGVLLKIFAEPVVGFWTTTVETDLNVKLVATSRTGLRAERTFFVKGELTSVIWPQGIFNDSLEDGTRQLLAKMVEAILELMKQYPELGFSHHHPAVVAWRPEQGR*
Ga0075298_102468713300005880Rice Paddy SoilLLVLALPGCALTDVNIKPPDSGLKMPIPGGNQRQVIVTIPFADARQITNRCGIQKGGFGNETAIAICQDDPARWIAAFLAAELKASGFTVLPAEDGTRDSALKVEGVLLKIFAEPVVGFWTTTVETDLNVKLVATSRTGLRAERTFFVKGELTSVIWPQGIFNDSLEDGTRQLLAKMVEAILELMKQYPELGFN
Ga0066656_1024903333300006034SoilMATRGGWVWLAAVLSMLSGCALTDVNLKPPSSGLKAPIPGGNQRQIVVTAPFTDSREIKSRCGVQKGGYGNETAVATCQGEPAQWLADLLASELRASGFTVLTAETGARESALKLDGVLLKIFAEPVVGFWSTNVETDMNVRLVATSKTGLKAERTFFVKGELQSIIWTQGIFNDSVENGTRDLLKKMVEAILDLMNQYPQLGFKR*
Ga0075417_1008333813300006049Populus RhizosphereMHRVWLPSVLLALMLSGCATTDVKLKLPPTGLPTAIPGGSQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAGWIAHVLAAELKKSGFTVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDTCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPTQLRLAEWREAGR*
Ga0075417_1017887323300006049Populus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0075422_1011282023300006196Populus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGKQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAQEGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFNESLARGSQDLLAKMVEAVL
Ga0075428_10003410433300006844Populus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0073934_100001041233300006865Hot Spring SedimentMVTPRGWVPLAVLLAVLAGCALQDVRLKAPPTGLKTPIPGGNQRQVIVTIPFTDARRMKDRCGVQKGGYGNETAAAHCVDDPAQWLATMLAGELKASGFSVLAGPDGARDSALRVEGVLLKIFAEPVVGFWSTTVETDLHVKLVATSRTGLQAERTFYVKGELTSVIWPQGIFNDSLEAGTRDLLAKMVQAILDLMKAYPQLGFDRRPPSLLARQPEAAR*
Ga0075426_1010252423300006903Populus RhizosphereMHRVWLPSVLLALMLSGCATTDVKLKLPPTGLPTAIPGGSQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAGWIAHVLAAELKKSGFIVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDTCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPTQLRLAEWREAGR*
Ga0075424_10206181013300006904Populus RhizosphereMPVRAAVIAVLVLTCTGCALVNVRVKSPESGLETPIPGGNQRQIILTIPFQDSRSSMFRCGVQKGGFGNEIADAVCQGSPADWIPMLLARELEASGFTVLQSEEGARDTALKIEGVVLKIFVEPVVGPWSTTVESDFDVKLVATSRTGLRAERTFFSKGERTSFIWPQSIFDDSVSRGTRDLLSKMVHAILELMKR
Ga0075419_1015481223300006969Populus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFNESLARGSQDLLAKMVEAVLELMKQYPELG*
Ga0099791_1000377813300007255Vadose Zone SoilMWFPCVLLALMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSLEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGWREAGR*
Ga0066710_100001165153300009012Grasslands SoilMATRGGWVWLAAVLSMLSGCALTDVNLKPPSSGLKAPIPGGNQRQIVVTAPFTDSREIKSRCGVQKGGYGNETAVATCQGEPAQWLADLLASELRASGFTVLTAETGARESALKLDGVLLKIFAEPVVGFWSTNVETDMNVRLVATSKTGLKAERTFFVKGELQSIIWTQGIFNDSVENGTRDLLKKMVEAILDLMNQYPQLGFKR
Ga0075418_1002937733300009100Populus RhizosphereMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFNESLARGSQDLLAKMVEAVLELMKQYPELG*
Ga0111538_1004307513300009156Populus RhizosphereTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0075423_1017447023300009162Populus RhizosphereMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0126382_1012775643300010047Tropical Forest SoilGCATTDVKLTLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPAHWITDALAAELRASGFTVLSARESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0134088_1052858413300010304Grasslands SoilMATRGGWVWLAAVLSMLSGCALTDVNLKPPSSGLKAPIPGGNQRQIVVTAPFTESREIKSRCGVQKGGYGNETAVATCQGEPAQWLADLLASELRASGFTVLTAETGARESALKLDGVLLKIFAEPVVGFWSTNVETDMNVRLVATSKTGLKAERTFFVKGELQSIIWTQGIFNDSVE
Ga0126376_1264179513300010359Tropical Forest SoilGCATTDVKLTLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWITDALAAELRASGFTVLSAPEGARDTAVTIEGSLLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAILELMKQYPE
Ga0126377_1130039023300010362Tropical Forest SoilVLLALTLSGCATTDVKLKLPPTGLPTAIPGGNQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELKKSGFTVLSPEEGAHDTAVRIEGALLTLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTD
Ga0126377_1198150213300010362Tropical Forest SoilMWIPGAILGLLLSGCATTDVNLKQPPAGLETPIPGGNQRQVIVTIPFTDERQIKNRCGLQKGGYGNETATAYCMGDPAQWIAAMLAAELRASGFTVLTTPEGSRDSALKIDGVLLKIFAEPVVGAWSTLIESDLSVRLVATSRTGLGAERTFFVKGNVENVIWTQGIFNDSVERGTRVLLRKMVEAILELMKQYPQLGFAG
Ga0134123_1000563463300010403Terrestrial SoilMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSWSGLRTERTFFVKGGRESIVWTNKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0124850_101020923300010863Tropical Forest SoilMLSGCATTDVKLTLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPAHWITDALAAELRASGFTVLSAPEGARDTAVTIEGSLLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG*
Ga0137399_1000948123300012203Vadose Zone SoilMWLPCVLLALMVSGCATIDVKLKQPPEGLKTPIAGGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGGRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGWREAGRLSVAPWRCWLRSV*
Ga0137369_1020095423300012355Vadose Zone SoilVAAPFADARQITNRCGMQKGGYGNETADALCQGDPAQWIAALLASELKASGFTVLPAEGGARDGALKVEGVLLKIFAEPVVGFWTTSVETDLNVKLVATSQTGLRAERTFFVKGELTSVIWPQGIFNDSVENGTRELLAKMVEAILELMKQYPELGLRYRHPAVVAWRPEQGR*
Ga0137397_1018953413300012685Vadose Zone SoilMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGF
Ga0137394_1031483233300012922Vadose Zone SoilMRRLWIPSAILVLMLSGCATTDVMLTPPASGLKMPIPGGNQRRVVITMPFSDARQITKRCGLQRSGYGDETATAYCEGDPAHWLAAKLASELEASGFTVLSAEQGGRDSALKIEGALLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGRREAGR*
Ga0137394_1045867123300012922Vadose Zone SoilMLSGCALTDVNVKPPSSGLKTPIPGGNQRQIIVTAPFSDSREIKNRCGVQKGGYGNETAVAVCQGEPAQWLADLLAIELKASGFTVLPSDAGARDSALKLDGVLLKIFAEPVVGAWSTNVETDMNVRLVATSKTGLKAERNFFVKGELQSVIWTQGIFNDSVERGAHELLKKMVDAIMELMSQYPQLGFNR*
Ga0137394_1102665313300012922Vadose Zone SoilMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKM
Ga0137394_1103995013300012922Vadose Zone SoilMQRLWIPSAILILMLSGCATTDVMLTPPASGLKTPIPGGNQRQVVITMPFSDARQITKRCGLQRSGYGAETATAYCEGDPTHWLAAKLASELEASGFTVLSADQGGRDSALKIEGALLKIFAEPVIGAWSTTIETDLSVRLGATSRTGLRTERTFFVKGDLETVIWTQGLFNDSLERGTRDLLG
Ga0137404_1016066423300012929Vadose Zone SoilMRDDRREAQAAARGIEDAHPRGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATASCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGWREAGR*
Ga0153915_1099881333300012931Freshwater WetlandsMVTRGGWLGLAVFLVALSGCALEDVHLKPPDSGLKTPIPGGNQRQVIVTIPFADARQITNRCGIQKGGYGNETAIAICQGEPARWIAAFLATELKASGFTVLPTEEGARESALKVEGVLLKIFAEPVVGFWATTVETDLNVKLVATSRTGLHAERTFFVKGELTSIIWPQGIFNDSMEDGTRQLLAKMVEAILALMRQYPELGFSHRHPAVVAWQPEQGR*
Ga0137410_1018875823300012944Vadose Zone SoilMRRLWIPSAILVLMLSGCATTDVMLTPPASGLKMPIPGGNQRQVVITMPFSDARQITKRCGLQRSGYGDETATAYCEGDPAHWLAAKLASELEASGFTVLSAEQGGRDSALKIEGALLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFTR
Ga0126375_1001155923300012948Tropical Forest SoilMLSGCATTDVKLKLPPTGLPTAIPGGNQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELKKSGFTVLSPEEGAHDTAVRIEGALLTLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPTPLRLAEWREAGR*
Ga0126375_1055649313300012948Tropical Forest SoilMWIPGAILGLLLSGCATTDVNLKQPPAGLETPIPGGNQRQVIVTIPFTDERQIKNRCGLQKGGYGNETATAYCMGDPAQWIAAMLAAELRASGFTVLTTPEGSRDSALKIDGVLLKIFAEPVVGAWSTLIESDLSVRLVATSRTGLGAERTFFVKGNVENVIWTQGIFNDSVERGTRVLLRK
Ga0137405_104349413300015053Vadose Zone SoilMRDDRREAQAAARGIEDAHPRGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATASCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGDAG
Ga0137409_10000695273300015245Vadose Zone SoilMRRLWIPSAILVLMLSGCATTDVMLTPPTSGLKMPIPGGNQRRVVITMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGWREAGR*
Ga0137403_10000141193300015264Vadose Zone SoilMRDDRREAQAAARGIEDAHPRGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATASCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFTRPLPLRLAGWREAGR*
Ga0132258_1006480353300015371Arabidopsis RhizosphereMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTVVTIEGALLKLFAEPVLGPFLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFDESLARGTQDLLAKMVEAVLELMKQHPELG*
Ga0132257_10005990523300015373Arabidopsis RhizosphereMLSGCATTDVKLNLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTVVTIEGALLKLFAEPVLGPFLTTIETDLSVRLVATSRSGLRTERTFFVKGARESIVWTDKCFDESLARGTQDLLAKMVEAVLELMKQHPELG*
Ga0187825_1014393013300017930Freshwater SedimentMATRGAWLGLVILFGMLSGCALQDVHLKPPESGLKKSIPGGNQRQVIVTIPFTDARQITNRCGIQKGGYGNETAIAICQDEPARWIAAFLATELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWATTVETDLKVKLVATSRSGLRAERTFFVKGTLTSVIWPQGIFNDSLEDGTRQLLTKMVEAILELMKQYPELGFSHRHPAVVAWQPGQ
Ga0187821_1008263023300017936Freshwater SedimentMATRGAWLGLVILFGMLSGCALQDVHLKPPESGLKKSIPGGNQRQVIVTIPFTDARQITNRCGIQKGGYGNETAIALCQDEPARWIAAFLATELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWATTVETDLKVKLVATSRSGLRAERTFFVKGTLTSVIWPQGIFNDSLEDGTRQLLTKMVEAILELMKQYPELGFSHRHPAVVAWQPGQ
Ga0187766_1077288113300018058Tropical PeatlandLTRNMATQGRPFWLALLVLALPACTFMDVSIKLPDSRPKTPIPGGNQRQIVLIVPFTDARQISDRCGVQKDAFGNETARGICQGRPGQWIAELLARKLRASGFTLLAMEDGARESALKIQGTLLKIFVEPVAGPSSATIESDLEVKLVATSRTGLRAERTFFTKGERTSVAWMQGEERDFNDSLENGVRQLLANMVNAILELVNQYPQLGLEGDRYHAAMSW
Ga0184615_1004503833300018059Groundwater SedimentMVTRGGWLGLAVLLVTLSGCALEDVRLKPPTSGLKKPIPGGNQRQVIVAVPFADARQITNRCGMQKGGYGNETANALCQGEPAEWIATFLAGELKASGFTVLPAEGGARESALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAKRTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0066662_1207778513300018468Grasslands SoilMTTRRGWTFLTVLVFAVSGCALAEVNVKPPEAGLEKPIPGGNQRQVIVAIPFQEARQSTSRCGVQKGGYGNETAQAVCQGNPTQWIAEFLARELRASGFTVLPSEQGARDSALKVEGVLLKIFVEPVVGFWSTTVESDLNVKLVATSKTGLQAERTFFAKGEKTSIIWPQGIFNDSVERGSRDLLTKMVEAILE
Ga0209619_1039937513300025159SoilLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRDLLTKMVEAILDLMKQYPELGVSHRHPAVVAWRPEQGR
Ga0209109_1001761023300025160SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAIVASRPEQER
Ga0209521_1015296033300025164SoilLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRDLLTKMVEAILDLMKQYPELGVSHRHPAVVAWRPEQGR
Ga0209108_1004269233300025165SoilMVTLGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGVSHRQPAVVAWRPEQGR
Ga0209642_1062386713300025167SoilDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPE
Ga0209002_1006318343300025289SoilMVTLGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0209172_100000092213300025310Hot Spring SedimentMVTPRGWVPLAVLLAVLAGCALQDVRLKAPPTGLKTPIPGGNQRQVIVTIPFTDARRMKDRCGVQKGGYGNETAAAHCVDDPAQWLATMLAGELKASGFSVLAGPDGARDSALRVEGVLLKIFAEPVVGFWSTTVETDLHVKLVATSRTGLQAERTFYVKGELTSVIWPQGIFNDSLEAGTRDLLAKMVQAILDLMKAYPQLGFDRRPPSLLARQPEAAR
Ga0209321_1008855933300025312SoilMVTLGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFSHRHPAVVAWRPEQGR
Ga0209431_1038324323300025313SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0209323_1041276723300025314SoilLPMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0209519_1027001913300025318SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAIVASRPEQER
Ga0209520_1001674463300025319SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFSHRHPAVVAWRPEQGR
Ga0209641_1033318833300025322SoilLKTPIPGGNQRQVIVAVPFADGRQIANRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0209640_1022830523300025324SoilVVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFSHRHPAVVAWRPEQGR
Ga0209341_1040524223300025325SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGFGHHHPAVVAWRPEQGR
Ga0207660_1112457013300025917Corn RhizosphereMKSTKLALALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSWSGLRTERTFFVKGGRESIVWTNKCFNE
Ga0207681_1042264923300025923Switchgrass RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWNDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Ga0207701_1020790113300025930Corn, Switchgrass And Miscanthus RhizosphereTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Ga0208417_10787513300025999Rice Paddy SoilADVHLKPPDSGLKKPIPGGNQRQVIVTIPFADARQITNRCGIQKGGFGNETAIAICQDDPARWIAAFLAAELKASGFTVLPAEDGTRDSALKVEGVLLKIFAEPVVGFWTTTVETDLNVKLVATSRTGLRAERTFFVKGELTSVIWPQGIFNDSLEDGTRQLLAKMVEAILELMKQYPELGFN
Ga0209375_104780923300026329SoilMRRVWLPCVLLGLMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTIPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTMIETDLSVRLVATTRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFTRPLPLRLAGWRETGR
Ga0209466_110825313300027646Tropical Forest SoilLAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVKLKLPPTGLPTAIPGGNQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAQWIAHVLAAELKKSGFTVLSPEEGAHDTAVRIEGALLTLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGR
Ga0209388_100131163300027655Vadose Zone SoilMVSGCATIDVKLKQPPEGLKTPIPGGNQRQVIVTMPFSDARQITNRCGMQKGGYGNETATAYCQGDPTQWIAAMLAAELKASGFTVLSPEAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLETVIWTQGLFNDSLERGTRDLLGKMVEAILELMKQYPELGFARPLPLRLAGWREAGR
Ga0207428_1021943623300027907Populus RhizosphereVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAIDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGKQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Ga0209382_1051847223300027909Populus RhizosphereMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Ga0268264_1041315833300028381Switchgrass RhizosphereSPVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLELMKQYPELG
Ga0247828_1088995213300028587SoilVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTER
Ga0307469_10000210133300031720Hardwood Forest SoilMDRPWLPCVLLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCKDDPARWITDALAAELMASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLQLMKQYPELG
Ga0307469_1002055623300031720Hardwood Forest SoilMRRVWLPCALLALMLSGCATIDVKLKQPPEGLKTPIPGGHQRQVIVTMPFSDARQITNRCGVQKGGYGNETATAYCEGDPTQWIAAMLAAELKASGFIVLSTDAGSRDTALKIEGVLLKIFAEPVVGAWSTVIETDLSVRLVATSRTGLRAERTFFVKGDLESIIWTQGLFNDSLERGTRDLLGKMVEAILELMKEYPELGFTRPLPLRLAGWREAGR
Ga0307469_1011518133300031720Hardwood Forest SoilMRRVWLPCAILVLMLSGCSTTDVKLLPPPAGLTAPIPGGAQRQVIITAPFADARQITNRCGMQRSGYGDETASAYCEGDPAQWIAALLAAELKASGFTVLSTEEGARDSALKIEGVLLKIFAEPVIGPFLTAVETDLSVKLVATSRTGLRAERTFFVKGDREAIVWTQGTFNDSLDRGTRELLGKMVEDILELMKRYPQLGLAR
Ga0307469_1015848513300031720Hardwood Forest SoilMAIRGGWVRLGVLLSTLSGCALTDVNVKPPSSGLKTPIPGGSQRQIIVTAPFADSREIKSRCGVQKGGYGNETAVAVCQGEPAQWLADLLAIELKASGFTVLPTDAGARDSALKLDGVLLKVFAEPVVGAWSTNVETDMNVRLVATSKTGLKAERNFFVKGELQSIIWTQGIFNDSVEHGAHDLLKKMVEAILELMNQYPQLGFRR
Ga0307468_10050160433300031740Hardwood Forest SoilLMLSGCSTTDVKLLPPPAGLTAPIPGGAQRQVIITVPFADARQITNRCGMQRSGYGDETASAYCEGDPAQWIAALLAAELKASGFTVLSTEEGARDSALKIEGVLLKLFAEPVIGPWLTAIETDLSVKLVATSRTGLRAERTFFVKGDRESIVWTQGTFNDSLDRGTREILGKMVEDILELMKRYPQLGFAR
Ga0307468_10065951623300031740Hardwood Forest SoilSPVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLQLMKQYPELG
Ga0307468_10150753113300031740Hardwood Forest SoilSPVYSRAAVWPILAVLGTNEAPSVESGRPSLVGGRHPIAMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGDETATVYCEDDPARWIADALAAELRASGFTVLSAREGARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTD
Ga0307473_1019194823300031820Hardwood Forest SoilMHRVWLPCVLLALMLSGCATTDVKLKLPPTGLPTSIPGGNQRQIILTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELKKSGFTVLSAEEGARDTAVTIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRDSIVWTDRCFNESLARGTRDLLAKMVEAILELMKQYPELGFTRPAPVRVAGWREAGR
Ga0307473_1019955723300031820Hardwood Forest SoilMDRPWLPCVLLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITNALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLL
Ga0307473_1029064323300031820Hardwood Forest SoilMHRVWLPSVLLALMLSGCATTDVKLKLPLTGLPTAIPGGSQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAGWIAHVLAAELKKSGFTVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDTCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPTQLRLAEWREAGR
Ga0214473_1026049143300031949SoilMTTRRRCLLLWLLVVAFPGCALTDVRLKPPESGVKTPIPGGNERQVIVAVPFSDARQIKDRCGVQKGGYGNETAKALCVGDPAQWVATFLARELTASGFMVLPADGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLEVKLMATSRTGLHAERTFFVKGELTSVIWPQGIFNDSLENGTRALLAKMVEAILELMKQYPELGFRHSHAATVAWQPEQGR
Ga0214473_1099395423300031949SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQGEPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGVSHRHPAVVAWRPEQGR
Ga0315278_1007504243300031997SedimentMITRGGWLGLTVLLVALSGCALQDVHLKPPDSGLKTPIPGGNQRQVIVTIPFADARQITNRCGIQKGGFGNETAIAICQGEPVQWIAAFLAAELKASGFTVLPTEEGARESALKVEGVLLKIFAEPVVGFWTTTVESDLNVKLVAMSRTGLRAERTFFVKGELTSIIWPQGIFNDSLEDGTRQLLTKMVEAILELMKQYPELGFSHRYPAVVAWQPGQGR
Ga0307470_1002388543300032174Hardwood Forest SoilMDRPWLPCVLLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCKDDPARWITDALAAELMASGFTVLSAQESARDTAVTIEGALLKLFAEPVRGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLQLMKQYPELG
Ga0307470_1011897733300032174Hardwood Forest SoilMRRVWLPCAILVLMLSGCSTTDVKLLPPPAGLTAPIPGGAQRQVIITVPFADARQITNRCGMQRSGYGDETASAYCEGDPAQWIAALLAAELKASGFTVLSTEEGARDSALKIEGVLLKLFAEPVIGPWLTAIETDLSVKLVATSRTGLRAERTFFVKGDRESIVWTQGTFNDSLDRGTREILGKMVEDILELMKRYPQLGFAR
Ga0307471_10072104323300032180Hardwood Forest SoilMHRVWLPSVLLALMLSGCATTDVKLKLPLTGLPTAIPGGSQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAGWIAHVLAAELKKSGFTVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDTCFNESLARGTQDLLAKMVEAILELMKQYPELGFTRPTQLRLAEWREA
Ga0307471_10072210613300032180Hardwood Forest SoilRVWLPCAILVLMLSGCSTTDVKLLPPPAGLTAPIPGGAQRQVIITVPFADARQITNRCGMQRSGYGDETASAYCEGDPAQWIAALLAAELKASGFTVLSTEEGARDSALKIEGVLLKLFAEPVIGPWLTAIETDLSVKLVATSRTGLRAERTFFVKGDRESIVWTQGTFNDSLDRGTREILGKMVEDILELMKRYPQLGFAR
Ga0307471_10177125213300032180Hardwood Forest SoilMTTLRGWMLLTVLAFSGCALAEVHVKPPETGLETPVPGGDQRQVIVVIPFQDARQSTSRCGVQKGGYGNETAQAICQGSPAQWIAESLARELRASGFTVLPSEEGARDSALRVEGALLKVFVEPVVGFWSTTVESDLNVKLVVTSKTGLQAERTFFAKGEKTSVIWPQGIFNDSVERGSRDLLTKMVE
Ga0307472_10000392313300032205Hardwood Forest SoilMHRVWLPSVLLALMLSGCATTDVKLKLPLTGLPTAIPGGSQRQIVLTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPAGWIAHVLAAELKKSGFTVLSPEEGARDTAVRIEGALLKLFAEPVLGPWSTTIETDLSVRLVATSRTGLRTERTFFVKGGRESIVWTDTCFNESLARGTQDLLAKMVEAILELMKQYPEL
Ga0307472_10007825413300032205Hardwood Forest SoilMDRPWLPCVLLALMLSGCATTDVKLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPFLTTIETDLSVRLVATSRSGLRTERTFFVKGERESIVWTDKCFNESLARGTQDLLAKMVEAVLELMK
Ga0307472_10258436213300032205Hardwood Forest SoilPTSIPGGNQRQIILTMPFADARQVPTLCGVQKSGYGDETATVYCEDDPARWIAHVLAAELRKSGFTVLSAEEGARDTAVTIEGALLKLFAEPVLGPWSTTIETDLSVRLMATSRTGLRTERTFFVKGGRDSIVWTDRCFNESLARGTQDLLAKMVEAILELMKQYPELGFT
Ga0335082_1002186173300032782SoilMRGKRFWLALLALALSGCALTDVNIKPPDSGLKTPIPGGNQRQVIVVIPFTDARQTTDRCGVQKGGFGNETAVGICQGNPAQWIAEFLARELKASGFSVLSAGEGRDSALKLEGTLLKIFAEPVVGFWSTTVESDLQVKLVATSGTGLRAERTFFVKGELTSVIWPQGIFNDSLEDGVRQMLKRMVEAILELMSQYPQLGFHRDGHRLALGDWQARAWP
Ga0335084_1118583513300033004SoilMTTRGKRFWLAFLVLALAGCALTDVNIKPPDSGLKMPIPGGNQRQVIVIAPFADARQITDRCGVQKGGYGNETARGICQGSPAQWLAEFLARELRASGFTVLSAEDGRESALKIEGTLLKFFVEPVVGFWSTTVESDLQVKLVATSRTGLRAQRTFFAKGELTSVIWTQGIFNDSLEDGVRQLLTKMVEAILELMKQYPQLSFER
Ga0214471_1017374923300033417SoilMVTRGGWLGLAVLLVSLSGCALEDVRLKPPTSGLKTPIPGGNQRQVIVAVPFADGRQITNRCGMQKGGYGNETANALCQREPAQWIAAFLASELKASGFTVLPAEDGARDSALKVEGVLLKIFAEPVVGFWSTTVETDLNVKLVATSRTGLRAERTFFVKGELTSIIWPQGIFNDSLENGTRELLTKMVEAILDLMKQYPELGVSHRQPAVVAWRPEQGR
Ga0247829_1097273223300033550SoilMAMRRGWMLLTALLFAVSGCALAEVNVKPPEAGLEQPIPGGNQRQVIVTIPFQDARQSASRCGVQKGGWGNETAQAVCQGNPTQWIAEFLARELRASGFIVLTSEQGARDTALKVEGVLVKIFVEPVVGFWSTTVESDLNVKLVATTKTGLQAERMFFAKGEKTSVIWPQGIFNDSVERGSRDLLTKMVEAILELLK
Ga0247829_1132801513300033550SoilVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTFCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLQLMKQYPELG
Ga0247830_1112840113300033551SoilLALMLSGCATTDVRLKLPSTGLPTPIPGGNQRQIVLTKPFADGRQVPTLCGVQKSGYGVETATVYCEDDPARWITDALAAELRASGFTVLSAQESARDTAVTIEGALLKLFAEPVLGPWLTTIETDLSVRLVATSRSGLRTERTFFVKGGRESIVWTDKCFNESLARGTQDLLAKMVEAVLQLMKQYPELG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.