NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F054691

Metagenome / Metatranscriptome Family F054691

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F054691
Family Type Metagenome / Metatranscriptome
Number of Sequences 139
Average Sequence Length 121 residues
Representative Sequence MSKRVYVSIDIAVTTAQEDTPVSLRIQVSSATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRLVELLEAGPLVV
Number of Associated Samples 109
Number of Associated Scaffolds 139

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 75.37 %
% of genes near scaffold ends (potentially truncated) 79.14 %
% of genes from short scaffolds (< 2000 bps) 89.93 %
Associated GOLD sequencing projects 107
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (65.468 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.741 % of family members)
Environment Ontology (ENVO) Unclassified
(28.058 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.410 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.23%    β-sheet: 0.00%    Coil/Unstructured: 55.77%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 139 Family Scaffolds
PF13551HTH_29 6.47
PF13592HTH_33 2.16
PF13518HTH_28 2.16
PF00296Bac_luciferase 2.16
PF00248Aldo_ket_red 0.72
PF05685Uma2 0.72
PF03328HpcH_HpaI 0.72
PF01904DUF72 0.72
PF13358DDE_3 0.72
PF01610DDE_Tnp_ISL3 0.72
PF07883Cupin_2 0.72
PF01548DEDD_Tnp_IS110 0.72
PF13340DUF4096 0.72
PF00006ATP-synt_ab 0.72
PF03400DDE_Tnp_IS1 0.72
PF02781G6PD_C 0.72
PF14707Sulfatase_C 0.72
PF01051Rep_3 0.72
PF01850PIN 0.72
PF04257Exonuc_V_gamma 0.72
PF04986Y2_Tnp 0.72
PF13280WYL 0.72
PF02371Transposase_20 0.72
PF01609DDE_Tnp_1 0.72
PF02163Peptidase_M50 0.72
PF01738DLH 0.72
PF00155Aminotran_1_2 0.72
PF00491Arginase 0.72
PF01724DUF29 0.72
PF08028Acyl-CoA_dh_2 0.72
PF04392ABC_sub_bind 0.72
PF00501AMP-binding 0.72
PF02627CMD 0.72

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 139 Family Scaffolds
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.16
COG3547TransposaseMobilome: prophages, transposons [X] 1.44
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.72
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 0.72
COG5527Protein involved in initiation of plasmid replicationMobilome: prophages, transposons [X] 0.72
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 0.72
COG5421TransposaseMobilome: prophages, transposons [X] 0.72
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 0.72
COG38362-keto-3-deoxy-L-rhamnonate aldolase RhmACarbohydrate transport and metabolism [G] 0.72
COG3464TransposaseMobilome: prophages, transposons [X] 0.72
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 0.72
COG3293TransposaseMobilome: prophages, transposons [X] 0.72
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 0.72
COG0010Arginase/agmatinase family enzymeAmino acid transport and metabolism [E] 0.72
COG2301Citrate lyase beta subunitCarbohydrate transport and metabolism [G] 0.72
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 0.72
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.72
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 0.72
COG1662Transposase and inactivated derivatives, IS1 familyMobilome: prophages, transposons [X] 0.72
COG1330Scaffold subunit RecC of the DNA repair enzyme RecBCD (exonuclease V)Replication, recombination and repair [L] 0.72
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 0.72
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 0.72
COG0364Glucose-6-phosphate 1-dehydrogenaseCarbohydrate transport and metabolism [G] 0.72


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A65.47 %
All OrganismsrootAll Organisms34.53 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000550|F24TB_10085492Not Available570Open in IMG/M
3300000955|JGI1027J12803_102217858Not Available643Open in IMG/M
3300003994|Ga0055435_10051930Not Available991Open in IMG/M
3300004156|Ga0062589_102681676All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Octadecabacter → Octadecabacter arcticus518Open in IMG/M
3300004268|Ga0066398_10068565Not Available761Open in IMG/M
3300004633|Ga0066395_10734719Not Available588Open in IMG/M
3300005172|Ga0066683_10084621All Organisms → cellular organisms → Bacteria1913Open in IMG/M
3300005177|Ga0066690_10339828Not Available1017Open in IMG/M
3300005181|Ga0066678_10103117All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae1730Open in IMG/M
3300005181|Ga0066678_10136240Not Available1523Open in IMG/M
3300005181|Ga0066678_10931300Not Available567Open in IMG/M
3300005289|Ga0065704_10048326Not Available700Open in IMG/M
3300005294|Ga0065705_10792394Not Available613Open in IMG/M
3300005294|Ga0065705_10980788Not Available552Open in IMG/M
3300005294|Ga0065705_11131932All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → delta proteobacterium NaphS2516Open in IMG/M
3300005445|Ga0070708_101678713All Organisms → cellular organisms → Bacteria → Proteobacteria591Open in IMG/M
3300005446|Ga0066686_10995144Not Available545Open in IMG/M
3300005536|Ga0070697_100496471Not Available1067Open in IMG/M
3300005554|Ga0066661_10725643All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Paraburkholderia → Paraburkholderia tuberum583Open in IMG/M
3300005556|Ga0066707_10698481All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium636Open in IMG/M
3300005557|Ga0066704_10532009All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300005559|Ga0066700_10310070Not Available1113Open in IMG/M
3300005559|Ga0066700_10786019All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Octadecabacter → Octadecabacter arcticus643Open in IMG/M
3300005586|Ga0066691_10313504All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella926Open in IMG/M
3300006797|Ga0066659_11800047All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Octadecabacter → Octadecabacter arcticus518Open in IMG/M
3300006844|Ga0075428_100221313All Organisms → cellular organisms → Bacteria2044Open in IMG/M
3300006844|Ga0075428_101658125All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300006845|Ga0075421_102550275Not Available532Open in IMG/M
3300006852|Ga0075433_11508301Not Available581Open in IMG/M
3300006854|Ga0075425_101028832Not Available939Open in IMG/M
3300006854|Ga0075425_102755270All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales541Open in IMG/M
3300006904|Ga0075424_100190340All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2177Open in IMG/M
3300006953|Ga0074063_13956627Not Available575Open in IMG/M
3300006969|Ga0075419_11197986Not Available560Open in IMG/M
3300009012|Ga0066710_101724159All Organisms → cellular organisms → Bacteria951Open in IMG/M
3300009012|Ga0066710_102261264Not Available792Open in IMG/M
3300009038|Ga0099829_10977552Not Available702Open in IMG/M
3300009089|Ga0099828_10292517All Organisms → cellular organisms → Bacteria → Proteobacteria1464Open in IMG/M
3300009090|Ga0099827_11403132All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Candidatus Tectomicrobia → Candidatus Entotheonella608Open in IMG/M
3300009090|Ga0099827_12010353Not Available502Open in IMG/M
3300009100|Ga0075418_13020377Not Available513Open in IMG/M
3300009101|Ga0105247_10791176All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300009137|Ga0066709_100172416All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Archangiaceae → Stigmatella → Stigmatella aurantiaca2803Open in IMG/M
3300009444|Ga0114945_10419652Not Available799Open in IMG/M
3300009444|Ga0114945_10778349Not Available586Open in IMG/M
3300009691|Ga0114944_1035744All Organisms → cellular organisms → Bacteria → Acidobacteria1783Open in IMG/M
3300009792|Ga0126374_11610694Not Available537Open in IMG/M
3300009797|Ga0105080_1037121Not Available574Open in IMG/M
3300009812|Ga0105067_1095295Not Available534Open in IMG/M
3300009817|Ga0105062_1043131All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Pseudacidovorax → unclassified Pseudacidovorax → Pseudacidovorax sp. RU35E813Open in IMG/M
3300009818|Ga0105072_1128787Not Available522Open in IMG/M
3300009819|Ga0105087_1122911Not Available505Open in IMG/M
3300010046|Ga0126384_11774903Not Available585Open in IMG/M
3300010046|Ga0126384_11895549All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300010046|Ga0126384_12361211Not Available514Open in IMG/M
3300010047|Ga0126382_11281423Not Available661Open in IMG/M
3300010047|Ga0126382_12260249Not Available525Open in IMG/M
3300010048|Ga0126373_13183097Not Available511Open in IMG/M
3300010087|Ga0127492_1085841All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300010304|Ga0134088_10579930Not Available557Open in IMG/M
3300010362|Ga0126377_13107730Not Available536Open in IMG/M
3300010376|Ga0126381_103458711Not Available621Open in IMG/M
3300010398|Ga0126383_11153209All Organisms → cellular organisms → Bacteria865Open in IMG/M
3300010398|Ga0126383_13534582Not Available510Open in IMG/M
3300011270|Ga0137391_10807808Not Available772Open in IMG/M
3300011270|Ga0137391_11561007Not Available504Open in IMG/M
3300011439|Ga0137432_1297288Not Available516Open in IMG/M
3300012189|Ga0137388_10427496Not Available1227Open in IMG/M
3300012189|Ga0137388_12009560Not Available506Open in IMG/M
3300012204|Ga0137374_10063331All Organisms → cellular organisms → Bacteria3695Open in IMG/M
3300012205|Ga0137362_11753621Not Available508Open in IMG/M
3300012206|Ga0137380_11664547Not Available522Open in IMG/M
3300012207|Ga0137381_10964131All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300012209|Ga0137379_10227832All Organisms → cellular organisms → Bacteria1778Open in IMG/M
3300012209|Ga0137379_11526256All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300012209|Ga0137379_11665223Not Available535Open in IMG/M
3300012211|Ga0137377_10931709All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300012351|Ga0137386_11126685Not Available553Open in IMG/M
3300012359|Ga0137385_10170463All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidetes Order II. Incertae sedis → Rhodothermaceae → Salinibacter1909Open in IMG/M
3300012361|Ga0137360_10210684Not Available1578Open in IMG/M
3300012362|Ga0137361_10117234All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia2342Open in IMG/M
3300012383|Ga0134033_1022288All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300012917|Ga0137395_10184161Not Available1447Open in IMG/M
3300012922|Ga0137394_10332256Not Available1298Open in IMG/M
3300012922|Ga0137394_11321664All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300012923|Ga0137359_10332578All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300012927|Ga0137416_11060638Not Available726Open in IMG/M
3300012930|Ga0137407_10613415All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300012930|Ga0137407_11098820Not Available754Open in IMG/M
3300012930|Ga0137407_11988518All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium555Open in IMG/M
3300012944|Ga0137410_10008392All Organisms → cellular organisms → Bacteria6991Open in IMG/M
3300012948|Ga0126375_10492147All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300012948|Ga0126375_11194051Not Available633Open in IMG/M
3300012971|Ga0126369_11356866Not Available801Open in IMG/M
3300012971|Ga0126369_12883967Not Available563Open in IMG/M
3300012972|Ga0134077_10070787Not Available1316Open in IMG/M
3300012987|Ga0164307_11413686Not Available586Open in IMG/M
3300015245|Ga0137409_10057724All Organisms → cellular organisms → Bacteria → Proteobacteria3663Open in IMG/M
3300015264|Ga0137403_11111150Not Available636Open in IMG/M
3300015357|Ga0134072_10464754Not Available513Open in IMG/M
3300015371|Ga0132258_11002187All Organisms → cellular organisms → Bacteria → Proteobacteria2111Open in IMG/M
3300018031|Ga0184634_10195353All Organisms → cellular organisms → Bacteria920Open in IMG/M
3300018078|Ga0184612_10035849Not Available2572Open in IMG/M
3300018082|Ga0184639_10400029All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300018468|Ga0066662_12162788Not Available583Open in IMG/M
3300018468|Ga0066662_12197139Not Available579Open in IMG/M
3300018482|Ga0066669_10735346Not Available870Open in IMG/M
3300019255|Ga0184643_1390312Not Available523Open in IMG/M
3300019259|Ga0184646_1280353Not Available505Open in IMG/M
3300021086|Ga0179596_10726340Not Available503Open in IMG/M
3300022563|Ga0212128_10198280All Organisms → cellular organisms → Bacteria1281Open in IMG/M
3300022563|Ga0212128_10459415Not Available783Open in IMG/M
3300025934|Ga0207686_10597153All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Chitinophagia → Chitinophagales → Chitinophagaceae → Parafilimonas → Parafilimonas terrae868Open in IMG/M
3300027169|Ga0209897_1027630Not Available807Open in IMG/M
3300027511|Ga0209843_1033218Not Available948Open in IMG/M
3300027882|Ga0209590_10807190Not Available596Open in IMG/M
3300028587|Ga0247828_11002572Not Available546Open in IMG/M
3300030600|Ga0247659_1216770Not Available514Open in IMG/M
3300030993|Ga0308190_1033180Not Available922Open in IMG/M
3300031058|Ga0308189_10212041Not Available709Open in IMG/M
3300031094|Ga0308199_1132749Not Available578Open in IMG/M
3300031114|Ga0308187_10492289Not Available503Open in IMG/M
3300031421|Ga0308194_10180666Not Available671Open in IMG/M
3300031421|Ga0308194_10364202Not Available518Open in IMG/M
3300031668|Ga0318542_10199828Not Available1008Open in IMG/M
3300031679|Ga0318561_10853498Not Available501Open in IMG/M
3300031720|Ga0307469_12366825Not Available518Open in IMG/M
3300031913|Ga0310891_10349538Not Available530Open in IMG/M
3300031944|Ga0310884_10936524Not Available536Open in IMG/M
3300032013|Ga0310906_10166364All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexia → environmental samples → uncultured Chloroflexia bacterium1304Open in IMG/M
3300032025|Ga0318507_10222012All Organisms → cellular organisms → Bacteria818Open in IMG/M
3300032063|Ga0318504_10089531All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae → Fimbriiglobus → Fimbriiglobus ruber1364Open in IMG/M
3300034673|Ga0314798_065201Not Available712Open in IMG/M
3300034680|Ga0370541_048690Not Available551Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.74%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil11.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.07%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.47%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs5.04%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand5.04%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.32%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.60%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.60%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.16%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere2.16%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.44%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.44%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.44%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.72%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.72%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.72%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.72%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.72%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006953Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHMB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009797Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_10_20EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009817Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300009819Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010087Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012383Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027169Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027511Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300030600Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Dnb12 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030830Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_368 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030993Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_185 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031058Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031668Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031913Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D4EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300032013Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3EnvironmentalOpen in IMG/M
3300032025Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f20EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300034673Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034680Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_116 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1008549213300000550SoilRTTVKDLHSRLQHAYQRADVRLVQRITGLLDLLVHWVPMVVLCGRWGLSPSYLHAWQQKGLLPGMESLVYGHRRGRPPKLTPQPKKRLGRVPRGCG*
JGI1027J12803_10221785813300000955SoilMSKRVYVSIDIALTTEQEDTPVSLRIQVSAVTVKALHARLHQAYLQDDVRLVRRTTVLIDLLVHHVPMAGLCERWGLSSSCLYDWQRAFLLHGLERLRYRDSGGRRPKLTPRQKNRLVELLEAG
Ga0055435_1005193023300003994Natural And Restored WetlandsMRERGYLLQNDCGITQQEDTPVGIRIQVSNATVKALQDRLQQAYRQDDVRLVRRTTVLIDLLVHHVPVVVVGERWGLSPACLYDWQKAFMLRGMDSLLYRHGGGRPEKLTS
Ga0062589_10268167613300004156SoilMSKRVYVSRDISVTTAQEDTPVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLGASCLYDWQKAFLLHGLESLCYHHSGGRRPKLTPKQKRRLVDLLEAGPLVVGCETACWDAVLIRVL
Ga0066398_1006856513300004268Tropical Forest SoilMSTRVYLSKHACQTPHEEESPVCLRIQLSRATVKDLHRRLQYAYQRNDVRLVRRTTVLIDLLVHHVPVAVLCERWDLSPACIYGWQQAFLLRGLDSVVYSHGGGR
Ga0066395_1073471913300004633Tropical Forest SoilMGNSGLASRSKRVYVSIDIASTTAQEETPVSLSIPVRAATVKALHAQLPQAYRQAAGRLVRRTTVLLALLVHHVPMAVLGERWGLSSSWLYDWQRAVLRHGLESVRSRHGGSRRPQ*
Ga0066683_1008462113300005172SoilMRIAVYHPSDIAVTTAQEDTPVSIRIQVSSATIKALYNRLQQAYLKDDVRLVRRTTVLIDLLVHLVPMALLCERWGLSASCLYDWQRAFLLHGLDSSVYRHGGGRRPKLTPKQ
Ga0066690_1033982823300005177SoilMSTRVYLSRDMAVIPDQEDTPVRLSIQGSSATVTAVHTKVHQAYRKDERRVGRRTTVLRALLGPQGPSPVLCARWSLRASGLYDWPRAFLLHGLDRLGSRHRGGRRPQWRPRQKNRLGERREAGPRVVGCETACWDAGLIRGLLWRACGGLDNRP*
Ga0066678_1010311713300005181SoilVSIRIQVSSATVKALHTKLQQAYLKDDVRLVRRMTVVIDLLVHHVPVEVLHTRWGLSISCLYHWRQDFLLRGMDSLGYHHSGGRRPKLTPRQKKRVCPTFYTWRFVGLCGSLYRP*
Ga0066678_1013624023300005181SoilMAVIPDQEDTPVRLSIQGSSATVTAVHTKVHQAYRKDERRVGRRTTVLRALLGPQGPSPVLCARWSLRASGLYDWPRAFLLHGLDRLGSRHRGGRRPQWRPRQKKRLGERREAGPRVVGCETACWDAGLIRGLLWRACGGLDNRP*
Ga0066678_1093130013300005181SoilMSTRVYLSKHTCQITPQEDTPVCIRIQPSRTTAKALQSRLQLAYQRDDVRLVRRITVLLDLLVSHVPVAVLCERWSLSPACVYAWQKAFLLRGLDSLAYSHGGGRQPRLPPRQKQRLVELMDAGPLAVGCETACWNSVLIRVLIWREF
Ga0065704_1004832613300005289Switchgrass RhizosphereMSTRVYLSKHTCQTTHEEETPVCLRIQLSRATVKELHSRLQHAYQRDDMRLVRRITVLLDLLVHQVPVEVLSERWHLSISCLYQWRQAFLLRGMDSLVYHYSGGRRPKLT
Ga0065705_1079239413300005294Switchgrass RhizosphereMSTRVYLSKHTCQTTHEEETPVCLRIQLSRATVKELHSRLQHAYQRDNVRLVRRITVLLDLLAHRVPMAVLCERWGLSPSCLYAWQQAFLLPGMDSLVYSHSGGRPP
Ga0065705_1098078823300005294Switchgrass RhizosphereVGIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLIDLLVHQVPVAVLCERWGLSPSCLYTWQQAFLLRGMDSLVYGHSGGRRPKLSPRQKKRLVELSEAGPLVVG
Ga0065705_1113193213300005294Switchgrass RhizosphereVSIRIQLSRATVKDLYNRLQHAYRHDDVRLVRRITVVIDLLVHHVPVEVLHTRWGLSISCIYPWRRDFLLRGLDSLIYHQSGGRRPKLTPRQKKRLVELLEAGPQVVGCETACW
Ga0070708_10167871313300005445Corn, Switchgrass And Miscanthus RhizosphereMSKRVYVSRDISVTTAQEDTPVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYHHSGGRRPKLTPKQKRRLVDLLEAGPLVVGCETACWDAVL
Ga0066686_1099514413300005446SoilMSTRVYHSKHTYQTTHEEDTPVCLRIQLSRATVKDLHSRLQHAYRKDDVRLVRRTTVLIDLLVHHVPVAVLCERWGLRPSCLYDWQSAFMLHGMDSLRYRHSGGRPEKLTPSQKKRLVELIEASPLVVGCETACWN
Ga0070697_10049647113300005536Corn, Switchgrass And Miscanthus RhizosphereMSKRVYLSRDISVIPEQEDTPVSLSIQVSSATVKALHTKVHQAYRKDDRRLVRRTTVLIDMLVHQGLIPVLCARWSLRASCLYAWQRAFLLHGLDRLVSRHSGGRRPQLPPRQKKRLVELSAAGPLVVGGE
Ga0066661_1072564313300005554SoilVCIRIQLSRATVKDLHSRLQHAYQHDDVRLARRTTVLIDLLVYQVPVAVLCERWGISPACLYHWQKAFLLRGIDSLVYGHSGGRRPKLSPRQKQRLVEL
Ga0066707_1069848113300005556SoilATVKDLHSRLQHAYQRDDVRLVRRITVLIDLLVHQVPVAVLCERWNLSPSCLYDWQRAFLVRGMESLVYCHGGGRRPKLTPKQKKRLVALIDAGPQVVGFETACWNHIPPNYVVAYLSRSSQAA*
Ga0066704_1053200923300005557SoilMSTRVYHPSDIAVTTAQEDTPVSLRIQVSSATIKALHNRLQQAYLKDDVRLVRRTTVWIDLLVHLVPMALLCERWGLSVSCLYDWQRAFRLHGLDSFVYRHSGGRRPKLTPKQKKRLVELSEAGPL
Ga0066700_1031007013300005559SoilVRLSIQGSSATVTAVHTKVHQAYRKDERRVGRRTTVLRALLGPQGPSPVLCARWSLRASGLYDWPRAFLLHGLDRLGSRHRGGRRPQWRPRQKKRLGERREAGPRVVGCETACWDAGLIRGLLWRACGGLDNRP*
Ga0066700_1078601913300005559SoilMSKRVYVSIAIAVTTAQEDTPVSLRIQVSSTTVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWDLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRLVDLLEAGPLVVGCETACW
Ga0066691_1031350413300005586SoilMSTRVYLSKHACQTTHEEDTPVCLRIQLSRATVKDLHSRLQYAYQRNDVRLVRRTTVLIDLLVHHVPVAVLCERWDLSPACLYGWQQAFLLRGLDSVVYSHGGGRRPKLAPKQKRRFVELIDAGPLVV
Ga0066659_1180004713300006797SoilMSKRVYVSIAIAVTTAQEDTPVSLRIQVSSTTVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPVAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKPSVDSECEMSFVAQHFH*
Ga0075428_10022131313300006844Populus RhizosphereMSKGVYVSIDISVPTEQEDTPVSLRIQVSAATVKALHARLQQAYRKDDVRLVRRTTVLIDLLVHHVPMAVLCERWGLSSSCLYGWQRAFLLHGMDSLHYRHSGGRRPKLTPTDSTGFSGKIASH*
Ga0075428_10165812513300006844Populus RhizosphereMSKRVYVSIPMSVTTAQEDTPMSLRIQSSATTVKALYARLQQAYRKDDVRLVRRTTVLIDLLVHHVPMAGLCERWGLSSSCLYAWQRTFLLHGVDSLNYRQSGGRRPKLTLKQK
Ga0075421_10255027523300006845Populus RhizosphereMSLRIQVSAATVTALHTRLQQAYCKDDVRLVRRTTVLIDLLVHHVPMAVLCERWDLSSSCLYDWQRAFLLHGMERLHYRHSGGRRPKLTPRQKKRLVELLEA
Ga0075433_1150830123300006852Populus RhizosphereMSTRVYLSKHTCQTTHEEETPVCIRIQLSRATVKELHSRLQHAYQRDDVRLVRRLTVLLDLLVHQVPMEVLSARWGLSPSCLYHWRQAFLLRGMDSLVYRHSGGRRPKLTPRQKKRLVELLE
Ga0075425_10102883213300006854Populus RhizosphereMSTRVSLSKHICQTSHEEETPVCLRIPRSRPTVKELHRRLQHAYQRDEVRLVRRIPRLLDLLVHQGPVEVLRDRWHLSLSCLYQWRQALLLRGMDSLVYPHSGGRRPTVTPRQKNRLVELVEAGPLVVGCEPAGWTSGLSWVLIWREVGGLYNCPYVCPWLR
Ga0075425_10275527013300006854Populus RhizosphereVYLSKNTCQTTQEEDTPVCIRIQLSHATVKDLHSRLQHAYQRDDVRLVRRTTVFIDLLVHQVPVAVWCARWGLSPACLSNWQKAFLLRGIDSLVYGHSGGRRPQWSSRQKKRLVELIEAGPLVVG
Ga0075424_10019034013300006904Populus RhizosphereMSTRVYLSKHTCQITHEEETPVCIRIQLSRATVKELHSRLQHAYQRDDVRLVRRLTVLLDLLVHQVPMEVLSARWGLSPSCLYHWRQAFLLRGMDSLVYRHSGGRRPKLTPRQKKRLVELLE
Ga0075424_10060500523300006904Populus RhizosphereMSTRVYLSKNPCQTTQEENTPVCIRIQASHATVKALQARLQDAYRRDDVRLVRRISVLLELLTQTASVTALCERWGLSPSCLYDWQKAFMLCGMDSLVYRHSGGRPEK
Ga0074063_1395662713300006953SoilKRVYVSLDIVVTTEQEDTPVSLRIQVSSATVKALHARLQQAYLKDDVRVVRRTTVLIDLLVHHVPLAVLCERWGLSASWLYDWQKALLLHGLERLCSRHSGGRRPTLPPKQQRRLGDLLEAGPLVVGCETACWDAVLMRVLIWRELTNIAHADSDETRVDRSQAEASGDVG*
Ga0075419_1119798623300006969Populus RhizosphereMSKGVYVSIDISVPTEQEDTPVSLRIQVSAATVKALHARLQQAYRKDDVRLVRRTTVLIDLLVHHVPMAVLCERWDLSSSCLYDWQRAFLLHGMERWHYRHSGGRR
Ga0066710_10172415913300009012Grasslands SoilMTIRIPLRSTTVKDLHRRLQDAYRKDDVRLVRRPTVLIDLLVHHVPVAVLCERWGLSPSCLYDWQKALLLRGLDSVVSGHGGGRQPKVTPRQKKRLVELIDVDGNQT
Ga0066710_10226126413300009012Grasslands SoilMSTRVYLSKHTCQITPQEDTPVCIRIQPSRTTAKALQSRLQLAYQRDDVRLVRRITVLLDLLVSHVPVAVLCERWSLSPACVYAWQKAFLLRGLDSLAYSHGGGRQPRLPPRQKQRLVELMDAGPLAVGCETACWNSVLIRVLIWRECGVLSNRHDGCTLLHNVGFA
Ga0099829_1097755213300009038Vadose Zone SoilMSKRVYVSRDIAVTTEQEDTPVSLRIQVSSATVKALHARLRQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCARWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKCRLVD
Ga0099828_1029251713300009089Vadose Zone SoilVCIRIQLSRATVKDLHSRLQHAYRRDDVRLVRRTTVLIDLLVHHVPVAVLRERWGLSPACLYDWQKAFMLRGIDSLVYGHGGGRRPKLSPRQKKRLVELIDAGPLV
Ga0099827_1140313213300009090Vadose Zone SoilMSKQVYVSIDIAVTTAQEDTPVSLRIQVSSATVKALHARWQPAYLKDAGRLVRRTMVLIDLLVHHVPMAVLCERWGLRSSCLYDWQRAFLLHGLASLSYRHSGGRRPQLTPRPKKRLVELLE
Ga0099827_1201035313300009090Vadose Zone SoilMSNRVYLSIDISVIPEQEDTPVSIRMQVSAATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHQGPIPVLCERWSLSASCLYDWQRALLLHSLDSLVSRHSGGRRPKLTPRQKKRLVE
Ga0075418_1302037713300009100Populus RhizosphereVCIRIQLSRATVKDLYSRLQHAYQRDDVRVVRRTTVLIDLLVHHVPVAGLSERWGLSPACLYNWQKAFLLSGMDSLGYGHGGGRRPKLTPRQKKRLVELLDAGPLVVG
Ga0105247_1079117623300009101Switchgrass RhizosphereMSKRVYVSRDISVTTAQEDTPVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLGASCLYDWQKAFLLHGLESLCYHHSGGLRPKLTPKQKRRLVDLLEA
Ga0066709_10017241613300009137Grasslands SoilMSKQVYLSKNKVRIMQEEDTPVSIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRITVLIDLLVHQVPVAVLCERWNLSPSCLYDWQRAFLVRGMESVVYCHGGGRRPKLTAKQKKR
Ga0114945_1041965213300009444Thermal SpringsMSTRVYLSKHTYQTTHEEDTPVCIRIQLSRATVKDLHSRLHHAYQRDDVRLVRRITVLLDLLVHHVPMAVLCERWGLSLACLYDWQKAFLLRGMDSLVYGHGGGRRPKLTPRQKKRLVEL
Ga0114945_1077834913300009444Thermal SpringsMSTRVYLSKHTCQTPHEEDTPVCIRIQLSRATVKDLHSRLHHAYQRDDVRLVRRITVLLDLLVHHVPMAVLCERWGLSVACLYDWQKAFLLRGMDSLVYGHGGGRRPKLTPRQKKRLVEL
Ga0114944_103574413300009691Thermal SpringsVCIRIQLSRATVKDLHSRLHHAYQRDDVRLVRRITVLLDLLVHHVPMAVLCERWGLSVACLYDWQKAFLLRGMDSLVYGHGGGRRPKLTPRQKKRLVEL
Ga0114944_108443433300009691Thermal SpringsMRTRVYPSQHICQTTPLEDTPVCIRIQLSRATVKDLHNRLQHAYRHDDVRLVRRTTVLIDLLVHHVSVEALREHWGLSPACIYDWQKAFLLRGIDSLVYAMVAVAARS*
Ga0126374_1161069413300009792Tropical Forest SoilMSTRVYLSKHTCQTTPEEETPVCLRIQRSRATVNELHNHLQHAYQRDDVRLVRRITVLLALLVHQGPVAVLRERWSLSPSCLYHWRQAFLLQGRASVGYRHGGGRPEQLPPTQRKRVVALIEAGPLVVGLATACWHAVLIRVLIWREVGVLYNRHYGCTWRHNWGVSFQ
Ga0105080_103712123300009797Groundwater SandMSKRVYVSIDIAVTTAQEDTPVSLRIQVSSATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRLVELLEAGPLVV
Ga0105067_109529523300009812Groundwater SandVYLFKNDFGITHQEATPVRIRIQRSRATVKDLHSRLQQAYQKDDVRLVRRTTVVIDRRVHHAPVAVLCERWGLSPACLYAWQQAFLLHGLDSVVSHHGGGRQPKLTPRQQQRLVELME
Ga0105062_104313113300009817Groundwater SandVCVRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLLDLLVHHVPVEVLSERWGLSTSCLYQWRHAFVLRGMDSLVYHHSGGRRPKLTPRQKKRLVELLEAGPQVVG
Ga0105072_112878713300009818Groundwater SandMSIRIQRSHATVKALHRHLQQAYRGDDVRLVRRTTVLIDLLVHHVPLVVLCERWGLSPACLYDWQKAFVLRGLDSLVSHHGGGRQPKLPPKQKKR
Ga0105087_112291113300009819Groundwater SandMSTRVYLSKHTCQTTHEEDTPVCVRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLLDLLVHHVPVEVLSERWGLSTSCLYQWRQAFVLRGMDSLVYHHSGGRRPKLTPRQKKRLVELLEAGPQ
Ga0126384_1175277013300010046Tropical Forest SoilMSTRVYLSKHTNQTTQQEDTPVCIRIQLSRATVKDLYSRLQHAYQRDDVRLVRRLTVVIDLLVHQVPMAVLCERWGLSPSCLYDWQKAFLMRGMDSLRYRHSG
Ga0126384_1177490313300010046Tropical Forest SoilMSKRVYVSIDILVPTEQEDTPVRLRIQVSAATVKALHARLHQAYLKDDVRLVRRTTVLIDLLVHHVPMAVLCERWGLSSSCLYGWQRAFLLHGMDSLYYRHSGGRRPKLTPRQKKRLV
Ga0126384_1189554913300010046Tropical Forest SoilMSKRVYVSIDMAVTMAEEDTPVSLRIQVSAATVKALHTKLQQAYRKDDVRLVRRTTVLIDLLVHHVPMAVLCERWGLSASCLDDWQRAFLLHGLESVRYRHGGGRRPKLTH
Ga0126384_1236121123300010046Tropical Forest SoilMSTRVYLSKHTCQTTHEEDPPVCLRIQLSRATVKELPSRLQHAYQRDDVRLVRRITVLLDLLVHQVPVEVLSDRWGLSPSWFYHWRQAFLLRGMDSLVYRQGGGRPEKLTATQRKRLVALIEAGPLVVGLETACW
Ga0126382_1128142323300010047Tropical Forest SoilMSTRVYLSQHTYQTTHEEDPPVGLRIQRSRATVQELHGRLQHAYQRDEVRVVRRPTVWLDLLGHRVPVAVLSARWHRSPSWLYQWRQALLRRGMDSLVSHPRGGRRPQWTPRQKKRWVELLEAGPQVVGGATACGTAV
Ga0126382_1226024913300010047Tropical Forest SoilMSTRVYLSKHTDQTSHEEETFVCFRIQLSRATVKDLHTRLQHAYQRDDVRLVRRITVLLDLLVHHVPIAVLCERWGLSLACLYDWQKAFLLHGMESCVSHHSGGRRPKLTPRQKKRLVELIDAGSQVVGFETACWNS
Ga0126373_1318309713300010048Tropical Forest SoilVYLSKHTYQTTHQEDTPVCIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRITVLLELLVHQVPVEVLSERWGLSSACLYHWRQAFLLRGMDSLVYHHSGGRRPKLTARQKKRLVELL
Ga0127492_108584123300010087Grasslands SoilMSTRVYLSKHTCQTTQQEDTPVCIRIQLSRATVNDLHKRLQHAYQRDDVRLVRRLTVLLDLLVQQVPMAVLCERWGLSLSCLYNWQKALLLRGMDSLVSRHGGGRRP
Ga0134088_1057993013300010304Grasslands SoilMQEEDTPVSIRIQLSRATVKDLHSRLQHTYQRDDVRLVRRITVLIDLLVHQVPVAVLCERWNLSPSCLYDWQRAFLVRGMESLVYCHGGGRRPKLTPKQKKRLVELIDAGPQVVGFETAC
Ga0126377_1310773013300010362Tropical Forest SoilMSTRVYVSRDITVTTAQEDTPVSIRIQLSAATVKALHAKLHQAYLRDDVRLVRRTTVLIDLLVHHVAMAVLCERWGLSVSCLYDWQRAFLLHGLESVRYRHSGSRRPKLTPKQKQRLVELIEAGPLVV
Ga0126381_10345871113300010376Tropical Forest SoilMSKRVYVSIDISVTTAQEETPVSLRIQVSAATVKALHARLQQAHITDDVRLVRRTTVLLDLLVHHMPMATLCERWGLSPSCLYNWQRAFLLHGMESLHYRHSGGR
Ga0126383_1115320913300010398Tropical Forest SoilMSKRAYLAQHNSRTTQSKDTPVSIRIQLSHATVKGLHSRLQHAYERDDVRLVRRITVLSDLLVHHVPITVLCECWGLSPACLYAWQKAFLLRGMDSFVYQHSGGRRPKLTPSQKKRLVE
Ga0126383_1353458213300010398Tropical Forest SoilMSTRVYLSQHTDQTTHEEETPVCLRIQLSRATVKDLHCRLQHAYQRDDVRLVRRITVLLDLLVHHVPMAVLCECWGLSLACLYDWQKAFLLRGMESLVYHHSGGRRPKLTPRQKKRLVELIDAGPQVVGFE
Ga0137391_1080780813300011270Vadose Zone SoilMSKRVYVSRDIAVTTAQEDTPVSLRIQVSSATVKALHARWQQAYLKDDGRLVRRTTVLIDLLVHHVPLAVLCERWGLSASWLYDGQKAFLLHGLASLCSRHSGGRRPKLTPKQKCRLVDLLEAGPLVVGGETACWDAVLMRVLIWREVGVLSN
Ga0137391_1156100723300011270Vadose Zone SoilMSTGVYLSRDISVLPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQKAFLRHGLESLCSRHSGGRRPKLTPKQKCRLVDLLEAGPLVVG
Ga0137432_129728813300011439SoilMSTRVYHPSDIAVTTVQEDTPVSIRIQVSSATVKALHNRLQQAYLKDDVRLVRRTTVLIDLLVHLVPMALLCERWGLSASCLYDWQRAFLLHGLNSLVYRHGGGRRPKLTPKQKK
Ga0137388_1042749623300012189Vadose Zone SoilVCIRIQRSRATVKDLHRRLQHAYQCDDVRLVRWTTVLIDLLVHHVPVAVLSERWGLSPACLYGWQQALLLRGMDSVVSRHGGGRRPQLTPRPHSTEFSGG
Ga0137388_1200956013300012189Vadose Zone SoilMSQRVYLSRDISVIPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRAFLLHGLDSLVSRHSGGRRPRLTPRQKKRLGELLA
Ga0137374_1006333113300012204Vadose Zone SoilMAVTTAQEDTPVSIRIQLSSATVKALHARLRQAYLKDDVRLVRRTTVLIDLLVHHVPMAILCERWGLSSSCLYDWQSAFLLHGLESLHYRHSGGRRPKLTPKQKQRLV
Ga0137362_1175362123300012205Vadose Zone SoilDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPQLTPRQKKRLVGDVW*
Ga0137380_1166454723300012206Vadose Zone SoilMPIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLIDLLVHHVPVEVLCTQWGLSPSCISHWRHAFLLHGVDSLVYRHSGGRRPKLTPKQKHRLG
Ga0137381_1096413123300012207Vadose Zone SoilMSKQVYLSKNKVRIMQEEDTPVSIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLIDLLVHHVPVAVLCERWGLSPSCLYDWQRAFLVRGMESLVYCHGGGRRPKLTPKQKKRLVALIDAGPQVVGFETACWNHIPPNYVVAYLSRSSQAA*
Ga0137379_1022783213300012209Vadose Zone SoilMSKRVYVSIDIAVTTEQEDTPVSLRIQVSSATVKALHARLRQAYLKDDVRLVRRTTVLLDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRR
Ga0137379_1152625623300012209Vadose Zone SoilMSKRVYVSIDIAVTTAQEDTPVSLRIQVSSATVKALHARLQQAYLKDEVRLVRRTTVLLDLLVHHVPLAVLCERWGLSASGLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRLVDLLEAGPL
Ga0137379_1166522313300012209Vadose Zone SoilVCIRIQLSRATVKELHSRLQHAYQRDDVRLVRRTTVLLDLLVHQVPVEVLRVRWGLSTSCLYHWRQAFLLRGMDSLAYHHSGGRRPKLTPRQHKRLVELLEAG
Ga0137377_1093170913300012211Vadose Zone SoilMTIRIPLRSTTVKDLHRRLQDAYRKDDVRLVRHPTVLIDLLVHHVPVAVLCERWGLSPSCLYDWQKALLLRGLDSVVSGHGGGRQPKVTPRQKKRLVELIDVDGNQT*
Ga0137386_1112668513300012351Vadose Zone SoilMSKRVYLSRDIAVTPEQEDTPVSISIPVSSATVKAWHTRLPQAYLKDDVRLVRRMTVVIDLLVHHVPVEVLHERWGLSISCIYQWRQDFLLRGMDSLVYHHNGGRRPKLASRQKKRLVELLEAGPQ
Ga0137385_1017046313300012359Vadose Zone SoilMSKQVYLSKNNVRIMQEEETPVSIRIQRSRATVKDLHSRHQHAYQRADVRLVRRSIVLIDLLGHQVPVAVLCERWHLSPSCLYDWQRAFLVRGMESVVYCHGGGRRPKLTPKQKKRLVALIDAGLQVV
Ga0137360_1021068413300012361Vadose Zone SoilMSNRVYLSIDISVIPEQEDTPVSIRMQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHQGPIPVRCERWSLRASCLYDWQRAFLRHGLDRLVSRHRGGRRPQLRPRQKQRLVELIAAGQLVVRGETACLDAVRLRVLLWRECGGLSNRQDVCPLHHNLGV
Ga0137361_1011723433300012362Vadose Zone SoilMSKRVYLSRDIAVLPEQEDTPVSLRIQVSSATVKGLHTKLHQAYLKDDMRLVRRTTVLIDMLVHQGPIPVRCERWSLRASCLYDWQRAFLRHGLDRLVSRHRGGRRPQLSPRQKK
Ga0134033_102228823300012383Grasslands SoilMQEEDTPVSIRIQLSRATVKDLHSRLQHAYQRDDVRLVRRITVLIDLLVHQVPVAVLCERWGLSLSCLYAWQQALLLRGLDSVVYGHSGGRPPKVTPQHKKRFVELSEAGP
Ga0137395_1018416113300012917Vadose Zone SoilMSKRVYLSRDISVLPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPQLTPRQKKRLVGDVW*
Ga0137394_1033225613300012922Vadose Zone SoilMSKRVYLSRDISVLPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPKLTPRQKKRLVGDVW*
Ga0137394_1132166423300012922Vadose Zone SoilMSTRVYVSRDIAVTTAQEDTPVSIRIQLSSATVKALHAKLPQAYLKDDGRLVRRTTVLIDLLVHHVAMAVLCERWGLSVSCLYDWQRACLRHGLESWHSRHSGGRRPQLTPQQKPRLVELSEAGPLVVGGETACW
Ga0137359_1033257823300012923Vadose Zone SoilMSNRVYLSRDISVLPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPQLTPRQKKRLVGDVW*
Ga0137416_1106063813300012927Vadose Zone SoilMSNRVYLSIDISVIPEQEDTPVSLRMQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHQGPIPVLCERWSLSASCLYDWQRALLLHGLDSLVSRHRGGRRPQLSPRQKKRLVELLEAGPLAVGCETAGWDAVIIRVLI
Ga0137407_1061341523300012930Vadose Zone SoilMRTTRLEETPVSIRIQRSHATVKDLYSRLQYAYQRDDVRLVRRTTVLLDLLVHHVPVEVLGVRWGLSISCIYQWRHAFLLQGMASLTYHHGGGRRPKLTPRQKQCLVELMEAGPLVVGCETA
Ga0137407_1109882023300012930Vadose Zone SoilMSNRVYLSIDISVIPEQEDTPVSIRMQVSSATVKALHTKLQQAYRKDDRRLVRRPTVLIDMLVHQGPIPVLCERWSLRASCLYDWQRTFLLHGLDSLVSRHSGGRRPKLTPRQKKRLVGDVW*
Ga0137407_1198851823300012930Vadose Zone SoilMNIRGYLSQNDFSLTPQEDTPMQIRIQRSRTTVKDLYSRLQQAYQKGEVRLVRRTTVLIDLLVHHVPVAVSCERWGLSPACPYAWQKALLRHGLDSLVSHHGGGRRPKLTPSRRSVWWS*
Ga0137410_1000839273300012944Vadose Zone SoilMSKRVYLSRDISVLPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPKLTPRQKKRLVELLEAGPLAVGCETAGWDA*
Ga0126375_1049214713300012948Tropical Forest SoilMSTRVYLSKHTYQTTHQEETPVCLRIQLSRATVKELPSRLQPAYQHDDVRWVRRITVLLDLLVYQVPVEVLSARWGLSPSGLSHWRRAFRLRGMESLGYRHGGGRPEKLPPTQRKRWVALLEAGPLVVGVGRPVGTRS*
Ga0126375_1119405113300012948Tropical Forest SoilVYVSIDIASTTAQEETPVSLSIPVRAATVKALHAQLPQAYRQAAGRLVRRTTVLLDLLVHHVPMAVLGERWGLSSSCLYDWERAFLLHGLESLRYRHGG
Ga0126369_1135686623300012971Tropical Forest SoilMSTRVYLSQHTDQTTHKEETPVCRRIQRSRATIKDLHTRLQHAYQRDEVRLVRRITVLLDLLVHQVPMAGLCERWGLSPACLYAWQQAFLLGGMESVVYHHSGGRRPKLTPRQKKRLVELIDAGPQVVGCETACWNS
Ga0126369_1288396713300012971Tropical Forest SoilMNWKFRVVSISEQVYLTIDLSVTTVQEDTPVSIRIQSSSATVQALHTRLQQAYLKDDVRLVRRITVLIDLLVHHVPMAVLSARWGLSASCLYDWQKAFVLHGLDSFVSRHSGGRRPKLTPKQKKR
Ga0134077_1007078723300012972Grasslands SoilMPIRIQLSSATVKALHCRLQHAYRKDEVRLVRRITVLIDFLVHHVPVSVLCERWRLSPSCLYDWQKAFMLRGMDSLSYRYSGGRPEKLTPSQKKRLVELM
Ga0164307_1141368613300012987SoilMSKRVYVSRDISVTTAQEDTPVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLGASCLYDWQKAFLLHGLESLCYHHSGGRRPKLTPKQKR
Ga0137409_1005772453300015245Vadose Zone SoilMSNRVYLSIDIAVIPEQEDTPVSLRIQVSAATVKALHTKLQQAYLKDDRRLVRRTTVLIDMLVHQGPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVSRHSGGRRPKLTPRQKKRLVELLEAGPLAVGCETAGWDA*
Ga0137403_1111115013300015264Vadose Zone SoilVSKRVYLSRDISVIPEQEDTPVSIRIQVSSATVKALHTKLQQAYLKDDMRLVRRTTVLIDMLVHHVPIPVLCERWSLSASCLYDWQRTFLLHGLDSLVYRHKCANS*
Ga0134072_1046475413300015357Grasslands SoilMSKRVYLSKNPYQTIQEENTPVSLRIQASNATVKALQIRLQDAYRRDDVRLVRRISVLLALLTQTASVPVLCERWGLSPACLYAWQKAFVLRGMVSLLYRHSGGRPEKVTPRQKKRLVE
Ga0132258_1100218733300015371Arabidopsis RhizosphereMSTRVYLSQHTDQTTHEEDTPVCLRIQLSRATVKDLHTRLRHAYQRDEVRLVRRITVLLDLLVHHVPMAVLCERWGLSLACLYDWQKAFLLRGMESCVYHHSGGRRPKLTPRQKK
Ga0184634_1019535323300018031Groundwater SedimentMRERGYLSQNDFGITQQEDTPVGMRIQVSNATVKALQNRLQQAYRKDDVRLVRRTTVLIDLLVHHVPVVVLSERWGLSPACLYDWQKAFMLRGMDSLLSRHGGGRPE
Ga0184612_1003584953300018078Groundwater SedimentMSTRVYLSKHTCQTTHEEDTPVCVRIQLSRATVKDLHSRLQHAYQRDDVRLVRRTTVLLDLLVHHVPVEVLSERWGLSTSCLYQWRQAFVLRGMDSLVYHHSGGRRPKLTPRQKKRLVELLE
Ga0184639_1040002923300018082Groundwater SedimentMSTRVYLSKHTCQTTHKEDTPVCIRIQLSRATVKDLHRRLQHAYQRDDVRLVRRTTVLLDLLVHQVPVEVLSERWGLSTSCLYQWRQAFLLRGMDSLVYHHSGGRRPKLLPR
Ga0066662_1216278813300018468Grasslands SoilMSTRVYLSKHTCQITPQEDTPVCIRIQPSRTTAKALQSRLQLAYQRDDVRLVRRITVLLDLLVSHVPVAVLCERWSLSPACVYAWQKAFLLRGLDSLAYSHGGGRQPRLPPRQKQRLVELMDAGPLAVGCE
Ga0066662_1219713913300018468Grasslands SoilMSKRVYLSKASSGTTRYEDTPMRIRIQVSSATVKALQTRLQQAYQHDDVRLVRRTTVLIDRLVYHVPMERLCERWGLSPACLYGWRQAFLLRGMDSLVYRHSGGR
Ga0066669_1073534623300018482Grasslands SoilMSKQVYLSKNNVRITQEEGTPVSIRIQLSRATIKDLHSRLQHAYRRNDVRLVRRTTVLIDLLVHHVPVAVLCERWDLSPACIDGGHQAFLLLGLDSWGSSHGGGRRPKLAPKQRRRFVELIAAGPLVVG
Ga0184643_139031213300019255Groundwater SedimentMSKRVYVSIDIAVTTEQEDTPVSLRIQVSAATVKALHTRLQQAYLKDDVRVVRRTTVLIDLLVHHVPMAVLCERWGLSSSCLYDWQRAFLLHGLESVSYRHSGGRRPKLTPRQKKRLVELLE
Ga0184646_128035313300019259Groundwater SedimentMSIKIQVSSATVKALHTRLPQAYHHDDVRLVRRATVLVDLLVQHVPVEVLCEQWGLSPACIYQWRQAFLLRGMDSLVYCHGGGRHPKLTAKQKKRLVELMEAG
Ga0179596_1072634013300021086Vadose Zone SoilVSLRIQVSSATVKALHARWQQAYLKDDGRLVRRTTVLIDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRLMDLLEAGPLVVGCETACWDAVLIRVLIW
Ga0212128_1014263523300022563Thermal SpringsMRTRVYPSQHTCQTTPLEDTPVCIRIQLSRATVKDLHNRLQHAYRHDDVRLVRRTTVLIDLLVHHVSVEALREHWGLSPACIYDWQKAFLLRGIDSLVYAMVAVAARS
Ga0212128_1019828013300022563Thermal SpringsMSTRVYLSKHTCQTPHEEDTPVCIRIQLSRATVKDLHSRLHHAYQRDDVRLVRRITVLLDLLVHHVPMAVLCERWGLSVACLYDWQKAFLLRGMDSLVYGHGGGRRPKLTPR
Ga0212128_1045941513300022563Thermal SpringsMSNRVYLSKRSYPTSPQADTPVSIRIQLSRATVKDLHRRLQYAYQHDDVRLVRRTTVLIDLFVHHMSVEVLSERWGLSASCMYGWRQDLLLHGVDSLVYHHGGGRQPKLTPKQRRRLVEL
Ga0207686_1059715313300025934Miscanthus RhizosphereMSKRVYVSRDISVTTAQEDTPVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLGASCLYDWQKAFLLHGLESLCYHHSGGRRPKLTPKQKRRLVDLLEAGPLVVGCE
Ga0209897_102763013300027169Groundwater SandVSTRVYLFKNDFGITHQEATPVRIRIQRSRATVKDLHSRLQQAYQKDDVRLVRRTTVVIDRRVHHAPVAVLCERWGLSPACLYAWQQAFLLHGLDSVVSHHGGDDSPS
Ga0209843_103321813300027511Groundwater SandMSKRVYLSIDISVTPEQEDTPVRISMQVSSATVKALHTRLQQAYLKDDGRLVRRMTVVIDLLVHHVPVEVLHARWALSISCIYQWRQAFLLRGMDSLVSHQSGGRRPKLTP
Ga0209590_1080719013300027882Vadose Zone SoilMSTRVYLSKNPYQTTQEENTPVCIRIQASHATVKALQARLQDAYRRDDVRLVRRISVLLELLTQTASVTVLCERWGLSPACLYAWQKAFMLRGMDSLIYRHSGGRPEKLTPRQKKQIPPDLVVKSQHIDIIGVNLRLI
Ga0247828_1100257213300028587SoilMSKQVYVSIDMAVIMAQEDTPVSLRIQVSAATVTALHAKLQQAYLKDDVRLVRRTTVLIDLLVHHVPMAVLCERWGLSASCLYDWQRAFLLHGLESLRYCHGGGRRPKLTPKQKQRLVELLEAGPL
Ga0247659_121677013300030600SoilMSKRVYVSIDITVTMVEEDTPVSLRIQVSAATVKALHAKLQQAYRKDDVRLVRRTTVLLDLLVHHGPRAGLCERWGLRASCLYDWQRAFLLHGLERLRYRHGGGRRPQLPPPQKQRLVE
Ga0308205_106589113300030830SoilMSKRVYLSKNDFGITREKDTPVSIRIQLSRATVKDLYSRLQYAYQRDDVRLVRRTTVLIDLLVHRVPVAVLGERWGLSPACLYAWQQAFLLRGLDSLLSHPGGGR
Ga0308190_103318023300030993SoilMSKRVYVSRDIAVTTEQEDTPVSLRIQVSSATVKALHARLRQAYLKDDVRLVRRTTVLIDLLVHHVPLAVLCERWGLSASCLYDWQKAFLLHGLESLCYRHSGGRRPKLTPKQKRRVGDLLEAGPLIVGGETACWDAVLMRVLIWRECGVLSNR
Ga0308189_1021204113300031058SoilVYLSKNDFGITREKDTPVSIRIQRSRATVKDLYSRLQYAYQRDDVRLVRRTTVLIDLLVHRVPVAVWGERWGLSPACLYAWQQALLLRGLDSLLSHPGGGRRPKWTPRQKKRLVELVEAGPQVVGCETACGDSVLIRVLIWREFGVL
Ga0308199_113274923300031094SoilMSQWVYLSSHISVTTAQEDTPVNIRIQVSSATVKALHNRLQQAYFKDDVRLVRRTTVLIDLLVHLVPVAVLCERWGLSASCLYDWQRALLWHSLDSLVSRHGGGRRPQLPPKQKKRLVELIEAGPL
Ga0308187_1049228913300031114SoilMHIRIQLSRSTVKDLHSRLQHAYQRDDVRLVRRTTVLIDLLVHHVPVEELSAQWGLSVSCMYGWRQAFLLHGMESLVYHHGGGRQPKLTPKQRKRLVELI
Ga0308194_1018066623300031421SoilMSKGVYVSRDITVTTAQEDTPVSLRIQVSAATVKAFHARLRQAYLKDDGRLVRRTTVVIDLLVHHVPLAVLCERWGLSASCLYDWQKACLLHGLESLCSRHSGGRRPKLTPKQKRRVGDLLEAGPLIVGCETACWDAVLMRVLI
Ga0308194_1036420213300031421SoilMSIKIQVSSATVKALHTRLQQAYQHDDVRLVRRATVLVDLLVHHVPVEVLCEQWGLSPACIYQWRQAFLLRGMDSLVYCHGGGRHPKLTAKQKKRLVELME
Ga0318542_1019982813300031668SoilVSLRIQVSAATVKALHARLQQAYLKDDVRLVRRTTVLMDLLVHHVPMAVLCERWGLRSSCLYDWQRAFLLHGMESVPYRHSGGRRPKLTPRQKKRLVE
Ga0318561_1085349813300031679SoilMHIRIRLSSATVKALHAKLQQAYRKDDVRLVRRTTVLIDLLVHHVPMAVLCERWGLRASCLYDWQKAFLLHGLESLRYRHSGGRRPQFTPKQK
Ga0307469_1236682513300031720Hardwood Forest SoilMSKRVYLSRDISVIPEQEDTPVSLSIQVSSATVKALHTKVHQAYRKDDRRLVRRTTVLIDMLVHQGPIPVLCARWSLRASCLYAWQRAFLLHGLDRLVSRHSGGRRPQLPPRQKKRLVELSEAGPLVVGCETACW
Ga0310891_1034953823300031913SoilMSTRVYLSKHTCQTTHEEDTPVCVRIQLSRTTVKDLHSRLQHAYQRDDVRLVRRITVLLDLLVHRVPMAVLCERWGLSPSCIYAWQQAFLLRGMDSLVYGHSGGRPPTLTPRHKKRLV
Ga0310884_1093652423300031944SoilMSTRVYVSRDIAVTTAQEDTPVSIRIQLSAATVKALHAKLHQAYLRDDVRLVRRTTVLIDLLGHHVAMAVLCERWGLSVSCLYDWQRAFLLHGLESVRYRHSGGRRPKLTPPSRSNAWLS
Ga0310906_1016636413300032013SoilMSTRVYVSRDIAVTTAQEDTPVSIRIQLSAATVKAVHAKLHQAYLRDDVRLVRRTTVLIDLLGHHVAMAVLCERWGLSVSCLYDWQRAFLLHGLESVRYRHSGGRRPKLTPPSRSNAWLS
Ga0318507_1022201213300032025SoilMSTRVYLSKYPYPTTPEEDTPMCIRIQLSHATVKALQSRLQHAYQRDDVRLVRRTTVLLDLLVHHVPVEVLCAQWGLSPACLSGWRQAFLLRGMDSLVYRHGGGRRPKLTA
Ga0318504_1008953113300032063SoilMSTQVYLSKHTDQTAQQEDTPVCIRIQLSRATVKDMHSRLQHAYQRDDVRLVRRITVLLDRLVHQVPMVVLCERWGLSLACLYDWQKAFLRRGMASLVYGHGGGRRPKLTPRQKQRLVELIEAG
Ga0314798_065201_303_7103300034673SoilMSTRVYLSKHTCQTTHEEETPVYLRIQLGRATVKDLHSRLQHAYQRDDVRLVRRITVLLDLLVHQVPMVALCERWGLSLAWLYNWQKAFLLRGMDSLVYGHGGGRRPKLTPRQKKRLVELIEPGPLVVGFETACWN
Ga0370541_048690_2_2863300034680SoilMSIKIQVSSATVKALHTRLQQAYQHDDVRLVRRATVLVDLLVHHVPVEVLCEQWGLSPACIYQWRQAFLLRGMDSLVYCHGGGRHPKLTAKQKKR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.