NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F046227

Metagenome / Metatranscriptome Family F046227

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046227
Family Type Metagenome / Metatranscriptome
Number of Sequences 151
Average Sequence Length 198 residues
Representative Sequence PHDIPSADIVASLPQLREEFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL
Number of Associated Samples 125
Number of Associated Scaffolds 151

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 7.28 %
% of genes near scaffold ends (potentially truncated) 93.38 %
% of genes from short scaffolds (< 2000 bps) 92.72 %
Associated GOLD sequencing projects 120
AlphaFold2 3D model prediction Yes
3D model pTM-score0.28

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (85.430 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(19.205 % of family members)
Environment Ontology (ENVO) Unclassified
(36.424 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(31.126 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.41%    β-sheet: 10.55%    Coil/Unstructured: 43.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.28
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 151 Family Scaffolds
PF06406StbA 5.96
PF01326PPDK_N 0.66
PF01966HD 0.66
PF01695IstB_IS21 0.66
PF10412TrwB_AAD_bind 0.66
PF12696TraG-D_C 0.66

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 151 Family Scaffolds
COG0574Phosphoenolpyruvate synthase/pyruvate phosphate dikinaseCarbohydrate transport and metabolism [G] 0.66
COG1484DNA replication protein DnaCReplication, recombination and repair [L] 0.66


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A85.43 %
All OrganismsrootAll Organisms14.57 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2032320005|FACEOR_FY84VJD01DH5T9Not Available502Open in IMG/M
2088090014|GPIPI_16903164Not Available1385Open in IMG/M
3300000789|JGI1027J11758_12924165Not Available644Open in IMG/M
3300000793|AF_2010_repII_A001DRAFT_10124504Not Available538Open in IMG/M
3300000955|JGI1027J12803_100479924Not Available1008Open in IMG/M
3300002243|C687J29039_10225907Not Available655Open in IMG/M
3300003319|soilL2_10129140All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium4638Open in IMG/M
3300003324|soilH2_10396649All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1821Open in IMG/M
3300003996|Ga0055467_10078497Not Available906Open in IMG/M
3300004267|Ga0066396_10089696Not Available554Open in IMG/M
3300005332|Ga0066388_100275969All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2323Open in IMG/M
3300005332|Ga0066388_103346475Not Available819Open in IMG/M
3300005332|Ga0066388_104126393Not Available741Open in IMG/M
3300005445|Ga0070708_100947137Not Available808Open in IMG/M
3300005447|Ga0066689_10448041Not Available810Open in IMG/M
3300005451|Ga0066681_10602690Not Available676Open in IMG/M
3300005459|Ga0068867_102367752Not Available505Open in IMG/M
3300005536|Ga0070697_101280234Not Available654Open in IMG/M
3300005553|Ga0066695_10303159Not Available1007Open in IMG/M
3300005713|Ga0066905_101369515Not Available639Open in IMG/M
3300005764|Ga0066903_107065772Not Available582Open in IMG/M
3300005843|Ga0068860_101779260Not Available638Open in IMG/M
3300005983|Ga0081540_1189735Not Available759Open in IMG/M
3300006049|Ga0075417_10093529Not Available1356Open in IMG/M
3300006847|Ga0075431_100891587Not Available859Open in IMG/M
3300006847|Ga0075431_101146222Not Available741Open in IMG/M
3300006854|Ga0075425_101227284Not Available852Open in IMG/M
3300006865|Ga0073934_10131579Not Available1823Open in IMG/M
3300006865|Ga0073934_10414669Not Available824Open in IMG/M
3300006903|Ga0075426_11530197Not Available507Open in IMG/M
3300007076|Ga0075435_100979920Not Available738Open in IMG/M
3300009012|Ga0066710_100361212All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2151Open in IMG/M
3300009012|Ga0066710_102993086Not Available658Open in IMG/M
3300009078|Ga0105106_10609348Not Available782Open in IMG/M
3300009078|Ga0105106_10615059Not Available778Open in IMG/M
3300009137|Ga0066709_101373773Not Available1030Open in IMG/M
3300009137|Ga0066709_101556649Not Available950Open in IMG/M
3300009137|Ga0066709_102489364Not Available699Open in IMG/M
3300009147|Ga0114129_10172078All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2952Open in IMG/M
3300009162|Ga0075423_11289851Not Available780Open in IMG/M
3300009610|Ga0105340_1223950Not Available799Open in IMG/M
3300009807|Ga0105061_1057801Not Available606Open in IMG/M
3300009811|Ga0105084_1068926Not Available641Open in IMG/M
3300009816|Ga0105076_1100898Not Available560Open in IMG/M
3300009818|Ga0105072_1067780Not Available692Open in IMG/M
3300009836|Ga0105068_1117491Not Available530Open in IMG/M
3300010047|Ga0126382_11669691Not Available594Open in IMG/M
3300010048|Ga0126373_12311643Not Available598Open in IMG/M
3300010304|Ga0134088_10641800Not Available530Open in IMG/M
3300010359|Ga0126376_10154276All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1844Open in IMG/M
3300010360|Ga0126372_13022488Not Available522Open in IMG/M
3300010361|Ga0126378_11871268Not Available683Open in IMG/M
3300010361|Ga0126378_12943405Not Available543Open in IMG/M
3300010398|Ga0126383_10862920Not Available990Open in IMG/M
3300010398|Ga0126383_12484227Not Available603Open in IMG/M
3300010399|Ga0134127_11191861Not Available828Open in IMG/M
3300011271|Ga0137393_10352306All Organisms → cellular organisms → Bacteria1258Open in IMG/M
3300011271|Ga0137393_10953800Not Available731Open in IMG/M
3300012034|Ga0137453_1049734Not Available758Open in IMG/M
3300012160|Ga0137349_1020025Not Available1050Open in IMG/M
3300012200|Ga0137382_10870130Not Available650Open in IMG/M
3300012201|Ga0137365_11307499Not Available514Open in IMG/M
3300012206|Ga0137380_10897673Not Available761Open in IMG/M
3300012206|Ga0137380_11137522Not Available664Open in IMG/M
3300012207|Ga0137381_10654351Not Available914Open in IMG/M
3300012207|Ga0137381_10771703Not Available834Open in IMG/M
3300012209|Ga0137379_11221046Not Available659Open in IMG/M
3300012285|Ga0137370_10953849Not Available529Open in IMG/M
3300012349|Ga0137387_10405788Not Available988Open in IMG/M
3300012350|Ga0137372_10694408Not Available737Open in IMG/M
3300012350|Ga0137372_10959927Not Available600Open in IMG/M
3300012351|Ga0137386_10687911Not Available735Open in IMG/M
3300012353|Ga0137367_10319155Not Available1109Open in IMG/M
3300012354|Ga0137366_11120860Not Available540Open in IMG/M
3300012355|Ga0137369_10301346Not Available1189Open in IMG/M
3300012357|Ga0137384_11019886Not Available665Open in IMG/M
3300012358|Ga0137368_10315049Not Available1052Open in IMG/M
3300012358|Ga0137368_10355953Not Available972Open in IMG/M
3300012359|Ga0137385_10665108Not Available871Open in IMG/M
3300012360|Ga0137375_10394456Not Available1212Open in IMG/M
3300012360|Ga0137375_10409044All Organisms → cellular organisms → Bacteria1183Open in IMG/M
3300012361|Ga0137360_10423193Not Available1125Open in IMG/M
3300012532|Ga0137373_10927056Not Available637Open in IMG/M
3300012532|Ga0137373_11014763Not Available600Open in IMG/M
3300012582|Ga0137358_10812147Not Available620Open in IMG/M
3300012929|Ga0137404_11410603Not Available643Open in IMG/M
3300012931|Ga0153915_11489886Not Available791Open in IMG/M
3300012948|Ga0126375_11331469Not Available605Open in IMG/M
3300012948|Ga0126375_12078079Not Available504Open in IMG/M
3300014267|Ga0075313_1072529Not Available826Open in IMG/M
3300014879|Ga0180062_1101931Not Available655Open in IMG/M
3300015372|Ga0132256_103064648Not Available562Open in IMG/M
3300018031|Ga0184634_10205817Not Available897Open in IMG/M
3300018054|Ga0184621_10149163Not Available842Open in IMG/M
3300018061|Ga0184619_10187200Not Available950Open in IMG/M
3300018068|Ga0184636_1281341Not Available587Open in IMG/M
3300018071|Ga0184618_10249441Not Available751Open in IMG/M
3300018072|Ga0184635_10341332Not Available577Open in IMG/M
3300018073|Ga0184624_10006158All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3942Open in IMG/M
3300018075|Ga0184632_10378087Not Available600Open in IMG/M
3300018075|Ga0184632_10489792Not Available505Open in IMG/M
3300018076|Ga0184609_10051480All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1759Open in IMG/M
3300018081|Ga0184625_10017377All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3399Open in IMG/M
3300018081|Ga0184625_10359263Not Available756Open in IMG/M
3300018422|Ga0190265_10167380All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2172Open in IMG/M
3300018429|Ga0190272_12302618Not Available581Open in IMG/M
3300018465|Ga0190269_11298781Not Available581Open in IMG/M
3300018469|Ga0190270_10626415Not Available1053Open in IMG/M
3300020004|Ga0193755_1150647Not Available706Open in IMG/M
3300020021|Ga0193726_1334542Not Available566Open in IMG/M
3300021081|Ga0210379_10481350Not Available551Open in IMG/M
3300021332|Ga0210339_1123235Not Available569Open in IMG/M
3300021560|Ga0126371_10804642Not Available1086Open in IMG/M
3300021560|Ga0126371_12565308Not Available617Open in IMG/M
3300024241|Ga0233392_1036525Not Available539Open in IMG/M
3300025146|Ga0209322_10125091Not Available1160Open in IMG/M
3300025155|Ga0209320_10283237Not Available695Open in IMG/M
3300025160|Ga0209109_10178864Not Available1055Open in IMG/M
3300025160|Ga0209109_10400063Not Available640Open in IMG/M
3300025164|Ga0209521_10112639Not Available1729Open in IMG/M
3300025165|Ga0209108_10391969Not Available682Open in IMG/M
3300025167|Ga0209642_10104335Not Available1640Open in IMG/M
3300025174|Ga0209324_10747173Not Available551Open in IMG/M
3300025310|Ga0209172_10109301Not Available1571Open in IMG/M
3300025318|Ga0209519_10029201All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3012Open in IMG/M
3300025318|Ga0209519_10606418Not Available602Open in IMG/M
3300025322|Ga0209641_10060007All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2928Open in IMG/M
3300025325|Ga0209341_10164822All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1863Open in IMG/M
3300025326|Ga0209342_10024936All Organisms → cellular organisms → Bacteria5811Open in IMG/M
3300027277|Ga0209846_1030314Not Available865Open in IMG/M
3300027379|Ga0209842_1021384All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300027875|Ga0209283_10327361All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1008Open in IMG/M
3300027957|Ga0209857_1010051All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1933Open in IMG/M
3300028828|Ga0307312_10949426Not Available570Open in IMG/M
3300030006|Ga0299907_10283579All Organisms → cellular organisms → Bacteria1355Open in IMG/M
3300031576|Ga0247727_10223452Not Available1689Open in IMG/M
3300031720|Ga0307469_10378194Not Available1200Open in IMG/M
3300031912|Ga0306921_11033793Not Available925Open in IMG/M
3300031962|Ga0307479_10676589Not Available1012Open in IMG/M
3300032025|Ga0318507_10527323Not Available514Open in IMG/M
3300032157|Ga0315912_10412748All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300032180|Ga0307471_100124412All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2412Open in IMG/M
3300032180|Ga0307471_101399329Not Available860Open in IMG/M
3300032180|Ga0307471_103077152Not Available591Open in IMG/M
3300032180|Ga0307471_103154394Not Available584Open in IMG/M
3300032782|Ga0335082_11215018Not Available622Open in IMG/M
3300033407|Ga0214472_11571066Not Available560Open in IMG/M
3300034114|Ga0364938_090155Not Available583Open in IMG/M
3300034147|Ga0364925_0112758Not Available971Open in IMG/M
3300034148|Ga0364927_0202098Not Available585Open in IMG/M
3300034165|Ga0364942_0090109Not Available990Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.21%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment7.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.96%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand5.30%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.97%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.97%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.31%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil3.31%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.65%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.65%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.99%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.32%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.32%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.32%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.66%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.66%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine0.66%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.66%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.66%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.66%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.66%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.66%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.66%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.66%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.66%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.66%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.66%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.66%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2032320005Soil microbial communities from sample at FACE Site 5 Oak Ridge CO2-EnvironmentalOpen in IMG/M
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000793Forest soil microbial communities from Amazon forest - 2010 replicate II A001EnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002243Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2EnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003996Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailB_D2EnvironmentalOpen in IMG/M
3300004267Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBioEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005983Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S2T1R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009807Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300009836Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012034Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT526_2EnvironmentalOpen in IMG/M
3300012160Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT630_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300014267Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D1EnvironmentalOpen in IMG/M
3300014879Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10DEnvironmentalOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018068Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021332Metatranscriptome of estuarine sediment microbial communities from the Columbia River estuary, Oregon, United States ? S.384 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024241Subsurface microbial communities from Mancos shale, Colorado, United States - Mancos A_50_July_PBEnvironmentalOpen in IMG/M
3300025146Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 1EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025174Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 3EnvironmentalOpen in IMG/M
3300025310Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027277Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027957Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032025Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f20EnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300034114Sediment microbial communities from East River floodplain, Colorado, United States - 9_s17EnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M
3300034148Sediment microbial communities from East River floodplain, Colorado, United States - 18_j17EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FACEORA_9258302032320005SoilPDPKIGPVVILAHDIGKVFTLNSPEEGRPHDIPSADILASLQELREEFDEITARSMILALRHQHAKAEIPLNVPPLTETIFKFIKKADLAASAEESREAAERMKALLPRVIDVFPYIIPELNVNGCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVF
GPIPI_000713302088090014SoilMLPLLPDPRIGPVVVLAHDIGKVYTLSXQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKVEGYLSDGYLFLFKEPVKEELLRHLDTQDAPVFKGQDPVWNEMALALAAXXXXXXXIATKVGEKDAGKKSCLFTIKNSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI
JGI1027J11758_1292416523300000789SoilLNVPPLTESILQFIKKADLAASAEESREAAERMKELLPRVIDIFPYIIPELNVNRCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVAXLXPHLIDRWLQTXIPKIEVL*
AF_2010_repII_A001DRAFT_1012450413300000793Forest SoilPSADIVASLPELREEFDEISARSMILAIRHQHSKAEIPLNAPLLTESILQFIKKADLAASAEESREAAQKMRDLLPRVIEVFPYVIPELNVNGRMGEKAEGYLSDGYLFLFKEPVKEKLLGHIDTQDAPVFKGQDPVWNEMAAALAEAGLITTKAGAKEAGKKSCLFTIKTPQGQEKAI
JGI1027J12803_10047992423300000955SoilMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKVEGYLSDGYLFLFKEPVKEELLRHLDTQDAPVFKGQDPVWNEMALALVEAGLIATKVGEKDAGKKSCLFTIKNSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
C687J29039_1022590713300002243SoilLAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKPEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQ
soilL2_1012914073300003319Sugarcane Root And Bulk SoilVILAHDIGKAFTLNNHGEGQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEMPLNVPPLTEMILQFIKKADIAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLTDGYVFLFKEPVKENLLRHLDTQDAPVFKGQDPVWNEMALALAGASLIATKVGEKDAGKKSCLFTIKTPQGQEKAIAIPVSNLAPHLKERWLQANIPKIEVI*
soilH2_1039664933300003324Sugarcane Root And Bulk SoilPELREDFDEITARSIIIAVRHQHSKADIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPKVTEAFPYILSDLNVNGCMGGKAEGYLSDGYLFLFKESVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLITTKVGEKDAGKKSCLFTIKTFQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0055467_1007849723300003996Natural And Restored WetlandsTLGGDQEGRPHDIPSADIVASLAELQEEFDEITARSMILAIRHQHAKAEMPLNPPPLTETILQFIKKIDLGASAEESREAAEKMRQLIPKVLEFFPYVIPELNVNGCMGGKAEGYLSDGYLYLCKDPVKERLLRMLEVKNAPVFKGQDPVWNEMALALAEAGLITTKAGAREAGKKSCLFTIKVPTGQEKAVAIPVASVAPNLSERWLRANIARIEVL*
Ga0066396_1008969613300004267Tropical Forest SoilAEIPLNAPPLTETLLQFIKKADLAAVAEESRDAAEKMKLLLPKIIEVFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRHVDTQDAPVFKGQDPVWNEMAAALADAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHNLIEQWLQTNSPKIEVL*
Ga0066388_10027596953300005332Tropical Forest SoilLNVPPLTEMILQFIKKADLAASAEESREAAERMKERLPKVIEVFPYVIPELNVNGCMGGKTEGYLSDGYLFLFKEPVKEKLLRHLNSPEAPVFKGQDPVWNEIALGLAGAGLITTKVGERDAGKKSCLFTIKTSQSQEKAIAIPVSNLAPHLKERWLQTNIPKIQVI*
Ga0066388_10334647523300005332Tropical Forest SoilAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMSALVPRVLEVFPYIVLELNVNACMGGKAEGYLCDGYLYLLKEPVKAKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIDRWLQTNIPKIEVL*
Ga0066388_10412639313300005332Tropical Forest SoilPGGLIIHTAKVLRHSEPLLPSLSDPRIGPVVILAHDIGKVFTLSSQQEGQPHDIPSADILTSVPELREEFDEITARSMILAVRHQHSKGEIPLNVPPLTETILKFIKKADLAAAAEESREAAERIKEQLPRVTEVFPYILSELNVNGCRGGKAEGYLSDGYLFLLKEPVKEKLLGEVDTQDAPAFKSQDPVWNEMALALAGAGLITTKLGEKEAGKKSCLFTIKTSQGQEKTIAIPVSNLAPHLKE
Ga0070708_10094713713300005445Corn, Switchgrass And Miscanthus RhizosphereLQFIKKADLNASAEESREAAEKMKELLPRVIEFFPYIISELNVNGCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEMALALTGAGLIATKAHAKEAGKKSCLFTIKTSQGQEKTIAIPVASLAPHLIDRWLQTNIPKIEVL*
Ga0066689_1044804113300005447SoilVVILAHDIGKAFTLSGQAEGQPHDIPSADILASVAELREDFDEITARSMILAVRHQHSKAEMPLHVPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLHHLDTQEAPAFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKERWLQANIPKIEMI*
Ga0066681_1060269013300005451SoilNVPPLTGTILQFIKKSDLAASAEETREAAERMKGQLPKVIEVFPYVISELNVNGCMGGKGEGYLSDGYLFLFKEPVKETFLRHLDSPEAPVFKGQDSVWNEMALALAEAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMV*
Ga0068867_10236775213300005459Miscanthus RhizosphereTILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGVKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTSRWLQSNIPKIEVL*
Ga0070697_10128023413300005536Corn, Switchgrass And Miscanthus RhizosphereSHHLSDAGGLIIHTTKVLKYSEPLLPLLPDPKIGAVVILAHDIGKVFTLNSPEEGRPHDIPSADILASLQELREEFDEITARSMILALRHQHSKAEIPLNVPPLTESILQFIKKADLAASAEESREAAERMKELLPRVIDVFPYIIPELNVNGCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEMALALAGAGLITTKAGAK
Ga0066695_1030315923300005553SoilQEGQPHDIPSADILASVPELREDFDEVTARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPELNVNGCMGGKVEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKDQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMV*
Ga0066905_10136951513300005713Tropical Forest SoilPELREEFDEITARSMILAIGHQHSKAEIPLNAPPLTETIFQFIKKADLSASAEESREAAVKMRTLLPKVIEFFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRYVDTQGAPVFKGQDLVWNEMAAGLAEAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHHLLEQWLQTNSPKIEVL*
Ga0066903_10706577213300005764Tropical Forest SoilDIPSADIVASLPELREEFDAVTARAMILAIRHQHSKAEIPLNAPPLTETLLQFIKKADLAAVAEESRDAAEKMKLLLPKIIEVFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRHVDTQDAPVFKGQDPVWNEMAAALADAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHNLI
Ga0068860_10177926013300005843Switchgrass RhizospherePEPREEFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRSLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL*
Ga0081540_118973523300005983Tabebuia Heterophylla RhizosphereQPHDIPSADILASVPELREEFDEITARSMILAVRHQHSKGEIPLNVPPLTEAILQFIKKADLAASAEESRDAAEKMKERLPRVVELFPYILSELNVNGCMGGKAEGYLSGGYLFLSKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGEKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPHLKERWLQTTIPKIEVI*
Ga0075417_1009352923300006049Populus RhizosphereVGPVVILAHDIGKTFTLGGQAEGQPHDIPSADILASVVELREDFDEITARSMIHAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVVEVFPYMIPELNVNGCMGGKGEGYLSDGHLFLFKEPVKEKLLRHLDTQDAPVFKGHDPVWNEMALALAEAGLITTKIGERDAGKKSCLFTIKTSQAQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0075431_10089158713300006847Populus RhizosphereEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVVEVFPYMIPELNVNGCMGGKGEGYLSDGHLFLFKEPVKEKLLRHLDTQDAPVFKGHDPVWNEMALALAEAGLITTKIGERDAGKKSCLFTIKTSQAQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0075431_10114622213300006847Populus RhizosphereALSMLIAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELMPRVLEVFPYIVPELNVNACMGGKAEGYLCDGYLYLIKEPVKEKLLGSLEVHNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGPEKTIAIPVASLATHLTNRWLQTNIPKIEVL*
Ga0075425_10122728423300006854Populus RhizosphereVVVLAHDIGKVYTLSNQAEGQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESRESAERMKERLPRVIEVFPYMIPELNVNGCMGGKVEGYLSDGYLFLFKEPVKEELLRHLDTQDAPVFKGQDPVWNEMALALVEAGLITTKIGERDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKKRWLQTNIPKIEIL*
Ga0073934_1013157933300006865Hot Spring SedimentLLPDPRIGPVVILANDIGKVLTLGDRGEVRPHDIPSADLLASLAELREEFDEITARSMILAIRHQHSKAEIPLNGPPLTQTILQFIKKADLAASAEESREAAERMRELAPRVIDAFPYIIAELNVNGCMGGKAEGYLSDGYVFLFKEPVKEKLLSYLDTRNAPVFKGQDPVWNEMALALAGAGLITTNIGEKDAGKKSSLFTVKTSQGQEKAIAIPVSNLATHLTERWLQANIPKIEVV*
Ga0073934_1041466913300006865Hot Spring SedimentNAPPLAETIVQFIKKADLAAAAEESREAGEDMKGRAAKILEVFPSIVSELNVNGWMGGKAQGYLSQGYLFLLKEPLKAKLLQHLGTQNAPVFKGQDPVWNEMAIALAGAGMITTKALAKEAGKKSCLFTIKTAQGRERVIAIPVANLAPQMTNKWLQTAVPEIEVL*
Ga0075426_1153019713300006903Populus RhizosphereGPVVILAHDIGKVFTLNSPEEGRPHDIPSADILASLQELREEFDEITARSMILALRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKELLPRVIDIFPYIIPELNVNRCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEM
Ga0075435_10097992023300007076Populus RhizosphereIKKADLAASAEESREAAERMKELLPRVIDIFPYIIPELNVNRCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIDRWLQTTIPKIEVL*
Ga0066710_10036121243300009012Grasslands SoilEPLLPLLSDPRVGSVVILAHDIGKVFTLSNQQEGQPHDIPSADILASVPELREDFDEVTARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPELNVNGCMGGKGEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMSLALVGAGLIATKVGEKDAGKKSCLFTIKTFQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMV
Ga0066710_10299308613300009012Grasslands SoilELLLPTLPDPNIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRGLVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKERLLRNLKIQNAPVFKGQDPVWNEMALALAGAALITTKAGAKEAGKKSCLFTIKTPQGQEKTIA
Ga0105106_1060934813300009078Freshwater SedimentAKVLKHSEPLLPTLPDPRIGPLVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESWEAAEKMKELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKKKLLRGLEVGTAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKRSCLFTIKTPQGQEKAIAMSVTSLAPHLTNRWLQTNIPKIEVL*
Ga0105106_1061505913300009078Freshwater SedimentKAEIPQNAPPLAETTLQFIKKADLAAAAEESREAAGIMKGLAGKVLEAFPLIVSELNVNGWMGGKAQGYLSQGYLFLLKEPLKAKLLQHLGTQNAPVFKGQDPVWNEMAIALGGAGMITTKALAKEAGKKSCLFTIKTSQGRERVIAIPVANLAPQMTNQWLQTAVPEIEVL*
Ga0066709_10137377313300009137Grasslands SoilDPRIGPVIILAHDIGKVFTLSNQREGQPHDIPSADILASVPELREDFDEITARSMILAVRHQHSRAEIPLNVPPLTETILQFIKKADIAAAAEESREAAERMKERVPRVIEAFPYILSELNVNSCMGGKTEGYLSGGYLFLFKEPVKEKLLGRLDSQDAPVFKGQDPVWNEMALALAGADLITTKAGERDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0066709_10155664913300009137Grasslands SoilMLSLLPDLRIGPVVVLAHDIGKAFTLSGQAEGQPHDIPSADILASVGELREDFDEITARSMILAVRHQHSKAEIPLNIPPLTETILQFIKKADFAASAEESREAAERMKQRLPRVIEVFPYIIPDLNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLMRHLDAQDAPVFKGQDAVWNEMALALAEAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSN
Ga0066709_10248936423300009137Grasslands SoilMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPDLNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMV*
Ga0114129_1017207823300009147Populus RhizosphereMLPDPNIGPVVILAHDTGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMKELVPKVLELFPCTVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPIFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLTNKWLQTNIPKIEVL*
Ga0075423_1128985113300009162Populus RhizosphereVVILAHDIGKTFTLGGQAEGQPHDIPSADILASVVELREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERIKERLPKVTEAFPYILSELNVNGCMGGKAEGYLSDGYLFLFKESVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLITTKIGEKDAGKKSCLFTIKTSQSQEKAIAIPVSNLAPHLKQRWLQTNIPKIE
Ga0105340_122395023300009610SoilPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELLPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIGRWLQTNIPKIEVL*
Ga0105061_105780113300009807Groundwater SandIIHTAKVLRHSEPLLPLLPDPRIGPVVILAHDIGKAFTLSGQAEGQPHDIPSADILASVPQLREDFDEITARSMILALRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLIA
Ga0105084_106892613300009811Groundwater SandADLAASAEESREAAERMKERLPRVIEIFPYMIPELNVNGCMGGKAEGYLSDGYVFLFKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALVEAGLITTKIGERDAGKKSCLFTIKTYQGQEKAIAIPVSNLAPHLKKRWLQTNIPKIEVI*
Ga0105076_110089813300009816Groundwater SandDILASVPQLREDFEEITARSMILAVRHQHSKAEIPLNIPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVEAGLITTKIGERDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPH
Ga0105072_106778023300009818Groundwater SandIKKADLAASAEESREAAERMKERLPRVIEVFPYIISELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKKRWLQTNIPKIEVV*
Ga0105068_111749113300009836Groundwater SandILAHDIGKAFTLSGQAEGQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEIPLNIPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLITT
Ga0126382_1166969113300010047Tropical Forest SoilDPRIGSVVILAHDIGKVFTLSNQQEGQPHDIPSANILGSVSELREEFDEITARSMILAVRHQHSKAEIPLNAPPLTEAILQFIKKADLAAVAEESREAAERIKEQLPRVTEVFPYILSELNVNGCRGGKAEGYLSDGYLFLVKEPVKEKLLGEVDTQDAPAFKGQDPVWNEMALALAGAGLIATKVGEKEAGKKSCL
Ga0126373_1231164313300010048Tropical Forest SoilEIPLNAPPLTETLLQFIKKADLAAVAEESRDAAEKMKLLLPKIIEVFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRHVDTQDAPVFKGQDPVWNEMAAALADAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHNLIEQWLQTNSPKIEVL*
Ga0134088_1064180013300010304Grasslands SoilHSEPLLPLLSDPRVGSVVILAHDIGKVFTLSNQQEGQPHDIPSADILASVPELREDFDEVTARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPKVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRQLDTQDAPVFKGQD
Ga0126376_1015427633300010359Tropical Forest SoilSADILGSVSELREEFDETTARSMILAVRHQHSKGEIPLNAPPLTETILQFIKKADLAASAEESRDAAEKMKERLPRVVELFPYILSELNVNGCMGGKAEGHLSGGYLFLSKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGDKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0126372_1302248813300010360Tropical Forest SoilPSADIVASLPELREEFDEITARSVILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMSALVPRVLEVFPYIVLELNVNACMGGKAEGYLCDGYLYLLKEPVKAKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLITTKAGAKEAGKKSCLFTIKTPQ
Ga0126378_1187126813300010361Tropical Forest SoilCETGGLIAHTAKVLVHSEPLLPLLPDPRTGTVVILAHDIGKLLTLNNQEEGRPHDIPSADIVASLPELREEFDEITARSMILAIGHQHSKAEIPLNAPPLTETIFQFIKKADLSASAEESREAAVKMRTLLPKVIEFFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRYVDTQGAPVFKGQDLVWNEMAAGLAEAGLITTRARAKEAGKKSCLFTIKT
Ga0126378_1294340513300010361Tropical Forest SoilADIMASIPELREDFDETTACSMILAVRHQHSKGEIPLNAPPLTETILQFIKKADLAAAAEESRDAAEKMKERLPRVVELFPYILSELNVNGCMGGKAEGHLSGGYLFLSKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGDKEAGKKSCLFTIKTSGGQEKAVAIP
Ga0126383_1086292013300010398Tropical Forest SoilEGQPHDIPSADILGSVSELREEFDEITARSMILAVRHQHSKAEIPLNAPPLTETILQFIKKADLAAAAEESREAAERIKEQLPRVTEVFPYILSELNVNGCRGGKAEGYLSDGYLFLVKEPVKEKLLGEVDTQDAPAFKGQDPVWNEMALALAGAGLIATKVDEKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0126383_1248422713300010398Tropical Forest SoilSSQQEGQPHDIPSADILGSVSELREEFDETTARSMILAVRHQHSKGEIPLNVPPLTETILQFIKKADLAAAAEESRDAAEKMKERVPRVVELFPYILSELNVNGCMGGKAEGYLSGGYLFLSKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGEKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPH
Ga0134127_1119186123300010399Terrestrial SoilFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLYDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTNRWLQSNIFKIEVL*
Ga0137393_1035230613300011271Vadose Zone SoilEFDEITARSMILALRHQHSKAEIPLNVPPLTETILQFIKKADLSASAEESREAAEKMKELFPRVIEVFPYIIPELNVNGCMGGKAEGYLSEGYLFLLKEPVKEKLLRHLDTQNAPVFKGQDPVWNEMALALARAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIDRWLQTNIPKIEVL*
Ga0137393_1095380023300011271Vadose Zone SoilAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMSALVPRVLEVFPYIISELNVNGCMGGKAEGYLSDGYLFLLKEPVKEKLLSRLDTQNAPVFKGQDPVWNEMALALAEAGLITTTARAKEAGRKSCLFTIKTSQGQEKAIAISIPNVAHHLIEKWLQGDISKIEVI*
Ga0137453_104973423300012034SoilSLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPLGQEKTIAIPVANLAPHLTNRWLQSNIPKIEVL*
Ga0137349_102002513300012160SoilILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESWEAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIGRWLQTNIPKIEVL*
Ga0137382_1087013023300012200Vadose Zone SoilTENILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIRSPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL*
Ga0137365_1130749913300012201Vadose Zone SoilAFTLSGQAEVQPHDIPSADILASVAELREDFDEITARSMILAVRHQHSKAEIPLNIPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKAEGHLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLIATKF
Ga0137380_1089767313300012206Vadose Zone SoilEPLLPTLPDPKIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRNLKIQNASVFKGQDPVWNEMALALAGAGLITTRAGAKEAGKKSCLFTIKSPQGQEKTIALPVASLAPHLTNRWLQSNIPKIEVL*
Ga0137380_1113752213300012206Vadose Zone SoilRSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADIAAAAEESREAAERMKERLPRVIEAFPYILSELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLMRHLDAQDAPVFKGQDPVWNEMALALAGAGLIATKIGERDAGKKSCLFTIKTSQGQEKVIAIPVSTLAPHLKERWLQTNIPKIEVI*
Ga0137381_1065435113300012207Vadose Zone SoilQHSKAEILLNILPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYVIPELNVNGCMGGKGEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTFQGQEKAIAIPVSNLAPHLRGRWLQTNIPKIEVI*
Ga0137381_1077170313300012207Vadose Zone SoilSSPGGLVIHTAKALRQSEPLLPLLPDPRIGPVVVLAHDIGKAFTLNGRAEGQPHDIPSADVLASVAELREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEAFPYIIPALNVNGCKGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAKAGLITTKVGERDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0137379_1122104613300012209Vadose Zone SoilAKVLKHSEPLLPTLPDPKIGPVVILAHDIGKIVTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEMMRELLPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLEVQNAPVFKGQDPVWNEMALALAGAGLITTRAGAKEAGKKSCLFTIKSPQ
Ga0137370_1095384913300012285Vadose Zone SoilELREDFDEVTARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPELNVNGCMGGKVEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDEGKKSCLFTIKTFQGQEKAIAIPVSNL
Ga0137387_1040578813300012349Vadose Zone SoilLASVAELREDFDEITARSMILAVRHQHSKAEIPLNIPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKERLLRLLDTQDAPVFKGEDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTFQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137372_1069440813300012350Vadose Zone SoilPLTETVLQFIKKADIAASAEESREAAEKIKERLPRVVELFPYILSELNVNGCMGGKGEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137372_1095992713300012350Vadose Zone SoilHQHSKAEMPLNIPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKAEGHLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLIATKFGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLASHLKARWLQTNIPKVEVI*
Ga0137386_1068791123300012351Vadose Zone SoilPSADILASVAELREDFDEITARSMILAVRHQHSKAEIPLNIPPLTETILQFIKKADLAASAEESREAAERMKQRPPRVIEVFPYIIPDLNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLMRHLDAQDAPVFKGQDAVWNEMALALAEAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQGNIPRIEVI*
Ga0137367_1031915513300012353Vadose Zone SoilVVILAHDIGKILTLSNDREGQPHDIPSADIVASLSELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYTVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPYLIDKWLQTNIPKIEVL*
Ga0137366_1112086013300012354Vadose Zone SoilDIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTTQGQEKTIAIPV
Ga0137369_1030134613300012355Vadose Zone SoilMILAVRHQHSRAEMPLNIPPLTETILQFIKKADLAASAEESREAAEKMKKQLPRVIEVFPYLIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSGGQEKAIAIPVSNLAHHLKERWLQANIPKIEVI*
Ga0137384_1101988613300012357Vadose Zone SoilTAKVLRHSEPMLSLLPDPRIGPVVVLAHDIGKAFTLSGQAEGQPHDIPSADILASVGELREDFDEITARSMILAVRHQHSKAEIPLNIPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAAAGLIATKVGEKDAGKKSCLFTIKNSQG
Ga0137368_1031504913300012358Vadose Zone SoilKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVCNEMALALAGAGLITTKAGAKEEGKKSCLFTIKSPQGQEKTIALPVASLAPHLTNRWLQSNIPKIEVL*
Ga0137368_1035595313300012358Vadose Zone SoilASVAELREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVTEVFPYIIPDLNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPIFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137385_1066510813300012359Vadose Zone SoilMLSLLPDPRIGPVVVLAHDIGKVYTLSNQAEGQPHDIPSADILASVAELREDFDEITARSMILAVRHQHSKAEMPLNVPPLTETILQFIKKAELAASAEESREAAERMKERLPRVTEVFPYIIPELNVNGCMGGKGEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIVIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137375_1039445623300012360Vadose Zone SoilMILAHDIGKAFTLSGQAEGQPHDIPSADILASVAELREDFDEITARSMILAVRHQHSKAEIPLNAPPLTETVLQFIKKADIAASAEESREAAEKIKERLPRVVELFPYILSELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRLLDTQDAPVFKGQDPVWNEMALALAEAGLITTKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137375_1040904413300012360Vadose Zone SoilDEITARSMIIALRHQHSKTEIPLNVPPLTETILQFIKKADLAASAEESREAAERIKERLPKVTEAFPYILSELNVNGCMGGKGEGYLSDGYLFLLKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALVGAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLRERWLQANIPKIEMI*
Ga0137360_1042319323300012361Vadose Zone SoilMGPVVVLAHDIGKVFTLSGQAEGQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEIPLNVPPLTEVILQFTKKADLAASAEESREAAERMKQRLPRVIEVFPYMIPEVNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLHHLDTQDAPVFKGQDPVWNEMALALAEAGLITTKVGENDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKE
Ga0137373_1092705613300012532Vadose Zone SoilIHTAKALRQSEPLLPLLPDPRIGPVVILAHDIGKAFTLSGQAEGQPHDIPSADILASVAELREDFDEITARSMILAVRHQHSKAEIPLNVPPLTETVLQFIKKADIAASAEESREAAEKIKERLPRVVELFPYILSELNVNGCMGGKTEGCLSDGYLFLFKEPVKEKLLGRLDSQDAPVFKGQDPVWNEMALALAGVGLITMKVGERDAGK
Ga0137373_1101476323300012532Vadose Zone SoilADLSASAEESREAAEKMRELVPKVLELFPYTVPELNVNGCMGGKAEGYLSEGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGERDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKERWLQTNIPKVEVI*
Ga0137358_1081214713300012582Vadose Zone SoilGGLIIHTAKVLRHSEPLLPLLPDPRMGPVVVLAHDIGKVFTLSGQAEGQPHDIPSADILASVPQLREDFDEITARSMILAVRHQHSKAEIPQNGPPLTEVSLQFTKKADLAASAEESREAAERMKQRLPRVIEVFPYMIPEVNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLHHLDTQDAPVFKGQDPVWNEMALALAEAGLITT
Ga0137404_1141060313300012929Vadose Zone SoilDILASVAELREDFDEITARSMILAIRHQHSKGEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPFVIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAAAGLIATKVGEKDAGKKSCLFTIKNSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0153915_1148988613300012931Freshwater WetlandsPLTETILQFIKKADLSASAEESREAAEKMRGSVPRVLELFPYIVHELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTRAGAKEAGKKSCLFTIKTPQGQEKAIAVPVANVAHDLIERWLQGNIPKIEVL*
Ga0126375_1133146913300012948Tropical Forest SoilQPHDIPSADILGSVSELREEFDETTARSMILAVRHQHSKGEIPLNVPPLTETILQFIKKADLAASAEESRDAAEKMKERVPRVVELFPYILSELNVNGCMGGKAEGYLSGGYLFLSKEPVKEKLLSHLDTQDAPVFKGQDPVWNEMALALAGAGLIATKVGEKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPHLKERWLQ
Ga0126375_1207807913300012948Tropical Forest SoilRHQHSKAEIPLNAPPLTETILQFIKKADLAAAAEESREAAERIKEQLPRVTEVFPYILSELNVNGCRGGKAEGYLSDGYLFLLKEPVKEKLLGEVDTQDAPAFKGQDPVWNEMALALAGAGLITSKVGEKEAGKKSCLFTIKTSGGQEKAVAIPVSNLAPHLKERWLQ
Ga0075313_107252923300014267Natural And Restored WetlandsLNNQGERQPHDIPSADILASVPELREDFDEITARSMVIALRHQHSKTEMPLNVPPLTETILQFIKKADLAASAEESREAAEKIKERLPEVVELFSYILSELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLGRLDSQDAPVFKGQDPVWNEMALALAGAGLIATKIGEKDAGKKSCLFTIKTSQSQEKAIVIPVSNLAPHLKERWLQTNIPKIEVI*
Ga0180062_110193113300014879SoilTPASRSHHLNESGGLIVHTAKVLSHSEPLLSTLPDPKIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAQPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLQKLKIQNAPVFKGQDPVWNEMALALAGAGLITT
Ga0132256_10306464813300015372Arabidopsis RhizosphereLAIRHQHSKAEIPLNAPPLTETILLFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKERLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEGGKKSCLFTIKTPQGQEKTIAIPVANLAPNLTNRWLQSNIPKIEVL
Ga0184634_1020581713300018031Groundwater SedimentILQFIKKADLSASAEESREAAEKMRELVPKVLELFPCTVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRNLKVQNAPVFKGQDPVWNEMALALARAGLITTKTGGKEAGKKSCLFTIKTPQGQEKSIAIPVASLAPYLIDKWLQTNIPKIEVL
Ga0184621_1014916313300018054Groundwater SedimentPHDIPSADIVASLPQLREEFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL
Ga0184619_1018720013300018061Groundwater SedimentKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKERLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLALHLINRWLQTNIPKIEVL
Ga0184636_128134113300018068Groundwater SedimentKIGPVVILAHDVGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMPAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPRVLELFPYIVRELNVNGCMGGKAEGFLCDGYLYLLKEPLKEKLLRRLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCL
Ga0184618_1024944113300018071Groundwater SedimentLAHDIGKILTLSNDREGQPHDIPSADIVASLPEPREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCIGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL
Ga0184635_1034133213300018072Groundwater SedimentVHTANALKHSESLLSTLPDPRIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPCTVPELNVNGCMGGRAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMAL
Ga0184624_1000615813300018073Groundwater SedimentLLSTLPDPRIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIGRWLQTNIPKIEVL
Ga0184632_1037808713300018075Groundwater SedimentKADLSASAEESREAAEKMRELVPKVLELFPCTVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPTGQEKAVAISVASLAPHLIDRWLRANIITRIEVL
Ga0184632_1048979213300018075Groundwater SedimentVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYIVPELNVNGCMGGKAEGYLSDGYLFLFKEPGKEKLLRHLDTQDAPVFKGQDPVWNEMALALAQAGLITTKVGKKDAGKKSCLFTIKTSQGQEKAIAIPASNLAPHLKERWLQTNIPKIEVV
Ga0184609_1005148013300018076Groundwater SedimentQFIKKADLSASAEESREAAEKMRELVPKVLELFPYTVPELNVNGCMGGKAEGYLCDGYVYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPRLIDRWLQTNIPKIEVL
Ga0184625_1001737713300018081Groundwater SedimentMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIGRWLQTNIPKIEVL
Ga0184625_1035926313300018081Groundwater SedimentGLIVHTAKVLKHSEPLLPTLPDPKIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLLELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTENILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKACAKEAGTKSCLFTIKTPQGQEKTIAIPVASLAPYLIDKWLQTNIP
Ga0190265_1016738013300018422SoilEDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNSCMGGKTEGYLCDGYLYLLKEPVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL
Ga0190272_1230261813300018429SoilLPELREEFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAGKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIVIPVASLAPHLIDRWLQSNIPK
Ga0190269_1129878113300018465SoilMILAVRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKVEGYLSDGYLFLFKEPVKEELLRHLDTQDAPVFKGQDPVWNEMALALAAAGLIATKVGEKDAGKKSCLFTIKNSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI
Ga0190270_1062641513300018469SoilIPSADIVASLPELREEFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRSLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKSPQGQEKTIAIPVASLAPHLTNRWLQSNIPKIEVL
Ga0193755_115064713300020004SoilLLPTLPDPRIGPVVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKAEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTSHGQEKTIAIPVASLAPYLIDKWLQTN
Ga0193726_133454213300020021SoilLRHSEPLLSTLPDSKIGPVVILAHDIGKILTLRNDREGQPHDIPSADIVASLPEPREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKIRELVPKVLELFPYIVPELNVNGCMGGKAEGYLYDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALAL
Ga0210379_1048135013300021081Groundwater SedimentHAIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLIETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIA
Ga0210339_112323513300021332EstuarineKADLAAAAEESREAAEIMKGLAGKVLEAFPLIVSELNVNGWMGGKAQGYLSQGYLFLLKEPLKAKLLQHLGTQNAPVLKGQDPVWNEMAIALAGAGMITTKALAKEAGKKSCLFTIKTSQGRERVIAIPVANLAPQMTSQWLQTAVPEIEVL
Ga0126371_1080464213300021560Tropical Forest SoilQEEGRPHDIPSADIVASLPELREEFDAVTARAMILAIRHQHSKAEIPLNAPPLTETLLQFIKKADLAAVAEESRDAAEKMKLLLPKIIEVFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRHVDTQDAPVFKGQDPVWNEMAAALADAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHHLLEQWLQTNSPKIEVL
Ga0126371_1256530813300021560Tropical Forest SoilHSEPLLPLLPDPRTGTVVILAHDIGKLLTLNNQEEGRPHDIPSADIVASLPELREEFDEITARSMILAIGHQHSKAEIPLNAPPLTETIFQFIKKADLSASAEESREAAVKMRTLLPKVIESFPYIIPELNVNGCMGGKAEGYLSDGYLFLFKEPVKEKLLRYVDTQGAPVFKGQDLVWNEMAAGLAEAGLITTRARAKEAGKKS
Ga0233392_103652513300024241Deep Subsurface SedimentDVGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIQLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPRVLELFPYIVPELNVNGCMGGKAEGFLCDGYLYLLKEPVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKTGAKEA
Ga0209322_1012509123300025146SoilMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209320_1028323713300025155SoilAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADVDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLLKEPVKEKLLRNLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209109_1017886423300025160SoilHDIPSADLVAALPELRADVDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVKEKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKVIALPVASLAPHLTNRWLQTNIPKIEVL
Ga0209109_1040006313300025160SoilKHSEPLLPTLPDPRIGPLVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKT
Ga0209521_1011263923300025164SoilVVILAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209108_1039196923300025165SoilLQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVREKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209642_1010433533300025167SoilFDEITARSMILAIRHQHSKPEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209324_1074717313300025174SoilIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209172_1010930123300025310Hot Spring SedimentDEITARSMILAIRHQHSKAEIPLNGPPLTQTILQFIKKADLAASAEESREAAERMRELAPRVIDAFPYIIAELNVNGCMGGKAEGYLSDGYVFLFKEPVKEKLLSYLDTRNAPVFKGQDPVWNEMALALAGAGLITTNIGEKDAGKKSSLFTVKTSQGQEKAIAIPVSNLATHLTERWLQANIPKIEVV
Ga0209519_1002920113300025318SoilLRADFDEITARSMILAIRHQHSKPEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209519_1060641813300025318SoilLLPTLPNPRIGPVVILAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADVDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVREKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKK
Ga0209641_1006000753300025322SoilLAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKPEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209341_1016482233300025325SoilGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209342_1002493663300025326SoilVVILAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKPEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKAEGYLCDGYLYLMKESVKEKLLRSLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKVIALAVASLAPHLTNRWLQTNIPKIEVL
Ga0209846_103031423300027277Groundwater SandSSPGGLIIHTAKVLRHSEQLLPLLPDPRIGSVVTLAHDIGKVFTLSNQGEGQPHDIPSADILASVPQLREDFEEITARSMILAVRHQHSKAEIPLNIPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAQAGLITTKVGKKDAGKKSCLFTIKTSQGQEKAIAIPASNLAPHLKERWLQTNIPKIEIL
Ga0209842_102138413300027379Groundwater SandGSVVTLAHDIGKVFTLSNQGEGQPHDIPSADILASVPQLREDFEEITARSMILAVRHQHSKAEIPLNIPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAQAGLITTKVGKKDAGKKSCLFTIKTSQGQEKAIAIPASNLAPHLKERWLQTNIPKIEVV
Ga0209283_1032736123300027875Vadose Zone SoilARASSNSKVKKAAKASSAKPTKKNSTQAEKLRAIALPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMSALVPRVLEVFPYIVPELNVNACMGGKAEGYLCDGYLYLLKEPVKEKLLRNLEVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKAIAIPVASLAPHLIDRWLQTNIPKIEVL
Ga0209857_101005143300027957Groundwater SandLRHSEQLLPLLPDPRIGSVVTLAHDIGKVFTLSNQGEGQPHDIPSADILASVPQLREDFEEITARSMILAVRHQHSKAEIPLNIPPLTEMILQFIKKADLAASAEESREAAERMKERLPRVIEVFPYMIPELNVNGCMGGKGEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAEAGLIATKVGEKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLKKRWLQTNIPKIEIL
Ga0307312_1094942613300028828SoilHSKAEIPLNSPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVPELNVNGCMGGKTEGYLCDGYLYLLKEPVKERLLRNLKVQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLALHLINRWLQTNIPKIEVL
Ga0299907_1028357923300030006SoilEGRPHDIPSADIVASLAELREEFDEITARSMILAIRHQHAKAEIPLNAPPLVETILQFIKKADLSASGEESREAAEKMKELIPKVLDLFPYIVPELNVNGCMGGKAEGYLSDGYLYLFKEPVKERLLRILEVKNAPVFKGQDPVWNEMALALAGASLITTKAGAKEAGKKSCLFTIKTPTGQEKAVAIPVAALAPDLIDRWLRANITRIEVL
Ga0247727_1022345223300031576BiofilmPNPKIGPVVILAHDIGKVLTLSNQGEGLSACSAQAGRPHDIPSADLLASLGELREEFDEITARSMILAIRHQHSRAEVPLNAPPLTQTILQFIKKADLAASAEESREAAERMRELAPRVIDAFPYIIPELNVNGCMGGKPEGYLSDGYLFLFKEPVKENLLSHLDTRNAPVFKGQDPVWNEMALALAGAGLITTQVATKDAGKKSCLFTIKTSQGQEKAIAIPVSNLAPHLTERWLQANIPKIEVV
Ga0307469_1037819423300031720Hardwood Forest SoilHTAKVLKHSEPLLSLLPDPKIGPVVILAHDIGKVFTLNSPEEGRPHDIPSADILASLQELREEFDEITARSMILALRHQHSKAEIPLNVPPLTESILQFIKKADLAASAEESREAAERMKELLPRVIDVFPYIIPELNVNGYMGGKAEGYLSDGYLFLLKEPVKEELLNRLETQNGPVFKGQDPVWNEMALALTEAGLITTKIGERDAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIDRWLQTNIPKIEVL
Ga0306921_1103379323300031912SoilVGPVVILAHDIGKVFTLNNPGEGQPHDIPSADILASVPELREDFDEITTRSIILAVRHQHSKTDIPLNAPPQTETILQFIKKADLTASAEESREAAERMKEQLPRVIEVFPYVIPDLNVNACMGGKAEGYLSDGYLFLFKEPVKEKLLSHLNTQDAPVFKGQDPVWNEMALALAEAGLITTKVGEKDAGKKSCLFTIKTSQGQEKSIAIPVSNLAPHLKERWLQTNIPKIEVI
Ga0307479_1067658913300031962Hardwood Forest SoilSMILALRHQHAKAEIPLNVPPLTETIFQFIKKADLAASAEESREAAERMKALLPRVIDVFPYIIPELNVNGCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEIALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIDRWLQTNIPKIEVL
Ga0318507_1052732313300032025SoilLLPDPRVGPVVILAHDIGKVFTLNNPGEGQPHDIPSADILASVPELREDFDEITTRSIILAVRHQHSKTDIPLNAPPQTETILQFIKKADLTASAEESREAAERMKEQLPRVIEVFPYVIPDLNVNACMGGKAEGYLSDGYLFLFKEPVKEKLLSHLNTQDAPVFKGQDP
Ga0315912_1041274813300032157SoilAVTARSIILAIRHQYSKAEIPQNAPPLAETILQFIKKADLAAAAEESREAAEDMRGQAAKVLEVFPSIVSELNVNGWMGGKAQGYLSQGYLFLLKEPLKAKLLQHLGTQNAPVFKGQDPVWNEMAIALAGAGMITTKALAKEAGKKSCLFTIKISQGRERVIAIPVANLAPQMTNQWLQTAVPEIEVL
Ga0307471_10012441253300032180Hardwood Forest SoilLASLQELREEFDEITARSMILALRHQHSKAEIPLNVPPLTESILQFIKKADLAASAEESREAAERMKELLPRVIDVFPYIIPELNVNGYMGGKAEGYLSDGYLFLLKEPVKEELLNRLETQNGPVFKGQDPVWNEMALALTEAGLITTKIGERDAGKKSCLFTIKTPQGQEKTIAIPGASLAPHLIDRWLQTNIPKIEVL
Ga0307471_10139932913300032180Hardwood Forest SoilAEIPLNVPPLTEVILQFIKKADLAASAEESREAAERMKERLPRVVEVFPYMIPELNVNGCMGGKVEGYLSDGYLFLFKEPVKEKLLRHLDTQDAPVFKGQDPVWNEMALALAAAGLIATKVGEKDAGKKSCLFTIKNSQGQEKAIAIPVSNLAPHLKERWLQTNIPKIEVI
Ga0307471_10307715213300032180Hardwood Forest SoilGLIIHTAKVLKHSEPLLSLLPDPKIGPVVILAHDIGKVFTLNSPEEGRPHDIPSADILASLQELREEFDEITARSMILALRHQHSKAEIPLNVPPLTETILQFIKKADLAASAEESREAAERMKELLPRVIDIFPYIIPELNVNRCMGGRAEGYLSDGYLFLLKEPVKEKLLNRLETQNAPVFKGQDPVWNEMALA
Ga0307471_10315439413300032180Hardwood Forest SoilHQHSKAEIPLNAPPLTETIFQFIKKADLNASAEESREAAEKMKELLPRVIEFFPYIISELNVNGCMGGKAEGYLSDGYLFLLKEPVKEKLLSRLDTQNAPVFKGQDPVWNEMAAALADAGLITTRARAKEAGKKSCLFTIKTPQGQEKAIAIPIPNVAHNLIEQWLQKNSPQIEVL
Ga0335082_1121501813300032782SoilSEPLLPLLPNPKMGPAVILAHDIGKVLTLNSPEEGLPHDIPSADILASLQELREEFDEITARSMILALRHQHSKAEIPLNFPRLTETILQFIKKADLSASAEESKEAAETMKELLPRVVKVFPDMIPELNVNGCMGGTAEGYLSDGYLYLLKEPVKEKLLHNLEVQNAPVFKGQDPVWNGMALALAGAGLITTKAGTKEAGKKSCL
Ga0214472_1157106613300033407SoilKIFTFSNDREGQPHDIPSADLVAALPELRADVDEMTARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVREKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIK
Ga0364938_090155_2_5473300034114SedimentVVILAHDIGKIFTFSNDREGQPHDIPSADLVAALPELRADFDEITARSMILAIRHQHSKAEIPLNAPPLTETILQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVKEKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGA
Ga0364925_0112758_2_5203300034147SedimentKAEIPLNAPPLTETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFTIKTPQGQEKTIAIPVASLAPHLIGRWLQTNIPKIEVL
Ga0364927_0202098_3_5843300034148SedimentGPLVILAHDIGKILTLSNDREGQPHDIPSADIVASLPELREDFDEMTAQSMILAIRHQHSKAEIPLNAPPLIETILQFIKKADLSASAEESREAAEKMRELVPKVLELFPYIVAELNVNGCMGGKTEGYLCDGYLYLLKESVKEKLLRNLKIQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLF
Ga0364942_0090109_461_9883300034165SedimentQHAKAEIPLNAPPLTETILQFIKKADLGASAEESQEAGEGMRELVPRVLELFPYIIADLNVNACMGGKAEGYLSDGYLFLLKAPVKEKLLSLLGTQNAPVFKGQDPVWNEMALALAGAGLITTKAGAKEAGKKSCLFAIKTPQGQEKTIAIPVASLAPHLTNRWLQTNVPKIEVL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.