NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F046260

Metagenome Family F046260

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F046260
Family Type Metagenome
Number of Sequences 151
Average Sequence Length 272 residues
Representative Sequence VAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Number of Associated Samples 111
Number of Associated Scaffolds 151

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.66 %
% of genes near scaffold ends (potentially truncated) 96.03 %
% of genes from short scaffolds (< 2000 bps) 85.43 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(34.437 % of family members)
Environment Ontology (ENVO) Unclassified
(52.318 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.252 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.11%    β-sheet: 6.53%    Coil/Unstructured: 45.36%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 151 Family Scaffolds
PF00723Glyco_hydro_15 1.32
PF00912Transgly 0.66
PF01436NHL 0.66
PF08450SGL 0.66
PF00300His_Phos_1 0.66
PF00892EamA 0.66

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 151 Family Scaffolds
COG3387Glucoamylase (glucan-1,4-alpha-glucosidase), GH15 familyCarbohydrate transport and metabolism [G] 1.32
COG0744Penicillin-binding protein 1B/1F, peptidoglycan transglycosylase/transpeptidaseCell wall/membrane/envelope biogenesis [M] 0.66
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 0.66
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 0.66
COG4953Membrane carboxypeptidase/penicillin-binding protein PbpCCell wall/membrane/envelope biogenesis [M] 0.66
COG5009Membrane carboxypeptidase/penicillin-binding proteinCell wall/membrane/envelope biogenesis [M] 0.66


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001686|C688J18823_10154433All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300001686|C688J18823_10371139All Organisms → cellular organisms → Bacteria929Open in IMG/M
3300004153|Ga0063455_100614870All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300004479|Ga0062595_100731276All Organisms → cellular organisms → Bacteria801Open in IMG/M
3300005171|Ga0066677_10196510All Organisms → cellular organisms → Bacteria1127Open in IMG/M
3300005171|Ga0066677_10409677All Organisms → cellular organisms → Bacteria775Open in IMG/M
3300005171|Ga0066677_10496527All Organisms → cellular organisms → Bacteria701Open in IMG/M
3300005171|Ga0066677_10542920All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300005174|Ga0066680_10287497All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300005175|Ga0066673_10001597All Organisms → cellular organisms → Bacteria7875Open in IMG/M
3300005176|Ga0066679_10443329All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300005177|Ga0066690_10190800All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300005177|Ga0066690_10584053All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300005179|Ga0066684_10064975All Organisms → cellular organisms → Bacteria2140Open in IMG/M
3300005179|Ga0066684_10369143All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300005181|Ga0066678_10140842All Organisms → cellular organisms → Bacteria1500Open in IMG/M
3300005184|Ga0066671_10337242All Organisms → cellular organisms → Bacteria954Open in IMG/M
3300005187|Ga0066675_10783241All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300005328|Ga0070676_10495510All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300005334|Ga0068869_100490371All Organisms → cellular organisms → Bacteria1024Open in IMG/M
3300005343|Ga0070687_100223094All Organisms → cellular organisms → Bacteria1156Open in IMG/M
3300005446|Ga0066686_10277266All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300005447|Ga0066689_10306580All Organisms → cellular organisms → Bacteria985Open in IMG/M
3300005450|Ga0066682_10515266All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Corynebacteriaceae → Corynebacterium → Corynebacterium frankenforstense757Open in IMG/M
3300005451|Ga0066681_10029881All Organisms → cellular organisms → Bacteria2868Open in IMG/M
3300005467|Ga0070706_100084689All Organisms → cellular organisms → Bacteria2937Open in IMG/M
3300005518|Ga0070699_100128539All Organisms → cellular organisms → Bacteria2233Open in IMG/M
3300005530|Ga0070679_100826950All Organisms → cellular organisms → Bacteria869Open in IMG/M
3300005536|Ga0070697_101043524All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300005540|Ga0066697_10152305All Organisms → cellular organisms → Bacteria1366Open in IMG/M
3300005554|Ga0066661_10170084All Organisms → cellular organisms → Bacteria1340Open in IMG/M
3300005556|Ga0066707_10168553All Organisms → cellular organisms → Bacteria1398Open in IMG/M
3300005557|Ga0066704_10049158All Organisms → cellular organisms → Bacteria → Proteobacteria2657Open in IMG/M
3300005558|Ga0066698_10182490All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300005559|Ga0066700_10232279All Organisms → cellular organisms → Bacteria1284Open in IMG/M
3300005561|Ga0066699_10422304All Organisms → cellular organisms → Bacteria954Open in IMG/M
3300005561|Ga0066699_10571041All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300005561|Ga0066699_10700066All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300005568|Ga0066703_10466520All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300005569|Ga0066705_10407381All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300005569|Ga0066705_10494352All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300005587|Ga0066654_10109917All Organisms → cellular organisms → Bacteria1346Open in IMG/M
3300005587|Ga0066654_10323373All Organisms → cellular organisms → Bacteria833Open in IMG/M
3300005718|Ga0068866_10569870All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300006031|Ga0066651_10233200All Organisms → cellular organisms → Bacteria976Open in IMG/M
3300006032|Ga0066696_10377307All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300006032|Ga0066696_10586839All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300006800|Ga0066660_10048234All Organisms → cellular organisms → Bacteria2761Open in IMG/M
3300006800|Ga0066660_10214084All Organisms → cellular organisms → Bacteria1480Open in IMG/M
3300006800|Ga0066660_10382412All Organisms → cellular organisms → Bacteria1155Open in IMG/M
3300006852|Ga0075433_10243829All Organisms → cellular organisms → Bacteria1595Open in IMG/M
3300006852|Ga0075433_10673565All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300006854|Ga0075425_100440765All Organisms → cellular organisms → Bacteria1500Open in IMG/M
3300006854|Ga0075425_101089887All Organisms → cellular organisms → Bacteria910Open in IMG/M
3300006854|Ga0075425_102301916All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300006904|Ga0075424_101055888All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300006954|Ga0079219_10792880All Organisms → cellular organisms → Bacteria742Open in IMG/M
3300007265|Ga0099794_10081826All Organisms → cellular organisms → Bacteria1593Open in IMG/M
3300009012|Ga0066710_100978112All Organisms → cellular organisms → Bacteria1305Open in IMG/M
3300009012|Ga0066710_100990511All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300009012|Ga0066710_102184167All Organisms → cellular organisms → Bacteria809Open in IMG/M
3300009038|Ga0099829_10778811All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300009088|Ga0099830_10808093All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300009090|Ga0099827_10020496All Organisms → cellular organisms → Bacteria4577Open in IMG/M
3300009143|Ga0099792_10109318All Organisms → cellular organisms → Bacteria1466Open in IMG/M
3300009143|Ga0099792_10203130All Organisms → cellular organisms → Bacteria1128Open in IMG/M
3300009792|Ga0126374_10205516All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300010320|Ga0134109_10246259All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300010321|Ga0134067_10096633All Organisms → cellular organisms → Bacteria1006Open in IMG/M
3300010322|Ga0134084_10095901All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300010325|Ga0134064_10100368All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300010337|Ga0134062_10152696All Organisms → cellular organisms → Bacteria1028Open in IMG/M
3300010364|Ga0134066_10022723All Organisms → cellular organisms → Bacteria1409Open in IMG/M
3300012203|Ga0137399_10451364All Organisms → cellular organisms → Bacteria1075Open in IMG/M
3300012203|Ga0137399_10679873All Organisms → cellular organisms → Bacteria866Open in IMG/M
3300012208|Ga0137376_10215078All Organisms → cellular organisms → Bacteria1664Open in IMG/M
3300012212|Ga0150985_108375125All Organisms → cellular organisms → Bacteria2398Open in IMG/M
3300012582|Ga0137358_10316386All Organisms → cellular organisms → Bacteria1059Open in IMG/M
3300012582|Ga0137358_10452352All Organisms → cellular organisms → Bacteria867Open in IMG/M
3300012683|Ga0137398_10428820All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300012685|Ga0137397_10391921All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300012685|Ga0137397_10797980All Organisms → cellular organisms → Bacteria700Open in IMG/M
3300012918|Ga0137396_10034310All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium3383Open in IMG/M
3300012918|Ga0137396_10447350All Organisms → cellular organisms → Bacteria958Open in IMG/M
3300012918|Ga0137396_10665250All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300012922|Ga0137394_10167133All Organisms → cellular organisms → Bacteria1874Open in IMG/M
3300012925|Ga0137419_10097493All Organisms → cellular organisms → Bacteria2028Open in IMG/M
3300012925|Ga0137419_10242692All Organisms → cellular organisms → Bacteria1354Open in IMG/M
3300012927|Ga0137416_10254448All Organisms → cellular organisms → Bacteria1436Open in IMG/M
3300012927|Ga0137416_10814238All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300012930|Ga0137407_11141594All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300012944|Ga0137410_10346620All Organisms → cellular organisms → Bacteria1186Open in IMG/M
3300012944|Ga0137410_10510319All Organisms → cellular organisms → Bacteria984Open in IMG/M
3300012972|Ga0134077_10180201All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300012975|Ga0134110_10083520All Organisms → cellular organisms → Bacteria1280Open in IMG/M
3300012975|Ga0134110_10211382All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300013297|Ga0157378_10333599All Organisms → cellular organisms → Bacteria1477Open in IMG/M
3300015241|Ga0137418_10140670All Organisms → cellular organisms → Bacteria2134Open in IMG/M
3300015264|Ga0137403_10213774All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300015358|Ga0134089_10107093All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300015371|Ga0132258_12656718All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300018433|Ga0066667_10080462All Organisms → cellular organisms → Bacteria2097Open in IMG/M
3300018433|Ga0066667_10306820All Organisms → cellular organisms → Bacteria1238Open in IMG/M
3300018433|Ga0066667_10432642All Organisms → cellular organisms → Bacteria1071Open in IMG/M
3300018468|Ga0066662_10223105All Organisms → cellular organisms → Bacteria1513Open in IMG/M
3300018468|Ga0066662_10697350All Organisms → cellular organisms → Bacteria968Open in IMG/M
3300018468|Ga0066662_10812663All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300018482|Ga0066669_10473784All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300024330|Ga0137417_1096632All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300025910|Ga0207684_10178123All Organisms → cellular organisms → Bacteria1833Open in IMG/M
3300025918|Ga0207662_10169039All Organisms → cellular organisms → Bacteria1401Open in IMG/M
3300026220|Ga0209855_1042451All Organisms → cellular organisms → Bacteria788Open in IMG/M
3300026276|Ga0209847_1037592All Organisms → cellular organisms → Bacteria971Open in IMG/M
3300026295|Ga0209234_1037548All Organisms → cellular organisms → Bacteria1842Open in IMG/M
3300026295|Ga0209234_1170938All Organisms → cellular organisms → Bacteria758Open in IMG/M
3300026300|Ga0209027_1094504All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300026309|Ga0209055_1041836All Organisms → cellular organisms → Bacteria2021Open in IMG/M
3300026314|Ga0209268_1032648All Organisms → cellular organisms → Bacteria1771Open in IMG/M
3300026315|Ga0209686_1038429All Organisms → cellular organisms → Bacteria1802Open in IMG/M
3300026316|Ga0209155_1101493All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300026322|Ga0209687_1050542All Organisms → cellular organisms → Bacteria1347Open in IMG/M
3300026326|Ga0209801_1072930All Organisms → cellular organisms → Bacteria1517Open in IMG/M
3300026328|Ga0209802_1047033All Organisms → cellular organisms → Bacteria2159Open in IMG/M
3300026328|Ga0209802_1095134All Organisms → cellular organisms → Bacteria1351Open in IMG/M
3300026331|Ga0209267_1009768All Organisms → cellular organisms → Bacteria5287Open in IMG/M
3300026332|Ga0209803_1086077All Organisms → cellular organisms → Bacteria1307Open in IMG/M
3300026507|Ga0257165_1025614All Organisms → cellular organisms → Bacteria1014Open in IMG/M
3300026523|Ga0209808_1127835All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300026523|Ga0209808_1160575All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300026528|Ga0209378_1040638All Organisms → cellular organisms → Bacteria → Proteobacteria2349Open in IMG/M
3300026528|Ga0209378_1182277All Organisms → cellular organisms → Bacteria747Open in IMG/M
3300026532|Ga0209160_1055037All Organisms → cellular organisms → Bacteria2266Open in IMG/M
3300026540|Ga0209376_1023939All Organisms → cellular organisms → Bacteria3972Open in IMG/M
3300026540|Ga0209376_1256308All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300026542|Ga0209805_1032735All Organisms → cellular organisms → Bacteria2657Open in IMG/M
3300026542|Ga0209805_1173799All Organisms → cellular organisms → Bacteria963Open in IMG/M
3300026550|Ga0209474_10170837All Organisms → cellular organisms → Bacteria1404Open in IMG/M
3300026551|Ga0209648_10599558All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300026552|Ga0209577_10033215All Organisms → cellular organisms → Bacteria4469Open in IMG/M
3300027663|Ga0208990_1008280All Organisms → cellular organisms → Bacteria3458Open in IMG/M
3300027669|Ga0208981_1101476All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300027671|Ga0209588_1167605All Organisms → cellular organisms → Bacteria692Open in IMG/M
3300027882|Ga0209590_10171717All Organisms → cellular organisms → Bacteria1357Open in IMG/M
3300027903|Ga0209488_10295166All Organisms → cellular organisms → Bacteria1210Open in IMG/M
3300027903|Ga0209488_10509217All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300028536|Ga0137415_10521700All Organisms → cellular organisms → Bacteria996Open in IMG/M
3300031820|Ga0307473_10051775All Organisms → cellular organisms → Bacteria1950Open in IMG/M
3300031820|Ga0307473_10715024All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300032180|Ga0307471_100202011All Organisms → cellular organisms → Bacteria1990Open in IMG/M
3300032205|Ga0307472_100843909All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300032782|Ga0335082_10250578All Organisms → cellular organisms → Bacteria1654Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil34.44%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil21.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil9.27%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil6.62%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.65%
Permafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost Soil1.32%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.32%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.32%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.32%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.66%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.66%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.66%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.66%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.66%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.66%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.66%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.66%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026220Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-063 (SPAdes)EnvironmentalOpen in IMG/M
3300026276Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil DNA_2013-060 (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026316Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J18823_1015443333300001686SoilSIAKLEALDLAVEAFGPKPVRELILPPGNCVRWIWATKEQRRSHPSSYCPTDVLPATHPMSFDEAVARSVNSLTTRHAVLLPVLLLERRPDLFRDLSATVPAEERRSLDSPADRALATGLFAQLGETISPEQVPEEVSYSAASVALFRYLKERRERAGLPAERLPDDPTSLLGNSSRATPEQIGSYLHRKLFVNDGSCTLTDTGALLALRRKEGTLRWLAQRWPKLVLAGKTGSSPHDDSAVAAVGICLDARPVVLVGALRPLEGPLPTGLRGSVVLRGLDAYLKELVRLERRPTSALWPAWVEEELAEKQAAAGIAPAPAATPVPATLGAALATKGKP*
C688J18823_1037113913300001686SoilKPMAFDEAVARSVNSLTARHAILLPALLAQRRPDLLREIAAEVSAAERLALDSPADRALAGDLLAQLGAPVPPDAIAPELSYSAASVALFRYLKDRRQRAGLPADRLPEDPTSLLGNSSRATAEQIASYLHRKLFANDGTCALSDTGALLALHRREGTLRWLAARWPKLLFSGKTGSSPHDDSAVAAVGLCLDARPVMLVGALRPVQAPLPDGLQGSVLLRGIDGYLKELSRLERKPGSALLPAWAESGPELAVEAQP*
Ga0063455_10061487013300004153SoilLTTRHAILLPVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALASGLLAQLGETLAPDDVPPDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATSEQIGAYLHRKLFSRDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGSLRPVEGTLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIA
Ga0062595_10073127613300004479SoilAMLLPVLLAQRRPDLMLQVSAEVQDEERAALDSPADRALAGSLLAQLGEAVPPDEIAPELSYSAASVALFRYLKARREEAGLPAQRLPDDPTSLLGNSSRATAEQVGAYLHRKLFAPDGSCTLSDTGALLALHRREGTLRWLAQRFPKLVFAGKTGSSPHDDSAVAAVALCLDARPVVLVAALRPIEGHLPMGLHGSILLRGIDAYLRELGRLQRRAASALLPSWAEGPEPALTAEVKP*
Ga0066677_1019651023300005171SoilRAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALVGIVVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER*
Ga0066677_1040967713300005171SoilAVARSINSLTARHAMLLPPLLAQRRPDLLRRMAAELSAGERAALDSPADRALAGDLLAQLGDAMPPEAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066677_1049652713300005171SoilHPMAFDEAVARSVNSLTARHALMLPVLFAQRRPDLASELSREVSDQERAALDSPADRALSAGLFAQLGETLQPDEVSAELSYSAAAVALFRHLKLRREQAGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRRDGTLRWLAQRWPKLVFAGKTGSSPHDDSAVAAVALCLESRPVVLVAALRALSGALPEGLRGSVLLRGIDGYMRELVRL
Ga0066677_1054292013300005171SoilVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALTSGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVE
Ga0066680_1028749713300005174SoilAFAALGSDGSVLARSGAESALMAVNYGSVAKLEALDLAVEAFGPAAVRDLMLPPGGCLRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGALPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066673_1000159713300005175SoilVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALTSGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAALPIDPPRVESALATKGKP*
Ga0066679_1044332913300005176SoilVAKSVNSLTARHAVLLPVLLSLRRPEVFRDLVSTVPDEERRSLDSPADRALASGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKERRELAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAALPIDPPRVESALATKGKP*
Ga0066690_1019080013300005177SoilDSPADRALTAGLLGQLGEAIAPQDVQPDLAYSAAGLALFRHLKDRRERAGLPALRLPDDPTSLLGNSSRATTEQIGAYLMKRLFAPGSCVLSDTGALLALHRREGTLRWLAARWPKLVFAGKTGTSPHDDSAVAAVGLCLDARPVILVSALRPVGGALPDGLHGSALLRGIDAYLRELARLERRPAPALWPAWAEEELVARQASSFERFDRFDNFDNDDQRRLTLDDRRPIRVETPP*
Ga0066690_1058405313300005177SoilVRDLALPAGDCVRWIWATKDLRRSRPASYCPADVSPAAHAMALDEAVARSINSLTTRHAILLPLLLAQRRPDLLREVSAEVSPEERAALDSPADRALAGGLFAQLGEAVPPDEVPPELSYSAAGVALFRYLKIRREQAGLPAQRLPDDPTSLLGNSSRATAEQIGGYLHRKLFVNDGSCTLSDTGALLALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVVLVAALRPLQDPL
Ga0066684_1006497533300005179SoilRSGAEAALMAVNYGSVAKLEALDLAVEAFGPGAVRELTLPPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066684_1036914313300005179SoilRSGAEAALMAVNYGSVAKLEALDLAVEAFGPGAVRELTLPPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAEISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLIAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQE
Ga0066678_1014084233300005181SoilPLLSRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER*
Ga0066671_1033724213300005184SoilPVNYGSVAKLEALDLAVEAFGPRAVQELTLPPAGCVRWIWATKDLRRSRPASYCPADVTPATHPMSFDEAVARSINSLTTRHAVLLPVLFAQRRPDLLSALMAEVPAEERAALDSPADRALAGNLLGQLGEVVRPEEVPPEISYSAASIALFRHLKMRRERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALWPAWTEEELAAAAQP
Ga0066675_1078324113300005187SoilPSSYCPNDVLPATHPMSLDEAVAKSVNSLTTRHAVLLPVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALTSGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSY
Ga0070676_1049551013300005328Miscanthus RhizosphereEAFGQQAVRELTLPAGACVRWIWTTKDARRTHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMMLPALLSQRRPDLLREVAAEVGSDERAALDSPADRALAGDLFAQLGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLP
Ga0068869_10049037123300005334Miscanthus RhizosphereVRWIWTTKDARRTHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMMLPALLSQRRPDLLREVAAEVGSDERAALDSPADRALAGDLLAQLGAPVPPDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP*
Ga0070687_10022309433300005343Switchgrass RhizosphereAEVGSVERAALDSPADRALAGDLLAQLGAPVPPDAIPEELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP*
Ga0066686_1027726623300005446SoilVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066689_1030658023300005447SoilAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0066682_1051526613300005450SoilEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTIAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRMVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP*
Ga0066681_1002988113300005451SoilRRPDLLRELAAEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTIAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRMVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP*
Ga0070706_10008468913300005467Corn, Switchgrass And Miscanthus RhizosphereEAVRELSLPPGPCVRWTWSTKELRRSHPSRYCPTDVAPATRPMTLDEAVARSVNSMTARHALMLPALLALRRPDLLREVASEVPAEERAALDSPADRAIAGDLLSQLGEAVPPDAVSPELSYSAAAVVLFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLHRKLFANDGTCTLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSALAAVGLCLDSRPVVLVGALRPIEPPLPDGLQGSVLLRGIDAYLKELVRLQRKPGSALLPAWAEPQPALAAEAKP*
Ga0070699_10012853913300005518Corn, Switchgrass And Miscanthus RhizosphereKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRQMAAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATVEQIGAYLHRKLFANDGTCTLSDTGALLAVRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP*
Ga0070679_10082695013300005530Corn RhizosphereAMDEAVARSINSMTARHAILLPALLAQRRPDLLREISAEVQGPERAALDSPADRALAGDLLRQLGESIPPDAIAPELSYSAAGVALFRYLRDRRERAGLPAGRLPEDPTSLLGNSSRATPEEIGAYLHRKLFRNDGTCALSDTGALLALHRKEGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGLCLDARPVVLIAALRPLQAPLPDGLQGSVLLRGIDAYLRELSRLDRKPAPAILPAWADPEAQLPAEANP*
Ga0070697_10104352413300005536Corn, Switchgrass And Miscanthus RhizosphereSLPPGGCVRWIWSTKELRRSHPSHYCPTDVAPATRPMTLDEAVARSVNSMTARHAMMLPALLALRRPDLLREVASEVPAEERAALDSPADRALAGDLLSQLGESVPPDAVAPELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGSCTLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDSRPVVLIGALRPI
Ga0066697_1015230533300005540SoilLAVEAFGPHAVMELPLPPAPCVRWIWATKDQRRARAASYCPTDVVPASHPMPFDEVVARSINSLTARHALLLPILFAQRRPDLLAEISKEVGDEERAALDSPADRALSATLFAQLGQTVQPDEIASDLSYSAASVALFRHLKSRRERGGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADETPAEELTVEAKR*
Ga0066661_1017008423300005554SoilYGSLAKLEALDFAVEAFGPRAVRELTLPPASCVRWIWARRPANYCPTDVAPPSRPMAFDEAVARSVNALTARHALLLPALLWQRRPDLLRTVSATLPVEERDALDSPADRALAADLFAQLGEIVPPDQVGPDLSYTAAGVALFRFLKARREEAGLPAALLPDDPTSLLGNSSRATAEQIGHYLHRKLFSDSSCALSDAGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLVAALRPLQPPLPKGLHGSVLLRGLDAYLRELVRLERRPTSLMLPAWAEENIAEPSAVAAGGIPSFAGDGEKR*
Ga0066707_1016855313300005556SoilALGSDGAVLARSGAEAALMAVNYGSVAKLEALDLAVEAFGPGAVRDLTLPPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066704_1004915813300005557SoilTVRERMLPPGGCVRWIWATKSQRGASPASYCPADVTPPARGMSFDEAVARSINSMTVRHALLLPPLLSRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER*
Ga0066698_1018249013300005558SoilLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066700_1023227913300005559SoilTVRHALLLPPLLSRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER*
Ga0066699_1042230423300005561SoilDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGEPIPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDAALAAVGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPASAVLPAWAEQETQVALEAKP*
Ga0066699_1057104113300005561SoilCVRWIWATKDLRRSRPASYCPADVTPATHPMSFDEAVARSINSLTTRHAVLLPVLFAQRRPDLLSALVAEVPAEERAALDSPADRALAGNLLAQLGEVVRPEEVPPEISYSAASIALFRHLKMRRERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALW
Ga0066699_1070006613300005561SoilRARAASYCPTDVIPAAHPMAFDEAVARSVNSLTARHALMLPVLFAQRRPDLASELSREVSDQERAALDSPADRALSAGLFAQLGETLQPDEVSAELSYSAAAVALFRHLKLRREQAGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRRDGTLRWLAQRWPKLVFAGKTGSSPHDDSAVAAVALCLESRPVVLVAALRALSGALPEGLRGSVLLRGID
Ga0066703_1046652013300005568SoilSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066705_1040738113300005569SoilMLPPGGCVRWIWATKSQRGASPASYCPADVTPPARAMSFDEAVARSINSMTVRHALLLPPLLSRWRPDLLHEIAAEVPSCERAALDSLADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALVGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIE
Ga0066705_1049435213300005569SoilLDLAVESFGPQSVRDLILPPGGCVRWIWATKEQRRSHPSSYCPNDVLPATHPMSLDEAVAKSVNSLTTRHAVLLPVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALTSGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKKSREAAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALR
Ga0066654_1010991733300005587SoilMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0066654_1032337313300005587SoilNYGSVAKLEALDLAVEAFGPAQVRDLMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMAAELSAGERAALDSPADRALAGDLLAQLGDAMPPEAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGS
Ga0068866_1056987013300005718Miscanthus RhizosphereTDVAPATRPMALDEAVARSINSLTARHAMMLPALLSQRRPDLLREVAAEVGSDERAALDSPADRALAGDLLAQLGAPVPTDAIPGELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPAWA
Ga0066651_1023320013300006031SoilPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMTFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTIAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP*
Ga0066696_1037730713300006032SoilFDEAVARSINSLTTRHAVLLPVLFAQRRPDLLSALVAEVPAEERAALDSPADRALAGNLLGQLGEVVRPEEVPPEISYSSASIALFRHLKMRRERAGLPSERLPDDPTSLLGNSSRATVEQIGAYLHRKLFAGDGTCTLSDTGALLALHRREGTLRWLALRWPKLVFSGKTGSSPHDDSAVAAVALCLDARPVVLAAGLRPLQGTLPTGLRGSVLLRGLDAYLHELVKLQRPPAPALWPAWAEEELAAAAQPPETPSLAVEEKR*
Ga0066696_1058683913300006032SoilGASPASYCPADVTPPARAMSFDEAVARSINSMTVRHALLLPPLLSRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPRLVFAGKTGSSPHDDSAVAGIAVCLDERPVVLVAALRPLQAPLPDGLHGSVLLRGLD
Ga0066660_1004823413300006800SoilAFAALGSDGSVLARSGAESALMAVNYGSVAKLEALDLAVEAFGPAAVRDLMLPPGGCLRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP*
Ga0066660_1021408423300006800SoilSALMAVNYGSVAKLEALDLAVEAFGPRTVRDLALPAGDCVRWIWATKDLRRSRPASYCPADVSPAAHAMALDEAVARSINSLTTRHAILLPLLLAQRRPDLLREVSAEVSPEERAALDSPADRALAGGLFAQLGEAVPPDEVPPELSYSAAGVALFRYLKIRREQAGLPAQRLPDDPTSLLGNSSRATAEQIGGYLHRKLFVNDGSCTLSDTGALLALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVVLVAALRPLQDPLPTGLRGSVLLRGMDAYLRELSRLERRLTSAVLPGWTEEGGTPKPPPGLSPMANDVRSGEEKP*
Ga0066660_1038241213300006800SoilVNYGSVAKLEALDFAVEAFGPHAVRDLPLAPAPCVRWIWATKDQRRARAASYCPTDVIPAAHPMAFDEAVARSVNSLTARHALMLPVLFAQRRPDLASELSREVSDQERAALDSPADRALSAGLFAQLGETLQPDAVSAELSYSAAAVALFRHLKLRREQAGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRRDGTLRWLAQRWPKLVFAGKTGSSPHDDSAVAAVALCLESRPVVLVAALRALSGALPEGLRGSVLLRGIDGYMRELVRLQRPPLSALWPEWAGEESAPETPSAEDPRTLRARINEKMKQSEEQLRRKNVGEETQKIQEQLLRDIDKLLELAREQPPPSQGSPPDAPQSK
Ga0075433_1024382913300006852Populus RhizosphereEAVLRAEFPSTSVRTAFAALGAKGAVLARSGAESALMALNYGSVAKLEALDLAVEAFGPEAVRELSLPPGSCVRWIWSTKELRRSHPSKYCPTDVAPPGKPMALDEAVARSVNSMTARHAMMLPALLSQRRPDLWTEMAAEVPAGERAALDSPADRALTGDLLAQLGAPVPPDAVSPELSYSAAGVSLFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGAYIHRKLFVNDGTCTLSDTGALLALHRREGTLRWLAYRWPKIVFSGKTGSSPHDDSALTAVGLCLDSRPVVLVAALRPIQPPLPDGLHGSVLLRGIDAYLKELVRLQRRPTSALLPQWACAADPSQSCPVETVEARP*
Ga0075433_1067356513300006852Populus RhizospherePPGGCVRWIWATKSQRGSSPASYCPSDVTPSARAMSFDEAVARSINSMTVRHALLLPPLLSRWRPDLLQEVASEVPPCERAALDSPADRALASGLFAQLGETLRPEEVAPELSYSTAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACLDERPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVASLALPAWAEDTVAPPPVPETISTAAMPAEKEE
Ga0075425_10044076533300006854Populus RhizosphereRSQPSRYCPTDVAPATRPMAMDEAVARSVNSMTARHAMMLPALLALRRPDLLREVASEVPAEERAALDSPADRALAGDLLSQLGEVVPPDAVAPELSYSAAAVALFRYLKDRREQAGLPAGRLPEDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEPPLPDGLQGSVLLRGIDAYLKELVRLQRRPTSALLPQWACAADPSQSCPVETVEARP*
Ga0075425_10108988713300006854Populus RhizosphereGPCVRWTWSTKELRRSHPSRYCPTDVAPATRPMTLDEAVARSVNSMTARHALMLPALLALRRPDLLREVASEVPAEERAALDSPADRAIAGDLLSQLGEAVPPDAVSPELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLHRKLFANDGTCTLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSALAAVGLCLDSRPVVLVGALRPIEPPLPDGLQGSVLLRGIDAYLKELVRLQRKPGSALLPAWAEPQPALAVEAKP*
Ga0075425_10230191613300006854Populus RhizosphereLLSQRRPDLWTEMAAEVPAGERAALDSPADRALTGDLLAQLGAPVPPDAVSPELSYSAAGVSLFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGAYIHRKLFVNDGTCTLSDTGALLALHRREGTLRWLAYRWPKIVFSGKTGSSPHDDSALTAVGLCLDSRPVVLVAALRPIQPPLPDGLHGSVLLRGIDA
Ga0075424_10105588813300006904Populus RhizosphereALGGNGAVLARSGAEAALMAVNYGSIAKLEALDLAVEAFGPDAVRELLLPPGGCVRWIWSTKDLRRSHPAHYCPTDVAPAARPMAMDEAVARSVNSMTTRHALLLPALFAQRRPDVLREIAAEVPAAERAALDSPADRALAGDLLAQLGQATPPDAIPPELSYSAASVGLFRYLRDRRERAGLPAERLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCVLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSAVAAVALCLDARPVVLIGALRPLQ
Ga0079219_1079288013300006954Agricultural SoilWCPEDVAPATHPMSFDEAVARSVNSMTARHAMLLPVLLAQRRPDLLLQLSAEVPAPERAALDSPADRALSGGLLAQLGEAVPPDGIAPELSYSAASVALFRYLKARREQAGLPAQRLPDDPTSLLGNSSRATAEQVGAYLHRKLFAKDGSCTLSDTGALLALHRREGTLRWLAQRFPKLVFAGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPVEGHLPMGLHGSVLLRGIDAYLRELKRLQRQA
Ga0099794_1008182613300007265Vadose Zone SoilADVTPPARAMSFDEAVARSINSMTVRHALLLPALFSQQRPDLLREVAAEVPPCERAALDSPADRALAGGLLAQLGETVTPDQVAPELSYSAAGVALFRHLRERREAAGLPAGRLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGEGACALTDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIALCLDARPVVLVAALRPLQGPLPDGLHGSVLLRGLDAYLRELRRLDRRVSSAALPAWAEEIDAPAEGRDPISVAATPVEKEER*
Ga0066710_10097811213300009012Grasslands SoilEISKEVGDEERAALDSPADRALSATLFAQLGQTVQPDEIASDLSYSAASIALFRHLKDRREHAGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADEAPAEELTVEAKR
Ga0066710_10099051113300009012Grasslands SoilLMADVAAAISPEERAALDSPADRALAADLFAQLGETIPPGDVAPDLSYSAAGVALFRYLKARRERAGLPSERLPDDPTSLLGNSSRATAEQIGAYLHRRLFAGDGSCALSDTGALIALHRSVGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAALGICLAARPVIMVAALRPVQARLPLGLHGSILLRGLDAYLRELGRLERRPTSLMLPPWATEEEAPPAPAVAAEEKR
Ga0066710_10218416713300009012Grasslands SoilYCPNDVAPATRPMALDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLEPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0099829_1077881123300009038Vadose Zone SoilELSAEERAALDSPADRALAGDLFALVGEAVAPDDVAPDLSYSAAGVALFRFLKARRERAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLFANDGYCALSDTGALIALHRAEGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGICLDARPVLLVAALRPVQGTLPIGLHGSVLLRGLDAYLRELARLERRPTSLVLPAWASEEEAPLTPALAAEEKR*
Ga0099830_1080809313300009088Vadose Zone SoilMAFDEAVARSVNSLTTRHAVMLPLLLALRRPDLLKQMAAEVPPEERAALDSPADRALAGGLFAQLGESVPPDEVAPELSYSAAGIALFRYLKVRRERAGLPAERLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGTCALSDTGALLALRRKEGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLSGVLPTGLRGSVVLRGIDAYLKEMVRLERPPTPALM
Ga0099827_1002049653300009090Vadose Zone SoilLREVSAELSAEERAALDSPADRALAGDLFAQVGEVVAPDDVAPDLSYSAAGVALFRFLKARRERAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLFANDGSCALSDTGALIALHRAEGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGICLDTRPVLLVSALRPVQGSLPTGLHGSMLLRGLDAYLRELARLERRPTSLVLPAWASEEEAPLTPALAAEEKR*
Ga0099792_1010931813300009143Vadose Zone SoilINSMTVRHALLLPPLLSQWRPDLLQEVASEVPPCEREALDSPADRALASGLLAQLGETLTPDQVAPELSYSAAGVALFRHLKARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAACIDERPVVLVAALRPLHAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSVALPAWAEETYAPSPAAETVTTAGVPAEKEER*
Ga0099792_1020313013300009143Vadose Zone SoilVARSINSMTARHAMLLPALLARSRPDLLRELAAEVPPEERASLDSPVDRALAGELLSQVGEAVAPDAVSPELSYSTAAIGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGIDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0126374_1020551633300009792Tropical Forest SoilQRRPDLLAEISQDLSEEERSALDSPADKALTAALFAQLGQTVQPDEIAADLSYSAASVALFRHLARRRQGAGLPSAMLPEDPTSLLGNSSRATAEQIGTYLHHKLFAGDGSCTLSDTGALLGLHRKEGTLRWLAQRWPRLVFSGKTGSSPHDDSAVAAVALCLDARPVVVVAALRALAGSLPDGLRGSVLLRGIDAYLKQLGKIERKPSSALWPSFVVDEAPVEEAKG*
Ga0134109_1024625913300010320Grasslands SoilALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYL
Ga0134067_1009663313300010321Grasslands SoilLTARHAMLLPPLLAQRRPDLLRRMAAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVISGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQLAVEARP*
Ga0134084_1009590123300010322Grasslands SoilSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0134064_1010036813300010325Grasslands SoilVLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0134062_1015269623300010337Grasslands SoilSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTIAAVGLFRYSKARREHAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP*
Ga0134066_1002272333300010364Grasslands SoilIAKLEALDLAVEAFGPAAVRELLLPPGACVRWIWATKDLRRSHPSRYCPTDVAPASRAMGFDEAVARSVNSMTTRHALLLPALLAQRRPDLMAELSAEVPAAERAALDSPADRALAGDLLAQLGEAVPPDAIPPELSYSAAGVALFRYLKDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELVRLQRRPASAVLPPWANPQPPLSAEATP*
Ga0137399_1045136413300012203Vadose Zone SoilMALDEAVARSVNSMTTRHAILLPALLAQRRPDLLREVAKELSPQERAALDSPADRALAGDLLAQLGEALPPDSIPPELSYSAAGVALFRYLRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAEIAAEATP*
Ga0137399_1067987313300012203Vadose Zone SoilTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137376_1021507833300012208Vadose Zone SoilRRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISAGERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKLVFSGKTGSSPHDDAALAAVGLCLDARPVLLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELGRLDRRPGSAILPAWAEQETQVALEAKP*
Ga0150985_10837512523300012212Avena Fatua RhizosphereVRWIWATKDQRRLRPASWCPEDVTKATHAMGFDEAVARSINSMTARHAMLLPVLLAQRRPDLLVHISAEVPAAERAALDSPADRALSGSLLAQLGETVPPDAIAPELSYSAASVALFRYLKERREQAGLPAQRLPDDPTSLLGNSSRATAEQVGAYLHRKLFAHDGTCMLSDTGALLAQRRKEGTLRWLAQRWPKLLFAGKTGSSAHDDSAVAAVGLCLDARPVVLVAALRPLDGPLPIGLRGSVLLRGIDAYLRELGRLQRSPTPALLPEWAQEPDPTVTAEVRP*
Ga0137358_1031638613300012582Vadose Zone SoilYGSVAKLEALDLAVEAFGPGAVRDLALPPGGCVRWIWSTKEQRRSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPADRALAGELLSQVGEAVPPDAVSPELSYSTAAVGLFRYLRDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVLLVAALRPLQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137358_1045235213300012582Vadose Zone SoilRPDLLREVAKELSPQERAALDSPADRALAGDLLAQLGEALPPDSIPPELSYSAAGVALFRYLRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPASALWPAWAEPPAPAEIAAEATP*
Ga0137398_1042882013300012683Vadose Zone SoilEVASEVPPCERAALDSPADRALASGLLAQLGETLRPDQVAAELSYSAAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFSGKTGSSPHDDSAVAGIAACLDERPVVLVAALRPLHAPLPDGLHESVLLRGLDAYLRELRRLDRRVTSVALPAWAEQTYAPSPAAETVTTAGVPAEKEER*
Ga0137397_1039192123300012685Vadose Zone SoilFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPVDRALAGELLSQVGEAVPPDAVSPELSYSTAAIGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGASLPLQLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137397_1079798013300012685Vadose Zone SoilQRRARAASYCPADVVPASQPMAFDESVARSINSLTVRHALLLPVLFAQRRPDLLAEISREVADEERAALDSPADRALTAALFAQLGQTMQPDEIPSDLSYSAASVALFRHLNRRREQSGLPSSMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLAQRWPKLAFSGKTGSSPHDDSAVAAVAICLDARPVVLVSALRALSGPLPEVLR
Ga0137396_1003431013300012918Vadose Zone SoilMSFDEAVARSINSMTVRHALLLPPLLWQWRPDLLQEIVAEMPPCDRAALDSPADRALAGGLLAQLVETVTPDQVAPELSYSAAGVALFRHLRERREAAGLPAGRLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGEGACALTDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIALCLDARPVVLVAALRPLQGPLPDGLHGSVLLRGLDAYLRELRRLDRRVSSAALPAWAEEIDAPAEGRDPISVAATPVEKEER*
Ga0137396_1044735013300012918Vadose Zone SoilVRDLALPPGGCVRWIWSTKELRQSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137396_1066525013300012918Vadose Zone SoilAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMTFDEAVARSINSMTARHAMLLPALLARRRPDLLRKLAAEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPLQAPLPDGLQG
Ga0137394_1016713313300012922Vadose Zone SoilGSVAKLEALDLAVEAFGPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMTFDEAVARSINSMTARHAMLLPALLARRRPDLLRKLAAEVPPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEQETRLAAEANP*
Ga0137419_1009749333300012925Vadose Zone SoilVKAGDVVCKLNTQQIEQLVAQEQLAVESSRANLAVADNALSIQESDNDSALRKATLDVEVSQLEFRQWLEGDVKSKRQALDLAVEAFGPEAVRELMLPPGACVRWIWSTKELRRSHPSRYCPTDVAPATRPMALDEAVARSVNSMTTRHAILLPALLAQRRPDLLREVAKELSPQERAALDSPADRALAGDLLAQLGEALPPDSIPPELSYSAAGVALFRYLRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAEIAAEATP*
Ga0137419_1024269223300012925Vadose Zone SoilLMAVNYGSIAKLEALDLAVEALGPRTVRERTLPPGACVRWIWATKSQRGASPASYCPADVTPAARAMSFDEAVARSINSMTVRHALLLPPLLWQWRPDLLHEIAAGMPPCDRAALDSPADRALTGGLLAQLGETVTPDQVAPELSYSAAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATTEQIGGYLHRKLLAGDGSCTLSDTGALLALHRRVGTLRWLAWRWPKLVFAGKTGSSPHDASAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSAALPPWAEEEIAARAAPEAMREAFSDDATPTGKEER*
Ga0137416_1025444823300012927Vadose Zone SoilAEAALMAVNYGSVAKLEALDLAVEAFGPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPLQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0137416_1081423813300012927Vadose Zone SoilRWIWATKEQRRSHPSSYCPSDVLPATHPMSFDEAVARSVNSLTARHAVLLPVLLSRRRPEVFRDLASTVPDEERRSLDSPADRALASGLLAQLGETLAPDDVPQDLSYSAAGVALFRYLKERRELAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVGVCLDQRPVVLVGALRPLEGALPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEE
Ga0137407_1114159413300012930Vadose Zone SoilYCPADVTPSARAMSFDEAVARSINSMTVRHALLLPPLLSQWRPDLLQEIAAEMPPCERAALDSPADRALASGLFAQVGEILRPDQVAPELSYSAAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGDGSCVLSDTGALLAAHRRVGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAVCLDERPVVLVAALRPLQAPLPDGLHGSALLRGLDAYLRELRRLDRR
Ga0137410_1034662023300012944Vadose Zone SoilSGAESALMAVNYGSVAKLEALDLAVEAFGPEAVRELSLPPGPCVRWTWSTKELRRSQPSRYCPTDVAPATRPMAMDEAVARSVNSMTARHAMMLPALLALRRPDLLREVASEVPAEERAALDSPADRALAGDLLSQLGEVVPPDAVAPELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEPPLPDGLQGSVLLRGIDAYFKELVRLQRKPGSALLPAWAEPQPALAVEAKP*
Ga0137410_1051031913300012944Vadose Zone SoilDLAVEAFGPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPSDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGGYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFTGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP*
Ga0134077_1018020113300012972Grasslands SoilPPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAVGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELGRLDRRPGSAVLPAWAEQEIQV
Ga0134110_1008352013300012975Grasslands SoilAALGSDGAVLARSGAESALMAVNYGSVAKLEALDLAVEAFGPGAVRELTLQPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0134110_1021138213300012975Grasslands SoilARHAVLLPVLLFRRRPEVFRDLASTVPEEERRSLDSPADRALASGLLAQLGETLAPDDVPEDLAYSAAGVALFRYLKERREQAGLPAERLPDDPTSLLGNSSRATPEQIGAYLHRKLFLKDGSCTLSDTGSLLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVAICLDQRPVVLVGALRPLEGSLPLGLRGSVVLRGLDSYLKALVRLDRRPTSALWPAWVEEEIAAKQAALPIDPPRVESALATKGKP*
Ga0157378_1033359913300013297Miscanthus RhizosphereAVEAFGQQAVRELTLPAGACVRWIWTTKDARRTHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMMLPALLSQRRPDLLREVAAEVGREERAALDSPADRALAGDLLAQLGAPVPPDAIPEELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEASP*
Ga0137418_1014067033300015241Vadose Zone SoilESALMAVNYGSVAKLEALDLAVEAFGPEAVRELMLPPGACVRWIWSTKELRRSHPSRYCPTDVAPATRPMALDEAVARSVNSMTTRHAILLPALLAQRRPDLLREVAKELSPQERAALDSPADRALAGDLLAQLGEALPPDSIPPELSYSAAGVALFRYLRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAEIAAEATP*
Ga0137403_1021377413300015264Vadose Zone SoilEALDLAVEAFGPEAVRELSLPPGPCVRWTWSTKELRRSQPSRYCPTDVAPATRPMAMDEAVARSVNSMTARHAMMLPALLALRRPDLLREVASEVPAEERAALDSPADRALAGDLLSQLGEVVPPDAVAPELSYSAAAVALFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATAEQIGGYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPIEAPLPDGLQGSVLLRGIDAYFKELVRLQRKPGSALLPAWAEPQPALAVEAKP*
Ga0134089_1010709323300015358Grasslands SoilAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP*
Ga0132258_1265671813300015371Arabidopsis RhizosphereDLLREVAAEVGREERAALDSPADRALAGDLLAQLGAPVPPDAIPEELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPAWAEADPQAPVEASP*
Ga0066667_1008046213300018433Grasslands SoilYGSVAKIEALVLAVEAFGAAQVRDLMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0066667_1030682013300018433Grasslands SoilHQTRGDSHTAGVTQFAPQSDVSAANPPAAISVGTTDVAPATRPMTFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVRPEERASLDSPADRALGGELLSQVGEAVPPDAVSPELSYTIAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPEGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP
Ga0066667_1043264213300018433Grasslands SoilPLLSRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPRLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER
Ga0066662_1022310533300018468Grasslands SoilRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGALPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP
Ga0066662_1069735023300018468Grasslands SoilPAAHAMALDEAVARSINSLTTRHAILLPLLLAQRRPDLLREVSAEVSPEERAALDSPADRALAGGLFAQLGEAVPPDEVPPELSYSAAGVALFRYLKIRREQAGLPAQRLPDDPTSLLGNSSRATAEQIGGYLHRKLFVNDGSCTLSDTGALLALHRREGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAAGICLDARPVVLVAALRPLQDPLPMGLRGSVLLRGMDAYLRELSRLERRLTSAVLPGWTEEGGTPKPPPGLSPMANDVRSGEEKP
Ga0066662_1081266323300018468Grasslands SoilWQRRPDVLRELSAQVPGDEWAALDSPADRALGGAQLAQLGESVPPDEVAPELSYSAAGIGLFRLLRERREAAGLPAQRLPDDPTSLLGNSSRATTEQIGAYLHRKLFQHDRSCFLSDTGALLALRRKDGTLRYLAQRYPKLIFAGKTGSSPHDDSAVAAIAVCIDARPVVLIAALRPLSGTLPVGLRGSVLLRGLDAYLRELGRLDRRPTSANLPEWAIEPSTEDKG
Ga0066669_1047378413300018482Grasslands SoilRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0137417_109663213300024330Vadose Zone SoilDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPVDRALAGELLSQVGEAVAPDAVSPELSYSTAAIGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPLQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0207684_1017812313300025910Corn, Switchgrass And Miscanthus RhizosphereEAVRELSLPPGPCVRWTWSTKELRRSHPSRYCPTDVAPATRPMTLDEAVARSVNSMTARHALMLPALLALRRPDLLREVASEVPAEERAALDSPADRAIAGDLLSQLGEAVPPDAVSPELSYSAAAVVLFRYLKDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGGYLHRKLFANDGTCTLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSALAAVGLCLDSRPVVLVGALRPIEPPLPDGLQGSVLLRGIDAYLKELVRLQRKPGSALLPAWAEPQPALAAEAKP
Ga0207662_1016903913300025918Switchgrass RhizosphereGQQAVRELTLPAGACVRWIWTTKDARRTHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMMLPALLSQRRPDLLREVAAEVGSDERAALDSPADRALAGDLLAQLGAPVPPDAIPEELSYSAAGVALFRYLKDRRERAGLPSDRLPEDPTSLLGNSSRATAEQIGTYLHRKLFANDGTCALTDTGALLALRRREGTLRWLAQRWPRLVFSGKTGSSPHDDSALAAVGICLDQRPVVLVAALRPLQGTLPDGLHGSLLLRGIDAYLRELTRLDRKPAAALLPSWAQADPQTPVEALP
Ga0209855_104245113300026220Permafrost SoilSVSKLELLNYAVEAFGPEAVRELPLPPGKCLRWIWATKEQRRVHAGSYCPNDVAPADHPMALDEAVARSINSLTTRHAVLLPELLAERRPDLFKLMASQLSAAERSALDSPADRALASNQYASVGMTVAPDAISPSLSYSAAGVALFRTLKQRREQAGLPATSLPDDPTSLLGNDSRATVEQIGAYLHKKLFLGDGSCTLSDTGALLALHRREGTLRWLAQRWPKLIFTGKTGSSPHDDSAVAAVAICLDTRPVVLVAALRP
Ga0209847_103759213300026276Permafrost SoilKLELLNYAVEAFGPEAVRELPLPPGKCLRWIWATKEQRRVHAGSYCPNDVAPADHPMALDEAVARSINSLTTRHAVLLPELLAERRPDLFKLMASQLSAAERSALDSPADRALASNQYASVGMTVAPDAISPSLSYSAAGVALFRTLKQRREQAGLPATSLPDDPTSLLGNDSRATVEQIGAYLHKKLFLGDGSCTLSDTGALLALHRREGTLRWLAQRWPKLIFTGKTGSSPHDDSAVAAVAICLDTRPVVLVAALRPLQGALPQGLRGSVVLRGLDSYLRELSRLERRPNSAELPPWAVLATAVEAVQ
Ga0209234_103754833300026295Grasslands SoilMAVNYGSVAKLEALDLAVEAFGPAAVRDLMLPPGGCLRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209234_117093813300026295Grasslands SoilALDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALE
Ga0209027_109450423300026300Grasslands SoilMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPPWAEQDAQMAVEARP
Ga0209055_104183623300026309SoilYGSLAKLEALDFAVEAFGPRAVRELTLPPASCVRWIWARRPANYCPTDVAPPSRPMAFDEAVARSVNALTARHALLLPALLWQRRPDLLRTVSATLPVEERDALDSPADRALAADLFAQLGEIVPPDQVGPDLSYTAAGVALFRFLKARREEAGLPAALLPDDPTSLLGNSSRATAEQIGHYLHRKLFSDSSCALSDAGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLVAALRPLQPPLPKGLHGSVLLRGLDAYLRELVRLERRPTSLMLPAWAEENIAEPSAVAAGGIPSFAGDGEKR
Ga0209268_103264833300026314SoilLARSGAEAALMAVNYGSVAKLEALDLAVEAFGPGAVRELTLPPGGCVRWIWSTKELRRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209686_103842913300026315SoilVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209155_110149313300026316SoilAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMTFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVSPEERASLDSPADRALGAELLSQVGEAVPPDAVSPELSYTIAAVGLFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRAVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSALLPPWVEEETRLAAEAKP
Ga0209687_105054213300026322SoilLMAVNYGSVAKLEALDLAVEAFGPAQVRDLMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209801_107293033300026326SoilLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER
Ga0209802_104703313300026328SoilRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209802_109513423300026328SoilRWRPDLLHEIAAEVPSCERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIELPVAAPEAISVAATPAEKEER
Ga0209267_100976863300026331SoilMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209803_108607733300026332SoilAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP
Ga0257165_102561413300026507SoilALPPGGCVRWIWSTKEQRRSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0209808_112783513300026523SoilLMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMAAELSAGERAALDSPADRALAGDLLAQLGDAMPPEAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209808_116057513300026523SoilILFAQRRPDLLAEISKEVGDDERAALDSPADRALSATLFAQLGQTVQPDEIASDLSYSAASVALFRHLKSRRERGGLPASMLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLALRWPKLVFTGKTGSSPHDDSAVAAVALCLDARPVVAVSALRALAGPLPEGLRGSVLLRGIDAYLKELSRLERKPSSALWPSFIADETPAEELTVEAKR
Ga0209378_104063813300026528SoilERAALDSPADRALAGGLLAQLGETVAPDQVAPELSYSAAGIALFRYLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLAADGSCTLSDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSALAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRVTSLALPAWAEEIEAPVAAPEAISVAATPAEKEER
Ga0209378_118227713300026528SoilLDEAVARSVNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDAALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPV
Ga0209160_105503743300026532SoilGPAAVRDLMLPPGGCLRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPALLAQRRPDLLRQMVAEVSARERASLDSPADRALAGDLLAQLGGAMPPEAIAPELSYSAAGVALFRYLRDRRELAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSVLRPLQAPLPDGLQGSLLLRGIDGYLRELARLDRKPASALLPTWAEQDAQLAVEARP
Ga0209376_102393953300026540SoilRRSHPSRYCPNDVAPATRPMALDEAVARSINSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0209376_125630813300026540SoilARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEA
Ga0209805_103273543300026542SoilARSGAESALMAVNYGSVAKLEALDLAVEAFGPAQVRDLMLPPGGCVRWTWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSINSLTARHAMLLPPLLAQRRPDLLRRMTAELSAGERAALDSPADRALAGDLLAQLGEAMPPDAIAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGAYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGLCLDARPVVLVSALRPLQAPLPDGLQGSLLLRGIDGYLRELAKLDRKPASAVLPAWAEQDAQLAVEARP
Ga0209805_117379913300026542SoilSDVLPATHPMSFDEAVAKSVNSLTARHAVLLPVLLFRRRPEVFRDLASTVPEEERRSLDSPADRALASGLLAQLGETLAPDDVPEDLAYSAAGVALFRYLKERREQAGLPAERLPDDPTSLLGNSSRATPEQIAAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVGICLDRRPVVLVGALRPLLGALPLGLRGSVVLRGLDSYLKALVRLDRRPASALWPAWVEEELAAKQASAPLEPSRVESALATKGKP
Ga0209474_1017083723300026550SoilRWIWATKEERRSHPSSYCPSDVLPATHPMSFDEAVAKSVNSLTARHAVLLPVLLFRRRPEVFRDLASTVPEEERRSLDSPADRALASGLLAQLGETLAPDDVPEDLAYSAAGVALFRYLKERREQAGLPAERLPDDPTSLLGNSSRATPEQIAAYLHRKLFLKDGSCTLTDTGALLALRRKEGTLRWLAQKWPKLVFAGKTGSSPHDDSAVAAVGICLDQRPVVLVGALRPLLGALPLGLRGSVVLRGLDSYLKALVRLDRRPASALWPAWVEEELAAKQASAPLEPSRVESALATKGKP
Ga0209648_1059955813300026551Grasslands SoilLALRRPDLLKQMAAEVPPEERAALDSPADRALAGGLFAQLGESVPPDEVAPELSYSAAGIALFRYLKVRRERAGLPAERLPDDPTSLLGNSSRATAEQIGAYLHRKLFAGDGTCALSDTGALLALRRKEGTLRWLAQRWPKLVFTGKTGSSPHDDSAVAAVGVCLDARPVVLVAALRPLSGVLPTGLRGSVVLRGIDAYLQE
Ga0209577_1003321563300026552SoilNSLTARHAMMLPALLAQRRPDLLGQMVAGISADERASLDSPADRALAGDLLAQLGETIPPDAVAPELSYSAAGVALFRYLRDRRERAGLPADRLPEDPTSLLGNSSRATAEQIGGYLHRKLFANDGTCALSDTGALLALRRREGTLRWLAARWPKIVFSGKTGSSPHDDSALAAIGLCLDARPVVLVAALRPLQPPLPDGLQGSVLLRGIDAYLRELARLDRRPGSAVLPAWAEQETPVALEAKP
Ga0208990_100828043300027663Forest SoilTRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPVDRALAGELLSQVGEAVPPDAVSPELSYSTAAVGLFRYLRDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0208981_110147613300027669Forest SoilFGPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPVDRALAGELLSQVGEAVPPDAVSPELSYSTAAIGLFRYLRDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKLFANDGTCTLSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVAICLDARPVVLVA
Ga0209588_116760513300027671Vadose Zone SoilALLLPALFSQQRPDLLREVAAEVPPCERAALDSPADRALAGGLLAQLGETVTPDQVAPELSYSAAGVALFRHLRERREAAGLPAGRLPDDPTSLLGNSSRATAEQIGGYLHRKLLAGEGACALTDTGALLALHRRAGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIALCLDARPVVLVAALRPLQGPLPDGLHGSVLLRGLDAYLRELRRLDRRVSSAALPAWAEEID
Ga0209590_1017171733300027882Vadose Zone SoilWARRPASYCPADVAPPSHPMALDEAVSRSVNTLTARHALLLPLLLWQRRPDLLREVSGVLSAEERAALDSPADRALAGDLFAQVGEVVAPDDVAPDLSYSAAGVALFRFLKARRERAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLFANDGSCALSDTGALIALHRAEGTLRWLAQRWPKLVFSGKTGSSPHDDSAVAAVGICLDARPVLLVAALRPVQGSLPTGLHGSMLLRGLDAYLRELARLERRPTSLVLPAWASEEEAPLTPALAAEEKR
Ga0209488_1029516613300027903Vadose Zone SoilDVAPATRPMALDEAVARSVNSMTTRHAILLPALLAQRRPDLLREVAKELSPQERAALDSPADRALAGDLLAQLGEALPPDSIPPELSYSAAGVALFRYLRDRREQAGLPAERLPEDPTSLLGNSSRATPEQIGAYLQRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPRIVFSGKTGSSPHDDSALAAVGLCLDARPVVLIGALRPLHPPLPDGLQGSVLLRGIDAYLRELAKLQRKPSSALLPAWAEPPAPAELAAEVTP
Ga0209488_1050921713300027903Vadose Zone SoilTGGRAACAALGPGGAVLARSGAESALMAVNYGSIAKLEALDLAVEAFGPRTVHERMLPPGGCVRWIWATKSQRGASPASYCPSDVTPSARAMSFDEAVARSINSMTVRHAMLLPPLLSQWRPDLLREVASEVPPCERAALDSPADRALASGLLAQLGETLRPDQVAAELSYSAAGVALFRHLRARRELAGLPAERLPDDPTSLLGNSSRATAEQIGHYLHRKLFAAGGSCSLSDTGALLALHRRVGTLRWLAWRWPKLVFSGKTGSSPHDDSAVAGIAACIDERPVVLAAPLR
Ga0137415_1052170013300028536Vadose Zone SoilAALMAVNYGSVAKLEALDLAVEAFGPGAVRDLALPPGGCVRWIWSTKELRRSHLSRYCPTDVAPATRPMAFDEAVARSINSMTARHAMLLPALLARRRPDLLRELAAEVPPEERASLDSPLDRALAGELLSQVGEAVPPDAVSPELSYTTAAVGLFRYLKDRRERAGLPADRLPDDPTSLLGNSSRATPEQIGAYLHLKVFANDGTCALSDTGALLALHRREGTLRWLAQRWPRIVFSGKTGSSPHDDSAVAAVGICLDARPVVLVAALRPVQAPLPDGLQGSVLLRGMDAYLRELARLERKPGSPLLPPWVEQETQLAAEAKP
Ga0307473_1005177513300031820Hardwood Forest SoilALGPGGAVLARSGAESALMGVNYGSIAKLEALDLAVEAFGPRAVQERVLPPGGCVRWIWATKSQRGASPASYCPADVAPSVRPMSFDEAVARSINSMTVRHTLLLPALLSQSRPDLLQEIAAALLPGEHAALDSPADRALAAGLLAQLGETVTPEQVPPELSYTAAGVALFRHLRARRQLAGLPAERLPDDPTSLLGNSSRATAEQIGGYLHRKLLASDGSCTLSDTGGLLALHRRDGTLRWLAWRWPKLVFAGKTGSSPHDDSAVAGIAVCLDARPVVLVAALRPLQAPLPDGLHGSVLLRGLDAYLRELRRLDRRLTPVALPAWAAETLAPAAAPEAISAGAPVEEER
Ga0307473_1071502413300031820Hardwood Forest SoilDVAPATRPMALDEAVARSVNSMTTRHAMLLPALLLQRRPDLMRVVAAEVTSQERAALDSPADRALAGDLLAQLGEPLPPDAIPAELSYSAAGVALFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATAEQIGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKE
Ga0307471_10020201113300032180Hardwood Forest SoilGPEAVRELLLPPGGCVRWIWSTKDLRRSHPSRYCPTDVAPATRPMALDEAVARSVNSMTTRHAMLLPALLMRRRPDLLREVAAEVTTQERAALDSPADRALAGDLLAQLGEPLPPDAIPAELSYSAAGVALFRYLKDRREQAGLPADRLPEDPTSLLGNSSRATAEQVGSYLHRKLFANDGTCMLSDTGALLALHRKEGTLRWLAQRWPKIVLSGKTGSSPHDDSAVAAVGLCLDARPVVLVAALRPLQAPLPDGLQGSVLLRGIDAYLKELARLQRTPTSALLPPWADAAAELSAEARP
Ga0307472_10084390913300032205Hardwood Forest SoilYGPTDVAPAARPMPFDEAVAKSINSLTARHALLLPVLFAQRRPDLLAEISRQVSEEERAALDSPADKALTASLFAQLGQTVPPEEIASDLSYSAASVALFRELKARREQAGLPSSMLPEDPTSLLGNSSRATVEQIGAYLHRKLFAGDGSCTLSDTGALLGLHRREGTLRWLAQRWPKLIFTGKTGSSPHDDSAVAAVALCLDARPVVVVSALRALSGSLPEGLRGSVLLRGIDAYLKALSRLERKPTSALWPSFVIDEAPAEEAKG
Ga0335082_1025057833300032782SoilCPTDVSPATRPMAMDESVARSINSMTARHALLLPALLLQRRPDLLRQLSAEVTPVERAALDSPADRALAGELLAQLGEAVPPDAVPAELSYSAAGVALFRYLRERREQAGLPASRLPEDPTSLLGNSSRATAEEIGSYLHRKLFANDGTCALSDTGALLALHRKEGTLRWLAARWPKLVFSGKTGSSPHDDSALAAVGICLDARPVVLIAALRPLHPPLPDGLQGSVLLRGIDAYLRELSRLDRRPGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.