NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097008

Metagenome Family F097008

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097008
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 239 residues
Representative Sequence VPLPLRELAAVARQSDEALAEKTGAAGNRISRRDALYGLLRREQAIVASDASSSRSEISRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGAARAGTDARLAKVPGSALTRPSLWGTMNVDVRMRLHQIAAHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREVLDESYRTLTA
Number of Associated Samples 79
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 63.46 %
% of genes near scaffold ends (potentially truncated) 40.38 %
% of genes from short scaffolds (< 2000 bps) 49.04 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(44.231 % of family members)
Environment Ontology (ENVO) Unclassified
(54.808 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.462 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 62.99%    β-sheet: 4.98%    Coil/Unstructured: 32.03%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.213.1.2: DinB-liked2ou6a12ou60.76773
a.213.1.1: YfiT-like putative metal-dependent hydrolasesd1rxqa_1rxq0.75252
a.213.1.4: Maleylpyruvate isomerase-liked2nsfa12nsf0.75115
a.213.1.2: DinB-liked2p1aa12p1a0.69655
a.213.1.2: DinB-liked2hkva12hkv0.68437


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF07883Cupin_2 30.77
PF04237YjbR 14.42
PF07992Pyr_redox_2 8.65
PF01250Ribosomal_S6 6.73
PF11716MDMPI_N 5.77
PF01084Ribosomal_S18 3.85
PF00436SSB 2.88
PF03401TctC 0.96
PF00496SBP_bac_5 0.96
PF00326Peptidase_S9 0.96
PF04075F420H2_quin_red 0.96
PF02775TPP_enzyme_C 0.96
PF01738DLH 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG2315Predicted DNA-binding protein with ‘double-wing’ structural motif, MmcQ/YjbR familyTranscription [K] 14.42
COG0360Ribosomal protein S6Translation, ribosomal structure and biogenesis [J] 6.73
COG0238Ribosomal protein S18Translation, ribosomal structure and biogenesis [J] 3.85
COG0629Single-stranded DNA-binding proteinReplication, recombination and repair [L] 2.88
COG2965Primosomal replication protein NReplication, recombination and repair [L] 2.88
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10044250All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2419Open in IMG/M
3300005171|Ga0066677_10014834All Organisms → cellular organisms → Bacteria3476Open in IMG/M
3300005172|Ga0066683_10050155All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2459Open in IMG/M
3300005172|Ga0066683_10084513All Organisms → cellular organisms → Bacteria1915Open in IMG/M
3300005174|Ga0066680_10070006All Organisms → cellular organisms → Bacteria2098Open in IMG/M
3300005174|Ga0066680_10118272All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1637Open in IMG/M
3300005176|Ga0066679_10020194All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3516Open in IMG/M
3300005177|Ga0066690_10182179All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1392Open in IMG/M
3300005178|Ga0066688_10020306All Organisms → cellular organisms → Bacteria3509Open in IMG/M
3300005180|Ga0066685_10070142All Organisms → cellular organisms → Bacteria2295Open in IMG/M
3300005180|Ga0066685_10951355All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Bacillaceae → Geobacillus → unclassified Geobacillus → Geobacillus sp. CAMR5420571Open in IMG/M
3300005186|Ga0066676_10026703All Organisms → cellular organisms → Bacteria3090Open in IMG/M
3300005186|Ga0066676_10068054All Organisms → cellular organisms → Bacteria2080Open in IMG/M
3300005186|Ga0066676_10101009All Organisms → cellular organisms → Bacteria1750Open in IMG/M
3300005186|Ga0066676_10496068All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Bacillaceae → Geobacillus → unclassified Geobacillus → Geobacillus sp. CAMR5420829Open in IMG/M
3300005186|Ga0066676_10713133All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Bacillaceae → Geobacillus → unclassified Geobacillus → Geobacillus sp. CAMR5420683Open in IMG/M
3300005446|Ga0066686_10004211All Organisms → cellular organisms → Bacteria6686Open in IMG/M
3300005446|Ga0066686_10021683All Organisms → cellular organisms → Bacteria3615Open in IMG/M
3300005446|Ga0066686_10210839All Organisms → cellular organisms → Bacteria1304Open in IMG/M
3300005468|Ga0070707_100128429All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2464Open in IMG/M
3300005518|Ga0070699_100173564All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1911Open in IMG/M
3300005554|Ga0066661_10107669All Organisms → cellular organisms → Bacteria1673Open in IMG/M
3300005554|Ga0066661_10257227All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1080Open in IMG/M
3300005555|Ga0066692_10098182All Organisms → cellular organisms → Bacteria1732Open in IMG/M
3300005556|Ga0066707_10206708All Organisms → cellular organisms → Bacteria1268Open in IMG/M
3300005557|Ga0066704_10014229All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria4522Open in IMG/M
3300005561|Ga0066699_10464113All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi906Open in IMG/M
3300005566|Ga0066693_10011422All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2513Open in IMG/M
3300005568|Ga0066703_10017158All Organisms → cellular organisms → Bacteria3654Open in IMG/M
3300005568|Ga0066703_10241066All Organisms → cellular organisms → Bacteria1098Open in IMG/M
3300005568|Ga0066703_10257673All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1058Open in IMG/M
3300005575|Ga0066702_10133943All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1451Open in IMG/M
3300006794|Ga0066658_10368404All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi772Open in IMG/M
3300009012|Ga0066710_101323864All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1117Open in IMG/M
3300009012|Ga0066710_101484207All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1047Open in IMG/M
3300009088|Ga0099830_10569994All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi927Open in IMG/M
3300009090|Ga0099827_10012342All Organisms → cellular organisms → Bacteria5590Open in IMG/M
3300009090|Ga0099827_10936996All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi750Open in IMG/M
3300009137|Ga0066709_102203050All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi760Open in IMG/M
3300009147|Ga0114129_10306227All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300010304|Ga0134088_10114258All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1275Open in IMG/M
3300012199|Ga0137383_10752396All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi712Open in IMG/M
3300012200|Ga0137382_10046828All Organisms → cellular organisms → Bacteria2681Open in IMG/M
3300012203|Ga0137399_10148017All Organisms → cellular organisms → Bacteria1872Open in IMG/M
3300012203|Ga0137399_10924151All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi735Open in IMG/M
3300012206|Ga0137380_10013725All Organisms → cellular organisms → Bacteria7389Open in IMG/M
3300012207|Ga0137381_10138133All Organisms → cellular organisms → Bacteria2091Open in IMG/M
3300012208|Ga0137376_10075945All Organisms → cellular organisms → Bacteria2807Open in IMG/M
3300012208|Ga0137376_10140700All Organisms → cellular organisms → Bacteria2071Open in IMG/M
3300012211|Ga0137377_10040980All Organisms → cellular organisms → Bacteria4243Open in IMG/M
3300012211|Ga0137377_10058898All Organisms → cellular organisms → Bacteria3568Open in IMG/M
3300012358|Ga0137368_10297138All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1094Open in IMG/M
3300012359|Ga0137385_10005091All Organisms → cellular organisms → Bacteria11516Open in IMG/M
3300012918|Ga0137396_10059909All Organisms → cellular organisms → Bacteria2636Open in IMG/M
3300012918|Ga0137396_10729412All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi730Open in IMG/M
3300012925|Ga0137419_10078706All Organisms → cellular organisms → Bacteria2222Open in IMG/M
3300012927|Ga0137416_10583365All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi972Open in IMG/M
3300012944|Ga0137410_10619019All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi896Open in IMG/M
3300014154|Ga0134075_10335267All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi662Open in IMG/M
3300015358|Ga0134089_10116601All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1034Open in IMG/M
3300017657|Ga0134074_1152718All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi808Open in IMG/M
3300018027|Ga0184605_10002103All Organisms → cellular organisms → Bacteria6638Open in IMG/M
3300018061|Ga0184619_10035993All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2108Open in IMG/M
3300018061|Ga0184619_10122193All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1179Open in IMG/M
3300018071|Ga0184618_10109754All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1092Open in IMG/M
3300018433|Ga0066667_10487513All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1013Open in IMG/M
3300018482|Ga0066669_10634860All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi938Open in IMG/M
3300019885|Ga0193747_1004145All Organisms → cellular organisms → Bacteria3558Open in IMG/M
3300019885|Ga0193747_1004234All Organisms → cellular organisms → Bacteria3517Open in IMG/M
3300020001|Ga0193731_1000852All Organisms → cellular organisms → Bacteria9094Open in IMG/M
3300021080|Ga0210382_10001362All Organisms → cellular organisms → Bacteria8214Open in IMG/M
3300021080|Ga0210382_10001596All Organisms → cellular organisms → Bacteria7528Open in IMG/M
3300022534|Ga0224452_1004346All Organisms → cellular organisms → Bacteria3418Open in IMG/M
3300025922|Ga0207646_10002468All Organisms → cellular organisms → Bacteria21845Open in IMG/M
3300025922|Ga0207646_10785643All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi848Open in IMG/M
3300026297|Ga0209237_1001282All Organisms → cellular organisms → Bacteria14868Open in IMG/M
3300026315|Ga0209686_1020859All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2606Open in IMG/M
3300026318|Ga0209471_1024996All Organisms → cellular organisms → Bacteria2972Open in IMG/M
3300026318|Ga0209471_1056928All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1790Open in IMG/M
3300026324|Ga0209470_1112620All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1213Open in IMG/M
3300026328|Ga0209802_1034244All Organisms → cellular organisms → Bacteria2638Open in IMG/M
3300026329|Ga0209375_1047679All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2158Open in IMG/M
3300026331|Ga0209267_1007047All Organisms → cellular organisms → Bacteria6473Open in IMG/M
3300026332|Ga0209803_1036191All Organisms → cellular organisms → Bacteria2284Open in IMG/M
3300026343|Ga0209159_1016873All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria4200Open in IMG/M
3300026524|Ga0209690_1038805All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2219Open in IMG/M
3300026529|Ga0209806_1043245All Organisms → cellular organisms → Bacteria2160Open in IMG/M
3300026529|Ga0209806_1297697All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi542Open in IMG/M
3300026532|Ga0209160_1018447All Organisms → cellular organisms → Bacteria4756Open in IMG/M
3300026536|Ga0209058_1013111All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium5762Open in IMG/M
3300026536|Ga0209058_1208833All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi788Open in IMG/M
3300026537|Ga0209157_1014338All Organisms → cellular organisms → Bacteria5148Open in IMG/M
3300026538|Ga0209056_10088086All Organisms → cellular organisms → Bacteria2585Open in IMG/M
3300026550|Ga0209474_10013846All Organisms → cellular organisms → Bacteria6413Open in IMG/M
3300028536|Ga0137415_10387315All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1202Open in IMG/M
3300028784|Ga0307282_10005259All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi4991Open in IMG/M
3300028814|Ga0307302_10165088All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1075Open in IMG/M
3300028819|Ga0307296_10217332All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1038Open in IMG/M
3300028828|Ga0307312_10088140All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1908Open in IMG/M
3300028881|Ga0307277_10095830All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1255Open in IMG/M
3300028884|Ga0307308_10010636All Organisms → cellular organisms → Bacteria4081Open in IMG/M
3300028885|Ga0307304_10149789All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi972Open in IMG/M
3300031820|Ga0307473_10219314All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1143Open in IMG/M
3300032180|Ga0307471_100471626All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1399Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil44.23%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.73%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.85%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.85%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.85%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028881Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1004425013300002908Grasslands SoilLPVRQRRPRCKRNGTRGLTGSRVPPPLRELAAFAGRSDETLSEKVGAAGNRISRRDALYGLLRREQAIVASGESSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGDWSLRDVMRHAMAVELRYAAQVEYSAARAESDPVAIPPGLLPCDRLSPPEPEFAHSRDGLLLDLLELLGKARAGSDVRLAKVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETXIQVEKIVGGVGELRATLRRCCLTRGMHERWSPEKERDVLDESYRRLTA*
Ga0066677_1001483453300005171SoilLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS*
Ga0066683_1005015523300005172SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS*
Ga0066683_1008451313300005172SoilRELGTVARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLVAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDQALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCITRGMHERWSSAADRAGLDESYRALTA*
Ga0066680_1007000633300005174SoilVPPPLRELAAFAGRSDETLSEKVGAAGNRISRRDALYGLLRREQAIVASGESSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGDWSLRDVMRHAMAVELRYAAQVEYSAARAESDPVAIPPGLLPCDRLSPPEPEFAHSRDGLLLDLLELLGKARAGSDVRLAKVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETTIQVEKIVGGVGELRATLRRCCLTRGMHERWSPEKERDVLDESYRRLTA*
Ga0066680_1011827213300005174SoilRVRPPKRARVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS*
Ga0066679_1002019443300005176SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSTTRAETDPIGIPPGLLPCDRLSPPEPEFAHSREGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTA*
Ga0066690_1018217943300005177SoilAAGYRISRRETLYGLLRREQAIVASGESSPRTEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPPGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGTHERWSPEKGRTVLDESYRALTR*
Ga0066688_1002030663300005178SoilVPPPLRELTTVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPPGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKILGGGGELRAILRRCCVTRGMHERW
Ga0066685_1007014223300005180SoilMPPPLAELAAVARLDQEALAGDARSSRGTLLGLLRGEQAIAAYGHPSERSEVSRILDFAQAAYGDVVGLLVGRDDSLLDTARDGEWSLRDLLRHAMAVELRYAAQVEYSATRAESDPVPIPPGLLPCDRLSPTEPAFAASRSGGVIELLELLGNARASSDTRLTKVPDSALTRPSLWGTKLLDVRMRLHQMAVHLTEAAIQIEKIVGGGDELRAIIRRCCITRGMHERWSSAADRAGLDESYRALTA*
Ga0066685_1095135513300005180SoilVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLIAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDQALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVSGGGGEPRAI
Ga0066676_1002670353300005186SoilLYGLLRREQAIAASAQPSRSQVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS*
Ga0066676_1006805443300005186SoilMPPPLRELGTVARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLIAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDQALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCITRGMHERWSSAADRAGLDESYRALTA*
Ga0066676_1010100923300005186SoilVPLPLRELAAVARQSDEALAEKTGAAGNRISRRDALYGLLRREQAIVASDASSSRSEISRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGAARAGTDARLAKVPGSALTRPSLWGTMNVDVRMRLHQIAAHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREVLDESYRTLTA*
Ga0066676_1049606813300005186SoilMPPPLAELAAVARLDQEALAGDARSSRGTLLGLLRGEQAIAAYGHPSERSEVSRILDFAQAAYGDVVGLLVGRDDSLLDTARDGEWSLRDLLRHAMAVELRYAAQVEYSATRAESDPVPIPPGLLPCDRLSPTEPAFAASRSGGVLELLELLGNARASSDTRLTKVPDSALTRPSLWGTKLLDVRMRLHQMAVHLTEAAIQIEKIVGGGDELRAIIRRCCITRGMHERWSSAADRAGLDESYRALTA*
Ga0066676_1071313313300005186SoilMPPPLRELAAVARLDDEALSQKVGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSLLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATRAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGE
Ga0066686_1000421153300005446SoilMPPPLRELAAVARLDDEALSQKVGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSLLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATRAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGELRSIIRRCGTTRGMHERWSPAAERAVLDESYRALAP*
Ga0066686_1002168363300005446SoilVPLPLRELAAVARQSDEALAEKTGAAGNRISRRDAWYGLLRREQAIVASSASSSRSEVSRILDFAQAAYGDLVGIMVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGATRAATDARLAKVPDSALTRASLWGTMNLDVRMRLHQIAAHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREVLDESYRTLTA*
Ga0066686_1021083923300005446SoilMPPPLRELGTVARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLIAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDQALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCITRGMHERWSSAADRAGLDES
Ga0070707_10012842923300005468Corn, Switchgrass And Miscanthus RhizosphereVPPPLSALAAIARLDDDALVADKRSARRGVLYGLLRREQAISASRQSSSRSEVSRILDFAQAAYGDVVGALVGRDDALLDTARDAEWSLRDVLRHAMAVELRYAAQVEYSATRSERDPVQIPPGLLPCDRLSPPETEFAASRSGGVLELLHLLGNARTSTDARLTKVPDSVLTRPSMWGTQLLDVRLRLHQIAVHLAETAIQIEKMVGRGCELRAIIRRCCIARGLHERWSPAEERAALDDSYTASRR*
Ga0070699_10017356423300005518Corn, Switchgrass And Miscanthus RhizosphereMAIARLDEATLVADQRGTRRDALYGLLRREQAIFAAGQRSPSEVSRILDLAQAAYGDVVGALVGRNDRLLDTARDGDWSLRDVLRHAIAVELRYAAQIEYSATRAESEPIEIPIARLPCDRLSPPEPAFAASRTGGVLEQLRLLGEARVGSDIRLAKLPDSVLTRPSMWGNRPLDVRMRIHQIAVHLTETTIQIDRIVLSGGELRAIIRRICITRGMHERWSPAHGRTALDESYAALSR*
Ga0066661_1010766923300005554SoilVPPPLRELAAVARSGDEALAEKVGAAGYRISRRETLYGLLRREQAIVVSAESSPRTEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPPGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTA*
Ga0066661_1025722723300005554SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAMAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAG
Ga0066692_1009818213300005555SoilMPPPLRELGTVARLSDETLAQKVGASGSRIPRGVALYGLLRREQAMFASGEWRPRSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDILELLGKARAGSDVRLAKVPDSALTRPSLWGTALVDVRLRLHQMAAHLTESAIQTEKIVGTGGEPRAIVRRCCITRGMH
Ga0066707_1020670833300005556SoilVPPPLRELGALARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYDDVVGVLVGRDDGLLDTARDGEWSLRDVLRHAIAVELRYAAQVEYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDIIELLGRARAGSDVRLAKVPDSALTRPSLWGTALVDVRLRLHQMAAHLTESAIQTEKIVGTGGEPRAIVRRCCITRGMHERWSREEERAVLDESYRALLS*
Ga0066704_1001422973300005557SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPAGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGNARAGGDVRLAKVPDSAFARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTG*
Ga0066699_1046411313300005561SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDQASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSTTRAETDPIGIPPGLLPCDRLSPPEPEFAHSREGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTA*
Ga0066693_1001142233300005566SoilMPPPLDALAALVRLDDCALATDERSSRQDTLYGVLRREQAIAASAQPARSEVSRILDFAQAALGDVVGVLIGREDRLLDSARDGEWSLRDVFRHAIAVELRYAAQVEYSASRADTDPIQIPPALLPCDRLSPPDGEFGSSRSGGVLELLQLLGTARTSTDARLARLPDAALGRPSLWGKQQIDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAIIRRCCMTRGMHERWSPPRERTVMDDSYRALTA*
Ga0066703_1001715843300005568SoilMDGVRGERRFFMYDLLRREYALAAAKQPPGRSEAIRILDFAQAAYGDVVGLLGGREDALLDSARDGDWSLRDVLRHAIAVELRYGAQVEYSATRAESDPVEIRPDLLPCDRLSPPDPGFSSSRDGGVLDLLELLGKARTTSDVRVARVSDSTLMRPSLWGKARLDVRMRLHQFGVHLVETAIQIEKIVDGCDEARMIIRRCCSMRGLHERWSPADERTILDESYRALSV*
Ga0066703_1024106623300005568SoilMPPPLRELGTVARLSDETLAQKVGASGSRIPRGVALYGLLRREQAMFASGEWRPRSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRVGGILDILELLGKARAGSDVRLAKVPDSALTRPSLWGTALVDVRLRLHQMAAHLTESAIQTEKIIGTGGELR
Ga0066703_1025767313300005568SoilVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPAGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGNARAGGDVRLAKVPDSAFARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKILGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTG*
Ga0066702_1013394333300005575SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSTTRAETDPIGIPPSLLPCDRLSPPEPEFAHSRDGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAI
Ga0066658_1036840413300006794SoilGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPPGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGKARAGGDVRLARVPESALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGTHERWSPEKGRTVLDESYRALTR*
Ga0066710_10132386423300009012Grasslands SoilVPLPLRELAAVARQSDEALAEKTGAAGNRISRRDAWYGLLRREQAIVASSASSSRSEVSRILDFAQAAYGDLVGIMVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGATRAATDARLAKVPDSALTRASLWGTMNLDVRMRLHQIAGHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREVLDESYRTLTA
Ga0066710_10148420723300009012Grasslands SoilMPPPLRELAAVARLDDEALSQKVGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSLLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATRAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGELRSIIRRCGT
Ga0099830_1056999423300009088Vadose Zone SoilVVARQSDEALARKVGAAGSRGSRRDVLYGLLRREQAIFASAQSPLRSEVSRILDFAQAAYGDVVGVLVGRDDGLLDSSRDGDWTLRDVVRHAIAVELRYAAQVLYSATRSESEPVEIQPSLLPCDRLSPPEPEFARSRDGGVLDLLELLGKALAGTDARLAKVPDSALGRPSLWGTVRLDVRMRVHQMAAHLTESAIQIEKIVGGGGELQAILRRCCITRGLHERWSPANERALLDQSYRDLIA*
Ga0099827_1001234213300009090Vadose Zone SoilVPPPLRELAAVARQSDETLAEKVGAAGDRISRRDALYGLLRREQAIVASRESSSRSEVSRILDFAQAAYGDLVGILVGRDDTVLDSARDGEWSLRDVMRHAMAVELRYAAQVEYSAARAESDPVAIPPGLLPCDRLSPPEPEFAHSRDGLLLDLLELLGKARAGSDVRLAKVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKI
Ga0099827_1093699613300009090Vadose Zone SoilLLRREQAMFASGQSRPRSEVSRILDFAQVAYDDVVGVLVGRDDGLLDTARDGEWSLRDVLRHAIAVELRYAAQVEYSATRAETDPVEIPPSLLPCDRLSPPEPEFAGSRDGGILDILELLGKARAGSDVRLAKVPDSALTRPSLWGTALVDVRLRLHQMAAHLTESAIQTEKIVGTGGELRAIVRRCCITRGMHERWSREKQRTVLDESYRALLS*
Ga0066709_10220305023300009137Grasslands SoilMPPPLRELGTVARLSDETLAQKVGASGSRIPRGVALYGLLRREQAMFASGEWRPRSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDILELLAKARAGSDVRVAKVPDAALTRPSLWGTALVDVRLRPHQMAAHLTERA
Ga0114129_1030622723300009147Populus RhizosphereVASLDDQLATDSPRALYGLLRREQAIFATGQSQPRSEVSRILDFAQAAYADVVGVLVGRDDGLLDTLRDGDWSLRDVVRHAIAVELRYGAQVEYSATRAETEPVGIRPDLLPCDRLSPPEPDFADSRDGGVLELLDLLGRARAITDLRLAKVPDSTLTRPSLWGTTPLDVRMRLHQIAAHLVQSAIQIEKIVGGGSELRMVIRRCCSARGLHERWSSENDRAPLDEAYRALTA*
Ga0134088_1011425813300010304Grasslands SoilGLLRREQAIVASSASSSRSEVSRILDFAQAAYGDLVGIMVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGAARAGTDARLAKVPGSALTRPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREILDESYRTLTA*
Ga0137383_1075239613300012199Vadose Zone SoilPPMPLPLAELAAVARLTNEALAEKVGVAGNRTSRRDALYGLLRREQAMFASGEWRPRSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDILELFGNARAGSDVRPAKVADSALTRPSLWGTALVDVRLRLHQMAAHLTESAIQTEKIIGTGGELRAIVRRCCITRGMHERW
Ga0137382_1004682823300012200Vadose Zone SoilMPPPLDALAAVLRRDNGALAADERSSRRDALYGLLRREQAIAALAQPDRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLREVLRHAMAVELRYAAQVEYSASRADTDPIQIPPAVLPCDRLSPPDGEFGLSRSGGVLELLRLLGNARTSTDTRLARLPDAALGRPSLWGKQQIDVRMRLHQIAVHLTETAIQIEKIVGGGGELRAIVRRCCITRGMHERWSPPQERTVMDDSYRALTT*
Ga0137399_1014801733300012203Vadose Zone SoilVPPPLGELAAVARLDDGARVADKRNSLRNALYGLLRREQAIAASPQSLARSEVSRILDFAQAAYGDVIGVLVGRDDHLLDTARDGEWSLRDVLRHAIAVELRYAAQTDYSAARSDSDPVQIPPDLLPCDRLSPPELGFGASRSGGVIELLQLLGTARASSDARLAKVADSALTRPSMWGMQLLDVRMRLHQIAVHLTETAIQIEKIVGSDGELRAIVRRCCITRGLHERWSSAAERASLDESYRALTA*
Ga0137399_1092415123300012203Vadose Zone SoilEQAIVASGESSSRSEVSRILDFSQTAYGDLVGILVGRDDRLLDSARDGEWSLRDIMRHAIAVELRYAAQVEYSATRAETDPVGIPPGLLPCDRLSPPEPEFANSRHGAIVDLLELLGKARASSDLRLAKVPDSALMRPSLWGAVGLDVRMRLHQIAAHLTESAIQIEKIVGSGGELRAVIRRCGMTRGMHERWSRAEARAVLDESYRTLAP*
Ga0137380_1001372533300012206Vadose Zone SoilMPPPIRELAAVARLTNEGLAERVGLVGNRTSRRDALYGLLRREQAVFASAESRPRSEVSRILDFAQAAYGDLVGVMVGRDDSLLDTGRDGEWSLRDVLRHAMAVKLRYAAQVDYSATRAETDPVEIPPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDSALTRPSFWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCATRGMHERWSREEERTVLDESYRALLS*
Ga0137381_1013813323300012207Vadose Zone SoilMPPPIRELAAVARLTNEGLAERVGLVGNRTSRRDALYGLLRREQAVFASAESRPRSEVSRILDLAQAAYGDLVGVMVGRDDSLLDTGRDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIPPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDSALTRPSFWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCATRGMHERWSREEERTVLDESYRALLS*
Ga0137376_1007594533300012208Vadose Zone SoilMPPPLAELAAVARLDPEALVGRTRSSCRDVLYGLLRREQAIAASAESSSRSEISRILDYAQAAYGDVVGVLVGRDNSLLDTARDAEWSLRDVLRHAMAVELRYAAQVEYSATRSDSDPVPIPPGLLPCDRLSPTEPAFTGSRTGGVLELLQLLGDARASSDARLAKVPDSALTRASMWGTRLLDVRMRLHQIAVHLTETAIQIEKIVGGDGELRAISRRCCITRGLHERWSSATERGTIDDSYRSLAGQLS*
Ga0137376_1014070033300012208Vadose Zone SoilMPPPLDALAAVVRLGDGALAADERSSRRDALYGLLRREQAIAALAQPDRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAVLPCDRLSPPDGEFGLSRSGGVLVLLRILGNARTSTDTRLAKLPDASLGRPSLWGKQEIDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAIVRRCCITRGMHERWSPPQERTVMDDSYRALTT*
Ga0137377_1004098043300012211Vadose Zone SoilMPPPLRELGTVARLSDETLAQKVGASGSRIPRGVALYGLLRREQAMFASGEWRPRSEVSRILDFAQAAYGDLVGVLVGRDDGLLDTARDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDIIELLGRARAGSDVRLAKVPDSALTRPSFWGTARVDVRLRLHQMAAHLIESAIQTEKIVGTGGEPRAIVRRCCITRGMHERWSREEERAVLDESYRALLS*
Ga0137377_1005889833300012211Vadose Zone SoilVPLPLRELAAFARQRDEALAEKIGAAGNRMSRHDALYGLLRREQAIVAAGAPSSRSEVSRILDFAQAGYGDLAGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRRSPPEPEFAHSRDGGILKLLELLGAARAGTDARLAKVPGSALTRPSLWGTMNVDVRMRLHQIAAHLTETAIQVEKIIGYGGELRAILRRCCMTRGMHERWSPAKEREVLDESYRTLTA*
Ga0137368_1029713823300012358Vadose Zone SoilMPPPLRELAAVARLDDEALSQKVGVAGKRISRRDALYGLLRHEQGMFAAGESRPPSEVTRILDFAQAAYGDLVGLLVGREDSLLDIGRDGDWSLRDVLRHAIAVELRYAAQVEYSATRAESDPLEIRPGLLPCDRLSPPEPEFAGSRKGGIPDVLDLLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQVAGHLTESAIQTEKIVGSAGELRSIIRRCAMTRGMHERWSPAAERAVLDKSYR
Ga0137385_10005091143300012359Vadose Zone SoilMPPPIRELAAVARLTNEGLAERVGLVGNRTSRRDALYGLLRREQAVFASAESRPRSEVSRILDFAQAAYGDLVGVMVGRDDSLLDTGRDGEWSLRDVLRHAMAVELRYAAQVDYSATRAETDPVEIPPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDSALTRPSFWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCATRGMHERWSREEERTVLDESYRALLS*
Ga0137396_1005990943300012918Vadose Zone SoilVPPPLGELAAVARLDDGARVADKRNSLRNALYGLLRREQAIAASPQSLARSEVSRILDFAQAAYGDVIGVLVGRDDHLLDTARDGEWSLRDVLRHAIAVELRYAAQTDYSAARSDTDPVQIPLDLLPCDRLSPPELGFGASRSGGVIELLQLLGTARASSDARLAKVADSALTRPSMWGMQLLDVRMRLHQIAVHLTETAIQIEKIVGSDGELRAIVRRCCITRGLHERWSSAAERASLDESYRALTA*
Ga0137396_1072941223300012918Vadose Zone SoilGLLRREQAIVASGESSSRSEVSRILDFSQAAYGDLVGILIGRDDRLLDSARDGEWSLRDIMRHAIAVELRYAAQVEYSATRAETDPVGIPPGLLPCDRLSPPEPEFANSRHGAIVDLLELLGKARASSDQRLAKVPDSALMRPSLWGAVGLDVRMRLHQIAAHLTESAIQIEKIVGSGGELRAVIRRCGMTRGMHERWSRAEERAVLDESYRTLAP*
Ga0137419_1007870643300012925Vadose Zone SoilVPPPLGELAAVARLDDGARVADKRNSLRNALYGLLRREQAIAASPQSLARSEVSRILDFAQAAYGDVIGVLVGRDDHLLDTARDGEWSLRDVLRHAIAVELRYAAQTDYSAARSDTDPVQIPLDLLPCDRLSPPEPSFAMSRSGGVLELLQMLGTARASSDARLAKVADSALTRPSMWGMQLLDVRMRLHQIAVHLSETAIQIEKIVGSDSELRAIVRRCCITRGLHERWSSAAERASLDESYRALTA*
Ga0137416_1058336513300012927Vadose Zone SoilHDALYSLLRREQAIVASGASSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRADSDPVGIPPGLLPCDRRSPPEPEFAHSRDGGILKLLELLGAARAGTDTRLAKVPGSALTRPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGYGGELRAILRRCCMTRGMHERWSPAKEREVLDESYRTLTA*
Ga0137410_1061901913300012944Vadose Zone SoilVPPPLRELGAVARLGDPALVEKVGQAGNRISRRDALYGLLRREQAIFASTDSSPRSEVSRILDFAQGAFGDLVGLLVGRDDTLLDTGRDGDWSLRDALRHAIAVELRYAAQVEYSATRGESDPVEIRQSLLPCDRLSPPDPEFAGSRDGGVVDLLELLGKARAKSDARVTKVPDSTLERPSLWGTQRVDVRTRLHQIAVHLTETAIQIEKIIDDNGEGRAIIRRCCIARGTHERWSPAAERAVLDESYRALGP*
Ga0134075_1033526713300014154Grasslands SoilGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSLLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATQAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGELRSIIRRCGTTRGMHERWSPAAERAV
Ga0134089_1011660123300015358Grasslands SoilMPPPLRELAAVARLDDEALSQKVGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSVLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATRAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGELRSIIRRCGT
Ga0134074_115271813300017657Grasslands SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAMAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDTRLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQL
Ga0184605_10002103103300018027Groundwater SedimentRREQAIAASGESSAPSEVSRILDFAQAAYGDVVGVLVGRDDGLLDTARDGEWSLRDVLRHAMAVELRYAAQIEYSATRSDTDPVPIPPDLLPCDRLSPTEPAFAASRIGGVLELLQLLGQARAGSDTRLARVPAPALTRPSMWGNQLLDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAITRRCCITRGMHERWSPAEERAVLDDSYHRLTA
Ga0184619_1003599323300018061Groundwater SedimentMPPPLAELAKVARLDQEELVGGTRSSRRDALYGLLRREQAIAASGESSAPSEVSRILDFAQAAYGDVVGVLVGRDDGLLDTARDGEWSLRDVLRHAMAVELRYAAQIEYSATRSDTDPVPIPPDLLPCDRLSPTEPAFAASRIGGVLELLQLLGQARAGSDTRLARVPAPALTRPSMWGNQLLDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAITRRCCITRGMHERWSPAEERAVLDDSYHRLTA
Ga0184619_1012219323300018061Groundwater SedimentMPPPLGELAAVARLDDEALAGDKRGSRRDALYGLLRREQAIAPSGESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDDEWSLRDVLRHSMAVGLRYAAQVEYSATRSDTDPVAIPPSLLPCDRLSPPDPEFAASRTSGVLELLQLLGQARAGSDTRLANVLAPALTRPSMWGTQLLDVRMRLHQIAVHLTETAIQIEKIVGGGGELRAIIRRCCVTRGMHERWSPAEERAVLDDSYHRLTA
Ga0184618_1010975423300018071Groundwater SedimentMPPPLEELAAVARLDDEALVGDTRRSRRDVLYGLLRREQAIAASGESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWTLRDVLRHAVAVELRYAAQVEYSATRLDIDPVAIPPGLLPCDRLSPPDPEFAASRTSGVLELLDLLGNARASSDTRLTKVPDSALTRPSLWGTKLLDVRMRLHQIAVHLTETS
Ga0066667_1048751313300018433Grasslands SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPALLPCDRLSPPESEFGLSRSGGVLQLLRLLGTARTSTDARLAKLPDATLGRPSLWGKQQIDVRMRLHQIAVHLTETAVQIEK
Ga0066669_1063486013300018482Grasslands SoilMPPPLDALAALVRLDDGALATDERSSRQDTLYGVLRREQAIAASAQPARSEVSRILDFAQAALGDVVGVLVGREDRLLDSARDGEWSLRDVFRHAIAVELRYAAQVEYSASRADTDPIQIPPALLPCDRLSLPDGEFGSSRSGGVLELLQLLGTARTSTDARLARLPDAALGRPSLWGKQQIDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAIIRRCCMTRGMHERWSPPRERTVMDDSYRALTA
Ga0193747_100414513300019885SoilMPPPLDALAAVVRLDGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGMHERWSTAADRAVLDDSYRALTA
Ga0193747_100423423300019885SoilMPPPLDSLAAVARSDDLPEIADARASGRDALYDLLRREQAIVAPGPSSDRSEVARIVDLAQAAYGDVVGVLVGRHNDLLDTARDGDWSLRDVLRHAMAVELRYAAQIEYSATRSDADPVQIPQGLLPCDRLSPPDPKFVGSRGGGMLELLHLLGNARGDSDTRLAKVPDSTLTRPSLWGTQLLDVRMRLHQIAVHLTETAIQVEKIVGGGGELRAIIRRCCITRGMHERWSSAAERAGLDDLYRALTP
Ga0193731_1000852133300020001SoilMPPPLDALAAVVRLDGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGVHERWSTAADRAVLDDSYRALTA
Ga0210382_10001362113300021080Groundwater SedimentMPPPLAELAKVARLDQEELVGDTRSSRRDALYGLLRREQAIAASGESSAPSEVSRILDFAQAAYGDVVGVLVGRDDGLLDTARDGEWSLRDVLRHAMAVELRYAAQIEYSATRSDTDPVPIPPDLLPCDRLSPTEPAFAASRIGGVLELLQLLGQARAGSDTRLARVPAPALTRPSMWGNQLLDVRMRLHQIAVHLTETAIQIEKIVGSGGELRAITRRCCITRGMHERWSPAEERAVLDDSYHRLTA
Ga0210382_10001596103300021080Groundwater SedimentMPPPLEELAAVARLDDEALVGDTRRSRRDVLYGLLRREQAIAASGESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDGEWTLRDVLRHAVAVELRYAAQVEYSATRLDSDPVAIPPGLLPCDRLSPPDPEFAASRTSGVLELLDLLGNARASSDTRLTKVPDSALTRPSLWGTQLLDVRLRLHQIAVHLTETAIQIEKIVGGGGEVRTVIRRCCITRGMHERWSPAEDRAALDDSYHRLTA
Ga0224452_100434633300022534Groundwater SedimentMPPPLAELAAVARLDNEALVGDTRRSRRDVLYGLLRREQAIAASRESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDAEWSLRDVLRHAMAVELRYAAQVEYSGTRLDIDPVAIPPGLLPCDRLSPPDPEFAASRTGGVLELLDLLGNARASSDTRLTKVPDSALTRPSLWGTKLLDVRLRLHQIAVHLTETAIQIEKIVGGGGEPRAILRRCCITRGMHERWSPVGERAVLDESYRAMSV
Ga0207646_1000246893300025922Corn, Switchgrass And Miscanthus RhizosphereMPPPLGEIAAVARLDDEALGHDARGSRRDVLYGLLRREQAIAASGASPRSEVSRILDFAQAAYGDVVGVLVGRDDKLLDIARDGEWTLRDMLRHAMAVELRYAAQVDYSATRSDGDPVQIPPALLPCDRLSPPEPAFAASRTGKVLELLQLLGEARASSDARLAKVPDSALARPSMWGAQLLDVRIRLHQVAVHLTETAIQIEKIVGSGGELRAIIRRSCITRGMHERWSPGKERAALDESYHALKV
Ga0207646_1078564323300025922Corn, Switchgrass And Miscanthus RhizosphereAIARLDDDALVADKRSARRGVLYGLLRREQAISASRQSSSRSEVSRILDFAQAAYGDVVGALVGRDDALLDTARDAEWSLRDVLRHAMAVELRYAAQVEYSATRSERDPVQIPPGLLPCDRLSPPETEFAASRSGGVLELLHLLGNARTSTDARLTKVPDSVLTRPSMWGTQLLDVRLRLHQIAVHLAETAIQIEKMVGRGCELRAIIRRCCIARGLHERWSPAEERAALDDSYTASRR
Ga0209237_1001282113300026297Grasslands SoilVPPPLRELAAFAGRSDETLSEKVGAAGNRISRRDALYGLLRREQAIVASGESSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGDWSLRDVMRHAMAVELRYAAQVEYSAARAESDPVAIPPGLLPCDRLSPPEPEFAHSRDGLLLDLLELLGKARAGSDVRLAKVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETTIQVEKIVGGVGELRATLRRCCLTRGMHERWSPEKERDVLDESYRRLTA
Ga0209686_102085923300026315SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209471_102499643300026318SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSTTRAETDPIGIPPGLLPCDRLSPPEPEFAHSREGAVVDLLELLGKARAGGDVRLARVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTA
Ga0209471_105692823300026318SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGREDRLLDSARDSEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209470_111262023300026324SoilMPPPLRELGTVARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLIAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDKALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCITRGMHERWSSAADRAGLDESYRALTA
Ga0209802_103424453300026328SoilVPPPLRELAAFAGRSDETLSEKVGAAGNRISRRDALYGLLRREQAIVASGESSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGDWSLRDVMRHAMAVELRYAAQVEYSAARAESDPVAIPPGLLPCDRLSPPEPEFAHSRDGLLLDLLELLGKARAGSDVRLAKVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQV
Ga0209375_104767933300026329SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPSDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209267_100704783300026331SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDTRLARLPDAVLERPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209803_103619153300026332SoilRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209159_101687363300026343SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGREDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209690_103880523300026524SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAMAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERDTIDDSYRTLAGQLS
Ga0209806_104324523300026529SoilMPPPFSAVGVLAGSSDGALGQKVVMDGVRGERRFFMYDLLRREYALAAAKQPPGRSEAIRILDFAQAAYGDVVGLLGGREDALLDSARDGDWSLRDVLRHAIAVELRYGAQVEYSATRAESDPVEIRPDLLPCDRLSPPDPGFSSSRDGGVLDLLELLGKARTTSDVRVARVSDSTLMRPSLWGKARLDVRMRLHQFGVHLVETAIQIEKIVDGCDEARMIIRRCCSMRGLHERWSPADERTILDESYRALSV
Ga0209806_129769713300026529SoilRDALYSLLRREQAIVASAEFSSRSEVSRILDLAQAAYGDLVGILVGKDDRLLDSARDREWSLRDVMRHAIAVEVRYAAQVEYSATRSESDPIGIPPGLLPCDRLAPPEPEFTRSRDGGVVELLELLGRARAGSDMRLAGVPDSALARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIV
Ga0209160_101844743300026532SoilVPPPLRELAAVARYGDEALAEKVGAAGYRISRRETLYGLLRREQAIVASDEASSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQIEYSATRAETDPIGIPAGLLPCDRLSPPEPEFAHSRDGAVVDLLELLGNARAGGDVRLAKVPDSAFARPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKILGGGGELRAILRRCCVTRGMHERWSPEKGRTVLDESYRALTG
Ga0209058_101311193300026536SoilMPPPLRELGAVARLSDETLAQKVGAAGSRIPLGVALYGLLRREQAMFASGESRPRSEVSRILDFAQAAYGDLIAVLVGRDDSVLDTSRDGEWSLRDLLRHAIAVELRYAAQVDYSATRAESDPVEIRPSLLPCDRLSPPEPEFAGSRDGGILDVLELLGKARAGSDVRLAKVPDQALTRPSVWGMARVDVRLRLHQMAAHLTESAIQTEKIVGGGGEPRAIVRRCCITRGMHERWSSAADRAGLDESYRALTA
Ga0209058_120883313300026536SoilSVARQSDEALAEKTGAAGNRISRRDAWYGLLRREQAIVASSASSSRSEVSRILDFAQAAYGDLVGIMVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRAESDPVGIPPGLLPCDRLSPPEPEFAHSRDGGILKLLELLGATRAATDARLAKVPDSALTRASLWGTMNLDVRMRLHQIAAHLTETAIQVEKIIGYGGEVRAILRRCCITRGMHERWSPAKEREVLDESYRTLTA
Ga0209157_101433843300026537SoilMPPPLRELAAVARLDDEALSQKVGVAGNRISRRDALYGLLRREQGMFATGESRPRSEVSRILDFAQAAYGDLVGLLVGREDSLLDTGRDGDWSLRDLLRHAIAVELRYAAQVEYSATRAESDPVEIRPGLLPCDRLSPPEPEFAGSRDGGIPDVLELLGKARAGSDVRLANVPDTALTRPSLWGTARIDVRMRLHQIAAHLTESTIQIEKIVGSAGELRSIIRRCGTTRGMHERWSPAAERAVLDESYRALAP
Ga0209056_1008808653300026538SoilFDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKQQIDVRMRLHQIAAHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0209474_1001384663300026550SoilVPPPLDHLAAIAVGDTNSSRRDTLYGLLRREQAIAASAQPSRSEVSRILDFAQAAHGDVVGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPAFLPCDRLSPPESEFGLSRSGGVLELLRLLGNARTSTDARLARLPDAVLGRPSLWGKRQIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIVRRCCITRGMHERWSLATERGTIDDSYRTLAGQLS
Ga0137415_1038731523300028536Vadose Zone SoilMPLPLRELAVFARQSDEALAEKIGAAGTRISRHDALYGLLRREQAIVASGASSSRSEVSRILDFAQAAYGDLVGILVGRDDRLLDSARDGEWSLRDVMRHAIAVELRYAAQVEYSATRADSDPVGIPPGLLPCDRRSPPEPEFAHSRDGGILKLLELLGAARAGTDARLAKVPGSALTRPSLWGTMNLDVRMRLHQIAAHLTETAIQVEKIVGYGGELRAILRRCCMTRGMHERWSPAKEREVLDESYRTLTA
Ga0307282_1000525973300028784SoilMPPPLDALAAVVRLDGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGTHEHWSLATVRGTIDDSYRSLAGDLS
Ga0307302_1016508823300028814SoilMPPPLAELAAVARLDNEALVGDTRRSRRDVLYGLLRREQAIAASGESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDAEWSLRDVLRHAMAVELRYAAQVEYSGTRLDIDPVAIPPGLLPCDRLSPPDPEFVASRTGGVLELLQLLGQARAGSDTRLANVLAPALTRPSMWGTQLLDVRMRLHQIAVHLTETAIQIEKIVGGGGELRAILRRCCITRGMHERWSPAEERAVLDDSYHRLTA
Ga0307296_1021733233300028819SoilASRESSARSEVSRILDFAQAAYGDVVGVLVGRDDSLLDTARDADWSLRDVLRHAMAVELRYAAQVEYSATRLDIDPVAIPPGLLPCDRLSPPDPEFAASRIGGVLELLELLGIARAGRDPRLAKVPDSALTRPSLWGTQLLDVRVRLHQIAVHLTETAIQIEKIVGGGGEPRAILRRCCITRGMHERWSPVGERAVLDESYRAMSV
Ga0307312_1008814043300028828SoilGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPPELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGTHEHWSLATVRGTIDDSYRSLAGDLS
Ga0307277_1009583023300028881SoilMPPPLDALAAVVRVDEDALAADERGRRRDALYGLLRREQAIAAAAQPSSRAEVSRILDFAQAAYGDVIGVLIGRDDSLLDSTRDGEWSLRDVVRHAMAVELRYAVQVEYSVTRADADPVQIPAELLPCDRLSPPESEFGPSRNGGVLELLRLLGNARASSDIRLARLPDTSLGRPSLWGKQRVDVRIRLHQIGVHLSETAIQIEKIVGSGGEIRAIIRRCCITRGMHERWSASEERAILDDSYRALTT
Ga0307308_1001063663300028884SoilMPPPLDALAAVVRLDGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGREDRLLDSARDGEWNLRDVSRHAIAVELRYAAQVEYSASRADTDPIQIPPELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGMHERWSIAADRAVLDDSYRALTA
Ga0307304_1014978923300028885SoilMPPPLDALAAVVRLDGDALAADKRSSRRDALYGLLRREQAIAALAQPARSEVSRILDFAQAAYGDVIGVLVGRDDRLLDSARDGEWSLRDVLRHAIAVELRYAAQVEYSASRADTDPIQIPLELLPCDRLSPPESEFGPSRSGGVLQLLRLLGNARTSTDTRLARLPDAALGRPSLWGKQHIDVRMRLHQIAVHLTETAIQMEKIVGSGGELRAIIRRCCITRGVHERWSTAADRAVLDDSYRALTA
Ga0307473_1021931423300031820Hardwood Forest SoilGVLYGLLRREQAISASRQSSSRSEVSRILDFAQAAYGDVVGALVGRDDALLDKARDAEWSLRDVLRHAMAVELRYAAQVEYSATRSERDPVQIPPGLLPCDRLSPPETEFVASRSGGVLELLHLLGNARTGTDARLTKVPDSVLTRPSMWGTQLLDVRLRLHQIAVHLAETAIQIEKMVGSGGELRAIIRRCCIARGLHERWSPAEERAALDDSYTASRR
Ga0307471_10047162623300032180Hardwood Forest SoilVPPPLAALTALTRLDEDAFVGEKRTSRRDPLYGLLRREQVIFAAEPRSPRSEVSRILDLAQASYGDVVGVLVGRLDSLLDTTRDGEWSLRDLLRHAMAVERRYAAQVEYSSTRSNSDPIQIPPSLLPSDRLSPPEPEFAASRNGGVLELLQLLERARAGSDTRLAKVPDSALTRPSMWGTQQIDVRLRLHQIAAHLTETAIQVEKMIGAGGELRAIIRRCCITRGLHERWSPAGERQILDESYRS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.