NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102996

Metagenome / Metatranscriptome Family F102996

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102996
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 280 residues
Representative Sequence KPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFNDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLAEGKLDLTSVHGLQWAPLSERALGLSPKLFWALVATKPERTLH
Number of Associated Samples 80
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.00 %
% of genes near scaffold ends (potentially truncated) 96.04 %
% of genes from short scaffolds (< 2000 bps) 81.19 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.040 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(35.644 % of family members)
Environment Ontology (ENVO) Unclassified
(46.535 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.475 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.88%    β-sheet: 5.73%    Coil/Unstructured: 48.39%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF02142MGS 27.72
PF00834Ribul_P_3_epim 10.89
PF01189Methyltr_RsmB-F 4.95
PF00884Sulfatase 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0036Pentose-5-phosphate-3-epimeraseCarbohydrate transport and metabolism [G] 10.89
COG014416S rRNA C967 or C1407 C5-methylase, RsmB/RsmF familyTranslation, ribosomal structure and biogenesis [J] 4.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms96.04 %
UnclassifiedrootN/A3.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100560464All Organisms → cellular organisms → Bacteria1020Open in IMG/M
3300002560|JGI25383J37093_10005683All Organisms → cellular organisms → Bacteria3954Open in IMG/M
3300003321|soilH1_10044325All Organisms → cellular organisms → Bacteria1145Open in IMG/M
3300005167|Ga0066672_10179155All Organisms → cellular organisms → Bacteria1342Open in IMG/M
3300005171|Ga0066677_10371612All Organisms → cellular organisms → Bacteria818Open in IMG/M
3300005171|Ga0066677_10447700All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300005177|Ga0066690_10063173All Organisms → cellular organisms → Bacteria2295Open in IMG/M
3300005181|Ga0066678_10246742All Organisms → cellular organisms → Bacteria1154Open in IMG/M
3300005186|Ga0066676_10544335All Organisms → cellular organisms → Bacteria787Open in IMG/M
3300005447|Ga0066689_10009472All Organisms → cellular organisms → Bacteria4347Open in IMG/M
3300005450|Ga0066682_10491340All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Frankiales → Frankiaceae → Frankia → unclassified Frankia → Frankia sp. EUN1f778Open in IMG/M
3300005467|Ga0070706_100154977All Organisms → cellular organisms → Bacteria2139Open in IMG/M
3300005518|Ga0070699_100414716All Organisms → cellular organisms → Bacteria1218Open in IMG/M
3300005536|Ga0070697_101215316All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. ScaeMP-e10672Open in IMG/M
3300005546|Ga0070696_100648519All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae → Silanimonas → Silanimonas lenta856Open in IMG/M
3300005559|Ga0066700_10474609All Organisms → cellular organisms → Bacteria875Open in IMG/M
3300005559|Ga0066700_10754773All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300005561|Ga0066699_10143354All Organisms → cellular organisms → Bacteria1622Open in IMG/M
3300005574|Ga0066694_10230115All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300005574|Ga0066694_10287705All Organisms → cellular organisms → Bacteria782Open in IMG/M
3300005576|Ga0066708_10205697All Organisms → cellular organisms → Bacteria1237Open in IMG/M
3300005576|Ga0066708_10567874All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300005576|Ga0066708_10688136All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Frankiales → Frankiaceae → Frankia → Frankia saprophytica648Open in IMG/M
3300005598|Ga0066706_10303285All Organisms → cellular organisms → Bacteria1257Open in IMG/M
3300005598|Ga0066706_10818848All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300006032|Ga0066696_10485894All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300006791|Ga0066653_10161122All Organisms → cellular organisms → Bacteria1104Open in IMG/M
3300006796|Ga0066665_10750379All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Frankiales → Frankiaceae → Frankia → unclassified Frankia → Frankia sp. EUN1f772Open in IMG/M
3300006797|Ga0066659_10185618All Organisms → cellular organisms → Bacteria1510Open in IMG/M
3300006800|Ga0066660_10308812All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300006871|Ga0075434_101312540All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycobacterium → unclassified Mycobacterium → Mycobacterium sp. UNCCL9734Open in IMG/M
3300007255|Ga0099791_10075073All Organisms → cellular organisms → Bacteria1536Open in IMG/M
3300007258|Ga0099793_10363345All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300009012|Ga0066710_102216885All Organisms → cellular organisms → Bacteria801Open in IMG/M
3300009137|Ga0066709_100707862All Organisms → cellular organisms → Bacteria1449Open in IMG/M
3300009137|Ga0066709_101661548All Organisms → cellular organisms → Bacteria909Open in IMG/M
3300009176|Ga0105242_10316414All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300010321|Ga0134067_10217173All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300010326|Ga0134065_10082713All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300010336|Ga0134071_10439278All Organisms → cellular organisms → Bacteria668Open in IMG/M
3300010399|Ga0134127_10982610All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300012202|Ga0137363_10540822All Organisms → cellular organisms → Bacteria981Open in IMG/M
3300012203|Ga0137399_10220525All Organisms → cellular organisms → Bacteria1546Open in IMG/M
3300012205|Ga0137362_10366149All Organisms → cellular organisms → Bacteria1248Open in IMG/M
3300012362|Ga0137361_10145072All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300012582|Ga0137358_10273406All Organisms → cellular organisms → Bacteria1148Open in IMG/M
3300012683|Ga0137398_10024688All Organisms → cellular organisms → Bacteria3316Open in IMG/M
3300012685|Ga0137397_10332209All Organisms → cellular organisms → Bacteria1133Open in IMG/M
3300012918|Ga0137396_10370688All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300012918|Ga0137396_10461824All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300012922|Ga0137394_10394060All Organisms → cellular organisms → Bacteria1181Open in IMG/M
3300012922|Ga0137394_10496930All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300012922|Ga0137394_10883539All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300012922|Ga0137394_10904277All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300012922|Ga0137394_10954874All Organisms → cellular organisms → Bacteria714Open in IMG/M
3300012922|Ga0137394_10971874All Organisms → cellular organisms → Bacteria706Open in IMG/M
3300012923|Ga0137359_10159179All Organisms → cellular organisms → Bacteria2014Open in IMG/M
3300012923|Ga0137359_10194553All Organisms → cellular organisms → Bacteria1811Open in IMG/M
3300012927|Ga0137416_10553090All Organisms → cellular organisms → Bacteria997Open in IMG/M
3300012929|Ga0137404_10478169All Organisms → cellular organisms → Bacteria1109Open in IMG/M
3300012929|Ga0137404_10506583All Organisms → cellular organisms → Bacteria1077Open in IMG/M
3300012977|Ga0134087_10018252All Organisms → cellular organisms → Bacteria2534Open in IMG/M
3300014157|Ga0134078_10053186All Organisms → cellular organisms → Bacteria1408Open in IMG/M
3300015052|Ga0137411_1100020All Organisms → cellular organisms → Bacteria1850Open in IMG/M
3300015054|Ga0137420_1293471All Organisms → cellular organisms → Bacteria3690Open in IMG/M
3300015241|Ga0137418_10053206All Organisms → cellular organisms → Bacteria3716Open in IMG/M
3300015241|Ga0137418_10054279All Organisms → cellular organisms → Bacteria3676Open in IMG/M
3300015241|Ga0137418_10387857All Organisms → cellular organisms → Bacteria1143Open in IMG/M
3300015264|Ga0137403_10067470All Organisms → cellular organisms → Bacteria3639Open in IMG/M
3300015264|Ga0137403_10071038All Organisms → cellular organisms → Bacteria3535Open in IMG/M
3300015264|Ga0137403_10246570All Organisms → cellular organisms → Bacteria1699Open in IMG/M
3300015264|Ga0137403_10469314All Organisms → cellular organisms → Bacteria1132Open in IMG/M
3300017930|Ga0187825_10168507All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300018433|Ga0066667_10779891All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300018433|Ga0066667_10939163All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300025910|Ga0207684_10132028All Organisms → cellular organisms → Bacteria2143Open in IMG/M
3300025912|Ga0207707_10269446All Organisms → cellular organisms → Bacteria1476Open in IMG/M
3300025981|Ga0207640_10431928All Organisms → cellular organisms → Bacteria1081Open in IMG/M
3300026300|Ga0209027_1083062All Organisms → cellular organisms → Bacteria1175Open in IMG/M
3300026301|Ga0209238_1102194All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300026305|Ga0209688_1008777All Organisms → cellular organisms → Bacteria1883Open in IMG/M
3300026312|Ga0209153_1011311All Organisms → cellular organisms → Bacteria2817Open in IMG/M
3300026315|Ga0209686_1049152All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300026317|Ga0209154_1017256All Organisms → cellular organisms → Bacteria3417Open in IMG/M
3300026318|Ga0209471_1112467All Organisms → cellular organisms → Bacteria1166Open in IMG/M
3300026322|Ga0209687_1013714Not Available2679Open in IMG/M
3300026330|Ga0209473_1200396All Organisms → cellular organisms → Bacteria756Open in IMG/M
3300026335|Ga0209804_1163513All Organisms → cellular organisms → Bacteria982Open in IMG/M
3300026523|Ga0209808_1003692All Organisms → cellular organisms → Bacteria → Proteobacteria7840Open in IMG/M
3300026532|Ga0209160_1197775All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300026547|Ga0209156_10200707All Organisms → cellular organisms → Bacteria950Open in IMG/M
3300026548|Ga0209161_10098169Not Available1772Open in IMG/M
3300026550|Ga0209474_10106541All Organisms → cellular organisms → Bacteria1879Open in IMG/M
3300026552|Ga0209577_10165488All Organisms → cellular organisms → Bacteria1726Open in IMG/M
3300026557|Ga0179587_10144120Not Available1480Open in IMG/M
3300027671|Ga0209588_1078688All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300027903|Ga0209488_10240931All Organisms → cellular organisms → Bacteria1357Open in IMG/M
3300028536|Ga0137415_10454786All Organisms → cellular organisms → Bacteria1086Open in IMG/M
3300030998|Ga0073996_10134223All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300031720|Ga0307469_10716048All Organisms → cellular organisms → Bacteria909Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil35.64%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil33.66%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.95%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.95%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.97%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.99%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.99%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026305Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes)EnvironmentalOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026523Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030998Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-3A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10056046423300002245Forest SoilLLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQEKMAEEVTFASLFQHPHPSLDGRTYAQALANELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFSEGKNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAYGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKAGRTLH*
JGI25383J37093_1000568353300002560Grasslands SoilRLARADVAALKILAKPVSQGGVQLNAAAESTMPYPKIPDVRLYAVGGPAPTAKIDEDATFASLFSAPHPSLGGRMYVQAMADELRRRGAFDGAPSPFQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQAMFAANGLALTHRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERALGLSPKEFWALVATKSPRTVH*
soilH1_1004432513300003321Sugarcane Root And Bulk SoilLALDPVTARLFLRCDGERSLGQVLADDGPQALDKLLRLARADVAALKILSRPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPAPQARIEEEASLASLFAAPHPSLGGRTYAQAIADELRRRGAFEGREPFQVLSLGLPMDLPEAQVETNIAKVQQPESYRAIVVNEIGAQLGFSDGKNSGLIRLVTDAATSLAPGGVLIVGDYGDPKGEAAPGSVRFADLLERATQSGLNARVVPLAEVLNLDMNAQALSTTRASLPALQALFAAHGLALTRRAWLRSEIEQMAEGRLDLATVHGLQWAPLSERALGLSPKQYWALVATKPERTLH*
Ga0066672_1017915513300005167SoilYTRAGKALPLDAVTSRLFLRCNGERSLGQVLADAGSTALEPLLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPAPTAKIDEDATFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPCQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLVTQTAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERAMGLSPKEFWALVATKSPRTVH*
Ga0066677_1037161213300005171SoilGLLRLARADVAALKILAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPSPQSKLEEEPTFASLFATPHPSLGGRTYAQAMGDELRRRRAFDGVSGPPQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLVVADYGDPKAEATPNSVSFKDVQTQAARSGLTGRIVPLSEVLGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLS
Ga0066677_1044770013300005171SoilLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLS
Ga0066690_1006317343300005177SoilMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT*
Ga0066678_1024674213300005181SoilPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH*
Ga0066676_1054433513300005186SoilFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSIGFADLLTRAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0066689_1000947253300005447SoilFLRCNGERSLGQVLADAGSTALEPLLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPAPTAKIDEDATFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPCQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQTAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERAMGLSPKEFWALVATKSPRTVH*
Ga0066682_1049134013300005450SoilLYAAGGPAPQQKVAEDATLASLFQDPHPSLDGQTYAQALARELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYHAVVASEIGAQLGFSERKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLHRQAEQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0070706_10015497733300005467Corn, Switchgrass And Miscanthus RhizospherePRGAVFQRKDEDVTIYTRAGKPMPLDAVTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKIEQESTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDDVPPPFQVLSLGLAMELPQARVETNIARVQQAGAYRAVILNEIGSQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEARPDSIGFADLLARAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANDLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALGLAPKDFWALVAVKPPRTIH*
Ga0070699_10041471613300005518Corn, Switchgrass And Miscanthus RhizosphereVPRGAVFQRKDEDVTIYTRAGKAMPLDAVTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKIEQESTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDDVPPPFQVLSLGLAMELPQARVETNIARLQQAGAYSAVILNEIGSQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEARPDSIGFADLLARAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANDLALTRRAWLRSEIEQLARAKLDLSTVHGLQWAPMSERALGLAPKDFWALVAVKPPRTIH*
Ga0070697_10121531613300005536Corn, Switchgrass And Miscanthus RhizosphereLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMEMPQARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADRGDPKAEAMPDSISFADLLTRATQSGLRGRVVPLWEALDLDVNGQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPLSERALGLAPKDYWALVA
Ga0070696_10064851913300005546Corn, Switchgrass And Miscanthus RhizosphereRLARADVAALKILARPVSQGGVQLNQAAESTMPYPEITDVRLYAAGGPAPQQKIDDEPTFAALFASPHPSLGGRTYGQALADELHRRGAFDRPGPFRVLSLGLPLQIEGASIETNIANVQQPESYQAVVVNEIGAQLGFSDGKNSGLIRLVTDAANALAPGGVLIVGDYGDPKADAAPGSVGFADLLERATQSGLTARVVPLAEVLGLDLNAQALSTTRASLPALRALFAKHGLSLTRRAWLRSEIEQLADGKLDLANVHGLQWAPLSERALGLSPKSYWALVAS
Ga0066700_1047460913300005559SoilLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPK
Ga0066700_1075477313300005559SoilAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALATGGVLIVADHGDPKAAATPDSISFADLLTRAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHG
Ga0066699_1014335433300005561SoilSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT*
Ga0066694_1023011513300005574SoilLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0066694_1028770513300005574SoilKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQKVAEDATFASLFQDPHPSLDGQTYAQALARELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYHAVVASEIGAQLGFSERKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQNVGFADLHRQAEQSGLGARVVPLAEVLDLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSER
Ga0066708_1020569713300005576SoilADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQKVAEEATFASLFQDPHPSLDGQTYAQALASELRRRGAFAGSPPFQVLSLGLALELPDARMETNIARVQEPESYHAVVASEIGAQLGFSERKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLHRQAEQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0066708_1056787413300005576SoilAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLS
Ga0066708_1068813613300005576SoilLEEEPTFASLFATPHPSLGGRTYAQAMGDELRRRRAFDGVSGPPQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLVVADYGDPKAEATPNSVSFKDVQTQAARSGLTGRIVPLSEVLGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLS
Ga0066706_1030328523300005598SoilLEALLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH*
Ga0066706_1081884813300005598SoilADVAALKILAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAGELRRRGAFDGVPAPHQVLSLGLAIELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADASGALAPGGVLIVADHGDPKAEATPNSVSFKDLQAQAARSGLSGRIVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIE
Ga0066696_1048589413300006032SoilLRLARADVAALKILAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPSPQSKLEEEPTFASLFATPHPSLGGRTYAQAMGDELRRRRAFDGVSGPPQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLVVADYGDPKAEATPNSVSFKDVQTQAARSGLTGRIVPLSEVLGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLS
Ga0066653_1016112223300006791SoilEALLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH*
Ga0066665_1075037913300006796SoilAAESTMPYPEIPDVRLYAVGGPSPQSKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLVIELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLSGRIVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEELAEGRLDLSTVHGLHWAPLSERALGLSPKQFWA
Ga0066659_1018561833300006797SoilARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLFAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT*
Ga0066660_1030881223300006800SoilPRVPRGAVFQRKDEEVTVYTRAGKALPLDAVTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVYGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAQGT
Ga0075434_10131254013300006871Populus RhizosphereRADVAALKILAKPVSHGGVQLNPAADSTMPYPEISDPRAYSAGAAAPAQKLEEETTFASLFQDPHPSLGGRTYAQAIADELRRRGAFDGPPPSNVLSLGLPLEVAGARVETSIAKLQQPESYQAVVVNEIGAQLGFSEGRNSGLLRLVADTAVALAPGGVLLVADYGDAKAGATPQSISFTDLEKQAALSGLTSRVVPLAEALNLDGNGQALSTTRASFPALQALFAAHGLSLTRRAWLRSEIE
Ga0099791_1007507333300007255Vadose Zone SoilIEQEQTFASLFHDPHPSLGGRTYAQAMAEELRRQGAFKTAQPWQVLSLGLQLEIPGTTVETNIAQVQQPESYRAVVVNEIGAQLGFSEGKNSGLLRLVADAASALAPGGVLIVADFGDARDEATANGIAFTDLAQQAGQSGLEGRVVPLAEVLNLDLNSQALSTTRASLPALQALFAAHGLSLTRRAWLRSEIEALAQGKLDLTTVHGLQWAPLSERALGLSPRQFWALVATKPERTLH*
Ga0099793_1036334513300007258Vadose Zone SoilMPYPEIPDVRLYAAGAPAPQQKMAEEATFASLFQDPHPSLEGRTYAQALANELRRRGAFAGSPPFQVLSLGLALELPEARVETNIARVQEPESYRAVVANEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQKQAAQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAAGKLDLSTVHGLQWAPL
Ga0066710_10221688513300009012Grasslands SoilSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALV
Ga0066709_10070786213300009137Grasslands SoilKLEEEPTFASLFATPHPSLGGRTYAQAMAGELRRRGAFDGVPAPHQVLSLGLVIELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADASGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAVRSGLSGRIVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEELAEGRLDLSTVHGLHWAPLSERALGLSPKQFWALVATKPERTLH*
Ga0066709_10166154823300009137Grasslands SoilAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRAGQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGIALTRRAWLRSEIERLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPARTHQRAHGT*
Ga0105242_1031641413300009176Miscanthus RhizosphereLRLARADVAALKILARPVSQGGVQLNQAAESTMPYPEITDVRLYAAGGPAPQQRIDDEPTFAALFASPHPSLGGRTYGQALADELHRRGAFDRPGPFRVLSLGLPLQIEGATVETNIANVQEPESYQAVVVNEIGAQLGFSDGKNSGLIRLVTDAANALAPGGVLIVGDYGDPKAEAAPGSVGFADLLERAMQSGLTARVVPLAEVLGLDLNAQALSTTRASLPALRALFAKHGLSLTRRAWLRSEIEQLADGKLDLANVHGLQWAPLSERALGLSPKSYWALVASKPGRTLH*
Ga0134067_1021717313300010321Grasslands SoilTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPVPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLSLPRRAWLRSEVEKLDEGKIDLSSVHGLQWAPLSESALGLSPKQFWALVATKPERTLH
Ga0134065_1008271313300010326Grasslands SoilNGKGVIVYTRAGTALPLEEMTSRLFLRCDGERSLGQVLGDAGYAPLEALLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPVPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVAAYGDPKAEATPNSVSFNDLQAQAARSVLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH*
Ga0134071_1043927813300010336Grasslands SoilAAESTMPYPEIPDARLYAVGGPPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKEEATPNSVSFKDLQAQAARSGLTGRIVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEID
Ga0134127_1098261013300010399Terrestrial SoilALKILARPVSQGGVQLNQAAESTMPYPEITDVRLYAAGGPAPQQRIDDEPTFAALFASPHPSLGGRTYGQALADELHRRGAFDRPGPFRVLSLGLPLQIEGATVETNIANVQEPESYQAVVVNEIGAQLGFSDGKNSGLIRLVTDAANALAPGGVLIVGDYGDPKADAAPGSVGFADLLERATQSGLTARVVPLAEVLGLDLNAQALSTTRASLPALRALFAKHGLSLTRRAWLRSEIEQLADGKLDLANVHGLQWAPLSERALGLSPKSYWALVASKPGRTLH*
Ga0137391_1044384613300011270Vadose Zone SoilDALKLDPVTAQLFSRCNGEKSLGQVLGDAGPQALPDFLRLARADVAAVKILAKPASQGGVQLNPAAESTMPYPEIPDARAYAKGGPAPEPKADDETTFASLFEGPHKCLGGKSYAQALAFELNRRGAFKEVRGRPARALSLGIDLTEALRKHVPDVKVDANIALIKESEAFDAIVANEIALQLGFSEGKNSGALALVRDAAAALAPGGVLFIADFGDPKADPTPLSVRFADLSAEATRLDLGARVVPLIEAVSLDANELSLSTTKASFPALRALFAAHGLELGKRAWLRSEIEKLAEGKLDLARVQGLQWAPLSERALGLSPKQFWALVAQKPERVLH*
Ga0137363_1054082213300012202Vadose Zone SoilPLDAVTSSLFLRCNGDRSLGQILGDAGPSALEPLLRLARADVAALKILAKPGSQGGLQRNAAAESTMPYPEIPDVRLFAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAGSYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQTAQSGLRGRVVPLWEVLDLDMNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTI
Ga0137399_1022052533300012203Vadose Zone SoilDEDATFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFKVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARLDSVSFADLMTQAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPGTIHQRAHGT*
Ga0137362_1036614923300012205Vadose Zone SoilVFQREDEVVKVYTRAGKAMPLDAVTSSLFLRCNGDRSLGQVLGDAGPGALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKIGEESTFASLFAAPHPSLGGRTYAQAMADEVRRRGAFDGVPSPFHLLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT*
Ga0137361_1014507213300012362Vadose Zone SoilTRAGIALGLDPLTSRLFLRCDGERSLGQVLGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVSLYAAGAPAPQQKMGEEATFASLFQDPHPSLEGRTYAQALANELRRRGAFAGAPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFSEGKNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPLSVGFGDLQRQAAQCGLSARVVPLAEVLDLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLATVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137358_1027340623300012582Vadose Zone SoilTVFERDAAGRVTIYTRAGSALDLDPMSSRLFLRCDGERTLGQVLGDAGPGALGALLRLARADVAALKVLAKPVSQGGVQLNPAAESTMPYPEIPDPRLYAAGAPPPQQRIEQEQTFASLFHDPHPSLGGRTYAQAMAEELRRQGAFKTPQPWRVLSLGLQLEIPGTTVETNIARVQQPESYRAVVVNEIAEQLGFSEGKNSGLLRLVADAASALAPGGVLIVADFGDAQDEATANGIAFTDLAQQAEQSGLEGRVVPLAEVLNLDLNSQALSTTRASLPALQALFAAHGLSLTRRAWLRSEIEALAEGKLDLTTVHGLQWAPLSERALGLSPRQFWALVATKPERTLH*
Ga0137398_1002468843300012683Vadose Zone SoilLYASGAPAPQQMAEEATFASLFQDPHPSLEGRTYAQALASELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQELESYQAVVANEIGMQLGFSEGKNSGLLRLVSDAASALAAGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137397_1033220923300012685Vadose Zone SoilQLMAEEATFASLFQDPHPSLDGRTYAQALANELRRRGAFAGSPPFQVLSLGIALELPDARVETNIARVQEPESYQAVVANEIGAQLGYSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQKQAAQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAAGKLDLSTVHGLQWAPLSERALGLSPRQFWALIATKPGRTLH*
Ga0137396_1037068823300012918Vadose Zone SoilADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQRMPEEATFASLFQDPHPSLGGQTYAQALASELRRRGAFAGSPPFQVLSLGIALELPDARVETNIARVQEPESYQAVVANEIGAQLGYSEGKNSGLLRLVSDAASALAPGGVMIVADYGDPKADATPQSVGFGDLQRQATQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVERLAEGKLDLSTVLGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137396_1046182423300012918Vadose Zone SoilMPYPEIPDARLYAAGDPAPSQKMAEEATFTSLFQEPHPSLEGRTYAQAMASELRRRGAFAGSPPFQVLSLGLALELPAARVETNIARVQEPESYQAVVANEIGTQLGFCEGRNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSSVHGLQWAPLSERALGLSPRHFWALVATKPGRTLH*
Ga0137394_1039406013300012922Vadose Zone SoilRTLGQVLGDAGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEINDVRLYAAGAPAPQQRMAEEATFASLFQAPHPSLEGRTYAQAMANELRRRGAFAGSPPFQVLSLGLALELPDARVETNIACVQEPESYQAVVANEIGTQLGFSERKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137394_1049693013300012922Vadose Zone SoilGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPPQKMAEEATFASLFQEPHPSLEGRTYAQAMASELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFCEGRNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATRQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSRVHGLQWAPLSERALGLSPRHFWALVATKPGPTLH*
Ga0137394_1088353913300012922Vadose Zone SoilQRIEQEQTFASLFHDPHPSLGGRTYAQAMAEELRRQGAFKTPQPWQVLSLGLQLEIPGTTVETNIARVQQPESYRAVVVNEIAEQLGFSEGKNSGLLRLVADAASALAPGGVLIVADFGDAKDEATPNGIAFTDLAQQAEQSGLEGRVVPLAEVLNLDLNPQALSTTRASLPALQALFAAHGLSLTRRAWLRSEIEALAQGKLDLTTVHGLQWAPLSERALGLSPRQFWALVATKPERTLH*
Ga0137394_1090427713300012922Vadose Zone SoilQRIEQEQTFASLFHDPHPSLGGRTYAQAMAEELRRQGAFKTAQPWQVLSLGLQLEIPGTTVETNIAQVQQPESYRAVVVNEIGAQLGFSEGKNSGLLRLVADAASALAPGGVLIVADFGDARDEATANGIAFTDLAQQAGQSGLEGRVVPLAEVLNLDLNSQALSTTRASLPALQALFAAHGLSLTRRAWLRSEIEALAEGKLDLTTVHGLQWAPLSERALGLSPRQFWALVATKPERTLH*
Ga0137394_1095487413300012922Vadose Zone SoilLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQRMAEEATFASLFQDPHPSLGGQTYAQALASELRRRGAFAGSPPFQVLSLGIALELPDARVETNIARVQEPESYQAVVANEIGPQLGYSEGKNSGLLRLVSDAASALAPGGVMIVADYGHPKADATPQSVGFGDLQKQAAQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAANGLSLP
Ga0137394_1097187413300012922Vadose Zone SoilLEEEPTFAELFRDPHPSLGGRTYAQALGEELRRRGVFQGQGKAPLQVLSLGLGLELAGAQVETNIANMQQPDWYRAIVLNEIGTQLGFSEGKNSGLLRLVSDAAVALAPGGVLIVADFGDPKALATPQSISFADVHKQSTDSGLQGRVVPLAEALNLDANVQALSTTRASFPALQALFAAHGLALTRRAWLRGEIEKLAEGKLDLGIVHGLQWAPLSERALGLSPRQFWALVAA
Ga0137359_1015917913300012923Vadose Zone SoilARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPPQKMAGEATFASLFQDPHPSLGGQTYAQALASELRRRGAFAGSPPFQVLSVGIGLDLPEAQMETNIARVQETESYQAVVADEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKAEATPQSVGFGDLQKQAEKSGLLGARVVPLAEVLNLDVNVQALSTMRASLPALQALFAAHGLSLPRRAWLRSEVESLAGGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137359_1019455313300012923Vadose Zone SoilDVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKVAEETTFASLFQDPHPSLEGRTYAQALANELRRRGAFAGAPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFSEGKNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLSARVVPLAEVLDLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLTTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137416_1055309013300012927Vadose Zone SoilMAEEATFASLFQDPHPSLEGRTYAQALANELRRRGAFAGSPPFQVLSLGLALELPEARVETNIARVQEPESYRAVVANEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQRQATQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVERLAEGKLDLSTVLGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137404_1047816913300012929Vadose Zone SoilQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRAGQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGIALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT*
Ga0137404_1050658323300012929Vadose Zone SoilAEVAALKILAKAVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQKMAEEATFASLFQDPHPSLEGRTYAQAMANELRRRGAFAGSPPFQILSLGLALELPDARVETNIARVQEPESDQAVVANEIVRQLGFSEGKNSGLLRLVSDAAAALVPGGVLIVADYGDPKTDATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDLNMQALSTTRASLPALQGLFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0134087_1001825213300012977Grasslands SoilQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFNDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0134078_1005318613300014157Grasslands SoilKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFNDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLAEGKLDLTSVHGLQWAPLSERALGLSPKLFWALVATKPERTLH*
Ga0137411_110002033300015052Vadose Zone SoilVPRAAVFELGERVTLYTRAGVALELDPLTSRLFLRCDGERSLGQVLGDAGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPQQRMAEEATFASLFQDPHPSLGGQTYAQALASELRRRGAFAGSPPFQVLSLGIALELPDARVETNIARVQEPESYQAVVANEIGPQLGYSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQKQAAQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAANGLSLPRRAWLRSEVEKLAAGKLDLSTVHGLQWAPLSERALGLSPRQFWALIATKPGRTLH*
Ga0137420_129347113300015054Vadose Zone SoilVQLNPAAESTMPYPEIPDARLYAAGDPAPSQKMAEEATFASLFQEPHQSLEGRTYAQAMASELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFCEGRNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSSVHGLQWAPLSERALGLSPRHFWALVATKPGRTLH*
Ga0137418_1005320643300015241Vadose Zone SoilGLDPLTSRLFLRCDGERSLGQVLGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDARLYAAGDPAPSQKMAEEATFASLFQEPHPSLEGRTYAQAMASELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFCEGRNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSSVHGLQWAPLSERALGLSPRHFWALVATKPGRTLH*
Ga0137418_1005427913300015241Vadose Zone SoilTSRLFLRCDGERSLGQVLGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKMAEEATFASLFQDPHPSLEGRTYAQALANELRRRGAFAGSPPFQVLSLGLALELPEARVETNIARVQEPESYRAVVANEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQRQATQSGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVERLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137418_1038785713300015241Vadose Zone SoilGLDPLTSRLFLRCDGERSLGQVLGDAGPQVLGALLGLARAEVAALKILAKAVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKMAEEATFASLFQDPHPSLEGRTYAQAMANELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGRQLGFSEGKNSGLLRLVSDAAAALVPGGVLIVADYGDPKTDATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDLNMQALSTTRASLPALQGLFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137403_1006747013300015264Vadose Zone SoilAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGGPAPPQKMAGEATFASLFQDPHPSLGGQTYAQALASELRRRGAFAGSPPFQVLSVGIGLDLPEAQMETNIARVQEPESYQAVVADEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKAEATPQSVGFGDLQKQAEKSGLLGARVVPLAEVLNLDVNVQALSTMRASLPALQALFAAHGLSLPRRAWLRSEVESLAGGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137403_1007103843300015264Vadose Zone SoilAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKVAEETTFASLFQDPHPSLEGRTYAQALANELRRRGAFAGAPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFSEGKNSGLLRLVSDAAAALAPGGVLIVADYGDPKADATPLSVGFGDLQRQAAQCGLSARVVPLAEVLDLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLATVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0137403_1024657023300015264Vadose Zone SoilMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRAGQNGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGIALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPLSERALGLSPKEFWALVATKSPRTVH
Ga0137403_1046931413300015264Vadose Zone SoilCDGERSLGQVLGDAGPQALGALLGLARAEVAALKILAKAVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKMAEEATFASLFQDPHPSLEGRTYAQAMANELRRRGAFAGSPPFQILSLGLALELPDARVETNIARVQEPESYQAVVANEIGRQLGFSEGKNSGLLRLVSDAAAALVPGGVLIVADYGDPKTDATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDLNMQALSTTRASLPALQGLFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH*
Ga0187825_1016850713300017930Freshwater SedimentATGGPAPEQTIADEPTFASLFADPHPSLGGRTYSQSMAVELRRRGAFAGAPPFRVLSLGLDLEIPEARVETNIAQVQRPESYGAVVVNEVGAQLGFSEGRQSGLLRLVSDAATALAPGGVLIVADCGDPRADATPQSVGFSELQTQATRSGLAGRVVPLAEVLNLDLNPQALSTTRASFPALQALFASHAMSLTRRAWLRSEIEKLAEGKLDLGSVHGLQWAPLSERALGLSPKRFWALVAVKPQRTVH
Ga0066667_1077989113300018433Grasslands SoilPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVA
Ga0066667_1093916313300018433Grasslands SoilPDVRLYAVGGPSPQSKLEEEPTFASLFATPHPSLGGRTYAQAMGDELRRRRAFDGVSGPPQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLVVADYGDPKAEATPNSVSFKDVQTQAARSGLTGRIVPLSEVLGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVA
Ga0207684_1013202833300025910Corn, Switchgrass And Miscanthus RhizosphereVPRGAVFQRKDEDVTIYTRAGKPMPLDAVTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKIEQESTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDDVPPPFQVLSLGLAMELPQARVETNIARVQQAGAYRAVILNEIGSQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEARPDSIGFADLLARAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANDLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALGLAPKDFWALVAVKPPRTIH
Ga0207707_1026944613300025912Corn RhizosphereQGGIQLNPAAESTMPYPEIPDARLYATGGPAPEQTIADEPTFASLFADPHPSLGGRTYSQAMAAELRRRGAFAGAPPFRVLSLGLDLEIPEALVETNVAQVQQPESYRAVVVNEVGAQLGFSEGHQSGLLRLVSDAAIALAPGGVLIVADCGDPKADATPQSVGFSELQTEATRTGLAGRVVPLSEVLNLDLNPQALSTTRASFPALQALFASHGMSLTRRAWLRSEIEQLAEGKLDLGSVHGLQWAPLSERALGLSPKRFWALVAVKPQRTVH
Ga0207640_1043192813300025981Corn RhizosphereLYAAGGPAPQQKIDDEPTFAGLFASPHPSLGGRTYGQALADELRRRGAFDRPGPFRMLSLGLPLQIEGATVETNIANVQEPESYQAVVVNEIGAQLGFSDGKNSGLIRLVTDAANALAPGGVLIVGDYGDPKADAAPGSVGFADLLERATQSGLTARVVPLAEVLGLDLNAQALSTTRASLPALRALFAKHGLSLTRRAWLRSEIEQLADGKLDLANVHGLQWAPLSERALGLSPKSYWALVASKPGRTLH
Ga0209027_108306213300026300Grasslands SoilGKALPLDAVTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALATGGVLIVADHGDPKAAATPDSISFADLLTRAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERAMGLSPKEFWALVATKSPRTVH
Ga0209238_110219423300026301Grasslands SoilLFSAPHPSLGGRMYVQAMADELRRRGAFDGAPSPFQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQAMFAANGLALTHRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERALGLSPKEFWALVATKSPRTVH
Ga0209688_100877713300026305SoilTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0209153_101131143300026312SoilSLGGRTYAQAMGDELRRRRAFDGVSGPPQVLSLGLAIELPDARVETNIARVQQPESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLVVADYGDPKAEATPNSVSFKDVQTQAARSGLTGRIVPLSEVLGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIDKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0209686_104915213300026315SoilRADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVYGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0209154_101725643300026317SoilAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0209471_111246713300026318SoilVAALKILAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGPAPTAKIDEDATFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGAPSPFQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQTAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERAMGLSPKEFWALVATKSPRTVH
Ga0209687_101371413300026322SoilAESTMPYPEIPDVRLYAVGGPAPTAKIDEDATFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPEARVETNIARVQQAESYRAVILNEIGAQLGFSDGKNSGLLRLVADAAEALAPGGVLIVADYGHPKAAARPDSVSFADLMTQTAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLAQGKLDLSTVHGLQWAPLSERAMGLSPKEFWALVATKSPRTVH
Ga0209473_120039613300026330SoilVSQGGVQLNAAAESTMPYPEIPDVRLYAAGGPPPQPKLEEESTFASLFATPHPSLGGRTYAQAMADELRRRGAFEGIPAPHQVLCLGLAMELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRTVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLAEGKLDLSTVHGLQWAP
Ga0209804_116351313300026335SoilLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0209808_1003692103300026523SoilPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0209160_119777513300026532SoilVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALATGGVLIVADHGDPKAAATPDSISFADLLTRAAQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVHGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0209156_1020070713300026547SoilGDRPLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGTFDGVAPPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAEATPDSISFADLLTRAAQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVYGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQR
Ga0209161_1009816933300026548SoilLEALLRLARADVAALKVLAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAVGGAPPQPKLEEEPTFASLFATPHPSLGGRTYAQAMAHELRRRGAFDGVPAPHQVLSLGLAMELPDARVETNIARVQQAESYRAAVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRIVPLSEALCLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLVEGKLDLASVHGLQWAPLSERALGLSPKQFWALVATKPERTLH
Ga0209474_1010654133300026550SoilGVIVYTRAGTALPLDEMTSRLFLRCDGERSLGQVLGDAGYAALEALLRLARADVAALKILAKPVSQGGVQLNAAAESTMPYPEIPDVRLYAAGGPPPQPKLEEESTFASLFATPHPSLGGRTYAQAMADELRRRGAFEGIPAPHQVLCLGLAMELPDARVETNIARVQQAESYRAVVVNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGDPKAEATPNSVSFKDLQAQAARSGLTGRTVPLSEALGLDVNSQALSTTRASLPALQALFASHGLALARRAWLRSEIEKLAEGKLDLSTVHGLQWAPLSERALGLSPKQFWALVATKAERTLH
Ga0209577_1016548833300026552SoilTSSLFLRCNGDRSLGQVLGDAGPSALEPLLRLARADVAALKILAKPVSQGGLQLNAAAESTMPYPEIPDVRLYAVGGPAPQAKMEEDSTFASLFAAPHPSLGGRTYAQAMADELRRRGAFDGVPSPFQVLSLGLAMELPQARVETNIARVQQAESYRAVILNEVGAQLGFSDGKNSGLLRLVTDAAEALAPGGVLIVADHGDPKAGATPDSISFADLLTRATQSGLRGRVVPLWEILDLDVNAQALSTTRASFPALQALFAANGLALTRRAWLRSEIEQLARGKLDLSTVYGLQWAPMSERALSLVPKDFWALVAVKPPRTIHQRAHGT
Ga0179587_1014412013300026557Vadose Zone SoilAGFPRIPRAAVFERGDTVTLYTRAGTALGLDPLTSGLFLRCDGERSLGQVLGDAGPQALGAFLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEINDVRLYAAGAPAPQQRMAEEATFASLFQAPHPSLEGRTYAQAMANELRRRGAFAGSPPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGMQLGFSEGKNSGLLRLVSDAASALVPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQCGLGARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAEGKLDLSTVHGLQWAPLSERALGLSPRQFWALVATKPGRTLH
Ga0209588_107868813300027671Vadose Zone SoilLFLRCDGERSLGQVLGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYASGAPAPQPKMAEEATFASLFQDPHPSLAGRTYAQALADELRRRGAFEGSRPFQVLSLGLALELPDARVETNIARVQEPESYQAVVANEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQSGLGARVVPLAEVLNLDVNAQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAAGKLDLSTVQGLQWAPLSERALGLSPRQFWALVATKPGRAS
Ga0209488_1024093113300027903Vadose Zone SoilLRLARADVAALKVLAKPVSQGGVQLNPAAESTMPYPEISDPRLYAAGAPPPQQRIEQELTFASLFHDPHPSLGGRTYAQAMAEELRRQGAFKTPQPWRVLSLGLQLEIPGTTVETNIAHVQQPESYRAVVVNEIGAELGFSEGKSSGLLRLVADAASALAPGGVLIVADFGDAKDEATPNGIAFTDLAQQAGQSGLEGRVVPLAEVLNLDLNPQALSTTRASLPALQALFAAHGLSLTRRAWLRSEIEALADGGLDLTAVHGLQWAPLSERALGLSPRQFWALVATKPERTLH
Ga0137415_1045478623300028536Vadose Zone SoilTSRLFLRCDGERSLGQVLGDSGPQALGALLRLARADVAALKILAKPVSQGGVQLNPAAESTMPYPEIPDVRLYAAGAPAPQQKMAEEATFASLFQDPHPSLEGRTYAQALANELRRRGAFAGSPPFQVLSLGLALELPEARVETNIARVQEPESYRAVVANEIGTQLGFSEGKNSGLLRLVSDAASALAPGGVLIVADYGDPKADATPQSVGFGDLQRQAAQSGLGARVVPLAEVLNLDVNAQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEKLAAGKLDLSTVQGLQWAPLSERALGLSPRQFWALVATKPGRAS
Ga0073996_1013422313300030998SoilDPHPSLGGRTYAQAMADELRRRGAFQGTTPFQVLSLGLAMQVPDAQVETNVARAQQPESYHAVVVNEIGAQLGFSDGKNSGLLRLVADAALALAPGGVLIVADYGHPKADAVPQGVSFADLQKQAAQSGLSARVVPLAEVLNLDVNVQALSTTRASLPALQALFAAHGLSLPRRAWLRSEVEQLAAGKLDLSSVQGLQWAPLSERALGLSPRRFWALVATKPERTL
Ga0307469_1071604823300031720Hardwood Forest SoilQLKMEEESTFASLFAAPHPSLGGRTYAQAMTDALRRRGAFDGVPAPFQILSLGLTMQQPDARVETNIARVQQPESYRAVVLNEIGAQLGFSDGKNSGLLRLVADAAGALAPGGVLIVADYGASKGEARPDSVSFADLLAQAEQSGLRGRVVPLWEVLDLDVNAQALSTTRASFPALQALFAANGVALTRRAWLRSEIEQIADGKLDLSNVHGLQWAPLSERALGLSPKDFCALVATKS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.