NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F059846

Metagenome Family F059846

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F059846
Family Type Metagenome
Number of Sequences 133
Average Sequence Length 161 residues
Representative Sequence PPPNRPARYRPTDPQTLLAQLATLQGEALDRLGRELRNMSRPGDPVTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAGERATMKVRARGGMPADAPSWLEPASNDAPNVTALLIDETQLVMTSGEGETIAGLWTSHPLIVLLARRALQTMS
Number of Associated Samples 108
Number of Associated Scaffolds 133

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 6.02 %
% of genes near scaffold ends (potentially truncated) 91.73 %
% of genes from short scaffolds (< 2000 bps) 81.20 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.323 % of family members)
Environment Ontology (ENVO) Unclassified
(40.602 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.120 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.02%    β-sheet: 19.79%    Coil/Unstructured: 42.19%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 133 Family Scaffolds
PF00561Abhydrolase_1 32.33
PF12697Abhydrolase_6 19.55
PF00246Peptidase_M14 5.26
PF00440TetR_N 3.76
PF05362Lon_C 2.26
PF01564Spermine_synth 0.75
PF04389Peptidase_M28 0.75
PF00069Pkinase 0.75

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 133 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.01
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 2.26
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 2.26
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 2.26
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 2.26


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10021512All Organisms → cellular organisms → Bacteria2301Open in IMG/M
3300002560|JGI25383J37093_10028163All Organisms → cellular organisms → Bacteria1883Open in IMG/M
3300004047|Ga0055499_10061469All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300005166|Ga0066674_10408496All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300005172|Ga0066683_10098223All Organisms → cellular organisms → Bacteria1775Open in IMG/M
3300005176|Ga0066679_10042577All Organisms → cellular organisms → Bacteria2574Open in IMG/M
3300005180|Ga0066685_10104665All Organisms → cellular organisms → Bacteria1893Open in IMG/M
3300005180|Ga0066685_10210585All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1335Open in IMG/M
3300005181|Ga0066678_10972102All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Acidovorax → unclassified Acidovorax → Acidovorax sp. CF316552Open in IMG/M
3300005345|Ga0070692_11171499All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300005440|Ga0070705_100530485All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes899Open in IMG/M
3300005447|Ga0066689_10409622All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300005447|Ga0066689_10886385All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Roseomonas → Roseomonas cervicalis552Open in IMG/M
3300005451|Ga0066681_10239306All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1098Open in IMG/M
3300005540|Ga0066697_10512219All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300005549|Ga0070704_100282215All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1376Open in IMG/M
3300005549|Ga0070704_101507861All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300005553|Ga0066695_10579651All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300005554|Ga0066661_10408019All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes832Open in IMG/M
3300005556|Ga0066707_10178537All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1360Open in IMG/M
3300005558|Ga0066698_10018246All Organisms → cellular organisms → Bacteria4055Open in IMG/M
3300005559|Ga0066700_11087913All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Roseomonas → Roseomonas cervicalis523Open in IMG/M
3300005615|Ga0070702_100345586All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1046Open in IMG/M
3300006032|Ga0066696_10192720All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1296Open in IMG/M
3300006046|Ga0066652_101459943All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300006791|Ga0066653_10030309All Organisms → cellular organisms → Bacteria2124Open in IMG/M
3300006791|Ga0066653_10406819All Organisms → cellular organisms → Bacteria696Open in IMG/M
3300006797|Ga0066659_10252021All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300006904|Ga0075424_101259614All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes787Open in IMG/M
3300006914|Ga0075436_100939304All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300007004|Ga0079218_12547937All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300007076|Ga0075435_100009671All Organisms → cellular organisms → Bacteria7001Open in IMG/M
3300007076|Ga0075435_100534390All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1015Open in IMG/M
3300007076|Ga0075435_101051342All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes711Open in IMG/M
3300007258|Ga0099793_10429244All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300009012|Ga0066710_100862308All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1392Open in IMG/M
3300009012|Ga0066710_103228093All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300009088|Ga0099830_10319137All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1244Open in IMG/M
3300009137|Ga0066709_100096095All Organisms → cellular organisms → Bacteria3608Open in IMG/M
3300009143|Ga0099792_10560799All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes723Open in IMG/M
3300009597|Ga0105259_1080464All Organisms → cellular organisms → Bacteria755Open in IMG/M
3300009609|Ga0105347_1077962All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1223Open in IMG/M
3300010301|Ga0134070_10017097All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2333Open in IMG/M
3300010301|Ga0134070_10298873All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300010323|Ga0134086_10395791All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300010329|Ga0134111_10064090All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1359Open in IMG/M
3300010333|Ga0134080_10214900All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes838Open in IMG/M
3300010336|Ga0134071_10225435All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes929Open in IMG/M
3300010362|Ga0126377_12553385All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300010403|Ga0134123_13434830All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300011443|Ga0137457_1015481All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia1924Open in IMG/M
3300011443|Ga0137457_1088993All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes965Open in IMG/M
3300011445|Ga0137427_10147007All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes968Open in IMG/M
3300012173|Ga0137327_1068350All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300012198|Ga0137364_10294835All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1204Open in IMG/M
3300012200|Ga0137382_10003086All Organisms → cellular organisms → Bacteria7763Open in IMG/M
3300012200|Ga0137382_10068455All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia2269Open in IMG/M
3300012200|Ga0137382_10303151All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1115Open in IMG/M
3300012200|Ga0137382_11065159All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300012201|Ga0137365_10198246All Organisms → cellular organisms → Bacteria1501Open in IMG/M
3300012203|Ga0137399_10212548All Organisms → cellular organisms → Bacteria1575Open in IMG/M
3300012204|Ga0137374_10452684All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1007Open in IMG/M
3300012208|Ga0137376_10235574All Organisms → cellular organisms → Bacteria1587Open in IMG/M
3300012210|Ga0137378_11622540All Organisms → cellular organisms → Bacteria555Open in IMG/M
3300012285|Ga0137370_10000671All Organisms → cellular organisms → Bacteria13405Open in IMG/M
3300012285|Ga0137370_10961124All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300012350|Ga0137372_10176944All Organisms → cellular organisms → Bacteria1724Open in IMG/M
3300012351|Ga0137386_10300292All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1156Open in IMG/M
3300012351|Ga0137386_10505918All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes870Open in IMG/M
3300012354|Ga0137366_10958996All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300012356|Ga0137371_10014913All Organisms → cellular organisms → Bacteria5973Open in IMG/M
3300012357|Ga0137384_10055231All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia3286Open in IMG/M
3300012359|Ga0137385_10824076All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes770Open in IMG/M
3300012360|Ga0137375_10044875All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes4877Open in IMG/M
3300012362|Ga0137361_10214443All Organisms → cellular organisms → Bacteria1745Open in IMG/M
3300012362|Ga0137361_11926731All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300012582|Ga0137358_10296084All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1099Open in IMG/M
3300012685|Ga0137397_10024168All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia4297Open in IMG/M
3300012685|Ga0137397_10025397All Organisms → cellular organisms → Bacteria4194Open in IMG/M
3300012918|Ga0137396_10139858All Organisms → cellular organisms → Bacteria1757Open in IMG/M
3300012918|Ga0137396_10190018All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1508Open in IMG/M
3300012922|Ga0137394_11011326All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300012925|Ga0137419_10324079All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1185Open in IMG/M
3300012927|Ga0137416_10954912All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes764Open in IMG/M
3300012944|Ga0137410_11604501All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300012977|Ga0134087_10109987All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium1163Open in IMG/M
3300014154|Ga0134075_10300454All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300014157|Ga0134078_10335937All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300014157|Ga0134078_10596427All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300015358|Ga0134089_10235049All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300015358|Ga0134089_10263953All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300015374|Ga0132255_102902552All Organisms → cellular organisms → Bacteria732Open in IMG/M
3300017654|Ga0134069_1094589All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes971Open in IMG/M
3300017656|Ga0134112_10366224All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300018075|Ga0184632_10175497All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes945Open in IMG/M
3300018079|Ga0184627_10212162All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1022Open in IMG/M
3300018089|Ga0187774_11239611All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300018482|Ga0066669_10525874All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1027Open in IMG/M
3300018482|Ga0066669_11804554All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300020170|Ga0179594_10178070All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300021418|Ga0193695_1042234All Organisms → cellular organisms → Bacteria988Open in IMG/M
3300024325|Ga0247678_1021729All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes983Open in IMG/M
3300025942|Ga0207689_10057899All Organisms → cellular organisms → Bacteria3187Open in IMG/M
3300026297|Ga0209237_1059104All Organisms → cellular organisms → Bacteria1871Open in IMG/M
3300026297|Ga0209237_1076036All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1559Open in IMG/M
3300026298|Ga0209236_1019584All Organisms → cellular organisms → Bacteria3804Open in IMG/M
3300026306|Ga0209468_1001950All Organisms → cellular organisms → Bacteria8830Open in IMG/M
3300026306|Ga0209468_1003716All Organisms → cellular organisms → Bacteria6094Open in IMG/M
3300026323|Ga0209472_1191722All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300026324|Ga0209470_1332438All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300026329|Ga0209375_1001004All Organisms → cellular organisms → Bacteria22264Open in IMG/M
3300026343|Ga0209159_1232304All Organisms → cellular organisms → Bacteria565Open in IMG/M
3300026536|Ga0209058_1022802All Organisms → cellular organisms → Bacteria4053Open in IMG/M
3300026540|Ga0209376_1086756All Organisms → cellular organisms → Bacteria1645Open in IMG/M
3300026550|Ga0209474_10158257All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1470Open in IMG/M
3300026552|Ga0209577_10241276All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1367Open in IMG/M
3300026555|Ga0179593_1043906All Organisms → cellular organisms → Bacteria2810Open in IMG/M
3300027886|Ga0209486_11315662All Organisms → cellular organisms → Bacteria501Open in IMG/M
3300027903|Ga0209488_10505858All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes885Open in IMG/M
3300027909|Ga0209382_10231670All Organisms → cellular organisms → Bacteria2104Open in IMG/M
3300027909|Ga0209382_10785150All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1015Open in IMG/M
3300028380|Ga0268265_10623169All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1034Open in IMG/M
3300028536|Ga0137415_10079168All Organisms → cellular organisms → Bacteria3146Open in IMG/M
3300028536|Ga0137415_10614228All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes898Open in IMG/M
3300028719|Ga0307301_10080117All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1024Open in IMG/M
3300028784|Ga0307282_10246248All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes858Open in IMG/M
3300028878|Ga0307278_10511969All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300028884|Ga0307308_10021284All Organisms → cellular organisms → Bacteria2983Open in IMG/M
3300031720|Ga0307469_10166713All Organisms → cellular organisms → Bacteria1674Open in IMG/M
3300031720|Ga0307469_10532722All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1036Open in IMG/M
3300031908|Ga0310900_10782991All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes770Open in IMG/M
3300032180|Ga0307471_100395032All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1509Open in IMG/M
3300034164|Ga0364940_0036957All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1286Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.32%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil22.56%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil10.53%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.52%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.26%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.76%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.26%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.50%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.50%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.75%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.75%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.75%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.75%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.75%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.75%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.75%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.75%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300004047Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012173Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT517_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300024325Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK19EnvironmentalOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028719Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1002151243300002558Grasslands SoilTLQGEALDRLGRELRDISRPGDPVTREVAGSRAVANLLMQLVARAERSVEGVMALELWRPTLPAWRRAGERATMKVRARGGMPADAPSWLEPASEDAPHVTALLIDATQLVMTSGEGETIAGLWTSHPLIVLLARRALQTMS*
JGI25383J37093_1002816333300002560Grasslands SoilLVTRGAASRTPSPARPVRYRPTDPQALLAQLATLQGEALDRLGRELRGLSLPVDPVTKEIAGNRAVANLILQLVARAERRVDGVMAAELWRPTLPAWRRAGERAQLEVALAGELPADAPAWLKQASENAPKVTMLIMDESQLVLTSGEGDAIAGLWSSHPLIVMIGRRALQTMS*
Ga0055499_1006146923300004047Natural And Restored WetlandsLDRLTRDLRGLARPGDPVTREVAGSRAVANLVMQLVARAERRVEGIVASELWRPTLPAWRRAGERATLRIVMRGELPSETPAWLSAAADDAPSATFLVIDESQLIVTSGEGEAIAGLWTSHPLIVMLAARALQTIS*
Ga0066674_1040849623300005166SoilGALEGLVSRGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0066683_1009822333300005172SoilLVSRGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0066679_1004257743300005176SoilEGLVKRGAAARTPAPARPVRYRPTDPQALLAQLATLQGEALDRLGQELRGLSLPVDPITREIAGSRAVANLVVQLVARAERSVQGVMAAELWRPTLPAWRRAGERAQLDVALLGEPPPDAPPWLKHAPENAPNVTMLIIDESQLVLTSGAGDGIAGLWSSHPLILLIGRRALQTLT*
Ga0066685_1010466513300005180SoilLITQLATLQGEALDRLGRALRDFSRPGEPVTREVAGIRAVANLVMQLVARAERSVKGIMAVELWRPTLPAWRRAAERARMTVRVRGGMPADAPAWLEPATDDAPNATILLIDEAQLVMTTGEGEAIAGFWTSHPLIVVLAQRAL*
Ga0066685_1021058513300005180SoilGAASRTPPPDRPARYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVGGSRAVSNLVMQLVARAEGSVEGIMALELWRPTLPAWRRASERAQLKVRMRGGMPDDAPSWLEPAAADAPAATVLVIDDAQVVVTSGDGDAITGLWTSHPLILLLARSALQTLS*
Ga0066678_1097210213300005181SoilVSRGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0070692_1117149913300005345Corn, Switchgrass And Miscanthus RhizosphereAASRTPASVRPARYRPIDPHALLAQLATLQGEALDRLGRELKGLSQQGDPVTREVAGARAVANLIMQLVARAERRVEGIVAVELWRPTLPAWRRAGERAQMDVKVRGEPPSDAPAWLGSAAADAPAATILVTDEAQLVVTAGDGEAIAGLWTSHPLIVMLARRALQTVS*
Ga0070705_10053048523300005440Corn, Switchgrass And Miscanthus RhizosphereGLVTRGAASRTPPPARPARYRPTDPHGLLAQLATAQGEALDRLGRQLRDLSRPGEPFTREVAGSRAVANLVMQLVARARTSVEGVLAAELWRPTLPAWRRAGERAPLDVRMTGDVPADAPPWLKPAAEDALNLRATILVIDESQLILTSGDGESIAGVWSSHPLIVMIARRALQTLP*
Ga0066689_1040962213300005447SoilTRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0066689_1088638513300005447SoilRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPATREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLMVLLARRALQSVL*
Ga0066681_1023930623300005451SoilTRTPPPNRPARYRPIDPQALLALLATRQGEALDRLGQEVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMSVRARGGMPPDAPSWLEPASEGGLEATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRTLS*
Ga0066697_1051221913300005540SoilYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVAGSRAVANLMMQLVARAERSVEGIMALELWRPTLPAWRRASERAQLKVRLRGGMPADAPSWLEPAAADAPAATVLVIDEAHVVITSGEGDAVTGLWTSHPLILLLARRALRTFS*
Ga0070704_10028221513300005549Corn, Switchgrass And Miscanthus RhizosphereRGAATRTAPSVRPARYRPIDPQALLAQLAALQGEALDRLGRELKGLAQKGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADAPAWLSSAADDAPLATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQPVV*
Ga0070704_10150786113300005549Corn, Switchgrass And Miscanthus RhizospherePSVRPARYRPIDPQALLAQLAALQGEALDRLGHELKGLARPGDPVTREVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERADMSVRMRGELPPDTPSWLSAAASDAPSATVLVIDESQLIVTSGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0066695_1057965113300005553SoilRTPPPSRPARYRPTDPQGLLAHLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQLIVTSGDGEAIAGVWSSHPLIVMLGRGALPTLA*
Ga0066661_1040801923300005554SoilPSRPARYRPTDPQGLLAQLAMKQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQLIVTSGDGEAIAGVWSSHPLIVMLGRSALPTLA*
Ga0066707_1017853713300005556SoilQGEALDRLGRELRGLSLPVDPVTKEIVGNRAVANLILQLVARAERRVDGVMAAELWRPTLPAWRRAGERAQLDVALAGDLPVDAPSWLKQAPEDAPKVTMLIMDDSQLVLTSGEGDAIAGLWSSHPLIVMIGRRALRTMS*
Ga0066698_1001824633300005558SoilLEGLVSRGAATRTPPPERPARYRPADPQSLLAHLATLQGEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVEGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA*
Ga0066700_1108791313300005559SoilRANAYGALEGLVSRGAATRTPPPDRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPATREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLMVL
Ga0070702_10034558613300005615Corn, Switchgrass And Miscanthus RhizosphereRGAATRTAPSVRPARYRPIDPQALLAQLAALQGEALDRLGRELKGLAQRGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADAPAWLSSAADDAPLATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQPVL*
Ga0066696_1019272023300006032SoilPTDPQVLLAHLAALQGEALDRLGRELRDFSRPSEPITREVTGSRAVANLLMQLVARAEHRVQGVMAFELWRPTLPAWRRAGERASMDVRVRGEMPPDPPSWLEPAPANAPDATLLLVDEAHLLVASGAGDAIAGLWTSHPLLLMLALRALQTIG*
Ga0066652_10145994313300006046SoilARYRPTDPQVLLAHLAMLQGEALDRLGRELRDFSRPAEPITREVTGSRAVANLLMQLVARAERRVQGVMAFELWRPTLPAWRRAGERASMQVRLRGEMPSDPPSWLEAAPENAPDATLLLVDEAHLLVASGAGDAIAGLWTSHPLLLMLAQRALQTIE*
Ga0066653_1003030933300006791SoilSRGAATRTPPPNRPARYRPIDPQALLALLATRQGEALDRLERDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEGGLEATALLIDDAQLVMTSGDGESIAGLWTSHPLVVLLARRALRTLS*
Ga0066653_1040681913300006791SoilASRAPPPSRPARYRPTDPQGLLAHLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQVIVTSGDGEAIAGVWSSHPLIVMLGRSALPTLA*
Ga0066659_1025202113300006797SoilGLVSRGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0075424_10125961413300006904Populus RhizosphereAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRAGERAEIGVRVRGGMPTDAPAWLEPAGDDAPAATILLIDSAQLVMTTGEGDAIAGLWTSHPLIVLLAQRAL*
Ga0075436_10093930413300006914Populus RhizosphereTDPQALVAQLATLQGEALDRLSRDLRDMSRPGDPVTREVSGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRAGERAEISVRVRGGMPTDAPAWLEPAADDAPAATILLIDAAQLVMTTGEGDAIAGLWTSHPLIVLLAQRAF*
Ga0079218_1254793713300007004Agricultural SoilYSALEGLVTRGAATRTPPTIRPARYRPIDPQALMAHLAALQGEALDRLGRELRGLARPGDPVTKEVAGSRAVANLVMQLVARAERRVEGIVAAELWRPTLPAWRRAAERAQLDVRIRGDLPPDAPAWLTSASGDAPPPAATVLVVDESQLVITSGDGEAIGGLWTSHPLIVMLARRALS*
Ga0075435_10000967113300007076Populus RhizosphereALEGLVTRGAASRTPPPSRPARYRPTDPQGLLAQLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLLMQLVARANQSVAGVMAAELWRPTVPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEDAVMNRATVLVIDESQLILTSGEGEAIAGVWSSHPLIVMLGRRALPTLA
Ga0075435_10053439023300007076Populus RhizosphereRLGRDLRGISRPGEPVTREVSGARAVANLIMQLVARAERSVQGVVALELWRPTLPAWRRAGERAAIKVRVRGGLPADAPAWLEPAPDDAPNATVLLIDDAQLVMTSGEGEATAGLWSSHPLIVLLAQRALQTIA*
Ga0075435_10105134223300007076Populus RhizosphereRYRPTDPQSLLAHLATLQGEALDRLGRQLRDFSRPGEPVVREVAGSRAVANLVMQLVARAERGVEGVMTPELWRPTLPAWRRAAERAQLKVRMRGGMPEDAPSWLEPASDTLDATLLVIDEAHLVITSGDGDAVTGLWTSHPLILVLARRALQTLS*
Ga0099793_1042924423300007258Vadose Zone SoilARPVRYRPTDPQALLAQLATLQGEALDRLGRELRGLSRPADPVTREIGGHRAVGNLILQLVARAERGVQGVMAAELWRPTLPAWRRAGERAQIDVALAGELPADAPPWLKQARADAPQVTMLIMDESQLVLTSGEGDAIAGLWSSHPLIVMIGQRGLQTMS*
Ga0066710_10086230823300009012Grasslands SoilARYRPTDPQGLLAQLAMIQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPAWLKSVGEHAVTNRATVLVVDESQLIVTSGDGEAIAGVWSSHPLIVMLGRSALPTLA
Ga0066710_10322809323300009012Grasslands SoilLVTRGAASRTPSPARPVRYRPTDPQALLARLATLQGEALDRLGRELRGLSLPVDPVTKEIAGNRAVANLILQLVARAERRVDGVMAAELWRPTLPAWRRAGERAQLEVALAGELPADAPAWLKQASENAPKVTMLIMDESQLVMTSGEGDAIAGLWSSHPLIVMIGRRALQTMS
Ga0099830_1031913713300009088Vadose Zone SoilPPPNRPARYRPTDPQTLLAQLATLQGEALDRLGRELRNMSRPGDPVTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAGERATMKVRARGGMPADAPSWLEPASNDAPNVTALLIDETQLVMTSGEGETIAGLWTSHPLIVLLARRALQTMS*
Ga0066709_10009609533300009137Grasslands SoilDRLGRALRDFSRPGEPVTREVAGIRAVANLVMQLVARAERSVKGIMAVELWRPTLPAWRRAAERARMTVRVRGGMPADAPAWLEPATDDAPNATILLIDEAQLVMTTGEGEAIAGFWTSHPLIVVLAQRAL*
Ga0099792_1056079913300009143Vadose Zone SoilGEALDRLGRDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEDGLDATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRTPS*
Ga0105259_108046423300009597SoilVRPARYRPIDPQALLAQLAALQGAALDRLGRELKGLARPGEPVTREVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERAAMSVRMPGAPPPEAPSWLTSATDEAPSATMLVIDESQLIVTSGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0105347_107796213300009609SoilAALQGAALDRLGRELKGLARPGEPVTREVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERADMSVRMRGASPPDVPSWLTSATDEAPSATILVIDESQLIVASGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0134070_1001709743300010301Grasslands SoilRTPPPERPARYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVAGSRAVANLMMQLVARAERSVEGIMALELWRPTLPAWRRASERAQLKVRLRGGMPADAPSWLEPAAADAPAATVLVIDEAHVVITSGEGDAVTGLWTSHPLILLLARRALRTFS*
Ga0134070_1029887313300010301Grasslands SoilLEGLVTRGAASRTPPPSRPARYRPTDPQGLLAHLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQLIVTSGDGEAIAGVWSSHPLIVMLGRSALPTLA*
Ga0134086_1039579113300010323Grasslands SoilTPPPNRPARYRPIDPQALLALLATRQGEALDRLERDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMSVRARGGMPPDAPSWLEPASEGGLEATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRTLS*
Ga0134111_1006409023300010329Grasslands SoilAYSALEGLVSRGAATRTPPPERPARYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVAGSRAVANLMMQLVARAERSVEGIMALELWRPTLPAWRRATERAQLKVRLRGGMPADAPSWLEPAAADAPAATVLVIDEAHVVITAGEGDAVTGLWTSHPLILLLARRALRTFS*
Ga0134080_1021490013300010333Grasslands SoilRTPPPQRPARYRPTDPQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGSRAVANLIMQLVARAERSVQGIMALELWRPTLPAWRRAGDRAEVRVRVRGGMPADAPAWLEPAADDAPAATILLIDAAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0134071_1022543523300010336Grasslands SoilGALEGLVSRGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERGVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGEAIAGLWTSHPLMVLLARRALQSVL*
Ga0126377_1255338513300010362Tropical Forest SoilPTDPQTLLAHLAMLQGEALDRLGRELRGASRSSDPVTREVAGSRAVANLLMQLVARAERSVQGVMALELWRPTLPAWRRAGERATLRVRMRGGMPADAPAWLEPAGPEAPDATLLLIDDTQLLLISGEGDALTGLWSSHPLILLVAQRAMATIE*
Ga0134123_1343483013300010403Terrestrial SoilAASRTPPPARPARYRPTDPHGLLAQLATAQGEALDRLGRQLRDLSRPGEPFTREVAGSRAVANLVMQLVARASTSVEGVLAAELWRPTLPAWRRAGERAQLDVRMTGDVPADAPPWLKPAAEDALNLRATILVIDESQLILTSGDGESIAGVWSSHPLIVMIARRALQTL
Ga0137457_101548133300011443SoilEALDRLGRELRDMSRPGDPVTKEVAGSRAVANLIMQLVARAERRVEGVMAIELWRPTLPAWRRASERAEMDVRVRGGVPQDAPAWLKPAGDDVPNATILVTDESHLILTSGDGETIAGLWTSHPLIVMLARRALQTIA*
Ga0137457_108899323300011443SoilYGALEGLVTRGAASRTPPSVRPARYRPIDPQALLAQLAALQGAALDRLGRELKGLARPGEPVTREVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERADMSVRMRGASPPDVPSWLTSATDEAPSATILVIDESQLIVASGEGEAIAGLWTSHPLIVMLARRALQTIS
Ga0137427_1014700713300011445SoilVRPARYRPIDPQALMAQLAALQGEALDRLGRELKGLARPGEPVTRKVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERAAMSVRMPGAPPPEAPSWLTSATDEAPSATMLVIDESQLIVTSGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0137327_106835023300012173SoilVRPARYRPIDPQALLAQLAALQGAALDRLGRELKGLARPGEPVTREVAGSRAVANLVMQLVARAERRVEGIVALELWRPTLPAWRRAGERADMSVRMRGASPPDVPSWLTSATDEAPSATILVIDESQLIVASGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0137364_1029483523300012198Vadose Zone SoilAATRTPPPQRPARYRPTDPQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGGRAVANLIMQLVARAERSVKGIMALELWRPTLPAWRRAGERAEVSVRVRGGMPAEAPAWLEPAADDAPAATILLIDAAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0137382_1000308613300012200Vadose Zone SoilEALDRLGRDLRGLTRPGDPVTREVSGSRAVANLVQQLVARAERRVEGIMAGELWRPTLPAWRRAAERAQIQLLIRGDLPPDPPSWITPAAEDAPDATILVIDESQLVITAGEAEAIAGLWTSHPLVVQLARRALQTVS*
Ga0137382_1006845533300012200Vadose Zone SoilGLLAQLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQLIVTSGDDEAIAGVWSSHPLIVMLGRSALPTLA*
Ga0137382_1030315113300012200Vadose Zone SoilPARYRPTDPQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGIRAVANLIMQLVARAERSVQGIMALELWRPTLPAWRRAGERAEVSVRVRGGMPADAPAWLEPAADDAPAATILLIDAAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0137382_1106515913300012200Vadose Zone SoilLLAQLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQLIVTSGDGEAIAGVWSSHPLIVMLGLSALPTLA*
Ga0137365_1019824623300012201Vadose Zone SoilLVSRGAATRTPPPLRPARYRPADPQSLLAHLATLQGEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVEGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA*
Ga0137399_1021254813300012203Vadose Zone SoilEVRDLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLRVRAHGGVPPDAPSWLEPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLIVLLARRALQSVS*
Ga0137374_1045268423300012204Vadose Zone SoilALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLIDDAHLVMTSGDGEAVAGLWTSHPLIVLLARRALRGVA*
Ga0137376_1023557433300012208Vadose Zone SoilGRDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEGGLDATALLIDDAQLVITSGEGESIAGLWTSHPLIVLLARRALRTPS*
Ga0137378_1162254013300012210Vadose Zone SoilRANAYGALESLVSRGAATRTPPPQRPARYRPTDPQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGSRAVANLIMQLVARAERSVQGIMALDLWRPTLPAWRRAGERAQVSVRVRGGMPADAPAWLEPAADDAPAATILLIDAAQLVMTTGEGDAIAGLWTSHPLIVLLAQRAL
Ga0137370_1000067193300012285Vadose Zone SoilLVSRGAATRTPPPQRPARYRPADPQSLLAHLATLQGEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVEGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA*
Ga0137370_1096112413300012285Vadose Zone SoilEALDRLSRDLRDMARPGDPVTREVSGSRAVANLIMQLVARAERSVQGIMALELWRPTLPAWRRAGERAEVSVRVRGGMPADAPAWLEPAADDAPAATILLIDAAQLVMTTGEGAGIAGLWTSHPLIVLLAQRAL*
Ga0137372_1017694413300012350Vadose Zone SoilQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGGRAVANLIMQLVARAERSVKGIMALELWRPTLPAWRRAGERAELSVRVRGGMPADAPAWLEPAADDAPAATILLIDVAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0137386_1030029213300012351Vadose Zone SoilGLVAQLATLQGEALDRLSRELRDISRPGDPVTREVSGGRAVANLIMQLVARAERSVKGIMALELWRPTLPAWRRAGERAEVSVRVRGGMPADAPAWLEPAADDAPAATILLIDVAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0137386_1050591813300012351Vadose Zone SoilRPIDPQALLALLATRQGEALDRLGRDVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLALAPDDAPDATVLLIDVTQLVMTSGEGETIAGLWTSHPLMVLLARRALQSVS*
Ga0137366_1095899613300012354Vadose Zone SoilALDRLGRDLRGLTRPGDPVTREVSGSRAVANLVQQLVARAERRVEGIMAGELWRPTLPAWRRAAERAQIQLLIRGDLPPDPPSWITPAAEDAPDATILVIDESQLVITAGEAEAIAGLWTSHPLVVQLARRALQTIS*
Ga0137371_1001491313300012356Vadose Zone SoilYRPTDPPGLVAQLATLQGEALDRLSRELRDMSRPGDPVTRVVSGSRAVANLIMQLVARAERSVQGIMALDLWRPTLPAWRRAGERAQVSVRVRGGMPADAPAWLEPAADDAPAATILLIDAAQLVVTTGEGDAIAGLWTSHPLIVLLAQRAL*
Ga0137384_1005523113300012357Vadose Zone SoilPPPQRPARYRPTDPQGLVAQLATLQGEALDRLSRELRDMSRPGDPVTREVSGGRAVANLIMQLVARAERSVKGIMALELWRPTLPAWRRAGERAEVSVRVRGGMPADAPAWLEPAADDAPAATILLIDVAQLVMTTGEGDGIAGLWTSHPLIVLLAQRAL*
Ga0137385_1082407623300012359Vadose Zone SoilYRPIDPQALLALLATRQGEALDRLGRDVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA*
Ga0137375_1004487543300012360Vadose Zone SoilANAYGALEGLVSRGAATRTPPPNRPARYRPIDPQALVALLATRQGEALDRLGRDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEDGLDATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRSTS*
Ga0137361_1021444313300012362Vadose Zone SoilIDPQALVAQLATLQGEALDRLGRELRDFSRPGDPVTREVAGSRAVANLVMQLVARAERSVTGIMALELWRPTLPAWRRAAERARMAVRVRGGMPADAPAWLEPAPGDAPNATILLIDEGQLVMTTGEGEAIAGFWTSHPLIVVLAQRAL*
Ga0137361_1192673113300012362Vadose Zone SoilRTPPPHRPARYRPIDPQALLAVLATRQGEALDRLGREVRDLSRPGDPATREVAGSRAVANLVTQLVARAERSVEGVMVLELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDGAPDATVLVIDDTQLVMTSGEGETIGGLWTSHPLMVLLARRALQSVL*
Ga0137358_1029608413300012582Vadose Zone SoilRQGEALDRLGREVRDLSRPGDPATREVAGSRAVANLVTQLVARAERTVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLMVLLARRALQSVL*
Ga0137397_1002416833300012685Vadose Zone SoilLLAQLATAQGEALDRLSRQLRDLGRPGEPITREVAGSRAVANLVMQLVARASTSVEGVVAADLWRPTLPAWRRAGERAQLDVRMVGDLPADAPPWLKAAGEDATTQRATVLVIDQSQLILTSGEGESIAGVWSSHPLIVLLARRALQTLP*
Ga0137397_1002539713300012685Vadose Zone SoilTRGAATRTPPPNRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTVPAWRRAAERATLKVRARGGVPPDAPSWLEPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLIVLLARRALQSVS*
Ga0137396_1013985833300012918Vadose Zone SoilEGLVTRGAATRTPPPARPARYRPTDPQALIAQLALVQGEALDRLGRDLRGLSRPGEPVTREVSGSRAVANLMQQLVARAEQRVEGIMAIELWRPTLPAWRRASERAQIRLLIRGGLPPDPPSWLTPAAEGAPDATILVIDESQLVITAGEGEAIAGLWTSHPLIVLLARRALQTVS*
Ga0137396_1019001813300012918Vadose Zone SoilGLLALLATRQGEALDRLGREVRDLSRPGDPVTREVAGARAVANLLTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGMPPDAPSWLEPASDDASEATVLLIDDMQLVMTSGDGETIAGLWTSHPLIVLLARLALRSMA*
Ga0137394_1101132623300012922Vadose Zone SoilYRPIDPQALMAQLAALQGEALDRLDRGLKGLARPGDPVTREVAGSRAVANLVMQLVARAERRVEGIVAPELWRPTLPAWRRAGERAQMSIRMRGELPADAPAWLTPTTGDAPSATVLVVDESQLIVTSGEGEAIAGLWTSHPLIVMLARRALQTIS*
Ga0137419_1032407923300012925Vadose Zone SoilGAATRTPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLIDDAQLVMTSGDGEAVAGLWTSHPLIVLLARRALRGAA*
Ga0137416_1095491223300012927Vadose Zone SoilDRLGREVRDLSRPGDPVTREVAGSRALANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPEAPSWLEPAPADAPDATVLLIDDMQLVLTSGEGETIAGLWTSHPLILLLARRALQTVL*
Ga0137410_1160450113300012944Vadose Zone SoilARPARYRPTDPQALLAQLALVQGEALDRLGRDLRGLSRPGDPVTRAVSGSRAVANLVQQLVARAERRVEGIMAIELWRPTLPAWRRAGERAQIRLLIRGGLPPDPPSWITPAAEGAPDATILVIDESQLVITAGEGEAIAGLWTSHLLIVLLARRALQPVW*
Ga0134087_1010998723300012977Grasslands SoilLVSRGAATRTPPPERPARYRPADPQSLLAHLATLQGEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA*
Ga0134075_1030045423300014154Grasslands SoilPPPDRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPVTREVTGSRAVANLVTQLVARAERGVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGETIAGPWPSHPLMVLLARRALRSVL*
Ga0134078_1033593713300014157Grasslands SoilEALDRLGRDLRDATRPSEPVTREVAGSRAVANLLMQLVARAARSVQGVMALELWRPTLPAWRRAGERATLTVRMRGGMPADPPAWLEPAGTDAPNSTVLLIDDAQLLLISGEGDGLAGLWSSHPLILLIAQRAMPAIL*
Ga0134078_1059642713300014157Grasslands SoilLEGLVKRGAAARTPTPARPVRYRPTDPQALLAQVATLQGEALDRLGQELRGLSLPVDPITREIAGSRAVANLVVQLVARAERSVEGVMAAELWRPTLPAWRRAGERAQLEVALLGEPPPDAPPWLKHAPENAPNVTMLIIDESQLVLTSGAGDGIAGLWSSHPLILLIGRRALQT
Ga0134089_1023504913300015358Grasslands SoilGLVSRGAATRTPPPTRPARYRPTDPQSLLAHLATLQGEALDRLGRELRNLSRPGEPIIREVAGSRAAANLLMQLVARAERSVKGVMALELWRPTLPAWRRAAERAKLAVRLRGEMPADAPAWLEPAAADAPEATVLVIDETHLLVTSGDGEAITGVWTSHPLVLLLAHRALQTFS*
Ga0134089_1026395323300015358Grasslands SoilGYAVARGARLAPANAYGALEGLVYQGAATRTPPPHRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPATREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKLRARGGVPPDAPSWLAPASDDAPDATVLLIDDTQLVMTSGEGDTIAGLWTSHPLMVLLARRALRSVL*
Ga0132255_10290255213300015374Arabidopsis RhizosphereGALEGLVTRGAATRTAPSVRPARYRPIDPQALLAQLAALQGEALDRLGRELKGLAQKGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADPPSWLSSAADDAPVATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQTVL*
Ga0134069_109458923300017654Grasslands SoilLVTRGAASRTPSPARPVRYRPTDPQALLAQLATLQGEALDRLGRELRGLSLPVDPVTKEIVGNRAVANLILQLVARAERRVDGVMAAELWRPTLPAWRRAGERAQLDVALAGDLPVDAPSWLKQAPEDAPKVTMLIMDESQLVLTSGEGDAIAGLWSSHPLVVMIGRRALQTMS
Ga0134112_1036622413300017656Grasslands SoilDRPARYRPIDPQALLALLAARQGEALDRLGREVRDLSRPGDPVTREVTGSRAVANLVTQLVARAERGVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLVDDTQLVMTSGDGETVAGLWTSHPLIVLLARRALRGAA
Ga0184632_1017549713300018075Groundwater SedimentGLVTRGAATRTPPPNRPARYRPIDPQALVALLATRQGEALDRLGRDVRDFSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPPWLEPASEDGLDATALLIDDAQLVMTAGEGESIAGLWTSHPLVVLLARRALRSTS
Ga0184627_1021216223300018079Groundwater SedimentPIDPQGLIAQLATRQGEALDRLNRELKDASHPGDPVTKEVAGSRAVANLILQLVARAERRVEGVMAAELWRPTLPAWRRASERAAVDLRIAGELPADASQSPWLKAAAADVPAVTLLVVDESQLVVTSGEGDGIAGLWSSHPLMIMLARRALQTFA
Ga0187774_1123961123300018089Tropical PeatlandAHLATRQGEALDRLGRELRNAARPSEPITREVAGSRAVANLLMQLVARAERSVQGVMALELWRPTLPAWRRAAERATLTVRMRGGMPADPPAWLVAAGTGAPSSTVLLIDDAQLLLISGEGDGLAGLWSSHPLILLIAQRAMPAIL
Ga0066669_1052587423300018482Grasslands SoilDRLGRQVRDLSRPGDPVTREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATMSVRARGGMPPDAPSWLEPASEGGLEATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRTLS
Ga0066669_1180455413300018482Grasslands SoilSAATRSPRSERPARYRPTDPQVLLAHLAMLQGEALDRLGRELRDFSRPAEPITREVTGSRAVANLLMQLVARAERRVQGVMAFELWRPTLPAWRRAGERASMQVRLRGEMPSDAPSWLEPAPESAPDAILLIVDEVHLLVAAGAGDATSGLWTSHPLLLMLALRALQTIQ
Ga0179594_1017807013300020170Vadose Zone SoilARLARANAYGALEGLVTRAAATRTPPPTRPARYRPTDPQTLLVQLATGQGEALDRLGSELRGFSRPGEPVMREVAGSRAVANLIQQLVARAERRVEGIMAIELWRPTLPAWRRASERAQIELLVRGDLPPDPPTWLKAAGDGVPDATILVTDESQLVTTTGAGEAIAGLWTSHPLLVMLGRRALQTIS
Ga0193695_104223423300021418SoilMRPCFGWALRPATRWPAAPASRAPTRMALLATRQGEALDRLGREVRDLSRPGDPVTREVAGNRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGVPPDAPPWLEPAANDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLIVLLARRALQSKS
Ga0247678_102172913300024325SoilQLAALQGEALDRLGRELKGLAQRGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADAPAWLSSAADDAPVATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQPVL
Ga0207689_1005789933300025942Miscanthus RhizosphereVTRGAATRTAPSVRPARYRPIDPQALLAQLAALQGEALDRLGRELKGLAQKGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADAPAWLSSAADDAPVATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQPVL
Ga0209237_105910413300026297Grasslands SoilVSRGAATRTPPPERPARYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVAGSRAVANLMMQLVARAERSVEGIMALELWRPTLPAWRRASERAQLKVRLRGGMPADAPSWLEPAAADAPAATVLVIDEAHVVITSGEGDAVTGLWTSHPLILLLARRALRTFS
Ga0209237_107603613300026297Grasslands SoilLAQLATLQGEALDRLGRELRDISRPGDPVTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAGERATMKVRARGGMPADAPSWLEPASEDAPHVTALLIDATQLVMTSGEGESIAGLWTSHPLIVLLARRALQTMS
Ga0209236_101958413300026298Grasslands SoilARYRPIDPQALLALLATRQGEALDRLGRDVRDLSRPGDPATREVAGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLKVRARGGMPPDAPSWLAPAPDDAPDATVLLIDDTQLVMTSGEGETIAGLWTSHPLMVLLARRALQSVL
Ga0209468_100195013300026306SoilYSALEGLTARGAASRTPPPERPARYRPTDPQTLLAHLATLQGEALDRLGRDLRDATRPSEPVTREVAGSRAVANLLMQLVARAARSVQGVMALELWRPTLPAWRRAGERATLTVRMRGGMPADPPAWLEPAGTDAPNSTVLLIDDAQLLLISGEGDGLAGLWSSHPLILLIAQRAMPAIL
Ga0209468_100371653300026306SoilVSRGAATRTPPPNRPARYRPIDPQALLALLATRQGEALDRLERDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEGGLEATALLIDDAQLVMTSGDGESIAGLWTSHPLVVLLARRALRTPS
Ga0209472_119172213300026323SoilLVTRGAASRTPPPSRPARYRPTDPQGLLAHLAMMQGEALDRLGRELRDISRPGEPVTREVGGSRAVANLVMQLVARANKSVAGVMAAELWRPTLPAWRRAAERAQLDVRLMGDLPPDAPPWLKSVGEHAVTNRATVLVIDESQVIVTSGDGEAIAGVWSSHPLIVMLGRSALPTLA
Ga0209470_133243813300026324SoilRGAATRTPPPERPARYRPTDPQSLLAHLATLQGEALDRLGRQLRDLSRPGEPVVREVAGSRAVANLMMQLVARAERSVEGIMALELWRPTLPAWRRASERAQLKVRLRGGMPADAPSWLEPAAADAPAATVLVIDEAHVVITSGEGDAVTGLWTSHPLILLLARRALRTFS
Ga0209375_100100413300026329SoilLQGDALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA
Ga0209159_123230423300026343SoilPTDPQALLAQLATLQGEALDRLGRELRGLSLPVDPVTKEIVGNRAVANLILQLVARAERRVDGVMAAELWRPTLPAWRRAGERAQLDVALAGDLPVDAPSWLKQAPEDAPKVTMLIMDESQLVLTSGEGDAIAGLWSSHPLVVMIGRRALQTMS
Ga0209058_102280233300026536SoilLEGLVSRGAATRTPPPERPARYRPADPQSLLAHLATLQGEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA
Ga0209376_108675613300026540SoilEALDRLGQQLRDVSRPGEPVVREVAGSRAVANLVMQLVARAERSVKGIMALELWRPTLPAWRRASERAQVTVRMRGGMPADAPSWLEPAAADSPAVTMLMIDETHVVITSGDGDAVTGLWTSHPLILLLARRALPTVA
Ga0209474_1015825723300026550SoilLVSRGAATRSPRSERPARYRPTDPQVLLAHLAALQGEALDRLGRELRDFSRPSEPITREVTGSRAVANLLMQLVARAEHRVQGVMAFELWRPTLPAWRRAGERASMDVRVRGEMPPDPPSWLEPAPANAPDATLLLVDEAHLLVASGAGDAIAGLWTSHPLLLMLALRALQTIG
Ga0209577_1024127613300026552SoilPAPARPVRYRPTDPQALLAQLATLQGEALDRLGQELRGLSLPVDPITREIAGSRAVANLVVQLVARAERSVEGVMAAELWRPTLPAWRRAGERAQLDVALLGEPPPDAPPWLKHAPENAPNVTMLLIDESQLVLTSGAGDGIAGLWSSHPLILLIGRRALQTLT
Ga0179593_104390623300026555Vadose Zone SoilVVSRGAATRTPLPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSEPGDPVTREVGGSRAVANLVTQLVARAERSVEGVMALELWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDASDATVLLIDDTQLVMTSGDGEAVAGLWTSHPLILLLARRALRGAA
Ga0209486_1131566213300027886Agricultural SoilYSALEGLVTRGAATRTPPTIRPARYRPIDPQALMAHLAALQGEALDRLGRELRGLARPGDPVTKEVAGSRAVANLVMQLVARAERRVEGIVAAELWRPTLPAWRRAAERAQLDVRIRGDLPPDAPAWLTSASGDAPPPAATVLVVDESQLVITSGDGEAIGGLWTSH
Ga0209488_1050585813300027903Vadose Zone SoilGEALDRLGRDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEDGLDATALLIDDAQLVMTSGEGESIAGLWTSHPLVVLLARRALRTPS
Ga0209382_1023167013300027909Populus RhizospherePTRPARYRPTDPQSLLARLSLAQGEALDRLSRELRDISHPGEPVTREVAGSRAVANLVMQLVARASQRVEGVVTAELWRPTQPGWRRAGERAQLDVRMTGDRPADAPGWLKSAGDDASTLRATILVIDESQLILTSGEGEAVAGVWSSHPLIVMLARRALETLP
Ga0209382_1078515023300027909Populus RhizosphereYRPIDPQALLAQLATRQGEALDRLGRELKDLARPGAPVTREVAGSRAVANLVMHLVARAQRRVEGIVALELWRPTLPAWRRAGERADMSVCLRGELPPDAPSWLTAAASDAPTPSATVLVIDESQLIVTSGEGEAIAGLWTSQPLIVMLARRALQTMT
Ga0268265_1062316923300028380Switchgrass RhizosphereVRPARYRPIDPQALLAQLAALQGEALDRLGRELKGLAQRGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPADPPSWLSSAADDAPVATILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALQPVL
Ga0137415_1007916843300028536Vadose Zone SoilLLATRQGEALDRLGREVRDLSRPGDPVTREVAGARAVANLLTQLVARAERSVEGVMALELWRPTLPAWRRAGERATINVRARGGMPPDAPSWLQPASEDAPDATVLLIDDTQLVMTSGDGEAVAGLWTSHPLILLLARRALRGAA
Ga0137415_1061422823300028536Vadose Zone SoilTLQGEALDRLGRELRGLSLPIDPVTKEIAGHRAVANLILQLVARAERRVEGVMAAELWRPTLPAWRRAGERAQLDVSLAGELPVDAPPWLKQAPADKPKVTMLIMDESQLVLTSGEGDATAGLWSSHPLIVTIGRRALQTMS
Ga0307301_1008011723300028719SoilRPARYRPIDPQALLALLATRQGEALDRLGRDVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEDGLEVTALLVDDAQLVMTSGEGESIAGLWTSHPLVLLLARRALRTPS
Ga0307282_1024624813300028784SoilGLVSRGAATRTPPPNRPARYRPIDPQALVALLATRQGEALDRLGREVRDLSRPGDPVTREVAGSRAVANLITQLVARAERSVEGMMALELWRPTLPAWRRAAERASLTVRARGGMPPDAPSWLQPAPEDAPAATVLLIDDTQLVMTTGEGEAIAGLWTSHPLIVLLVRRALQSVV
Ga0307278_1051196913300028878SoilPPPERPARYRPIDPQALLALLATRQGEALDRLGREVRGLSRPGDPVTREVAGGRAVANLVTQLVARAERSVEGVMALDLWRPTLPAWRRAAERATLTVRARGGLPPDAPAWLEPAASDAPDATVLLIDDAQLVMTSGDGEAVAGLWTSHPLIVLLARRALRGAA
Ga0307308_1002128413300028884SoilTPPPNRPARYRPIDPQALLALLATRQGEALDRLGREVRDLSRPGDPMTREVAGSRAVANLVMQLVARAERSVEGVMALELWRPTLPAWRRAAERATMTVRARGGMPPDAPSWLEPASEDGLEVTALLVDDAQLVMTSGEGESIAGLWTSHPLVLLLARRALRTPS
Ga0307469_1016671333300031720Hardwood Forest SoilYRPTDPHGLLAQLATAQGEALDRLGRQLRDLSRPGEPITREVAGSRAVANLVMQLVARASTSVEGVLAAELWRPTLPAWRRAGERAQLDVRMTGDVPADAPPWLKPAAEDALNLRATILVIDESQLILTSGDGESIAGVWSSHPLIVMIARRALQTLP
Ga0307469_1053272213300031720Hardwood Forest SoilATRTPPPARPARYRPTDPQSLLAQLATLQGEALDRLGRELRNLSRPGEPIIREVSGSRAVANLLMQLVARAERSVKGVMALELWRPTLPAWRRAAERAKLEVRVRGGMPADAPAWLEPAAADSPDATVLVIDETHLLVTSGDGEAITGVWTSHPLVLLLAQRALPTFS
Ga0310900_1078299123300031908SoilDRLGRELKGLAQKGDPVTREVAGSRAVANLIMQLVARAERRVEGIVAAELWRPTLPAWRRAGERAQLSVKMRGDLPSDAPAWLSSAADDAPVASILVTDEAQLIVTSGEGEAIAGLWTSHPLIVMLARRALRTVL
Ga0307471_10039503213300032180Hardwood Forest SoilASRTPPPARPARYRPTDPHGLLAQLATAQGEALDRLGRQLRDLSRPGEPFTREVAGSRAVANLVMQLVARARTSVEGVLAAELWRPTLPAWRRAGERAPLDVRMTGDVPADAPPWLKPAAEDALNLRATILVIDESQLILTSGDGESIAGVWSSHPLIVMIARRALQTLP
Ga0364940_0036957_622_10683300034164SedimentVAQLATAQGEALDRLTRALRDASRPVDPVTKEVAGSRAVANLLLQLVARAERRVEGIMDVELWRPTLPAWRRAGERAQVDLRIGGGDLPADAPAWVRIAAADAPDATLLVVDEAQLVVASGEGDHIAGLWSSHPLIVMLARRALQTLG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.