NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072580

Metagenome / Metatranscriptome Family F072580

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072580
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 163 residues
Representative Sequence MPDNPTYIHKLAEILTEARSPKPIPFFRRRDMEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRDLVRWLERRISDPYVAVEQLRHEAVIGRIVELKAETAARAVKIVLPDPKPSVELPDGVSLQPGLLTVSFDNEQQLLERLFLLARVLATQPQFLSSLSLPR
Number of Associated Samples 101
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 54.55 %
% of genes near scaffold ends (potentially truncated) 41.32 %
% of genes from short scaffolds (< 2000 bps) 80.99 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (61.983 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(9.091 % of family members)
Environment Ontology (ENVO) Unclassified
(33.058 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.322 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.96%    β-sheet: 9.90%    Coil/Unstructured: 41.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF02899Phage_int_SAM_1 13.22
PF13614AAA_31 2.48
PF02452PemK_toxin 2.48
PF01656CbiA 1.65
PF00589Phage_integrase 1.65
PF13517FG-GAP_3 0.83
PF01041DegT_DnrJ_EryC1 0.83
PF02661Fic 0.83
PF10134RPA 0.83
PF13489Methyltransf_23 0.83
PF00665rve 0.83
PF01850PIN 0.83
PF00239Resolvase 0.83
PF02604PhdYeFM_antitox 0.83
PF13692Glyco_trans_1_4 0.83
PF08241Methyltransf_11 0.83
PF02397Bac_transf 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 13.22
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 13.22
COG2337mRNA-degrading endonuclease MazF, toxin component of the MazEF toxin-antitoxin moduleDefense mechanisms [V] 2.48
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.83
COG4584TransposaseMobilome: prophages, transposons [X] 0.83
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.83
COG3316Transposase (or an inactivated derivative), DDE domainMobilome: prophages, transposons [X] 0.83
COG2873O-acetylhomoserine/O-acetylserine sulfhydrylase, pyridoxal phosphate-dependentAmino acid transport and metabolism [E] 0.83
COG2826Transposase and inactivated derivatives, IS30 familyMobilome: prophages, transposons [X] 0.83
COG2801Transposase InsO and inactivated derivativesMobilome: prophages, transposons [X] 0.83
COG0399dTDP-4-amino-4,6-dideoxygalactose transaminaseCell wall/membrane/envelope biogenesis [M] 0.83
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.83
COG2148Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid)Cell wall/membrane/envelope biogenesis [M] 0.83
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.83
COG1104Cysteine desulfurase/Cysteine sulfinate desulfinase IscS or related enzyme, NifS familyAmino acid transport and metabolism [E] 0.83
COG0626Cystathionine beta-lyase/cystathionine gamma-synthaseAmino acid transport and metabolism [E] 0.83
COG0520Selenocysteine lyase/Cysteine desulfuraseAmino acid transport and metabolism [E] 0.83
COG0436Aspartate/methionine/tyrosine aminotransferaseAmino acid transport and metabolism [E] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A61.98 %
All OrganismsrootAll Organisms38.02 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2140918013|NODE_298_length_1081_cov_18.859390Not Available1113Open in IMG/M
3300000363|ICChiseqgaiiFebDRAFT_14403468All Organisms → cellular organisms → Bacteria1509Open in IMG/M
3300000890|JGI11643J12802_10189534Not Available962Open in IMG/M
3300001356|JGI12269J14319_10048723All Organisms → cellular organisms → Bacteria2553Open in IMG/M
3300002568|C688J35102_120940503Not Available2659Open in IMG/M
3300004081|Ga0063454_100810654All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300004114|Ga0062593_103378313Not Available512Open in IMG/M
3300004152|Ga0062386_100530625Not Available957Open in IMG/M
3300004153|Ga0063455_101512544Not Available524Open in IMG/M
3300004463|Ga0063356_102954570Not Available732Open in IMG/M
3300005338|Ga0068868_101237513Not Available691Open in IMG/M
3300005345|Ga0070692_10940018Not Available600Open in IMG/M
3300005367|Ga0070667_101805068Not Available575Open in IMG/M
3300005434|Ga0070709_11051312Not Available649Open in IMG/M
3300005435|Ga0070714_102505554Not Available501Open in IMG/M
3300005471|Ga0070698_100993016Not Available786Open in IMG/M
3300005524|Ga0070737_10175124Not Available904Open in IMG/M
3300005534|Ga0070735_10015341All Organisms → cellular organisms → Bacteria → Proteobacteria5683Open in IMG/M
3300005539|Ga0068853_100067387All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3110Open in IMG/M
3300005764|Ga0066903_100601848All Organisms → cellular organisms → Bacteria1905Open in IMG/M
3300005764|Ga0066903_103617558Not Available832Open in IMG/M
3300005764|Ga0066903_107349383Not Available569Open in IMG/M
3300005842|Ga0068858_100087416All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2899Open in IMG/M
3300005843|Ga0068860_100311644All Organisms → cellular organisms → Bacteria1543Open in IMG/M
3300006052|Ga0075029_100479923All Organisms → cellular organisms → Bacteria818Open in IMG/M
3300006162|Ga0075030_100195402All Organisms → cellular organisms → Bacteria1630Open in IMG/M
3300006162|Ga0075030_100362352Not Available1156Open in IMG/M
3300006175|Ga0070712_101683089Not Available555Open in IMG/M
3300006358|Ga0068871_101019984Not Available771Open in IMG/M
3300006806|Ga0079220_11488076Not Available580Open in IMG/M
3300007004|Ga0079218_11284589Not Available769Open in IMG/M
3300009093|Ga0105240_11295859All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Azospirillaceae → Azospirillum → Azospirillum tabaci768Open in IMG/M
3300009094|Ga0111539_12639852Not Available582Open in IMG/M
3300009177|Ga0105248_10268457Not Available1921Open in IMG/M
3300009521|Ga0116222_1184454Not Available898Open in IMG/M
3300009522|Ga0116218_1180209Not Available957Open in IMG/M
3300009553|Ga0105249_10173370All Organisms → cellular organisms → Bacteria2093Open in IMG/M
3300009700|Ga0116217_10184227All Organisms → cellular organisms → Bacteria1379Open in IMG/M
3300009840|Ga0126313_11333813Not Available593Open in IMG/M
3300009868|Ga0130016_10004064All Organisms → cellular organisms → Bacteria27956Open in IMG/M
3300009868|Ga0130016_10511447Not Available767Open in IMG/M
3300009873|Ga0131077_10821513Not Available810Open in IMG/M
3300010346|Ga0116239_10320245All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylobacteriaceae → Methylobacterium1079Open in IMG/M
3300010373|Ga0134128_11385575Not Available774Open in IMG/M
3300010376|Ga0126381_100810667All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1344Open in IMG/M
3300010379|Ga0136449_100201749All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia3782Open in IMG/M
3300010876|Ga0126361_10693963Not Available665Open in IMG/M
3300012212|Ga0150985_112195729Not Available570Open in IMG/M
3300012212|Ga0150985_115687693Not Available528Open in IMG/M
3300012354|Ga0137366_10344519Not Available1091Open in IMG/M
3300012955|Ga0164298_10587143Not Available762Open in IMG/M
3300012955|Ga0164298_11579050Not Available516Open in IMG/M
3300012958|Ga0164299_10983753Not Available620Open in IMG/M
3300012961|Ga0164302_10317706All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300012984|Ga0164309_11657320Not Available548Open in IMG/M
3300012985|Ga0164308_10698360Not Available874Open in IMG/M
3300012985|Ga0164308_12092665Not Available527Open in IMG/M
3300012986|Ga0164304_10764634Not Available741Open in IMG/M
3300013296|Ga0157374_11703841Not Available655Open in IMG/M
3300013296|Ga0157374_11988968Not Available607Open in IMG/M
3300013306|Ga0163162_11546737Not Available756Open in IMG/M
3300013307|Ga0157372_10333124All Organisms → cellular organisms → Bacteria1768Open in IMG/M
3300014169|Ga0181531_10251438Not Available1078Open in IMG/M
3300014325|Ga0163163_11078788Not Available866Open in IMG/M
3300014968|Ga0157379_11505293Not Available655Open in IMG/M
3300017975|Ga0187782_10901759Not Available686Open in IMG/M
3300018060|Ga0187765_10608717Not Available706Open in IMG/M
3300018432|Ga0190275_10649731All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylobacteriaceae → Methylobacterium1107Open in IMG/M
3300018468|Ga0066662_10522193All Organisms → cellular organisms → Bacteria1088Open in IMG/M
3300018468|Ga0066662_12213066Not Available577Open in IMG/M
3300019889|Ga0193743_1032681All Organisms → cellular organisms → Bacteria2469Open in IMG/M
3300020152|Ga0196971_1020951Not Available1551Open in IMG/M
3300021374|Ga0213881_10009973All Organisms → cellular organisms → Bacteria3879Open in IMG/M
3300021374|Ga0213881_10067800All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Sulfopaludibacter → unclassified Candidatus Sulfopaludibacter → Candidatus Sulfopaludibacter sp. SbA41522Open in IMG/M
3300021374|Ga0213881_10284847Not Available736Open in IMG/M
3300021374|Ga0213881_10359733Not Available653Open in IMG/M
3300021384|Ga0213876_10007452All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium5952Open in IMG/M
3300021388|Ga0213875_10136788All Organisms → cellular organisms → Bacteria1146Open in IMG/M
3300021388|Ga0213875_10262656Not Available815Open in IMG/M
3300021388|Ga0213875_10459407Not Available610Open in IMG/M
3300021432|Ga0210384_10002433All Organisms → cellular organisms → Bacteria22745Open in IMG/M
3300021476|Ga0187846_10202335Not Available832Open in IMG/M
3300021604|Ga0226835_1050666Not Available516Open in IMG/M
3300021976|Ga0193742_1070464All Organisms → cellular organisms → Bacteria1367Open in IMG/M
3300023272|Ga0247760_1210507Not Available502Open in IMG/M
3300025634|Ga0208589_1048742Not Available1082Open in IMG/M
3300025913|Ga0207695_10000765All Organisms → cellular organisms → Bacteria61406Open in IMG/M
3300025924|Ga0207694_10433491Not Available1096Open in IMG/M
3300025961|Ga0207712_10004003All Organisms → cellular organisms → Bacteria9303Open in IMG/M
3300025986|Ga0207658_11091168All Organisms → cellular organisms → Bacteria729Open in IMG/M
3300027570|Ga0208043_1093482Not Available820Open in IMG/M
3300027680|Ga0207826_1201498Not Available536Open in IMG/M
3300027703|Ga0207862_1005582All Organisms → cellular organisms → Bacteria3689Open in IMG/M
3300027854|Ga0209517_10008218All Organisms → cellular organisms → Bacteria12261Open in IMG/M
3300027886|Ga0209486_10479342Not Available769Open in IMG/M
3300027905|Ga0209415_10667591All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae753Open in IMG/M
3300027911|Ga0209698_11133483Not Available578Open in IMG/M
3300027986|Ga0209168_10009834All Organisms → cellular organisms → Bacteria → Acidobacteria5772Open in IMG/M
3300028381|Ga0268264_10315371All Organisms → cellular organisms → Bacteria1477Open in IMG/M
3300029636|Ga0222749_10029333Not Available2327Open in IMG/M
3300030706|Ga0310039_10273883Not Available645Open in IMG/M
3300031232|Ga0302323_102227674Not Available624Open in IMG/M
3300031548|Ga0307408_101696792Not Available602Open in IMG/M
3300031824|Ga0307413_10150795All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Granulicella → Granulicella rosea1620Open in IMG/M
3300031852|Ga0307410_11261222Not Available645Open in IMG/M
3300031901|Ga0307406_11347090Not Available624Open in IMG/M
3300031938|Ga0308175_100032178All Organisms → cellular organisms → Bacteria4351Open in IMG/M
3300031938|Ga0308175_100722241Not Available1084Open in IMG/M
3300031996|Ga0308176_10096458All Organisms → cellular organisms → Bacteria2582Open in IMG/M
3300032074|Ga0308173_11160890Not Available721Open in IMG/M
3300032074|Ga0308173_12102526Not Available533Open in IMG/M
3300032160|Ga0311301_10742408All Organisms → cellular organisms → Bacteria1366Open in IMG/M
3300032805|Ga0335078_10010976All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia13416Open in IMG/M
3300032805|Ga0335078_10177316All Organisms → cellular organisms → Bacteria2986Open in IMG/M
3300032805|Ga0335078_11096991All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium931Open in IMG/M
3300032805|Ga0335078_11139866All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae907Open in IMG/M
3300032893|Ga0335069_11945611Not Available621Open in IMG/M
3300032897|Ga0335071_10107890All Organisms → cellular organisms → Bacteria2729Open in IMG/M
3300032897|Ga0335071_10777314Not Available906Open in IMG/M
3300032898|Ga0335072_10506080All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300033513|Ga0316628_100679758Not Available1349Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.09%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil8.26%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil7.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.13%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.13%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.31%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.31%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock3.31%
Plant RootsHost-Associated → Plants → Roots → Unclassified → Unclassified → Plant Roots3.31%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere3.31%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.48%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.48%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.48%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.48%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere2.48%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere2.48%
WastewaterEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater2.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.65%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.65%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere1.65%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.65%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.65%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.83%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog0.83%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.83%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.83%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.83%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil0.83%
FenEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Fen0.83%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.83%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.83%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.83%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.83%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge0.83%
Anaerobic Bioreactor BiomassEngineered → Bioreactor → Anaerobic → Unclassified → Unclassified → Anaerobic Bioreactor Biomass0.83%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2140918013Soil microbial communities from Great Prairies - Iowa soil (MSU Assemblies)EnvironmentalOpen in IMG/M
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000890Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300001356Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004081Grasslands soil microbial communities from Hopland, California, USA - 2 (version 2)EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005524Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen10_05102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009521Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaGEnvironmentalOpen in IMG/M
3300009522Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_5_LS metaGEnvironmentalOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009700Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_4_PS metaGEnvironmentalOpen in IMG/M
3300009840Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot105AEnvironmentalOpen in IMG/M
3300009868Activated sludge microbial diversity in wastewater treatment plant from Tai Wan - Bali plant Bali plantEngineeredOpen in IMG/M
3300009873Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Wenshan plantEngineeredOpen in IMG/M
3300010346AD_USMOcaEngineeredOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010876Boreal forest soil eukaryotic communities from Alaska, USA - W5-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014169Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin11_10_metaGEnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300017975Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300020152Soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_10-13CEnvironmentalOpen in IMG/M
3300021374Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R08EnvironmentalOpen in IMG/M
3300021384Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9Host-AssociatedOpen in IMG/M
3300021388Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R8Host-AssociatedOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021604Anaerobic ammonium oxidizing microbial communities from anammox membrane bioreactor (MBR) in UC Berkley, California, United States - LAC_MetaG_1EngineeredOpen in IMG/M
3300021976Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c1EnvironmentalOpen in IMG/M
3300023272Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L171-409R-4EnvironmentalOpen in IMG/M
3300025634Arctic peat soil from Barrow, Alaska - NGEE Surface sample F53-2 shallow-092012 (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025986Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027570Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027680Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 80 (SPAdes)EnvironmentalOpen in IMG/M
3300027703Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 81 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030706Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_6_BS metaG (v2)EnvironmentalOpen in IMG/M
3300031232Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031824Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-2Host-AssociatedOpen in IMG/M
3300031852Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-3Host-AssociatedOpen in IMG/M
3300031901Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-2Host-AssociatedOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032074Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R1EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300032897Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5EnvironmentalOpen in IMG/M
3300032898Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.1EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Iowa-Corn-GraphCirc_013134202140918013SoilMPDHPSWIHKLTEILAEARTPKPIPFFRRRDIEALFGLKKRQALNLMHRIGAIRVSRELAVEKRDLLRWLEQMVEDPSVAVEQRRHERVIDRIVELKAETAARAVKIILPDAKPSVELPDGVSLQPGLLTISFENDQQLLERLFLLARVLATQPQVLSSLSLPR
ICChiseqgaiiFebDRAFT_1440346813300000363SoilMPDHPSWIHKLTEILAEARTPKPIPFFRRRDIEALFGLKKRQALNLMHRIGAIRVSRELAVEKRDLLRWLEQMVEDPSVAVEQRRHERVIDRIVELKAETAARAVKIILPDAKPSVELPDGVSLQPGLLTISFENDQQLLERLFLLARVLATQPQVLSSLSLPR*
JGI11643J12802_1018953423300000890SoilMKEYCTIVQYHGPVAILSYRFPPMPDHPSWIHKLTEILAEARTPKPIPFFRRRDIEALFGLKKRQALNLMHRIGAIRVSRELAVEKRDLLRWLEQMVEDPSVAVEQRRHERVIDRIVELKAETAARAVKIILPDAKPSVELPDGVSLQPGLLTISFENDQQLLERLFLLARVLATQPQVLSSLSLPR*
JGI12269J14319_1004872353300001356Peatlands SoilLCNTADRFYNCLQVSSVPDKPSYIHKIKSILTEAKSPKPIPFFRRRDIEALFGLKRRQAINLMHRIGAVRVSCEIAIPQRDLVSWLEEIGSNPAGAREIRRQERVIGRIVDLKAETAARAVKIVLPESLPAADIPVGVSLQPGVLTVSFSNEQQLLERLFLLARALATKPQLISNLCTPQQ*
C688J35102_12094050313300002568SoilMPDKPTYIHKLTEILVEARAPKPIPFFRRQDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLIAWLEQRIADPSVAIEQRRHERVIDRIVELRAETAARAVRIVLPDPKPSVELPDGVSLQPGLLTISFVNDQELLQQLFLLARVLATQPQILSSLSLPR*
Ga0063454_10081065423300004081SoilSPKPIPFFRRCDVEALFGLRRRQAINLMHEIGAVRVSQEIAVPQKDLVAWLEKKVVDPARAREIRRQERVIGRIVELKAETAARAVKIVLPDGNPSMDLPAGVSLHLGVLTISFDNEQQLLERLFLLARTFAANPQMLSSLPKR*
Ga0062593_10337831313300004114SoilMPDNPTYIHKLAAILTETRTPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLVAWLEQMIEDPSVVAEWQRHERVIDRIVELKAETAARAVKIVLPDPKPSVELPDGVSLQPGLLTISFESDQELLERLFLLARVFA
Ga0062386_10053062523300004152Bog Forest SoilMPDNPTYIHKLEAILAEARLAKPIPFFRRRDIEALFGLKKRQAINLMHRIGAVRVSRELAVEQRHLVRWLELRIAQPSVAAEWRRHETVIDRIVELKAETAARAVKIVLPDPSPSLDLPEGVSLQPGLLTVSFASEQQLLERLFVLARVLATRPQSLSSLSLPR*
Ga0063455_10151254413300004153SoilMPDKPTYIHKLTEILVEARAPKPIPFFRRQDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLIAWLEQMLENPSVSIEQRRHERVIDRIIELKAETAARAIKIVLPDPKGPLQLPDGVSLEPGLLTISFDNEQQLLERLFLLARVLATQPQVLSAVSLPR*
Ga0063356_10295457023300004463Arabidopsis Thaliana RhizosphereTLLQRPSMPDNPTYIHKLAEILAEARTPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLIAWLEQMIEDPSVAAEWQRHERVIDRIVELKAETAARAVKIVLPDPKPSVELPDGVSLQPGLLTISFESDQELLERLFLLARVFATQPQMLSSLSPSKLASSHY*
Ga0068868_10123751313300005338Miscanthus RhizosphereGASMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP*
Ga0070692_1094001813300005345Corn, Switchgrass And Miscanthus RhizosphereMPNNPSYLHRLTAILAEARSPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVKVSSEIAVPQQDLVSWVERKALDPAGSREIRRQERVIERIVDLKAETAARAVKIVLPDAAPALDLPDGVSLQPGVLSISFNTAEELLERLFLIARSFAANPGLLSKLRRR*
Ga0070667_10180506813300005367Switchgrass RhizosphereIPPTKCVILAIGASMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP*
Ga0070709_1105131213300005434Corn, Switchgrass And Miscanthus RhizosphereILAEARRPKPIPFFRRGDIEVLFGLRRRQAINLMHEIGAIRVSNEIAVPQQDVVRWLERMASSPARTREIHRQKRVIDRIVELKAETAARAVKIVLPDQPAVDLPEGVSLRPGVLTIVFSTEQQLLERLFQFARALATKRQLLNDLGKS*
Ga0070714_10250555413300005435Agricultural SoilRKPKPIPFFRRCDMEALFGLKRRQAINLMHEIGAVRVSNEIAVPQDDLVSWLEKKALDPARTQELRRRERVIGRIVELKAETAARTVKIVLPDAAPPVDWPEGVSLQPGVLRIAFQTEEQLLERLFGLVRSFAASPDLLHNLPKA*
Ga0070698_10099301613300005471Corn, Switchgrass And Miscanthus RhizosphereMPDNPTYIHKLEGILAEARAPKPIPFFRRRDIEALFGLRKRQAVNLMHRIGAVRVSRELAVEQRGLVRWLEQRISDPAVAVEQRRHAAVIGRIVELKAETAARAIKIVLPDRASSLDLPDGVSLQPGLLTLSFDNEQQLLERLFLLARVLATQPELLSSLSLPW*
Ga0070737_1017512413300005524Surface SoilVAVYSLEGSSVPDQPTYIHKLTAILAEARSPKPIPFFRRRDIEALFGLRKRQAVNLMHRIGAVRVSRELAVEQRELVRWLEQMISDPSVEAERRRHGTVIDRIVELKAETAARTVKIVLPDRPRSVDLPEGVVLTPGLLTVSFKGEQQLLERLFLLARALATRPELLSDVTLPC*
Ga0070735_1001534133300005534Surface SoilMPDNPTYLHRLPSILAEAKSPKPIPFFRRCDVEALFGLKRRQAINLMHRIGAVRVSQEIAIPQRELVSWLEQMVSNPATSHEIRRQERVIGRIVDLKAETAARAVKITLPDSAPSADLPAGVSLQPGVLTVTFRDEQELLEQLFLLARLLATKPQLISNLWR*
Ga0068853_10006738763300005539Corn RhizosphereMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP*
Ga0066903_10060184823300005764Tropical Forest SoilMPGNPTYIHKLEGILAEARLPKPIPFFRRRDVEALFGLKKRQALNLMHRIGAVRVSREMAIPQCDLVRWLEEMVSNPTVARERHRHDTVIGRIVELKAETAARAVKIVLPDGPSVEMPAGISFQPGLLRVAYSRLR*
Ga0066903_10361755813300005764Tropical Forest SoilKPSYLHKLTAILAEAKRPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAVRVSNEIAVPQEDLVFWLENRARDPARTREIRRQERVIGRIIDLKAETAARAVKIALPDPPRSDDLPAGISLQPGLLMISFSTEEQLLERLFGFARVLANNRQLISHLRKEEA*
Ga0066903_10734938313300005764Tropical Forest SoilCNTADKIWDNDMASCMPDQPSYLHRVTEILAEARTPKPIPFFRRSDIEALFGLKRRQAINLMHRVGAVRVSHEIAVPQRDLVSWLEKTVLDPAAAREIRRQERVIGRIVDLKAETAARAIKIVLPDAAPSAEFPDGISLQPGLLTVSFDNEQQLLERLFLLARLFATKPELLQNLSVAR*
Ga0068858_10008741623300005842Switchgrass RhizosphereMPDNPTYIHKLAEILAEARTPKPIPFFRRQDIEALFGLKKRQAINLMHQIGAVRVSRELAVEQRDLVHWLERMISDPSVAVEQRRHDAVIGRIVELKAETAARAIKIILPDGPSVDLPDGVSLRPGLLTVSFASEQQLLERLFLLARVLATEPQMFSSLSLPR*
Ga0068860_10031164423300005843Switchgrass RhizosphereMYYYRVLSTDVLHSCAIPPTKCVILAIGASMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP*
Ga0075029_10047992323300006052WatershedsMPDKPTYMHRLSSILAEAKSSKPIPFFRRLDVEALFGLKRRQAINLMHRIGAVRVSHEIAVPQRDLVAWLEKMARDPAGPREIRRQERVIGRIVDLKAEAAARAIKIVLPDGIRPADLPEGVGLEPGLLTVSFENERQLLERLFQLARVFAANPQLLN
Ga0075030_10019540233300006162WatershedsMPDKPTYMHRLSSILAEAKSSKPIPFFRRLDVEALFGLKRRQAINLMHRIGAVRVSHEIAVPQRDLVAWLEKMARDPAGPREIRRQERVIGRIVDLKAEAAARAIKIVLPDGIRPADLPEGVGLEPGLLTVSFENERQLLERLFQLARVFAANPQLLNELSGAR*
Ga0075030_10036235213300006162WatershedsMNRAGGMPRIPRGHLGVPKRILHSCALPPSKFAIMAAHPGMPDKPTYLHKLTSILAEAKTPKPIPFFRRSDVEALFGLKRRQAINLMHEIGAVRVSNEIAVPQDDLVAWVEKKALDPARSREIRRQERVIGRIVELKAETAARAVKIVFRDPAPSIDLPAGVSFLPGMLTITFNTEEQLLERLFLLARAFAANPQILNNLPKQ*
Ga0070712_10168308913300006175Corn, Switchgrass And Miscanthus RhizosphereMPDKPSYLHRVTAILAEARKPKPIPFFRRCDMEALFGLKRRQAINLMHEIGAVRVSNEIAVPQDDLVSWLEKKALDPARTQELRRRERVIGRIVELKAETAARTVKIVLPDAAPPVDWPEGVSLQPGVLRIAFQTEEQLLERLFGLVRSFAASPDLLHNLPKA*
Ga0068871_10101998413300006358Miscanthus RhizosphereWRWTSSVPDKPTYLNRLTAILAEAKTPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVAANPARIREIGRQQRVIARIVELKAETAARAVKIVLPDGPPSPDLPAGVSLQPGLLSVAFDTEQQLLERLFLLARLFAADTQTLSKFRRP*
Ga0079220_1148807613300006806Agricultural SoilMPDKPSYLHRVAEILAEAKTPKPIPFFRRRDMEALFGLKRRQAINLMHTIGAVRVSQEIAVPQQDLVSWLEKMALNPARSREIRRRERVIGRIVELKAETAARAVKIVLPDPASTPVDFPPGVSLQPGNLTISFTTGQQLLERLFLLARVFANKPQLLS
Ga0079218_1128458923300007004Agricultural SoilKLDGILAEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVDQRDLIRWLERMIANPSVAAEWHRHERVIGRIVELKAETAARAVKIVLPDRGLSVELPDGVSLQPGELTVAFDNEQQLLERLFLLARALATNPDMLTDSCVSPYGTSRDTSVVAITR*
Ga0105240_1129585913300009093Corn RhizosphereLPENPTYIHKLLGILAEARTPKPIPFFRRRDIEALFGLRKRQAVNLMHQIGAIRVSRELAVDQKDLVRWLEDMISNPSTAAEHHRHETVIDRIVELKAETAARAIKIVLPDRKPPNGFPKGVSLAPGLLTVSFENEQELL
Ga0111539_1263985213300009094Populus RhizosphereMPDKPSYLERLPEILQEAKSPKPIPFFRRRDIEALFGLKRRQAIRLMHHIGAIRVSSEIAVEQRDLVRWLERAAQSPAVTREVARRSRVVDRIVELKAETAARARKLVLPDPTPIVDIPDGVSLRPGVLTIAFASEQELLERLFLLARVLASNT
Ga0105248_1026845723300009177Switchgrass RhizosphereVPDKPTYLNRLTAILAEAKTPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVAANPARIREIGRQQRVIARIVELKAETAARAVKIVLPDGPPSPDLPAGVSLQPGLLSVAFDTEQQLLERLFLLARLFAADTQTLSKFRRP*
Ga0116222_118445423300009521Peatlands SoilVPSQPTYIHRLRAILAEARPARPIPFFRRRDIQALFGLQKRQAINLMHRIGAVRVSRELALRQPDLVGWIERRISEPSVAIEWRRHETVIGRIVELKAETAARAVRIVLPDRPPSVDLPPGVSLAPGLLTVSFENGPELLEKLFLLARVLATQPHLLDR*
Ga0116218_118020923300009522Peatlands SoilPIPFFRRRDIEALFGLKRRQAINLMHRIGAVRVSCEIAIPQRDLVSWLEEIGSNPAGAREIRRQERVIGRIVDLKAETAARAVKIVLPESLPAADIPVGVSLQPGVLTVSFSNEQQLLERLFLLARALATKPQLISNLCTPQQ*
Ga0105249_1017337023300009553Switchgrass RhizosphereMPDNPTYIHKLAEILAEACTPKPIPFFRRQDIEALFGLKKRQAINLMHQIGAVRVSRELAVEQRDLVHWLERMISDPSVAVEQRRHDAVIGRIVELKAETAARAIKIILPDGPSVDLPEGVSLRPGLLTVSFASEQQLLERLFLLARVLATEPQMLSSLSLPR*
Ga0116217_1018422723300009700Peatlands SoilVPDKPSYIHKIKSILTEAKSPKPIPFFRRRDIEALFGLKRRQAINLMHRIGAVRVSCEIAIPQRDLVSWLEEIGSNPAGAREIRRQERVIGRIVDLKAETAARAVKIVLPESLPAADIPVGVSLQPGVLTVSFSNEQQLLERLFLLARALATKPQLISNLCTPQQ*
Ga0126313_1133381313300009840Serpentine SoilIDAVVLHDRAIPYRNGYTLFEASSVPDNPSYIHKLAGILEEARAPKPISFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSREIAIDQRDLIRWLEQMIASPSVAAEWHRHERVIGRIVQLKAETAARAVKIVFPDQKPSVLLPDGVSLEPGLLTVVFEDDQQLLQRLFLLARVLADEPEMLTRMNQPPVN*
Ga0130016_10004064153300009868WastewaterMPDNPSYIHKLEGILAEVRSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVEQRDLIRWLERTLSDPSTVSEQHRHERVISRIVELKAETAARAVRIVLPDPASSVDFPEGVSLQPGLLTIAFESEQQLLERLFLLARVLATQPQILTNL*
Ga0130016_1051144723300009868WastewaterMPDNPSYIHKLEGILAEARSPKPIPFFRRRDIEDLFGLKKRQAVNLMHRIGEISVSRELAVEQCDLIRWLEQMLSDPSVAIEQRRHERVISRIVELKAETAARAVRIVLPDPAPSVDFPEGVSLQPGLLTVSFEN
Ga0131077_1082151313300009873WastewaterMPNNPTYIHKFAGILAEARTPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKGDLIAWLEQMMEDPSVAIEQRRHERVIDRIVELKAESAARAIKIVLPDPKRTVQLPDGVALEPGLLTISFDNEEQLLERLFLLARVFVSQPQMLSAGTGPR*
Ga0116239_1032024523300010346Anaerobic Digestor SludgeMPDKPTYLHRLTSILAEARKPKPIPFLRRGDIEALFGLKRRQAINLMHRIGAIRVSREIAIPQRDLVAWLERMQADPATAREVRRQERAVGRIVDLKAEAAARAVKIVLPDSPPGADLPEGVSLQPGCLTVFFADGNELLERLFLLSRALATNPQMIGDLRPMGD*
Ga0134128_1138557513300010373Terrestrial SoilMPGYPSYIHKLEGILVEARSSKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSSELAVAQRDLVRWLERMAADPSVAVEQRRHESVIERIVELKAETAARAIKIVLPDRKPSGDLPDGVSLQPGLLSVSFDNEQQLLERLFLLARVLATQPQLLSAANPSR*
Ga0126381_10081066723300010376Tropical Forest SoilMPDKPTYIHKVTAILAEARKPQPIPFFRRCDIEALFGLRRRQAINLMHAIGAVRVSNEIAVPQEDLVSWLEKMAVSPARIREVRRQERVIGRIVELKAETAARAVKIVLPDAVSSSDWPAGVSFRPGMLSISFETGEQLLERLFLLARAFAANPQLLSNLFKD*
Ga0136449_10020174943300010379Peatlands SoilMPDKPSYIHRLTSILEEAKSPKPIPFFRRCDIEALFALKRRQAINLMHTIGAIRVSHEIAVPQKDLVSWLEEMALNPARTREIRRRERVIGRIVDLKAETAARAIKIVLPDSSPSADIPPGVSLQPGLLTVSFSNEQQLLERLFLLARLLAAKPHLISDLCKPQS*
Ga0126361_1069396313300010876Boreal Forest SoilVPDQPSYIHKLAAILREARSPKPIPFFRRRDVEALFGLKKRQAINLMHRIGAVLVSRELAVPQRDLVRWLERKIAEPSVSIEQNRHETVIHRIVELKAETAARAVKIVLPEGPPSVDLPAGVSLAPGLLTVSFENGQQLLEKLFLLARVLATKPHLLNS*
Ga0150985_11219572913300012212Avena Fatua RhizosphereMPDKPSYIHRLTSILEEARSPKPIPFFRRCDVEALFGLQRRQAINLMHEIGAVRVSQEIAVPQKDLVAWLEKKMVDPARAREIRRQERVIGRIVELKAETAARAVKIVLPDGTPSMDLPAGVSLHLGVLTISFDNEQQLLERLFLLARTFAANPQVLSSLPKR*
Ga0150985_11568769313300012212Avena Fatua RhizosphereSYIHKLQGILGEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGVIRVSRELAVDKRDLIAWLEQMIENPSVALEQRRHERVIDRIVELKAEAAARAVKIVLPDPKPAAELPEGVSLEPGLLTVSFGNEQQLLERLFLLARTLATQPDVLASVTVPGQVGQYL*
Ga0137366_1034451923300012354Vadose Zone SoilVPDNPTYIHKLASILVEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSSELALPQRDLVRWLEQMVSNPAVAMERRRHDTVIGRIVELKAETAARAVKIVLPEGPSVDLPDGVSLQPGLLTVSFDNEQQLLERLFLLARVLAAKPQVLSSLSFPH*
Ga0164298_1058714313300012955SoilMPDNPTYIHKLAEILTEARSPKPIPFFRRRDMEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRELVRWLERRISNPYVAVEQLRHEAVIGRIVELKAETAARAIKIVLPDRPPSVDLPEGVSLQPGLLTVSFDNEQELLERLFLLARVLATQPQLLSTLSLSR*
Ga0164298_1157905013300012955SoilPSYIHKLAEILVEARKPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSREIAVEKRDLLRWLEQMIEDPSAAAEWQRHEKVIDRIVELKDETAARALKIVLPDSKPSMELPDGVSLQPGLLTISFKNGQQLLEQLFLLARVLATQPQVLNSLSLPR*
Ga0164299_1098375313300012958SoilMPDNPTYIHKLAEILAEARSPKPIPFFRRRDMEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRELVRWLERRISDPQVAVEQRRHEAVIGRIVELKAETAARAVKIVLPDRAPSVDLPEGVSLQPGLLTVSFD
Ga0164302_1031770623300012961SoilMPDNPTYIHKLAEILAEARSPKPIPFFRRRDIEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRDLVRWLERRISNPYVAVEQLRHEAVIGRIVELKAETAARAVKILLPDPKPSVDLPDGVSLQPGRLSVSFCLQQDLF*
Ga0164309_1165732013300012984SoilMPDNPTYIHKLAEILAEARSPKPIPFFRRRDMEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRDLVRWLERRISDPYVAVEQLRHEAVIGRIVELKAETAARAIKIVLPDRPPSVDLPEGVSLQPGLLTVAF
Ga0164308_1069836023300012985SoilMPDNPTYIHKLAEILTEARSPKPIPFFRRRDMEALFGLKKRQAIHLMHRIGAVRVSRELAVEQRDLVRWLERRISDPYVAVEQLRHEAVIGRIVELKAETAARAVKIVLPDPKPSVELPDGVSLQPGLLTVSFDNEQQLLERLFLLARVLATQPQFLSSLSLPR*
Ga0164308_1209266513300012985SoilMPDHPSYIHKLAEILVEARKPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSREIAVEKRDLLRWLEQMIEDPSAAAEWQRHEKVIDRIVELKAETAARAIKIVLPDSKPSMELPDGVSLQPGLLTISFENDQQLLEQLFLLARVLATQPQVLN
Ga0164304_1076463423300012986SoilMPDHPSYIHKLAEILVEARKPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVEKRDLLRWLEQMIEDPSAAAEWQRHEKVIDRIVELKAETAARAIKIVLPDSKPSMELPDGVSLQPGLLTISFENDQQLLEQLFLLARVLATQPQVLNSLSLPR*
Ga0157374_1170384113300013296Miscanthus RhizosphereKPIPFFRRRDIEALFGLRKRQAVNLMHQIGAVRVSRELAVDQRDLIRWLEQRMADPSVAAEHSRHETVIERIVELKAETAARAIKIVLPDRKPSVDLPDGVSLHPGLLTVSFENEQELLERLFLLARVFATNPHMLNSLGTHS*
Ga0157374_1198896813300013296Miscanthus RhizosphereKLAEILAEACTPKPIPFFRRQDIEALFGLKKRQAINLMHQIGAVRVSRELAVEQRDLVHWLERMISDPSVAVEQRRHDAVIGRIVELKAETAARAIKIILPDGPSVDLPDGVSLRPGLLTVSFASEQQLLERLFLLARVLATEPQMLSSLSLPR*
Ga0163162_1154673713300013306Switchgrass RhizosphereVPDKPTYLNRLTAILAEAKTPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVAANPARIREIGRQQRVIARIVELKAETAARAVKIVLPDGPPSPDLPAGVSLQPGLLSVAFDTEQQLLERLFLLARLFAADTQTLSKFRLP*
Ga0157372_1033312443300013307Corn RhizospherePDKPSYLHKLADILLEAKKLKPIPFFRRRDMEALFGLKRRQAVNLMHAIGAVRVSQEIAVAQEDLVVWLETMAASPERAQEVRRQERVIGRIVELKAETAARAVKIVLPNPIPVNNLPEGVSLRPGNLTVSFDSEQQLLERLFLLVRAFAANPLMLSELIDH*
Ga0181531_1025143813300014169BogLPDNPTYIHKLAGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVEQRELVRWLEQMISDPSVALEQRRHDAVIDRIVELKAETAARAVKIVLPDRKPSVDLPDGVSFQPGLLTVSFDSEQQLLERLFLLARVLATQPQLLSAVSPSQ*
Ga0163163_1107878813300014325Switchgrass RhizosphereVPDKPTYLNRLTAILAEAKTPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVAANPARIREIGRQQRVIARIVELKAETAARAVKIVLPDGPSSPDLPAGVSLQPGLLSVAFDTEQQLLERLFLLARLFAADTQTLSKFRRP*
Ga0157379_1150529323300014968Switchgrass RhizosphereAILAAAKTPQPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVAANPARIREIGRQQRVIARIVELKAETAARAVKIVLPDGPSSPDLPAGVSLQPGLLSVAFDTEQQLLERLFLLARLFAADTQTLSKFRRP*
Ga0187782_1090175913300017975Tropical PeatlandMPDKPTYLHRLTSILKEAQTPKPIPFFRRCDIEALFGLKRRQAINLMHAIGAIRVSNEIAVPQKDLVAWLEMKALDPACTREIRRQERVVGRIVELKAETAARAVKIVLPNSPAPADLPAGVSLQPGLLSSSFTSEKELLERLFLLARAFANRPQLLNNLANP
Ga0187765_1060871713300018060Tropical PeatlandRLCSSLPRHTAQLCSTADEIYNRSSDLLMPDKPSYLHRVTDILAEARTPKPIPFFRRSDIEALFGLKRRQAINVMHAIGAVRVSQEIAVPQEDLVAWLEKLAADPARVQEIRRQERVIGRIVELRAEAAARARKIVLPDPPPTPSGFPAGVSLQPGTLTISFASEQELLERLFLLVRAFAARPEALSNLDRR
Ga0190275_1064973113300018432SoilMPDNPTYIHKLAEILAAARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVAKRDLVAWLEQMIEDPSVAIELRRHKRVIDHIVELKAETAARAVKIVLPDPKPSVELPDGVSLQPGLLTISFENDQQLLERLFLLARVLAAQPQVLSSLSLPR
Ga0066662_1052219313300018468Grasslands SoilVPEHPSYIHQLAAILAEARSPKPLPFFRRRDIEALFGLQKRQAVNLMHRLGAVRVSRELAVDQRDLVRWLEQKLADPSARAECQRHAKVIDRIVELKAETAARAIKIVLPEGVSSTEIPAGVSFLPGLLMVAFRTPEELLERLFLLGRALATKPQLIETIPLPR
Ga0066662_1221306613300018468Grasslands SoilMPDQPSYVHKLADILAEARAPKPIPFFRRRDIEALFGLKKRQALYLMYRIGVVRVSRELAVEQRDLLRWLEQRIAEPAVVAEQQRHERVIGRIVELKAETAARTIRIVLPEQMVADWPEGVSLGPGLLTVSFDTEQQLLERL
Ga0193743_103268113300019889SoilMPDNPSYIHKLEGILTEARAPKPIPFFRRRDIEALFGLQKRQAVNLIHRIGAIRVSRELAVDQRDLIAWLEQRIEDPSVANEQRRHERVIDRIVELKAETAARAVKVVLPDPKPLVELPDGVSLAPGLLTISFENEQQLLERLFLLARVLATQPQVLSSASLPR
Ga0196971_102095123300020152SoilVKAVYSVRGFPVPEQPSYIHKLEGILAEARKPKPIPFFRRRDIESLFGLRKRQTVNLMHRIGAVRVSRELAVEQRDLVRWLEQMVEDPSVAAEWRRHQRVIGRIIDLKAETAARAVKIVLPDPERSVELPDGVSLRPGLLTISFNSEQQLLERLFLLARVLASQTDVLSGLSRSTSSQREADERDLRSHL
Ga0213881_1000997323300021374Exposed RockLPDNPTYIHKLEGILEEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELALDQRDLIRWLERSISDPCAAIERRRHEAVIERIVEWKAETAARAVKIVLPDRKASADLPEGVSLEPGRLTVSFDNEQQLVERLFLLARVLATQPQLLSGAGLSR
Ga0213881_1006780023300021374Exposed RockPDNPSYIHKVAGILAEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVEQPDLVQWLEELLSDPSAVAEWQRHEKVIKRIVDMKAETAARAIKIVLPEGAPANDIPAGVSLQPGLLTVAFSSQQELLERLFLLARAFATQPDLLNSIAVPH
Ga0213881_1028484713300021374Exposed RockMPEHPTYVQKLASILAEARAPKPIPFFRRRDLEALFGLQKRQALYLMHRIGAVRVSRELAVEQRDLIRWLEERIADPSVVAEQQRHERVIDRIVELKAETAARAVKIVLPEQASAAEFPAGVSLQPGLLTVSFETEQQLLERLFLLARVWATQPQLLSSLSPLH
Ga0213881_1035973313300021374Exposed RockMPDNPTYIHKLAEILAEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVEKRDLIRWLEQMIEDPSVVIEQRRHERVIQRIVELKAETAARAVKIVLPDPKPPVELPDGVSVQPGVLTITFDNDQQLLERLF
Ga0213876_1000745233300021384Plant RootsMCPYRAVEFKFYCTIVHKGHETAILRLGSSVPGNSSYVHKLDGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVEQRDLVRWLEAMIADPSVALEQRRHASVIDRIVELKAETAARAVKIVLPEQVPSIDLPDGVSLQPGLLTVSFDTDQQLLERLFLLARVLATRPQLLSAVNPSR
Ga0213875_1013678823300021388Plant RootsLPDNPTYIHKLEGILEEARSPKPIPFFRRADIEALFGLRKRQAVNLMHRIGAIRVSRELAVDKRDLIRWLERTISDPSVAAEQRRHEAVIGRIVELKAETAARAVKIVLPDPKPLADLPDGVSLQPGVLTVSFANERQLVERLFLLARVMATQPQILGTLSLPR
Ga0213875_1026265613300021388Plant RootsRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVEQPDLVQWLEELLSDPSAVAEWQRHEKVIKRIVDMKAETAARAIKIVLPEGAPANDIPAGVSLQPGLLTVAFSSQQELLERLFLLARAFATQPDLLNSIAVPH
Ga0213875_1045940713300021388Plant RootsDNPTYIHKLEGILEEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELALDQRDLIRWLERSISDPCAAIERRRHEAVIERIVEWKAETAARAVKIVLPDRKASADLPEGVSLEPGRLTVSFDNEQQLVERLFLLARVLATQPQLLSGAGLSR
Ga0210384_10002433203300021432SoilMPDKPSYIHRLTSILEEAKTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQQDLVSWLEKRSLDPAREREIHRQELVIGRIVDLKAETAARAIKIVLPDPVPSADMPAGVSLEPGLLSVSFDSEQQLLERLFLLARAFAANPLLISRFARQ
Ga0187846_1020233523300021476BiofilmVIDSPWGLSQYLLEPTDKPTYIHRLTSILAEAKSPKPIPFFRRCDIEALFGVKRRQAINLMHKIGAVRVSNEIAIPQRDLISWLEEISLDPAGAREIRRQERVIDRIVDLKAETAARAIKIVLPDSARLADFPEGVSLQPGLLTVSFESEQQLLERLFLLARVFATKPQLLKGLEIPR
Ga0226835_105066613300021604Anaerobic Bioreactor BiomassGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVEQRDLIRWLEQMLSDPSVAIEQRRHERVISRIVELKAETAARAVRIVLPDPAPSVDFPEGVSLQPGLLTISFENEQQLLQRLFLLARVLATQPQILTNL
Ga0193742_107046423300021976SoilMPDNPSYIHKLEGILTEARAPKPIPFFRRRDIEALFGLQKRQAVNLIHRIGAIRVSRELAVDKRDLIAWLEQMIEDPSVANEQRRHERVIDRIVELKAETAARAVKVVLPDPKPLVELPDGVSLAPGLLTISFENEQQLLERLFLLARVLATQPQVLSSASLPR
Ga0247760_121050713300023272Plant LitterSMPDHPSYIHKLAGILAEARVPKPIPFFRRRDIEALFGLKKRQAVNLMHQIGAIRVSSELAVDKRDLIRWLEQRIEDPSVAIEHSRHERVIDRIVELKAETAARAIKIVLPDPKPSVELPDGVSLQPGLLTISFESDQQLLERLFLLARVLATQPQVLSAVSLPR
Ga0208589_104874223300025634Arctic Peat SoilMPDNPTYIHKLAEILAEAHTPKPIPFFRRRDIEALFGLKKRQAINLMHRIGAVHVSRELAVEQRDLVHWLERMISDPSVAVEQRRHDAVIGRIVELKAETAARAIKIILPDRPSVDLPDGVSLRPGLLTVSFASEQQLLERLFLLARVLATQPQMLSSVSLPR
Ga0207695_10000765443300025913Corn RhizosphereMYYYRVLSTDVLHSCAIPPTKCVILAIGASMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP
Ga0207694_1043349113300025924Corn RhizosphereMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQII
Ga0207712_10004003103300025961Switchgrass RhizosphereMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP
Ga0207658_1109116813300025986Switchgrass RhizosphereLEKRALIAGRLEAPFLEIHTNEFGCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVELKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQI
Ga0208043_109348213300027570Peatlands SoilVPSQPTYIHRLRAILAEARPARPIPFFRRRDIQALFGLQKRQAINLMHRIGAVRVSRELALRQPDLVGWIERRISEPSVAIEWRRHETVIGRIVELKAETAARAVRIVLPDRPPSVDLPPGVSLAPGLLTVSFENGPELLEKLFLLARVLATQPHLLDR
Ga0207826_120149813300027680Tropical Forest SoilMPDKPKYIQKLAAILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSSEIAVTQRDLVSWLERMASSPSTAQELRRHERVVGRIVELKAETAARAIKIVLPDEAPAAEIPAGVSLEPGLLTVSFANEQQLLERLFLLARAF
Ga0207862_100558243300027703Tropical Forest SoilMPDKPKYIQKLAAILSEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSSEIAVTQRDLVSWLERMASSPSTAQELRRHERVVGRIVELKAETAARAIKIVLPDEAPAAEIPAGVSLEPGLLTVSFANEQQLLERLFLLARAFATKPQLLNNLRMPH
Ga0209517_1000821873300027854Peatlands SoilVPDKPSYIHKIKSILTEAKSPKPIPFFRRRDIEALFGLKRRQAINLMHRIGAVRVSCEIAIPQRDLVSWLEEIGSNPAGAREIRRQERVIGRIVDLKAETAARAVKIVLPESLPAADIPVGVSLQPGVLTVSFSNEQQLLERLFLLARALATKPQLISNLCTPQQ
Ga0209486_1047934223300027886Agricultural SoilKLDGILAEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELAVDQRDLIRWLERMIANPSVAAEWHRHERVIGRIVELKAETAARAVKIVLPDRGLSVELPDGVSLQPGELTVAFDNEQQLLERLFLLARALATNPDMLTDSCVSPYGTSRDTSVVAITR
Ga0209415_1066759123300027905Peatlands SoilMPDKPSYIHRLTSILEEAKSPKPIPFFRRCDIEALFALKRRQAINLMHTIGAIRVSHEIAVPQKDLVSWLEEMALNPARTREIRRRERVIGRIVDLKAETAARAIKIVLPDSSPSADIPPGVSLQPGLLTVS
Ga0209698_1113348313300027911WatershedsMPDKPTYLHKLTSILAEAKTPKPIPFFRRSDVEALFGLKRRQAINLMHEIGAVRVSNEIAVPQDDLVAWVEKKALDPARSREIRRQERVIGRIVELKAETAARAVKIVFRDPAPSIDLPAGVSFLPGMLTITFN
Ga0209168_1000983433300027986Surface SoilMPDNPTYLHRLPSILAEAKSPKPIPFFRRCDVEALFGLKRRQAINLMHRIGAVRVSQEIAIPQRELVSWLEQMVSNPATSHEIRRQERVIGRIVDLKAETAARAVKITLPDSAPSADLPAGVSLQPGVLTVTFRDEQELLEQLFLLARLLATKPQLISNLWR
Ga0268264_1031537123300028381Switchgrass RhizosphereMYYYRVLSTDVLHSCAIPPTKCVILAIGASMPDKPSYIHRLTSILTEARTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQEDLVAWLEKKVLDPARTREILRQERVIGRIVDLKAETAARAVKIVLPNPVPSGDLPAGVSLQPGILSVSFNSEEQLLERLFLLARSFAVNPQIISNLARP
Ga0222749_1002933343300029636SoilMPDKPSYIHLLTSILEEVKTPKPIPFFRRCDIEALFGLKRRQAINLMHEIGAVRVSNEIALPQQDLVSWLEKRSLDPAREREIQRQERVIGRIVDLKAETAARAIKIVLPDPVPSADMPAGVSLEPGLLSVSFDSEQQLLERLFLLARAFAANPLLISRFARQ
Ga0310039_1027388313300030706Peatlands SoilYNCLQVSSVPDKPSYIHKIKSILTEAKSPKPIPFFRRRDIEALFGLKRRQAINLMHRIGAVRVSCEIAIPQRDLVSWLEEIGSNPAGAREIRRQERVIGRIVDLKAETAARAVKIVLPESLPAADIPVGVSLQPGVLTVSFSNEQQLLERLFLLARALATKPQLISNLCTPQQ
Ga0302323_10222767413300031232FenVPDKPSYIHRLTSILAEAKTPKPIPFLRRCDIEALFGLKRRQAINLMHEIGAVRVSREIAVPQEDLVAWLEKRILDPVCTREIRRQERVIGRIVELKAETAARAVKIILPESAPAADLPTGVSLQPGLLTIIFCDEQQLLQRLFLLARL
Ga0307408_10169679213300031548RhizosphereFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLVRWLEQMLSDPSVVAEWRRHNRVVDRIIELKAETAARAVKIVLPDPTPSLELPDGVSLQPGLLTISFENDRQLLERLFLLARVLATQPQVLSSLSLPDSRSDMARLGQGPNEPH
Ga0307413_1015079533300031824RhizosphereMPDNPSYIHKLEGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLVRWLEQMLSDPSVVAEWHRHNRVVDRIIELKAETAARAVKIVLPDPTPSLELPDGVSLQPGLLTISFENDRQLLERLFLLARVLATQPQVLSSLSLPDSRSDMARLGQGPNEPH
Ga0307410_1126122213300031852RhizosphereMPDNPSYIHKLEGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLVRWLEQMLSDPSVVAEWHRHNRVVDRIIELKAETAARAVKIVLPDPTPSLELPDGVSLQPGLLTISFENDRQLLERLFLLARVLATQPQVLSSLSLPDSRSDMARL
Ga0307406_1134709013300031901RhizosphereNPSYIHKLEGILAEARSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLVRWLEQMLSDPSVVAEWHRHNRVVDRIIELKAETAARAVKIVLPDPTPSLELPDGVSLQPGLLTISFENDRQLLERLFLLARVLATQPQVLSSLSLPDSRSDMARLGQGPNEPH
Ga0308175_10003217813300031938SoilLPENPTYIHKLAGILTEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELALEQRELVRWLEQMICDPSVETEQRRHERVIGRIVELKAETAARAIKITLPEAAPSMGLPAGVWLEPGLLTVSFDDEQQLLERLFLLARVLATEPQVLSNLSVSCQEPR
Ga0308175_10072224133300031938SoilVPDKPTYLNRLTAILAEAKTPKPIPFFRRGDIEALFGLKRRQAINLMHAIGAIRVSQEIAVRQKDLVIWLEKVSANPGSIREIGRQERVIERIVEMKAETAARAEKIVHPDGPPAPDLPAGVSLQPGLLTVAFDTEQQLLERLFLLTRLFAADTQTLSKLRRP
Ga0308176_1009645823300031996SoilMISDGDILSSRFPLPENPTYIHKLAGILTEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSRELALEQRELVRWLEQMICDPSVETEQRRHERVIGRIVELKAETAARAIKITLPEAAPSMGLPAGVWLEPGLLTVSFDDEQQLLERLFLLARVLATEPQVLSNLSVSCQEP
Ga0308173_1116089013300032074SoilMPENPSYIHKLEGILGEARAPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAIRVSRELAVDKRDLIAWLEQMIENPSVAVEQRRHERVIDRIVELKAEAAARAVKIVLPDPKPAAELPEGVSLEPGLLTVSFGNEQQLLERLFLLARTLATQPHVLASVTVPGQVGQYL
Ga0308173_1210252613300032074SoilRLTAILAEAKSPKPIPFFRRCDVEALFGLRRRQAINLMHEIGAVRVSQEIAVPQEDLVAWLEKKVVDPARAREIRRQERVIGRIIELKAETAARAVKIVLPDGNPSLALPAGVSLHPGLLTISFNNEQQLLERLFLLARTFAANPQMLSSLPKR
Ga0311301_1074240823300032160Peatlands SoilMPDKPSYIHRLTSILEEAKSPKPIPFFRRCDIEALFALKRRQAINLMHTIGAIRVSHEIAVPQKDLVSWLEEMALNPARTREIRRRERVIGRIVDLKAETAARAIKIVLPDSSPSADIPPGVSLQPGLLTVSFSNEQQLLERLFLLARLLAAKPHLISDLCKPQS
Ga0335078_1001097663300032805SoilMTPKPIPFFRRRDIEALFGLKRRQAINLMHAVGAVRVSSEIAVLQLDLVTWLEKMVRNPAREREIRRRERVIGRIVELKAEAAARAVKIVLPDPLPAPSDFPPGVSLQPGILTISFSCEEQLLERLFLLARAFAAKPELLSTWRGGKGAGGSA
Ga0335078_1017731643300032805SoilMPHHPSYIHKLDGILAESRRAKPIPFFRRGDIEALFGLRKRQAVNLMHRIGAIRVSRELAVAQRDLVRWLERAIAEPARAIEVRRHAAAIGRIVELKAETAARAVKIVLPDPAPASGLPHGVSLEPGRLTIVFDDEQQLLERLFLLARRLASEPQWLSRRLPR
Ga0335078_1109699113300032805SoilVPDNPTYIHKLDGILAEARSPKPIPFFRRRDIQALFGLKKRQAVNLMPRIGAVRVSRELALDQHDLVRWLEQRIAEPSVAREWRRHETVIDRIVELKAETAARAVKIALPERAPSVELPAGVSLQPGLLTVSFDHQQQLLERLFLLARVLATQPEFLSSISSSR
Ga0335078_1113986623300032805SoilDKPSYLHRVTEILAEAKTPKPIPFFRRCDIEALFGLKRRQAINLMHAVGAVRVSNEIAVPQQDLVSWLEKMVLSPVRVQEIGRRERVIGRIIELKAETAARAVKIVLPDPPLPPGFPPGVSLQPGTLTISFDTQMELLERLFLLARLFAAKPQLLSTLDKR
Ga0335069_1194561123300032893SoilSSMPDHPTYIHKLEGILAEACSPKPIPFFRRRDIEALFGLKKRQAVNLMHRIGAVRVSSELAVEQRDLVRWLERMISDPSVAVEQRRQEAVIDRIVELKAETAARAIKIVLPDRAPCVALPDGVSLRPGLLTVAFDNEQQLLERLFLLARVLATQPQFLRSLSLLR
Ga0335071_1010789033300032897SoilLPDNPTYIHNLAGILAEARSPKPIPFFRRRDIEALFGLRKRQAINLMHRIGAIRVSRELAVEKRDLIRWLERMISDPSVAAEQRRHDTVIDRIVEFKAETAARAIKIVLPERKPSVELPDGVSLQPGLLTVSFESEEQLIERLFLLARVLATQAQVLFKR
Ga0335071_1077731423300032897SoilMPDKPSYLHRITEILAEAKTPKPIPFFRRCDIEALFGLQRRQAINLMHRIGAVRVSNEIAIPQQDLVSWLETAALDPARMREIRRQKRVIGRIVELKAETAARAVKIVLPDPPPAPVDLPAGVSLQPGTLTISFNSEQQLLERLFLLARAFAANPQLLSNLYKG
Ga0335072_1050608023300032898SoilVPDNPSYIHKLEGILAEARAPKPIPFFRRRDIESLFGLRKRQAVNLMHQIGAIRVSRELAVEQCRLVRWLEQMISDPSTAAEWQRQETVINRIVELKAETAARAIKIVLPDPATPIEIPPGVSLQPGLLTVSFESEHQLLERLFLLARVFATSPQVLSSLSIPHQGQHA
Ga0316628_10067975813300033513SoilMPDKPTYIHRIPAILVEARSPRPIPFFRRCDIEALFGLKRRQALKLMHRIGAIRVSNEIALDQRDLIDWLERMAEGPTVARESRRRERVIGRIVELKTETAARAVKIVLPDPKPSAEIPDGVSLEPGVLTVTFDHEQQLLERLFQLARLLANKPHLISDLQRQR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.