NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089069

Metagenome / Metatranscriptome Family F089069

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089069
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 47 residues
Representative Sequence MPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKL
Number of Associated Samples 76
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 45.37 %
% of genes near scaffold ends (potentially truncated) 55.96 %
% of genes from short scaffolds (< 2000 bps) 97.25 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (75.229 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(41.284 % of family members)
Environment Ontology (ENVO) Unclassified
(30.275 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(48.624 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.61%    β-sheet: 0.00%    Coil/Unstructured: 51.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF14235DUF4337 4.59
PF01068DNA_ligase_A_M 2.75
PF13545HTH_Crp_2 1.83
PF05532CsbD 1.83
PF03631Virul_fac_BrkB 1.83
PF02518HATPase_c 0.92
PF13649Methyltransf_25 0.92
PF13419HAD_2 0.92
PF00856SET 0.92
PF00872Transposase_mut 0.92
PF02735Ku 0.92
PF04392ABC_sub_bind 0.92
PF13565HTH_32 0.92
PF00239Resolvase 0.92
PF00072Response_reg 0.92
PF09339HTH_IclR 0.92
PF03401TctC 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 2.75
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 2.75
COG1295Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase)Function unknown [S] 1.83
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 1.83
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 0.92
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.92
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.92
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.92
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.92
COG3328Transposase (or an inactivated derivative)Mobilome: prophages, transposons [X] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A75.23 %
All OrganismsrootAll Organisms24.77 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090002|CNA_F7QKVOU01AQZEENot Available509Open in IMG/M
3300000956|JGI10216J12902_118069236Not Available908Open in IMG/M
3300004114|Ga0062593_101944024All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria651Open in IMG/M
3300005329|Ga0070683_100359796All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1386Open in IMG/M
3300005338|Ga0068868_100196457All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1680Open in IMG/M
3300005339|Ga0070660_101115034Not Available668Open in IMG/M
3300005347|Ga0070668_102240422All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. C-145505Open in IMG/M
3300005455|Ga0070663_101642848All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium573Open in IMG/M
3300005564|Ga0070664_100476085Not Available1149Open in IMG/M
3300006237|Ga0097621_101699267Not Available601Open in IMG/M
3300006791|Ga0066653_10384497Not Available716Open in IMG/M
3300009094|Ga0111539_10271560All Organisms → cellular organisms → Bacteria1974Open in IMG/M
3300009098|Ga0105245_11222360Not Available799Open in IMG/M
3300009100|Ga0075418_11153531Not Available839Open in IMG/M
3300009147|Ga0114129_10088825All Organisms → cellular organisms → Bacteria4284Open in IMG/M
3300009147|Ga0114129_10969601Not Available1073Open in IMG/M
3300009174|Ga0105241_10771639All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. C-145883Open in IMG/M
3300009176|Ga0105242_11484800Not Available708Open in IMG/M
3300009553|Ga0105249_11813887Not Available683Open in IMG/M
3300009553|Ga0105249_13470568Not Available507Open in IMG/M
3300010371|Ga0134125_10985268Not Available925Open in IMG/M
3300010399|Ga0134127_11366793Not Available779Open in IMG/M
3300011003|Ga0138514_100094962All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia643Open in IMG/M
3300012212|Ga0150985_102840033Not Available551Open in IMG/M
3300012212|Ga0150985_104277738Not Available886Open in IMG/M
3300012212|Ga0150985_105700024Not Available765Open in IMG/M
3300012212|Ga0150985_105725759Not Available639Open in IMG/M
3300012212|Ga0150985_107066612Not Available776Open in IMG/M
3300012212|Ga0150985_108543417Not Available668Open in IMG/M
3300012212|Ga0150985_110562906Not Available919Open in IMG/M
3300012212|Ga0150985_111136145Not Available1037Open in IMG/M
3300012212|Ga0150985_123038872Not Available742Open in IMG/M
3300012469|Ga0150984_101725658All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → Pseudorhodoplanes sinuspersici1098Open in IMG/M
3300012469|Ga0150984_108944536Not Available567Open in IMG/M
3300012469|Ga0150984_110417873All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300012469|Ga0150984_111659467Not Available772Open in IMG/M
3300012469|Ga0150984_113613387Not Available1228Open in IMG/M
3300012469|Ga0150984_114220268Not Available658Open in IMG/M
3300012469|Ga0150984_119414764Not Available601Open in IMG/M
3300012939|Ga0162650_100022202Not Available921Open in IMG/M
3300012951|Ga0164300_10157025Not Available1072Open in IMG/M
3300012951|Ga0164300_10260171All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium886Open in IMG/M
3300012955|Ga0164298_10825168Not Available666Open in IMG/M
3300012955|Ga0164298_11672896Not Available504Open in IMG/M
3300012958|Ga0164299_10688633Not Available713Open in IMG/M
3300012960|Ga0164301_11612063Not Available539Open in IMG/M
3300012961|Ga0164302_10085509All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1695Open in IMG/M
3300012961|Ga0164302_10691699All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria754Open in IMG/M
3300012961|Ga0164302_10764476Not Available724Open in IMG/M
3300012985|Ga0164308_12261515Not Available506Open in IMG/M
3300012987|Ga0164307_11878179Not Available508Open in IMG/M
3300012989|Ga0164305_11114308Not Available679Open in IMG/M
3300013306|Ga0163162_11010530All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. C-145940Open in IMG/M
3300013306|Ga0163162_11334697Not Available815Open in IMG/M
3300013306|Ga0163162_11945153All Organisms → cellular organisms → Bacteria → PVC group → Chlamydiae673Open in IMG/M
3300014326|Ga0157380_12138276All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium623Open in IMG/M
3300014745|Ga0157377_10214905All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1228Open in IMG/M
3300015372|Ga0132256_102265312All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Oxalobacteraceae → unclassified Oxalobacteraceae → Oxalobacteraceae bacterium647Open in IMG/M
3300015373|Ga0132257_100761675Not Available1206Open in IMG/M
3300015374|Ga0132255_102116032Not Available857Open in IMG/M
3300017965|Ga0190266_10717283Not Available627Open in IMG/M
3300018054|Ga0184621_10205780Not Available705Open in IMG/M
3300018054|Ga0184621_10211533Not Available694Open in IMG/M
3300018061|Ga0184619_10026944All Organisms → cellular organisms → Bacteria2398Open in IMG/M
3300018067|Ga0184611_1142316Not Available847Open in IMG/M
3300018469|Ga0190270_10685293Not Available1014Open in IMG/M
3300018469|Ga0190270_10876574Not Available913Open in IMG/M
3300018469|Ga0190270_13080859All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium527Open in IMG/M
3300019269|Ga0184644_1114009Not Available686Open in IMG/M
3300019867|Ga0193704_1034800Not Available1011Open in IMG/M
3300019886|Ga0193727_1169164Not Available574Open in IMG/M
3300020004|Ga0193755_1151545Not Available703Open in IMG/M
3300021082|Ga0210380_10374915Not Available650Open in IMG/M
3300022756|Ga0222622_10935018Not Available636Open in IMG/M
3300025919|Ga0207657_10934212Not Available667Open in IMG/M
3300025972|Ga0207668_10895956All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. C-145789Open in IMG/M
3300026023|Ga0207677_11677799Not Available589Open in IMG/M
3300026067|Ga0207678_11410618Not Available616Open in IMG/M
3300027873|Ga0209814_10325780Not Available670Open in IMG/M
3300027903|Ga0209488_10558602Not Available834Open in IMG/M
3300028380|Ga0268265_11041320Not Available810Open in IMG/M
3300028711|Ga0307293_10063041Not Available1160Open in IMG/M
3300028717|Ga0307298_10037746Not Available1301Open in IMG/M
3300028717|Ga0307298_10138966Not Available702Open in IMG/M
3300028717|Ga0307298_10232857Not Available545Open in IMG/M
3300028720|Ga0307317_10026361All Organisms → cellular organisms → Bacteria → Proteobacteria1797Open in IMG/M
3300028721|Ga0307315_10042779Not Available1243Open in IMG/M
3300028754|Ga0307297_10386217Not Available516Open in IMG/M
3300028755|Ga0307316_10040765All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1541Open in IMG/M
3300028768|Ga0307280_10339072Not Available553Open in IMG/M
3300028784|Ga0307282_10109113Not Available1290Open in IMG/M
3300028793|Ga0307299_10046552All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1595Open in IMG/M
3300028796|Ga0307287_10407581Not Available511Open in IMG/M
3300028807|Ga0307305_10100989Not Available1333Open in IMG/M
3300028810|Ga0307294_10370574Not Available533Open in IMG/M
3300028810|Ga0307294_10422648Not Available504Open in IMG/M
3300028875|Ga0307289_10074529Not Available1372Open in IMG/M
3300028875|Ga0307289_10122410All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300028885|Ga0307304_10342662Not Available667Open in IMG/M
3300028885|Ga0307304_10418666Not Available607Open in IMG/M
3300028885|Ga0307304_10573131Not Available521Open in IMG/M
3300030902|Ga0308202_1058901All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria721Open in IMG/M
3300030988|Ga0308183_1120836Not Available618Open in IMG/M
3300031091|Ga0308201_10126600Not Available772Open in IMG/M
3300031092|Ga0308204_10032677Not Available1165Open in IMG/M
3300031092|Ga0308204_10116243Not Available756Open in IMG/M
3300031092|Ga0308204_10132518Not Available722Open in IMG/M
3300031740|Ga0307468_102469155Not Available508Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil41.28%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere8.26%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere6.42%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.59%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere4.59%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.67%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.75%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.83%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil1.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.83%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.92%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.92%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.92%
Quercus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Quercus Rhizosphere0.92%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.92%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090002Quercus rhizosphere microbial communities from Sierra Nevada National Park, Granada, Spain - CNAHost-AssociatedOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300005329Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaGEnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005564Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaGHost-AssociatedOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300006237Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2 (version 2)Host-AssociatedOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012939Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t1i015EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018067Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_coexEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300019269Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025919Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026067Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028720Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_357EnvironmentalOpen in IMG/M
3300028721Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_355EnvironmentalOpen in IMG/M
3300028754Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_157EnvironmentalOpen in IMG/M
3300028755Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_356EnvironmentalOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028810Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_151EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300030902Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_356 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030988Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_157 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031091Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_355 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
CNA_006432102088090002Quercus RhizosphereLQAIHRYVIDTPALKTVTEEIRAVVETVWPELISKLPPKP
JGI10216J12902_11806923613300000956SoilPFMPAELQMIHRYVVETPVLETVTDEIRTVVETLWPELIAKLPPRTPVQD*
Ga0062593_10194402423300004114SoilRVIDGKLVMPSELEAIHRHVVETPVLKVVTEEIREVVETVWPELISKLPPKP*
Ga0070683_10035979613300005329Corn RhizosphereMPAELQAIHRYVVETPVLKVVTEEIRAVVETVWPELISKLPPKPRRDAAGGG*
Ga0068868_10019645733300005338Miscanthus RhizosphereMPAELQAIHRYVVETPVLKVVTEEIRAVVETVWPELISKLPPKPRPDATGGG*
Ga0070660_10111503413300005339Corn RhizosphereMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKT*
Ga0070668_10224042223300005347Switchgrass RhizosphereIRRYVVETPVLNAVTAEIREVVESVWPELISKLPPRFDRGERN*
Ga0070663_10164284823300005455Corn RhizosphereFPYEPRVVGGKLLMPAELQAIRRYVVETPVLNAVTAEIREVVESVWPELISKLPPRFDRGERN*
Ga0070664_10047608523300005564Corn RhizosphereMSAELEAIHRYVTETPVLEAVTDEIRAVVETVWPELISKLPPKF*
Ga0068858_10193814413300005842Switchgrass RhizosphereYEPRVIGGKMTMPAELQAIHRYVVETPVLENVTEEIRAVVEMVWP*
Ga0097621_10169926733300006237Miscanthus RhizosphereLQAIHRYVVETPVLKAVTEEIRAVVETVWPELMSKLPPKPRPDATGGG*
Ga0066653_1038449713300006791SoilMPAELQLIHRYLVETPVLEAVTDEIRAVVETVWPELISKLPPKF*
Ga0111539_1027156043300009094Populus RhizosphereMPAELQAIHRYVVETPVLEDVTEEIRAAVETVWPELISKLPPRP*
Ga0105245_1122236013300009098Miscanthus RhizosphereMPAELQTIHRYVVETPVLEALTDEIRAVVETVWLELISKLPPKT*
Ga0075418_1115353123300009100Populus RhizosphereSTPRVVGGKLITSAELHVIHRYVVETPILEEVTEEIRAVVETVWPELISKLPPKS*
Ga0114129_1008882563300009147Populus RhizosphereMPAELQAIHRYVVETPVLETVTDEIRAVVETVWPELISK
Ga0114129_1096960113300009147Populus RhizosphereRVVGGKLITSAELHVIHRYVVETPILEEVTEEIRAVVETVWPELISKLPPKS*
Ga0105241_1077163913300009174Corn RhizosphereVGGKLLMPAELQAIRRYVVETPVLNAVTAEIREVVESVWPELISKLPPRFDRGERN*
Ga0105242_1148480023300009176Miscanthus RhizosphereMPAELQTIHRYVVETPVLEALTDEIRAVVETVWPELISKLPPKT*
Ga0105249_1181388723300009553Switchgrass RhizosphereMTPAELQAIHRYVVATSILEEVTEEIRAVVETVWPELISKLPPKP*
Ga0105249_1347056813300009553Switchgrass RhizosphereQAIHVYVVETPVLEDVTEEIRAAVETFWPELKSKLPPRAG*
Ga0134125_1098526813300010371Terrestrial SoilMPAELQAIRRYVVETPVLNAVTEEIRAVVETVWPELKSKLPPTP*
Ga0134127_1136679333300010399Terrestrial SoilMPAELQTIHRYVVETPVLEALTDEIRAVVETVWPELISKLPPKI*
Ga0138514_10009496223300011003SoilMMMPAELQAIHRYVLETPVLENVTEEIRAVIETVWPELLSKLPPKA*
Ga0150985_10284003313300012212Avena Fatua RhizosphereKLLMPAELQAIHRYVIDTPALKTVTEEMRAVVETVWPELRSKLPPKP*
Ga0150985_10427773823300012212Avena Fatua RhizosphereMPAELQVIHRYVIETPVLKAVTEELREVVETVWPELISKLPPKP*
Ga0150985_10570002423300012212Avena Fatua RhizosphereMPAELQAIHRYVVETPVLKTVTEEMRAVVETVWPELASKLPPKP*
Ga0150985_10572575913300012212Avena Fatua RhizosphereIHRYLVETPVLETVTDEIRAAVETIWPELISKLPPRL*
Ga0150985_10706661213300012212Avena Fatua RhizosphereMPAELQAIHRYVVETPVLEAVTDEIRAVVETLWPELISKLPPKS*
Ga0150985_10854341743300012212Avena Fatua RhizosphereMPAELQAIHRYVVETRVLEVVTDEVRAAVETIWPELISKLPPKL*
Ga0150985_11056290613300012212Avena Fatua RhizosphereMPAELQAIHRYVTETPVLEAVTEDIRAVVETVWPELISKLPPKL*
Ga0150985_11113614523300012212Avena Fatua RhizosphereMPAELQAIHRYVIDTPLLKTVTEEIRAVVETVLA*
Ga0150985_12303887223300012212Avena Fatua RhizosphereMPAELQTIHRYVVETPVLEAVTDEIRAVVETVWPELIAKLPPKI*
Ga0150984_10172565823300012469Avena Fatua RhizosphereRQRFPYEPRVVGGKLLMPAELQAIHRYVVETPVLKTVTEEIREVVETVWPELISKLPPRS
Ga0150984_10894453623300012469Avena Fatua RhizosphereQAIHRYVVETPVLEVVTDEIRAVVETVWPELKSKLPPKP*
Ga0150984_11041787313300012469Avena Fatua RhizosphereHRYVVETPVLEVVTDEIRAVVETVWPELKSKLPPKP*
Ga0150984_11165946713300012469Avena Fatua RhizosphereMPAELQAIHRYVIDTPLLKTVTEEIRAVVETVWPELISKLPPKPRRDAAGGG*
Ga0150984_11361338743300012469Avena Fatua RhizosphereRFPYEPRVVGGKLFMPAELQVIHRYVVETPDLKVVTEELRAVVETVWPELISKLPPKS*
Ga0150984_11422026813300012469Avena Fatua RhizospherePRVVGGRLLMPAELQAIRRYVVETPVLKTVTEEIRAVVETVWPELRSKLPPMS*
Ga0150984_11941476423300012469Avena Fatua RhizosphereMPAELQAIRRYVVETPVLNAVTEEIRAVVETVWPELISKLPPKLRDVP*
Ga0162650_10002220213300012939SoilMIHRYVVETPVLDSVTEELREVVEAVWPEMIAKLPPKT*
Ga0164300_1015702523300012951SoilMPAELQAIYRYVVETPVLKTVTEEIRTVVETVWPELRSKLPPKP*
Ga0164300_1026017113300012951SoilMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKL*
Ga0164298_1082516813300012955SoilMPAELQAIHRYVIETPVLEEVTEEIRAVVETVWPELLSKLPPKP*
Ga0164298_1167289623300012955SoilMPAELQAIHRYVVETLVLKTVTEEIRAVVETVWPELISKLPPKPRPDATGGG*
Ga0164299_1068863313300012958SoilMPAELQAIHRYVVETPVFEEITEEIRAVVETVWPELLSKLPPKP*
Ga0164301_1161206323300012960SoilVGGKLLMPAELQAIHRYVVETPVLENVTEEIRAVVHAVWPEFISKLPPKS*
Ga0164302_1008550943300012961SoilFPYEPRVIGGKLLMPAELQAIHRYVIDTPALKTVTEEIRAVVETVWPELISKLPPKP*
Ga0164302_1069169933300012961SoilMPAELQAIHRYVVETPVLEEVTEEIRAVVETVWPELLSKLPPKR*
Ga0164302_1076447623300012961SoilMPAELQAIYRYVVETPVLKTVTEEIRAVVETVWPELRSKLPPKP*
Ga0164308_1226151513300012985SoilAIRRYVVETPVLAVTEEIRAVVETVWPELISKLPPKLRDVP*
Ga0164307_1187817913300012987SoilMPAELQAIHRYVIETPVLEEVTEEIRAVVETVWPELLSKLPPKR*
Ga0164305_1111430823300012989SoilMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPEL*
Ga0163162_1101053023300013306Switchgrass RhizosphereMPAELQAIRRYVVETPVLNAVTAEIREVVESVWPELISKLPPRFDRGERN*
Ga0163162_1133469713300013306Switchgrass RhizosphereRQRFPYEPRVVGGRLLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELRSKLPPKP
Ga0163162_1194515323300013306Switchgrass RhizospherePYESRVVGGRLLMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKL*
Ga0157380_1213827613300014326Switchgrass RhizosphereFPYEPRVIGGKLLMPAELQAIHRYVVETPVLKVVTEEIRAVVETVWPELISKLPPKPRPDATGGG*
Ga0157377_1021490523300014745Miscanthus RhizosphereMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPRR*
Ga0132256_10226531213300015372Arabidopsis RhizosphereFPYEPRVVGGKLLMPAELQAIHRYVVETPVLEDVTEEIRAAVETVWPELISKLPPRP*
Ga0132257_10076167543300015373Arabidopsis RhizosphereMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKR*
Ga0132255_10211603213300015374Arabidopsis RhizosphereMPAELQAIHRYVVETPVLETVTDEIRAVVETVWPELISKLPPKL*
Ga0190266_1071728313300017965SoilPAELQTIHRYVLETPVLENVTDEIRAVVKTVWPELMSKLPPKA
Ga0184621_1020578023300018054Groundwater SedimentGRLLMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKL
Ga0184621_1021153323300018054Groundwater SedimentMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPDLISKLPPKPWD
Ga0184619_1002694413300018061Groundwater SedimentRVVGGKLLMPAELQAIHRYVVETPVLEAVTEEIRAVVETIWPELISKLPPKP
Ga0184611_114231613300018067Groundwater SedimentAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0190270_1068529323300018469SoilVVGGKLMMPAELQAIHRYVIEKPVLENVTEEIRAVVETVWPELM
Ga0190270_1087657413300018469SoilMMPAELQAIHRHVVETPVLETVTEEIRAVIETVWPELISKLPPRAG
Ga0190270_1308085913300018469SoilMPAELQAIHRYVLETPVLEAVTEEMRAVVETVWPEL
Ga0184644_111400913300019269Groundwater SedimentDRRRFPYEPRVVGGKLLMPAQLQAIHRYVVETPILNAVTEEIRAVVETVWPELISKLPPK
Ga0193704_103480013300019867SoilHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0193727_116916423300019886SoilLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPQP
Ga0193755_115154513300020004SoilMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPKP
Ga0210380_1037491523300021082Groundwater SedimentEPRVVGGKLLMPAELQAIHRYVVETPVLEEVTEEIRAVVETVWPELRSKLPPKS
Ga0222622_1093501823300022756Groundwater SedimentDRRRFPYEPRVVGGKLLMPAELQTIHRYVVETPVLKTVTEEIRAVVETVWPELVSKLPPK
Ga0207657_1093421213300025919Corn RhizosphereMPAELQAIHRYVVETPVLKVVTEEIRAVVETVWPELMSKLPPRR
Ga0207668_1089595633300025972Switchgrass RhizosphereQAIRRYVVETPVLNAVTAEIREVVESVWPELISKLPPRFDRGERN
Ga0207677_1167779913300026023Miscanthus RhizosphereRVIGGKLLMPAELQTIHRYVVETPVLEALTDEIRAVVETVWPELISKLPPKT
Ga0207678_1141061813300026067Corn RhizosphereMPAELQAIHRYVVETPVLKVVTEEIREVVETVWPELISKLPPKPRRDAAGGG
Ga0209814_1032578013300027873Populus RhizosphereRVVGGKLITSAELHVIHRYVVETPILEEVTEEIRAVVETVWPELISKLPPKR
Ga0209488_1055860233300027903Vadose Zone SoilMPAELQAIHRYVVETPVLKTVTEEIREVVETVWPELISKLPPKP
Ga0268265_1104132013300028380Switchgrass RhizosphereRFPYEPRVIGGKLLMPAELQTIHRYVVETPVLEALTDEIRAVVETVWLELISKLPPKT
Ga0307293_1006304113300028711SoilQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPLKP
Ga0307298_1003774613300028717SoilGGKLLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPKP
Ga0307298_1013896613300028717SoilMPAELQAIHRYVVETPILNVVTEEIRAAVETVWPELIS
Ga0307298_1023285713300028717SoilMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKL
Ga0307317_1002636113300028720SoilIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKL
Ga0307315_1004277923300028721SoilKLLMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPEPISKLPPKP
Ga0307297_1038621723300028754SoilELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0307316_1004076513300028755SoilLMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0307280_1033907223300028768SoilTGQKGRPQERKSPALPRVVGGRLLMPAELPPVLEAVTEEIRAVVETFWPELISKLPPKL
Ga0307282_1010911323300028784SoilLLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPKP
Ga0307299_1004655223300028793SoilMPAELQAIHRYVVETPILNVVTEEIRAVVETVWPELISKLPPKP
Ga0307287_1040758113300028796SoilPYEPRVVGGRLLMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0307305_1010098913300028807SoilGKLLMPAELQAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPKP
Ga0307294_1037057413300028810SoilLQAIHRYVVETPALKAVTEEIRAVVETVWPELISKLPPKL
Ga0307294_1042264823300028810SoilIGGKLMMPAELQAIHRYVVETPILENITEEVRAVVETVWPELMSKLPPKA
Ga0307289_1007452923300028875SoilMKGRPQERKSPALPRVVGGRLLMPAELPPVLEAVTEEIRAVVETFWPELISKLPPKL
Ga0307289_1012241013300028875SoilAIHRYVVETPVLEAVTEEIRAVVETVWPELISKLPPQP
Ga0307304_1034266233300028885SoilRLLMPAELQAIHRYVTETPVLEAVTEEIRAVVETVWPELISKLPPKL
Ga0307304_1041866613300028885SoilMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELRSKLPPKP
Ga0307304_1057313113300028885SoilRVVGGKLLMPAELQAIHRYVVETPVLETVTEEIRAVVETVWPELISKLPPKP
Ga0308202_105890123300030902SoilMPAELQAIHRYVRETPVLEVVTEDIRAVVETVWPELISKLPPKL
Ga0308183_112083613300030988SoilMPAELQAIHRYVVETPVLEVVTEEIRAVVETVWPELISKLPPKR
Ga0308201_1012660013300031091SoilMPAELQAIRRYVVETPVLNAVTEEIRAVVETVWPELISKLPPKLRYAP
Ga0308204_1003267713300031092SoilFPYEPRVVGGKLLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPKP
Ga0308204_1011624323300031092SoilRRRFPYEPRVVGGKLLMPAELQAIHRYVVETPVLETVTEEIRAVVETVWPELISKLPPKP
Ga0308204_1013251833300031092SoilFPYEPRVVGGKLLMPAELQAIHRYVVETPVLKTVTEEIRAVVETVWPELISKLPPKL
Ga0307468_10246915513300031740Hardwood Forest SoilIHRYVVEAPILENVTEEIRAVIETVWPELISKLPPKA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.