NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F098796

Metagenome / Metatranscriptome Family F098796

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098796
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 47 residues
Representative Sequence MTPQPSDWRSVAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH
Number of Associated Samples 57
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 90.29 %
% of genes near scaffold ends (potentially truncated) 13.59 %
% of genes from short scaffolds (< 2000 bps) 70.87 %
Associated GOLD sequencing projects 49
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.311 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil
(13.592 % of family members)
Environment Ontology (ENVO) Unclassified
(22.330 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.427 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 51.35%    β-sheet: 0.00%    Coil/Unstructured: 48.65%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF00072Response_reg 11.65
PF13545HTH_Crp_2 9.71
PF00589Phage_integrase 2.91
PF00501AMP-binding 1.94
PF01850PIN 1.94
PF04892VanZ 0.97
PF13185GAF_2 0.97
PF04366Ysc84 0.97
PF00069Pkinase 0.97
PF05985EutC 0.97
PF03631Virul_fac_BrkB 0.97
PF01867Cas_Cas1 0.97
PF00115COX1 0.97
PF00392GntR 0.97
PF12833HTH_18 0.97
PF00027cNMP_binding 0.97
PF01593Amino_oxidase 0.97
PF03626COX4_pro 0.97
PF00892EamA 0.97
PF05598DUF772 0.97
PF04055Radical_SAM 0.97
PF00534Glycos_transf_1 0.97
PF00196GerE 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 3.88
COG1295Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase)Function unknown [S] 0.97
COG1518CRISPR-Cas system-associated integrase Cas1Defense mechanisms [V] 0.97
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 0.97
COG3125Heme/copper-type cytochrome/quinol oxidase, subunit 4Energy production and conversion [C] 0.97
COG4302Ethanolamine ammonia-lyase, small subunitAmino acid transport and metabolism [E] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms56.31 %
UnclassifiedrootN/A43.69 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10117169All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1886Open in IMG/M
3300002915|JGI25387J43893_1018090Not Available968Open in IMG/M
3300004139|Ga0058897_11173501Not Available528Open in IMG/M
3300004479|Ga0062595_101358050Not Available644Open in IMG/M
3300005167|Ga0066672_10329752Not Available995Open in IMG/M
3300005175|Ga0066673_10053859All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2039Open in IMG/M
3300005176|Ga0066679_10213578All Organisms → cellular organisms → Bacteria → Acidobacteria1233Open in IMG/M
3300005179|Ga0066684_10111027All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1694Open in IMG/M
3300005179|Ga0066684_11101820Not Available508Open in IMG/M
3300005435|Ga0070714_100013181All Organisms → cellular organisms → Bacteria → Acidobacteria6617Open in IMG/M
3300005435|Ga0070714_100151064All Organisms → cellular organisms → Bacteria2093Open in IMG/M
3300005435|Ga0070714_100275223All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatobacter → unclassified Candidatus Sulfotelmatobacter → Candidatus Sulfotelmatobacter sp. SbA71562Open in IMG/M
3300005435|Ga0070714_100533639Not Available1122Open in IMG/M
3300005435|Ga0070714_100540670All Organisms → cellular organisms → Bacteria1114Open in IMG/M
3300005435|Ga0070714_100891163All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300005436|Ga0070713_100383862All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1309Open in IMG/M
3300005437|Ga0070710_11223049Not Available556Open in IMG/M
3300005437|Ga0070710_11482070Not Available509Open in IMG/M
3300005454|Ga0066687_10532552All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium696Open in IMG/M
3300005534|Ga0070735_10000691All Organisms → cellular organisms → Bacteria → Acidobacteria35020Open in IMG/M
3300005534|Ga0070735_10004585All Organisms → cellular organisms → Bacteria → Acidobacteria11600Open in IMG/M
3300005534|Ga0070735_10004800All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae11310Open in IMG/M
3300005534|Ga0070735_10115882All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1678Open in IMG/M
3300005534|Ga0070735_10332790Not Available913Open in IMG/M
3300005539|Ga0068853_101201561Not Available732Open in IMG/M
3300005560|Ga0066670_10397134Not Available845Open in IMG/M
3300005563|Ga0068855_102095372Not Available570Open in IMG/M
3300005575|Ga0066702_10150397All Organisms → cellular organisms → Bacteria1378Open in IMG/M
3300005587|Ga0066654_10327000Not Available829Open in IMG/M
3300006028|Ga0070717_10131726All Organisms → cellular organisms → Bacteria2151Open in IMG/M
3300006028|Ga0070717_11029347Not Available750Open in IMG/M
3300006032|Ga0066696_10240039All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1168Open in IMG/M
3300006032|Ga0066696_10715292Not Available642Open in IMG/M
3300006893|Ga0073928_10000626All Organisms → cellular organisms → Bacteria79453Open in IMG/M
3300006893|Ga0073928_10000628All Organisms → cellular organisms → Bacteria79319Open in IMG/M
3300006893|Ga0073928_10000968All Organisms → cellular organisms → Bacteria59010Open in IMG/M
3300007819|Ga0104322_131764All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1617Open in IMG/M
3300007819|Ga0104322_138713All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300007982|Ga0102924_1015227All Organisms → cellular organisms → Bacteria6023Open in IMG/M
3300009093|Ga0105240_10004411All Organisms → cellular organisms → Bacteria21467Open in IMG/M
3300009093|Ga0105240_10034008All Organisms → cellular organisms → Bacteria → Acidobacteria6580Open in IMG/M
3300009093|Ga0105240_10178358All Organisms → cellular organisms → Bacteria2509Open in IMG/M
3300009093|Ga0105240_12669125Not Available516Open in IMG/M
3300009545|Ga0105237_10606522All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1102Open in IMG/M
3300009545|Ga0105237_10850016All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300009551|Ga0105238_10053741All Organisms → cellular organisms → Bacteria4047Open in IMG/M
3300010321|Ga0134067_10410821Not Available544Open in IMG/M
3300010371|Ga0134125_10653674Not Available1161Open in IMG/M
3300010371|Ga0134125_11169939Not Available841Open in IMG/M
3300010373|Ga0134128_12463103All Organisms → cellular organisms → Bacteria → Acidobacteria573Open in IMG/M
3300010373|Ga0134128_12729044Not Available544Open in IMG/M
3300010373|Ga0134128_12806470All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300010396|Ga0134126_10028649All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium7058Open in IMG/M
3300010396|Ga0134126_10081900All Organisms → cellular organisms → Bacteria3993Open in IMG/M
3300010396|Ga0134126_10156408All Organisms → cellular organisms → Bacteria2758Open in IMG/M
3300010396|Ga0134126_11329077Not Available796Open in IMG/M
3300010399|Ga0134127_12371571Not Available610Open in IMG/M
3300011120|Ga0150983_12391720Not Available1876Open in IMG/M
3300011120|Ga0150983_14396585All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium710Open in IMG/M
3300012469|Ga0150984_101425174All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1181Open in IMG/M
3300013104|Ga0157370_10814746All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium849Open in IMG/M
3300013307|Ga0157372_10477225All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1454Open in IMG/M
3300018431|Ga0066655_10259052All Organisms → cellular organisms → Bacteria → Acidobacteria1115Open in IMG/M
3300018433|Ga0066667_12164950Not Available517Open in IMG/M
3300018468|Ga0066662_10916859All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium861Open in IMG/M
3300019888|Ga0193751_1095403Not Available1155Open in IMG/M
3300020579|Ga0210407_10214641All Organisms → cellular organisms → Bacteria1496Open in IMG/M
3300020579|Ga0210407_10510568Not Available939Open in IMG/M
3300021168|Ga0210406_10172062Not Available1809Open in IMG/M
3300021168|Ga0210406_10303672All Organisms → cellular organisms → Bacteria → Acidobacteria1297Open in IMG/M
3300021479|Ga0210410_10361939Not Available1301Open in IMG/M
3300022557|Ga0212123_10000509All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia125835Open in IMG/M
3300022557|Ga0212123_10001130All Organisms → cellular organisms → Bacteria → Acidobacteria77332Open in IMG/M
3300022557|Ga0212123_10001496All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae64602Open in IMG/M
3300022557|Ga0212123_10014025All Organisms → cellular organisms → Bacteria10308Open in IMG/M
3300025898|Ga0207692_10986813Not Available556Open in IMG/M
3300025913|Ga0207695_10044785All Organisms → cellular organisms → Bacteria → Acidobacteria4703Open in IMG/M
3300025913|Ga0207695_10049749All Organisms → cellular organisms → Bacteria4414Open in IMG/M
3300025913|Ga0207695_10058420All Organisms → cellular organisms → Bacteria → Proteobacteria4004Open in IMG/M
3300025913|Ga0207695_10085979All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → unclassified Terriglobales → Acidobacteriales bacterium 13_2_20CM_55_83173Open in IMG/M
3300025915|Ga0207693_10913715Not Available673Open in IMG/M
3300025928|Ga0207700_10244697Not Available1530Open in IMG/M
3300025929|Ga0207664_10158652All Organisms → cellular organisms → Bacteria1928Open in IMG/M
3300025929|Ga0207664_10196109All Organisms → cellular organisms → Bacteria1741Open in IMG/M
3300025929|Ga0207664_10312404Not Available1385Open in IMG/M
3300025929|Ga0207664_10873178Not Available808Open in IMG/M
3300025929|Ga0207664_11080873Not Available717Open in IMG/M
3300025929|Ga0207664_11568277Not Available580Open in IMG/M
3300025929|Ga0207664_11828717Not Available530Open in IMG/M
3300025929|Ga0207664_11905891Not Available517Open in IMG/M
3300026300|Ga0209027_1025595All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2235Open in IMG/M
3300026301|Ga0209238_1100000Not Available984Open in IMG/M
3300026301|Ga0209238_1110328All Organisms → cellular organisms → Bacteria921Open in IMG/M
3300026550|Ga0209474_10247938All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300026550|Ga0209474_10525410Not Available600Open in IMG/M
3300026552|Ga0209577_10720173Not Available562Open in IMG/M
3300027587|Ga0209220_1134907Not Available641Open in IMG/M
3300027635|Ga0209625_1133959Not Available555Open in IMG/M
3300027986|Ga0209168_10000849All Organisms → cellular organisms → Bacteria → Acidobacteria26881Open in IMG/M
3300027986|Ga0209168_10013589All Organisms → cellular organisms → Bacteria4757Open in IMG/M
3300027986|Ga0209168_10231571Not Available918Open in IMG/M
3300031740|Ga0307468_101670996Not Available598Open in IMG/M
3300034644|Ga0370548_060873Not Available695Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil13.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.62%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere10.68%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil9.71%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring7.77%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil7.77%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.77%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.80%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.94%
Permafrost SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Permafrost Soil1.94%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.94%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.97%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002915Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cmEnvironmentalOpen in IMG/M
3300004139Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF230 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007819Permafrost core soil microbial communities from Svalbard, Norway - sample 2-1-2 SoapdenovoEnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300013104Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025913Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300034644Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_123 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1011716943300001593Forest SoilMTPQPRDWRRVAEEVRDEQDSIKMMALVIELDRLLENEDKSRKRPH*
JGI25387J43893_101809023300002915Grasslands SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRILEREEKARKRSR*
Ga0058897_1117350123300004139Forest SoilGRFPMTPQPSNRRHEAERICYEKDSNKMMELVFELDRLLECEDKSRKRPH*
Ga0062595_10135805023300004479SoilMTPQPSNWRSIAEQICDEKDSNKMMALVFELDRLLERDEKFRKRSH*
Ga0066672_1032975223300005167SoilMTPQPSDWRSVAEQISDEKDSNKMMALVVELDRLLEREEKPRKRSH*
Ga0066673_1005385923300005175SoilMTPQSSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKTRKRSH*
Ga0066679_1021357813300005176SoilMTPQPSDWRSVAEQISDEKDSNKMMALVAELDRLLEREEKARKRSH*
Ga0066684_1011102733300005179SoilMIPMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKTRKRSH*
Ga0066684_1110182013300005179SoilVTPQPRDWRRVAEEVRDEKDSAKMMALVIELDHLLEKEDKARRRPHCRAWMIN*
Ga0070714_100013181103300005435Agricultural SoilMTPQPRDWRRVAEQVRDEKDSSKMMALVIELDRLLENEDKSRKRPH*
Ga0070714_10015106443300005435Agricultural SoilMTPQPRDWRRVAEEARDEKDSVKMMALIIELDSLLEKEDKTRRRPH*
Ga0070714_10027522343300005435Agricultural SoilNQKLKGGLPMTPQPSNWRFVAEQICDEKDSDKMMALVVELDRLLERDEKSRKHQQH*
Ga0070714_10053363933300005435Agricultural SoilMTPQPRDWRRVAEEVRDEKDSTKMMALFVELDRLLENEDKSRKRPH*
Ga0070714_10054067023300005435Agricultural SoilMSPQPGDWRSVAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH*
Ga0070714_10089116323300005435Agricultural SoilMTPQPSDWRSVAEQICDEKDSNKMMALVFELDRLLERDEKFRKRSH*
Ga0070713_10038386223300005436Corn, Switchgrass And Miscanthus RhizosphereMTPQQPSNWRPVAEQICDEKDSTKMMALVIELNRLLERDEKSRKQQQQH*
Ga0070710_1122304913300005437Corn, Switchgrass And Miscanthus RhizosphereMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLEREDKSRQRPH*
Ga0070710_1148207013300005437Corn, Switchgrass And Miscanthus RhizosphereMTPQPSDWRYVAERVCMEKDSNKMIALVMELNRLLEREQSNKRTH*
Ga0066687_1053255213300005454SoilMIPMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRILEREEKARKRSH*
Ga0070735_10000691303300005534Surface SoilMTPQPRDWRCVAEQIRDERDSKKMMDLVVELDRLLESEDKSRRRPH*
Ga0070735_1000458573300005534Surface SoilMTPQPSNWRSVAEQICDEKNSDKMMALVFELDRLLERDEKARKQQQH*
Ga0070735_10004800143300005534Surface SoilMTPQPRDWRLVAEQIRDEKDSNKMMELVVELDCLLERDENARKHPHNLCA*
Ga0070735_1011588233300005534Surface SoilMTPQPRDWRSVAEQIADEKNSDKMMALVFELNRLLERDEKSRKHQQQ*
Ga0070735_1033279023300005534Surface SoilMTPQPRTPQPRDWRYVAEQIRDEKDSKKMMDLVVELDRLLESEDKSRRRPHYQLPKL*
Ga0068853_10120156133300005539Corn RhizosphereSPMTPQPRDWRRVAEQVRDEKDSSKLMVLVIELNRLLENEDKSRKRPH*
Ga0066670_1039713423300005560SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH*
Ga0068855_10209537223300005563Corn RhizosphereMTPQPRDWRRVAEQLRDEKDSSKMMALVIELDRLLENEDKSRKRPH*
Ga0066702_1015039723300005575SoilMTPQPSNWRSLAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH*
Ga0066654_1032700023300005587SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKTRKRSH*
Ga0070717_1013172633300006028Corn, Switchgrass And Miscanthus RhizosphereMTPQPRDWRYVAKQICDERDSNKMMELVVELDRLLEREEKSRKQH*
Ga0070717_1102934723300006028Corn, Switchgrass And Miscanthus RhizosphereMTPQQPSNWRPVAEQICDEKDSTKMMALVIELNRLLERDEKARKQQQQH*
Ga0066696_1024003923300006032SoilVTPQPRDWRRVAEEVRDEKDSVKMMALVIELDRLLEKEDKSRRRPH*
Ga0066696_1071529213300006032SoilLIPMTPQPSGWRSLAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH*
Ga0073928_10000626763300006893Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSNKMMALVIELDHLLECEDKSRKRPH*
Ga0073928_10000628423300006893Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLECEDKSRRRPH*
Ga0073928_10000968573300006893Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSMKMMALVIELDRLLESEDKSRKRSH*
Ga0104322_13176433300007819Permafrost SoilMTPQPRDWRRVAEQVCDEKDSIKMMALVIELDHLLENEDKSRKRPH*
Ga0104322_13871323300007819Permafrost SoilMTPQPNNWRYVAEQICDEKDSIKMMTLVIELDRLLEYEDKSRKPQQH*
Ga0102924_101522773300007982Iron-Sulfur Acid SpringMTPQPSDWRHVAEQIRDEKDSTKMMALVIELDRLLEREDKSRKPQH*
Ga0105240_1000441163300009093Corn RhizosphereMTPQPRTPQPSDWRYVAEQIRDEKDSKKMMDLVAELDRLLESEDKSRRRPH*
Ga0105240_1003400813300009093Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSSKLMVLVIELNRLLENEDKSRKRPH*
Ga0105240_1017835833300009093Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSSKMMALVIELNRLLENEDKSRKRSH*
Ga0105240_1266912513300009093Corn RhizosphereTPQPRDWRRVAEQLRDEKDSSKMMALVIELDRLLERDEKFRKRSH*
Ga0105237_1060652213300009545Corn RhizosphereRTLQPRDWRYVAEQIRDEKDSKKMMDLVVELDRLLESEDKSRRRRY*
Ga0105237_1085001623300009545Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSSKMMALVIELNRLLENEDMSRKRPH*
Ga0105238_1005374133300009551Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSNKLMALVIELARLLESEDKSRKRPH*
Ga0134067_1041082113300010321Grasslands SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRILEREEKARKRSH*
Ga0134125_1065367423300010371Terrestrial SoilMTPQPSDWRSVAEQICDEKNSNKMMALVVELDRLFEREEKARKSSH*
Ga0134125_1116993913300010371Terrestrial SoilMTPQPRDWRCVAEQIRDEKDSKKMMALVVELDRLMESEDKSRRRPH*
Ga0134128_1246310313300010373Terrestrial SoilMTPQPRDWRRVAEQLRDEKDSSKMMALVIELDRLLENEDKSRKR
Ga0134128_1272904413300010373Terrestrial SoilMTPQPRTLQPRDWRYVAEQIRDEKDSKKMMDLVVELDRLLESEDKSRRRRY*
Ga0134128_1280647023300010373Terrestrial SoilMTPQPRDWRRVAEEVRDEKDSTKMMALVIELDRLLENEDR
Ga0134126_1002864983300010396Terrestrial SoilMTPQPRDWRCVAEQIRDEKDSKKMMDLVVELDRLLESEDKSRRRPH*
Ga0134126_1008190013300010396Terrestrial SoilMTPQPSDWRLVAAQVRDEKDPNKMMALVIELDRLLERDEKTRKRSR*
Ga0134126_1015640823300010396Terrestrial SoilMTPQPCNWRSIAEQICDEKDSNKMMALVFELDRLLERDEKFRKRSH*
Ga0134126_1132907723300010396Terrestrial SoilMTPQPRDWRRVAEEVRDEKDSTKMMALVIELDRLLENEDRSRKRPHPGRDLAKAWK*
Ga0134127_1237157123300010399Terrestrial SoilQKQKGGFSMTPQPRTLQPRDWRYVAEQIRDEKDSKKMMDLVVELDRLLESEDKSRRRRY*
Ga0150983_1239172023300011120Forest SoilLYSRNTEGRFPMTPQPRTPQPRDWRYVAEQIRDEEDSKKMMELVVELDRLLESEDKSRKRPH*
Ga0150983_1439658513300011120Forest SoilETRNREGRFPMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLEREDESRKRPH*
Ga0150984_10142517413300012469Avena Fatua RhizosphereMTPQPSDWRSIAAQICDEKDSNKMMALVVELDRLLEREEKARKRSH*
Ga0157370_1081474613300013104Corn RhizosphereKYMTPQPRTPQPSDWRYVAEQIRDEKDSKKMMDLVVELDRLLESDDKSRRRPH*
Ga0157372_1047722523300013307Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSNKLIALVIELDRLLESEDKSRKRPH*
Ga0066655_1025905223300018431Grasslands SoilMTPQPSDWRSVAEQISDEKDSNKMMALVVELDRLLEREEKTRKRSH
Ga0066667_1216495013300018433Grasslands SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKTRKRSH
Ga0066662_1091685913300018468Grasslands SoilMTPQSSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKTRKRSH
Ga0193751_109540313300019888SoilMTPQPRDWRCVAEQICDEKDSIKMMELVVELDRLLEREDKSRKRPH
Ga0210407_1021464143300020579SoilMTPQPSDWRYVAEQICDEKDSNKMMELVVELDRLLEREDKSRKRPH
Ga0210407_1051056813300020579SoilMTPQPRTPQPRDWRYVAEQIRDEEDSKKMMELVVELDRLLESEDKSRKRPH
Ga0210406_1017206213300021168SoilMTPQPRTSQPRDWRYVAEQIRDEKDSKKMMELVVELDRLLESEDKSRKPPH
Ga0210406_1030367213300021168SoilMTPQPSNRRHEAERICYEKDSNKMMELVFELDRLLECED
Ga0210410_1036193923300021479SoilMTPQPRDWRRVAEQVRDEKDSTKMMALVIELDRLLECEDKSRKRPH
Ga0212123_10000509973300022557Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSMKMMALVIELDRLLESEDKSRKRSH
Ga0212123_10001130613300022557Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLECEDKSRRRPH
Ga0212123_10001496343300022557Iron-Sulfur Acid SpringMTPQPRDWRRVAEEVRDEKDSNKMMALVIELDHLLECEDKSRKRPH
Ga0212123_1001402553300022557Iron-Sulfur Acid SpringMTPQPSDWRHVAEQIRDEKDSTKMMALVIELDRLLEREDKSRKPQH
Ga0207692_1098681313300025898Corn, Switchgrass And Miscanthus RhizosphereMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLEREDKSRQRPH
Ga0207695_1004478513300025913Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSSKLMVLVIELNRLLENEDKSRKRPH
Ga0207695_1004974943300025913Corn RhizosphereMTPQPRDWRRVAEQLRDEKDSSKMMALVIELDRLLENEDKSRKRPH
Ga0207695_1005842053300025913Corn RhizosphereMTPQPRDWRRVAEQVRDEKDSSKMMALVIELNRLLENEDMSRKRPH
Ga0207695_1008597943300025913Corn RhizosphereMTPQPRTPQPSDWRYVAEQIRDEKDSKKMMDLVAELDRLLESEDKSRRRPH
Ga0207693_1091371513300025915Corn, Switchgrass And Miscanthus RhizosphereMTPQPSNWRFVAEQICDEKDSDKMMALVVELDRLLER
Ga0207700_1024469733300025928Corn, Switchgrass And Miscanthus RhizosphereMTPQQPSNWRPVAEQICDEKDSTKMMALVIELNRLLERDEKSRKQQQQH
Ga0207664_1015865223300025929Agricultural SoilMSPQPGDWRSVAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH
Ga0207664_1019610923300025929Agricultural SoilMTPQPRDWRRVAEEARDEKDSVKMMALIIELDSLLEKEDKTRRRPH
Ga0207664_1031240423300025929Agricultural SoilMTPQPRDWRRVAEQVRDEKDSNKLIALVIELDRLLESEDKSRKRPH
Ga0207664_1087317813300025929Agricultural SoilMTPQPCNWRSIAEQICDEKDSNKMMALVFELDRLLERDEKFRKRSH
Ga0207664_1108087313300025929Agricultural SoilMTPQPSNWRFVAEQICDEKDSDKMMALVVELDRLLERDEKSRKHQQH
Ga0207664_1156827713300025929Agricultural SoilREIEGRFSMTPQPRDWRRVAEEVRDEKDSTKMMALFVELDRLLENEDKSRKRPH
Ga0207664_1182871713300025929Agricultural SoilMTPQPSDWRSVAEQICDEKDSNKMMALVVELDCLLEREEKARKR
Ga0207664_1190589113300025929Agricultural SoilMTPLQPSNWQTVAEQICDEKNSDKMMALVVELDRLLERDEKSRKHQQH
Ga0209027_102559533300026300Grasslands SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH
Ga0209238_110000013300026301Grasslands SoilMIPMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRILEREEKARKRSH
Ga0209238_111032813300026301Grasslands SoilMTPQPSDWRSLAEQICDEKDSNKMMALVVELDRILEREEKPRQRSR
Ga0209474_1024793813300026550SoilMTPQPSDWRSVAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH
Ga0209474_1052541023300026550SoilMTPQPSDWRSVAEQISDEKDSNKMMALVVELDRLLEREEKPRKRSH
Ga0209577_1072017323300026552SoilMTPQPSNWRSLAEQICDEKDSNKMMALVVELDRLLEREEKARKRSH
Ga0209220_113490713300027587Forest SoilMTPQPRDWRRVAEEVRDEKDSIKMMALVIELDRLLECEDKSRKRPH
Ga0209625_113395923300027635Forest SoilMTPQPSNWRYVAEQICDEKDSNKMMELVFELDRLLEREDKSRKRPH
Ga0209168_10000849303300027986Surface SoilMTPQPRDWRCVAEQIRDERDSKKMMDLVVELDRLLESEDKSRRRPH
Ga0209168_1001358963300027986Surface SoilMTPQPRDWRLVAEQIRDEKDSNKMMELVVELDCLLERDENARKHPHNLCA
Ga0209168_1023157123300027986Surface SoilMTPQPSNWRSVAEQICDEKNSDKMMALVFELDRLLERDEKARKQQQH
Ga0307468_10167099623300031740Hardwood Forest SoilMTPQPNNWRYVAEQICDEKDSNKMMALVIELDRLLEREDKSRKPQQH
Ga0370548_060873_29_1693300034644SoilMTPQPRDWRRVAEEVRDEKDSDKMMALVIELDRLLENEDKSRKRPH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.