NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105587

Metagenome / Metatranscriptome Family F105587

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105587
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 67 residues
Representative Sequence MNIANSRSYPDATRLAELVQKIRVLKLDLSRIIEHSRNSKVGSRLIMASGLLDGALEELSKAASEQSD
Number of Associated Samples 57
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 77.00 %
% of genes near scaffold ends (potentially truncated) 22.00 %
% of genes from short scaffolds (< 2000 bps) 64.00 %
Associated GOLD sequencing projects 48
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (50.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil
(26.000 % of family members)
Environment Ontology (ENVO) Unclassified
(46.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(81.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.25%    β-sheet: 0.00%    Coil/Unstructured: 43.75%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF12844HTH_19 8.00
PF00072Response_reg 8.00
PF11941DUF3459 3.00
PF04545Sigma70_r4 3.00
PF07519Tannase 3.00
PF07238PilZ 2.00
PF01381HTH_3 2.00
PF13560HTH_31 2.00
PF03466LysR_substrate 1.00
PF13442Cytochrome_CBB3 1.00
PF00510COX3 1.00
PF04366Ysc84 1.00
PF08240ADH_N 1.00
PF02518HATPase_c 1.00
PF13690CheX 1.00
PF02735Ku 1.00
PF07589PEP-CTERM 1.00
PF03707MHYT 1.00
PF13384HTH_23 1.00
PF00589Phage_integrase 1.00
PF02661Fic 1.00
PF04972BON 1.00
PF13620CarboxypepD_reg 1.00
PF00486Trans_reg_C 1.00
PF04454Linocin_M18 1.00
PF13435Cytochrome_C554 1.00
PF01493GXGXG 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 1.00
COG1845Heme/copper-type cytochrome/quinol oxidase, subunit 3Energy production and conversion [C] 1.00
COG2930Lipid-binding SYLF domain, Ysc84/FYVE familyLipid transport and metabolism [I] 1.00
COG3300MHYT domain, NO-binding membrane sensorSignal transduction mechanisms [T] 1.00
COG5001Cyclic di-GMP metabolism protein, combines GGDEF and EAL domains with a 6TM membrane domainSignal transduction mechanisms [T] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms50.00 %
UnclassifiedrootN/A50.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001082|JGI12664J13189_1001811All Organisms → cellular organisms → Bacteria → Acidobacteria2473Open in IMG/M
3300001082|JGI12664J13189_1004508Not Available1091Open in IMG/M
3300001124|JGI12692J13336_1005787Not Available657Open in IMG/M
3300001151|JGI12713J13577_1000040All Organisms → cellular organisms → Bacteria41011Open in IMG/M
3300001175|JGI12649J13570_1002209All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2979Open in IMG/M
3300001175|JGI12649J13570_1018942Not Available750Open in IMG/M
3300001593|JGI12635J15846_10013383All Organisms → cellular organisms → Bacteria6866Open in IMG/M
3300001593|JGI12635J15846_10017931All Organisms → cellular organisms → Bacteria5865Open in IMG/M
3300001593|JGI12635J15846_10022174Not Available5197Open in IMG/M
3300001593|JGI12635J15846_10046938All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidicapsa → Acidicapsa acidisoli3320Open in IMG/M
3300001593|JGI12635J15846_10065609All Organisms → cellular organisms → Bacteria2705Open in IMG/M
3300002245|JGIcombinedJ26739_100246036All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1677Open in IMG/M
3300002245|JGIcombinedJ26739_100515026Not Available1073Open in IMG/M
3300003505|JGIcombinedJ51221_10047530All Organisms → cellular organisms → Bacteria1620Open in IMG/M
3300003505|JGIcombinedJ51221_10097821All Organisms → cellular organisms → Bacteria1165Open in IMG/M
3300004082|Ga0062384_100357666Not Available927Open in IMG/M
3300004082|Ga0062384_100471404Not Available825Open in IMG/M
3300004082|Ga0062384_100815241Not Available653Open in IMG/M
3300004091|Ga0062387_100001102All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium6725Open in IMG/M
3300004092|Ga0062389_100934538Not Available1051Open in IMG/M
3300004120|Ga0058901_1546133All Organisms → cellular organisms → Bacteria1626Open in IMG/M
3300005591|Ga0070761_10001909All Organisms → cellular organisms → Bacteria12993Open in IMG/M
3300005591|Ga0070761_10437616Not Available801Open in IMG/M
3300005591|Ga0070761_10489998Not Available757Open in IMG/M
3300005591|Ga0070761_10790911Not Available597Open in IMG/M
3300005591|Ga0070761_10955383Not Available543Open in IMG/M
3300005602|Ga0070762_10005165All Organisms → cellular organisms → Bacteria6310Open in IMG/M
3300005602|Ga0070762_10970102Not Available582Open in IMG/M
3300005602|Ga0070762_11289355Not Available507Open in IMG/M
3300005610|Ga0070763_10009668All Organisms → cellular organisms → Bacteria3925Open in IMG/M
3300005610|Ga0070763_10589887Not Available643Open in IMG/M
3300005712|Ga0070764_10132994Not Available1355Open in IMG/M
3300005712|Ga0070764_10682272Not Available632Open in IMG/M
3300006176|Ga0070765_100191626All Organisms → cellular organisms → Bacteria1851Open in IMG/M
3300006176|Ga0070765_100456832Not Available1197Open in IMG/M
3300006176|Ga0070765_101056943Not Available767Open in IMG/M
3300006176|Ga0070765_101875037Not Available562Open in IMG/M
3300009518|Ga0116128_1139304Not Available698Open in IMG/M
3300009547|Ga0116136_1182795Not Available525Open in IMG/M
3300009623|Ga0116133_1006584All Organisms → cellular organisms → Bacteria → Acidobacteria2911Open in IMG/M
3300009623|Ga0116133_1020506All Organisms → cellular organisms → Bacteria1636Open in IMG/M
3300009624|Ga0116105_1000055All Organisms → cellular organisms → Bacteria42358Open in IMG/M
3300009624|Ga0116105_1179744Not Available574Open in IMG/M
3300009762|Ga0116130_1078213Not Available1038Open in IMG/M
3300014489|Ga0182018_10062943All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2237Open in IMG/M
3300014489|Ga0182018_10169549Not Available1237Open in IMG/M
3300017925|Ga0187856_1336312Not Available513Open in IMG/M
3300018013|Ga0187873_1059391All Organisms → cellular organisms → Bacteria1617Open in IMG/M
3300018042|Ga0187871_10021114All Organisms → cellular organisms → Bacteria4204Open in IMG/M
3300019258|Ga0181504_1332771All Organisms → cellular organisms → Bacteria1781Open in IMG/M
3300019260|Ga0181506_1318699Not Available1142Open in IMG/M
3300020580|Ga0210403_10038650All Organisms → cellular organisms → Bacteria3815Open in IMG/M
3300020582|Ga0210395_10884297Not Available664Open in IMG/M
3300020583|Ga0210401_10172043All Organisms → cellular organisms → Bacteria2020Open in IMG/M
3300020583|Ga0210401_10341256All Organisms → cellular organisms → Bacteria1359Open in IMG/M
3300021171|Ga0210405_10000070All Organisms → cellular organisms → Bacteria133501Open in IMG/M
3300021181|Ga0210388_10035419All Organisms → cellular organisms → Bacteria4121Open in IMG/M
3300021181|Ga0210388_10049273Not Available3508Open in IMG/M
3300021181|Ga0210388_10137445All Organisms → cellular organisms → Bacteria2116Open in IMG/M
3300021181|Ga0210388_10223605All Organisms → cellular organisms → Bacteria1650Open in IMG/M
3300021181|Ga0210388_10569496Not Available992Open in IMG/M
3300021181|Ga0210388_10841338All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium793Open in IMG/M
3300021401|Ga0210393_10112232All Organisms → cellular organisms → Bacteria2175Open in IMG/M
3300021401|Ga0210393_10196878All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1628Open in IMG/M
3300021401|Ga0210393_10650395Not Available860Open in IMG/M
3300021401|Ga0210393_11367437Not Available566Open in IMG/M
3300021405|Ga0210387_10001310All Organisms → cellular organisms → Bacteria18706Open in IMG/M
3300021406|Ga0210386_11002937Not Available712Open in IMG/M
3300021407|Ga0210383_10390608Not Available1200Open in IMG/M
3300021432|Ga0210384_10242069All Organisms → cellular organisms → Bacteria1624Open in IMG/M
3300021433|Ga0210391_10014631All Organisms → cellular organisms → Bacteria → Acidobacteria6336Open in IMG/M
3300021433|Ga0210391_10017507All Organisms → cellular organisms → Bacteria → Acidobacteria5784Open in IMG/M
3300021433|Ga0210391_10025041All Organisms → cellular organisms → Bacteria4800Open in IMG/M
3300021477|Ga0210398_10115265Not Available2183Open in IMG/M
3300021559|Ga0210409_10927155Not Available745Open in IMG/M
3300025406|Ga0208035_1075417Not Available510Open in IMG/M
3300025412|Ga0208194_1003640All Organisms → cellular organisms → Bacteria → Acidobacteria2912Open in IMG/M
3300025460|Ga0208562_1109284Not Available529Open in IMG/M
3300027334|Ga0209529_1001229All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3551Open in IMG/M
3300027334|Ga0209529_1016643Not Available1236Open in IMG/M
3300027370|Ga0209010_1004325All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2988Open in IMG/M
3300027559|Ga0209222_1008954Not Available2063Open in IMG/M
3300027559|Ga0209222_1030109All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1051Open in IMG/M
3300027648|Ga0209420_1045355Not Available1333Open in IMG/M
3300027652|Ga0209007_1042590All Organisms → cellular organisms → Bacteria1159Open in IMG/M
3300027729|Ga0209248_10091279Not Available920Open in IMG/M
3300027795|Ga0209139_10006368All Organisms → cellular organisms → Bacteria4455Open in IMG/M
3300027855|Ga0209693_10289289Not Available800Open in IMG/M
3300027879|Ga0209169_10598888All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia576Open in IMG/M
3300027889|Ga0209380_10160811Not Available1314Open in IMG/M
3300027895|Ga0209624_10021961All Organisms → cellular organisms → Bacteria4042Open in IMG/M
3300027895|Ga0209624_10427304All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium883Open in IMG/M
3300027895|Ga0209624_11045090Not Available528Open in IMG/M
3300028906|Ga0308309_10954989Not Available743Open in IMG/M
3300028906|Ga0308309_11368306Not Available607Open in IMG/M
3300031708|Ga0310686_109357216Not Available725Open in IMG/M
3300034163|Ga0370515_0009015All Organisms → cellular organisms → Bacteria4785Open in IMG/M
3300034163|Ga0370515_0022878All Organisms → cellular organisms → Bacteria2861Open in IMG/M
3300034163|Ga0370515_0051092All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1822Open in IMG/M
3300034163|Ga0370515_0185656Not Available888Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil26.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil21.00%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland10.00%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil7.00%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland5.00%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil4.00%
PalsaEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Palsa2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001082Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O3EnvironmentalOpen in IMG/M
3300001124Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O3EnvironmentalOpen in IMG/M
3300001151Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O3EnvironmentalOpen in IMG/M
3300001175Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_O3EnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004091Coassembly of ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300009518Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_150EnvironmentalOpen in IMG/M
3300009547Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_20_40EnvironmentalOpen in IMG/M
3300009623Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_19_10EnvironmentalOpen in IMG/M
3300009624Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_10EnvironmentalOpen in IMG/M
3300009762Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_40EnvironmentalOpen in IMG/M
3300014489Permafrost microbial communities from Stordalen Mire, Sweden - 812P2M metaGEnvironmentalOpen in IMG/M
3300017925Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_40EnvironmentalOpen in IMG/M
3300018013Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_100EnvironmentalOpen in IMG/M
3300018042Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_10EnvironmentalOpen in IMG/M
3300019258Metatranscriptome of peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_10_metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019260Metatranscriptome of peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_10_metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025406Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_11_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025412Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_19_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025460Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_16_150 (SPAdes)EnvironmentalOpen in IMG/M
3300027334Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027370Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027559Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027648Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027652Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027729Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027795Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027879Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027895Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300034163Peat soil microbial communities from wetlands in Alaska, United States - Goldstream_04D_14EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12664J13189_100181123300001082Forest SoilMGNAGPRLNPDSTRLAELIQKIRALKLELSRIIEHSRNSEVGSCLIMASGLLDGAIEELSNAKVEQND*
JGI12664J13189_100450813300001082Forest SoilMSTAGSRLYPAPALLAELIQKIRVLRLDLSRIIEYSRESKVGSRLIMASGLLDGALEELSKAASEQRD*
JGI12692J13336_100578723300001124Forest SoilMSTAGSRSYPDATRLAEIVQKIRVLKLHLSGIIEHSRNSSVVSRLIMASGLLDIAIEE
JGI12713J13577_1000040303300001151Forest SoilMSTAGSRSYPDATRLAEIVQKIRVLKLHLSGIIEHSRNSSVVSRLIMASGLLDIAIEEISKSVSEAKH*
JGI12649J13570_100220923300001175Forest SoilLYPAPALLAELIQKIRVLRLDLSRIIEYSRESKVGSRLIMASGLLDGALEELSKAASEQRD*
JGI12649J13570_101894213300001175Forest SoilGNALRQQRGKHMGTAGSSSYSGSARLAELIQKIRILKSDLSGMINRGGNSQVISRLIMASGLLDGALEELSKAASEQKD*
JGI12635J15846_1001338393300001593Forest SoilMGTAGSSSYSGSARLAELIQKIRILKSDLSGMINRGGNSQVISRLIMASGLLDGALEELSKAASEQKD*
JGI12635J15846_1001793143300001593Forest SoilMSTAGSRLYPDPALLTELIQKIRVLRSDLGKIIEFSRDSKVGSGLIMASGLLDGALEELSKAASQQRD*
JGI12635J15846_1002217473300001593Forest SoilMATAGSRSYSGSTRLAELIQKIRILKSDLSGMINHGGNAKVISRLIMASGLLDGALEELSQAVSEQKE*
JGI12635J15846_1004693813300001593Forest SoilMTISDSRPHPDATHLAELVQKIRVLKLDLRGIMEHNRNSKLISRLIMASGLLDGAIEELSKSVSEAED*
JGI12635J15846_1006560923300001593Forest SoilMNIANSRSYPDATRLAELVQKIRVLKLDLSRIIEHSRNSKVGSRLIMASGLLDGALEELSKAASEQSD*
JGIcombinedJ26739_10024603623300002245Forest SoilMADLRPYPDPTLLYELIQRIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD*
JGIcombinedJ26739_10051502623300002245Forest SoilMDTAGSRLYPDSTRLAELVKKIRVLRLDLSRILQHSRNSHIGSHLIMASGLLDGAIEELSKAQSEQKD*
JGIcombinedJ51221_1004753033300003505Forest SoilMSTAGSRPYPDPALLAELIQKIRVLRLLLSKILEQSRNSKVDSRLIMASGLLDGAIDELSKAESEQKD*
JGIcombinedJ51221_1009782113300003505Forest SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVXEQRD*
Ga0062384_10035766623300004082Bog Forest SoilMSIADSRPYPDATRLAELVQKMRLFKSDLKGIIERAHNSKAISRLIMASGLLDGALEELSKAASEK*
Ga0062384_10047140423300004082Bog Forest SoilMSTAGSRSYPDSSRLAELVQKIRVLKLDLGRIIERRRNSKVGSRLIMASGLLDGALEELSKAMSEQRDEAIDGDV*
Ga0062384_10081524123300004082Bog Forest SoilMSTAASRSYPDPKRLAEVVQKIRVLKLDLSGMIEHSCNSKVGSRLIMASGLLDGALEELSKAGSEQKD*
Ga0062387_10000110223300004091Bog Forest SoilMTNAGSRSQPDSVRLAELVQKIRVFKLDLTRIIEHSRNSKGGSRLIMASGLLDGVLEELAKAESELKD*
Ga0062389_10093453823300004092Bog Forest SoilMGISDSRPYPDATRLAELVQKMRMFKSDLRRIIERGHSSKAVSRLVMASGLLDGVLEELSKAVSELGD*
Ga0058901_154613313300004120Forest SoilRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVWEQRD*
Ga0070761_10001909143300005591SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVSEQRD*
Ga0070761_1043761623300005591SoilMNIADSKPYPDATLLAELVQKIRVVKLDLSRIIEHSRNSKIGSRLIMASGLLDGALDELSNAASEQKE*
Ga0070761_1048999813300005591SoilMSTAGSRLDPHPALLAELIQKIRVLRSDLSRIIEYSRDSKVGSGLIMASGLLDGALEELSKAASEQKG*
Ga0070761_1079091123300005591SoilMSTTGSRLYPDPALLTELIQKIRVLRLDLSRIIEFSRDSKVGSGLIMASGLLDGALEELSKAASQQRD*
Ga0070761_1095538323300005591SoilMTISDSRPHPDATHLGELVQKIRVLKLDLRGIMEHSPNSKVFSRLIMASGLLDGAIEELSKSVSETKD*
Ga0070762_1000516573300005602SoilMGNAGPRLSPDSTRLAELIQKIRALKLDLSRIIEYSRNSEVGSCLIMASGLLDGAIEELSNAKEEQND*
Ga0070762_1097010223300005602SoilMGTAGSSSYSGSIRLAELIQKIRILKSDLSSMINHGGNSKMISRLIMASGLLDGALEELSKAMSEQKD*
Ga0070762_1128935513300005602SoilMGTAGSRSYFGSTRLAELIQKIRILKSDLSGMINHGGNSKVIARLIMAAGLLDGALEELSQADLEQKE*
Ga0070763_1000966843300005610SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVWEQRD*
Ga0070763_1058988723300005610SoilMSTAGSRSYPDATRLAEIVQNIRILKSHLSGIIEHSRNSNVVSRLIMASGLLDIAIEEISKSVSEAKQ*
Ga0070764_1013299413300005712SoilMGSAGSRSYSGSTRLAELIQKIRILKSDLSGMISQGGNSKVISRLIMASGLLDGALEELSQAVSEQKE*
Ga0070764_1068227213300005712SoilMADLRPYPDPTFLYALIQKIRVLRSDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD*
Ga0070765_10019162613300006176SoilADSKPYPDATLLAELVQKIRVVKLDLSRIIEHSRNSKIGSRLIMASGLLDGALDELSNAASEQKE*
Ga0070765_10045683223300006176SoilMNIANSRSYPDATRLAELVQKIRVLKLDLSRIIEHSRDSQVGSRLIMASGLLDGALEELSKAASEQSD*
Ga0070765_10105694313300006176SoilMADLRPYPDPTLLTELVQRIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD*
Ga0070765_10187503713300006176SoilMITEESRSNPDSIHLSQLVQKIRILKSDLSGIIKRSHNSDVASRLIMASGLLDGALEELSKALSGRKD*
Ga0116128_113930413300009518PeatlandSRRYPDATRLAELIQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAASEQRE*
Ga0116136_118279513300009547PeatlandETQTGEGSYKIKLCQRQKGDQTSMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSKAVSEQKD*
Ga0116133_100658483300009623PeatlandQRQKGDQTSMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSKAVSEQKD*
Ga0116133_102050613300009623PeatlandMSTADSRRYPDATRLAELVQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAASEQRE*
Ga0116105_1000055273300009624PeatlandMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSKAVSEQKD*
Ga0116105_117974413300009624PeatlandMSTEESRSNPNSVHLAELVQKIRVLKSDLSGIIKHTGNSEVISRLIMASGLLDGALEELSKALSGRKD*
Ga0116130_107821313300009762PeatlandMSTADSRRYPDATRLAELIQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAALEQRE*
Ga0182018_1006294333300014489PalsaMSTAGSRLYPDPALLAELIQKIRVLRLDLSRIIEYSRDSKVGSRLIMASGLLDGALEELSQAASEQKG*
Ga0182018_1016954933300014489PalsaQEDGRMSIADSRAYPDATLLAELVQKIHVLKLDLSRIIEHSRNSKAGSRLIMASGLLDGAFEELSKAASEQRD*
Ga0187856_133631213300017925PeatlandMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSKAVSEQKD
Ga0187873_105939123300018013PeatlandMSTADSRRYPDATRLAELVQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAALEQRE
Ga0187871_10021114103300018042PeatlandMSTADSRRYPDATRLAELIQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAASEQRE
Ga0181504_133277153300019258PeatlandMSTADSRRYPDATRLAELVQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAASEQRE
Ga0181506_131869943300019260PeatlandYPDATRLAELVQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEVLSKAASEQRE
Ga0210403_1003865023300020580SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVWEQRD
Ga0210395_1088429713300020582SoilPALLAELIQKIRVLRLLLSKILEQSRNSKVDSRLIMASGLLDGAIDELSKAESEQKD
Ga0210401_1017204313300020583SoilMSTAGSRPYPDPALLAELIQKIRVLRLLLSKILEQSRNSKVDSRLIMASGLLDGAIDELSKAESEQKD
Ga0210401_1034125633300020583SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVSEQRD
Ga0210405_100000701233300021171SoilMSTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALGELSKAVWEQRD
Ga0210388_1003541913300021181SoilMGNAGPRLSPDSTRLAELIQKIRALKLDLSRIIEYSRNSEVGSCLIMASGLLDGAIEELSNAKEEQND
Ga0210388_1004927333300021181SoilMSTAGSRLDPHPALLAELIQKIRVLRSDLSRIIEYSRDSKVGSGLIMASGLLDGALEELSKAASEQKG
Ga0210388_1013744513300021181SoilSRSYFGSTRLAELIQKIRILKSDLSGMINHGGNSKVIARLIMAAGLLDGALEELSQADLEQKE
Ga0210388_1022360523300021181SoilMTISDSRPHPDATHLGELVQKIRVLKLDLRGIMEHSPNSKVFSRLIMASGLLDGAIEELSKSVSETKD
Ga0210388_1056949613300021181SoilLVQKIRVVKLDLSRIIEHSRNSKIGSRLIMASGLLDGALDELSNAASEQKQ
Ga0210388_1084133823300021181SoilMADLRPYPDPTLLAELVQKIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD
Ga0210393_1011223223300021401SoilMDTAGSRLYPDSTRLAELVKKIRVLRLDLSRILQHSRNSHIGSHLIMASGLLDGAIEELSKAQSEQKD
Ga0210393_1019687813300021401SoilMADSKPYPDATRLAELVQRIRVLRSDLSGMIEHGRSSKASSRLIMASGLLDGALEELTKAVSEQGETK
Ga0210393_1065039513300021401SoilMSTVDSRPYPDATRLAELVQKIRILKSDLSGIIEHGRNLKAISRLILVSGLLDGALEELSKVASEHR
Ga0210393_1136743723300021401SoilMGTSDPQPYPDAKRIAELVQKIRVLRSDLSGMVEHGRNSNATSRLIMASGLLDEAVQELTKAASEQKG
Ga0210387_10001310243300021405SoilMSTEGSKSYSDPTRLAEFVQKIRVLRLDLSRIIEHSRNSKVGSRLIMASGLLNEPIEELSKAESEQKD
Ga0210386_1100293723300021406SoilQKIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD
Ga0210383_1039060823300021407SoilMATAGSSSYSGSTRLAELIQKIRILKSDLSSMINHGGNSKMISRLIMASGLLDGALEELSKAMSEQKD
Ga0210384_1024206953300021432SoilTAGSRSYPDSTRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVWEQRD
Ga0210391_1001463113300021433SoilMSTAGSGSYPGSTRLAELIQKIRVLKLDLSKILQHSRNSHIGARLIMASGLLDGAIDELSKAESEQKD
Ga0210391_1001750753300021433SoilMNTAGPRLYPDSTRLGELVQKIRMTRSDLSAMIEQSHHSKVISRLIMASGLLDGALEELSKAASEQSG
Ga0210391_1002504133300021433SoilMSTTGSRLYPDPALLTELIQKIRVLRLDLSRIIEFSRDSKVGSGLIMASGLLDGALEELSKAASQQRD
Ga0210398_1011526523300021477SoilMNIADSKPYPDATLLAELVQKIRVVKLDLSRIIEHSRNSKIGSRLIMASGLLDGALDELSNAASEQKE
Ga0210409_1092715513300021559SoilVWSDSKPYPDATRLAELVQKIRLLKSDLSGMIEHSRDSKAISRLIMASGLLDGALEELSKAVSEQRD
Ga0208035_107541713300025406PeatlandMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSK
Ga0208194_100364083300025412PeatlandQRQKGDQTSMVDSRPYPDATRLAELVQRIRVLKSDLSGMIEHGRSSKASSRLIMASGLLDGALEELSKAVSEQKD
Ga0208562_110928413300025460PeatlandPDATRLAELIQRIRVLKLDLSGMIEHGRNSKAISRLIMASGLLDGALEELSKAASEQRE
Ga0209529_100122963300027334Forest SoilMSTAGSRSYPDATRLAEIVQKIRVLKLHLSGIIEHSRNSSVVSRLIMASGLLDIAIEEISKSVSEAKH
Ga0209529_101664313300027334Forest SoilMGNAGPRLNPDSTRLAELIQKIRALKLELSRIIEHSRNSEVGSCLIMASGLLDGAIEELSNAKVEQND
Ga0209010_100432523300027370Forest SoilMSTAGSRLYPAPALLAELIQKIRVLRLDLSRIIEYSRESKVGSRLIMASGLLDGALEELSKAASEQRD
Ga0209222_100895443300027559Forest SoilMATAGSRSYSGSTRLAELIQKIRILKSDLSGMINHGGNAKVISRLIMASGLLDGALEELSQAVSEQKE
Ga0209222_103010923300027559Forest SoilMTISDSRPHPDATHLAELVQKIRVLKLDLRGIMEHNRNSKLISRLIMASGLLDGAIEELSKSVSEAED
Ga0209420_104535533300027648Forest SoilMGTAGSSSYSGSARLAELIQKIRILKSDLSGMINRGGNSQVISRLIMASGLLDGALEELSKAASEQKD
Ga0209007_104259023300027652Forest SoilMSTTGSRLYPDPALLTELIQKIRVLRSDLSRIIEFSRDSKVGSGLIMASGLLDGALEELSKAASQQRD
Ga0209248_1009127923300027729Bog Forest SoilMSIADSRPYPDATRLAELVQKMRLFKSDLKGIIERAHNSKAISRLIMASGLLDGALEELSKAASEK
Ga0209139_1000636853300027795Bog Forest SoilMTNAGSRSQPDSVRLAELVQKIRVFKLDLTRIIEHSRNSKGGSRLIMASGLLDGVLEELAKAESELKD
Ga0209693_1028928913300027855SoilMGTAGSRSYFGSTRLAELIQKIRILKSDLSGMINHGGNSKVIARLIMAAGLLDGALEELSQADLEQKE
Ga0209169_1059888813300027879SoilRGGRMTISDSRPHPDATHLGELVQKIRVLKLDLRGIMEHSPNSKVFSRLIMASGLLDGAIEELSKSVSETKD
Ga0209380_1016081113300027889SoilMATAGSSSYSGSTRLAELIQKIRILKSDLSGMINHGGNSKVIARLIMAAGLLDGALEELSQADLEQKE
Ga0209624_1002196113300027895Forest SoilPTLLAELVQKIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSKAASEQRD
Ga0209624_1042730423300027895Forest SoilMADLRPYPDPTLLTELVQKIRVLRLDLSRIIEHSRSSKVGSRLIMASGLLDGALEELSK
Ga0209624_1104509013300027895Forest SoilMSITGSRSYSGSTRLIEVIEKIRLLKSDLSGIIEYNRNSKVICRLIMASTLLDGALQELSTCVSEQRD
Ga0308309_1095498923300028906SoilADSKPYPDATLLAELVQKIRVVKLDLSRIIEHSRNSKIGSRLIMASGLLDGALDELSNAASEQKE
Ga0308309_1136830613300028906SoilMITEESRSNPDSIHLSQLVQKIRILKSDLSGIIKRSHNSDVASRLIMASGLLDGALEELSKALSGRKD
Ga0310686_10935721623300031708SoilMNIADSKPYPDATLLAELVQKIRVLKLDLSRIIEHSRNSQVGSRLIMASGLLDGALDELSDAASEQKE
Ga0370515_0009015_4422_46283300034163Untreated Peat SoilMSIADSRAYPDATLLAELVQKIHVLKLDLSRIIEHSRNSKAGSRLIMASGLLDGALEELSKAASEQRD
Ga0370515_0022878_1877_20833300034163Untreated Peat SoilMGSAGSRSYSGSTRLAELIQKIRILRADLSGMINHGGNSKVISRLIMAAGLLDGALEELSQAVSEQKD
Ga0370515_0051092_1412_16183300034163Untreated Peat SoilMSTAGSRLYPDPALLAELIQKIRVLRLDLSRIIEYSRDSKVGSRLIMASGLLDGALEELSKAASEQKG
Ga0370515_0185656_489_6953300034163Untreated Peat SoilMSIPDPKPYPDATRLAELVQKLRVFKSDLKAMIERAHNTKIVSRLIMATGLLDGALEELSKAASEERD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.