NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105205

Metagenome / Metatranscriptome Family F105205

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105205
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 50 residues
Representative Sequence MRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFEK
Number of Associated Samples 64
Number of Associated Scaffolds 99

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 49.00 %
% of genes near scaffold ends (potentially truncated) 38.00 %
% of genes from short scaffolds (< 2000 bps) 75.00 %
Associated GOLD sequencing projects 58
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (96.000 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine
(20.000 % of family members)
Environment Ontology (ENVO) Unclassified
(53.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(44.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.36%    β-sheet: 7.69%    Coil/Unstructured: 67.95%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 99 Family Scaffolds
PF04019DUF359 2.02
PF00127Copper-bind 2.02
PF09376NurA 1.01
PF13432TPR_16 1.01
PF01987AIM24 1.01
PF13181TPR_8 1.01
PF10117McrBC 1.01
PF13328HD_4 1.01
PF00383dCMP_cyt_deam_1 1.01
PF12705PDDEXK_1 1.01
PF13361UvrD_C 1.01

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 99 Family Scaffolds
COG1909Archaeal dephospho-CoA kinaseCoenzyme transport and metabolism [H] 2.02
COG2013AIM24 protein, required for mitochondrial respirationEnergy production and conversion [C] 1.01


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000167|SI39nov09_120mDRAFT_c1042740All Organisms → cellular organisms → Archaea973Open in IMG/M
3300000172|SI34jun09_200mDRAFT_c1048532All Organisms → cellular organisms → Archaea739Open in IMG/M
3300000200|SI48aug10_150mDRAFT_c1010496All Organisms → cellular organisms → Archaea905Open in IMG/M
3300002919|JGI26061J44794_1067393All Organisms → cellular organisms → Archaea634Open in IMG/M
3300003478|JGI26238J51125_1004258All Organisms → cellular organisms → Archaea4477Open in IMG/M
3300003492|JGI26245J51145_1005115All Organisms → cellular organisms → Archaea3639Open in IMG/M
3300003498|JGI26239J51126_1004534All Organisms → cellular organisms → Archaea4646Open in IMG/M
3300003498|JGI26239J51126_1074517All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.602Open in IMG/M
3300003593|JGI26259J51720_1033483All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.730Open in IMG/M
3300003594|JGI26258J51719_1007662All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.2318Open in IMG/M
3300003595|JGI26263J51726_1076971All Organisms → cellular organisms → Archaea547Open in IMG/M
3300003894|Ga0063241_1012176All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.4927Open in IMG/M
3300004109|Ga0008650_1056364All Organisms → cellular organisms → Archaea1081Open in IMG/M
3300004110|Ga0008648_10146465All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.650Open in IMG/M
3300004273|Ga0066608_1157042All Organisms → cellular organisms → Archaea555Open in IMG/M
3300004276|Ga0066610_10229887All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300004280|Ga0066606_10240782All Organisms → cellular organisms → Archaea648Open in IMG/M
3300005838|Ga0008649_10002239All Organisms → cellular organisms → Archaea14291Open in IMG/M
3300005838|Ga0008649_10040567All Organisms → cellular organisms → Archaea2116Open in IMG/M
3300005838|Ga0008649_10123056All Organisms → cellular organisms → Archaea1054Open in IMG/M
3300005969|Ga0066369_10209535All Organisms → cellular organisms → Archaea636Open in IMG/M
3300006465|Ga0082250_10044839All Organisms → cellular organisms → Archaea976Open in IMG/M
3300006468|Ga0082251_10224349All Organisms → cellular organisms → Archaea810Open in IMG/M
3300006900|Ga0066376_10256785All Organisms → Viruses → Predicted Viral1031Open in IMG/M
3300008470|Ga0115371_10629300All Organisms → cellular organisms → Archaea517Open in IMG/M
3300008470|Ga0115371_10939029All Organisms → cellular organisms → Archaea846Open in IMG/M
3300008627|Ga0115656_1133480All Organisms → cellular organisms → Archaea1074Open in IMG/M
3300008627|Ga0115656_1185066All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.812Open in IMG/M
3300009030|Ga0114950_10056812All Organisms → cellular organisms → Archaea3087Open in IMG/M
3300009030|Ga0114950_10342155All Organisms → cellular organisms → Archaea1203Open in IMG/M
3300009030|Ga0114950_10343778All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1200Open in IMG/M
3300009030|Ga0114950_10389062All Organisms → cellular organisms → Archaea1120Open in IMG/M
3300009030|Ga0114950_10720019All Organisms → cellular organisms → Archaea789Open in IMG/M
3300009030|Ga0114950_11278040All Organisms → cellular organisms → Archaea570Open in IMG/M
3300009030|Ga0114950_11335726All Organisms → cellular organisms → Archaea556Open in IMG/M
3300009030|Ga0114950_11377668All Organisms → cellular organisms → Archaea546Open in IMG/M
3300009102|Ga0114948_11134864All Organisms → cellular organisms → Archaea616Open in IMG/M
3300009106|Ga0117917_1044583All Organisms → cellular organisms → Archaea1798Open in IMG/M
3300009139|Ga0114949_10800427All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon CG15_BIG_FIL_POST_REV_8_21_14_020_37_12753Open in IMG/M
3300009139|Ga0114949_10814961All Organisms → cellular organisms → Archaea746Open in IMG/M
3300009139|Ga0114949_11046736All Organisms → cellular organisms → Archaea652Open in IMG/M
3300009139|Ga0114949_11368450All Organisms → cellular organisms → Archaea565Open in IMG/M
3300009481|Ga0114932_10041198All Organisms → cellular organisms → Archaea3009Open in IMG/M
3300009481|Ga0114932_10538134All Organisms → cellular organisms → Archaea686Open in IMG/M
3300009703|Ga0114933_10021440All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → Candidatus Nitrosotenuis chungbukensis5036Open in IMG/M
3300009703|Ga0114933_10374092All Organisms → cellular organisms → Archaea936Open in IMG/M
3300009703|Ga0114933_10881631All Organisms → cellular organisms → Archaea568Open in IMG/M
3300009725|Ga0123372_103279All Organisms → cellular organisms → Archaea518Open in IMG/M
3300009747|Ga0123363_1036313All Organisms → cellular organisms → Archaea757Open in IMG/M
3300009748|Ga0123370_1069264All Organisms → cellular organisms → Archaea845Open in IMG/M
3300009748|Ga0123370_1115763All Organisms → cellular organisms → Archaea919Open in IMG/M
3300009753|Ga0123360_1020110All Organisms → cellular organisms → Archaea1461Open in IMG/M
3300009753|Ga0123360_1143840All Organisms → cellular organisms → Archaea963Open in IMG/M
3300010135|Ga0123382_1011900All Organisms → Viruses → Predicted Viral1143Open in IMG/M
3300011013|Ga0114934_10317940All Organisms → cellular organisms → Archaea700Open in IMG/M
3300011112|Ga0114947_10841656All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.674Open in IMG/M
3300020230|Ga0212167_1283489All Organisms → cellular organisms → Archaea1104Open in IMG/M
3300020231|Ga0212168_1292565All Organisms → cellular organisms → Archaea2012Open in IMG/M
3300020234|Ga0212227_1238062All Organisms → cellular organisms → Archaea1020Open in IMG/M
3300020234|Ga0212227_1351409All Organisms → cellular organisms → Archaea1448Open in IMG/M
3300020234|Ga0212227_1375705All Organisms → cellular organisms → Archaea1276Open in IMG/M
3300020234|Ga0212227_1426955All Organisms → cellular organisms → Archaea2345Open in IMG/M
3300020235|Ga0212228_1112891All Organisms → cellular organisms → Archaea1298Open in IMG/M
3300020235|Ga0212228_1192804All Organisms → Viruses → Predicted Viral2085Open in IMG/M
3300020235|Ga0212228_1317190All Organisms → cellular organisms → Archaea1619Open in IMG/M
3300020235|Ga0212228_1409841All Organisms → cellular organisms → Archaea1298Open in IMG/M
3300020235|Ga0212228_1409841All Organisms → cellular organisms → Archaea1298Open in IMG/M
3300020235|Ga0212228_1450607All Organisms → cellular organisms → Archaea1230Open in IMG/M
(restricted) 3300022888|Ga0233428_1007305All Organisms → cellular organisms → Archaea7190Open in IMG/M
(restricted) 3300022888|Ga0233428_1019455All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.3365Open in IMG/M
(restricted) 3300022888|Ga0233428_1154289All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.789Open in IMG/M
(restricted) 3300022902|Ga0233429_1094189All Organisms → cellular organisms → Archaea1228Open in IMG/M
(restricted) 3300022902|Ga0233429_1101981All Organisms → cellular organisms → Archaea1158Open in IMG/M
(restricted) 3300022933|Ga0233427_10025401All Organisms → cellular organisms → Archaea3529Open in IMG/M
3300024058|Ga0209997_10327344All Organisms → cellular organisms → Archaea753Open in IMG/M
3300024060|Ga0209987_10039528All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis2642Open in IMG/M
(restricted) 3300024256|Ga0233446_1187566All Organisms → cellular organisms → Archaea547Open in IMG/M
(restricted) 3300024257|Ga0233442_1154295All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.601Open in IMG/M
(restricted) 3300024260|Ga0233441_1239007All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.526Open in IMG/M
(restricted) 3300024299|Ga0233448_1125210All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.677Open in IMG/M
(restricted) 3300024302|Ga0233449_1018515All Organisms → cellular organisms → Archaea3321Open in IMG/M
(restricted) 3300024327|Ga0233434_1265567All Organisms → cellular organisms → Archaea595Open in IMG/M
(restricted) 3300024327|Ga0233434_1333776All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.503Open in IMG/M
3300024344|Ga0209992_10001072All Organisms → cellular organisms → Archaea31265Open in IMG/M
3300024344|Ga0209992_10015271All Organisms → cellular organisms → Archaea4419Open in IMG/M
3300024344|Ga0209992_10018923All Organisms → cellular organisms → Archaea3797Open in IMG/M
3300024344|Ga0209992_10023813All Organisms → cellular organisms → Archaea3239Open in IMG/M
3300024431|Ga0209988_10147128All Organisms → cellular organisms → Archaea1416Open in IMG/M
3300025547|Ga0209556_1112430All Organisms → cellular organisms → Archaea581Open in IMG/M
3300025660|Ga0209045_1160189All Organisms → cellular organisms → Archaea629Open in IMG/M
3300025667|Ga0209043_1135513All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.621Open in IMG/M
3300025672|Ga0209663_1090803All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.940Open in IMG/M
3300025681|Ga0209263_1015341All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.3073Open in IMG/M
3300025688|Ga0209140_1147073All Organisms → cellular organisms → Archaea691Open in IMG/M
3300026253|Ga0208879_1051094All Organisms → cellular organisms → Archaea1985Open in IMG/M
3300028173|Ga0257118_1090562All Organisms → cellular organisms → Archaea691Open in IMG/M
3300028174|Ga0257123_1082133All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.755Open in IMG/M
3300028175|Ga0257117_1062159All Organisms → cellular organisms → Archaea963Open in IMG/M
3300028198|Ga0257121_1009843All Organisms → cellular organisms → Archaea5118Open in IMG/M
3300028198|Ga0257121_1036666All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.2146Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine20.00%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface17.00%
SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Sediment14.00%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater13.00%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine11.00%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Volcanic → Unclassified → Deep Subsurface10.00%
MarineEnvironmental → Aquatic → Marine → Inlet → Unclassified → Marine5.00%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine4.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine3.00%
SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment2.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000167Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - 39 11/10/09 120mEnvironmentalOpen in IMG/M
3300000172Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - 34 06/16/09 200mEnvironmentalOpen in IMG/M
3300000200Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - 48 08/11/10 150mEnvironmentalOpen in IMG/M
3300002919Marine microbial communities from the Southern Atlantic Ocean, analyzing organic carbon cycling - Bottom_A/KNORR_S2/LVEnvironmentalOpen in IMG/M
3300003478Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_100m_DNAEnvironmentalOpen in IMG/M
3300003492Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S4LV_200m_DNAEnvironmentalOpen in IMG/M
3300003498Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_130m_DNAEnvironmentalOpen in IMG/M
3300003593Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI074_LV_100m_DNAEnvironmentalOpen in IMG/M
3300003594Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI074_LV_10m_DNAEnvironmentalOpen in IMG/M
3300003595Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI074_LV_200m_DNAEnvironmentalOpen in IMG/M
3300003894Marine microbial communities from the northern Gulf of Mexico hypoxic zone - Cultivation independent assessmentEnvironmentalOpen in IMG/M
3300004109Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S2LV_150m_DNAEnvironmentalOpen in IMG/M
3300004110Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S2LV_100m_DNAEnvironmentalOpen in IMG/M
3300004273Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_135mEnvironmentalOpen in IMG/M
3300004276Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_165mEnvironmentalOpen in IMG/M
3300004280Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_100mEnvironmentalOpen in IMG/M
3300005838Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S2LV_130m_DNAEnvironmentalOpen in IMG/M
3300005969Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S7_td_Bottom_ad_4513_LV_AEnvironmentalOpen in IMG/M
3300006465Deep-sea sediment bacterial and archaeal communities from Fram Strait - Hausgarten IXEnvironmentalOpen in IMG/M
3300006468Deep-sea sediment bacterial and archaeal communities from Fram Strait - Combined Assembly of Gp0119454, Gp0119453, Gp0119452, Gp0119451EnvironmentalOpen in IMG/M
3300006900Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S15_td_Bottom_ad_5009_LV_AEnvironmentalOpen in IMG/M
3300008470Sediment core microbial communities from Adelie Basin, Antarctica. Combined Assembly of Gp0136540, Gp0136562, Gp0136563EnvironmentalOpen in IMG/M
3300008627Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 247m, 2.7-0.2umEnvironmentalOpen in IMG/M
3300009030Deep subsurface microbial communities from Kermadec Trench to uncover new lineages of life (NeLLi) - N075 metaGEnvironmentalOpen in IMG/M
3300009102Deep subsurface microbial communities from Mariana Trench to uncover new lineages of life (NeLLi) - CR04 metaGEnvironmentalOpen in IMG/M
3300009106Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, May cruise - 295m, 2.7-0.2um, replicate aEnvironmentalOpen in IMG/M
3300009139Deep subsurface microbial communities from Kermadec Trench to uncover new lineages of life (NeLLi) - N074 metaGEnvironmentalOpen in IMG/M
3300009481Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaGEnvironmentalOpen in IMG/M
3300009703Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 4SBTROV12_W25 metaGEnvironmentalOpen in IMG/M
3300009725Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_229_18m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300009747Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_197_2m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300009748Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_210_18m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300009753Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_190_18m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010135Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_257_18m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300011013Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 4SBTROV10_white metaGEnvironmentalOpen in IMG/M
3300011112Deep subsurface microbial communities from Mariana Trench to uncover new lineages of life (NeLLi) - CR02 metaGEnvironmentalOpen in IMG/M
3300020230Deep-sea sediment microbial communities from the Mariana Trench, Pacific Ocean - CR02EnvironmentalOpen in IMG/M
3300020231Deep-sea sediment microbial communities from the Mariana Trench, Pacific Ocean - CR04EnvironmentalOpen in IMG/M
3300020234Deep-sea sediment microbial communities from the Kermadec Trench, Pacific Ocean - N074EnvironmentalOpen in IMG/M
3300020235Deep-sea sediment microbial communities from the Kermadec Trench, Pacific Ocean - N075EnvironmentalOpen in IMG/M
3300022888 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_120_MGEnvironmentalOpen in IMG/M
3300022902 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_135_MGEnvironmentalOpen in IMG/M
3300022933 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_100_MGEnvironmentalOpen in IMG/M
3300024058Deep subsurface microbial communities from Mariana Trench to uncover new lineages of life (NeLLi) - CR04 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024060Deep subsurface microbial communities from Kermadec Trench to uncover new lineages of life (NeLLi) - N074 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024256 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_124_October2016_120_MGEnvironmentalOpen in IMG/M
3300024257 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_123_September2016_150_MGEnvironmentalOpen in IMG/M
3300024260 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_123_September2016_135_MGEnvironmentalOpen in IMG/M
3300024299 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_124_October2016_150_MGEnvironmentalOpen in IMG/M
3300024302 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_124_October2016_200_MGEnvironmentalOpen in IMG/M
3300024327 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_120_MGEnvironmentalOpen in IMG/M
3300024344Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024431Deep subsurface microbial communities from Kermadec Trench to uncover new lineages of life (NeLLi) - N075 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025547Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_150m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025660Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI073_LV_10m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025667Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S4LV_100m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025672Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - Saanich Inlet SI073_LV_135m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025681Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_100m (SPAdes)EnvironmentalOpen in IMG/M
3300025688Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI073_LV_120m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300026253Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S15_td_Bottom_ad_5009_LV_A (SPAdes)EnvironmentalOpen in IMG/M
3300028173Marine microbial communities from Saanich Inlet, British Columbia, Canada - SI112_150mEnvironmentalOpen in IMG/M
3300028174Marine microbial communities from Saanich Inlet, British Columbia, Canada - SI106_135EnvironmentalOpen in IMG/M
3300028175Marine microbial communities from Saanich Inlet, British Columbia, Canada - SI112_135mEnvironmentalOpen in IMG/M
3300028198Marine microbial communities from Saanich Inlet, British Columbia, Canada - SI106_100EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
SI39nov09_120mDRAFT_104274033300000167MarineMRCKICEIPHKNTAKYDLWLEHKICRVCGSIFDLFSWNGNNLGEYWRMEN*
SI34jun09_200mDRAFT_104853233300000172MarineMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEYWRIKN*
SI48aug10_150mDRAFT_101049613300000200MarineKEIRLMRCKICEIPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26061J44794_106739323300002919MarineKICEITHKNTAKYDLWIENQMCKVCGQILDFFSWNGNNLGEYWRFGK*
JGI26238J51125_100425853300003478MarineMRCKICXXPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26245J51145_100511553300003492MarineMRCKICEIPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26239J51126_100453483300003498MarineMRCKICEIPHKNTGKYDLWLEHKVXRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26239J51126_107451733300003498MarineICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH*
JGI26259J51720_103348323300003593MarineLIPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26258J51719_100766253300003594MarineRCKICEIPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
JGI26263J51726_107697133300003595MarineGEKKMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH*
Ga0063241_101217693300003894MarineMRCKICQITHKNTGKYDLWVEYQVCRICAQMLELFSWNGNNLGEYWGVRN*
Ga0008650_105636413300004109MarinePHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK*
Ga0008648_1014646513300004110MarineKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH*
Ga0066608_115704213300004273MarineKMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH*
Ga0066610_1022988733300004276MarineHKNTAKYDLWLEHQVCRDCGNLFDLFSWNGNNLGAYWRIGN*
Ga0066606_1024078233300004280MarineCKICDITHKNTSKYDLWIENQVCRICGQVLDFFSWNGNNLGDYWRIGK*
Ga0008649_1000223983300005838MarineMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH*
Ga0008649_1004056723300005838MarineMRCKICDITHKNTSKYDLWIENQVCRVCGQVLDFFSWNGNNLGDYWRIGK*
Ga0008649_1012305653300005838MarinePHKNTSKFDLWVENQVCRTCSQIIDLFSWNGNNLGEYWRFVN*
Ga0066369_1020953523300005969MarineMRCKICEITHKNTAKYDLWIENQMCKVCGQILDFFSWNGNNLGEYWRFGK*
Ga0082250_1004483933300006465SedimentMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFEK*
Ga0082251_1022434933300006468SedimentMRCKICNIPHKNTSKYDLWLENQVCRVCAHILDLFSWNGNNLGEYWRFSN*
Ga0066376_1025678543300006900MarineMRCKICEITHKNTAKYDLWIENQMCRVCGQILDFFSWNGNNLGEYWRFGK*
Ga0115371_1062930023300008470SedimentEKKMRCKICNIPHKNTAKYDLWIENQVCRVCGQILDLFSWNGNNLGEYWRFRN*
Ga0115371_1093902933300008470SedimentMRCKICEIPHKNTSKFDLSIENQVCRTCGQILDLFSWNGNNLGGYWRFGN*
Ga0115656_113348053300008627MarineMKLTKSGGIKMRCKICEIPHKNTAKFDLWKENQVCRICGQLFDLFSWNGNNLGEYWRIGN
Ga0115656_118506633300008627MarineMRCKICEIPHKNTRKYDLWIENQVCRVCGHLMDFFSLNGNNLGHYWRIGN*
Ga0114950_1005681283300009030Deep SubsurfaceMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLEEYWRFSN*
Ga0114950_1034215523300009030Deep SubsurfaceMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWGSGK*
Ga0114950_1034377813300009030Deep SubsurfaceCKICNIPHKNTSQHELWIENQVCRICGHILDLFSWNGNNLGEYWRFGK*
Ga0114950_1038906243300009030Deep SubsurfaceMRCKICDISHKNTGKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFSN*
Ga0114950_1072001933300009030Deep SubsurfaceMRCKICNIPHKNTSKYDLWLENQVCRVCAHILDLFSWNGNNLGEYWRFGK*
Ga0114950_1127804023300009030Deep SubsurfaceMRCKICNIPHKNTSKHEFWLEYQVCRICGQILDLFSWNGNNLGEYWRFGK*
Ga0114950_1133572613300009030Deep SubsurfaceMRCKICEIPHKNTAKYDLWLENQVCRVCAQILDLFSWNGNNLGEYWRFGK*
Ga0114950_1137766833300009030Deep SubsurfaceMRCKICNTPHKITAKYDLWLENQVCRVFGQILDLFSWNGNNLG
Ga0114948_1113486423300009102Deep SubsurfaceMRCKICDIPHKNTGKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFSN*
Ga0117917_104458323300009106MarineMRCKICEIPHKNTAKYDLWIENQVCRICGQLMDFFSLNGNNLGEYWRFGN*
Ga0114949_1080042723300009139Deep SubsurfaceMRCKICNIPHKNTGKYDLWIENQVCRVCGQILDLFSWNGNNLEEYWRFSN*
Ga0114949_1081496133300009139Deep SubsurfaceLGYEKMRCKICNTPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFSN*
Ga0114949_1104673613300009139Deep SubsurfaceMRCKICNIPHKNTAKYDLWIENQVCRVCAQILDLFSWNGNNLGEYWRF
Ga0114949_1136845023300009139Deep SubsurfaceMRCKICNIPHKNTAKYDLWLENQVCRVCAQILDLFSWNGNNLGEYWRFGK*
Ga0114932_1004119863300009481Deep SubsurfaceMRCKICNISHKNTGKYDLWLDNQICRVCGQIFDFFSWNGNNLENYWRCEK*
Ga0114932_1053813423300009481Deep SubsurfaceMRCKICDISHKNTGKYDLWIDNQICRVCGQVLDFFSWNGNNLESYWRYEN*
Ga0114933_1002144083300009703Deep SubsurfaceMSMRCKICDISHKNTGKYDLWLENQVCRFCGQIIDLFSWNGN
Ga0114933_1037409233300009703Deep SubsurfaceMRCKICNISHKNTGKYDLWLDNQICRVCGQIFDFFSWNGNNLENYWRNEN*
Ga0114933_1088163123300009703Deep SubsurfaceMRCKICEITHKNTSKYDLWLENQMCRGCCQLFDIFSWNGNN
Ga0123372_10327923300009725MarineMRCKICQITHKNTGKYDLWTEYQVCRICGQILDLFTWNGNNLGEYWKKEAIA*
Ga0123363_103631333300009747MarineMRCKICQITHKNTGKYDLWTEHQVCRVCGQMLDLFSWNGNNLGEYWKKEVIAQ*
Ga0123370_106926433300009748MarineMRCKICQITHKNTGKYDLWTEHQVCRICGQILDLFTWNGNNLGE
Ga0123370_111576313300009748MarineRVGQNFGGNKMRCKICQITHKNTGKYDLWTEYQVCRICGQILDLFTWNGNNLGEYWKKEAIA*
Ga0123360_102011033300009753MarineMRCKICQITHKNTGKYDLWTEHQVCRICGQILDLFTWNGNNLGEYWKKEAIA*
Ga0123360_114384013300009753MarineMRCKICQITHKNTGKYALWTEHQVCRVCGQMLDLFSWNGNNLGEY
Ga0123382_101190023300010135MarineMRCKICQITHKNTGKYDLWTEHQVCRVCGQMLDFFSWNGNNLGEYWKKQVIAQ*
Ga0114934_1031794013300011013Deep SubsurfaceEKMRCKICDISHKNTGKYDLWIDNQICRVCGQVLDFFSWNGNNLESYWRYEN*
Ga0114947_1084165623300011112Deep SubsurfaceMRCKICNIPHKNTAKYDLWIENQVCRVCGQILDLFSWNGNNLGEYWRFSN*
Ga0212167_128348933300020230SedimentMRCKICNITHKNTAKYDLWLENQVCRVCAQILDLFSWNGNNLGEYWRFGK
Ga0212168_129256563300020231SedimentMRCKICNIPHKNTAKYDLWIENQVCRVCAQILDLFSWNGNNLGEYWRFSN
Ga0212227_123806233300020234SedimentMRCKICEIPHKNTGKYDLWTENQVCRICGQILDLFSWNGNNLREYWRFGK
Ga0212227_135140953300020234SedimentMRCKICNIPHKNTAKYDLWLENQVCRVCAQILDLFSWNGNNLGEYWRLSN
Ga0212227_137570533300020234SedimentMRCKICQISHKNTAKYDLWLENQVCRVCGQIMDLFSWNGNNLSEYWRFSN
Ga0212227_142695543300020234SedimentMRCKICDISHKNTGKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFSN
Ga0212228_111289133300020235SedimentMRCKICNIPHKNTGKYDLWVENQVCRVCGQILDLFSWNGNNLGEYWRFGK
Ga0212228_119280423300020235SedimentMRCKICNIPHKNTSKHEFWLEYQVCRICGQILDLFSWNGNNLGEYWRFGK
Ga0212228_131719013300020235SedimentMRCKICDIPHKNTGRHELWIEQRICRICAQILDLFSWNGNYLQEYWRTLN
Ga0212228_140984133300020235SedimentMRCKICEIPHKNTAKYDLWLENQVCRVCAQILDLFSWNGNNLGEYWRFGK
Ga0212228_140984153300020235SedimentCKICNIPHKNTSQHELWIENQVCRICGHILDLFSWNGNNLGEYWRFGK
Ga0212228_145060733300020235SedimentMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWGSGK
(restricted) Ga0233428_100730583300022888SeawaterMRCKICEIPHKNTAKYDLWLEHKICRVCGSIFDLFSWNGNNLGEYWRMEN
(restricted) Ga0233428_101945553300022888SeawaterMRCKICEIPHKNTGKYDLWLEHKVCRVCGSLFDLFSWNGNYLVEYWRSGK
(restricted) Ga0233428_115428933300022888SeawaterMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEYWRIKN
(restricted) Ga0233429_109418923300022902SeawaterVRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEYWRIEN
(restricted) Ga0233429_110198113300022902SeawaterQNLGNEKLRCKICQITHKNTAKYDLWLEHQVCRDCGNLFDLFSWNGNNLGAYWRIGN
(restricted) Ga0233427_1002540163300022933SeawaterMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH
Ga0209997_1032734413300024058Deep SubsurfaceMDKVTTSLENKAGGTKMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNL
Ga0209987_1003952873300024060Deep SubsurfaceMRCKICNIPHKNTAKYDLWLENQVCRVCGQILDLFSWNGNNLEEYWRFSN
(restricted) Ga0233446_118756613300024256SeawaterKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH
(restricted) Ga0233442_115429513300024257SeawaterMRCKICEIPHKNTAKYDLWLEHKICRVCGSIFDLFSWNGNNLGE
(restricted) Ga0233441_123900713300024260SeawaterMRCKICEIPHKNTAKYDLWLEHKICRVCGSIFDLFSWNGNNLGEYWRME
(restricted) Ga0233448_112521013300024299SeawaterMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEY
(restricted) Ga0233449_101851583300024302SeawaterFKNSTRKREDEIMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEYWRIKN
(restricted) Ga0233434_126556723300024327SeawaterMRCKICEIPHKNTAKYDLWIQNQVCRVCGHLLDFFTLNGNNLGEYWRFVN
(restricted) Ga0233434_133377623300024327SeawaterMRCKICEIPHKNTTKYDLWLEHKVCRVCGSLFDLFSWNGNNLGEYWRIEN
Ga0209992_1000107233300024344Deep SubsurfaceMRCKICDISHKNTGKYDLWIDNQICRVCGQVLDFFSWNGNNLESYWRYEN
Ga0209992_1001527163300024344Deep SubsurfaceMRCKICNISHKNTGKYDLWLDNQICRVCGQIFDFFSWNGNNLENYWRCEK
Ga0209992_1001892363300024344Deep SubsurfaceMSMRCKICDISHKNTGKYDLWLENQVCRFCGQIIDLFSWNGNNLEAYWRYDIDVSKM
Ga0209992_1002381353300024344Deep SubsurfaceMRCKICNISHKNTGKYDLWLDNQICRVCGQIFDFFSWNGNNLENYWRNEN
Ga0209988_1014712813300024431Deep SubsurfaceTGKYDLWLENQVCRVCGQILDLFSWNGNNLGEYWRFSN
Ga0209556_111243023300025547MarineMRCKICDITHKNTSKYDLWIENQVCRVCGQVLDFFSWNGNNLGDYWRIGK
Ga0209045_116018913300025660MarineICEIPHKNTAKYDLWLEHKICRVCGSIFDLFSWNGNNLGEYWRMEN
Ga0209043_113551333300025667MarineRKDLGEKKMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH
Ga0209663_109080353300025672MarineMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWR
Ga0209263_1015341113300025681MarineMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRF
Ga0209140_114707333300025688MarineMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLGEYWRFKH
Ga0208879_105109443300026253MarineMRCKICEITHKNTAKYDLWIENQMCKVCGQILDFFSWNGNNLGEYWRFGK
Ga0257118_109056213300028173MarineTHKNTAKYDLWLEHQVCRDCGNLFDLFSWNGNNLGAYWRIGN
Ga0257123_108213323300028174MarineMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLEEYWRIEN
Ga0257117_106215913300028175MarineMRCKICEIPHKNTAKYDLWLEHKVCRVCGSLFDLFSWNGNNLGEYWRNKI
Ga0257121_100984393300028198MarineMRCKICEITHKNTSKYDLWLENQMCRVCCQLLDIFSWNGNNLGEYWRFVN
Ga0257121_103666663300028198MarineFGGGKMRCKICDISHKNTGKYDLWLDNQLCRICGQIFDFFSWNGNNLGEYWRFKH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.