NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099886

Metagenome Family F099886

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099886
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 46 residues
Representative Sequence VLGVKKTISIKGVISFSLFFLNERITNNIIEIIIAINIMDSTIQPTTGEH
Number of Associated Samples 45
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 54.37 %
% of genes near scaffold ends (potentially truncated) 57.28 %
% of genes from short scaffolds (< 2000 bps) 80.58 %
Associated GOLD sequencing projects 38
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (61.165 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine
(64.078 % of family members)
Environment Ontology (ENVO) Unclassified
(98.058 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(86.408 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.00%    β-sheet: 16.00%    Coil/Unstructured: 44.00%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF08238Sel1 9.71
PF00656Peptidase_C14 5.83
PF00589Phage_integrase 3.88
PF07661MORN_2 1.94
PF00149Metallophos 1.94
PF13156Mrr_cat_2 0.97
PF14072DndB 0.97
PF01844HNH 0.97
PF00565SNase 0.97
PF02086MethyltransfD12 0.97
PF08281Sigma70_r4_2 0.97
PF00271Helicase_C 0.97
PF00145DNA_methylase 0.97
PF13087AAA_12 0.97
PF08487VIT 0.97
PF135322OG-FeII_Oxy_2 0.97
PF08837DUF1810 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG4249Uncharacterized conserved protein, contains caspase domainGeneral function prediction only [R] 5.83
COG2849Antitoxin component YwqK of the YwqJK toxin-antitoxin moduleDefense mechanisms [V] 1.94
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.97
COG0338DNA-adenine methylaseReplication, recombination and repair [L] 0.97
COG3392Adenine-specific DNA methylaseReplication, recombination and repair [L] 0.97
COG5579Uncharacterized conserved protein, DUF1810 familyFunction unknown [S] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms64.08 %
UnclassifiedrootN/A35.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000250|LPfeb09P261000mDRAFT_1029157All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300003478|JGI26238J51125_1014870All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon2007Open in IMG/M
3300004276|Ga0066610_10020474Not Available2575Open in IMG/M
3300005399|Ga0066860_10202669All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1674Open in IMG/M
3300005408|Ga0066848_10107466Not Available758Open in IMG/M
3300006164|Ga0075441_10006126All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae5271Open in IMG/M
3300006164|Ga0075441_10009241All Organisms → cellular organisms → Bacteria → Proteobacteria4216Open in IMG/M
3300006164|Ga0075441_10044404Not Available1778Open in IMG/M
3300006164|Ga0075441_10052612All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1612Open in IMG/M
3300006164|Ga0075441_10115867All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1022Open in IMG/M
3300006164|Ga0075441_10123049Not Available987Open in IMG/M
3300006164|Ga0075441_10142304Not Available906Open in IMG/M
3300006164|Ga0075441_10172457All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1811Open in IMG/M
3300006164|Ga0075441_10303841All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria582Open in IMG/M
3300006164|Ga0075441_10393741Not Available502Open in IMG/M
3300006190|Ga0075446_10024349All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Synechococcales → Prochlorococcaceae → Prochlorococcus → unclassified Prochlorococcus → Prochlorococcus sp. MIT 06042003Open in IMG/M
3300006190|Ga0075446_10135646All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300006190|Ga0075446_10144661All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria679Open in IMG/M
3300006191|Ga0075447_10003221All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria7314Open in IMG/M
3300006191|Ga0075447_10019669All Organisms → cellular organisms → Bacteria2693Open in IMG/M
3300006191|Ga0075447_10034583All Organisms → cellular organisms → Bacteria → Proteobacteria1928Open in IMG/M
3300006191|Ga0075447_10053769Not Available1477Open in IMG/M
3300006191|Ga0075447_10055292All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1452Open in IMG/M
3300006191|Ga0075447_10068687Not Available1270Open in IMG/M
3300006191|Ga0075447_10078333All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1170Open in IMG/M
3300006191|Ga0075447_10084337Not Available1117Open in IMG/M
3300006191|Ga0075447_10114631Not Available925Open in IMG/M
3300006191|Ga0075447_10141769All Organisms → cellular organisms → Bacteria811Open in IMG/M
3300006191|Ga0075447_10202463Not Available652Open in IMG/M
3300006191|Ga0075447_10247208All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300006191|Ga0075447_10278867Not Available538Open in IMG/M
3300006193|Ga0075445_10037788All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1972Open in IMG/M
3300006193|Ga0075445_10041201All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP11873Open in IMG/M
3300006193|Ga0075445_10123379All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria948Open in IMG/M
3300006193|Ga0075445_10162712All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300006352|Ga0075448_10053592Not Available1287Open in IMG/M
3300006947|Ga0075444_10006720All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → Prevotella buccalis6554Open in IMG/M
3300006947|Ga0075444_10026379All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Arenitalea → Arenitalea lutea2940Open in IMG/M
3300006947|Ga0075444_10027337All Organisms → cellular organisms → Bacteria2878Open in IMG/M
3300006947|Ga0075444_10124491All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1104Open in IMG/M
3300006947|Ga0075444_10144765All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP11000Open in IMG/M
3300006947|Ga0075444_10267458Not Available668Open in IMG/M
3300006947|Ga0075444_10316582Not Available599Open in IMG/M
3300006947|Ga0075444_10365406Not Available547Open in IMG/M
3300006947|Ga0075444_10389578All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1525Open in IMG/M
3300006947|Ga0075444_10412814All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1506Open in IMG/M
3300006947|Ga0075444_10417823All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → unclassified Bryobacterales → Bryobacterales bacterium502Open in IMG/M
3300008253|Ga0105349_10017229All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon3059Open in IMG/M
3300009409|Ga0114993_10567177All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300009420|Ga0114994_10391112All Organisms → cellular organisms → Bacteria921Open in IMG/M
3300009420|Ga0114994_10627189All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1704Open in IMG/M
3300009472|Ga0115554_1352965All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1578Open in IMG/M
3300009512|Ga0115003_10523143All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1694Open in IMG/M
3300009512|Ga0115003_10620319Not Available631Open in IMG/M
3300009705|Ga0115000_10667576All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1643Open in IMG/M
3300009705|Ga0115000_10897458Not Available542Open in IMG/M
3300009785|Ga0115001_10921854Not Available524Open in IMG/M
3300010883|Ga0133547_11727872All Organisms → cellular organisms → Bacteria1161Open in IMG/M
3300020376|Ga0211682_10092993Not Available1212Open in IMG/M
3300020376|Ga0211682_10120358All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1046Open in IMG/M
3300020376|Ga0211682_10385434All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Pseudothioglobus → Candidatus Pseudothioglobus singularis → Candidatus Pseudothioglobus singularis PS1512Open in IMG/M
3300021084|Ga0206678_10348155Not Available705Open in IMG/M
(restricted) 3300024261|Ga0233439_10041108Not Available2765Open in IMG/M
(restricted) 3300024327|Ga0233434_1019289All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon3819Open in IMG/M
3300027522|Ga0209384_1011353All Organisms → cellular organisms → Bacteria → Proteobacteria3136Open in IMG/M
3300027668|Ga0209482_1037753All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Thiotrichales → Piscirickettsiaceae → unclassified Piscirickettsiaceae → Piscirickettsiaceae bacterium1877Open in IMG/M
3300027668|Ga0209482_1059008Not Available1362Open in IMG/M
3300027668|Ga0209482_1061807All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1318Open in IMG/M
3300027668|Ga0209482_1095484All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1963Open in IMG/M
3300027668|Ga0209482_1102616All Organisms → cellular organisms → Bacteria914Open in IMG/M
3300027668|Ga0209482_1108450All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria877Open in IMG/M
3300027672|Ga0209383_1013954All Organisms → cellular organisms → Bacteria3660Open in IMG/M
3300027672|Ga0209383_1127577Not Available816Open in IMG/M
3300027672|Ga0209383_1145530All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300027672|Ga0209383_1171358All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria656Open in IMG/M
3300027672|Ga0209383_1188149Not Available611Open in IMG/M
3300027687|Ga0209710_1104342All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1112Open in IMG/M
3300027699|Ga0209752_1110533Not Available817Open in IMG/M
3300027704|Ga0209816_1008324All Organisms → cellular organisms → Bacteria → Proteobacteria6310Open in IMG/M
3300027704|Ga0209816_1019254All Organisms → cellular organisms → Bacteria3670Open in IMG/M
3300027704|Ga0209816_1025862Not Available3001Open in IMG/M
3300027704|Ga0209816_1062351All Organisms → cellular organisms → Bacteria1611Open in IMG/M
3300027704|Ga0209816_1074122All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP11416Open in IMG/M
3300027704|Ga0209816_1195609All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1677Open in IMG/M
3300027714|Ga0209815_1188511All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria641Open in IMG/M
3300027771|Ga0209279_10237924Not Available550Open in IMG/M
3300027779|Ga0209709_10074227All Organisms → cellular organisms → Bacteria → Proteobacteria1866Open in IMG/M
3300027779|Ga0209709_10333809All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria630Open in IMG/M
3300027791|Ga0209830_10447011Not Available540Open in IMG/M
3300027813|Ga0209090_10105373All Organisms → cellular organisms → Bacteria1524Open in IMG/M
3300028287|Ga0257126_1034954Not Available2218Open in IMG/M
3300031141|Ga0308021_10303194Not Available596Open in IMG/M
3300031598|Ga0308019_10155866Not Available904Open in IMG/M
3300031598|Ga0308019_10358032All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1532Open in IMG/M
3300031599|Ga0308007_10322328All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Gammaproteobacteria incertae sedis → Candidatus Thioglobus → unclassified Candidatus Thioglobus → Candidatus Thioglobus sp. NP1509Open in IMG/M
3300031627|Ga0302118_10049090All Organisms → cellular organisms → Bacteria → Proteobacteria2153Open in IMG/M
3300031630|Ga0308004_10171642Not Available896Open in IMG/M
3300031630|Ga0308004_10282191Not Available647Open in IMG/M
3300031639|Ga0302117_10209238All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria799Open in IMG/M
3300031647|Ga0308012_10221041Not Available746Open in IMG/M
3300031696|Ga0307995_1052550All Organisms → cellular organisms → Bacteria1697Open in IMG/M
3300031757|Ga0315328_10221815Not Available1106Open in IMG/M
3300032820|Ga0310342_103052643Not Available557Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine64.08%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine18.45%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine7.77%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.94%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater1.94%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater0.97%
MarineEnvironmental → Aquatic → Marine → Oceanic → Aphotic Zone → Marine0.97%
MarineEnvironmental → Aquatic → Marine → Inlet → Unclassified → Marine0.97%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.97%
Methane Seep MesocosmEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Methane Seep Mesocosm0.97%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine0.97%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000250Marine microbial communities from expanding oxygen minimum zones in Line P, North Pacific Ocean - February 2009 P26 1000mEnvironmentalOpen in IMG/M
3300003478Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_100m_DNAEnvironmentalOpen in IMG/M
3300004276Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_165mEnvironmentalOpen in IMG/M
3300005399Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F14-07SV275EnvironmentalOpen in IMG/M
3300005408Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201310SV72EnvironmentalOpen in IMG/M
3300006164Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG002-DNAEnvironmentalOpen in IMG/M
3300006190Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG058-DNAEnvironmentalOpen in IMG/M
3300006191Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG104-DNAEnvironmentalOpen in IMG/M
3300006193Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG029-DNAEnvironmentalOpen in IMG/M
3300006352Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG108-DNAEnvironmentalOpen in IMG/M
3300006947Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG017-DNAEnvironmentalOpen in IMG/M
3300008253Methane-oxidizing microbial communities from mesocosms in the Hudson Canyon - EN1B Hudson CanyonEnvironmentalOpen in IMG/M
3300009409Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150EnvironmentalOpen in IMG/M
3300009420Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152EnvironmentalOpen in IMG/M
3300009472Pelagic marine microbial communities from North Sea - COGITO_mtgs_110404EnvironmentalOpen in IMG/M
3300009512Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_88EnvironmentalOpen in IMG/M
3300009705Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128EnvironmentalOpen in IMG/M
3300009785Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_130EnvironmentalOpen in IMG/M
3300010883western Arctic Ocean co-assemblyEnvironmentalOpen in IMG/M
3300020376Marine microbial communities from Tara Oceans - TARA_B100000795 (ERX555997-ERR599121)EnvironmentalOpen in IMG/M
3300021084Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 80m 12015EnvironmentalOpen in IMG/M
3300024261 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_123_September2016_100_MGEnvironmentalOpen in IMG/M
3300024327 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_120_MGEnvironmentalOpen in IMG/M
3300027522Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG058-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027668Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG104-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027672Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG029-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027687Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_138 (SPAdes)EnvironmentalOpen in IMG/M
3300027699Marine microbial communities from oxygen minimum zone in mesopelagic equatorial Pacific - METZYME_3_250m (SPAdes)EnvironmentalOpen in IMG/M
3300027704Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG017-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027714Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG002-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027771Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG006-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027779Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_136 (SPAdes)EnvironmentalOpen in IMG/M
3300027791Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_130 (SPAdes)EnvironmentalOpen in IMG/M
3300027813Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152 (SPAdes)EnvironmentalOpen in IMG/M
3300028287Marine microbial communities from Saanich Inlet, British Columbia, Canada - SI060_120mEnvironmentalOpen in IMG/M
3300031141Marine microbial communities from water near the shore, Antarctic Ocean - #351EnvironmentalOpen in IMG/M
3300031598Marine microbial communities from water near the shore, Antarctic Ocean - #284EnvironmentalOpen in IMG/M
3300031599Marine microbial communities from water near the shore, Antarctic Ocean - #71EnvironmentalOpen in IMG/M
3300031627Marine microbial communities from Western Arctic Ocean, Canada - AG5_33.1EnvironmentalOpen in IMG/M
3300031630Marine microbial communities from water near the shore, Antarctic Ocean - #38EnvironmentalOpen in IMG/M
3300031639Marine microbial communities from Western Arctic Ocean, Canada - AG5_32.2EnvironmentalOpen in IMG/M
3300031647Marine microbial communities from water near the shore, Antarctic Ocean - #179EnvironmentalOpen in IMG/M
3300031696Marine microbial communities from Ellis Fjord, Antarctic Ocean - #262EnvironmentalOpen in IMG/M
3300031757Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 200m 32315EnvironmentalOpen in IMG/M
3300032820Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - S1503-DNA-20-500_MGEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
LPfeb09P261000mDRAFT_102915713300000250MarineNSINGVISFNLFFLNMRITNNIIEIIIAIIIGTSTPQPITGEHK*
JGI26238J51125_101487013300003478MarineVKKTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGTSTPQPITGEHK*
Ga0066610_1002047423300004276MarineVLGVKKTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGTSTPQPITGEHK*
Ga0066860_1020266923300005399MarineMRLGVQKTKSNKGVISFNLFFLNTRTTNNIAEMIIVSANGTSTPHPITGEHR*
Ga0066848_1010746633300005408MarineVQKTKSNKGVISFNLFFLNMRITNNIAEIIIVSANGPSTPQPITGE
Ga0075441_1000612673300006164MarineVKKTISIKGVTSFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR*
Ga0075441_1000924153300006164MarineVLGVKKTISITGVTSFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR*
Ga0075441_1004440413300006164MarineVLGVKKTISIRGVTSFSLFFLNTRITNNIIETIIAINIMDSTIQPT
Ga0075441_1005261223300006164MarineVLGVKKTISIKGVISFSLFFLNDRITTNIIEIIMAINIIDSTIQPTTGEHR*
Ga0075441_1011586733300006164MarineKVLGVKKAISIKGVTSFSLFFLNERITNNTIEIIMAINIIDSTIQPTTGEHR*
Ga0075441_1012304913300006164MarineVLGVKKRISIKGVTSFSLFFLNERITNNIIEIIIAINII
Ga0075441_1014230413300006164MarineVKKAISIKGVTSFSLFFLNERITNNTIEIIMAINI
Ga0075441_1017245723300006164MarineVKKTISIRGVTSFSLFFLNERITNNTIEIIIAINIMLSTIQPTTGEHR*
Ga0075441_1030384123300006164MarineVLGVKNTISIKGVISFSLFFLNERITNNIIEIIIAINIIDSTIQPTTGEHR*
Ga0075441_1039374113300006164MarineVLGVKKIISIKGVTSFSLFFLNERITNNIIEIIIA
Ga0075446_1002434913300006190MarineVKKTISIKGVISFNLFFLNERITNNIIEIIMAINIIDSTIQPTTGEHR*
Ga0075446_1013564633300006190MarineISIKGVISFSLFFLNERITNKIIEIIMAINIMVSTIQPTTGEHR*
Ga0075446_1014466123300006190MarineVKNTISIKGVISFSLFFLNERITNNIIEIIIAINIIDSTIQPTTGEHR*
Ga0075447_1000322153300006191MarineVKKDISIKGVISFSLFFLNDRITNNTIEIIIAINIIDSTIQPTTGEHR
Ga0075447_1001966953300006191MarineMSIKGVISFSLVFLNNRITTNIIEIIMAINIIDSIIQPTTGEHR*
Ga0075447_1003458313300006191MarineVKKTISITGVTSFSLFFLNERITNNIIEIIIAINI
Ga0075447_1005376933300006191MarineVLGVKKTISIKGVISFSLFFLNERITNNIIEIIMAINIMDSMMQPTTGEHR*
Ga0075447_1005529233300006191MarineVLGVKKTISIKGVTSFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR*
Ga0075447_1006868713300006191MarineVKKAISIKGVTSFSSFFLNERITNNTIEIMMAINIIDSTIQPTTGEHR*
Ga0075447_1007833333300006191MarineVLGVKKTISIKGVISFSLFFLNDRITNNTIEIIIAINIIDSTIQPTTGEHR*
Ga0075447_1008433733300006191MarineVLGVKKTISIKGVTSFSLFLLNERITNNTIEIIIAINIMLSTIQPTTGEHR*
Ga0075447_1011463123300006191MarineLGVKKTISITGVTSFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR*
Ga0075447_1014176933300006191MarineVLGVKKTISINGVISFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR*
Ga0075447_1020246323300006191MarineVLGVKKTISIRGVTSFSLFFLNTRITNNIIETIIAINIMDST
Ga0075447_1024720833300006191MarineRSTKSINGVISLALSFLNARITNNIIETIIAINIGTSTPHPMTGEHK*
Ga0075447_1027886713300006191MarineVKKTISIKGVTSFSLFFLNERITNSIIEIIIAINIIDSTIQPTTGEQR*
Ga0075445_1003778843300006193MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIETIMAINIMDSTIQPTTGEHR*
Ga0075445_1004120133300006193MarineVLGVKKAISIKGVISFSLFFLNERITNNIIEIIMAINIMVSTIQPTTGEHR*
Ga0075445_1012337923300006193MarineVLGVKKTISIKGVTSFSLFFLNERITNNTIEIIMAINIMDSTIQPTTGEHR*
Ga0075445_1016271233300006193MarineVLGVKKTISINGVISFSLFFLNERITNNIIEIIIAINIMVSTI
Ga0075448_1005359223300006352MarineVKKTISINGVISFNLFFLNERITNNIIEIIMAINIMDSTIHPTTGEHR*
Ga0075444_1000672013300006947MarineVKKTISIKGVTSFSLFFLNERITNNIIEIIIAINIMVSTIQPT
Ga0075444_1002637913300006947MarineVKKTISIKGVISFSLFFLNKRITNNTIEIIIAINIMLSTIQPTTGEHR
Ga0075444_1002733753300006947MarineVLGVKKTISIKGVTSFSLFLLNERITNNTIEIIIAINIM
Ga0075444_1012449133300006947MarineSLFFLNDRITNNTIEIIIAINIIDSTIQPTTGEHR*
Ga0075444_1014476513300006947MarineVLGVKKTISIKGVTSFSLFFLNDRITNNTIEIIMAINIM
Ga0075444_1026745813300006947MarineVKKTISIKGVISFSLFFLNERITNNIIEIIIAINIIDSMIQP
Ga0075444_1031658213300006947MarineVLGVKKTISIKGVISFSLFFLNDRITTNIIEIIMA
Ga0075444_1036540613300006947MarineVLGVKKTISIKGVISFSLFFLNERITNKIIEIIMAINIMVSTIQPTTGEHR*
Ga0075444_1038957813300006947MarineVLGVKKDISIKGVISFSLFFLNDRITNNTIEIIIAINIIDSTI
Ga0075444_1041281433300006947MarineVKKTISIKGVISFSLFFLNERITNNTIEIIMAINI
Ga0075444_1041782313300006947MarineVLGVKKTMSIKGVTSFSLCFLNTRITNNIIETIMAINIMDSTIQPT
Ga0105349_1001722923300008253Methane Seep MesocosmVLGVKKTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGASTPQPITGEHK*
Ga0114993_1056717713300009409MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIQPTTGEHR*
Ga0114994_1039111213300009420MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIQPT
Ga0114994_1062718913300009420MarineVLGVKKTISIKGVISFSLFFLNERITNNIIEIIIAINIMDSTIQPTTGEH
Ga0115554_135296513300009472Pelagic MarineMSINGVISFSLFFLKTRNNINIIEIVIAMSIGTSGPQPTTGEQR*
Ga0115003_1052314313300009512MarineVLGVKKTMSIKGVISFNLFFLNERITNNIIEIIMAINIMDSTIHPTTGEH
Ga0115003_1062031923300009512MarineVLGVKKTISIRGVISFSLFFLNDRITTNIIEIIMAINIIDSTI
Ga0115000_1066757623300009705MarineVLGVKKTMSIKGVISFNLFFLNERITNNIIEIIMAINIMDSTIHPTTGEHR*
Ga0115000_1089745823300009705MarineVKKTISIKGVTSLSLFFLNERITNNIIEIIIAINIIDSTIQPTTGEHR
Ga0115001_1092185423300009785MarineVLGVKKTISIKGVISFSLFFLNDRITTNIIEIIMAINIIDS
Ga0133547_1172787213300010883MarineVLGVKKTISIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIQPTTGE
Ga0211682_1009299313300020376MarineVLGVKKTISIKGVTSFSLFLLNERITNNTIEIIIAINIMLS
Ga0211682_1012035833300020376MarineVKKTISIKGVISFSLFFLNERITNNTIEIIIAINIIDSTIQPTTGEHR
Ga0211682_1038543413300020376MarineVLGVKKIISIKGVTSFSLFFLNERITNNIIEIIMAINIMLSTI
Ga0206678_1034815523300021084SeawaterMRLGVQKTKSNKGVISFNLFFLNMRITNNIAEIIIVSANGPSTPQPITGEHR
(restricted) Ga0233439_1004110823300024261SeawaterVLGVKKTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGTSTPQPITGEHK
(restricted) Ga0233434_101928933300024327SeawaterVKKTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGTSTPQPITGEHK
Ga0209384_101135323300027522MarineVLGVKKTISITGVTSFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR
Ga0209482_103775343300027668MarineVLGVKKTISINGVISFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR
Ga0209482_105900823300027668MarineVLGVKKTISIKGVISFSLFFLNERITNNIIEIIMAINIMDSMMQPTTGEHR
Ga0209482_106180713300027668MarineVISFSLFFLNERITNNIIEIIIAINIMVSTIQPTTGEHR
Ga0209482_109548443300027668MarineMKKTISIKGVISFSLFFLNKRITNNIIEIIIAINIMLSTIQPT
Ga0209482_110261613300027668MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIETIMAINIMDSTIQPTTGEHR
Ga0209482_110845023300027668MarineISIKGVTSFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR
Ga0209383_101395463300027672MarineVLGVKKTISIKGVTSFSLFFLNERITNNTIEIIMAINIMDSTIQPTTGEHR
Ga0209383_112757713300027672MarineVLGVKKTISIKGVTSFSLFLLNERITNNTIEIIIAINIMVSTIQPTTGEHR
Ga0209383_114553013300027672MarineGARSTKSINGVISLALSFLNARITNNIIETIIAINIGTSTPHPMTGEHK
Ga0209383_117135823300027672MarineVKKAISIKGVISFSLFFLNERITNNTIEIMMAINIIDSMIQPTTGEHR
Ga0209383_118814913300027672MarineVLGVKKTISIKGVISFSLFFLNERITNNIIEIIMAINIMVSTIQPTTG
Ga0209710_110434243300027687MarineVKKTISIKGVTSFSLFFLNDRITTNIIEIIMAINIIDSTIQPTTGEHR
Ga0209752_111053313300027699MarineKRLGVQKTKSNKGVISFNLFFLNMRITNNIAEIIIVSANGASTPQPITGEHR
Ga0209816_100832443300027704MarineVKKTISIKGVTSFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR
Ga0209816_101925423300027704MarineVKKAISIKGVTSFSLFFLNERITNNTIEIIMAINIIDSTIQPTTGEHR
Ga0209816_102586243300027704MarineVKKTISIKGVTSFSLFFLNDRITNNTIEIIMAINIMVSTIHPTTGEHR
Ga0209816_106235113300027704MarineVKKTISIKGVISFSLFFLNERITNNIIEIIIAINIIDSTIQPT
Ga0209816_107412213300027704MarineVKKTISIKGVISFSLFFLNERITNNTIEIIMAINIIDSTIQPT
Ga0209816_119560923300027704MarineVKKTISIKGVISFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR
Ga0209815_118851113300027714MarineAISIKGVTSFSLFFLNERITNNTIEIIMAINIMVSTIQPTTGEHR
Ga0209279_1023792413300027771MarineVLGVKKTISIKGVMSFSLFFLNERITNNIIEIIIAIN
Ga0209709_1007422723300027779MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIQPTTGEHR
Ga0209709_1033380923300027779MarineMVLGVKKTISIKGVTSFSLFFLNDRITTNIIEIIMAINIIDSTIQPTTGEHR
Ga0209830_1044701113300027791MarineVLGVKKTISIRGVISFSLFFLNDRITTNIIEIIMAINII
Ga0209090_1010537333300027813MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIQPTTGEH
Ga0257126_103495413300028287MarineTISIKGVTSFSLFFLNMRITNNIAEIIIAIIIGTSTPQPITGEHK
Ga0308021_1030319413300031141MarineVKKTISIKGVISFNLFFLNERITNNTIEIIIAINIMVSM
Ga0308019_1015586633300031598MarineVKKAISIKGVISFSLFFLNERITNNIIEIIIAINIIVSTIQPTTGEHR
Ga0308019_1035803213300031598MarineVLGVKKAISIKGVISFSLFFLNERITNNIIEIIMA
Ga0308007_1032232823300031599MarineVLGVKKIISIRGVTSFSLFFLNERITNNIIEIIIAINIIDSTIQPTTGEHRXGPAREPKS
Ga0302118_1004909023300031627MarineVLGVKKTMSIKGVTSFSLFFLNTRITNNTIEIIMAINIMDSTIQPTTGEHR
Ga0308004_1017164213300031630MarineVKKTISIKGVISFNLFFLNERITNNTIEIIMAINIMLSTIQPTTGEHR
Ga0308004_1028219113300031630MarineVLGVKKTISIKGVMSFSLFFLNERITNNIIEIIIAINIMVSTIQPTTG
Ga0302117_1020923823300031639MarineVLGVKKTISIRGVISFSLFFLNDRITTNIIEIIMAINIIDSTIQPTTGEHR
Ga0308012_1022104113300031647MarineVKKTISIKGVISFSLFFLNERITNNIIEIIIAINIIVSTIQPTTGEHR
Ga0307995_105255013300031696MarineISIKGVTSFSLFFLNTRITNNIIEIIMAINIMDSTIHPTTGEHR
Ga0315328_1022181523300031757SeawaterMRLGVKKTISIKGVTSFSLFFLNTRITNNITEIIIAIIIGASTPQPITGEHR
Ga0310342_10305264313300032820SeawaterVQKTKSNKGVISFNLFFLNMRITNNIAEIIIVIANGPSTPQPITGEHR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.