NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097499

Metagenome Family F097499

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097499
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 62 residues
Representative Sequence MSYYPKTIADKMKKGLENELQQICEIGKVLREDGNSRTEVNYYMSVDEDFISDVLGCYNS
Number of Associated Samples 70
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 65.05 %
% of genes near scaffold ends (potentially truncated) 29.81 %
% of genes from short scaffolds (< 2000 bps) 81.73 %
Associated GOLD sequencing projects 62
AlphaFold2 3D model prediction Yes
3D model pTM-score0.81

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (67.308 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(25.000 % of family members)
Environment Ontology (ENVO) Unclassified
(69.231 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(62.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 48.86%    β-sheet: 0.00%    Coil/Unstructured: 51.14%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.81
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.123.1.1: Nuclear receptor ligand-binding domaind1fcya_1fcy0.61717
a.104.1.0: automated matchesd7cb9a_7cb90.61629
a.123.1.0: automated matchesd3ltxa_3ltx0.61063
a.123.1.1: Nuclear receptor ligand-binding domaind1pzla_1pzl0.60881
a.123.1.1: Nuclear receptor ligand-binding domaind3vhva13vhv0.6087


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF06067DUF932 2.88
PF16363GDP_Man_Dehyd 0.96
PF02599CsrA 0.96
PF00583Acetyltransf_1 0.96
PF02666PS_Dcarbxylase 0.96
PF13365Trypsin_2 0.96
PF02945Endonuclease_7 0.96
PF01844HNH 0.96
PF04542Sigma70_r2 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.96
COG0688Phosphatidylserine decarboxylaseLipid transport and metabolism [I] 0.96
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.96
COG1551sRNA-binding carbon storage regulator CsrASignal transduction mechanisms [T] 0.96
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.96
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A67.31 %
All OrganismsrootAll Organisms32.69 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002231|KVRMV2_100246129Not Available657Open in IMG/M
3300002242|KVWGV2_10343291All Organisms → Viruses → Predicted Viral2096Open in IMG/M
3300002242|KVWGV2_10838337Not Available652Open in IMG/M
3300003619|JGI26380J51729_10024189Not Available1813Open in IMG/M
3300004273|Ga0066608_1106694Not Available710Open in IMG/M
3300005593|Ga0066837_10056410Not Available1483Open in IMG/M
3300006310|Ga0068471_1055813All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Acidiferrobacterales → Acidiferrobacteraceae → unclassified Acidiferrobacteraceae → Acidiferrobacteraceae bacterium5823Open in IMG/M
3300006411|Ga0099956_1057873Not Available655Open in IMG/M
3300006637|Ga0075461_10046440All Organisms → Viruses → Predicted Viral1414Open in IMG/M
3300006751|Ga0098040_1026789All Organisms → Viruses → Predicted Viral1855Open in IMG/M
3300006751|Ga0098040_1072234Not Available1055Open in IMG/M
3300006754|Ga0098044_1275081Not Available648Open in IMG/M
3300006789|Ga0098054_1137866Not Available904Open in IMG/M
3300006929|Ga0098036_1136575Not Available751Open in IMG/M
3300007283|Ga0066366_10245819Not Available747Open in IMG/M
3300007283|Ga0066366_10461834Not Available558Open in IMG/M
3300007514|Ga0105020_1255640All Organisms → Viruses → Predicted Viral1161Open in IMG/M
3300007963|Ga0110931_1023355Not Available1866Open in IMG/M
3300007963|Ga0110931_1113455All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium816Open in IMG/M
3300007963|Ga0110931_1131640Not Available752Open in IMG/M
3300007963|Ga0110931_1157476Not Available681Open in IMG/M
3300008050|Ga0098052_1103842All Organisms → Viruses → Predicted Viral1155Open in IMG/M
3300008217|Ga0114899_1195919Not Available642Open in IMG/M
3300008624|Ga0115652_1002656All Organisms → cellular organisms → Bacteria12221Open in IMG/M
3300008624|Ga0115652_1055974All Organisms → Viruses → Predicted Viral1363Open in IMG/M
3300009103|Ga0117901_1013183All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium7021Open in IMG/M
3300009370|Ga0118716_1008107Not Available10350Open in IMG/M
3300009370|Ga0118716_1031459All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → unclassified Myxococcaceae → Myxococcaceae bacterium3624Open in IMG/M
3300009376|Ga0118722_1261297All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium1001Open in IMG/M
3300009409|Ga0114993_10385080All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → unclassified Myxococcaceae → Myxococcaceae bacterium1056Open in IMG/M
3300009414|Ga0114909_1113749Not Available734Open in IMG/M
3300009418|Ga0114908_1046111All Organisms → Viruses → Predicted Viral1583Open in IMG/M
3300009481|Ga0114932_10004737All Organisms → cellular organisms → Bacteria12474Open in IMG/M
3300009481|Ga0114932_10009736All Organisms → cellular organisms → Bacteria7530Open in IMG/M
3300009481|Ga0114932_10118891All Organisms → Viruses → Predicted Viral1638Open in IMG/M
3300009481|Ga0114932_10147422All Organisms → Viruses → Predicted Viral1447Open in IMG/M
3300009481|Ga0114932_10250266Not Available1069Open in IMG/M
3300009481|Ga0114932_10753667Not Available565Open in IMG/M
3300009595|Ga0105214_118385Not Available555Open in IMG/M
3300009604|Ga0114901_1143870Not Available719Open in IMG/M
3300009605|Ga0114906_1284617Not Available529Open in IMG/M
3300009619|Ga0105236_1029010Not Available674Open in IMG/M
3300009619|Ga0105236_1054577Not Available537Open in IMG/M
3300009620|Ga0114912_1165555Not Available513Open in IMG/M
3300009703|Ga0114933_10037861All Organisms → Viruses → Predicted Viral3605Open in IMG/M
3300009703|Ga0114933_10168857All Organisms → Viruses → Predicted Viral1497Open in IMG/M
3300009786|Ga0114999_10207878All Organisms → Viruses → Predicted Viral1619Open in IMG/M
3300009794|Ga0105189_1034314Not Available511Open in IMG/M
3300010155|Ga0098047_10392288Not Available520Open in IMG/M
3300010934|Ga0137844_1084776Not Available973Open in IMG/M
3300011013|Ga0114934_10000107All Organisms → cellular organisms → Bacteria46815Open in IMG/M
3300011013|Ga0114934_10005038All Organisms → cellular organisms → Bacteria8066Open in IMG/M
3300017775|Ga0181432_1258528Not Available550Open in IMG/M
3300020303|Ga0211692_1016403All Organisms → Viruses → Predicted Viral1005Open in IMG/M
3300020447|Ga0211691_10031048All Organisms → Viruses → Predicted Viral1879Open in IMG/M
3300020447|Ga0211691_10424310Not Available538Open in IMG/M
3300020458|Ga0211697_10210764All Organisms → cellular organisms → Bacteria803Open in IMG/M
3300020476|Ga0211715_10565920Not Available558Open in IMG/M
3300021442|Ga0206685_10110697Not Available909Open in IMG/M
3300021791|Ga0226832_10004591All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium4317Open in IMG/M
3300021791|Ga0226832_10076962Not Available1187Open in IMG/M
3300021791|Ga0226832_10171358Not Available835Open in IMG/M
3300021791|Ga0226832_10311523Not Available644Open in IMG/M
3300021959|Ga0222716_10565822Not Available627Open in IMG/M
3300021978|Ga0232646_1299270Not Available550Open in IMG/M
(restricted) 3300022933|Ga0233427_10065353Not Available1857Open in IMG/M
(restricted) 3300024340|Ga0255042_10164032Not Available672Open in IMG/M
3300024344|Ga0209992_10002961All Organisms → cellular organisms → Bacteria14859Open in IMG/M
3300024344|Ga0209992_10007304All Organisms → cellular organisms → Bacteria7522Open in IMG/M
3300024344|Ga0209992_10072782All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Chloroflexi incertae sedis → SAR202 cluster → SAR202 cluster bacterium1584Open in IMG/M
3300024344|Ga0209992_10218964Not Available802Open in IMG/M
(restricted) 3300024518|Ga0255048_10211215Not Available946Open in IMG/M
3300025125|Ga0209644_1029851Not Available1212Open in IMG/M
3300025133|Ga0208299_1101583Not Available971Open in IMG/M
3300025241|Ga0207893_1004544Not Available1925Open in IMG/M
3300025251|Ga0208182_1005958All Organisms → Viruses → Predicted Viral3907Open in IMG/M
3300025280|Ga0208449_1009339All Organisms → Viruses → Predicted Viral3555Open in IMG/M
3300025547|Ga0209556_1092540Not Available668Open in IMG/M
3300025630|Ga0208004_1082691Not Available791Open in IMG/M
3300025873|Ga0209757_10014272All Organisms → Viruses → Predicted Viral2152Open in IMG/M
3300025873|Ga0209757_10196178Not Available639Open in IMG/M
3300026134|Ga0208815_1061712Not Available511Open in IMG/M
3300028018|Ga0256381_1022309Not Available1033Open in IMG/M
3300028022|Ga0256382_1012209Not Available1676Open in IMG/M
3300028022|Ga0256382_1023095Not Available1335Open in IMG/M
3300028022|Ga0256382_1052895Not Available947Open in IMG/M
3300028022|Ga0256382_1061040Not Available887Open in IMG/M
3300028022|Ga0256382_1075486Not Available801Open in IMG/M
3300028022|Ga0256382_1077432Not Available791Open in IMG/M
3300028022|Ga0256382_1121215Not Available628Open in IMG/M
3300028022|Ga0256382_1132444Not Available599Open in IMG/M
3300031605|Ga0302132_10215869Not Available918Open in IMG/M
3300031606|Ga0302119_10387992Not Available507Open in IMG/M
3300031701|Ga0302120_10044848Not Available1881Open in IMG/M
3300031701|Ga0302120_10221291Not Available716Open in IMG/M
3300031701|Ga0302120_10228453Not Available701Open in IMG/M
3300031801|Ga0310121_10084045Not Available2067Open in IMG/M
3300031802|Ga0310123_10131490All Organisms → Viruses → Predicted Viral1723Open in IMG/M
3300031811|Ga0310125_10519599Not Available564Open in IMG/M
3300032277|Ga0316202_10453183Not Available601Open in IMG/M
3300032278|Ga0310345_10567362Not Available1090Open in IMG/M
3300032278|Ga0310345_11246160Not Available727Open in IMG/M
3300032820|Ga0310342_101736109Not Available745Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine25.00%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Volcanic → Unclassified → Deep Subsurface13.46%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean8.65%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater8.65%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine6.73%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine5.77%
Marine OceanicEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine Oceanic4.81%
Hydrothermal Vent FluidsEnvironmental → Aquatic → Marine → Hydrothermal Vents → Diffuse Flow → Hydrothermal Vent Fluids3.85%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater2.88%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine2.88%
Marine SedimentEnvironmental → Aquatic → Marine → Hydrothermal Vents → Sediment → Marine Sediment2.88%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.92%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous1.92%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine1.92%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater1.92%
MarineEnvironmental → Aquatic → Marine → Oceanic → Photic Zone → Marine0.96%
MarineEnvironmental → Aquatic → Marine → Oceanic → Aphotic Zone → Marine0.96%
Microbial MatEnvironmental → Aquatic → Marine → Coastal → Sediment → Microbial Mat0.96%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater0.96%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water0.96%
Hydrothermal Vent FluidsEnvironmental → Aquatic → Marine → Hydrothermal Vents → Unclassified → Hydrothermal Vent Fluids0.96%
Subsea Pool Microbial MatEnvironmental → Aquatic → Unclassified → Unclassified → Unclassified → Subsea Pool Microbial Mat0.96%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002231Marine sediment microbial communities from Santorini caldera mats, Greece - red matEnvironmentalOpen in IMG/M
3300002242Marine sediment microbial communities from Kolumbo Volcano mats, Greece - white/grey matEnvironmentalOpen in IMG/M
3300003619Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI072_LV_165m_DNAEnvironmentalOpen in IMG/M
3300004273Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI075_LV_DNA_135mEnvironmentalOpen in IMG/M
3300005593Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201302SV86EnvironmentalOpen in IMG/M
3300005969Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S7_td_Bottom_ad_4513_LV_AEnvironmentalOpen in IMG/M
3300006310Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT229_3_0500mEnvironmentalOpen in IMG/M
3300006411Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT225_1_0200mEnvironmentalOpen in IMG/M
3300006637Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_>0.8_DNAEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006754Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006929Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaGEnvironmentalOpen in IMG/M
3300007283Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S7_td_250_ad_252m_LV_BEnvironmentalOpen in IMG/M
3300007514Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 143m, 2.7-0.2um, replicate aEnvironmentalOpen in IMG/M
3300007963Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG (version 2)EnvironmentalOpen in IMG/M
3300008050Marine viral communities from the Subarctic Pacific Ocean - 15_ETSP_OMZ_AT15312 metaGEnvironmentalOpen in IMG/M
3300008217Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_215EnvironmentalOpen in IMG/M
3300008624Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 200m, 250-2.7umEnvironmentalOpen in IMG/M
3300009103Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 143m, 250-2.7umEnvironmentalOpen in IMG/M
3300009370Combined Assembly of Gp0127930, Gp0127931EnvironmentalOpen in IMG/M
3300009376Combined Assembly of Gp0137079, Gp0137080EnvironmentalOpen in IMG/M
3300009409Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150EnvironmentalOpen in IMG/M
3300009414Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_906EnvironmentalOpen in IMG/M
3300009418Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s17EnvironmentalOpen in IMG/M
3300009481Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaGEnvironmentalOpen in IMG/M
3300009595Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3635_2500EnvironmentalOpen in IMG/M
3300009604Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s16EnvironmentalOpen in IMG/M
3300009605Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_M9EnvironmentalOpen in IMG/M
3300009619Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3827_250EnvironmentalOpen in IMG/M
3300009620Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_51EnvironmentalOpen in IMG/M
3300009703Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 4SBTROV12_W25 metaGEnvironmentalOpen in IMG/M
3300009786Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_126EnvironmentalOpen in IMG/M
3300009794Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3438_5245EnvironmentalOpen in IMG/M
3300010155Marine viral communities from the Subarctic Pacific Ocean - 12_ETSP_OMZ_AT15267 metaGEnvironmentalOpen in IMG/M
3300010934Microbial mat microbial communities from the Kallisti Limnes subsea pool, Santorini, Greece - 2-BIOTECH-ROV9-P3EnvironmentalOpen in IMG/M
3300011013Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 4SBTROV10_white metaGEnvironmentalOpen in IMG/M
3300017775Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 55 SPOT_SRF_2014-07-17EnvironmentalOpen in IMG/M
3300020303Marine microbial communities from Tara Oceans - TARA_B100000745 (ERX556095-ERR599124)EnvironmentalOpen in IMG/M
3300020447Marine microbial communities from Tara Oceans - TARA_B100000745 (ERX556090-ERR599159)EnvironmentalOpen in IMG/M
3300020458Marine microbial communities from Tara Oceans - TARA_B100000749 (ERX556123-ERR599000)EnvironmentalOpen in IMG/M
3300020476Marine microbial communities from Tara Oceans - TARA_B100001750 (ERX556108-ERR598958)EnvironmentalOpen in IMG/M
3300021442Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M2 200m 12015EnvironmentalOpen in IMG/M
3300021791Hydrothermal fluids microbial communities from Mariana Back-Arc Basin vent fields, Pacific Ocean - Daikoku_FS921 150_kmerEnvironmentalOpen in IMG/M
3300021959Estuarine water microbial communities from San Francisco Bay, California, United States - C33_13DEnvironmentalOpen in IMG/M
3300021978Hydrothermal fluids microbial communities from Mariana Back-Arc Basin vent fields, Pacific Ocean - Perseverance_CTD_V16A_01_btl17 _150kmerEnvironmentalOpen in IMG/M
3300022933 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_100_MGEnvironmentalOpen in IMG/M
3300024340 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_5EnvironmentalOpen in IMG/M
3300024344Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024518 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_2EnvironmentalOpen in IMG/M
3300025125Marine viral communities from the Pacific Ocean - ETNP_2_1000 (SPAdes)EnvironmentalOpen in IMG/M
3300025133Marine viral communities from the Subarctic Pacific Ocean - 15_ETSP_OMZ_AT15312 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025241Marine viral communities from the Deep Pacific Ocean - MSP-121 (SPAdes)EnvironmentalOpen in IMG/M
3300025251Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_906 (SPAdes)EnvironmentalOpen in IMG/M
3300025280Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s17 (SPAdes)EnvironmentalOpen in IMG/M
3300025547Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_150m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025630Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025873Marine viral communities from the Pacific Ocean - ETNP_6_1000 (SPAdes)EnvironmentalOpen in IMG/M
3300026134Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3438_5245 (SPAdes)EnvironmentalOpen in IMG/M
3300028018Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 1600mEnvironmentalOpen in IMG/M
3300028022Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 750mEnvironmentalOpen in IMG/M
3300031605Marine microbial communities from Western Arctic Ocean, Canada - CB9_32.1EnvironmentalOpen in IMG/M
3300031606Marine microbial communities from Western Arctic Ocean, Canada - AG5_TmaxEnvironmentalOpen in IMG/M
3300031701Marine microbial communities from Western Arctic Ocean, Canada - AG5_BottomEnvironmentalOpen in IMG/M
3300031801Marine microbial communities from Western Arctic Ocean, Canada - CB27_Tmax_986EnvironmentalOpen in IMG/M
3300031802Marine microbial communities from Western Arctic Ocean, Canada - CB6_AW_1057EnvironmentalOpen in IMG/M
3300031811Marine microbial communities from Western Arctic Ocean, Canada - CB11b_Tmax_Bot8EnvironmentalOpen in IMG/M
3300032277Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 3-month pyrrhotiteEnvironmentalOpen in IMG/M
3300032278Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - HC15-DNA-20-500_MGEnvironmentalOpen in IMG/M
3300032820Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - S1503-DNA-20-500_MGEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
KVRMV2_10024612923300002231Marine SedimentMKKGLTNEIEMYAEISKALREDGNSRTQVRYYMNVDEDFISDVLSCYK*
KVWGV2_1034329163300002242Marine SedimentMSYYPQTIADKMEKGLTNEIELYAAISKALREDGNSKSQVSYYMNVDEDFIPDVLQAYA*
KVWGV2_1083833733300002242Marine SedimentMSYYPQTIASKMKKGLTSEIEMYAEISKALREDGKSRSEVRYYMSVDDDFISDVLGCYK*
JGI26380J51729_1002418913300003619MarineMDFAEKVASNMKKGLSNELRKVCEIGRVLRDVEGLAKSQVNYFMSI
Ga0066608_110669413300004273MarineMDFAEKVASNMKKGLSNELRKVCEIGRVLRDVEGLSRTEVNYYMSIDEDFIPDVLQAYKGVKKRKPV
Ga0066837_1005641053300005593MarineMMSNQELNSYYPRTVASQMKRGLTNELEQICEIGKVLRDLGNSPREVNYYMSVDEDFISDVLGCYNYKG*
Ga0066369_1014653523300005969MarineMDYAEKVASQMKKGLSNERSKIGEISRVLRDLGLSNVEVNYHMNVDEDFIPDVLSYYK
Ga0068471_1055813173300006310MarineLTASQPPDTIHTHTHEGAKMSYYPQTIADKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA*
Ga0099956_105787323300006411MarineMTGKNYYPKIVADKMEKGIESELGQIYEISKVLHEDGNSRTQVGYYMNVDEDFIPDVLSCYNYKG*
Ga0075461_1004644013300006637AqueousMDFAEKVASNMKKGLSNELRKICEIGRVLRDEGLSKTEVNYHMSIDEDFIPDVLQAYKGVKRNRN*
Ga0098040_102678973300006751MarineMSYYPKTVASKMTKGLENDLQKICEIGKVLRELGKSRTEVNYYMSIDEDFISDVFNCYEG
Ga0098040_107223423300006751MarineMSYYPKIVSDKMTKGLTNELKQICEIGKVLRDLGNSPREVNYYMSVDEDFISDVLGCYNYKGN*
Ga0098044_127508113300006754MarineLKQLQSYYPYKVATQMEKGLTNELKQICEIGKVLRDLGNSPREVNYYMSVDEDFISDVLGCYNYKGN*
Ga0098054_113786623300006789MarineMSYYPKTVASKMKKGLENELQKICEIGKVLREDGNTNTQVSYYLNVDEDFIPDVLSCYKG
Ga0098036_113657513300006929MarineMSIICYNGGMRNEEGKAMSYYPQTIADKMEKGLVHEQSKIWEIRRVLRNEGLSVTEVDYYMNVDEDFIPDVLQVYA*
Ga0066366_1024581923300007283MarineMRNKEGKAMSYYPQTIADKMEKGLTKEIELYAAISRALREDGNSRSQVSYYMSVDEDFIPDVLQAYA*
Ga0066366_1046183413300007283MarineMSYYPQTVANNMAKGLTTQSEKLFEIGKVLHDLGNSRTEVRYYMSVDEDFIPDVLSCYRGLTNENSLV*
Ga0105020_125564013300007514MarineMKKGLENELQQICEIGKVLREDGNSRIEVNYYMNVDEDFISDVLGCYNYKG*
Ga0110931_102335513300007963MarineMISNNYYPKVIANKMEKGIENELGQIYEISKALREDGNSRIQVGYYMNVDED
Ga0110931_111345513300007963MarineMISDRYYPKAIANKMEKGIESELGQIYEISKVLREDGNSRIQVGYYMNVDEDFISDVLSCYKNIGVENG*
Ga0110931_113164033300007963MarineMSYYPQTIADKMEKGLTNEIEMYAAISKALREDGNSRSQVSYYMSVDEDFIPDVLQAYA*
Ga0110931_115747613300007963MarineMSITYYNGGMRNKEGKAMSYYPQTIADKMEKGLVHEQSKIWEIRRVLRNEGLSVTEVDYYMNVDEDFIPDVLQVYA*
Ga0098052_110384233300008050MarineMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLGCYK*
Ga0114899_119591933300008217Deep OceanMSYYPKIVSDNMKKGLTNELKQICEIGRVLRELGNDRHEVNYYMSVDEDFISDVLGCYNYKG*
Ga0115652_1002656153300008624MarineMSYYPKEIASKMKKGLENELQQICEIGKVLREDGNSRIEVNYYMNVDEDFISDVLGCYNYKG*
Ga0115652_105597423300008624MarineMSYYPQIVANKMEKGLENELQQICAIGKVLRELGNSRTEVNYYMSVDEDFIPDVLGCYNYHRKG*
Ga0117901_101318363300009103MarineMSYYPKVVSDKMKKGLETELQKICEIGKVLRELGNSRTEVNYYMSVDEDFIPDVLSCYNYKG*
Ga0118716_1008107143300009370MarineMRFNNYYPKAIANKMKKGLENELQKICEIGKVLREDGNTRIQVNYFLNVDEDFISDVLSCYKG*
Ga0118716_103145933300009370MarineMMSNQELNSYYPRTVASQMKRGLTNELAQICEIGKVLRDLGNSPREVNYYMSVDEDFIPDVLSCYKG*
Ga0118722_126129723300009376MarineMSYYPKVIADKMKKGLSNEIRIICEIGRVLREDNLTPVEVNYYMSVDEDFIPDVMDCYKERA*
Ga0114993_1038508013300009409MarineMDYPQEIASKMKKGLSNELRKICEIGRVLRDVEGLSRTEVNYYMSIDEDFISDVLSCYKG
Ga0114909_111374913300009414Deep OceanIMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLGCYK
Ga0114908_104611163300009418Deep OceanRFILKIIPNRGWGITRKEIIMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLGCYK*
Ga0114932_1000473723300009481Deep SubsurfaceMYNESMKRNNNNERKKVISYYPQTIASKMKKGLTNEIEMYAEISKALREDGKSRSEVRYYMSFDEDFISDVLDCYK*
Ga0114932_1000973663300009481Deep SubsurfaceMGDMESNNNNNERKKVISYYPQTIASKMQKGLTKEIEIYAEISKALREDGKSRIECRYYMTIDDDFISDVLGCYK*
Ga0114932_1011889133300009481Deep SubsurfaceMRNKEGEVMSYYPQTIANKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA*
Ga0114932_1014742213300009481Deep SubsurfaceKMKKGLENELQKICEIGKVLREDGNTNTQVSYYLNVDEDFIPDVLSCYKG*
Ga0114932_1025026633300009481Deep SubsurfaceMSYYPQIIASKMKKGLTNEIEMYAEISKALREDGKSRSEVRYCMSVDEDFIPDVLQCYKK
Ga0114932_1075366713300009481Deep SubsurfaceMSSYYPQTIASKMKKGLTNEIEMYAEISKALREDGKSRSEVRYCMNVDEDFIPDVLQCYKK*
Ga0105214_11838513300009595Marine OceanicKEGSEMSYYPKTVASKMTKGLKTQSEKLFEIGKVLHDLGNSRIEVRYYMSVDEDFISDVLSCYRG*
Ga0114901_114387013300009604Deep OceanMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLG
Ga0114906_128461723300009605Deep OceanRKEIIMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLGCYK*
Ga0105236_102901033300009619Marine OceanicVADKMKKGLSNEIALYCEIGKVLRELGNSRTEVNYYMNVDEDFISDVLGCYSYKG*
Ga0105236_105457713300009619Marine OceanicMSYYPQTIADKMEKGLTNEIELYAAISKALREDGNSRSQVSYYMSVDEDFIPDVLQAYA*
Ga0114912_116555513300009620Deep OceanMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFIPDVLDCYK
Ga0114933_10037861103300009703Deep SubsurfaceMSYYPKIIASKMKKGLTNEIEMYAEISKALREDGNSRTQVRYYMNVDEDFISDVLSCYK*
Ga0114933_1016885723300009703Deep SubsurfaceMRNKEGKDMSYYPQTIANKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA*
Ga0114999_1020787833300009786MarineMKNNNKKETKMDYPQEIADQMKKGLSNETRWGEIDLICEIGRVLRESGLSRTEVNYHMSIDEDFIPDVLSCYKGVEK
Ga0105189_103431423300009794Marine OceanicMISNNYYPKVIANKMEKGIESELGQIYEISKVLREDGNSRTQVGYYMNVDEDFISDVLSCYKNIGVENG*
Ga0098047_1039228813300010155MarineMSYYPKTIADKMKKGLENELQQICEIGKVLREDGNSRTEVNYYMSVDEDFISDVLGCYNS
Ga0137844_108477613300010934Subsea Pool Microbial MatMSYYPQTIADKMEKGLTNEIELYAAISKALREDGNSKSQVSYYMNVDEDFIPDV
Ga0114934_10000107483300011013Deep SubsurfaceMSFYYPQMIASKMKKGLTNEIEMYAEISKALREDGNSRTQVNYYMNVDEDFIPDVLACYK
Ga0114934_1000503833300011013Deep SubsurfaceMIRNNNNNERKKVMSYYPQTIASKMKKGLTSEIEMYAEISKALREDGKSRSEVRYYMSVDDDFISDVLGCYK*
Ga0181432_125852823300017775SeawaterMSMSSYYPKTVASQMEQGLKTESEMITEIGRVLHVLGNSRREITYYMSVDEDFIMDVLGCYNYKG
Ga0211692_101640313300020303MarineMSYYPQTIANKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA
Ga0211691_1003104853300020447MarineMIRNNSAKKEGSEMSYYPKTVANKMTKGLKTQSEKLFEIGKVLHDLGNSRIEVRYYMSVDEDFISDVLSCYRA
Ga0211691_1042431023300020447MarineMSYYPQIVSDNMKKGLSNEIRQICEIGRVLRELGNSPREVNYYMSVDEDFISDVLGCYNYKG
Ga0211697_1021076413300020458MarineMSYYPKTVANKMTKGLKTQSEKLFEIGKVLHDLGNSRIEVRYYMSVDEDFISDVLSCYRA
Ga0211715_1056592023300020476MarineMSYYPQTIADKMEKGLTNEIELYAAISKALREDGNSRSQVSYYMSVDEDFIPDVLQAYA
Ga0206685_1011069713300021442SeawaterMPIIYYNGGMRNKEGKDMSYYPQTIANKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA
Ga0226832_1000459133300021791Hydrothermal Vent FluidsMSYYPQTVANNMAKGLTTQSEKLFEIGKVLHDLGNSRTEVRYYMSVDEDFIPDVLSCYRGLTNENSLV
Ga0226832_1007696233300021791Hydrothermal Vent FluidsMSYYPRTIANKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYAA
Ga0226832_1017135823300021791Hydrothermal Vent FluidsMSYYPKTVASKMAKGLENDLQKICEIGKVLRELGKSRTEVNYYMSMDEDFISDVFNCYEG
Ga0226832_1031152323300021791Hydrothermal Vent FluidsMSYYPKTVASKMTKGLENELQQICEIGKVLRELGNSRTEVNYYMSMDEDFISDVLNCYKG
Ga0222716_1056582213300021959Estuarine WaterMDFAEKVASNMKKGLSNEIRKVCEIGRVLRDVEGLAKSQVNYFMSIDEDFIPDVLKAYKGVK
Ga0232646_129927013300021978Hydrothermal Vent FluidsKTVASKMTKGLENELQKICEIGKVLRELGNSRIEVNYYMSVDEDFISDVLSCYRG
(restricted) Ga0233427_1006535313300022933SeawaterMDFAEKVASNMKKGLSNELRKVCEIGRVLRDVEGLAKSQVNYFMSIDEDFIPDVLQAYKGVKKRKP
(restricted) Ga0255042_1016403213300024340SeawaterMDFAEKVASNMKKGLSNELRKVCEIGRVLRDVEGLAKSQVNYFMSIDEDFI
Ga0209992_10002961123300024344Deep SubsurfaceMSFYYPQMIASKMKKGLTNEIEMYAEISKALREDGNSRTQVNYYMNVDEDFIPDVLQCYK
Ga0209992_10007304143300024344Deep SubsurfaceMGDMESNNNNNERKKVISYYPQTIASKMQKGLTKEIEIYAEISKALREDGKSRIECRYYMTIDDDFISDVLGCYK
Ga0209992_1007278243300024344Deep SubsurfaceMYNESMKRNNNNERKKVISYYPQTIASKMKKGLTNEIEMYAEISKALREDGKSRSEVRYYMSFDEDFISDVLDCYK
Ga0209992_1021896413300024344Deep SubsurfaceMSSYYPQTIASKMKKGLTNEIEMYAEISKALREDGKSRSEVRYCMNVDEDFIPDVLQCYK
(restricted) Ga0255048_1021121533300024518SeawaterMSYYPKTVASKMTKGLENEYFQICEIGKVLRELGNSRTEVNYYMNVDEDFISDVLSCYRG
Ga0209644_102985133300025125MarineMSYYPQTIASKIKKGLTNENSIICEIGNALREDGNSHIEVNYYMNVDEDFISDVLGCYNYNK
Ga0208299_110158343300025133MarineEIIMSYYPKTIADKMQKGLTNEIELYAEINKALREDGNSRIEVNYYMNVDEDFISDVLGCYK
Ga0207893_100454453300025241Deep OceanMRDYAEKVASQMKKGLSNENSKICEIGRVLRDEGLSHVEVNYHMNVDADFIPDVLESYKG
Ga0208182_100595863300025251Deep OceanMSYYPKTVASKMTKGLENDLQKICEIGKVLRELGKSRTEVNYYMSMDEDFISDVFNCYEG
Ga0208449_100933933300025280Deep OceanMSYYPKTVASKMTKGLKNQSEKISEIGKVLRELGKSRTEVNYYMSIDEDFISDVLSCYRG
Ga0209556_109254023300025547MarineMDFAEKVASNMKKGLSNELRKVCEIGRVLRDVEGLSRTEVNYYMSIDEDFIPDVLQAYK
Ga0208004_108269123300025630AqueousMDFAEKVASNMKKGLSNELRKICEIGRVLRDEGLSKTEVNYHMSIDEDFIPDVLQAYKGVKRNRN
Ga0209757_1001427233300025873MarineMSYYPKTVASKMRFGLENELQKICEIGKVLQDLGKTNDEINYYMNVDEDFISDVLSCYKG
Ga0209757_1019617813300025873MarineMSYYPAIVANQMKRGLSNEIAQICEIGKVLRDLGNSNREVNYYLNVDEDFIMD
Ga0208815_106171213300026134Marine OceanicMEKGIESELGQIYEISKVLREDGNSRTQVGYYMNVDEDFISDVLSCYKNIGVENG
Ga0256381_102230933300028018SeawaterMSYYPKTVASKMAKGLENDLQKICEIGKVLRELGKSRAEVNYYMSMDEDFISDVFNCYEG
Ga0256382_101220933300028022SeawaterMSYYPQTIASKMKKGLTNEISMICEIGKVLREDGNSRIQVSYYMNVDEDFISDVLSCYNN
Ga0256382_102309543300028022SeawaterMSYYPKIVSDNMKKGLTNELKQICEIGRVLRELGNDRHEVNYYMSVDEDFISDVLGCYNYKG
Ga0256382_105289523300028022SeawaterMRFNNYYPKAIANKMKKGLENELQKICEIGKVLREDGNTRIQVNYFLNVDEDFISDVLSCYKG
Ga0256382_106104013300028022SeawaterMSYYPKTVANKMTKGLKTQSEKLFEIGKVLHDLGNSRIEVRYYMSVDEDFISDVL
Ga0256382_107548623300028022SeawaterMKKGLTNEIRMICEIGRVLREDGNSNTQVNYYLNVDEDFISDVLGCYNYKG
Ga0256382_107743223300028022SeawaterMIKKITKNERENQMSYYPKTIADKMKKGLTNEIEMYAEINKALREDGNSRIQVNYYMNVDEDFISDVLGCYNYKG
Ga0256382_112121513300028022SeawaterERRKMSYYPKTVASKMTKGLENDLQKICEIGKVLRELGKSRTEVNYYMSMDEDFISDVFNCYEG
Ga0256382_113244413300028022SeawaterMSYYPQTIASKMKKGLTNEIAMICEIGKVLREDGNSNTQVSYFLNVDEDFIPDVLGCYNHKG
Ga0302132_1021586943300031605MarineTGNLTKRKSMMDYPQEIASKMKKGLSNELRKICEIGRVLRDVEGLSRTEVNYHMSIDEDFISDVLSCYKGVDSKV
Ga0302119_1038799213300031606MarineMSYYPKIVSDNMKKGLTNELKQICEIGKVLRDLGNSRTEVCYMMSVDEDFISDVLGCYNYKGN
Ga0302120_1004484843300031701MarineMSYYPQIVSDNMKKGLTNELKQICEIGKVLRDLGNSRTEVCYMMSVDEDFISDVLGCYNYKGN
Ga0302120_1022129133300031701MarineMSYYPQTIADKMEKGLTNEIEMYAAISKALREDGNSRSQVSYYMSVDEDFIPDVLQAYA
Ga0302120_1022845323300031701MarineMRDYAEEVASQMKKGLSNENSKICEIGRVLRDEGLSHVEVNYHMNVDEDFISDVLQSYKGVEKRKPVSP
Ga0310121_1008404553300031801MarineMSYYPQTIADKMEKGLTKEIEMYAAISKALREDGNSKSQVNYYMSVDEDFIPDVLQAYA
Ga0310123_1013149033300031802MarineMSYYPKTVANNMAKGLTTQSEKLFEIGKVLHDLGNSRIEVRYYMSVDEDFISDVLSCYRA
Ga0310125_1051959923300031811MarineGKDMSYYPQTIADKMEKGLTKEIEMYAAISKALREDGNSKSQVNYYMSVDEDFIPDVLQAYA
Ga0316202_1045318313300032277Microbial MatMDFAEKVASNMKKGLSNELRKICEIGRVLRDEGLSKTEVNYHMSIDEDFIPDVLQA
Ga0310345_1056736233300032278SeawaterMSYYPQTIADKMEKGLTKEIELYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA
Ga0310345_1124616023300032278SeawaterMSYYPQTIADKMEKGLTKEIEMYAAISKALREDGNSKSQVSYYMSVDEDFIPDVLQAYA
Ga0310342_10173610923300032820SeawaterMGYYPKTVASKMTKGLENELQQICEIGKVLRELGNSRTEVNYYMSMDEDFISDVLNCYKG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.