NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F089426

Metagenome Family F089426

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089426
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 38 residues
Representative Sequence MLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSCIF
Number of Associated Samples 60
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 82.57 %
% of genes near scaffold ends (potentially truncated) 20.18 %
% of genes from short scaffolds (< 2000 bps) 79.82 %
Associated GOLD sequencing projects 51
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (64.220 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(41.284 % of family members)
Environment Ontology (ENVO) Unclassified
(91.743 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(85.321 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 44.62%    β-sheet: 0.00%    Coil/Unstructured: 55.38%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF00476DNA_pol_A 63.30
PF01612DNA_pol_A_exo1 7.34
PF13361UvrD_C 1.83
PF11753DUF3310 0.92
PF01597GCV_H 0.92
PF01636APH 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0749DNA polymerase I, 3'-5' exonuclease and polymerase domainsReplication, recombination and repair [L] 63.30
COG0509Glycine cleavage system protein H (lipoate-binding)Amino acid transport and metabolism [E] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A64.22 %
All OrganismsrootAll Organisms35.78 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000148|SI47jul10_100mDRAFT_c1026121All Organisms → Viruses → environmental samples → uncultured virus908Open in IMG/M
3300001450|JGI24006J15134_10033088All Organisms → Viruses → environmental samples → uncultured virus2248Open in IMG/M
3300001450|JGI24006J15134_10097405Not Available1063Open in IMG/M
3300001683|GBIDBA_10100220All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium1484Open in IMG/M
3300001743|JGI24515J20084_1005385All Organisms → cellular organisms → Bacteria → Proteobacteria1167Open in IMG/M
3300006083|Ga0081762_1085097All Organisms → Viruses → environmental samples → uncultured virus545Open in IMG/M
3300006164|Ga0075441_10073858Not Available1326Open in IMG/M
3300006164|Ga0075441_10104822Not Available1083Open in IMG/M
3300006190|Ga0075446_10092993Not Available890Open in IMG/M
3300006304|Ga0068504_1343918Not Available641Open in IMG/M
3300006308|Ga0068470_1149408All Organisms → Viruses → environmental samples → uncultured virus1332Open in IMG/M
3300006308|Ga0068470_1149409Not Available609Open in IMG/M
3300006310|Ga0068471_1103868Not Available5435Open in IMG/M
3300006310|Ga0068471_1232934All Organisms → cellular organisms → Bacteria → Proteobacteria2980Open in IMG/M
3300006310|Ga0068471_1417664Not Available2008Open in IMG/M
3300006310|Ga0068471_1427911Not Available1858Open in IMG/M
3300006310|Ga0068471_1504939Not Available1658Open in IMG/M
3300006310|Ga0068471_1645806Not Available2229Open in IMG/M
3300006324|Ga0068476_1185367Not Available825Open in IMG/M
3300006336|Ga0068502_1314936All Organisms → Viruses → environmental samples → uncultured virus1947Open in IMG/M
3300006336|Ga0068502_1424104Not Available933Open in IMG/M
3300006336|Ga0068502_1490573All Organisms → Viruses → environmental samples → uncultured virus1182Open in IMG/M
3300006338|Ga0068482_1259803All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Siphoviridae3027Open in IMG/M
3300006340|Ga0068503_10242306Not Available8166Open in IMG/M
3300006340|Ga0068503_10248234All Organisms → cellular organisms → Bacteria → Proteobacteria3820Open in IMG/M
3300006340|Ga0068503_10274813Not Available1455Open in IMG/M
3300006340|Ga0068503_10282560Not Available5657Open in IMG/M
3300006340|Ga0068503_10321056All Organisms → cellular organisms → Bacteria1613Open in IMG/M
3300006340|Ga0068503_10432226Not Available1594Open in IMG/M
3300006340|Ga0068503_10432227All Organisms → cellular organisms → Bacteria → Proteobacteria3989Open in IMG/M
3300006340|Ga0068503_10444528All Organisms → Viruses → environmental samples → uncultured virus1005Open in IMG/M
3300006340|Ga0068503_10444529Not Available703Open in IMG/M
3300006340|Ga0068503_10444591Not Available1072Open in IMG/M
3300006340|Ga0068503_10480986Not Available2652Open in IMG/M
3300006340|Ga0068503_10498322All Organisms → Viruses → environmental samples → uncultured virus1580Open in IMG/M
3300006340|Ga0068503_10553166All Organisms → Viruses2485Open in IMG/M
3300006340|Ga0068503_10561405All Organisms → Viruses → environmental samples → uncultured virus532Open in IMG/M
3300006340|Ga0068503_10563667Not Available640Open in IMG/M
3300006340|Ga0068503_10588638Not Available795Open in IMG/M
3300006340|Ga0068503_10594759All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300006340|Ga0068503_10749652Not Available1253Open in IMG/M
3300006340|Ga0068503_10818114Not Available796Open in IMG/M
3300006340|Ga0068503_10851824Not Available630Open in IMG/M
3300006753|Ga0098039_1122140Not Available894Open in IMG/M
3300006754|Ga0098044_1208947All Organisms → Viruses → environmental samples → uncultured virus765Open in IMG/M
3300006902|Ga0066372_10408885Not Available786Open in IMG/M
3300006947|Ga0075444_10151188Not Available973Open in IMG/M
3300006947|Ga0075444_10202513Not Available801Open in IMG/M
3300006947|Ga0075444_10282889All Organisms → Viruses → environmental samples → uncultured virus644Open in IMG/M
3300008221|Ga0114916_1122445Not Available607Open in IMG/M
3300008470|Ga0115371_11023259All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Autographiviridae → Fussvirus632Open in IMG/M
3300009149|Ga0114918_10104877All Organisms → Viruses → environmental samples → uncultured virus1750Open in IMG/M
3300009149|Ga0114918_10133556Not Available1499Open in IMG/M
3300009173|Ga0114996_10787433Not Available689Open in IMG/M
3300009173|Ga0114996_11286220Not Available509Open in IMG/M
3300009173|Ga0114996_11292959Not Available507Open in IMG/M
3300009409|Ga0114993_10526251All Organisms → Viruses → environmental samples → uncultured virus876Open in IMG/M
3300009409|Ga0114993_11062950Not Available574Open in IMG/M
3300009409|Ga0114993_11121657Not Available556Open in IMG/M
3300009420|Ga0114994_10386028All Organisms → Viruses → environmental samples → uncultured virus928Open in IMG/M
3300009420|Ga0114994_10988418Not Available544Open in IMG/M
3300009425|Ga0114997_10005669Not Available9472Open in IMG/M
3300009425|Ga0114997_10320309Not Available853Open in IMG/M
3300009425|Ga0114997_10425581All Organisms → Viruses → environmental samples → uncultured virus715Open in IMG/M
3300009432|Ga0115005_11697290Not Available519Open in IMG/M
3300009595|Ga0105214_103750All Organisms → Viruses → environmental samples → uncultured virus869Open in IMG/M
3300009622|Ga0105173_1044070Not Available739Open in IMG/M
3300009705|Ga0115000_10166108Not Available1466Open in IMG/M
3300009705|Ga0115000_10715667Not Available618Open in IMG/M
3300009706|Ga0115002_11227318Not Available506Open in IMG/M
3300009786|Ga0114999_10596392Not Available839Open in IMG/M
3300009786|Ga0114999_11191436Not Available543Open in IMG/M
3300010883|Ga0133547_10550971Not Available2317Open in IMG/M
3300010883|Ga0133547_11679697All Organisms → Viruses → environmental samples → uncultured virus1181Open in IMG/M
3300017775|Ga0181432_1302568Not Available507Open in IMG/M
3300020303|Ga0211692_1019205Not Available913Open in IMG/M
(restricted) 3300022902|Ga0233429_1044500All Organisms → Viruses2131Open in IMG/M
3300024262|Ga0210003_1110501All Organisms → Viruses1234Open in IMG/M
(restricted) 3300024520|Ga0255047_10114447All Organisms → Viruses → environmental samples → uncultured virus1384Open in IMG/M
3300025039|Ga0207878_110727Not Available1097Open in IMG/M
3300025039|Ga0207878_114190Not Available914Open in IMG/M
3300025045|Ga0207901_1031403Not Available720Open in IMG/M
3300025050|Ga0207892_1010917Not Available957Open in IMG/M
3300025050|Ga0207892_1027753Not Available646Open in IMG/M
3300025052|Ga0207906_1053455Not Available538Open in IMG/M
3300025078|Ga0208668_1004583Not Available3275Open in IMG/M
3300025114|Ga0208433_1017664Not Available2050Open in IMG/M
3300025168|Ga0209337_1009552All Organisms → Viruses6217Open in IMG/M
3300025168|Ga0209337_1043384Not Available2387Open in IMG/M
3300025168|Ga0209337_1139814All Organisms → Viruses → environmental samples → uncultured virus1064Open in IMG/M
3300025168|Ga0209337_1332903Not Available530Open in IMG/M
3300025237|Ga0208031_1040719Not Available597Open in IMG/M
3300025897|Ga0209425_10043912Not Available3063Open in IMG/M
3300027687|Ga0209710_1124719Not Available973Open in IMG/M
3300027714|Ga0209815_1225966Not Available570Open in IMG/M
3300027779|Ga0209709_10028262All Organisms → Viruses3528Open in IMG/M
3300027801|Ga0209091_10033948Not Available3093Open in IMG/M
3300027813|Ga0209090_10507736All Organisms → Viruses → environmental samples → uncultured virus560Open in IMG/M
3300027839|Ga0209403_10394877All Organisms → Viruses → environmental samples → uncultured virus729Open in IMG/M
3300027839|Ga0209403_10641147Not Available507Open in IMG/M
3300027844|Ga0209501_10523802All Organisms → Viruses675Open in IMG/M
3300031601|Ga0307992_1065470All Organisms → Viruses1529Open in IMG/M
3300031627|Ga0302118_10323915Not Available707Open in IMG/M
3300031658|Ga0307984_1067769Not Available1082Open in IMG/M
3300031659|Ga0307986_10451781Not Available502Open in IMG/M
3300031801|Ga0310121_10178977All Organisms → Viruses1304Open in IMG/M
3300031801|Ga0310121_10336621Not Available874Open in IMG/M
3300032360|Ga0315334_11670623Not Available543Open in IMG/M
3300034629|Ga0326756_027471All Organisms → Viruses → environmental samples → uncultured virus678Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine41.28%
MarineEnvironmental → Aquatic → Marine → Oceanic → Aphotic Zone → Marine31.19%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine7.34%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface2.75%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine2.75%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean1.83%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine1.83%
Marine OceanicEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine Oceanic1.83%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.83%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.92%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater0.92%
Filtered SeawaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Filtered Seawater0.92%
Pelagic MarineEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Pelagic Marine0.92%
Diffuse Hydrothermal Flow Volcanic VentEnvironmental → Aquatic → Marine → Hydrothermal Vents → Diffuse Flow → Diffuse Hydrothermal Flow Volcanic Vent0.92%
Hydrothermal Vent PlumeEnvironmental → Aquatic → Marine → Hydrothermal Vents → Unclassified → Hydrothermal Vent Plume0.92%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater0.92%
SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment0.92%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000148Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - 47 07/07/10 100mEnvironmentalOpen in IMG/M
3300001450Marine viral communities from the Pacific Ocean - LP-53EnvironmentalOpen in IMG/M
3300001683Hydrothermal vent plume microbial communities from Guaymas Basin, Gulf of California - IDBA assemblyEnvironmentalOpen in IMG/M
3300001743Marine viral communities from the Pacific Ocean - LP-38EnvironmentalOpen in IMG/M
3300006083Diffuse hydrothermal flow volcanic vent microbial communities from Axial Seamount, northeast Pacific ocean - Sample FS908_Marker33_DNAEnvironmentalOpen in IMG/M
3300006164Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG002-DNAEnvironmentalOpen in IMG/M
3300006190Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG058-DNAEnvironmentalOpen in IMG/M
3300006304Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT238_1_1000mEnvironmentalOpen in IMG/M
3300006308Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT229_2_0500mEnvironmentalOpen in IMG/M
3300006310Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT229_3_0500mEnvironmentalOpen in IMG/M
3300006324Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT231_1_0500mEnvironmentalOpen in IMG/M
3300006336Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT238_2_0500mEnvironmentalOpen in IMG/M
3300006338Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT232_1_0770mEnvironmentalOpen in IMG/M
3300006340Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT238_2_0770mEnvironmentalOpen in IMG/M
3300006753Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaGEnvironmentalOpen in IMG/M
3300006754Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaGEnvironmentalOpen in IMG/M
3300006902Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S15_td_250_ad_251m_LV_AEnvironmentalOpen in IMG/M
3300006947Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG017-DNAEnvironmentalOpen in IMG/M
3300008221Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_66EnvironmentalOpen in IMG/M
3300008470Sediment core microbial communities from Adelie Basin, Antarctica. Combined Assembly of Gp0136540, Gp0136562, Gp0136563EnvironmentalOpen in IMG/M
3300009149Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaGEnvironmentalOpen in IMG/M
3300009173Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_134EnvironmentalOpen in IMG/M
3300009409Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150EnvironmentalOpen in IMG/M
3300009420Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152EnvironmentalOpen in IMG/M
3300009425Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_136EnvironmentalOpen in IMG/M
3300009432Marine eukaryotic phytoplankton communities from Arctic Ocean - Arctic Ocean - Greenland ARC118M MetagenomeEnvironmentalOpen in IMG/M
3300009595Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3635_2500EnvironmentalOpen in IMG/M
3300009622Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3321_4155EnvironmentalOpen in IMG/M
3300009705Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128EnvironmentalOpen in IMG/M
3300009706Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_86EnvironmentalOpen in IMG/M
3300009786Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_126EnvironmentalOpen in IMG/M
3300010883western Arctic Ocean co-assemblyEnvironmentalOpen in IMG/M
3300017775Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 55 SPOT_SRF_2014-07-17EnvironmentalOpen in IMG/M
3300020303Marine microbial communities from Tara Oceans - TARA_B100000745 (ERX556095-ERR599124)EnvironmentalOpen in IMG/M
3300022902 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_135_MGEnvironmentalOpen in IMG/M
3300024262Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024520 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_1EnvironmentalOpen in IMG/M
3300025039Marine viral communities from the Pacific Ocean - LP-41 (SPAdes)EnvironmentalOpen in IMG/M
3300025045Marine viral communities from the Pacific Ocean - LP-46 (SPAdes)EnvironmentalOpen in IMG/M
3300025050Marine viral communities from the Pacific Ocean - LP-54 (SPAdes)EnvironmentalOpen in IMG/M
3300025052Marine viral communities from the Pacific Ocean - LP-37 (SPAdes)EnvironmentalOpen in IMG/M
3300025078Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025114Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025168Marine viral communities from the Pacific Ocean - LP-53 (SPAdes)EnvironmentalOpen in IMG/M
3300025237Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_38 (SPAdes)EnvironmentalOpen in IMG/M
3300025897Pelagic Microbial community sample from North Sea - COGITO 998_met_05 (SPAdes)EnvironmentalOpen in IMG/M
3300027687Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_138 (SPAdes)EnvironmentalOpen in IMG/M
3300027714Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG002-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027779Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_136 (SPAdes)EnvironmentalOpen in IMG/M
3300027801Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128 (SPAdes)EnvironmentalOpen in IMG/M
3300027813Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152 (SPAdes)EnvironmentalOpen in IMG/M
3300027839Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_86 (SPAdes)EnvironmentalOpen in IMG/M
3300027844Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_134 (SPAdes)EnvironmentalOpen in IMG/M
3300031601Marine microbial communities from Ellis Fjord, Antarctic Ocean - #133EnvironmentalOpen in IMG/M
3300031627Marine microbial communities from Western Arctic Ocean, Canada - AG5_33.1EnvironmentalOpen in IMG/M
3300031658Marine microbial communities from Ellis Fjord, Antarctic Ocean - #78EnvironmentalOpen in IMG/M
3300031659Marine microbial communities from Ellis Fjord, Antarctic Ocean - #82EnvironmentalOpen in IMG/M
3300031801Marine microbial communities from Western Arctic Ocean, Canada - CB27_Tmax_986EnvironmentalOpen in IMG/M
3300032360Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 500m 34915EnvironmentalOpen in IMG/M
3300034629Seawater viral communities from Mid-Atlantic Ridge, Atlantic Ocean - 543_2600EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
SI47jul10_100mDRAFT_102612123300000148MarineMLIYGKTPKDYLELAKAHKKATAIAVIIVIAVLYCIF*
JGI24006J15134_1003308833300001450MarineMLIYGKTPNDYLELAKAHKKETAIAAIIVIAVLYCIF*
JGI24006J15134_1009740523300001450MarineMLIYGKTPNDYLKVAKAHKKQTAIAAIIVIAVLYCIF*
GBIDBA_1010022053300001683Hydrothermal Vent PlumeMLIYGKTPNDYLKLAKAHKKKVAIGLIIVVVVLSWIF*
JGI24515J20084_100538543300001743MarineIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSWIF*
Ga0081762_108509723300006083Diffuse Hydrothermal Flow Volcanic VentMLIYGKTPNDYLKLAKAHKKKVAVGLIIVIAVLSWIF*
Ga0075441_1007385833300006164MarineMLIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYCIS*
Ga0075441_1010482213300006164MarineGVIMLIYGKTPQDYLELAKAHKKETAIAAIIVIAVLYCIF*
Ga0075446_1009299333300006190MarineMLIYGKTPNDYLKLAKAHKKKVAIGLIIVVAVLSWIF*
Ga0068504_134391813300006304MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSFIF*
Ga0068470_114940833300006308MarineMLIYGKTLNDYLKLAKAHKKKVAVGLIIVVVVLSWIF*
Ga0068470_114940933300006308MarineMLIYGKTLNDYLKLAKAHKRKVAVGLIIVVAVLYSIF*
Ga0068471_1103868133300006310MarineMLIYGKTLNDYLKLAKAHKRKVAVGLIIVIAVLSWIF*
Ga0068471_123293413300006310MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSFI
Ga0068471_141766423300006310MarineMLIYGKTLNNYLKLAKAHKKKIAVGLIIVVAVLYCIF*
Ga0068471_142791143300006310MarineMLIYGKTLNDYLKLAKANKKKVAIGLIIVVAVLSWIF*
Ga0068471_150493933300006310MarineMLIYGKTPNDYLKLAKAHKRKVAVGLIIVVAVLYSIF*
Ga0068471_164580623300006310MarineMLIYGKTLNDYLKLAKAHKKKVAVGLIIVVAVLSWIF*
Ga0068476_118536723300006324MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSCIF*
Ga0068502_131493623300006336MarineMLIYGKTLNDYLKLAKAHKRKVAVGLIIVVAVLSCIF*
Ga0068502_142410423300006336MarineMLIYGKTPNEYLKLAKAHKKKVAVGLIIVVVVLSWIF*
Ga0068502_149057333300006336MarineMLIYGKTPNDYLKLAKAHKKKVAIGLIIVIAVLSWIF*
Ga0068482_125980333300006338MarineMLIYGKTPNDYLKLAKAHQRKIAVGLIIVVAVLYCIF*
Ga0068503_10242306103300006340MarineMLIYGKTLNDYLELAKAHKKKVAVGLIVVAVLYCIF*
Ga0068503_1024823433300006340MarineMLIYGKTLNDYLKLVKANKKKVAIGLIIVVAVLSWIF*
Ga0068503_1027481323300006340MarineMLIYGRTLNDYLKLAKAHKKKVAVGLIIVVAVLSFIF*
Ga0068503_1028256033300006340MarineMLIYGKTPNDYLKLAKAHKKKVVVGLIIVVAVLSFIF*
Ga0068503_1032105633300006340MarineMLIYGKTLNDYLKLAKAHKRKVAVGLIIVVAVLSWIF*
Ga0068503_1043222623300006340MarineMLIYGKTPNEYLKLAKAHKKKVAVGLIIVVAVLSWIF*
Ga0068503_1043222753300006340MarineMLIYGKTLNDYLKLAKAHKRKVAIGLIIVVAVLSWIF*
Ga0068503_1044452823300006340MarineMLIYGRTPNDYLKLAKAHKKKVAVGLIIVIAVLSWIF*
Ga0068503_1044452923300006340MarineMLIYGKTLNDYLKLAKAHKKKVVIGLIIVVAVLSWIF*
Ga0068503_1044459123300006340MarineMLIYGKTPNDYLKLAKAHKKKVVIGLIIVVAVLSFIF*
Ga0068503_1048098653300006340MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIKVVAVLSWIF*
Ga0068503_1049832213300006340MarineMLIYGKTPNDYLKLAKAHKKKVAIGLIIVVAVISFIF*
Ga0068503_1055316653300006340MarineMLIYGKTPNDYLKLAKAHQKKVAVGLIIVIAVLYCIF*
Ga0068503_1056140523300006340MarineMLIYGRTPNDYLKLAKAHKKKVAIGLIIVIAVLYCIF*
Ga0068503_1056366713300006340MarineMLIYGKTPNDYLKLAKAHKKKVAIGLIIVVAVLSFIF*
Ga0068503_1058863813300006340MarineYGRTPNDYLKLAKAHKRKVAIGLIIVVVVLYCIF*
Ga0068503_1059475933300006340MarineMLIYGKTPNDYLKLAKAHKRKVAVGLIIVVAVLYWIF*
Ga0068503_1074965213300006340MarineMLIYGKTPNDYLKLAKAHKKKVAVGLKIVVAVLSWIF*
Ga0068503_1081811413300006340MarineMLIYGKTLDDYLKLAKAHKRKVAVGLIIVVAVLYSIF*
Ga0068503_1085182423300006340MarineMLIYGRTPNDYLKLAKAHKKKVAVGLIIVIAVLYCIF*
Ga0098039_112214023300006753MarineMLIYGKTPNDYLKLAKAHKRKVAVGLIIVVAVLSWIF*
Ga0098044_120894723300006754MarineMLIYGKTLNDYLKLAKAHKRKVAAGIIIVIVVLYSIF*
Ga0066372_1040888513300006902MarineMLIYGKTLNDYLKLAKAHKRKVAIGLIIVVAVLYSIF*
Ga0075444_1015118823300006947MarineMLIYGKTPNDYLKLAKAHQKKVAIGLIIVIAVLYCIF*
Ga0075444_1020251323300006947MarineMLIYGKTPNDYLELAKAHKKQTAIAVIIVIAVLYCIF*
Ga0075444_1028288923300006947MarineMLIYGKTPNDYLKLAKAHKKEVAIGLIIVVAVLSWIF*
Ga0114916_112244533300008221Deep OceanIILRIKIGVIMLIYGKTPNDYLKVAKAHKKETAIAAIIVIAVLYCIF*
Ga0115371_1102325913300008470SedimentILEIKLALIMLIYGKTTNDYLLLAKSHKTQTAIAVIIVIAVLYCIF*
Ga0114918_1010487733300009149Deep SubsurfaceMLIYGKTPNDYLKLAKAHKKETAVIAIIVIAVLYCIF*
Ga0114918_1013355633300009149Deep SubsurfaceMLIYGKTPNDYLELAKAHKKATAIAAIIVIAVLYCIF*
Ga0114996_1078743313300009173MarineMLIYGKTPKDYLNVAKAHKKETAIAAIIVIAVLYCIF*
Ga0114996_1128622013300009173MarineMLLYNTQIKIGVIMLIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYCIS*
Ga0114996_1129295923300009173MarineIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYCIS*
Ga0114993_1052625123300009409MarineMLIYGKELKDYLELAKVHKKETAIAAIIVIAVLYCIF*
Ga0114993_1106295023300009409MarineMLIYGKTPNDYLKVAKAHKKQTAIAAIIEIAILYCIF*
Ga0114993_1112165723300009409MarineMLIYGKTLKDYLKVVKEHKKETAIVAIIVIAILYCIS*
Ga0114994_1038602823300009420MarineMLIYGKKLKDYLELAKVHKKETAIAAIIVIAVLYCIF*
Ga0114994_1098841813300009420MarineMLIYGKTPNDYLKVAKAHKKETAIAAIIVIAILYCIF*
Ga0114997_10005669163300009425MarineMLIYGKTPNDYVKIAKAHKKETAIAVIIVIAVLYCIF*
Ga0114997_1032030913300009425MarineMLIYGKTPNDYLKVAKAHKKQTAIAVIVVIAVLYCIF*
Ga0114997_1042558113300009425MarineMLIYGKTPNDYLELAKAHKKATAIAAIIVIAVLYC
Ga0115005_1169729013300009432MarineMLIYGKTPNDYLELAKAHKKATVIAAIIVIAILYCIF*
Ga0105214_10375013300009595Marine OceanicMLIYGKTPNDYLKLAKAHKKKVAIGLIIVIAVLSFIF*
Ga0105173_104407013300009622Marine OceanicMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSWIF*
Ga0115000_1016610813300009705MarineIYGKTPNDYLELAKAHKKETAIAAIIVIAILYCIF*
Ga0115000_1071566733300009705MarineMLIYGKTLNDYLKLAKAHKKQTAIAVIIVIADLYCIF*
Ga0115002_1122731823300009706MarineMLIYGKTPKDYLELAKAHKKGTAVAVIIVIAILYCIS*
Ga0114999_1059639223300009786MarineMLIYGKTPNDYLKLAKAHQKKVAVGLIIVVAVLSWIF*
Ga0114999_1119143623300009786MarineMLIYGKTPNDYLKVAKAHKKETAIAAIIVIAVLYCIF*
Ga0133547_1055097123300010883MarineMLIYGKTPNDYLELAKAHKKATAVAAIIVIAVLYCIF*
Ga0133547_1167969733300010883MarineMLIYGKTLNDYLKLAKAHKKQTAIAVIIVIAVLYCIF*
Ga0181432_130256823300017775SeawaterMLIYGKTPNDYLKLAKSHKKEVAIGLIVVIAILYCIF
Ga0211692_101920523300020303MarineMLIYGKTPNDYLKLAKAHKKKVVVGLIIVVAVLSFIF
(restricted) Ga0233429_104450043300022902SeawaterMLIYGKTPKDYLELAKAHKKATAIAVIIVIAVLYCIF
Ga0210003_111050133300024262Deep SubsurfaceMLIYGKTPNDYLKLAKAHKKETAVIAIIVIAVLYCIF
(restricted) Ga0255047_1011444713300024520SeawaterMLIYGKTPKDYLELAKAHKKATAIAAIAVIIVIAVLYCIF
Ga0207878_11072743300025039MarineMLIYGKTLNDYLELAKAHKKKVAVGLIVVAVLYCIF
Ga0207878_11419013300025039MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSFIF
Ga0207901_103140333300025045MarineINKIGVIMLIYGRTPNDYLKLAKAHKKKVAVGLIIVIAVLSWIF
Ga0207892_101091713300025050MarineMLIYGRTPNDYLKLAKAHKKKVAVGLIIVIAVLSWIF
Ga0207892_102775313300025050MarineIILTTKIGIIMLIYGKTLNDYLELAKAHKKKVAVGLIVVAVLYCIF
Ga0207906_105345523300025052MarineMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSWI
Ga0208668_100458343300025078MarineMLIYGKTLNNYLKLAKAHKKKIAVGLIIVVAVLYCIF
Ga0208433_101766423300025114MarineMLIYGKTLNDYLKLAKAHKRKVAVGLIIVVAVLSWIF
Ga0209337_100955263300025168MarineMLIYGKTPNDYLKVAKAHKKQTAIAAIIVIAVLYCIF
Ga0209337_104338423300025168MarineMLIYGKTPNDYVKIAKAHKKETAIAVIIVIAVLYCIF
Ga0209337_113981423300025168MarineMLIYGKTPKDYLNVAKAHKKETAIAAIIVIAVLYCIF
Ga0209337_133290313300025168MarineTILQIKIGVIMLIYGKTPNDYLELAKAHKKQTAIAVIIVIAVLYCIF
Ga0208031_104071923300025237Deep OceanMLIYGKTPNDYLELAKAHKKQTAIAVIIVIAVLYCIF
Ga0209425_1004391213300025897Pelagic MarineMLIYGKTPKDYLELAKAHKKATAIAVIIVIAVLYC
Ga0209710_112471963300027687MarineMLIYGKTPNDYLELAKAHKKATAIAAIIVIAVLYCIF
Ga0209815_122596623300027714MarineMLIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYCIS
Ga0209709_1002826223300027779MarineMLIYGKTPNDYLELAKAHKKETAIAAIIVIAVLYCIF
Ga0209091_1003394873300027801MarineMLIYGKTPNDYLKVAKAHKKQTAIAVIVVIAVLYCIF
Ga0209090_1050773623300027813MarineMLIYGKTPNDYLELAKAHKKETAIAAIIVIAVLYC
Ga0209403_1039487723300027839MarineMLIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYC
Ga0209403_1064114723300027839MarineIYGKTPKDYLELAKAHKKGTAVAAIIVIAILYCIS
Ga0209501_1052380213300027844MarineLQIKIGVIMLIYGKTPNDYLKVAKAHKKQTAIAAIIVIAVLYCIF
Ga0307992_106547023300031601MarineMLIYGKTPKDYLELAKAHKKETAIAAIIVIAVLYCIF
Ga0302118_1032391513300031627MarineMLIYGKTPNDYLELVKEHKKATAIAAIIVIAVLYCIF
Ga0307984_106776923300031658MarineMLIYGKTPRDYLELAKAHKKETAIAAIIVLAVLYCIF
Ga0307986_1045178113300031659MarineGVIMLIYGKTPNDYLELAKAHKKATAIAAIIVIAVLYCIF
Ga0310121_1017897753300031801MarineCINKIGVIMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSWIF
Ga0310121_1033662113300031801MarineMLIYGKTLNDYLKLVKANKKKVAIGLIIVIAVLSWIF
Ga0315334_1167062333300032360SeawaterIMLIYGKTPNDYLKLAKAHKKKVAVGLIIVVAVLSWIF
Ga0326756_027471_268_3813300034629Filtered SeawaterMLIYGKTLNDYLKLAKANKKKVAVGLIIVVAVLSWIF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.