NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F089026

Metagenome Family F089026

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089026
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 39 residues
Representative Sequence MNKKSYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Number of Associated Samples 59
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 68.81 %
% of genes near scaffold ends (potentially truncated) 26.61 %
% of genes from short scaffolds (< 2000 bps) 74.31 %
Associated GOLD sequencing projects 46
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (66.972 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(56.881 % of family members)
Environment Ontology (ENVO) Unclassified
(99.083 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(86.239 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 55.07%    β-sheet: 0.00%    Coil/Unstructured: 44.93%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF01402RHH_1 1.83
PF01507PAPS_reduct 1.83
PF14319Zn_Tnp_IS91 0.92
PF00145DNA_methylase 0.92
PF00208ELFV_dehydrog 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.92
COG0334Glutamate dehydrogenase/leucine dehydrogenaseAmino acid transport and metabolism [E] 0.92
COG2902NAD-specific glutamate dehydrogenaseAmino acid transport and metabolism [E] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A66.97 %
All OrganismsrootAll Organisms33.03 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002484|JGI25129J35166_1009644Not Available2517Open in IMG/M
3300002511|JGI25131J35506_1013998Not Available1103Open in IMG/M
3300002511|JGI25131J35506_1058475Not Available536Open in IMG/M
3300002514|JGI25133J35611_10012618All Organisms → cellular organisms → Archaea3604Open in IMG/M
3300002514|JGI25133J35611_10099015All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.863Open in IMG/M
3300002518|JGI25134J35505_10013949All Organisms → cellular organisms → Archaea2589Open in IMG/M
3300002760|JGI25136J39404_1006799Not Available1988Open in IMG/M
3300002760|JGI25136J39404_1011215All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1593Open in IMG/M
3300002760|JGI25136J39404_1082065Not Available604Open in IMG/M
3300002760|JGI25136J39404_1098950Not Available549Open in IMG/M
3300006324|Ga0068476_1356078All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.544Open in IMG/M
3300006336|Ga0068502_1342678All Organisms → cellular organisms → Bacteria963Open in IMG/M
3300006340|Ga0068503_10591525Not Available636Open in IMG/M
3300006736|Ga0098033_1036177Not Available1479Open in IMG/M
3300006736|Ga0098033_1067006Not Available1041Open in IMG/M
3300006736|Ga0098033_1074047Not Available983Open in IMG/M
3300006736|Ga0098033_1137033Not Available689Open in IMG/M
3300006736|Ga0098033_1214916Not Available531Open in IMG/M
3300006738|Ga0098035_1122879Not Available894Open in IMG/M
3300006751|Ga0098040_1232665Not Available535Open in IMG/M
3300006753|Ga0098039_1057052All Organisms → Viruses → Predicted Viral1361Open in IMG/M
3300006753|Ga0098039_1114484Not Available927Open in IMG/M
3300006789|Ga0098054_1253460Not Available635Open in IMG/M
3300006926|Ga0098057_1008164All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.2747Open in IMG/M
3300006926|Ga0098057_1052532Not Available999Open in IMG/M
3300006927|Ga0098034_1167800Not Available617Open in IMG/M
3300008220|Ga0114910_1058789All Organisms → Viruses → Predicted Viral1214Open in IMG/M
3300009412|Ga0114903_1027063Not Available1441Open in IMG/M
3300009414|Ga0114909_1066017All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1039Open in IMG/M
3300009414|Ga0114909_1070291Not Available998Open in IMG/M
3300009418|Ga0114908_1088442Not Available1049Open in IMG/M
3300009418|Ga0114908_1131003Not Available817Open in IMG/M
3300009602|Ga0114900_1019738Not Available2467Open in IMG/M
3300009602|Ga0114900_1039298Not Available1519Open in IMG/M
3300009602|Ga0114900_1088460All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.867Open in IMG/M
3300009613|Ga0105228_124491All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.559Open in IMG/M
3300009620|Ga0114912_1032434Not Available1401Open in IMG/M
3300009622|Ga0105173_1036737All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.794Open in IMG/M
3300009622|Ga0105173_1099749Not Available533Open in IMG/M
3300010155|Ga0098047_10018885All Organisms → cellular organisms → Archaea2772Open in IMG/M
3300010155|Ga0098047_10111417Not Available1066Open in IMG/M
3300010155|Ga0098047_10267931Not Available647Open in IMG/M
3300017775|Ga0181432_1002487All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.4100Open in IMG/M
3300017775|Ga0181432_1031385Not Available1425Open in IMG/M
3300017775|Ga0181432_1107135All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.837Open in IMG/M
3300017775|Ga0181432_1153170Not Available709Open in IMG/M
3300020398|Ga0211637_10003313Not Available7597Open in IMG/M
3300020407|Ga0211575_10445654All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.535Open in IMG/M
3300021791|Ga0226832_10016037All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.2426Open in IMG/M
(restricted) 3300024520|Ga0255047_10002992All Organisms → cellular organisms → Bacteria → Proteobacteria10823Open in IMG/M
3300025029|Ga0207900_112515Not Available716Open in IMG/M
3300025045|Ga0207901_1020957Not Available895Open in IMG/M
3300025050|Ga0207892_1024325All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.684Open in IMG/M
3300025069|Ga0207887_1003297All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.2503Open in IMG/M
3300025069|Ga0207887_1004114Not Available2240Open in IMG/M
3300025069|Ga0207887_1008429All Organisms → cellular organisms → Bacteria1584Open in IMG/M
3300025069|Ga0207887_1043787Not Available728Open in IMG/M
3300025078|Ga0208668_1067516Not Available645Open in IMG/M
3300025082|Ga0208156_1078735Not Available618Open in IMG/M
3300025097|Ga0208010_1034702Not Available1167Open in IMG/M
3300025097|Ga0208010_1035318All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1154Open in IMG/M
3300025109|Ga0208553_1050353Not Available1031Open in IMG/M
3300025112|Ga0209349_1001564All Organisms → cellular organisms → Archaea11297Open in IMG/M
3300025112|Ga0209349_1006874Not Available4727Open in IMG/M
3300025112|Ga0209349_1047003All Organisms → Viruses → Predicted Viral1366Open in IMG/M
3300025112|Ga0209349_1126083Not Available708Open in IMG/M
3300025118|Ga0208790_1035121Not Available1640Open in IMG/M
3300025122|Ga0209434_1048195Not Available1326Open in IMG/M
3300025125|Ga0209644_1012574Not Available1781Open in IMG/M
3300025125|Ga0209644_1024842All Organisms → Viruses → Predicted Viral1314Open in IMG/M
3300025125|Ga0209644_1030548Not Available1199Open in IMG/M
3300025125|Ga0209644_1080679Not Available762Open in IMG/M
3300025125|Ga0209644_1099080Not Available689Open in IMG/M
3300025125|Ga0209644_1108769Not Available657Open in IMG/M
3300025125|Ga0209644_1119782Not Available626Open in IMG/M
3300025125|Ga0209644_1129725Not Available601Open in IMG/M
3300025125|Ga0209644_1174226Not Available512Open in IMG/M
3300025131|Ga0209128_1002548All Organisms → cellular organisms → Archaea11770Open in IMG/M
3300025141|Ga0209756_1006976Not Available8184Open in IMG/M
3300025141|Ga0209756_1078097Not Available1495Open in IMG/M
3300025141|Ga0209756_1309920Not Available553Open in IMG/M
3300025251|Ga0208182_1001250Not Available12280Open in IMG/M
3300025251|Ga0208182_1026231All Organisms → Viruses → Predicted Viral1378Open in IMG/M
3300025264|Ga0208029_1004717Not Available4451Open in IMG/M
3300025267|Ga0208179_1119580Not Available500Open in IMG/M
3300025270|Ga0208813_1009756Not Available2821Open in IMG/M
3300025282|Ga0208030_1026653Not Available1835Open in IMG/M
3300025286|Ga0208315_1093925All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.720Open in IMG/M
3300025293|Ga0208934_1024796Not Available1197Open in IMG/M
3300025296|Ga0208316_1001320Not Available15128Open in IMG/M
3300025873|Ga0209757_10006672All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon2986Open in IMG/M
3300025873|Ga0209757_10039272All Organisms → cellular organisms → Archaea1370Open in IMG/M
3300025873|Ga0209757_10136068Not Available765Open in IMG/M
3300025873|Ga0209757_10136264Not Available765Open in IMG/M
3300025873|Ga0209757_10272442Not Available538Open in IMG/M
3300026103|Ga0208451_1013744All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.859Open in IMG/M
3300031801|Ga0310121_10017169All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.5373Open in IMG/M
3300031801|Ga0310121_10033855Not Available3561Open in IMG/M
3300031801|Ga0310121_10041248All Organisms → cellular organisms → Archaea3160Open in IMG/M
3300031801|Ga0310121_10077367Not Available2173Open in IMG/M
3300031801|Ga0310121_10215776Not Available1160Open in IMG/M
3300031801|Ga0310121_10369627Not Available822Open in IMG/M
3300031802|Ga0310123_10438817Not Available833Open in IMG/M
3300031803|Ga0310120_10023072All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.3798Open in IMG/M
3300031803|Ga0310120_10416382Not Available686Open in IMG/M
3300032278|Ga0310345_10004141Not Available13324Open in IMG/M
3300032820|Ga0310342_100023083All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.4825Open in IMG/M
3300032820|Ga0310342_103427463Not Available524Open in IMG/M
3300034656|Ga0326748_050919Not Available584Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine56.88%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean17.43%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine8.26%
Marine OceanicEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine Oceanic3.67%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater3.67%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater2.75%
MarineEnvironmental → Aquatic → Marine → Oceanic → Aphotic Zone → Marine2.75%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine1.83%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater0.92%
Filtered SeawaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Filtered Seawater0.92%
Hydrothermal Vent FluidsEnvironmental → Aquatic → Marine → Hydrothermal Vents → Diffuse Flow → Hydrothermal Vent Fluids0.92%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002484Marine viral communities from the Pacific Ocean - ETNP_2_130EnvironmentalOpen in IMG/M
3300002511Marine viral communities from the Pacific Ocean - ETNP_2_1000EnvironmentalOpen in IMG/M
3300002514Marine viral communities from the Pacific Ocean - ETNP_6_85EnvironmentalOpen in IMG/M
3300002518Marine viral communities from the Pacific Ocean - ETNP_6_100EnvironmentalOpen in IMG/M
3300002760Marine viral communities from the Pacific Ocean - ETNP_6_1000EnvironmentalOpen in IMG/M
3300006324Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT231_1_0500mEnvironmentalOpen in IMG/M
3300006336Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT238_2_0500mEnvironmentalOpen in IMG/M
3300006340Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT238_2_0770mEnvironmentalOpen in IMG/M
3300006736Marine viral communities from the Subarctic Pacific Ocean - 1_ETSP_OMZ_AT15124 metaGEnvironmentalOpen in IMG/M
3300006738Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaGEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006753Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006926Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaGEnvironmentalOpen in IMG/M
3300006927Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaGEnvironmentalOpen in IMG/M
3300008220Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_908EnvironmentalOpen in IMG/M
3300009412Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s2EnvironmentalOpen in IMG/M
3300009414Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_906EnvironmentalOpen in IMG/M
3300009418Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s17EnvironmentalOpen in IMG/M
3300009602Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_231EnvironmentalOpen in IMG/M
3300009613Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3737_250EnvironmentalOpen in IMG/M
3300009620Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_51EnvironmentalOpen in IMG/M
3300009622Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3321_4155EnvironmentalOpen in IMG/M
3300010155Marine viral communities from the Subarctic Pacific Ocean - 12_ETSP_OMZ_AT15267 metaGEnvironmentalOpen in IMG/M
3300017775Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 55 SPOT_SRF_2014-07-17EnvironmentalOpen in IMG/M
3300020398Marine microbial communities from Tara Oceans - TARA_B100000949 (ERX555993-ERR599072)EnvironmentalOpen in IMG/M
3300020407Marine microbial communities from Tara Oceans - TARA_B100001105 (ERX556033-ERR599115)EnvironmentalOpen in IMG/M
3300021791Hydrothermal fluids microbial communities from Mariana Back-Arc Basin vent fields, Pacific Ocean - Daikoku_FS921 150_kmerEnvironmentalOpen in IMG/M
3300024520 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_1EnvironmentalOpen in IMG/M
3300025029Marine viral communities from the Pacific Ocean - LP-39 (SPAdes)EnvironmentalOpen in IMG/M
3300025045Marine viral communities from the Pacific Ocean - LP-46 (SPAdes)EnvironmentalOpen in IMG/M
3300025050Marine viral communities from the Pacific Ocean - LP-54 (SPAdes)EnvironmentalOpen in IMG/M
3300025069Marine viral communities from the Pacific Ocean - LP-38 (SPAdes)EnvironmentalOpen in IMG/M
3300025078Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025082Marine viral communities from the Subarctic Pacific Ocean - 1_ETSP_OMZ_AT15124 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025097Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025109Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025112Marine viral communities from the Pacific Ocean - ETNP_2_130 (SPAdes)EnvironmentalOpen in IMG/M
3300025118Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025122Marine viral communities from the Pacific Ocean - ETNP_2_300 (SPAdes)EnvironmentalOpen in IMG/M
3300025125Marine viral communities from the Pacific Ocean - ETNP_2_1000 (SPAdes)EnvironmentalOpen in IMG/M
3300025131Marine viral communities from the Pacific Ocean - ETNP_6_100 (SPAdes)EnvironmentalOpen in IMG/M
3300025141Marine viral communities from the Pacific Ocean - ETNP_6_85 (SPAdes)EnvironmentalOpen in IMG/M
3300025251Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_906 (SPAdes)EnvironmentalOpen in IMG/M
3300025264Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s12 (SPAdes)EnvironmentalOpen in IMG/M
3300025267Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_Geostar (SPAdes)EnvironmentalOpen in IMG/M
3300025270Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_904 (SPAdes)EnvironmentalOpen in IMG/M
3300025282Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_M9 (SPAdes)EnvironmentalOpen in IMG/M
3300025286Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_215 (SPAdes)EnvironmentalOpen in IMG/M
3300025293Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s2 (SPAdes)EnvironmentalOpen in IMG/M
3300025296Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_231 (SPAdes)EnvironmentalOpen in IMG/M
3300025873Marine viral communities from the Pacific Ocean - ETNP_6_1000 (SPAdes)EnvironmentalOpen in IMG/M
3300026103Marine viral communities from the Southern Atlantic ocean transect to study dissolved organic matter and carbon cycling - metaG 3321_4155 (SPAdes)EnvironmentalOpen in IMG/M
3300031801Marine microbial communities from Western Arctic Ocean, Canada - CB27_Tmax_986EnvironmentalOpen in IMG/M
3300031802Marine microbial communities from Western Arctic Ocean, Canada - CB6_AW_1057EnvironmentalOpen in IMG/M
3300031803Marine microbial communities from Western Arctic Ocean, Canada - CB27_AW_983EnvironmentalOpen in IMG/M
3300032278Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - HC15-DNA-20-500_MGEnvironmentalOpen in IMG/M
3300032820Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - S1503-DNA-20-500_MGEnvironmentalOpen in IMG/M
3300034656Seawater viral communities from Mid-Atlantic Ridge, Atlantic Ocean - 502_2477EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25129J35166_100964433300002484MarineLKHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVMKKWRDLEK*
JGI25131J35506_101399843300002511MarineMTKYKTIKNRFITEFIEKYSDDSNRQFIKILKQWRDLEK*
JGI25131J35506_105847513300002511MarineMTKYKTIKNRFITEFIPKYSDKSDRHFIQVLKKWRDLEK*
JGI25133J35611_1001261873300002514MarineMTMPKKSYKDRCITEFNEYSDKSIRHFIQVMKQWRDLKK*
JGI25133J35611_1009901533300002514MarineMTRIKRSYKDRSITEFNEYSDESIRHFIQVKKQWKDLEK*
JGI25134J35505_1001394933300002518MarineMKIKSYKIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
JGI25136J39404_100679973300002760MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVMKKWRDLEK*
JGI25136J39404_101121543300002760MarineMTRPTRSYKDRSITEFNEYSDKSIRHFIQVKKQWRDLEK*
JGI25136J39404_108206533300002760MarineMTRIKRSYKDRSITEFNEYSDKSIRHFIQVKKQWKDLEK*
JGI25136J39404_109895013300002760MarineMTRIKRSYKDRYITEFNEYSDDSNRHFIQVLKKWRDLEK*NLKKY
Ga0068476_135607813300006324MarineMTRPYKTIKNRFITEFIEKYSDDSNRHFIQVIKQWRD
Ga0068502_134267813300006336MarineMTKYKTIKNRFITEFIPKYSDDSNRHFIQVLKKWRDLEK*
Ga0068503_1059152513300006340MarineETSRWFLMTRIKRSYKDRSITEFNEYSDKSIRHFIQVLKKWRDLEK*
Ga0098033_103617753300006736MarineMNKKSYKTIKNRFITEFIPKYSDESDRHFIQVLKKWRDLEK*
Ga0098033_106700623300006736MarineMKIKKSYKDRCITEFNEYSDQSKRHFIQVLKKWRDLQK*
Ga0098033_107404733300006736MarineMKIKKSYKNRCITEFIKKYSDQSDRQFFQVLKKWRD
Ga0098033_113703323300006736MarineLKHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKKWRTLKNE*
Ga0098033_121491623300006736MarineMTKYKTIKNRCITEWIPKYSDDSNRQFIQVLKQWRDLEK*
Ga0098035_112287923300006738MarineMPKKSYKDRCITEFNEYSDKSIRHFIQVMKQWRDLKK*
Ga0098040_123266513300006751MarineSYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRNLQK*
Ga0098039_105705233300006753MarineMTRIKRSYKDRSITEFNEYSDESNRHFIQVKKQWRDLEK*
Ga0098039_111448443300006753MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0098054_125346023300006789MarineLKHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*SVWI
Ga0098057_100816413300006926MarineMKIKKSYKDRCITEFNEYSDQSKRHFIQVLKKWRDL
Ga0098057_105253213300006926MarineLKHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0098034_116780013300006927MarineMTRIKKSYKNRCITEFNEYSDESKRHFIQVLKKWRDLEK*
Ga0114910_105878953300008220Deep OceanMIKSYKIIKNRFITEFIEKYSDDSNRHFIQVLKQWRDLEK*
Ga0114903_102706363300009412Deep OceanMTKYKTIKNRFITEFIEKYSDDSDRHFIKVMKQWRDLEK*
Ga0114909_106601733300009414Deep OceanMTKYKTIKNRFISEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0114909_107029143300009414Deep OceanMTRIKRSYKDRSITEFNEYSDKSIRHFIQVKKQWR
Ga0114908_108844213300009418Deep OceanTRPKRSYKDRYITEFNEYSDDSNRHFIQVLKQWRDLEK*
Ga0114908_113100333300009418Deep OceanMTRPYKTIKNRFITEFIPKYSDDSNRHFIQVLKKWRDLEK*N
Ga0114900_101973853300009602Deep OceanMTRPYKTIKNRFITEFIPKYSDDSNRHFIQVLKKWRDLEK*
Ga0114900_103929813300009602Deep OceanMMIKSYKIIKNRFITEFIEKYSDDSNRHFIQVLKQWRDLEK*
Ga0114900_108846013300009602Deep OceanMTRKRSYKDRSITEFNEYSDKSIRHFIQVKKQWRDLEK*
Ga0105228_12449123300009613Marine OceanicMTRIKTVYKNHFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0114912_103243443300009620Deep OceanMIKSYKIIKNRFITEFIEKYSDDSNRQFIQVLKKWRDLEK*
Ga0105173_103673713300009622Marine OceanicMTKYKTIKNRFITEFIEKYSDKSNRHFIQVLKKLRDLEK*
Ga0105173_109974923300009622Marine OceanicIGIGEKMTRIKRSYKDRSITEFNEYSDQSIRHFIQVMKQWRTLKNEM*
Ga0098047_1001888553300010155MarineMNKKSYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRTLKND*
Ga0098047_1011141723300010155MarineMKIKTAYKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0098047_1026793113300010155MarineHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK*
Ga0181432_100248773300017775SeawaterMTRPKRSYKDRCITEFNEYSDKSIRHFIQVKKQWRDLEK
Ga0181432_103138513300017775SeawaterKMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0181432_110713543300017775SeawaterMTPKRSYKDRSITEFNEYSDESKRHFIQVLKQWRDLEK
Ga0181432_115317033300017775SeawaterMKIKKSYKNRCITEFNEYSDESKRHFIHVLKKWRDLEK
Ga0211637_10003313163300020398MarineMTKYKTIKNRFITEFIEKYSDYSNRHFIQVMKQWRDFEK
Ga0211575_1044565413300020407MarineMTRKRSYKDRFITEFIEKYSDKSDRHFIQVMKQWRDLEK
Ga0226832_1001603723300021791Hydrothermal Vent FluidsMTRIKTAYKNHFITEFIEKYSDDSNRQFIQVLKQWRDLEK
(restricted) Ga0255047_10002992223300024520SeawaterMTRPTRSYKDRSITEFNEYSDESIRHFIQVKKQWRTLKND
Ga0207900_11251533300025029MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKQWRDLEK
Ga0207901_102095723300025045MarineMKIKKSYKNRCITEFIPKYSDQSDRQFFQVLKKWRDLEK
Ga0207892_102432523300025050MarineMTRPYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0207887_100329743300025069MarineMTRIKTAYKNHFITEFIEKYSDDSNRHFIQVLKQWRDLEK
Ga0207887_100411473300025069MarineMEQVIMTIKRSYKDRSITEFNEYSDKSIRHFIQVLKQWRDLEK
Ga0207887_100842923300025069MarineMMIKSYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDKEEKE
Ga0207887_104378723300025069MarineMIKSYKTIKNRFITEFIEKYSDDSNRQFIQVLKKWRDLEK
Ga0208668_106751613300025078MarineKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0208156_107873533300025082MarineYKTIKNHFITEFIEKYSDDSKRHFIQVLKKWRDLEK
Ga0208010_103470213300025097MarineLKHKTPYTIIKNRFITEFIEKYSDDSNRHFIQVLKK
Ga0208010_103531823300025097MarineMKIKKSYKDRCITEFNEYSDQSKRHFIQVLKKWRDLQK
Ga0208553_105035353300025109MarineMNKKSYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0209349_1001564183300025112MarineMTMPKKSYKDRCITEFNEYSDKSIRHFIQVMKQWRDLKK
Ga0209349_1006874143300025112MarineMIKRSYKDRSITEFNEYSDESIRHFIQVLKQWRDLEK
Ga0209349_104700343300025112MarineMTRIKRSYKDRSITEFNEYSDESIRHFIQVKKQWKDLEK
Ga0209349_112608313300025112MarineMTRIKRSYKDRSITEFNEYSDESIRHFIQVKKQWRD
Ga0208790_103512163300025118MarineYKMTRPKRSYKDRSITEFNEYSDKSIRHFIQVLKKWRDLEK
Ga0209434_104819513300025122MarineMTRKRSYKDRSITEFNEYSDKSIRHFIQVLKQWRDLEK
Ga0209644_101257473300025125MarineMTRPYKTIKNRFITEFIPKYSDDSNRHFIQVLKKWRDLEK
Ga0209644_102484223300025125MarineMTRIKTAYKNHFITEFIEKYSDKSIRHFIQVMKQWRDLEK
Ga0209644_103054853300025125MarineMTTYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0209644_108067933300025125MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVMKKWRDLEK
Ga0209644_109908043300025125MarineFLMTRIKRSYKDRSITEFNEYSDKSIRHFIQVKKQWKDLEK
Ga0209644_110876913300025125MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDL
Ga0209644_111978213300025125MarineMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEKXMNKKKT
Ga0209644_112972513300025125MarineMKIKTAYKNHFITEFIKKYSDQSDRQFFQVLKKWRDLEK
Ga0209644_117422623300025125MarineMKIKKSYKNRCITEFNEYSDESKRHFIQVLKQWRDLEK
Ga0209128_1002548113300025131MarineMKIKSYKIIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0209756_1006976153300025141MarineMIKRSYKDRSITEFNEYSDESNRQFIQVKKQWRELKK
Ga0209756_107809733300025141MarineMKRSYKDRSITEFNEYSDESNRHFIQVKKQWRDLEK
Ga0209756_130992023300025141MarineMKIKTAYKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEKX
Ga0208182_100125073300025251Deep OceanMKRSYKDRSITEFNEYSDESIRHFIQVLKQWRDLEK
Ga0208182_102623133300025251Deep OceanMMIKSYKIIKNRFITEFIEKYSDDSNRHFIQVLKQWRDLEK
Ga0208029_1004717123300025264Deep OceanMTRPKRSYKDRYITEFNEYSDKSIRHFIHVKKQWRDLEK
Ga0208179_111958013300025267Deep OceanIKRSYKDRSITEFNEYSDKSIRHFIQVKKQWRDLEK
Ga0208813_100975643300025270Deep OceanMMIKRSYKNRFITEFIPKYSDESDRHFIQVLKKWRDLEK
Ga0208030_102665363300025282Deep OceanMTRIKRSYKDRSITKFNEYSDKSKRHFIQVKKQWRDLEK
Ga0208315_109392533300025286Deep OceanMTRKRSYKDRSITEFNEYSDKSIRHFIQVKKQWRDLEK
Ga0208934_102479633300025293Deep OceanMTKYKTIKNRFITEFIEKYSDDSDRHFIKVMKQWRDLEK
Ga0208316_1001320173300025296Deep OceanMKRSYKDRSITEFNEYSDESIRHFIQVLKKWRDLEK
Ga0209757_1000667213300025873MarineMTRIKTSYKNRCITEFIKKYSDKSNRHIIQVLKQW
Ga0209757_1003927213300025873MarineMIKRSYKNRFITEFIKKYSDKSNRHFIQVLKQWRN
Ga0209757_1013606843300025873MarineMTKYKTIKNRFITEFIEKYSDESIRHFIQVKKQWRDLEK
Ga0209757_1013626433300025873MarineMTKYKTIKNRFITEFIEKYSDYSNRHFIQVLKKWRDLEK
Ga0209757_1027244223300025873MarineMTRIKRSYKDRSITEFNEYSDKSIRHFIQVKKQWKDLEK
Ga0208451_101374423300026103Marine OceanicMTKYKTIKNRFITEFIEKYSDKSNRHFIQVLKKLRDLEK
Ga0310121_1001716913300031801MarineMTKYKTIKNHFITEFIEKYSDPSNRHFIQVMKKWR
Ga0310121_10033855103300031801MarineMKIKKSYKNRCITEFIEKYSDQSDRQFFQVLKKWRDLEK
Ga0310121_1004124853300031801MarineMKIKTSYKNRCITEFNEYSDQSDRQFFQVLKKWRDLEK
Ga0310121_1007736733300031801MarineMKIKSYKTIKNHFITEFIEKYSDPSNRHFIQVLKKWRDLEK
Ga0310121_1021577623300031801MarineMKIKSYKTIKNRFITEFIEKYSDKSDRHFIQVLKKWRDLEK
Ga0310121_1036962743300031801MarineTRPYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEK
Ga0310123_1043881723300031802MarineMKIKTSYKNRFITEFNEYSDQSDRQFFQVLKKWRDLEK
Ga0310120_1002307283300031803MarineMTRIKRSYKNRSITEFNEYSDKSIRHFIQVLKQWRDLEK
Ga0310120_1041638223300031803MarineMTKYKTIKNRFITEFIEKYSDPSNRHFIQVLKKWRDLEK
Ga0310345_10004141193300032278SeawaterMTSKRSYKNRFITEFIEKYSDKSDRHFIQVKKQWRDLEK
Ga0310342_10002308363300032820SeawaterMTRKRSYKDRFITEFIEKYSDKSDRHFIQVKKQWRDLEK
Ga0310342_10342746333300032820SeawaterMTKYKTIKNRFITEFIEKYSDDSNRHFIQVLKKWRDLEKXPKKYTN
Ga0326748_050919_296_4393300034656Filtered SeawaterMTKYKTIKNRFITEFIQKYSDDSNRHFIQVLKKWRDLEKWKNDYSLI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.