NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F095309

Metagenome Family F095309

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095309
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 63 residues
Representative Sequence MSYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Number of Associated Samples 74
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 88.57 %
% of genes near scaffold ends (potentially truncated) 16.19 %
% of genes from short scaffolds (< 2000 bps) 69.52 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (50.476 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Strait → Unclassified → Seawater
(20.952 % of family members)
Environment Ontology (ENVO) Unclassified
(64.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(92.381 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 85.71%    β-sheet: 0.00%    Coil/Unstructured: 14.29%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF08291Peptidase_M15_3 1.90
PF11753DUF3310 0.95
PF13662Toprim_4 0.95
PF00145DNA_methylase 0.95
PF00476DNA_pol_A 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 0.95
COG0749DNA polymerase I, 3'-5' exonuclease and polymerase domainsReplication, recombination and repair [L] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A50.48 %
All OrganismsrootAll Organisms49.52 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000116|DelMOSpr2010_c10024711All Organisms → Viruses → Predicted Viral2874Open in IMG/M
3300000116|DelMOSpr2010_c10028262All Organisms → Viruses → Predicted Viral2647Open in IMG/M
3300000116|DelMOSpr2010_c10032190All Organisms → Viruses → Predicted Viral2440Open in IMG/M
3300000116|DelMOSpr2010_c10146936Not Available813Open in IMG/M
3300000116|DelMOSpr2010_c10162942Not Available750Open in IMG/M
3300000116|DelMOSpr2010_c10171525Not Available721Open in IMG/M
3300000117|DelMOWin2010_c10039005All Organisms → Viruses → Predicted Viral2215Open in IMG/M
3300005941|Ga0070743_10020256All Organisms → Viruses → Predicted Viral2318Open in IMG/M
3300005942|Ga0070742_10181595Not Available587Open in IMG/M
3300006026|Ga0075478_10013374All Organisms → Viruses → Predicted Viral2792Open in IMG/M
3300006752|Ga0098048_1147123Not Available703Open in IMG/M
3300006752|Ga0098048_1154478Not Available684Open in IMG/M
3300006752|Ga0098048_1211946Not Available570Open in IMG/M
3300006789|Ga0098054_1303711Not Available570Open in IMG/M
3300006790|Ga0098074_1063177All Organisms → Viruses → Predicted Viral1018Open in IMG/M
3300006793|Ga0098055_1045964All Organisms → Viruses → Predicted Viral1772Open in IMG/M
3300006793|Ga0098055_1100214All Organisms → Viruses → Predicted Viral1132Open in IMG/M
3300006810|Ga0070754_10036151Not Available2714Open in IMG/M
3300006919|Ga0070746_10147672All Organisms → Viruses → Predicted Viral1147Open in IMG/M
3300006919|Ga0070746_10168695All Organisms → Viruses → Predicted Viral1057Open in IMG/M
3300006922|Ga0098045_1010872All Organisms → Viruses → Predicted Viral2580Open in IMG/M
3300006922|Ga0098045_1059456Not Available935Open in IMG/M
3300006924|Ga0098051_1113096All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales725Open in IMG/M
3300006924|Ga0098051_1152747Not Available610Open in IMG/M
3300007229|Ga0075468_10204528Not Available575Open in IMG/M
3300007229|Ga0075468_10209967Not Available565Open in IMG/M
3300007276|Ga0070747_1083021All Organisms → Viruses → Predicted Viral1194Open in IMG/M
3300007276|Ga0070747_1168299Not Available782Open in IMG/M
3300007344|Ga0070745_1261027Not Available624Open in IMG/M
3300007345|Ga0070752_1115009All Organisms → Viruses → Predicted Viral1137Open in IMG/M
3300007345|Ga0070752_1183132Not Available844Open in IMG/M
3300007539|Ga0099849_1164338Not Available851Open in IMG/M
3300007557|Ga0102821_1014289All Organisms → Viruses → Predicted Viral2178Open in IMG/M
3300007627|Ga0102869_1096916Not Available807Open in IMG/M
3300008012|Ga0075480_10629722Not Available505Open in IMG/M
3300008995|Ga0102888_1041404Not Available882Open in IMG/M
3300009024|Ga0102811_1142162Not Available898Open in IMG/M
3300009086|Ga0102812_10321782Not Available841Open in IMG/M
3300010300|Ga0129351_1312800Not Available593Open in IMG/M
3300011258|Ga0151677_1086868Not Available644Open in IMG/M
3300017697|Ga0180120_10339981Not Available595Open in IMG/M
3300017709|Ga0181387_1006654All Organisms → Viruses → Predicted Viral2267Open in IMG/M
3300017710|Ga0181403_1001890All Organisms → Viruses → Predicted Viral4860Open in IMG/M
3300017710|Ga0181403_1012721All Organisms → Viruses → Predicted Viral1804Open in IMG/M
3300017719|Ga0181390_1070271Not Available986Open in IMG/M
3300017729|Ga0181396_1030255All Organisms → Viruses → Predicted Viral1078Open in IMG/M
3300017730|Ga0181417_1001945All Organisms → cellular organisms → Bacteria → Proteobacteria6024Open in IMG/M
3300017731|Ga0181416_1009307All Organisms → Viruses → Predicted Viral2338Open in IMG/M
3300017731|Ga0181416_1068494Not Available839Open in IMG/M
3300017742|Ga0181399_1018383All Organisms → Viruses → Predicted Viral1970Open in IMG/M
3300017742|Ga0181399_1129838Not Available613Open in IMG/M
3300017744|Ga0181397_1011354All Organisms → Viruses → Predicted Viral2728Open in IMG/M
3300017744|Ga0181397_1017063All Organisms → Viruses → Predicted Viral2166Open in IMG/M
3300017749|Ga0181392_1077007Not Available1007Open in IMG/M
3300017752|Ga0181400_1229085Not Available506Open in IMG/M
3300017763|Ga0181410_1022020All Organisms → Viruses → Predicted Viral2091Open in IMG/M
3300017764|Ga0181385_1145512All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Yersiniaceae720Open in IMG/M
3300017772|Ga0181430_1070775All Organisms → Viruses → Predicted Viral1062Open in IMG/M
3300017779|Ga0181395_1006542All Organisms → Viruses → Predicted Viral4271Open in IMG/M
3300017779|Ga0181395_1139147Not Available767Open in IMG/M
3300017782|Ga0181380_1014282All Organisms → Viruses → Predicted Viral3015Open in IMG/M
3300018416|Ga0181553_10328277Not Available845Open in IMG/M
3300018420|Ga0181563_10252275Not Available1051Open in IMG/M
3300019459|Ga0181562_10559887Not Available538Open in IMG/M
3300020169|Ga0206127_1058336All Organisms → Viruses → Predicted Viral1885Open in IMG/M
3300020169|Ga0206127_1300026Not Available534Open in IMG/M
3300021335|Ga0213867_1089981All Organisms → Viruses → Predicted Viral1111Open in IMG/M
3300021347|Ga0213862_10094106All Organisms → Viruses → Predicted Viral1055Open in IMG/M
3300021356|Ga0213858_10002211Not Available9285Open in IMG/M
3300021356|Ga0213858_10021064All Organisms → Viruses → Predicted Viral3112Open in IMG/M
3300021373|Ga0213865_10048987All Organisms → Viruses → Predicted Viral2341Open in IMG/M
3300021373|Ga0213865_10062422All Organisms → Viruses → Predicted Viral2043Open in IMG/M
3300021373|Ga0213865_10074059All Organisms → Viruses → Predicted Viral1850Open in IMG/M
3300021373|Ga0213865_10080848All Organisms → Viruses → Predicted Viral1757Open in IMG/M
3300021375|Ga0213869_10002393Not Available13003Open in IMG/M
3300021375|Ga0213869_10048750All Organisms → Viruses → Predicted Viral2214Open in IMG/M
3300021375|Ga0213869_10205207Not Available886Open in IMG/M
3300021959|Ga0222716_10651899Not Available566Open in IMG/M
3300022074|Ga0224906_1025385All Organisms → Viruses → Predicted Viral2076Open in IMG/M
3300022074|Ga0224906_1064140All Organisms → Viruses → Predicted Viral1144Open in IMG/M
3300022187|Ga0196899_1011248All Organisms → Viruses → Predicted Viral3520Open in IMG/M
3300022928|Ga0255758_10223568Not Available856Open in IMG/M
3300024346|Ga0244775_10032822All Organisms → Viruses → Predicted Viral4612Open in IMG/M
3300024346|Ga0244775_10572592Not Available918Open in IMG/M
3300025083|Ga0208791_1046738Not Available765Open in IMG/M
3300025098|Ga0208434_1037839All Organisms → Viruses → Predicted Viral1104Open in IMG/M
3300025108|Ga0208793_1150321Not Available616Open in IMG/M
3300025652|Ga0208134_1008977All Organisms → Viruses → Predicted Viral4322Open in IMG/M
3300025674|Ga0208162_1013360All Organisms → Viruses → Predicted Viral3357Open in IMG/M
3300025769|Ga0208767_1169794Not Available769Open in IMG/M
3300025806|Ga0208545_1093068Not Available801Open in IMG/M
3300025870|Ga0209666_1132177All Organisms → Viruses → Predicted Viral1159Open in IMG/M
3300027188|Ga0208921_1014469All Organisms → Viruses → Predicted Viral1235Open in IMG/M
3300027188|Ga0208921_1020328Not Available1018Open in IMG/M
3300027192|Ga0208673_1010384All Organisms → Viruses → Predicted Viral1679Open in IMG/M
3300027416|Ga0207994_1080752Not Available656Open in IMG/M
3300027751|Ga0208304_10340288Not Available519Open in IMG/M
(restricted) 3300027861|Ga0233415_10320285Not Available734Open in IMG/M
3300032212|Ga0316207_10017395All Organisms → Viruses → Predicted Viral4709Open in IMG/M
3300032274|Ga0316203_1018896All Organisms → Viruses → Predicted Viral2033Open in IMG/M
3300032277|Ga0316202_10040940All Organisms → Viruses → Predicted Viral2194Open in IMG/M
3300032277|Ga0316202_10269168Not Available792Open in IMG/M
3300032277|Ga0316202_10292505Not Available758Open in IMG/M
3300032373|Ga0316204_10883512Not Available637Open in IMG/M
3300034375|Ga0348336_166206Not Available634Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater20.95%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous18.10%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine13.33%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater10.48%
EstuarineEnvironmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine8.57%
MarineEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Marine6.67%
Microbial MatEnvironmental → Aquatic → Marine → Coastal → Sediment → Microbial Mat5.71%
EstuarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine4.76%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh3.81%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient1.90%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater1.90%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater0.95%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine0.95%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.95%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water0.95%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000116Marine microbial communities from Delaware Coast, sample from Delaware MO Spring March 2010EnvironmentalOpen in IMG/M
3300000117Marine microbial communities from Delaware Coast, sample from Delaware MO Winter December 2010EnvironmentalOpen in IMG/M
3300005941Estuarine microbial communities from the Columbia River estuary, USA - metaG S.697EnvironmentalOpen in IMG/M
3300005942Estuarine microbial communities from the Columbia River estuary, USA - metaG S.757EnvironmentalOpen in IMG/M
3300006026Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006752Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006790Marine viral communities from the Gulf of Mexico - 32_GoM_OMZ_CsCl metaGEnvironmentalOpen in IMG/M
3300006793Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaGEnvironmentalOpen in IMG/M
3300006810Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01EnvironmentalOpen in IMG/M
3300006919Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_21EnvironmentalOpen in IMG/M
3300006922Marine viral communities from the Subarctic Pacific Ocean - 11_ETSP_OMZ_AT15265 metaGEnvironmentalOpen in IMG/M
3300006924Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaGEnvironmentalOpen in IMG/M
3300007229Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_30_<0.8_DNAEnvironmentalOpen in IMG/M
3300007276Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31EnvironmentalOpen in IMG/M
3300007344Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_4EnvironmentalOpen in IMG/M
3300007345Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_30EnvironmentalOpen in IMG/M
3300007539Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaGEnvironmentalOpen in IMG/M
3300007557Estuarine microbial communities from the Columbia River estuary - Ebb tide non-ETM metaG S.715EnvironmentalOpen in IMG/M
3300007627Estuarine microbial communities from the Columbia River estuary - metaG 1546A-02EnvironmentalOpen in IMG/M
3300008012Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_N_<0.8_DNAEnvironmentalOpen in IMG/M
3300008995Estuarine microbial communities from the Columbia River estuary - metaG 1551A-3EnvironmentalOpen in IMG/M
3300009024Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.705EnvironmentalOpen in IMG/M
3300009086Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.713EnvironmentalOpen in IMG/M
3300010300Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_27_0.2_DNAEnvironmentalOpen in IMG/M
3300011258Seawater microbial communities from Japan Sea near Toyama Prefecture, Japan - 2015_1, permeateEnvironmentalOpen in IMG/M
3300017697Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_31_0.2_DNA (version 2)EnvironmentalOpen in IMG/M
3300017709Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 10 SPOT_SRF_2010-04-27EnvironmentalOpen in IMG/M
3300017710Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 26 SPOT_SRF_2011-09-28EnvironmentalOpen in IMG/M
3300017719Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 13 SPOT_SRF_2010-07-21EnvironmentalOpen in IMG/M
3300017729Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 19 SPOT_SRF_2011-01-11EnvironmentalOpen in IMG/M
3300017730Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 40 SPOT_SRF_2013-02-13EnvironmentalOpen in IMG/M
3300017731Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 39 SPOT_SRF_2013-01-16EnvironmentalOpen in IMG/M
3300017742Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 22 SPOT_SRF_2011-05-21EnvironmentalOpen in IMG/M
3300017744Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 20 SPOT_SRF_2011-02-23EnvironmentalOpen in IMG/M
3300017749Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 15 SPOT_SRF_2010-09-15EnvironmentalOpen in IMG/M
3300017752Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 23 SPOT_SRF_2011-06-22EnvironmentalOpen in IMG/M
3300017763Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 33 SPOT_SRF_2012-06-20EnvironmentalOpen in IMG/M
3300017764Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 8 SPOT_SRF_2010-02-11EnvironmentalOpen in IMG/M
3300017772Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 53 SPOT_SRF_2014-04-10EnvironmentalOpen in IMG/M
3300017779Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 18 SPOT_SRF_2010-12-16EnvironmentalOpen in IMG/M
3300017782Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 3 SPOT_SRF_2009-08-19EnvironmentalOpen in IMG/M
3300018416Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011502XT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300018420Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011512CT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300019459Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011511BT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300020169Pelagic subsurface seawater microbial communities from Kabeltonne, Helgoland, North Sea - Helgoland_Spring_Bloom_20160419_1EnvironmentalOpen in IMG/M
3300021335Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO540EnvironmentalOpen in IMG/M
3300021347Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO266EnvironmentalOpen in IMG/M
3300021356Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO245EnvironmentalOpen in IMG/M
3300021373Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO282EnvironmentalOpen in IMG/M
3300021375Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO132EnvironmentalOpen in IMG/M
3300021959Estuarine water microbial communities from San Francisco Bay, California, United States - C33_13DEnvironmentalOpen in IMG/M
3300022074Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 56 SPOT_SRF_2014-09-10 (v2)EnvironmentalOpen in IMG/M
3300022187Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01 (v3)EnvironmentalOpen in IMG/M
3300022928Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011513CT metaGEnvironmentalOpen in IMG/M
3300024346Whole water sample coassemblyEnvironmentalOpen in IMG/M
3300025083Marine viral communities from the Subarctic Pacific Ocean - 11_ETSP_OMZ_AT15265 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025098Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025652Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31 (SPAdes)EnvironmentalOpen in IMG/M
3300025674Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025769Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_21 (SPAdes)EnvironmentalOpen in IMG/M
3300025806Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_30_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025870Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_125m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027188Estuarine microbial communities from the Columbia River estuary - Ebb tide non-ETM metaG S.709 (SPAdes)EnvironmentalOpen in IMG/M
3300027192Estuarine microbial communities from the Columbia River estuary - Ebb tide non-ETM metaG S.715 (SPAdes)EnvironmentalOpen in IMG/M
3300027416Estuarine microbial communities from the Columbia River estuary, USA - metaG S.757 (SPAdes)EnvironmentalOpen in IMG/M
3300027751Estuarine microbial communities from the Columbia River estuary - Flood tide ETM metaG S.713 (SPAdes)EnvironmentalOpen in IMG/M
3300027861 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_12_MGEnvironmentalOpen in IMG/M
3300032212Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 6-week pyriteEnvironmentalOpen in IMG/M
3300032274Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 6-month pyrrhotite 1EnvironmentalOpen in IMG/M
3300032277Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 3-month pyrrhotiteEnvironmentalOpen in IMG/M
3300032373Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 6-month pyrrhotite 2EnvironmentalOpen in IMG/M
3300034375Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_30 (v4)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
DelMOSpr2010_1002471173300000116MarineMNYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAIHSFDYAVETLTSKGVL*
DelMOSpr2010_1002826233300000116MarineMNYAEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
DelMOSpr2010_1003219073300000116MarineMTYSEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLASKGIL*
DelMOSpr2010_1014693613300000116MarineKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGIL*
DelMOSpr2010_1016294223300000116MarineMNYSEYRKKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVL*
DelMOSpr2010_1017152513300000116MarineMNYSEYRKKLRKLSAKYNEAYKKYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGVIS
DelMOWin2010_10039005103300000117MarineMSYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKMDLRAKHAVHSFDYAVETLTSKGIL*
Ga0070743_1002025683300005941EstuarineMNYAEYRNKLRKLSAKYNESYKQYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGLL*
Ga0070742_1018159523300005942EstuarineMKYSEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGLL*
Ga0075478_1001337423300006026AqueousMNYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGVL*
Ga0098048_114712313300006752MarineRGYIMNYSEYRKKLRKLSAKYNEAYKRLGWGADITRKLRQQKTDLRAKYAVHSFDYAVETLTGKGIL*
Ga0098048_115447833300006752MarineMNYSEYRNKLRMLSAKYNESYKRYGWCADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0098048_121194623300006752MarineMNYSEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
Ga0098054_130371123300006789MarineMNYSEYRKKLRMLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
Ga0098074_106317743300006790MarineMSYSEYRRKLRKLSAKYNEAYKRYGWGADTTRKLRQKKIDLRAKYAVHSFDYAVESLTSKGIL*
Ga0098055_104596463300006793MarineMNYSEYRNKLRKLSAKYNESYKKCGWGADTTRKLRQQKIDLRAKYAIHSFDYAVETLTSKGIL*
Ga0098055_110021433300006793MarineMNYSEYRKKLRKLSAKYNEAYKRLGWGADITRKLRQQKTDLRAKYAVHSFDYAVETLTGKGIL*
Ga0070754_1003615193300006810AqueousMNYSEYRKKLRKLSAKYNEAYKKYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL*
Ga0070746_1014767243300006919AqueousMSYSEYRKKLRMLSVKYNEAYKRLGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
Ga0070746_1016869513300006919AqueousMSYSEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKMDLRAKYALHSFDYAVESLTSKGIL*
Ga0098045_101087293300006922MarineMNYSEYRNKLRMLTAKYNEAYKRLGWGADITRKLRQQKTDLRAKYAVHSFDYAVETLTGKGIL*
Ga0098045_105945633300006922MarineMNYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAIHSFDYAVETLTSKGIL*
Ga0098051_111309643300006924MarineMNYSEYRNKLRKLSAKYNESYKRYGWGADTTRNLRQQKIDLRAKYAVHSFDYAVE
Ga0098051_115274733300006924MarineMNYSEYRNKLRKLSAKYNESYKKCGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
Ga0075468_1020452833300007229AqueousRKKLRKLSAKYNESYKRYGWGADTTRKLRQQKMDLRAKYALHSFDYAVESLTSKGVL*
Ga0075468_1020996733300007229AqueousDNNPSNEGFIMSYLEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL*
Ga0070747_108302153300007276AqueousMNYSEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLTSKGCYKINLK*
Ga0070747_116829933300007276AqueousMSYLEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL*
Ga0070745_126102723300007344AqueousMNYSEYRNKLRMLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL*
Ga0070752_111500923300007345AqueousMSYSEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSYDYAVETLTSKGIL*
Ga0070752_118313213300007345AqueousIMTYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLTSKGIL*
Ga0099849_116433833300007539AqueousMTYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL*
Ga0102821_101428983300007557EstuarineMMYSEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKTDLRAKYAIHSFDYAVESLTSKGLL*
Ga0102869_109691613300007627EstuarineMTYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL*
Ga0075480_1062972213300008012AqueousMNYSEYRKKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGVIK*
Ga0102888_104140443300008995EstuarineMKYSEYRNKLRMLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL*
Ga0102811_114216223300009024EstuarineMYSEYRNKLRMLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGIL*
Ga0102812_1032178213300009086EstuarineMTYSEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGVL*
Ga0129351_131280023300010300Freshwater To Marine Saline GradientMSYSEYRKKLRMLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVL*
Ga0151677_108686813300011258MarineMSYSEYRKKLRKLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLSSKGVL*
Ga0180120_1033998113300017697Freshwater To Marine Saline GradientMTYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYALHSFDYAV
Ga0181387_100665483300017709SeawaterMSYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGIL
Ga0181403_100189093300017710SeawaterMSYSEYRNKLRKLSVKYNEAYKQLGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGIL
Ga0181403_101272143300017710SeawaterMSYSEYRNKLRKLTAKYIEAYRKYGWNADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0181390_107027133300017719SeawaterMNYAEYRKKLRVLTVKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0181396_103025533300017729SeawaterMTYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAIHSFDYAVESLTSKGVIS
Ga0181417_1001945123300017730SeawaterMSYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0181416_100930793300017731SeawaterMSYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKMDLRAKYAVHSFDYAVETLTSKGIL
Ga0181416_106849423300017731SeawaterMTYSEYRNKLRKLSAKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0181399_101838323300017742SeawaterMNYAEYRNKLRMLSAKYNEAYKRYGWGADTTRELRQQKIDLRAKYAVHSLDYAVETLTSKGVL
Ga0181399_112983833300017742SeawaterMTYSKYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGVIK
Ga0181397_101135433300017744SeawaterMSYSEYRKKLRMLSVKYNEAYKQLGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGIL
Ga0181397_101706343300017744SeawaterMSYSEYRKKLRMLSVKYNDAYKRLGWGADTTRKLRQQKMDLRAKYAVHSFDYSVEILTGQGIL
Ga0181392_107700723300017749SeawaterMTYSEYRKKLRMLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL
Ga0181400_122908523300017752SeawaterMTYSEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLASKGIL
Ga0181410_102202053300017763SeawaterMSYSEYRKKLRKLSAKYNEAYKRYGWGADTTRQLRQQKMDLRAKYAVHSFDYAVESLTSKGVL
Ga0181385_114551223300017764SeawaterMSYSEYRNKLRKLSAKYNEAYKQYGWGANTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGIL
Ga0181430_107077513300017772SeawaterMNYAEYRNKLRMLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVIK
Ga0181395_1006542103300017779SeawaterMTYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVL
Ga0181395_113914743300017779SeawaterYRGYIMKYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAIHSFDYAVESLTSKGVIS
Ga0181380_101428293300017782SeawaterMSYLEYRKKLRMLTAKYIEAYKKYGWNADTTRKLRQQKIDLRAKYSLHSFDYAVESLTSKGIL
Ga0181553_1032827743300018416Salt MarshMPYSEYRKKLRKLSAKYNEAYKRYGWGADTTRKLRQQKMDLRAKYAVHSFDYAVESLTSKGIL
Ga0181563_1025227563300018420Salt MarshMTYSEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0181562_1055988713300019459Salt MarshMSYSEYRRKLRKLSAKYNEAYKRYGWGADTTRKLRQKKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0206127_105833653300020169SeawaterMNYSEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0206127_130002613300020169SeawaterMNYSEYRNKLRMLSAKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0213867_108998143300021335SeawaterMSYSEYRKKLRMLSVKYNEAYKRLGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0213862_1009410613300021347SeawaterMKYSEYRKKLRMLSAKYNEAYKKYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL
Ga0213858_1000221133300021356SeawaterMTYSEYRKKLRMLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGVL
Ga0213858_1002106483300021356SeawaterMSYSEYRKKLRMLSVKYNEAYKRLGWGADTTRKLRQQKMDLRAKYAVHSFDYAVETLTSKGIL
Ga0213865_1004898753300021373SeawaterMKYSEYRKKLRMLSAKYNESYKKYGWGADTTRKLRQQKTDLRAKYALHSFDYAVESLTSKGVL
Ga0213865_1006242263300021373SeawaterVNYSEYKNKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYALHSFDYAVESLTSKGVL
Ga0213865_1007405953300021373SeawaterMNYSEYRNKLRMLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYALHSFDYAVETLTSKGVL
Ga0213865_1008084823300021373SeawaterMKYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLTSKGVL
Ga0213869_10002393103300021375SeawaterMNYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAIHSFDYAVETLTSKGVL
Ga0213869_1004875043300021375SeawaterMKYSEYRKKLRKLSAKYNESYKRYGWGADTTRKLRQQKMDLRAKYALHSFDYAVETLTSKGVL
Ga0213869_1020520743300021375SeawaterMNYSEYRNKLRMLSAKYNESYKRYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLTSKGVL
Ga0222716_1065189923300021959Estuarine WaterMTYSEYRKKLRKLSAKYNEAYRKYGWGGDTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGVIS
Ga0224906_102538523300022074SeawaterMTYSEYRNKLRKLTAKYIEAYRKYGWNADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0224906_106414053300022074SeawaterMTYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKMDLRAKYAVHSFDYAVETLTSKGIL
Ga0196899_101124883300022187AqueousMNYSEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGVL
Ga0255758_1022356843300022928Salt MarshEYRRKLRKLSAKYNEAYKRYGWGADTTRKLRQKKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0244775_1003282293300024346EstuarineMNYAEYRNKLRKLSAKYNESYKQYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGLL
Ga0244775_1057259253300024346EstuarineMMYSEYRNKLRMLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGIL
Ga0208791_104673833300025083MarineMNYSEYRKKLRKLSAKYNEAYKRLGWGADITRKLRQQKTDLRAKYAVHSFDYAVETLTGKGIL
Ga0208434_103783933300025098MarineMNYSEYRNKLRMLSAKYNESYKRYGWCADTTRKLRQQKIDLRAKYAVHSFDYAVETLTGKGIL
Ga0208793_115032123300025108MarineMNYSEYRNKLRKLSAKYNESYKKCGWGADTTRKLRQQKIDLRAKYAIHSFDYAVETLTSKGIL
Ga0208134_100897733300025652AqueousMSYLEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0208162_101336063300025674AqueousMTYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGIL
Ga0208767_116979413300025769AqueousMSYSEYRKKLRMLSVKYNEAYKRLGWGADTTRKLRQQKIDLRAKYAVHSFDYAVETLTSKGI
Ga0208545_109306813300025806AqueousEYRNKLRMLSVKYNEAYKKYGWGADTTRKLRQQKIDLRAKYALHSFDYAVESLTSKGCYKINLK
Ga0209666_113217733300025870MarineMNYAEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGLL
Ga0208921_101446963300027188EstuarineMTYSEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKTDLRAKYAIHSFDYAVESLTSKGLL
Ga0208921_102032843300027188EstuarineMNYAEYRNKLRKLSAKYNESYKQYGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGIL
Ga0208673_101038413300027192EstuarineEYRNKLRKLSAKYNEAYKRHGWGADTTRKLRQQKTDLRAKYAIHSFDYAVESLTSKGLL
Ga0207994_108075213300027416EstuarineMNYAEYRNKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSK
Ga0208304_1034028813300027751EstuarineRMLSAKYNEAYKRYGWGADTTRKLRQQKTDLRAKYALHSFDYAVETLTSKGIL
(restricted) Ga0233415_1032028533300027861SeawaterMNYAEYRNKLRKLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVIS
Ga0316207_1001739553300032212Microbial MatMSYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGIL
Ga0316203_101889643300032274Microbial MatMNYSEYRKKLRKLSAKYNEAYKKYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL
Ga0316202_1004094063300032277Microbial MatMTYSEYRKKLRMLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGVL
Ga0316202_1026916823300032277Microbial MatMNYSEYRKKLRMLSAKYNEAYKRYGWGADTTRKLRQQKMDLRAKYALHSFDYAVETLTSKGVL
Ga0316202_1029250533300032277Microbial MatMKYSEYRNKLRKLSAKYNESYKKYGWGADTTRKLRQQKIDLRAKYAVHSFDYAVESLTSKGVL
Ga0316204_1088351243300032373Microbial MatHLCNGGYSSIYRGYIMNYSEYRKKLRKLSAKYNEAYKKYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVESLTSKGIL
Ga0348336_166206_389_5803300034375AqueousMNYSEYRKKLRKLSAKYNESYKRYGWGADTTRKLRQQKTDLRAKYAVHSFDYAVETLTSKGVL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.