NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F092862

Metagenome Family F092862

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092862
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 38 residues
Representative Sequence MIEEKCKFCGSTELVYHQYVICDSSCQECGEWQNGETI
Number of Associated Samples 88
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 32.08 %
% of genes near scaffold ends (potentially truncated) 37.38 %
% of genes from short scaffolds (< 2000 bps) 74.77 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (60.748 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(56.075 % of family members)
Environment Ontology (ENVO) Unclassified
(83.178 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(79.439 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 21.21%    Coil/Unstructured: 78.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF14743DNA_ligase_OB_2 40.19
PF04404ERF 3.74
PF13619KTSC 2.80
PF12684DUF3799 1.87
PF02151UVR 1.87
PF08279HTH_11 0.93
PF04542Sigma70_r2 0.93
PF03013Pyr_excise 0.93
PF04466Terminase_3 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 0.93
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 0.93
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.93
COG1783Phage terminase large subunitMobilome: prophages, transposons [X] 0.93
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A60.75 %
All OrganismsrootAll Organisms39.25 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000949|BBAY94_10159369Not Available611Open in IMG/M
3300001450|JGI24006J15134_10108972Not Available979Open in IMG/M
3300001460|JGI24003J15210_10096152All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300002483|JGI25132J35274_1026635All Organisms → Viruses → Predicted Viral1332Open in IMG/M
3300002483|JGI25132J35274_1049254All Organisms → cellular organisms → Bacteria → Proteobacteria913Open in IMG/M
3300005404|Ga0066856_10139482All Organisms → Viruses → Predicted Viral1058Open in IMG/M
3300005432|Ga0066845_10427125Not Available513Open in IMG/M
3300005609|Ga0070724_10009378All Organisms → Viruses → Predicted Viral4740Open in IMG/M
3300005821|Ga0078746_1044515Not Available956Open in IMG/M
3300006735|Ga0098038_1038985Not Available1748Open in IMG/M
3300006736|Ga0098033_1066851All Organisms → Viruses → Predicted Viral1043Open in IMG/M
3300006736|Ga0098033_1110575Not Available779Open in IMG/M
3300006737|Ga0098037_1173443Not Available716Open in IMG/M
3300006738|Ga0098035_1140423Not Available825Open in IMG/M
3300006738|Ga0098035_1269066Not Available559Open in IMG/M
3300006738|Ga0098035_1306124Not Available517Open in IMG/M
3300006751|Ga0098040_1070415Not Available1071Open in IMG/M
3300006754|Ga0098044_1045570All Organisms → cellular organisms → Bacteria1882Open in IMG/M
3300006754|Ga0098044_1100268All Organisms → cellular organisms → Bacteria → Proteobacteria1186Open in IMG/M
3300006754|Ga0098044_1306213Not Available607Open in IMG/M
3300006789|Ga0098054_1014430All Organisms → Viruses → Predicted Viral3209Open in IMG/M
3300006789|Ga0098054_1183745All Organisms → cellular organisms → Bacteria → Proteobacteria765Open in IMG/M
3300006793|Ga0098055_1288731Not Available614Open in IMG/M
3300006923|Ga0098053_1009880All Organisms → Viruses → Predicted Viral2195Open in IMG/M
3300006923|Ga0098053_1014400Not Available1759Open in IMG/M
3300006928|Ga0098041_1085322All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Thorarchaeota → Candidatus Thorarchaeota archaeon1019Open in IMG/M
3300006928|Ga0098041_1215746Not Available613Open in IMG/M
3300006929|Ga0098036_1190005Not Available625Open in IMG/M
3300006929|Ga0098036_1206040Not Available597Open in IMG/M
3300007542|Ga0099846_1098564All Organisms → cellular organisms → Bacteria1077Open in IMG/M
3300007954|Ga0105739_1156203Not Available543Open in IMG/M
3300007956|Ga0105741_1075383Not Available825Open in IMG/M
3300008050|Ga0098052_1152667Not Available914Open in IMG/M
3300009104|Ga0117902_1194663All Organisms → cellular organisms → Bacteria2018Open in IMG/M
3300009172|Ga0114995_10052707All Organisms → Viruses → Predicted Viral2319Open in IMG/M
3300009409|Ga0114993_10157043All Organisms → Viruses → Predicted Viral1772Open in IMG/M
3300009445|Ga0115553_1151098Not Available952Open in IMG/M
3300009445|Ga0115553_1427133Not Available501Open in IMG/M
3300009447|Ga0115560_1129980All Organisms → Viruses → Predicted Viral1011Open in IMG/M
3300009488|Ga0114925_10089713All Organisms → Viruses → Predicted Viral1926Open in IMG/M
3300009593|Ga0115011_11582516Not Available582Open in IMG/M
3300010149|Ga0098049_1021390All Organisms → Viruses → Predicted Viral2131Open in IMG/M
3300010149|Ga0098049_1108600Not Available866Open in IMG/M
3300010149|Ga0098049_1135134Not Available765Open in IMG/M
3300010149|Ga0098049_1202112Not Available608Open in IMG/M
3300010150|Ga0098056_1014717Not Available2830Open in IMG/M
3300010151|Ga0098061_1272345Not Available586Open in IMG/M
3300013099|Ga0164315_11153625Not Available612Open in IMG/M
3300013101|Ga0164313_10124509Not Available2177Open in IMG/M
3300013101|Ga0164313_10975982Not Available690Open in IMG/M
3300013115|Ga0171651_1090050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium991Open in IMG/M
3300017705|Ga0181372_1044558Not Available749Open in IMG/M
3300017728|Ga0181419_1033650Not Available1387Open in IMG/M
3300017739|Ga0181433_1072751All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.853Open in IMG/M
3300017743|Ga0181402_1164918Not Available557Open in IMG/M
3300017757|Ga0181420_1035324All Organisms → Viruses → Predicted Viral1631Open in IMG/M
3300017758|Ga0181409_1144742Not Available696Open in IMG/M
3300017770|Ga0187217_1043015All Organisms → Viruses → Predicted Viral1585Open in IMG/M
3300017772|Ga0181430_1085564Not Available948Open in IMG/M
3300019720|Ga0193991_1026207Not Available682Open in IMG/M
3300020185|Ga0206131_10100342All Organisms → Viruses → Predicted Viral1665Open in IMG/M
3300020187|Ga0206130_10190808All Organisms → cellular organisms → Bacteria1000Open in IMG/M
3300020457|Ga0211643_10002396Not Available11339Open in IMG/M
3300021068|Ga0206684_1279863Not Available520Open in IMG/M
3300021087|Ga0206683_10243838Not Available931Open in IMG/M
3300021442|Ga0206685_10003871Not Available4582Open in IMG/M
3300022200|Ga0196901_1110282Not Available950Open in IMG/M
(restricted) 3300023109|Ga0233432_10040518All Organisms → Viruses → Predicted Viral3050Open in IMG/M
(restricted) 3300023210|Ga0233412_10236755Not Available797Open in IMG/M
3300024335|Ga0228672_1005773All Organisms → Viruses → Predicted Viral4448Open in IMG/M
(restricted) 3300024338|Ga0255043_10016730All Organisms → Viruses → Predicted Viral1960Open in IMG/M
3300024432|Ga0209977_10347330All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.708Open in IMG/M
3300025071|Ga0207896_1002153All Organisms → Viruses → Predicted Viral3784Open in IMG/M
3300025084|Ga0208298_1013469Not Available1941Open in IMG/M
3300025096|Ga0208011_1100613Not Available614Open in IMG/M
3300025098|Ga0208434_1100670Not Available565Open in IMG/M
3300025103|Ga0208013_1033416All Organisms → Viruses → Predicted Viral1459Open in IMG/M
3300025108|Ga0208793_1170621Not Available562Open in IMG/M
3300025109|Ga0208553_1129411Not Available566Open in IMG/M
3300025110|Ga0208158_1079232Not Available784Open in IMG/M
3300025118|Ga0208790_1064198Not Available1126Open in IMG/M
3300025118|Ga0208790_1073601Not Available1032Open in IMG/M
3300025118|Ga0208790_1114710Not Available771Open in IMG/M
3300025125|Ga0209644_1004418All Organisms → Viruses2787Open in IMG/M
3300025128|Ga0208919_1021170All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2445Open in IMG/M
3300025128|Ga0208919_1164710Not Available681Open in IMG/M
3300025132|Ga0209232_1007725All Organisms → Viruses → Predicted Viral4591Open in IMG/M
3300025141|Ga0209756_1036637Not Available2552Open in IMG/M
3300025141|Ga0209756_1040118All Organisms → Viruses → Predicted Viral2398Open in IMG/M
3300025168|Ga0209337_1161352Not Available957Open in IMG/M
3300025301|Ga0208450_1079240Not Available748Open in IMG/M
3300025626|Ga0209716_1017790All Organisms → Viruses → Predicted Viral2960Open in IMG/M
3300026257|Ga0208407_1049861Not Available1402Open in IMG/M
3300027752|Ga0209192_10031798All Organisms → Viruses → Predicted Viral2507Open in IMG/M
3300027758|Ga0209379_10022326All Organisms → Viruses → Predicted Viral2634Open in IMG/M
3300027838|Ga0209089_10498526Not Available658Open in IMG/M
(restricted) 3300027856|Ga0255054_10028075All Organisms → Viruses → Predicted Viral2859Open in IMG/M
3300027858|Ga0209013_10048456All Organisms → Viruses → Predicted Viral2993Open in IMG/M
(restricted) 3300027865|Ga0255052_10064673Not Available1794Open in IMG/M
(restricted) 3300027881|Ga0255055_10095927Not Available1637Open in IMG/M
3300027906|Ga0209404_10328973Not Available981Open in IMG/M
3300029309|Ga0183683_1000056Not Available63698Open in IMG/M
3300031566|Ga0307378_10104750All Organisms → Viruses → Predicted Viral2941Open in IMG/M
3300031628|Ga0308014_1111201Not Available633Open in IMG/M
3300031757|Ga0315328_10760267Not Available544Open in IMG/M
3300034374|Ga0348335_028846All Organisms → Viruses → Predicted Viral2483Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine56.07%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater7.48%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater4.67%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater3.74%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine3.74%
Marine SedimentEnvironmental → Aquatic → Marine → Coastal → Sediment → Marine Sediment2.80%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous2.80%
Marine SedimentEnvironmental → Aquatic → Marine → Hydrothermal Vents → Sediment → Marine Sediment2.80%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface1.87%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine1.87%
Estuary WaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Estuary Water1.87%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine1.87%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater1.87%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.93%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean0.93%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater0.93%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine0.93%
MarineEnvironmental → Aquatic → Marine → Oil Seeps → Unclassified → Marine0.93%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil0.93%
Macroalgal SurfaceHost-Associated → Algae → Green Algae → Ectosymbionts → Unclassified → Macroalgal Surface0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000949Macroalgal surface ecosystem from Botany Bay, Sydney, Australia - BBAY94Host-AssociatedOpen in IMG/M
3300001450Marine viral communities from the Pacific Ocean - LP-53EnvironmentalOpen in IMG/M
3300001460Marine viral communities from the Pacific Ocean - LP-28EnvironmentalOpen in IMG/M
3300002483Marine viral communities from the Pacific Ocean - ETNP_6_30EnvironmentalOpen in IMG/M
3300005404Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201406SV205EnvironmentalOpen in IMG/M
3300005432Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201310SV78EnvironmentalOpen in IMG/M
3300005609Marine sediment microbial communities from the Atlantic coast under amendment with organic carbon and nitrate - tdDd00.1EnvironmentalOpen in IMG/M
3300005821Marine sediment microbial communities from Aarhus Bay station M5, Denmark - 25 cmbsf, PM1EnvironmentalOpen in IMG/M
3300006735Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaGEnvironmentalOpen in IMG/M
3300006736Marine viral communities from the Subarctic Pacific Ocean - 1_ETSP_OMZ_AT15124 metaGEnvironmentalOpen in IMG/M
3300006737Marine viral communities from the Subarctic Pacific Ocean - 5_ETSP_OMZ_AT15132 metaGEnvironmentalOpen in IMG/M
3300006738Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaGEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006754Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006793Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaGEnvironmentalOpen in IMG/M
3300006923Marine viral communities from the Subarctic Pacific Ocean - 15B_ETSP_OMZ_AT15312_CsCl metaGEnvironmentalOpen in IMG/M
3300006928Marine viral communities from the Subarctic Pacific Ocean - 8_ETSP_OMZ_AT15162 metaGEnvironmentalOpen in IMG/M
3300006929Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007954Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1373B_0.2umEnvironmentalOpen in IMG/M
3300007956Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1459A_0.2umEnvironmentalOpen in IMG/M
3300008050Marine viral communities from the Subarctic Pacific Ocean - 15_ETSP_OMZ_AT15312 metaGEnvironmentalOpen in IMG/M
3300009104Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 143m, 2.7-0.2umEnvironmentalOpen in IMG/M
3300009172Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_154EnvironmentalOpen in IMG/M
3300009409Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150EnvironmentalOpen in IMG/M
3300009445Pelagic marine microbial communities from North Sea - COGITO_mtgs_110331EnvironmentalOpen in IMG/M
3300009447Pelagic marine microbial communities from North Sea - COGITO_mtgs_110509EnvironmentalOpen in IMG/M
3300009488Deep subsurface microbial communities from Indian Ocean to uncover new lineages of life (NeLLi) - Sumatra_00607 metaGEnvironmentalOpen in IMG/M
3300009593Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT8 MetagenomeEnvironmentalOpen in IMG/M
3300010149Marine viral communities from the Subarctic Pacific Ocean - 13B_ETSP_OMZ_AT15268_CsCl metaGEnvironmentalOpen in IMG/M
3300010150Marine viral communities from the Subarctic Pacific Ocean - 17B_ETSP_OMZ_AT15314_CsCl metaGEnvironmentalOpen in IMG/M
3300010151Marine viral communities from the Subarctic Pacific Ocean - 22_ETSP_OMZ_AT15343 metaGEnvironmentalOpen in IMG/M
3300013099Subseafloor sediment microbial communities from Guaymas Basin, Gulf of California, Mexico - Guay6, Core 4569-2, 0-3 cmEnvironmentalOpen in IMG/M
3300013101Subseafloor sediment microbial communities from Guaymas Basin, Gulf of California, Mexico - Guay4, Core 4569-4, 0-3 cmEnvironmentalOpen in IMG/M
3300013115Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, May cruise - 234m, 250-2.7um, replicate aEnvironmentalOpen in IMG/M
3300017705Marine viral communities from the Subarctic Pacific Ocean - Lowphox_08 viral metaGEnvironmentalOpen in IMG/M
3300017721Marine viral communities from the Subarctic Pacific Ocean - Lowphox_09 viral metaGEnvironmentalOpen in IMG/M
3300017728Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 42 SPOT_SRF_2013-04-24EnvironmentalOpen in IMG/M
3300017739Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 56 SPOT_SRF_2014-09-10EnvironmentalOpen in IMG/M
3300017743Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 25 SPOT_SRF_2011-08-17EnvironmentalOpen in IMG/M
3300017757Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 43 SPOT_SRF_2013-05-22EnvironmentalOpen in IMG/M
3300017758Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 32 SPOT_SRF_2012-05-30EnvironmentalOpen in IMG/M
3300017770Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 15 SPOT_SRF_2010-09-15 (version 2)EnvironmentalOpen in IMG/M
3300017772Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 53 SPOT_SRF_2014-04-10EnvironmentalOpen in IMG/M
3300019720Sediment microbial communities from the Broadkill River, Lewes, Delaware, United States ? BLC_2-3_MGEnvironmentalOpen in IMG/M
3300020185Pelagic subsurface seawater microbial communities from Kabeltonne, Helgoland, North Sea - Helgoland_Spring_Bloom_20160517_1EnvironmentalOpen in IMG/M
3300020187Pelagic subsurface seawater microbial communities from Kabeltonne, Helgoland, North Sea - Helgoland_Spring_Bloom_20160512_1EnvironmentalOpen in IMG/M
3300020457Marine microbial communities from Tara Oceans - TARA_B100001113 (ERX555941-ERR599014)EnvironmentalOpen in IMG/M
3300021068Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M2 100m 12015EnvironmentalOpen in IMG/M
3300021087Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M2 80m 12015EnvironmentalOpen in IMG/M
3300021442Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M2 200m 12015EnvironmentalOpen in IMG/M
3300022200Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300023109 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_10_MGEnvironmentalOpen in IMG/M
3300023210 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_4_MGEnvironmentalOpen in IMG/M
3300024335Seawater microbial communities from Monterey Bay, California, United States - 90DEnvironmentalOpen in IMG/M
3300024338 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_9EnvironmentalOpen in IMG/M
3300024432Deep subsurface microbial communities from Indian Ocean to uncover new lineages of life (NeLLi) - Sumatra_00607 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025071Marine viral communities from the Pacific Ocean - LP-36 (SPAdes)EnvironmentalOpen in IMG/M
3300025084Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025096Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025098Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025103Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025109Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025110Marine viral communities from the Subarctic Pacific Ocean - 8_ETSP_OMZ_AT15162 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025118Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025125Marine viral communities from the Pacific Ocean - ETNP_2_1000 (SPAdes)EnvironmentalOpen in IMG/M
3300025128Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025132Marine viral communities from the Pacific Ocean - ETNP_2_60 (SPAdes)EnvironmentalOpen in IMG/M
3300025141Marine viral communities from the Pacific Ocean - ETNP_6_85 (SPAdes)EnvironmentalOpen in IMG/M
3300025168Marine viral communities from the Pacific Ocean - LP-53 (SPAdes)EnvironmentalOpen in IMG/M
3300025301Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_908 (SPAdes)EnvironmentalOpen in IMG/M
3300025626Pelagic marine microbial communities from North Sea - COGITO_mtgs_120531 (SPAdes)EnvironmentalOpen in IMG/M
3300026257Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201406SV69 (SPAdes)EnvironmentalOpen in IMG/M
3300027752Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_154 (SPAdes)EnvironmentalOpen in IMG/M
3300027758Marine sediment microbial communities from the Atlantic coast under amendment with organic carbon and nitrate - tdDd00.1 (SPAdes)EnvironmentalOpen in IMG/M
3300027838Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_150 (SPAdes)EnvironmentalOpen in IMG/M
3300027856 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_23EnvironmentalOpen in IMG/M
3300027858Oil polluted marine microbial communities from Coal Oil Point, Santa Barbara, California, USA - Sample 2 (SPAdes)EnvironmentalOpen in IMG/M
3300027865 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_21EnvironmentalOpen in IMG/M
3300027881 (restricted)Seawater microbial communities from Jervis Inlet, British Columbia, Canada - JV7_2_27EnvironmentalOpen in IMG/M
3300027906Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT8 Metagenome (SPAdes)EnvironmentalOpen in IMG/M
3300029309Marine viral communities collected during Tara Oceans survey from station TARA_100 - TARA_R100001440EnvironmentalOpen in IMG/M
3300031566Soil microbial communities from Risofladan, Vaasa, Finland - UN-1EnvironmentalOpen in IMG/M
3300031628Marine microbial communities from water near the shore, Antarctic Ocean - #229EnvironmentalOpen in IMG/M
3300031757Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 200m 32315EnvironmentalOpen in IMG/M
3300034374Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31 (v4)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
BBAY94_1015936923300000949Macroalgal SurfaceMNKEKCKFCGSDELVYHQYIINDSSCQECGEWQNGETI*
JGI24006J15134_1010897233300001450MarineMSEEKCKFCDSTELVYHQYVICDSSCQECGKWQNEKTI*
JGI24003J15210_1009615233300001460MarineMNEEKCKFCDSTELVYHQYVICDSSCQECGEWQNEET
JGI25132J35274_102663523300002483MarineMSKEKCNFCGSTEMVYHQYIICDSICQECGEWQKGEYNYE*
JGI25132J35274_104925413300002483MarineEIMSKEKCKFCGSTEMVYHQYIICDSMCQECGEWQKGEYNYE*
Ga0066856_1013948233300005404MarineMSKEKCKFCGSTEMVYHQYIICDSMCQECGEWQKGEYNYE*
Ga0066845_1042712523300005432MarineMSKEKCKFCGSTEMVYHQYIIRDSMCQECGEWQKGEYNYE*
Ga0070724_1000937833300005609Marine SedimentMIEEKCKFCGSTELVYHQYVICDSSCQECGEWQNGETI*
Ga0078746_104451523300005821Marine SedimentMSKEKCKFCGSDELVYHQYVIQDSACQECGEWQNEETI*
Ga0098038_103898543300006735MarineMEKEKCKFCGSEELVYHQYIICDNSCQECGKWQNGETINK*
Ga0098033_106685123300006736MarineMENEKCKFCNSTELVYHSYLCDSKCQDCGEWQEGEELNLHLIK*
Ga0098033_111057533300006736MarineMNKEDCKFCGSEELVYHQYIICDSCCQECGEWQNGEEL*
Ga0098037_117344323300006737MarineMEETKCKFCGSTELVYHQYTICDSACQECGEWQDGEYIK*
Ga0098035_114042323300006738MarineMNKEKCKFCGDTALVYHHYICDSKCEECGEWQEGEKYE*
Ga0098035_126906613300006738MarineEKCKFCDSTALVYHYYICDSTCEECGKWQEGEKI*
Ga0098035_130612433300006738MarineMENEKCKFCNSTELVYHSYLCDSKCQDCGEWQEGEELNLNLIK
Ga0098040_107041523300006751MarineMENEKCKFCNSIALVYNYSLCDSTCEECGEWQEGEER*
Ga0098044_104557013300006754MarineMNKEKCKFCGDTTLVYHYYICDSKCEECGEWQEGEKYE*
Ga0098044_110026833300006754MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKYEK
Ga0098044_130621333300006754MarineMIKEKFKFCNSTALVYNYSLCDSKCEECGEWQEGEEIK
Ga0098054_101443043300006789MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKI*
Ga0098054_118374523300006789MarineMNKDKCKFCDDTALVYHYYICDSKCEECGEWQEGEKYE*
Ga0098055_128873113300006793MarineIIMIKEKCKFCNSTALVYNYSLCDSKCEECGEWQEGEEIYVKKIY*
Ga0098053_100988023300006923MarineMENEKCKFCNSIALVYNYSLCDSTCEECGKWQEGEKI*
Ga0098053_101440053300006923MarineMNKEKCKFCDSTALVYHYYICDSKCEECGEWQEGEK
Ga0098041_108532243300006928MarineMNKEKCKFCGSEELVYHQYIICDNSCQECGKWQNGETINK*
Ga0098041_121574633300006928MarineEKCKFCNSIALVYNYSLCDSKCEECGEWQEGEER*
Ga0098036_119000513300006929MarineMENEKCTFCNSIALVYNYLLCDSKCEECGEWQEGEER*
Ga0098036_120604023300006929MarineMNKEKCKFCGDIALVYHYSICDSKCEECGEWQEGEKYE*
Ga0099846_109856433300007542AqueousMNEEKCKFCDSTELVYHQYVICDSSCQECGEWQNEETI*
Ga0105739_115620313300007954Estuary WaterTKVRVMIEEKCKFCGNTELVYHQYVICDSSCQECGEWQNGETI*
Ga0105741_107538313300007956Estuary WaterIMIEEKCKFCGSTELVYHQYVICDSSCQECGEWQNGETI*
Ga0098052_115266723300008050MarineMNKEKCKFCGSKELVYHQYIICDYSCEECGEWQNGEE
Ga0117902_119466363300009104MarineMENEKCKFCNSIALVYNYSLCDSKCEECGEWQEGEKI*
Ga0114995_1005270773300009172MarineMSEEKCKFCDSTELVYHQYVICDSSCQECGEWQNEETI*
Ga0114993_1015704353300009409MarineMKKCKFCGSKELTYHQYIIMDYSCCECGEWQNGDYINE*
Ga0115553_115109833300009445Pelagic MarineMSKEKCKFCGSKELVYHQYVICDSSCQECGEWQNGETI*
Ga0115553_142713323300009445Pelagic MarineMSKEKCKFCGSKELVYHQYVICDSSCQECGEWQNGETI*V
Ga0115560_112998023300009447Pelagic MarineMSKEKCKFCGSKELVYHQYVICDSSCQECGEWQNG
Ga0114925_1008971313300009488Deep SubsurfaceMEKVKCKFCGGEELVYHQYIICDHSCEECGEWQNGEYI*
Ga0115011_1158251633300009593MarineMIKEKCKFCNSIALVYNYSLCDSKCEECGEWQEGEEIK*
Ga0098049_102139023300010149MarineMNKEKCKFCGSTALIYHYYICDSTCEECGKWQEGEKYEKINI*
Ga0098049_110860023300010149MarineMIKEKCKFCNSTALVYNYSLCDSKCEECGEWQEGEEIK*
Ga0098049_113513423300010149MarineMNKEKCKFCGDTALVYHYYICDSKCEECGEWQEGEKYE*
Ga0098049_120211243300010149MarineMIKEKCKFCNSTALVYNYSLCDSKCEECGEWQEGE
Ga0098056_101471743300010150MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKYEKINI*
Ga0098061_127234523300010151MarineMIKEKCKFCNSTALVYNYSLCDSKCEECGEWQEGEEIYVKKIY*
Ga0164315_1115362533300013099Marine SedimentIKQLKMENEKCKFCNSTELVYHSYLCDSKCQDCGEWQEGEER*
Ga0164313_1012450973300013101Marine SedimentMIKEKCKFCNSTALVYHHSFHFADSKCEECGEWQEGEEIK*
Ga0164313_1097598213300013101Marine SedimentMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDY
Ga0171651_109005053300013115MarineMENEKCKFCNSIALVYNYSLCDSKCEECGQWQEGEK
Ga0181372_104455823300017705MarineMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYL
Ga0181373_104465243300017721MarineMEKCKFCGSTELVYHQYVICDSACQECGEWQNGDYITQTKYEH
Ga0181419_103365043300017728SeawaterMIEEKCKFCGSTELVYHQYIICDSSCQDCGEWQNGE
Ga0181433_107275123300017739SeawaterMRYLKKEKMDKCKFCGSTELVYHQYVICDSVCQECGEWQKGEYNYE
Ga0181402_116491823300017743SeawaterCLKHLEIMSKEKCKFCGSTELVYHQYIICDSSCQDCGEWQNGEEI
Ga0181420_103532413300017757SeawaterEIMSKEKCKFCGSTELVYHQYIICDSSCQACGEWQNGEEI
Ga0181409_114474213300017758SeawaterEKCKFCGSTELVYHQYIICDSSCQDCGEWQNGEEI
Ga0187217_104301533300017770SeawaterMNEEKCKFCDSTELVYHQYVIRDSSCQECCEWQNGETI
Ga0181430_108556453300017772SeawaterMENEKCKFCNSIALVYNYSLCDSKCEECGEWQEGEEIK
Ga0193991_102620713300019720SedimentMNEDKCKFCGSTELVYHQYTICDSSCQECGEWQSGDYI
Ga0206131_1010034233300020185SeawaterMNEDKCKFCDSTELVYHQYVICDSSCQECGEWQNEETI
Ga0206130_1019080813300020187SeawaterMNKEKCKFCGSDELVYHQYVIQDSACQECGEWQNEETI
Ga0211643_10002396203300020457MarineMEKEKCKFCGSEELVYHQYIICDNSCQECGKWQNGETINK
Ga0206684_127986333300021068SeawaterMSKCKFCDSTALVYHSYICDSKCEECGEWQEGEDIIV
Ga0206683_1024383823300021087SeawaterMNKEKCKFCGDIALVYHYSICDSKCEECGEWQEGEKYE
Ga0206685_10003871103300021442SeawaterMENEKCKFCNSIALVYNYSLCDSKCEECGEWQEGEER
Ga0196901_111028213300022200AqueousMNEEKCKFCDSTELVYHQYVICDSSCQECGEWQNEETI
(restricted) Ga0233432_1004051833300023109SeawaterMSKCKFCGSKELVYHQYVIRDSSCQECGEWQNGETI
(restricted) Ga0233412_1023675533300023210SeawaterEKKCKFCGSTELVYHQYVIRDSSCQECGEWQNGETI
Ga0228672_1005773123300024335SeawaterMSKCKFCGSTELVYHQYVICDSSCQECGEWQNGETIRNYE
(restricted) Ga0255043_1001673063300024338SeawaterKKCKFCGSTELVYHQYVIRDSSCQECGEWQNGETI
Ga0209977_1034733033300024432Deep SubsurfaceMEKVKCKFCGGEELVYHQYIICDHSCEECGEWQNGEYI
Ga0207896_100215393300025071MarineMNEEKCKFCNDTALVYNFYICDSKCEECGEWQEGESYE
Ga0208298_101346963300025084MarineMIKEKCKFCNSTALVYNYSLCDSKCEECGEWQEGEEIK
Ga0208011_110061323300025096MarineMNKEKCKFCGDTALVYHYYICDSKCEECGEWQEGEKYE
Ga0208434_110067033300025098MarineKFCGSTELVYHQYVICDSSCQECGEWQNGETIRNYE
Ga0208013_103341623300025103MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKI
Ga0208793_117062113300025108MarineMNKCKFCGSTKLVYHQYIICDSKCQECGEWQNGDYLEDQDLS
Ga0208553_112941123300025109MarineMNKEECKFCGSEELVYHQYIICDSCCQECGEWQNGEEL
Ga0208158_107923213300025110MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKYEKINI
Ga0208790_106419833300025118MarineMNKEKCKFCGSTALVYHYYICDSTCEECGKWQEGEKYE
Ga0208790_107360133300025118MarineMNKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEDQSL
Ga0208790_111471013300025118MarineMNKEKCKFCDSTALVYHYYICDSKCEECGEWQEGEKI
Ga0209644_100441873300025125MarineMSKEKCKFCNSTELIYHYYLCDSKCQDCGEWQEGETI
Ga0208919_102117083300025128MarineMENEKCTFCNSIALVYNYLLCDSKCEECGEWQEGEER
Ga0208919_116471013300025128MarineMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEGQDL
Ga0209232_1007725103300025132MarineMSKEKCNFCGSTEMVYHQYIICDSICQECGEWQKGEYNYE
Ga0209756_103663753300025141MarineMNKCKFCDSTALVYHYYICDSKCEECGEWQEGEEL
Ga0209756_104011833300025141MarineMNKCKFCGSTELVYHQYIICDSSCQDCGEWQNGEEI
Ga0209337_116135223300025168MarineMSEEKCKFCDSTELVYHQYVICDSSCQECGKWQNEKTIXH
Ga0208450_107924013300025301Deep OceanMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEDQSLLST
Ga0209716_101779033300025626Pelagic MarineMSKEKCKFCGSKELVYHQYVICDSSCQECGEWQNGETI
Ga0208407_104986143300026257MarineMNKEKCKFCDDTTLVYHYYICDSKCEECGEWQEGEKYE
Ga0209192_1003179823300027752MarineMSEEKCKFCDSTELVYHQYVICDSSCQECGEWQNEETI
Ga0209379_1002232623300027758Marine SedimentMIEEKCKFCGSTELVYHQYVICDSSCQECGEWQNGETI
Ga0209089_1049852633300027838MarineMKKCKFCGSKELTYHQYIIMDYSCCECGEWQNGDYINE
(restricted) Ga0255054_1002807593300027856SeawaterMNKEKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEDQDLSST
Ga0209013_1004845663300027858MarineMSKCKFCGSKELVYHQYGICDSSCQECGEWQNGETI
(restricted) Ga0255052_1006467343300027865SeawaterMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEGQDLS
(restricted) Ga0255055_1009592753300027881SeawaterMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEDQ
Ga0209404_1032897343300027906MarineMIKEKCKFCNSIALVYNYSLCDSKCEECSEWQEGEEIK
Ga0183683_1000056603300029309MarineMEETKCKFCGSTELVYHQYTICDSSCQECGEWQDGEYIK
Ga0307378_1010475053300031566SoilMNEEKCKFCNDTALVYNSYICDSKCEECGEWQEGESYE
Ga0308014_111120133300031628MarineMNKCKFCGSKELTYHQYVIMDYSCDECGEWQNGDYINE
Ga0315328_1076026713300031757SeawaterMSKCKFCGSTELVYHQYIICDSKCQECGEWQNGDYLEDQD
Ga0348335_028846_2376_24833300034374AqueousSECKFCGSTELVYHQYVIRDSSCQDCGEWQNGETI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.