NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F087052

Metagenome / Metatranscriptome Family F087052

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F087052
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 46 residues
Representative Sequence YLNPSGAGGASREDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRVF
Number of Associated Samples 85
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.09 %
% of genes from short scaffolds (< 2000 bps) 94.55 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.545 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Strait → Unclassified → Seawater
(31.818 % of family members)
Environment Ontology (ENVO) Unclassified
(86.364 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(93.636 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.25%    β-sheet: 4.17%    Coil/Unstructured: 64.58%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF00166Cpn10 21.82
PF04820Trp_halogenase 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 21.82


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.55 %
All OrganismsrootAll Organisms25.45 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002483|JGI25132J35274_1013793All Organisms → cellular organisms → Bacteria1970Open in IMG/M
3300002488|JGI25128J35275_1026335All Organisms → cellular organisms → Bacteria1389Open in IMG/M
3300006029|Ga0075466_1045158All Organisms → cellular organisms → Bacteria1319Open in IMG/M
3300006735|Ga0098038_1046111Not Available1585Open in IMG/M
3300006737|Ga0098037_1080885Not Available1140Open in IMG/M
3300006737|Ga0098037_1089185Not Available1077Open in IMG/M
3300006737|Ga0098037_1102093Not Available993Open in IMG/M
3300006751|Ga0098040_1064457Not Available1126Open in IMG/M
3300006751|Ga0098040_1120723Not Available783Open in IMG/M
3300006753|Ga0098039_1091524Not Available1049Open in IMG/M
3300006753|Ga0098039_1106611Not Available965Open in IMG/M
3300006753|Ga0098039_1112907Not Available934Open in IMG/M
3300006793|Ga0098055_1290951Not Available611Open in IMG/M
3300006923|Ga0098053_1027623Not Available1211Open in IMG/M
3300006929|Ga0098036_1105696Not Available865Open in IMG/M
3300006929|Ga0098036_1146658Not Available721Open in IMG/M
3300006990|Ga0098046_1036195Not Available1190Open in IMG/M
3300007236|Ga0075463_10059599Not Available1233Open in IMG/M
3300008220|Ga0114910_1146686Not Available674Open in IMG/M
3300009370|Ga0118716_1243737Not Available795Open in IMG/M
3300009418|Ga0114908_1052125Not Available1465Open in IMG/M
3300009426|Ga0115547_1136976Not Available788Open in IMG/M
3300009550|Ga0115013_10467985Not Available817Open in IMG/M
3300009753|Ga0123360_1087034Not Available568Open in IMG/M
3300010148|Ga0098043_1036764Not Available1527Open in IMG/M
3300010153|Ga0098059_1021648Not Available2628Open in IMG/M
3300011013|Ga0114934_10188347Not Available960Open in IMG/M
3300011251|Ga0151676_1008009Not Available612Open in IMG/M
3300013674|Ga0117783_101392All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1499Open in IMG/M
3300017706|Ga0181377_1025923All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1246Open in IMG/M
3300017709|Ga0181387_1051884Not Available817Open in IMG/M
3300017710|Ga0181403_1050495Not Available870Open in IMG/M
3300017713|Ga0181391_1037172Not Available1174Open in IMG/M
3300017713|Ga0181391_1050505Not Available982Open in IMG/M
3300017717|Ga0181404_1049452Not Available1061Open in IMG/M
3300017720|Ga0181383_1009129All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2658Open in IMG/M
3300017720|Ga0181383_1026302Not Available1567Open in IMG/M
3300017725|Ga0181398_1042235Not Available1111Open in IMG/M
3300017726|Ga0181381_1052007Not Available898Open in IMG/M
3300017726|Ga0181381_1061498Not Available813Open in IMG/M
3300017731|Ga0181416_1032829All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1219Open in IMG/M
3300017731|Ga0181416_1056967Not Available923Open in IMG/M
3300017732|Ga0181415_1031028All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1232Open in IMG/M
3300017732|Ga0181415_1071781Not Available782Open in IMG/M
3300017733|Ga0181426_1024066All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1196Open in IMG/M
3300017737|Ga0187218_1078644Not Available802Open in IMG/M
3300017740|Ga0181418_1036180All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1252Open in IMG/M
3300017742|Ga0181399_1042509Not Available1204Open in IMG/M
3300017742|Ga0181399_1044080Not Available1178Open in IMG/M
3300017745|Ga0181427_1055124Not Available981Open in IMG/M
3300017755|Ga0181411_1041484All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1436Open in IMG/M
3300017756|Ga0181382_1112341Not Available731Open in IMG/M
3300017759|Ga0181414_1047534Not Available1150Open in IMG/M
3300017759|Ga0181414_1081021Not Available859Open in IMG/M
3300017759|Ga0181414_1177271Not Available554Open in IMG/M
3300017760|Ga0181408_1049517Not Available1126Open in IMG/M
3300017763|Ga0181410_1057081Not Available1186Open in IMG/M
3300017763|Ga0181410_1059231All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1160Open in IMG/M
3300017764|Ga0181385_1178557Not Available642Open in IMG/M
3300017767|Ga0181406_1064380Not Available1126Open in IMG/M
3300017773|Ga0181386_1062996All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1181Open in IMG/M
3300017951|Ga0181577_10347354Not Available954Open in IMG/M
3300017956|Ga0181580_10325199Not Available1041Open in IMG/M
3300017968|Ga0181587_10269881Not Available1156Open in IMG/M
3300018416|Ga0181553_10356872Not Available802Open in IMG/M
3300018418|Ga0181567_10431517Not Available869Open in IMG/M
3300018428|Ga0181568_10420419Not Available1074Open in IMG/M
3300019025|Ga0193545_10026457Not Available1135Open in IMG/M
3300020410|Ga0211699_10138859Not Available913Open in IMG/M
3300020411|Ga0211587_10058219All Organisms → Viruses → Predicted Viral1745Open in IMG/M
3300020429|Ga0211581_10213694Not Available782Open in IMG/M
3300020436|Ga0211708_10123846Not Available1020Open in IMG/M
3300021365|Ga0206123_10162186Not Available1017Open in IMG/M
3300022069|Ga0212026_1032634Not Available767Open in IMG/M
3300022074|Ga0224906_1024107Not Available2145Open in IMG/M
3300022074|Ga0224906_1052427All Organisms → Viruses1304Open in IMG/M
3300022074|Ga0224906_1090414Not Available915Open in IMG/M
3300022169|Ga0196903_1004714All Organisms → Viruses → Predicted Viral1788Open in IMG/M
3300023702|Ga0232119_1012314All Organisms → Viruses1322Open in IMG/M
(restricted) 3300024062|Ga0255039_10168580Not Available904Open in IMG/M
3300024344|Ga0209992_10115876Not Available1189Open in IMG/M
3300024359|Ga0228628_1015268All Organisms → Viruses1962Open in IMG/M
3300025083|Ga0208791_1044789Not Available787Open in IMG/M
3300025096|Ga0208011_1025132All Organisms → Viruses1495Open in IMG/M
3300025099|Ga0208669_1041530Not Available1080Open in IMG/M
3300025102|Ga0208666_1056321Not Available1080Open in IMG/M
3300025102|Ga0208666_1084585Not Available810Open in IMG/M
3300025108|Ga0208793_1072486Not Available1009Open in IMG/M
3300025108|Ga0208793_1192570Not Available515Open in IMG/M
3300025127|Ga0209348_1041876All Organisms → Viruses1585Open in IMG/M
3300025127|Ga0209348_1061118All Organisms → Viruses1241Open in IMG/M
3300025127|Ga0209348_1076789Not Available1073Open in IMG/M
3300025127|Ga0209348_1083768Not Available1013Open in IMG/M
3300025127|Ga0209348_1097177Not Available920Open in IMG/M
3300025128|Ga0208919_1037672All Organisms → Viruses1715Open in IMG/M
3300025132|Ga0209232_1085656Not Available1084Open in IMG/M
3300025270|Ga0208813_1019990Not Available1697Open in IMG/M
3300025577|Ga0209304_1021026All Organisms → Viruses2077Open in IMG/M
3300025853|Ga0208645_1176761Not Available781Open in IMG/M
3300025886|Ga0209632_10324663Not Available758Open in IMG/M
3300025890|Ga0209631_10074264All Organisms → Viruses2060Open in IMG/M
3300027742|Ga0209121_10185520Not Available760Open in IMG/M
3300027852|Ga0209345_10391446Not Available821Open in IMG/M
3300028022|Ga0256382_1014754All Organisms → Viruses1571Open in IMG/M
3300028022|Ga0256382_1029920All Organisms → Viruses1208Open in IMG/M
3300028022|Ga0256382_1030725All Organisms → Viruses1195Open in IMG/M
3300028134|Ga0256411_1240301Not Available561Open in IMG/M
3300029319|Ga0183748_1068368Not Available924Open in IMG/M
3300029448|Ga0183755_1017987All Organisms → Viruses2435Open in IMG/M
3300032011|Ga0315316_10608930Not Available912Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine31.82%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater31.82%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine6.36%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh5.45%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous4.55%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater3.64%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean2.73%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater2.73%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine1.82%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine1.82%
Pelagic MarineEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Pelagic Marine1.82%
MarineEnvironmental → Aquatic → Marine → Oil Seeps → Unclassified → Marine1.82%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Volcanic → Unclassified → Deep Subsurface1.82%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater0.91%
Coral TissueHost-Associated → Invertebrates → Cnidaria → Unclassified → Unclassified → Coral Tissue0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002483Marine viral communities from the Pacific Ocean - ETNP_6_30EnvironmentalOpen in IMG/M
3300002488Marine viral communities from the Pacific Ocean - ETNP_2_60EnvironmentalOpen in IMG/M
3300006029Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_20_<0.8_DNAEnvironmentalOpen in IMG/M
3300006735Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaGEnvironmentalOpen in IMG/M
3300006737Marine viral communities from the Subarctic Pacific Ocean - 5_ETSP_OMZ_AT15132 metaGEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006753Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaGEnvironmentalOpen in IMG/M
3300006793Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaGEnvironmentalOpen in IMG/M
3300006923Marine viral communities from the Subarctic Pacific Ocean - 15B_ETSP_OMZ_AT15312_CsCl metaGEnvironmentalOpen in IMG/M
3300006929Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaGEnvironmentalOpen in IMG/M
3300006990Marine viral communities from the Subarctic Pacific Ocean - 11B_ETSP_OMZ_AT15265_CsCl metaGEnvironmentalOpen in IMG/M
3300007236Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_30_>0.8_DNAEnvironmentalOpen in IMG/M
3300008220Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_908EnvironmentalOpen in IMG/M
3300009370Combined Assembly of Gp0127930, Gp0127931EnvironmentalOpen in IMG/M
3300009418Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_s17EnvironmentalOpen in IMG/M
3300009426Pelagic marine microbial communities from North Sea - COGITO_mtgs_100420EnvironmentalOpen in IMG/M
3300009550Marine eukaryotic phytoplankton communities from Atlantic Ocean - South Atlantic ANT15 MetagenomeEnvironmentalOpen in IMG/M
3300009753Marine microbial and viral communities from Louisana Shelf, Gulf of Mexico - GoM_2015_C6C_190_18m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010148Marine viral communities from the Subarctic Pacific Ocean - 9B_ETSP_OMZ_AT15188_CsCl metaGEnvironmentalOpen in IMG/M
3300010153Marine viral communities from the Subarctic Pacific Ocean - 20_ETSP_OMZ_AT15318 metaGEnvironmentalOpen in IMG/M
3300011013Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 4SBTROV10_white metaGEnvironmentalOpen in IMG/M
3300011251Seawater microbial communities from Japan Sea near Toyama Prefecture, Japan - 2015_1, 0.2EnvironmentalOpen in IMG/M
3300013674Coral viral communities from the Great Barrier Reef, Australia - Pocillopora damicornis (fresh isolate) - PDam_NLN_DNA_SISPAHost-AssociatedOpen in IMG/M
3300017706Marine viral communities from the Subarctic Pacific Ocean - Lowphox_13 viral metaGEnvironmentalOpen in IMG/M
3300017709Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 10 SPOT_SRF_2010-04-27EnvironmentalOpen in IMG/M
3300017710Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 26 SPOT_SRF_2011-09-28EnvironmentalOpen in IMG/M
3300017713Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 14 SPOT_SRF_2010-08-11EnvironmentalOpen in IMG/M
3300017717Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 27 SPOT_SRF_2011-10-25EnvironmentalOpen in IMG/M
3300017720Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 6 SPOT_SRF_2009-12-23EnvironmentalOpen in IMG/M
3300017725Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 21 SPOT_SRF_2011-04-29EnvironmentalOpen in IMG/M
3300017726Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 4 SPOT_SRF_2009-09-24EnvironmentalOpen in IMG/M
3300017731Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 39 SPOT_SRF_2013-01-16EnvironmentalOpen in IMG/M
3300017732Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 38 SPOT_SRF_2012-12-11EnvironmentalOpen in IMG/M
3300017733Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 49 SPOT_SRF_2013-12-23EnvironmentalOpen in IMG/M
3300017737Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 14 SPOT_SRF_2010-08-11 (version 2)EnvironmentalOpen in IMG/M
3300017740Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 41 SPOT_SRF_2013-03-13EnvironmentalOpen in IMG/M
3300017742Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 22 SPOT_SRF_2011-05-21EnvironmentalOpen in IMG/M
3300017745Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 50 SPOT_SRF_2014-01-15EnvironmentalOpen in IMG/M
3300017755Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 34 SPOT_SRF_2012-07-09EnvironmentalOpen in IMG/M
3300017756Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 5 SPOT_SRF_2009-10-22EnvironmentalOpen in IMG/M
3300017759Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 37 SPOT_SRF_2012-11-28EnvironmentalOpen in IMG/M
3300017760Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 31 SPOT_SRF_2012-02-16EnvironmentalOpen in IMG/M
3300017763Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 33 SPOT_SRF_2012-06-20EnvironmentalOpen in IMG/M
3300017764Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 8 SPOT_SRF_2010-02-11EnvironmentalOpen in IMG/M
3300017767Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 29 SPOT_SRF_2011-12-20EnvironmentalOpen in IMG/M
3300017773Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 9 SPOT_SRF_2010-03-24EnvironmentalOpen in IMG/M
3300017951Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 101413BT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300017956Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 071403BT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300017968Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 071409AT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300018416Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011502XT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300018418Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 101403AT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300018428Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 101404AT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300019025Metatranscriptome of marine prokaryotic communities collected during Tara Oceans survey from station TARA_151 - TARA_B100001565 (ERX1399745-ERR1328126)EnvironmentalOpen in IMG/M
3300020410Marine microbial communities from Tara Oceans - TARA_B100000519 (ERX555959-ERR599148)EnvironmentalOpen in IMG/M
3300020411Marine microbial communities from Tara Oceans - TARA_B100000131 (ERX556098-ERR599130)EnvironmentalOpen in IMG/M
3300020429Marine microbial communities from Tara Oceans - TARA_B100000614 (ERX556134-ERR599032)EnvironmentalOpen in IMG/M
3300020436Marine microbial communities from Tara Oceans - TARA_B100000424 (ERX556009-ERR598984)EnvironmentalOpen in IMG/M
3300021365Pelagic subsurface seawater microbial communities from Kabeltonne, Helgoland, North Sea - Helgoland_Spring_Bloom_20160316_1EnvironmentalOpen in IMG/M
3300022069Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_30 (v2)EnvironmentalOpen in IMG/M
3300022074Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 56 SPOT_SRF_2014-09-10 (v2)EnvironmentalOpen in IMG/M
3300022169Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300023702Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - 82R (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024062 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_1EnvironmentalOpen in IMG/M
3300024344Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024359Seawater microbial communities from Monterey Bay, California, United States - 34DEnvironmentalOpen in IMG/M
3300025083Marine viral communities from the Subarctic Pacific Ocean - 11_ETSP_OMZ_AT15265 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025096Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025099Marine viral communities from the Subarctic Pacific Ocean - 21_ETSP_OMZ_AT15319 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025102Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025127Marine viral communities from the Pacific Ocean - ETNP_2_30 (SPAdes)EnvironmentalOpen in IMG/M
3300025128Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025132Marine viral communities from the Pacific Ocean - ETNP_2_60 (SPAdes)EnvironmentalOpen in IMG/M
3300025270Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG DeepMed_904 (SPAdes)EnvironmentalOpen in IMG/M
3300025577Pelagic marine microbial communities from North Sea - COGITO_mtgs_100423 (SPAdes)EnvironmentalOpen in IMG/M
3300025853Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01 (SPAdes)EnvironmentalOpen in IMG/M
3300025886Pelagic Microbial community sample from North Sea - COGITO 998_met_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025890Pelagic Microbial community sample from North Sea - COGITO 998_met_08 (SPAdes)EnvironmentalOpen in IMG/M
3300027742Oil polluted marine microbial communities from Coal Oil Point, Santa Barbara, California, USA - Sample 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027852Oil polluted marine microbial communities from Coal Oil Point, Santa Barbara, California, USA - Sample 7 (SPAdes)EnvironmentalOpen in IMG/M
3300028022Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 750mEnvironmentalOpen in IMG/M
3300028134Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - WCR_12 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029319Marine viral communities collected during Tara Oceans survey from station TARA_032 - TARA_A100001516EnvironmentalOpen in IMG/M
3300029448Marine viral communities collected during Tara Oceans survey from station TARA_023 - TARA_E500000082EnvironmentalOpen in IMG/M
3300032011Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 60m 3416EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25132J35274_101379333300002483MarineMFLVSSKPTLSALIHLCLKPLGAGGASREDGGTHTFGLSDLDLVWFAREVVFYHLISYA*
JGI25128J35275_102633513300002488MarinePSGAGGASREDGGTHTFGLSDFDRVWLAREVLLFSFSITPPSC*
Ga0075466_104515833300006029AqueousHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFSFFSILRSFLRVF*
Ga0098038_104611113300006735MarineAGGASRDEGGTHTFGLSDFDRVWLARDVLFSFSILRSSRVF*
Ga0098037_108088533300006737MarineSGAGGASRDEGGLYTFGLSVFDRVTAAREVFSFVILCSSREF*
Ga0098037_108918523300006737MarineAGGASREDGGTHTLGLSDFDRVWFAREVFLSFSMLRSFRVF*
Ga0098037_110209323300006737MarineAGGASREDGGTHTLGLSDFVRVWLAREVFLSFSILRSFRVF*
Ga0098040_106445713300006751MarineSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLSFSILRLLRDF*
Ga0098040_112072323300006751MarineSRDDGGTHTFGLSDFDRVWFARDLSFVSILRSSRVF*
Ga0098039_109152423300006753MarineLVSWKPTLSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLSFSILRLLRDF*
Ga0098039_110661123300006753MarineSGAGGASRDDGGTHTFGLSDFDRVWFAREVLLFFSITPPSCV*
Ga0098039_111290713300006753MarineSGAGGASRDDGGTHTFGLSDFDRVWFAREVLLFFSITPPS*
Ga0098055_129095123300006793MarineSREDGGTHTFGLSDFDRVWFARDLSFVSILRSSRVF*
Ga0098053_102762313300006923MarineLSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLSFSILRLLRDF*
Ga0098036_110569613300006929MarineGASRDDGGTHTLGLSDFDRVWFAREVFLFSFSILRSFRVF*
Ga0098036_114665813300006929MarineGAGGASREDGGTHTLGLSDFDRVLFAREVFLSFVMLRSFRVF*
Ga0098046_103619523300006990MarineTSALIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVLFSFSILRSFRVF*
Ga0075463_1005959933300007236AqueousLSALIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWFAREVFFSFSILRSFRGF*
Ga0114910_114668623300008220Deep OceanALIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWFAREVLFSFSILRSFRVF*
Ga0118716_124373723300009370MarineLSDRNHLCLNPSGAGGASREDGGAQTFGLSVLDLVELAREVSFSFFFICS*
Ga0114908_105212533300009418Deep OceanAGGASREDGGTHTLGLSDFDRVWLAREVFLFSFSILRSFRVF*
Ga0115547_113697623300009426Pelagic MarineYLNPSGAGGASKDDGGTHTFGLSDLDLDWFAREVFLFSFSICLYLLRDF*
Ga0115013_1046798523300009550MarineLIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWFAREVFLSFSILRSFRVF*
Ga0123360_108703423300009753MarineNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFLSFSMLRSFRVF*
Ga0098043_103676433300010148MarineGAGGASREDGGTHTLGLSDFDRVWFAREVLFSFSMLRSFRVF*
Ga0098059_102164853300010153MarineNPSGAGGASREDGGTHTFGLSDFDRVWFARDLSFVSILRSSRVF*
Ga0114934_1018834723300011013Deep SubsurfaceYLNPSGAGGASREDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRVF*
Ga0151676_100800923300011251MarineLNPSGAGGASREDGGTHTFGLSDFDRVWLAREVFFSSFVILRSFRVF*
Ga0117783_10139233300013674Coral TissueSADIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWFAREVLFSFSMLRSFRVF*
Ga0181377_102592333300017706MarineAGGASRDDGGTHTFGLSDFDRVWLARDVFVSFSILRSSRVF
Ga0181387_105188413300017709SeawaterTSALIHLYLNPSGAGGASRDDGGTHTLGRSDFDRVWLAREVFLFSFSILRSFRVF
Ga0181403_105049513300017710SeawaterAGGASREDGGTHTLGLSDFVRVWLAREVFLSFSILRSFRVF
Ga0181391_103717223300017713SeawaterTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181391_105050523300017713SeawaterIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFFSSFVMLRSFRVF
Ga0181404_104945213300017717SeawaterGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181383_100912913300017720SeawaterTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFVSFSILRSSRVF
Ga0181383_102630243300017720SeawaterLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFARDVFFSFSILRSSRVF
Ga0181398_104223523300017725SeawaterSGAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181381_105200723300017726SeawaterTTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFLFSFSILRSFRGF
Ga0181381_106149823300017726SeawaterAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181416_103282933300017731SeawaterIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRDF
Ga0181416_105696713300017731SeawaterLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181415_103102833300017732SeawaterSRDDGGTHTFGLSDFDRVWLARDVFVSFSILRSSRVF
Ga0181415_107178123300017732SeawaterAGGASREDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRDF
Ga0181426_102406613300017733SeawaterEDGGTHTLGRSDFDRVWFAREVFLFSFSILRSFRGF
Ga0187218_107864423300017737SeawaterASRDDGGTHTLGLSDFDRVSFAREVFSFFSMLRSFRDF
Ga0181418_103618013300017740SeawaterYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRDF
Ga0181399_104250913300017742SeawaterALIHLYLNPSGAGGASREDGGTHTLGLSDFVRVWLAREVFLSFSILRSFRVF
Ga0181399_104408023300017742SeawaterPTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181427_105512423300017745SeawaterASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181411_104148433300017755SeawaterLNPSGAGGASRDDGGTHTLGLSDFDRVSFAREVFSFFSMLRSFRDF
Ga0181382_111234123300017756SeawaterGAGGASREDGGTHTLGRSDFDRVWFAREVFLFSFSILRSFRGF
Ga0181414_104753413300017759SeawaterLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFVSFSILRSSRVF
Ga0181414_108102123300017759SeawaterLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0181414_117727123300017759SeawaterASREDGGTHTLGLSDFDRVWLAREVFLSFSILRSFRVF
Ga0181408_104951723300017760SeawaterSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLARDVFVSFSILRSSRVF
Ga0181410_105708113300017763SeawaterALIHLYLNPSGAGGASRDDGGTHTLGLSDFDRVSFAREVFLSFSILRSFRVF
Ga0181410_105923133300017763SeawaterSREDGGTHTLGLSDFDRVWLAREVFFSSFVILRSFRVF
Ga0181385_117855713300017764SeawaterGGASREDGGTHTFGLSDFDRVWFAREVLLSCFFIRLYLLREF
Ga0181406_106438023300017767SeawaterAGGASREDGGTHTLGLSDFDRVWLAREVFFSFSMLRSFRGF
Ga0181386_106299633300017773SeawaterRDDGGTHTFGLSDFDRVWFAREVFLFSFSILRSFRDF
Ga0181577_1034735423300017951Salt MarshNPSGAGGASREDGGTHTFGLSDLDRVSFAREVLFSFSMLRSFRVF
Ga0181580_1032519913300017956Salt MarshALIHLYLNPSGAGGASREDGGTHTFGRSDFDRVWSAREVLFSFSMLRSFLRVF
Ga0181587_1026988123300017968Salt MarshSALIHLYLNPSGAGGASREDGGTHTFGLSDFDRVWFAREVLFSFSMLRSFLRVF
Ga0181553_1035687213300018416Salt MarshNPSGAGGASREDGGTHTFGLSDFDRVWSAREVLFSFSMLRSFLRVF
Ga0181567_1043151723300018418Salt MarshASREDGGTHTFGLSDLDRVSFAREVLFSFSMLRSFRVF
Ga0181568_1042041923300018428Salt MarshYLNPSGAGGASREDGGTHTFGLSDLDRVSFAREVLFSFSMLRSFRVF
Ga0193545_1002645723300019025MarineAGGASREDGGTHTFGLSDFDRVWFAREVLFSFSILRSFRVF
Ga0211699_1013885923300020410MarineLIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFLSFSMLRSFRVF
Ga0211587_1005821953300020411MarineASREDGGTHTLGLSDFDRVWFAREVFLSFSMLRSFRVF
Ga0211581_1021369423300020429MarineREDGGTHTLGLSDFDRVWFAREVLFSFSMLRSFRVF
Ga0211708_1012384613300020436MarineGASREDGGTHTLGLSDFDRVWFAREVFLSFTMLRSFRVF
Ga0206123_1016218623300021365SeawaterPSGAGGASREDGGTHTLGLSDFDRVWFAREVFFSFSILRSFRGF
Ga0212026_103263423300022069AqueousAGGASREDGGTHTFGLSDFDRVWSAREVLFSFSMLRSFLRVF
Ga0224906_102410713300022074SeawaterTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFSFFSILRSFLRVF
Ga0224906_105242713300022074SeawaterLYLNPSGAGGASRDDGGTHTLGLSDFDRVWFAREVFLLFSILRSFRVF
Ga0224906_109041413300022074SeawaterREDGGTHTLVLSDFDRVWLAREVFSFFSILRSFRVF
Ga0196903_100471413300022169AqueousALIHLYLNPSGAGGASREDGGTHTFGLSDFDRVWFAREVLFSFSMLRSFLRVF
Ga0232119_101231413300023702SeawaterTSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFLFSFSILRSFRGF
(restricted) Ga0255039_1016858023300024062SeawaterGGASRDDGGTHTFGRSDFDRVWFAREVFLFSFSILRSFRGF
Ga0209992_1011587613300024344Deep SubsurfaceLIHLYLNPSGAGGASRDDGGTHTLGLSDFDRVSFAREVFLFSFSMLRSFRVF
Ga0228628_101526813300024359SeawaterGGASRDDGGTHTFGLSDFDRVWLAREVFLFSFSILRSFRGF
Ga0208791_104478913300025083MarineLNPSGAGGASREDGGTHTLGLSDFDRVLFAREVFLSFVMLRSFRVF
Ga0208011_102513233300025096MarineASRDDGGTHTFGLSDFDRVWFAREVFLSFSILRLLRDF
Ga0208669_104153013300025099MarineLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFFSSFVILRSFRVF
Ga0208666_105632123300025102MarineALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFSFFSILRSFLRVF
Ga0208666_108458513300025102MarineTLSALIHLYLNPSGAGGASRDDGGTHTLGLSDFDRVWFAREVFLLFSILRSFRVF
Ga0208793_107248613300025108MarineLVSWKPTLSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVFLSFSILRLLRD
Ga0208793_119257023300025108MarineEDGGTHTLGLSDFDRVWFAREVLFSFSILRSFREF
Ga0209348_104187613300025127MarineLSALIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFLSFSILRSFRVF
Ga0209348_106111833300025127MarineHLYLNPSGAGGASRDDGGTHTLGLSDFDRVWFAREVFFSSFFMLRSFRVF
Ga0209348_107678913300025127MarineHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFLSFSMLRSFRVF
Ga0209348_108376823300025127MarineHLYLNPSGAGGASREDGGTHTLGLSDFDRVSFAREVLFSFSMLRSFRVF
Ga0209348_109717713300025127MarineTLSALIHLYLNPSGAGGASRDDGGTHTLGLSDFDRVWFAREVFLSFSILRSFRVF
Ga0208919_103767243300025128MarineLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFFSSFVMLRSFRVF
Ga0209232_108565613300025132MarinePSGAGGASREDGGTHTLGLSDFDRVWFAREVFFWSFFMLRSFRVF
Ga0208813_101999043300025270Deep OceanLSALIHLYLNPSGAGGASRDDGGTHTFGLSDFDRVWFAREVLLFFSITPPSCV
Ga0209304_102102663300025577Pelagic MarineSALIHLYLNPSGAGGASREDGGTHTLGRSDFDRVWFAREVFLFSFSILRSFLRVF
Ga0208645_117676113300025853AqueousLYLNPSGAGGASREDGGTHTLGRSDFDRVWFAREVFLFSFSILRSFRGF
Ga0209632_1032466313300025886Pelagic MarineREDGGTHTLGRSDFDRVWFAREVFLFSFSILRSFRGF
Ga0209631_1007426413300025890Pelagic MarineALIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWFAREVFFSFSILRSFRGF
Ga0209121_1018552013300027742MarineLNPSGAGGASRDDGGTHTFGLSDFDRVWLAREVFLFSFSILRSFRGF
Ga0209345_1039144613300027852MarineLIHLYLNPSGAGGASREDGGTHTLGLSDFDRVWLAREVFFSSFVMLRSFRVF
Ga0256382_101475413300028022SeawaterTGGASKDDGGTQTFGLSDFDLDWFAREVFLFSFSIRLYLLRDV
Ga0256382_102992013300028022SeawaterPTLSALIHLYLNPSGAGGASREDGGTHTFGLSDFDRVWLAREVFLFSFSILRSFRVF
Ga0256382_103072533300028022SeawaterPSGAGGASRDEGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF
Ga0256411_124030123300028134SeawaterRDDGGTHTFGLSDFDRVWLAREVFLFSFSIFRSFRGF
Ga0183748_106836813300029319MarineASREDGGTHTLGLSDFDRVWFAREVLFSFSMLRSFRVF
Ga0183755_101798763300029448MarineWKPTTSALIHLYLNPSGAGGASKDDGGTQTFGLSDFDLDWFAREVFLFSFSIRLYLLRDV
Ga0315316_1060893023300032011SeawaterGGASRDDGGTHTFGLSDFDRVWLARDVFFSFSILRSSRVF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.