NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105507

Metagenome / Metatranscriptome Family F105507

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105507
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 63 residues
Representative Sequence MAQCVKCNKPYSPARKLLGITVCLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Number of Associated Samples 79
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 92.00 %
% of genes near scaffold ends (potentially truncated) 18.00 %
% of genes from short scaffolds (< 2000 bps) 67.00 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(32.000 % of family members)
Environment Ontology (ENVO) Unclassified
(73.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(81.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 19.05%    β-sheet: 19.05%    Coil/Unstructured: 61.90%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF13203DUF2201_N 8.00
PF00004AAA 6.00
PF07728AAA_5 2.00
PF01870Hjc 1.00
PF08279HTH_11 1.00
PF03237Terminase_6N 1.00
PF12705PDDEXK_1 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1591Holliday junction resolvase Hjc, archaeal typeReplication, recombination and repair [L] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A58.00 %
All OrganismsrootAll Organisms42.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000947|BBAY92_10063621Not Available995Open in IMG/M
3300000947|BBAY92_10090746Not Available815Open in IMG/M
3300001460|JGI24003J15210_10018030All Organisms → Viruses → Predicted Viral2727Open in IMG/M
3300001460|JGI24003J15210_10030592All Organisms → Viruses → Predicted Viral1963Open in IMG/M
3300001472|JGI24004J15324_10001596All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon9012Open in IMG/M
3300002242|KVWGV2_10870551Not Available667Open in IMG/M
3300004448|Ga0065861_1072882Not Available680Open in IMG/M
3300004461|Ga0066223_1088608Not Available1304Open in IMG/M
3300004461|Ga0066223_1088609Not Available739Open in IMG/M
3300006026|Ga0075478_10123032All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium819Open in IMG/M
3300006191|Ga0075447_10002362Not Available8611Open in IMG/M
3300006352|Ga0075448_10265183Not Available521Open in IMG/M
3300006735|Ga0098038_1035262All Organisms → Viruses → Predicted Viral1854Open in IMG/M
3300006735|Ga0098038_1037117All Organisms → Viruses → Predicted Viral1798Open in IMG/M
3300006735|Ga0098038_1061745All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1338Open in IMG/M
3300006749|Ga0098042_1076642Not Available870Open in IMG/M
3300006751|Ga0098040_1002850All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon6890Open in IMG/M
3300006810|Ga0070754_10438265Not Available568Open in IMG/M
3300006928|Ga0098041_1134168Not Available798Open in IMG/M
3300006928|Ga0098041_1141471Not Available775Open in IMG/M
3300006929|Ga0098036_1148904Not Available715Open in IMG/M
3300006947|Ga0075444_10263915Not Available674Open in IMG/M
3300007276|Ga0070747_1189593Not Available728Open in IMG/M
3300007539|Ga0099849_1023104Not Available2698Open in IMG/M
3300007540|Ga0099847_1004140All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon4944Open in IMG/M
3300008221|Ga0114916_1052286Not Available1134Open in IMG/M
3300008221|Ga0114916_1080981All Organisms → Viruses823Open in IMG/M
3300009076|Ga0115550_1018982All Organisms → Viruses → Predicted Viral3323Open in IMG/M
3300009172|Ga0114995_10078740Not Available1856Open in IMG/M
3300009172|Ga0114995_10539437Not Available638Open in IMG/M
3300009193|Ga0115551_1015185All Organisms → Viruses → Predicted Viral4118Open in IMG/M
3300009420|Ga0114994_10091269Not Available2075Open in IMG/M
3300009428|Ga0114915_1032047Not Available1791Open in IMG/M
3300009428|Ga0114915_1055964Not Available1256Open in IMG/M
3300009428|Ga0114915_1114927Not Available788Open in IMG/M
3300009428|Ga0114915_1181105All Organisms → Viruses586Open in IMG/M
3300009481|Ga0114932_10019142All Organisms → Viruses → Predicted Viral4784Open in IMG/M
3300009526|Ga0115004_10011971All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon6359Open in IMG/M
3300009705|Ga0115000_10737050Not Available607Open in IMG/M
3300009705|Ga0115000_10958267Not Available522Open in IMG/M
3300009785|Ga0115001_10161191Not Available1460Open in IMG/M
3300010430|Ga0118733_103340809Not Available873Open in IMG/M
3300010883|Ga0133547_10648125Not Available2102Open in IMG/M
3300010883|Ga0133547_11138935Not Available1496Open in IMG/M
3300011118|Ga0114922_11069752All Organisms → Viruses654Open in IMG/M
3300011261|Ga0151661_1331446Not Available554Open in IMG/M
3300016771|Ga0182082_1103246Not Available705Open in IMG/M
3300017728|Ga0181419_1024507All Organisms → Viruses → Predicted Viral1676Open in IMG/M
3300017738|Ga0181428_1065897Not Available845Open in IMG/M
3300017951|Ga0181577_10605768Not Available675Open in IMG/M
3300020281|Ga0211483_10024656All Organisms → Viruses → Predicted Viral1991Open in IMG/M
3300020287|Ga0211471_1007380All Organisms → Viruses → Predicted Viral1377Open in IMG/M
3300020339|Ga0211605_1004088All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage4448Open in IMG/M
3300020409|Ga0211472_10012110All Organisms → Viruses → Predicted Viral3359Open in IMG/M
3300020431|Ga0211554_10149334Not Available1152Open in IMG/M
3300020436|Ga0211708_10007045All Organisms → Viruses → Predicted Viral4202Open in IMG/M
3300020452|Ga0211545_10269745Not Available780Open in IMG/M
3300020460|Ga0211486_10003770All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon9097Open in IMG/M
3300020595|Ga0206126_10054714All Organisms → Viruses → Predicted Viral2147Open in IMG/M
3300021084|Ga0206678_10403775Not Available642Open in IMG/M
3300021368|Ga0213860_10004407All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium5761Open in IMG/M
3300021958|Ga0222718_10025000Not Available4076Open in IMG/M
3300022221|Ga0224506_10083026All Organisms → Viruses → Predicted Viral1560Open in IMG/M
3300022934|Ga0255781_10048271All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2534Open in IMG/M
(restricted) 3300023210|Ga0233412_10136037All Organisms → Viruses → Predicted Viral1046Open in IMG/M
3300024344|Ga0209992_10000642Not Available44242Open in IMG/M
3300025079|Ga0207890_1013684All Organisms → Viruses → Predicted Viral1659Open in IMG/M
3300025086|Ga0208157_1016871All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium2284Open in IMG/M
3300025108|Ga0208793_1018561All Organisms → Viruses → Predicted Viral2496Open in IMG/M
3300025120|Ga0209535_1002347Not Available12358Open in IMG/M
3300025120|Ga0209535_1056737All Organisms → Viruses → Predicted Viral1628Open in IMG/M
3300025128|Ga0208919_1044413All Organisms → Viruses → Predicted Viral1546Open in IMG/M
3300025168|Ga0209337_1017470All Organisms → Viruses → Predicted Viral4282Open in IMG/M
3300025168|Ga0209337_1250230All Organisms → Viruses679Open in IMG/M
3300025266|Ga0208032_1000418Not Available24735Open in IMG/M
3300025266|Ga0208032_1001587Not Available9618Open in IMG/M
3300025266|Ga0208032_1006266All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon4121Open in IMG/M
3300025266|Ga0208032_1018667Not Available2044Open in IMG/M
3300025266|Ga0208032_1029394Not Available1473Open in IMG/M
3300025276|Ga0208814_1035099Not Available1560Open in IMG/M
3300025610|Ga0208149_1098073All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Microgenomates group → Candidatus Woesebacteria → Candidatus Woesebacteria bacterium705Open in IMG/M
3300027668|Ga0209482_1051963All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1492Open in IMG/M
3300027687|Ga0209710_1010170All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon5409Open in IMG/M
3300027687|Ga0209710_1043621Not Available2086Open in IMG/M
3300027780|Ga0209502_10432100Not Available531Open in IMG/M
3300028022|Ga0256382_1110989Not Available659Open in IMG/M
3300028125|Ga0256368_1016417Not Available1289Open in IMG/M
3300028448|Ga0256383_111882Not Available702Open in IMG/M
3300029318|Ga0185543_1040920Not Available1014Open in IMG/M
3300029448|Ga0183755_1096847Not Available590Open in IMG/M
3300031519|Ga0307488_10000214Not Available44582Open in IMG/M
3300031539|Ga0307380_10207385All Organisms → Viruses1887Open in IMG/M
3300031565|Ga0307379_11038701Not Available694Open in IMG/M
3300031565|Ga0307379_11319805Not Available588Open in IMG/M
3300031566|Ga0307378_10527633All Organisms → Viruses → Predicted Viral1052Open in IMG/M
3300031628|Ga0308014_1160129Not Available509Open in IMG/M
3300031660|Ga0307994_1023365Not Available2696Open in IMG/M
3300031676|Ga0302136_1036179Not Available1754Open in IMG/M
3300032011|Ga0315316_10152099All Organisms → Viruses → Predicted Viral1920Open in IMG/M
3300033742|Ga0314858_116452Not Available682Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine32.00%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine14.00%
Deep OceanEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean12.00%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous6.00%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil4.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine3.00%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh3.00%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater3.00%
Sea-Ice BrineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Sea-Ice Brine2.00%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater2.00%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine2.00%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine2.00%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater2.00%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Volcanic → Unclassified → Deep Subsurface2.00%
Macroalgal SurfaceHost-Associated → Algae → Green Algae → Ectosymbionts → Unclassified → Macroalgal Surface2.00%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface1.00%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.00%
Marine SedimentEnvironmental → Aquatic → Marine → Coastal → Sediment → Marine Sediment1.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Sediment → Marine1.00%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater1.00%
Sackhole BrineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Sackhole Brine1.00%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water1.00%
Marine SedimentEnvironmental → Aquatic → Marine → Hydrothermal Vents → Sediment → Marine Sediment1.00%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000947Macroalgal surface ecosystem from Botany Bay, Sydney, Australia - BBAY92Host-AssociatedOpen in IMG/M
3300001460Marine viral communities from the Pacific Ocean - LP-28EnvironmentalOpen in IMG/M
3300001472Marine viral communities from the Pacific Ocean - LP-32EnvironmentalOpen in IMG/M
3300002242Marine sediment microbial communities from Kolumbo Volcano mats, Greece - white/grey matEnvironmentalOpen in IMG/M
3300004448Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300004461Marine viral communities from Newfoundland, Canada BC-2EnvironmentalOpen in IMG/M
3300006026Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006191Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG104-DNAEnvironmentalOpen in IMG/M
3300006352Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG108-DNAEnvironmentalOpen in IMG/M
3300006735Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaGEnvironmentalOpen in IMG/M
3300006749Marine viral communities from the Subarctic Pacific Ocean - 9_ETSP_OMZ_AT15188 metaGEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006810Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01EnvironmentalOpen in IMG/M
3300006928Marine viral communities from the Subarctic Pacific Ocean - 8_ETSP_OMZ_AT15162 metaGEnvironmentalOpen in IMG/M
3300006929Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaGEnvironmentalOpen in IMG/M
3300006947Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG017-DNAEnvironmentalOpen in IMG/M
3300007276Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31EnvironmentalOpen in IMG/M
3300007539Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaGEnvironmentalOpen in IMG/M
3300007540Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaGEnvironmentalOpen in IMG/M
3300008221Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_66EnvironmentalOpen in IMG/M
3300009076Pelagic marine microbial communities from North Sea - COGITO_mtgs_100511EnvironmentalOpen in IMG/M
3300009172Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_154EnvironmentalOpen in IMG/M
3300009193Pelagic marine microbial communities from North Sea - COGITO_mtgs_110321EnvironmentalOpen in IMG/M
3300009420Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB2_152EnvironmentalOpen in IMG/M
3300009428Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_55EnvironmentalOpen in IMG/M
3300009481Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaGEnvironmentalOpen in IMG/M
3300009526Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_90EnvironmentalOpen in IMG/M
3300009705Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_128EnvironmentalOpen in IMG/M
3300009785Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB8_130EnvironmentalOpen in IMG/M
3300010430Marine sediment microbial communities from Gulf of Thailand under amendment with organic carbon and nitrate - JGI co-assembly of 8 samplesEnvironmentalOpen in IMG/M
3300010883western Arctic Ocean co-assemblyEnvironmentalOpen in IMG/M
3300011118Deep subsurface microbial communities from Aarhus Bay to uncover new lineages of life (NeLLi) - Aarhus_00045 metaGEnvironmentalOpen in IMG/M
3300011261Marine sediment microbial communities from Japan Sea near Toyama Prefecture, Japan - 2015_4, 0.02EnvironmentalOpen in IMG/M
3300016771Metatranscriptome of coastal salt marsh microbial communities from the Groves Creek Marsh, Georgia, USA - 071412BT metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300017728Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 42 SPOT_SRF_2013-04-24EnvironmentalOpen in IMG/M
3300017738Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 51 SPOT_SRF_2014-02-12EnvironmentalOpen in IMG/M
3300017951Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 101413BT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300020281Marine microbial communities from Tara Oceans - TARA_A100001035 (ERX556022-ERR599116)EnvironmentalOpen in IMG/M
3300020287Marine microbial communities from Tara Oceans - TARA_A100001403 (ERX556018-ERR598969)EnvironmentalOpen in IMG/M
3300020339Marine microbial communities from Tara Oceans - TARA_B100000674 (ERX555929-ERR599080)EnvironmentalOpen in IMG/M
3300020409Marine microbial communities from Tara Oceans - TARA_A100001403 (ERX555912-ERR599106)EnvironmentalOpen in IMG/M
3300020431Marine microbial communities from Tara Oceans - TARA_B100001142 (ERX556101-ERR598983)EnvironmentalOpen in IMG/M
3300020436Marine microbial communities from Tara Oceans - TARA_B100000424 (ERX556009-ERR598984)EnvironmentalOpen in IMG/M
3300020452Marine microbial communities from Tara Oceans - TARA_B100001173 (ERX556054-ERR599078)EnvironmentalOpen in IMG/M
3300020460Marine microbial communities from Tara Oceans - TARA_A100001037 (ERX555931-ERR599097)EnvironmentalOpen in IMG/M
3300020595Pelagic subsurface seawater microbial communities from Kabeltonne, Helgoland, North Sea - Helgoland_Spring_Bloom_20160412_1EnvironmentalOpen in IMG/M
3300021084Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 80m 12015EnvironmentalOpen in IMG/M
3300021368Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO550EnvironmentalOpen in IMG/M
3300021958Estuarine water microbial communities from San Francisco Bay, California, United States - C33_27DEnvironmentalOpen in IMG/M
3300022221Sediment microbial communities from San Francisco Bay, California, United States - SF_Jan12_sed_USGS_8_1EnvironmentalOpen in IMG/M
3300022934Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 101413BT metaGEnvironmentalOpen in IMG/M
3300023210 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_4_MGEnvironmentalOpen in IMG/M
3300024344Deep subsurface microbial communities from Kolumbo volcano to uncover new lineages of life (NeLLi) - 2SBTROV12_ACTIVE470 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025079Marine viral communities from the Pacific Ocean - LP-48 (SPAdes)EnvironmentalOpen in IMG/M
3300025086Marine viral communities from the Subarctic Pacific Ocean - 5_ETSP_OMZ_AT15132 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025120Marine viral communities from the Pacific Ocean - LP-28 (SPAdes)EnvironmentalOpen in IMG/M
3300025128Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025168Marine viral communities from the Pacific Ocean - LP-53 (SPAdes)EnvironmentalOpen in IMG/M
3300025266Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_66 (SPAdes)EnvironmentalOpen in IMG/M
3300025276Marine viral communities from the Global Malaspina Expedition - Malaspina viral metaG Antarct_55 (SPAdes)EnvironmentalOpen in IMG/M
3300025610Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_D_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027668Marine microbial communities from the West Antarctic Peninsula - Coastal water metaG104-DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027687Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_138 (SPAdes)EnvironmentalOpen in IMG/M
3300027780Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB11_90 (SPAdes)EnvironmentalOpen in IMG/M
3300028022Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 750mEnvironmentalOpen in IMG/M
3300028125Sea-ice brine viral communities from Beaufort Sea near Barrow, Alaska, United States - SBEnvironmentalOpen in IMG/M
3300028448Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 300mEnvironmentalOpen in IMG/M
3300029318Marine giant viral communities collected during Tara Oceans survey from station TARA_038 - TARA_Y100000289EnvironmentalOpen in IMG/M
3300029448Marine viral communities collected during Tara Oceans survey from station TARA_023 - TARA_E500000082EnvironmentalOpen in IMG/M
3300031519Sea-ice brine microbial communities from Beaufort Sea near Barrow, Alaska, United States - SB 0.2EnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031565Soil microbial communities from Risofladan, Vaasa, Finland - UN-2EnvironmentalOpen in IMG/M
3300031566Soil microbial communities from Risofladan, Vaasa, Finland - UN-1EnvironmentalOpen in IMG/M
3300031628Marine microbial communities from water near the shore, Antarctic Ocean - #229EnvironmentalOpen in IMG/M
3300031660Marine microbial communities from Ellis Fjord, Antarctic Ocean - #261EnvironmentalOpen in IMG/M
3300031676Marine microbial communities from Western Arctic Ocean, Canada - CBN3_20mEnvironmentalOpen in IMG/M
3300032011Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 60m 3416EnvironmentalOpen in IMG/M
3300033742Sea-ice brine viral communities from Beaufort Sea near Barrow, Alaska, United States - 2018 seawaterEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
BBAY92_1006362113300000947Macroalgal SurfaceMANCIICNSSFPTARKQLGMDTCLECGERVAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTT*
BBAY92_1009074613300000947Macroalgal SurfaceMATCRLCNKPYSPARKRLGIDTCFECGELEAKQVRHTIAPLNKSNYMVVSPDELKQLNPKRTT*
JGI24003J15210_1001803013300001460MarineHDYIRNLYTQENGMARCIQCDKPYSPARYLLGKLTCLTCGEQAARQTKHTVAPLNKSNYMLLLPEELKQLNPKRTT*
JGI24003J15210_1003059253300001460MarineMALCRMCSKPYSPARKRLGIDTCLVCGESAAKQVKHTVAPLNKSNYMLLLPEELKQLNPKRTT*
JGI24004J15324_1000159643300001472MarineMARCIQCDKPYSPARYLLGKLTCLTCGEQAARQTKHTVAPLNKSNYMLLLPEELKQLNPKRTT*
KVWGV2_1087055113300002242Marine SedimentMANCIFCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0065861_107288213300004448MarineMANCIRCDKYYSEARYQLGRLTCLVCGESAASEVKHTVAPLNKSNYMLLSREELKQLNPKRTT*
Ga0066223_108860843300004461MarineMTKCVKCNKSYSPARRLLGITVCLLCGELEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0066223_108860923300004461MarineMPRCVKCGKPYSPARRLLGITVCLLCGELEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0075478_1012303223300006026AqueousMSQTICTKCKKPYSPERAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLNPKRTT*
Ga0075447_1000236253300006191MarineMAQCVKCNKLYSPARKLLGITVCLLCGEIEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0075448_1026518323300006352MarineMAQCVKCNKPYSPARKLLGITVCLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0098038_103526243300006735MarineMALCRMCNTSFPTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0098038_103711743300006735MarineMANCIICNTSFPTARKRLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0098038_106174513300006735MarineMATCRLCNKPYSPARKRLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSKDELKQLNPKRTT*
Ga0098042_107664213300006749MarineMANCIICNTSFPTARKQLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0098040_100285013300006751MarineMASCRLCNKLYSPARKRLGFDTCMQCGQLAAKEIKHTVAPLNKSNYMLLSAEELKQLNPKRTT*
Ga0070754_1043826533300006810AqueousMANCIKCDKYYSEARYQLGRLTCLVCGESAASEVKHTVAPLNKSNYMLLSREELKQLNPKRTT*
Ga0098041_113416813300006928MarineMALCRMCNTSFPTARKQLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0098041_114147123300006928MarineMATCRLCNKPYSPARKSLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSKYELKQLNPKRTT*
Ga0098036_114890423300006929MarineMATCRLCNKPYSPARKSLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSKDELKQLNPKRTT*
Ga0075444_1026391523300006947MarineMAKCVKCSKLYSSARRLLGITICLVCGESEARSVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0070747_118959313300007276AqueousMAYCRMCNKPYSPARQRLGISTCMVCGDAAARQVKHTVAPLNKSNYMLLSTEELKQLNPKRTT*
Ga0099849_102310433300007539AqueousMSQTLCNKCKKPYSPARAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLNPKRTT*
Ga0099847_100414063300007540AqueousMALCRMCSKPYSPARKQLGIDTCLVCGESAAKQVKHTVAPLNKSNYMLLLPEELKQLNPKRTT*
Ga0114916_105228623300008221Deep OceanMAQCVKCNKLYSPARKLLGIDVCLVCGEFAASQVKHTVAPLNKSNYMLMSREDLKQLN
Ga0114916_108098113300008221Deep OceanMAQCVKCNKPYSPARKLLGITICLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQ
Ga0115550_101898263300009076Pelagic MarineMTRCIKCNKPYSPARRLLGRLTCLVCGESAASEVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114995_1007874013300009172MarineMTKCTKCGKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114995_1053943723300009172MarineMAQCVKCNKSYSPARRLLGITICLLCGELEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0115551_101518573300009193Pelagic MarineMPRCIKCNKPYSPARRLLGRLTCLVCGESAASEVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114994_1009126923300009420MarineMAKCVKCSKLYSSARRLLGITVCLLCGEAEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114915_103204723300009428Deep OceanMENDMAQCVKCNKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114915_105596423300009428Deep OceanMAQCVKCNKPYSPARKLLGITICLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114915_111492713300009428Deep OceanMAKCVKCSKLYSSARRLLGITICLVCGESEARSVKHTVAPLNKSNYMLMSREDLKQ
Ga0114915_118110513300009428Deep OceanMAQCVKCNKSYSPARKLLGITVCLICGEQEARAVKHTIAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114932_1001914253300009481Deep SubsurfaceMALCRMCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT*
Ga0115004_1001197113300009526MarineYSPARKLLGITICLVCGESEARTVKHTIAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0115000_1073705013300009705MarineMAKCVKCSKLYSSARRLLGITVCLLCGEAEARAVKHTVAPLNKSNYMLMSREDLK
Ga0115000_1095826713300009705MarineMTKCTKCGKPYSSARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0115001_1016119113300009785MarineMTKCTKCGKPYSPARRLLGITLCLLCGESEARDVKHTVAPLNKSNYMLMSREDLKKLNPKRTT*
Ga0118733_10334080923300010430Marine SedimentMTRCVKCNKPYSPARKLLGITVCLLCGEAEARAVKHTIAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0133547_1064812543300010883MarineMAQCVKCNKSYSPARRLLGITICLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0133547_1113893533300010883MarineMNRCVKCNKSYSPARKLLGITICLVCGESEARTVKHTIAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0114922_1106975223300011118Deep SubsurfaceMAQCVKCNKSYSPARTLLGITICLVCGESEARTVKHTIAPLNKSNYMLMSREDLKQLNPKRTT*
Ga0151661_133144613300011261MarineMATCRLCNKPYSPARKRLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSPDELKQLNPKRT
Ga0182082_110324613300016771Salt MarshMSQTLCNKCKKPYSPARAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLN
Ga0181419_102450713300017728SeawaterMALCRMCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLM
Ga0181428_106589713300017738SeawaterMALCRMCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNP
Ga0181577_1060576823300017951Salt MarshMSQTLCNKCKKPYSPARAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLNPKRTT
Ga0211483_1002465633300020281MarineMANCIICNTSFPTARKQLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTI
Ga0211471_100738033300020287MarineMANCIICNTSFPTARKQLGMDTCLECGERVAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTI
Ga0211605_100408843300020339MarineMANCIICNSSFPTARKQLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTI
Ga0211472_1001211013300020409MarineMANCIICNTPFPTARKQLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTI
Ga0211554_1014933423300020431MarineMANCIICNTSFPTARKRLGMDTCLECGERAAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTT
Ga0211708_1000704533300020436MarineMATCRLCNKPYSPARKRLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSKDELKQLNPKRTT
Ga0211545_1026974523300020452MarineMATCRLCNKPYSPARKQLGIDTCFKCGELEAKQVRHTIAPLNKSNYMVVSPDELKQLNPKRTT
Ga0211486_1000377013300020460MarineMANCIICNTPFPTARKQLGMDTCLECGERVAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTI
Ga0206126_1005471453300020595SeawaterMALCRMCSKPYSPARKQLGIDTCLVCGESAAKQVKHTVAPLNKSNYMLLLPEELKQLNPKRTT
Ga0206678_1040377513300021084SeawaterMAYCRMCNKPYSPARQRLGILTCMVCGDAAARQVKHTVAPLNKSNYMLLSAEELKQL
Ga0213860_1000440723300021368SeawaterMSQTLCNKCKKPYSPARAKLGITTCLTCGEADANKVRHTVAPLNKSNYMLLSPIELKQLNPKRTT
Ga0222718_1002500063300021958Estuarine WaterMPRCIKCNKPYSPARRLLGRLTCLVCGESAASEVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0224506_1008302633300022221SedimentMTRCVKCGKPYSPARRLLGRLTCLVCGESAASEVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0255781_1004827133300022934Salt MarshMPACIKCKKPYSPARAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLNPKRTT
(restricted) Ga0233412_1013603713300023210SeawaterMANCIRCDKYYSEARYQLGRLTCLVCGESAASEGKHTVAPLNKSNYMLLSREELKQLNPKRTT
Ga0209992_1000064253300024344Deep SubsurfaceMALCRMCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT
Ga0207890_101368423300025079MarineMTRCVKCNKPYSPARRLLGITVCLVCGESEARAVKHTIAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208157_101687143300025086MarineMALCRMCNTSFPTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT
Ga0208793_101856123300025108MarineMASCRLCNKLYSPARKRLGFDTCMQCGQLAAKEIKHTVAPLNKSNYMLLSADVPNLDLGDIAMPELKQLNPKRTT
Ga0209535_1002347143300025120MarineMARCIQCDKPYSPARYLLGKLTCLTCGEQAARQTKHTVAPLNKSNYMLLLPEELKQLNPKRTT
Ga0209535_105673713300025120MarineMALCRMCSKPYSPARKRLGIDTCLVCGESAAKQVKHTVAPLNKSNYMLLLPEELKQLNPKRTT
Ga0208919_104441343300025128MarineMATCRLCNKPYSPARKSLGIDTCFECGELEAKQVRHTIAPLNKSNYMVLSKDELKQLNPKRTT
Ga0209337_101747023300025168MarineMAYCRMCNKPYSPARQRLGISTCMVCGDAAARQVKHTVAPLNKSNYMLLSTEELKQLNPKRTT
Ga0209337_125023013300025168MarineMAQCVKCSKPYSPARRLLGITVCLLCGEAEARAVKHTIAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208032_100041823300025266Deep OceanMAQCVKCNKLYSPARKLLGIDVCLVCGEFAASQVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208032_100158723300025266Deep OceanMAQCVKCNKLYSPARKLLGITVCLLCGEIEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208032_100626663300025266Deep OceanMAQCVKCNKPYSPARKLLGITVCLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208032_101866713300025266Deep OceanMAKCVKCSKLYSSARRLLGITICLVCGESEARSVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208032_102939413300025266Deep OceanMAQCVKCNKPYSPARKLLGITICLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208814_103509923300025276Deep OceanMENDMAQCVKCNKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0208149_109807323300025610AqueousMSQTICTKCKKPYSPERAKLGITTCLTCGEADATKVIHTVAPLNKSNYMLLSPIELKQLNPKRTT
Ga0209482_105196343300027668MarineFGLATQPIIYRLLENGMAQCVKCNKLYSPARKLLGITVCLLCGEIEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0209710_101017093300027687MarineMAKCVKCSKLYSSARRLLGITVCLLCGEAEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0209710_104362123300027687MarineMTKCTKCGKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0209502_1043210023300027780MarineMAQCVKCNKSYSPARRLLGITICLVCGESEARDVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0256382_111098913300028022SeawaterMALCRICNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT
Ga0256368_101641713300028125Sea-Ice BrineMAQCVKCNKSYSPARRLLGITICLLCGELEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0256383_11188213300028448SeawaterMALCRMCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPK
Ga0185543_104092023300029318MarineMANCIICNTSFPTARKQLGMDTCLECGERVAKQVKHTIAPLNKSNYMLLMPEELKQLNPKRTT
Ga0183755_109684723300029448MarineMANCIFCNAPFSTARKRLGMDTCLECGERAAKQVKHTVAPLNKSNYMLLMPEELKQLNPKRTT
Ga0307488_10000214363300031519Sackhole BrineMANCIKCDKYYSEARYQLGRLTCLVCGESAASEVKHTVAPLNKSNYMLLSREELKQLNPKRTT
Ga0307380_1020738523300031539SoilMAQCVKCNKSYSLARRLLGITVCLLCGELEARAVKHTIAPLNKSNYMLVSREDLKQLNPKRTT
Ga0307379_1103870113300031565SoilMANCIRCDKYYSEARYQLGRLTCLVCGESAASEVKHTVAPLNKSNYMLLSREELKQLNPKRTT
Ga0307379_1131980513300031565SoilNDMAQCVKCNKSYSPARRLLGITVCLLCGELEARAVKHTVAPLNKSNYMLVSREDLKQLNPKRTT
Ga0307378_1052763323300031566SoilMAQCVKCNKSYSLARRLLGITVCLLCGELEARAVKHTVAPLNKSNYMLVSREDLKQLNPKRTT
Ga0308014_116012913300031628MarineVRCNKPYSPARRKLGITVCLVCGELEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0307994_102336533300031660MarineMAQCVKCNKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQLNPKRTT
Ga0302136_103617913300031676MarineMTKCTKCGKPYSPARRLLGITVCLLCGESEARAVKHTVAPLNKSNYMLMSREDLKQ
Ga0315316_1015209943300032011SeawaterMATCRLCNKPYSPARKQLGIDTCFKCGELEAKQVRHTIAPLNKSNYMVLSKDELKQLNPKRTT
Ga0314858_116452_449_6403300033742Sea-Ice BrineMTRCVKCNKPYSPARKLLGITVCLLCGEAEARAVKHTIAPLNKSNYMLMSREDLKQLNPKRTT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.