NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073654

Metagenome / Metatranscriptome Family F073654

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073654
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 64 residues
Representative Sequence MYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA
Number of Associated Samples 98
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 57.14 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 2.50 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (95.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(40.833 % of family members)
Environment Ontology (ENVO) Unclassified
(85.833 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(93.333 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.11%    β-sheet: 25.81%    Coil/Unstructured: 44.09%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF06147DUF968 46.67
PF03592Terminase_2 10.00
PF03237Terminase_6N 3.33
PF16786RecA_dep_nuc 2.50
PF12236Head-tail_con 2.50
PF12844HTH_19 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 120 Family Scaffolds
COG3728Phage terminase, small subunitMobilome: prophages, transposons [X] 10.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A95.00 %
All OrganismsrootAll Organisms5.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005239|Ga0073579_1191420All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium TMED22835504Open in IMG/M
3300005404|Ga0066856_10019853All Organisms → cellular organisms → Bacteria2903Open in IMG/M
3300006916|Ga0070750_10003017All Organisms → cellular organisms → Bacteria9366Open in IMG/M
3300025052|Ga0207906_1004906Not Available1972Open in IMG/M
3300025078|Ga0208668_1001688All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium TMED2285676Open in IMG/M
3300025110|Ga0208158_1080208All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium TMED228778Open in IMG/M
3300026292|Ga0208277_1154280All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium TMED228769Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine40.83%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater18.33%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous11.67%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine10.83%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater3.33%
Surface SeawaterEnvironmental → Aquatic → Marine → Oceanic → Photic Zone → Surface Seawater2.50%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.67%
Microbial MatEnvironmental → Aquatic → Marine → Coastal → Sediment → Microbial Mat1.67%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine1.67%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater0.83%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater0.83%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine0.83%
MarineEnvironmental → Aquatic → Marine → Oceanic → Aphotic Zone → Marine0.83%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater0.83%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water0.83%
MarineEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Marine0.83%
Hydrothermal Vent PlumeEnvironmental → Aquatic → Marine → Hydrothermal Vents → Unclassified → Hydrothermal Vent Plume0.83%
Macroalgal SurfaceHost-Associated → Algae → Green Algae → Ectosymbionts → Unclassified → Macroalgal Surface0.83%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000116Marine microbial communities from Delaware Coast, sample from Delaware MO Spring March 2010EnvironmentalOpen in IMG/M
3300000949Macroalgal surface ecosystem from Botany Bay, Sydney, Australia - BBAY94Host-AssociatedOpen in IMG/M
3300001589Marine viral communities from the Pacific Ocean - LP-40EnvironmentalOpen in IMG/M
3300001683Hydrothermal vent plume microbial communities from Guaymas Basin, Gulf of California - IDBA assemblyEnvironmentalOpen in IMG/M
3300003586Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI074_LV_135m_DNAEnvironmentalOpen in IMG/M
3300003588Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI072_LV_100m_DNAEnvironmentalOpen in IMG/M
3300005239Environmental Genome Shotgun Sequencing: Ocean Microbial Populations from the Gulf of MaineEnvironmentalOpen in IMG/M
3300005404Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201406SV205EnvironmentalOpen in IMG/M
3300005428Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F10-02SV253EnvironmentalOpen in IMG/M
3300005432Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201310SV78EnvironmentalOpen in IMG/M
3300005433Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201306PF45BEnvironmentalOpen in IMG/M
3300005510Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201306SV45EnvironmentalOpen in IMG/M
3300005523Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F12-01SV265EnvironmentalOpen in IMG/M
3300006024Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Knorr_S15_td_DCM_ad_63m_LV_BEnvironmentalOpen in IMG/M
3300006027Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_30_<0.8_DNAEnvironmentalOpen in IMG/M
3300006090Marine microbial communities from the Eastern Tropical South Pacific Oxygen Minumum Zone, cruise NBP1315, 2013 - sample NBP124EnvironmentalOpen in IMG/M
3300006315Marine microbial communities from North Pacific Subtropical Gyre, Station ALOHA - HOT233_1_0770mEnvironmentalOpen in IMG/M
3300006735Marine viral communities from the Subarctic Pacific Ocean - 5B_ETSP_OMZ_AT15132_CsCl metaGEnvironmentalOpen in IMG/M
3300006738Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaGEnvironmentalOpen in IMG/M
3300006752Marine viral communities from the Subarctic Pacific Ocean - 13_ETSP_OMZ_AT15268 metaGEnvironmentalOpen in IMG/M
3300006789Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaGEnvironmentalOpen in IMG/M
3300006793Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaGEnvironmentalOpen in IMG/M
3300006916Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_24EnvironmentalOpen in IMG/M
3300006924Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaGEnvironmentalOpen in IMG/M
3300006925Marine viral communities from the Subarctic Pacific Ocean - 14_ETSP_OMZ_AT15311 metaGEnvironmentalOpen in IMG/M
3300006927Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaGEnvironmentalOpen in IMG/M
3300007236Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_30_>0.8_DNAEnvironmentalOpen in IMG/M
3300007276Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31EnvironmentalOpen in IMG/M
3300007344Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_4EnvironmentalOpen in IMG/M
3300007346Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31EnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007540Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaGEnvironmentalOpen in IMG/M
3300009173Marine microbial communities from western Arctic Ocean - ArcticOcean_MG_CB4_134EnvironmentalOpen in IMG/M
3300009436Marine eukaryotic phytoplankton communities from Arctic Ocean - Fram Strait ARC3M MetagenomeEnvironmentalOpen in IMG/M
3300009790Marine eukaryotic phytoplankton communities from Atlantic Ocean - Tropical Atlantic ANT10 MetagenomeEnvironmentalOpen in IMG/M
3300010153Marine viral communities from the Subarctic Pacific Ocean - 20_ETSP_OMZ_AT15318 metaGEnvironmentalOpen in IMG/M
3300012920Marine microbial communities from the Costa Rica Dome - CRUD Field 142mm St8 metaGEnvironmentalOpen in IMG/M
3300012953Marine eukaryotic phytoplankton communities from the Atlantic Ocean - Atlantic ANT 2 MetagenomeEnvironmentalOpen in IMG/M
3300017714Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 35 SPOT_SRF_2012-08-15EnvironmentalOpen in IMG/M
3300017719Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 13 SPOT_SRF_2010-07-21EnvironmentalOpen in IMG/M
3300017720Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 6 SPOT_SRF_2009-12-23EnvironmentalOpen in IMG/M
3300017724Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 11 SPOT_SRF_2010-05-17EnvironmentalOpen in IMG/M
3300017728Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 42 SPOT_SRF_2013-04-24EnvironmentalOpen in IMG/M
3300017731Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 39 SPOT_SRF_2013-01-16EnvironmentalOpen in IMG/M
3300017746Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 12 SPOT_SRF_2010-06-29EnvironmentalOpen in IMG/M
3300017752Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 23 SPOT_SRF_2011-06-22EnvironmentalOpen in IMG/M
3300017755Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 34 SPOT_SRF_2012-07-09EnvironmentalOpen in IMG/M
3300017762Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 45 SPOT_SRF_2013-07-18EnvironmentalOpen in IMG/M
3300017763Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 33 SPOT_SRF_2012-06-20EnvironmentalOpen in IMG/M
3300017764Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 8 SPOT_SRF_2010-02-11EnvironmentalOpen in IMG/M
3300017768Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 6 SPOT_SRF_2009-12-23 (version 2)EnvironmentalOpen in IMG/M
3300017773Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 9 SPOT_SRF_2010-03-24EnvironmentalOpen in IMG/M
3300017775Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 55 SPOT_SRF_2014-07-17EnvironmentalOpen in IMG/M
3300017781Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 46 SPOT_SRF_2013-08-14EnvironmentalOpen in IMG/M
3300017782Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 3 SPOT_SRF_2009-08-19EnvironmentalOpen in IMG/M
3300020294Marine microbial communities from Tara Oceans - TARA_E500000331 (ERX556124-ERR599153)EnvironmentalOpen in IMG/M
3300020345Marine microbial communities from Tara Oceans - TARA_B100000427 (ERX556079-ERR599137)EnvironmentalOpen in IMG/M
3300020395Marine microbial communities from Tara Oceans - TARA_B100000427 (ERX555987-ERR599133)EnvironmentalOpen in IMG/M
3300020401Marine microbial communities from Tara Oceans - TARA_B100000212 (ERX555985-ERR599139)EnvironmentalOpen in IMG/M
3300020416Marine microbial communities from Tara Oceans - TARA_B100001109 (ERX556137-ERR599039)EnvironmentalOpen in IMG/M
3300020421Marine microbial communities from Tara Oceans - TARA_B100000902 (ERX556005-ERR599007)EnvironmentalOpen in IMG/M
3300020439Marine microbial communities from Tara Oceans - TARA_B100001939 (ERX556062-ERR599029)EnvironmentalOpen in IMG/M
3300020440Marine microbial communities from Tara Oceans - TARA_E500000178 (ERX555952-ERR599043)EnvironmentalOpen in IMG/M
3300020457Marine microbial communities from Tara Oceans - TARA_B100001113 (ERX555941-ERR599014)EnvironmentalOpen in IMG/M
3300020473Marine microbial communities from Tara Oceans - TARA_B100000700 (ERX555932-ERR598948)EnvironmentalOpen in IMG/M
3300021068Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M2 100m 12015EnvironmentalOpen in IMG/M
3300021959Estuarine water microbial communities from San Francisco Bay, California, United States - C33_13DEnvironmentalOpen in IMG/M
3300022065Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_24 (v2)EnvironmentalOpen in IMG/M
3300023702Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - 82R (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024255 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_123_September2016_10_MGEnvironmentalOpen in IMG/M
3300024327 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_120_MGEnvironmentalOpen in IMG/M
3300025045Marine viral communities from the Pacific Ocean - LP-46 (SPAdes)EnvironmentalOpen in IMG/M
3300025052Marine viral communities from the Pacific Ocean - LP-37 (SPAdes)EnvironmentalOpen in IMG/M
3300025071Marine viral communities from the Pacific Ocean - LP-36 (SPAdes)EnvironmentalOpen in IMG/M
3300025078Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025084Marine viral communities from the Subarctic Pacific Ocean - 14B_ETSP_OMZ_AT15311_CsCl metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025085Marine viral communities from the Subarctic Pacific Ocean - 14_ETSP_OMZ_AT15311 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025086Marine viral communities from the Subarctic Pacific Ocean - 5_ETSP_OMZ_AT15132 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025097Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025103Marine viral communities from the Subarctic Pacific Ocean - 16_ETSP_OMZ_AT15313 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025108Marine viral communities from the Subarctic Pacific Ocean - 17_ETSP_OMZ_AT15314 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025110Marine viral communities from the Subarctic Pacific Ocean - 8_ETSP_OMZ_AT15162 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025151Marine viral communities from the Pacific Ocean - ETNP_6_30 (SPAdes)EnvironmentalOpen in IMG/M
3300025759Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_24 (SPAdes)EnvironmentalOpen in IMG/M
3300026136Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201306PF45B (SPAdes)EnvironmentalOpen in IMG/M
3300026258Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201310SV78 (SPAdes)EnvironmentalOpen in IMG/M
3300026266Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F10-02SV257 (SPAdes)EnvironmentalOpen in IMG/M
3300026270Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F12-01SV265 (SPAdes)EnvironmentalOpen in IMG/M
3300026292Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201406SV205 (SPAdes)EnvironmentalOpen in IMG/M
3300026447Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - 125R_r (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300026465Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - 48R (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300026513Metatranscriptome of seawater microbial communities from Monterey Bay, California, United States - 51R (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300027833Marine eukaryotic phytoplankton communities from Arctic Ocean - Fram Strait ARC3M Metagenome (SPAdes)EnvironmentalOpen in IMG/M
3300029448Marine viral communities collected during Tara Oceans survey from station TARA_023 - TARA_E500000082EnvironmentalOpen in IMG/M
3300032006Marine microbial communities from station ALOHA, North Pacific Subtropical Gyre - HC15-DNA-20-200_MGEnvironmentalOpen in IMG/M
3300032254Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 3-month chalcopyriteEnvironmentalOpen in IMG/M
3300032277Microbial mat bacterial communities from mineral coupon in-situ incubated in ocean water Damariscotta River, Maine, United States - 3-month pyrrhotiteEnvironmentalOpen in IMG/M
3300034374Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31 (v4)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
DelMOSpr2010_1003728253300000116MarineMKYVVRIWLNDTMKKEIYFEADNDIIAMQKASAAIPDGCRATYEEINEETYKQETKIKTETINEEDINNF*
BBAY94_1015890923300000949Macroalgal SurfaceMKYKLRIWSNDSIKKEIYFEADSDIIAMQKTSAAVPDGCRATYEEIDEESYKKETQIKSEAINEEDLKTF*
JGI24005J15628_1000987153300001589MarineMYYRVIIWNGESFKKEIMFSADNEVIAMQKASAATPDGCRANYESITKEEYDAQSRNKEV
GBIDBA_1007655613300001683Hydrothermal Vent PlumeMFYLVKIWDNSDCFLKKEILYQSDNDVTAMQKASAATPDGCRATYESIDKEKYEKEKTQKTEETETVIEQDIAS*
JGI26261J51718_100811273300003586MarineNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYENGYKKTEAQEQAEA*
JGI26247J51722_100669323300003588MarineMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYENGYKKTEAQEQAEA*
Ga0073579_1191420133300005239MarineMHYRVRIWNGELFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA*
Ga0066856_1001985333300005404MarineMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEANQTKTETKAEAEA*
Ga0066856_1006551723300005404MarineMNYLVKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVDIVGENNA*
Ga0066856_1007060633300005404MarineMNYLVKIWNRADSHFKKEILFNADNDVIAMQKASAATPDGCRATYEEINKEKYEKEKEQAIKTEEN*
Ga0066856_1017384113300005404MarineMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAKV*
Ga0066863_1036074123300005428MarineMYYLVKIWDNEQGFILKKEILYQAENDVSAMQKASAATPDGCRSTYESINKEEYEKTKETKKAEEAA*
Ga0066845_1004579343300005432MarineMYYKVNIWKNDELKKVIYYKADNDIIAMQKASAAVPDGCRATYEEINEKIYFEETETKTEAKAEAEA*
Ga0066830_1005458523300005433MarineMNYLVKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVDIVGGNNA*
Ga0066825_1038870613300005510MarineMYYKVRIWGNDILKKEIYYKAENDIIAMQKASAAVPDGCRATYEEINEKIYFEETETKT*
Ga0066865_1000966063300005523MarineMNYLVKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVTITGENNA*
Ga0066865_1027038123300005523MarineMYYLVKIWGNDQLKKEILYEADNDVIAMQKASAATPDGCRATYEETNKEEYEKIYQTKTETEAEAQT*
Ga0066371_1009376323300006024MarineMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAEV*
Ga0075462_1000655443300006027AqueousMYYRVRIWNGESFKKEIMYSAENEVIAMQKASAAIPDGCRANYESITKEEYEKSYESKTEE*
Ga0082015_103244633300006090MarineMDYLVKIWDNQEGFVLKKEILYQADNDVNAMQKASAATPDGCRSTYETIDKEKYEKEKRERAEEEVAA*
Ga0068487_102915113300006315MarineMNYLVKIWNYTDSNFKKEILFSADNDVIAMQKASAATPDGCRSTYEEINKEDYEKEKAKKDIEDK
Ga0098038_115950523300006735MarineMYYLVKIWNNDTLKKEILFKADNDVIAMQKASAATPDGCRANYESINKEEYEKTYQTKTETETEAQT*
Ga0098035_111941643300006738MarineMYYLVKIWDNEQGFILKKEILYQAENDVSAMQKASAATPDGCRSTYETIDKEKYEKEKRERAEEEVAA*
Ga0098048_105601543300006752MarineMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAEV*
Ga0098054_100757433300006789MarineMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA*
Ga0098055_117108333300006793MarineMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTETEA*
Ga0070750_10003017153300006916AqueousMYYLVKIWGNDQLKKEILYEADNDVIAMQKASAATPDGCRATYEETNKEEYEKIYQTKTETEAQT*
Ga0070750_1011533453300006916AqueousMYYRVRIWNGELFKKEIMYEAENEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA*
Ga0098051_101354513300006924MarineMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA*
Ga0098050_108195813300006925MarineYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAQV*
Ga0098050_110559413300006925MarineNKRGQMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA*
Ga0098034_109981313300006927MarineMYYLVKIWDNEQGFILKKEILYQAENDVNAMQKASAATPDGCRSTYETIDKEKYEKEKRERAEEEVAA*
Ga0075463_1006277043300007236AqueousNGELFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA*
Ga0075463_1010860713300007236AqueousMYYRVRIWNGESFKKEIMYSAENEVIAMQKASAAIPDGCRANYESITKE
Ga0070747_101709623300007276AqueousMYYRVRIWNGESFKKEIMYSAENEVIAMQKASAAIPDGCRANYESITKEEYEKSYEDKTKEQAEA*
Ga0070745_128359723300007344AqueousMKKEIYFEADNDIIAMQKASAAIPDGCRATYEEINEETYKQETKIKTKTINEEDINNF*
Ga0070753_111314043300007346AqueousMKKEIYFEADNDIIAMQKASAAIPDGCRATYEEINEETYKQETKIKTETINEEDINNF*
Ga0099851_103805433300007538AqueousMKKEIYFEANNDIIAMQKASAAIPDGCRATYEEINEETYKQETKIKTETINEEDINNF*
Ga0099847_102156643300007540AqueousMYYRVKIWNGESFKKEIIYSAENEVIAMQKASAAIPDGCRANYESITKEEYEKSYESKTEE*
Ga0114996_1111305313300009173MarineMFYLVKIWDNSDCFLKKEILYQSDNDVNAMQKASAATPDGCRATYESIDKEKYEKEKTKKTEEAETVIEQDIAS*
Ga0115008_1053757113300009436MarineMYYRVRIWNGESFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYDAQSRNQEV
Ga0115012_1017835243300009790MarineKVNVWKNDELKKEIYYKADNDIIAMQKASAAIPDGCRATYEEIDEKIYIQETETKIKTETEAQAEV*
Ga0115012_1101378433300009790MarineDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEETYQETQQTKTEAKAEAEA*
Ga0098059_132725923300010153MarineMNYLVKIWNHTDSVFKKEILFYASNDVIAMQKASAATPDGCRATYEEINKEDYEKAKAKEKQEEQ
Ga0160423_1015714923300012920Surface SeawaterMYYLVKIWGNDQLKKEILYEADNDVIAMQKASAATPDGCRATYEETNKEEYEKIYQTKAETEAEAQT*
Ga0160423_1034495033300012920Surface SeawaterMYYKVRIWGNDILKKEIYYKAENDIIAMQKASAAVPDGCRATYESIEKEEYDAQSRKQEV
Ga0160423_1056740613300012920Surface SeawaterMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYESIEKEEYDAQSRKQEV
Ga0163179_10010742123300012953SeawaterMYYRVKIWNGESFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYDAQSRNKEV
Ga0181412_109332913300017714SeawaterMYYKVTIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181390_115133823300017719SeawaterMYYKVRIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181383_102676943300017720SeawaterIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181388_102474213300017724SeawaterMYYKVTIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYEPITKEEYDAQSRNQE
Ga0181419_111230133300017728SeawaterMYYKVTIWNGESFKKEIMYSAENEVISMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181416_106097033300017731SeawaterMYYLVKIWNNYGNELKKEILYQAENDVIAMQKASAATPDGCRATYESINKEEYEKTKETQKAEEAA
Ga0181389_117460513300017746SeawaterIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181400_110999213300017752SeawaterKEIMFSADNEVIAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181411_100609283300017755SeawaterMYYKVTIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYESINKEEYDAQSRNQEV
Ga0181422_109792633300017762SeawaterMYYKVRIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYESINKEEYDAQSRNQEV
Ga0181410_102313853300017763SeawaterMYYKVTIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181385_101753353300017764SeawaterMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYENGYKKTEAKEQA
Ga0181385_105820923300017764SeawaterMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEANQTKTETKAEAEA
Ga0181385_112766923300017764SeawaterMYYLVKIWNNYGNELKKEILYQAENDVSAMQKASAATPDGCRATYESIDKEQYEKTKEAEKAEEAA
Ga0187220_103371923300017768SeawaterMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRANYESINKEEYENGYKKTEAEEQTEA
Ga0187220_110548213300017768SeawaterYMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEANQTKTETKAEAEA
Ga0187220_111135623300017768SeawaterMNYLVKIWNHSDSDFKKEILFNADNDVIAMQKASAATPDGCRATYEEIDKEKYEKEKAIKTKEDQES
Ga0181386_109309623300017773SeawaterMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQ
Ga0181386_109479013300017773SeawaterMYYKVRIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYEPITKEEYDAQSRNQEV
Ga0181432_124951323300017775SeawaterMFYLVKIWDNNDCFLKKEILYQSDNDVNAMQKASAATPDGCRATYESIDKEKYEKEKTKKTEEAETVIEQDIAS
Ga0181423_111974633300017781SeawaterMYYKVTIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYESINKEEYDAQSRNEEV
Ga0181380_103734853300017782SeawaterIWNGESFKKEIMFSAENEVLAMQKASAATPDGCRANYEPITKEEYDAQSRNEEV
Ga0211520_108192923300020294MarineMYYLVKIWNQDQFKKEILFEADNDVVAMQKASAATPDGCRSNYESINKEEYENGYKKTEAEEQAEA
Ga0211706_103524123300020345MarineMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAEV
Ga0211705_1015113513300020395MarineLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEETYKEAKQTKIQTETEAQAEV
Ga0211617_1008814843300020401MarineMYYKVNVWKNDELKKEIYYKADNDIIAMQKASAAIPDGCRATYEEINEETYKKETETKTTEKTEEAQIAI
Ga0211644_1043742433300020416MarineGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYESIEKEEYDAQSRKQEV
Ga0211653_1017848633300020421MarineMYYLVKIWNKDQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKIYQTETKAEAEA
Ga0211558_1049670613300020439MarineMYYKVNIWKNDSLKKVIYYKADNDIIAMQKASAAIPDGCRATYEEIDEKIYIKETETKTE
Ga0211518_1036273723300020440MarineMYYLVKIWNNDQFKKEILFEADNDVIAMQKASAATPDGCRANYESINKEEYENGYKKTEAEEQAEA
Ga0211643_1042962423300020457MarineMYYKVYVWKNDELKKEIYYKADNDIIAMQKASAAVPDGCRATYESIREEEYNAQSRKQEV
Ga0211625_1006762333300020473MarineMFYVVKVWGNEILKKEILYEADNDVIAMQKASAAIPDGCRATYEEVNKEEYEKAKETKKAEEAFV
Ga0206684_102906713300021068SeawaterMFYLVKIWDNNDCFLKKEILYQSDNDVTAMQKASAATPDGCRATYESIDKEKYEKE
Ga0222716_1030740823300021959Estuarine WaterMKYVVRVWLNDTMKKEIYFEANNDIIAMQKASAAIPDGCRATYEEINEETYKQETQIKTKTINEEDINNF
Ga0212024_108536713300022065AqueousMYYRVRIWNGELFKKEIMYEAENEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA
Ga0232119_103217123300023702SeawaterMYYKVTIWNGESFKKEIMYSAETEVIAMQKASAATPDGCRANYESINKEEYDAQSRNQEV
(restricted) Ga0233438_1001376683300024255SeawaterMYYRVRIWNGESFKKEIMYSADNEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA
(restricted) Ga0233434_130464823300024327SeawaterMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYENGYKKTEAQEQAEA
Ga0207901_104339313300025045MarineMFYLVKIWDNSDCFLKKEILYQSDNDVNAMQKASAATPDGCRATYESIDKEKYEKEKTKKTEEAETVIEQDIAS
Ga0207906_100490643300025052MarineMFYLVKIWDNSDCFLKKEILYQSDNDVNAMQKASAATPDGCRATYESIDKEKYEKEKSQKTEETETVIAEDIAS
Ga0207896_102212533300025071MarineMYYRVRIWNGESFKKEIMFSADNEVIAMQKASAATPDGCRANYESITKEEYDAQSRNKEV
Ga0208668_100168883300025078MarineMDYLVKIWDNQEGFVLKKEILYQADNDVNAMQKASAATPDGCRSTYETIDKEKYEKEKRERAEEEVAA
Ga0208298_105123143300025084MarineLSNKRGQMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYEAINKEEYEKTYQTKTEEQTEA
Ga0208792_103041733300025085MarineMYYLVKIWNGEEFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA
Ga0208157_104244623300025086MarineMYYLVKIWNNDTLKKEILFKADNDVIAMQKASAATPDGCRANYESINKEEYEKTYQTKTETETEAQT
Ga0208010_110736833300025097MarineMYYLVKIWDNEQGFILKKEILYQAENDVSAMQKASAATPDGCRSTYETIDKEKYEKEKRERAEEEVAA
Ga0208013_109418733300025103MarineMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAKV
Ga0208793_100260313300025108MarineMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKTYQTKTEEQTEA
Ga0208158_108020823300025110MarineMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAEV
Ga0209645_124442423300025151MarineMYYKVRIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYESIEKEEYDAQSRKQEV
Ga0208899_101390863300025759AqueousMYYRVRIWNGESFKKEIMYSAENEVIAMQKASAAIPDGCRANYESITKEEYEKSYESKTE
Ga0208899_103341043300025759AqueousMYYLVKIWGNDQLKKEILYEADNDVIAMQKASAATPDGCRATYEETNKEEYEKIYQTKTETEAQT
Ga0208763_103073123300026136MarineMNYLVKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVDIVGGNNA
Ga0208130_103648513300026258MarineMYYKVNIWKNDELKKVIYYKADNDIIAMQKASAAVPDGCRATYEEINEKIYFEETETKTEAKAEAEA
Ga0208130_105297223300026258MarineMNYLVKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVDIVGENNA
Ga0208410_110887523300026266MarineMYYKVLIWGTDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEIDEKIYIKETETKIKTETEAQAEV
Ga0208410_113883123300026266MarineMNYLVKIWNRADSHFKKEILFNADNDVIAMQKASAATPDGCRATYEEINKEKYE
Ga0207993_101314143300026270MarineMYYKVNVWKNDELKKEIYYKAENDIIAMQKASAAIPDGCRATYEEIDEKIYIKETETKIKTETEAQAEV
Ga0207993_117024323300026270MarineKIWNHSDSHFKKEILFSADNDVIAMQKVSAATPDGCRATFEEINKDQYEQQKAKQTEVDIVGENNA
Ga0208277_109141423300026292MarineMNYLVKIWNRADSHFKKEILFNADNDVIAMQKASAATPDGCRATYEEINKEKYEKEKEQAIKTEEN
Ga0208277_115428043300026292MarineMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEANQTKTETKAEAEA
Ga0247607_107476133300026447SeawaterMYYKVTIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYESINKEEYDAQSRN
Ga0247588_104430533300026465SeawaterMYYKVTIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYESINKEEYDAQSRNQEV
Ga0247590_120209413300026513SeawaterIWNGESFKKEIMYSAENEVIAMQKASAATPDGCRANYESINKEEYDAQSRNEEV
Ga0209092_1055265323300027833MarineMYYRVRIWNGESFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYDAQSRNKEV
Ga0183755_100540593300029448MarineMYYKVLIWGNDILKKEIYYKAENDIIAMQKASAAIPDGCRATYEEINEKTYEEAKQTKIQTETEAQAKV
Ga0183755_102926443300029448MarineMYYLVKIWNGEQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYENGYKKTEAEEQAEA
Ga0183755_102948533300029448MarineMYYLVKIWNNDQFKKEILFEADNDVIAMQKASAATPDGCRSNYESINKEEYEKIYQTKTETEAEA
Ga0310344_1068603523300032006SeawaterMNYLVKIWNYTDSNFKKEILFSAANDVIAMQKASAATPDGCRSTYEEINKEDYEKEKAKNDIE
Ga0316208_115795113300032254Microbial MatMYYRVRIWNGESFKKEIMYSAENEVIAMQKASAAIPDGCRANYESINKEEYEKSYENKTEEQAEA
Ga0316202_1006521913300032277Microbial MatMHYRVRIWNGELFKKEIMYEADNEVIAMQKASAATPDGCRANYESINKEEYEKSYEDKTKEQAEA
Ga0348335_029121_1196_13723300034374AqueousMKKEIYFEADNDIIAMQKASAAIPDGCRATYEEINEETYKQETKIKTETINEEDINNF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.