NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F093713

Metagenome Family F093713

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093713
Family Type Metagenome
Number of Sequences 106
Average Sequence Length 69 residues
Representative Sequence MNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Number of Associated Samples 48
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 85.85 %
% of genes near scaffold ends (potentially truncated) 23.58 %
% of genes from short scaffolds (< 2000 bps) 79.25 %
Associated GOLD sequencing projects 36
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (52.830 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(78.302 % of family members)
Environment Ontology (ENVO) Unclassified
(99.057 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(98.113 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 84.06%    β-sheet: 0.00%    Coil/Unstructured: 15.94%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF01471PG_binding_1 75.47
PF00589Phage_integrase 8.49
PF02467Whib 0.94
PF04586Peptidase_S78 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG3740Phage head maturation proteaseMobilome: prophages, transposons [X] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A52.83 %
All OrganismsrootAll Organisms47.17 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002484|JGI25129J35166_1064246All Organisms → cellular organisms → Bacteria682Open in IMG/M
3300002484|JGI25129J35166_1091352Not Available541Open in IMG/M
3300002514|JGI25133J35611_10077049All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1034Open in IMG/M
3300002514|JGI25133J35611_10092251All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium908Open in IMG/M
3300002514|JGI25133J35611_10144510Not Available659Open in IMG/M
3300002514|JGI25133J35611_10193969Not Available535Open in IMG/M
3300002518|JGI25134J35505_10052740All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1014Open in IMG/M
3300002518|JGI25134J35505_10113772Not Available578Open in IMG/M
3300002519|JGI25130J35507_1006048All Organisms → cellular organisms → Bacteria3265Open in IMG/M
3300002519|JGI25130J35507_1009211All Organisms → cellular organisms → Bacteria2524Open in IMG/M
3300002519|JGI25130J35507_1010720All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2300Open in IMG/M
3300002519|JGI25130J35507_1011526All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2203Open in IMG/M
3300002519|JGI25130J35507_1043659Not Available913Open in IMG/M
3300002519|JGI25130J35507_1047132Not Available867Open in IMG/M
3300002519|JGI25130J35507_1054363Not Available787Open in IMG/M
3300002519|JGI25130J35507_1061230All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium727Open in IMG/M
3300002519|JGI25130J35507_1069089All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium670Open in IMG/M
3300003494|JGI26240J51127_1023387Not Available1202Open in IMG/M
3300003494|JGI26240J51127_1035569All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium873Open in IMG/M
3300003495|JGI26244J51143_1022227All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1307Open in IMG/M
3300003495|JGI26244J51143_1032197All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium996Open in IMG/M
3300003498|JGI26239J51126_1053513All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium752Open in IMG/M
3300003618|JGI26381J51731_1085543All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium655Open in IMG/M
3300005400|Ga0066867_10024673All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2458Open in IMG/M
3300005426|Ga0066847_10096396Not Available927Open in IMG/M
3300005431|Ga0066854_10115717Not Available896Open in IMG/M
3300005508|Ga0066868_10035764All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1601Open in IMG/M
3300006736|Ga0098033_1013340Not Available2618Open in IMG/M
3300006736|Ga0098033_1043969Not Available1323Open in IMG/M
3300006736|Ga0098033_1057186All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium HGW-Chloroflexi-101140Open in IMG/M
3300006736|Ga0098033_1083016Not Available919Open in IMG/M
3300006736|Ga0098033_1133541All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium699Open in IMG/M
3300006736|Ga0098033_1186690Not Available576Open in IMG/M
3300006736|Ga0098033_1191010Not Available568Open in IMG/M
3300006736|Ga0098033_1191541Not Available567Open in IMG/M
3300006736|Ga0098033_1206004Not Available544Open in IMG/M
3300006736|Ga0098033_1214130Not Available532Open in IMG/M
3300006738|Ga0098035_1055390All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1440Open in IMG/M
3300006738|Ga0098035_1058208All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1397Open in IMG/M
3300006738|Ga0098035_1233223Not Available609Open in IMG/M
3300006751|Ga0098040_1001334All Organisms → cellular organisms → Bacteria10726Open in IMG/M
3300006751|Ga0098040_1208669Not Available570Open in IMG/M
3300006753|Ga0098039_1025787Not Available2100Open in IMG/M
3300006753|Ga0098039_1124496Not Available884Open in IMG/M
3300006754|Ga0098044_1369564Not Available542Open in IMG/M
3300006926|Ga0098057_1031680All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1314Open in IMG/M
3300006927|Ga0098034_1112774All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium776Open in IMG/M
3300006927|Ga0098034_1240739Not Available501Open in IMG/M
3300006929|Ga0098036_1279604Not Available503Open in IMG/M
3300006988|Ga0098064_106609Not Available1976Open in IMG/M
3300007504|Ga0104999_1052869All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1831Open in IMG/M
3300007765|Ga0105010_1038235All Organisms → Viruses → Predicted Viral1908Open in IMG/M
3300010153|Ga0098059_1309946Not Available602Open in IMG/M
3300010155|Ga0098047_10013695All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3286Open in IMG/M
3300010155|Ga0098047_10309760Not Available596Open in IMG/M
3300010155|Ga0098047_10316451Not Available588Open in IMG/M
3300010155|Ga0098047_10350497Not Available555Open in IMG/M
3300017705|Ga0181372_1026669All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium982Open in IMG/M
3300017775|Ga0181432_1083524Not Available935Open in IMG/M
3300020423|Ga0211525_10194683Not Available859Open in IMG/M
3300021443|Ga0206681_10370667Not Available553Open in IMG/M
3300022225|Ga0187833_10347034Not Available808Open in IMG/M
3300022225|Ga0187833_10352607Not Available799Open in IMG/M
3300022225|Ga0187833_10503524Not Available622Open in IMG/M
3300022227|Ga0187827_10145402Not Available1678Open in IMG/M
3300022227|Ga0187827_10704931All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium572Open in IMG/M
3300022227|Ga0187827_10836373Not Available505Open in IMG/M
(restricted) 3300022888|Ga0233428_1081508Not Available1226Open in IMG/M
(restricted) 3300022933|Ga0233427_10447485Not Available512Open in IMG/M
3300025038|Ga0208670_110728Not Available1060Open in IMG/M
3300025072|Ga0208920_1000230All Organisms → cellular organisms → Bacteria15015Open in IMG/M
3300025072|Ga0208920_1000264All Organisms → cellular organisms → Bacteria13960Open in IMG/M
3300025072|Ga0208920_1043011Not Available916Open in IMG/M
3300025078|Ga0208668_1001077All Organisms → cellular organisms → Bacteria7271Open in IMG/M
3300025082|Ga0208156_1061893All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300025082|Ga0208156_1075523All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium636Open in IMG/M
3300025097|Ga0208010_1003925All Organisms → cellular organisms → Bacteria4415Open in IMG/M
3300025097|Ga0208010_1054210Not Available885Open in IMG/M
3300025109|Ga0208553_1011902Not Available2400Open in IMG/M
3300025109|Ga0208553_1132902Not Available556Open in IMG/M
3300025112|Ga0209349_1002964All Organisms → cellular organisms → Bacteria7866Open in IMG/M
3300025112|Ga0209349_1004689All Organisms → cellular organisms → Bacteria6001Open in IMG/M
3300025112|Ga0209349_1013476All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3071Open in IMG/M
3300025112|Ga0209349_1071851All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1033Open in IMG/M
3300025112|Ga0209349_1084605All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium928Open in IMG/M
3300025114|Ga0208433_1095256Not Available743Open in IMG/M
3300025114|Ga0208433_1104996All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium697Open in IMG/M
3300025122|Ga0209434_1015726All Organisms → cellular organisms → Bacteria2668Open in IMG/M
3300025122|Ga0209434_1017299All Organisms → cellular organisms → Bacteria2513Open in IMG/M
3300025122|Ga0209434_1021257Not Available2208Open in IMG/M
3300025122|Ga0209434_1021653All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2184Open in IMG/M
3300025122|Ga0209434_1022020All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2161Open in IMG/M
3300025122|Ga0209434_1034643All Organisms → Viruses → Predicted Viral1631Open in IMG/M
3300025122|Ga0209434_1089654Not Available892Open in IMG/M
3300025122|Ga0209434_1171618Not Available578Open in IMG/M
3300025131|Ga0209128_1119405Not Available825Open in IMG/M
3300025131|Ga0209128_1141610Not Available729Open in IMG/M
3300025141|Ga0209756_1061637Not Available1770Open in IMG/M
3300025141|Ga0209756_1213039All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium731Open in IMG/M
3300025141|Ga0209756_1273008Not Available609Open in IMG/M
3300025141|Ga0209756_1304104Not Available561Open in IMG/M
3300025547|Ga0209556_1023089All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1832Open in IMG/M
3300025547|Ga0209556_1094611Not Available658Open in IMG/M
3300025770|Ga0209362_1079337Not Available1280Open in IMG/M
3300026267|Ga0208278_1020568All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1779Open in IMG/M
3300028039|Ga0256380_1005472All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1921Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine78.30%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine8.49%
SeawaterEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Seawater5.66%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater1.89%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine0.94%
Water ColumnEnvironmental → Aquatic → Marine → Coastal → Unclassified → Water Column0.94%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater0.94%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine0.94%
SeawaterEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Seawater0.94%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater0.94%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002484Marine viral communities from the Pacific Ocean - ETNP_2_130EnvironmentalOpen in IMG/M
3300002514Marine viral communities from the Pacific Ocean - ETNP_6_85EnvironmentalOpen in IMG/M
3300002518Marine viral communities from the Pacific Ocean - ETNP_6_100EnvironmentalOpen in IMG/M
3300002519Marine viral communities from the Pacific Ocean - ETNP_2_300EnvironmentalOpen in IMG/M
3300003494Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_150m_DNAEnvironmentalOpen in IMG/M
3300003495Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S4LV_150m_DNAEnvironmentalOpen in IMG/M
3300003498Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_130m_DNAEnvironmentalOpen in IMG/M
3300003618Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI073_LV_165m_DNAEnvironmentalOpen in IMG/M
3300005400Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F12-01SV261EnvironmentalOpen in IMG/M
3300005426Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201310SV74EnvironmentalOpen in IMG/M
3300005431Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP201406SV75EnvironmentalOpen in IMG/M
3300005508Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F12-01SV259EnvironmentalOpen in IMG/M
3300006736Marine viral communities from the Subarctic Pacific Ocean - 1_ETSP_OMZ_AT15124 metaGEnvironmentalOpen in IMG/M
3300006738Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaGEnvironmentalOpen in IMG/M
3300006751Marine viral communities from the Subarctic Pacific Ocean - 7_ETSP_OMZ_AT15161 metaGEnvironmentalOpen in IMG/M
3300006753Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaGEnvironmentalOpen in IMG/M
3300006754Marine viral communities from the Subarctic Pacific Ocean - 10_ETSP_OMZ_AT15264 metaGEnvironmentalOpen in IMG/M
3300006926Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaGEnvironmentalOpen in IMG/M
3300006927Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaGEnvironmentalOpen in IMG/M
3300006929Marine viral communities from the Subarctic Pacific Ocean - 4_ETSP_OMZ_AT15127 metaGEnvironmentalOpen in IMG/M
3300006988Marine viral communities from Cariaco Basin, Caribbean Sea - 24B_WHOI_OMZ_CsClEnvironmentalOpen in IMG/M
3300007504Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 267m, 2.7-0.2um, replicate aEnvironmentalOpen in IMG/M
3300007765Marine water column microbial communities of the permanently stratified Cariaco Basin, Venezuela, November cruise - 247m, 2.7-0.2um, replicate bEnvironmentalOpen in IMG/M
3300010153Marine viral communities from the Subarctic Pacific Ocean - 20_ETSP_OMZ_AT15318 metaGEnvironmentalOpen in IMG/M
3300010155Marine viral communities from the Subarctic Pacific Ocean - 12_ETSP_OMZ_AT15267 metaGEnvironmentalOpen in IMG/M
3300017705Marine viral communities from the Subarctic Pacific Ocean - Lowphox_08 viral metaGEnvironmentalOpen in IMG/M
3300017775Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 55 SPOT_SRF_2014-07-17EnvironmentalOpen in IMG/M
3300020423Marine microbial communities from Tara Oceans - TARA_B100000315 (ERX556027-ERR599062)EnvironmentalOpen in IMG/M
3300021443Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 500m 12015EnvironmentalOpen in IMG/M
3300022225Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014_SV_400_PacBio MetaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022227Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014_SV_150_PacBio MetaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022888 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_120_MGEnvironmentalOpen in IMG/M
3300022933 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_118_April2016_100_MGEnvironmentalOpen in IMG/M
3300025038Marine viral communities from Cariaco Basin, Caribbean Sea - 24B_WHOI_OMZ_CsCl (SPAdes)EnvironmentalOpen in IMG/M
3300025072Marine viral communities from the Subarctic Pacific Ocean - 19_ETSP_OMZ_AT15317 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025078Marine viral communities from the Subarctic Pacific Ocean - 18_ETSP_OMZAT15316 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025082Marine viral communities from the Subarctic Pacific Ocean - 1_ETSP_OMZ_AT15124 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025097Marine viral communities from the Subarctic Pacific Ocean - 2_ETSP_OMZ_AT15125 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025109Marine viral communities from the Subarctic Pacific Ocean - 6_ETSP_OMZ_AT15160 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025112Marine viral communities from the Pacific Ocean - ETNP_2_130 (SPAdes)EnvironmentalOpen in IMG/M
3300025114Marine viral communities from the Subarctic Pacific Ocean - 3_ETSP_OMZ_AT15126 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025122Marine viral communities from the Pacific Ocean - ETNP_2_300 (SPAdes)EnvironmentalOpen in IMG/M
3300025131Marine viral communities from the Pacific Ocean - ETNP_6_100 (SPAdes)EnvironmentalOpen in IMG/M
3300025141Marine viral communities from the Pacific Ocean - ETNP_6_85 (SPAdes)EnvironmentalOpen in IMG/M
3300025547Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI037_S3LV_150m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025770Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - SI072_LV_165m_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300026267Marine microbial and viral communities from oxygen minimum zone, Eastern Pacific Ocean - ETNP2014F12-01SV259 (SPAdes)EnvironmentalOpen in IMG/M
3300028039Seawater viral communities from deep brine pools at the bottom of the Mediterranean Sea - LS1 2300mEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25129J35166_106424623300002484MarineMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25129J35166_109135213300002484MarineMEDNMNDILVRAGKTWLQTFAGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25133J35611_1007704923300002514MarineMEDNMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
JGI25133J35611_1009225123300002514MarineMEDNMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAKPGA*
JGI25133J35611_1014451023300002514MarineMNDILVRAGKTWLQTFAGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25133J35611_1019396913300002514MarineMNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
JGI25134J35505_1005274023300002518MarineMNDILVRAGKTWLQTFAGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA*
JGI25134J35505_1011377233300002518MarineWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25130J35507_100604853300002519MarineVRDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVXXXXNSLKLAKPGA*
JGI25130J35507_100921153300002519MarineMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPNA*
JGI25130J35507_101072043300002519MarineVRDILTRAGKTWLQTFVGLLVASWASRTIXIETLDPLQELSTIAGXAFASIPAAVSXLQNSLKLAKPGA*
JGI25130J35507_101152613300002519MarineMRDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA*
JGI25130J35507_104365933300002519MarineVNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25130J35507_104713223300002519MarineMEDNVNDILTRAGKTWLQTFGGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLVKPGA*
JGI25130J35507_105436323300002519MarineMRDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAKPGA*
JGI25130J35507_106123023300002519MarineMNDILVRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
JGI25130J35507_106908923300002519MarineMRDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAK
JGI26240J51127_102338713300003494MarineTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA*
JGI26240J51127_103556913300003494MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLSQTDA*
JGI26244J51143_102222723300003495MarineVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA*
JGI26244J51143_103219723300003495MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA*
JGI26239J51126_105351323300003498MarineMEDNVNDIXTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA*
JGI26381J51731_108554323300003618MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLK
Ga0066867_1002467343300005400MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0066847_1009639633300005426MarineMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0066854_1011571723300005431MarineMEDNVNDILTRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA*
Ga0066868_1003576443300005508MarineMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_101334053300006736MarineMNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_104396913300006736MarineEQVVEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_105718623300006736MarineMNDILTRAGKTWLQTFGGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLVKPGA*
Ga0098033_108301623300006736MarineMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPLA*
Ga0098033_113354123300006736MarineVRDILVRAGKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_118669023300006736MarineMNDILTRAGKTWLQTFGGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA*
Ga0098033_119101023300006736MarineMEDNVNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_119154113300006736MarineKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0098033_120600423300006736MarineMNDILIRAGKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098033_121413023300006736MarineMEDNVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098035_105539033300006738MarineMRDILVRAGKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLTQNA*
Ga0098035_105820833300006738MarineVEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098035_123322313300006738MarineHRRHRRNQHLEPIVEDNMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098040_100133483300006751MarineVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLTQNA*
Ga0098040_120866923300006751MarineVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098039_102578743300006753MarineMEDNVNDILTRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVS
Ga0098039_112449613300006753MarineLEPTLEDNMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098044_136956423300006754MarineVNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098057_103168043300006926MarineMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQN
Ga0098034_111277423300006927MarineMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSL
Ga0098034_124073913300006927MarineMRDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQN
Ga0098036_127960423300006929MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPNA*
Ga0098064_10660923300006988MarineMNDILTRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSMKLATPDA*
Ga0104999_105286913300007504Water ColumnMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSMKLATPDA*
Ga0105010_103823553300007765MarineMEDNMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSMKLATPV
Ga0098059_130994623300010153MarineMEDNVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098047_1001369523300010155MarineMRDILVRASKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLTQNA*
Ga0098047_1030976023300010155MarineVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0098047_1031645133300010155MarineQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA*
Ga0098047_1035049733300010155MarineNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA*
Ga0181372_102666913300017705MarineMRDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0181432_108352443300017775SeawaterNQHLESTVEDNMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0211525_1019468323300020423MarineMRDILIRAGKTWLQTFIGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLTQNA
Ga0206681_1037066713300021443SeawaterQHLEPLMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA
Ga0187833_1034703423300022225SeawaterMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0187833_1035260723300022225SeawaterMRDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0187833_1050352423300022225SeawaterMEDNVNDILTRAGKTWLQTFGGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLVKPGA
Ga0187827_1014540243300022227SeawaterMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0187827_1070493113300022227SeawaterMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSA
Ga0187827_1083637313300022227SeawaterMEDNVNDILTRAGKTWLQTFVGLLIASWASRTINIETLDPFQELSTIAGFAVASIPAAVSVI
(restricted) Ga0233428_108150833300022888SeawaterMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA
(restricted) Ga0233427_1044748523300022933SeawaterMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPGA
Ga0208670_11072843300025038MarineMNDILTRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSMKLATPDA
Ga0208920_100023073300025072MarineMRDILVRAGKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLTQNA
Ga0208920_100026483300025072MarineMEDNVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0208920_104301133300025072MarineVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0208668_100107763300025078MarineVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0208156_106189323300025082MarineMNDILTRAGKTWLQTFGGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLVKPGA
Ga0208156_107552313300025082MarineVNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0208010_100392573300025097MarineVEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0208010_105421033300025097MarineMRDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0208553_101190243300025109MarineMEDNVNDILTRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0208553_113290233300025109MarineHLEPTLEDNMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209349_100296493300025112MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0209349_100468983300025112MarineMEDNMNDILVRAGKTWLQTFAGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209349_101347643300025112MarineMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209349_107185123300025112MarineMEDNMNDILVRAGKTWLQTFAGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209349_108460513300025112MarineMNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0208433_109525633300025114MarineVNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0208433_110499613300025114MarineVRDILVRAGKTWLQTFVGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209434_101572643300025122MarineMNDILIRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209434_101729913300025122MarineVRDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209434_102125753300025122MarineMNDILVRAGKTWLQTFAGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209434_102165343300025122MarineVRDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSVLQNSLKLAKPGA
Ga0209434_102202043300025122MarineMNDILVRAGKTWLQTFIGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209434_103464313300025122MarineMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209434_108965423300025122MarineMRDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAKPGA
Ga0209434_117161823300025122MarineMNDILVRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLAKPGA
Ga0209128_111940543300025131MarineRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209128_114161033300025131MarineKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAKPGA
Ga0209756_106163753300025141MarineKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209756_121303923300025141MarineMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLA
Ga0209756_127300823300025141MarineMNDILVRAGKTWLQTFAGLLVASWASRNINIETLDPLQELSTIAGFAFASIPAAVSALQNSLKLAKPGA
Ga0209756_130410423300025141MarineMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSVKLAKPGA
Ga0209556_102308923300025547MarineVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA
Ga0209556_109461123300025547MarineMEDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLSQTDA
Ga0209362_107933713300025770MarineDNVNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGFAFASIPAAVSVLQNSLKLANPNA
Ga0208278_102056843300026267MarineMEDNMNDILIRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTIAGLAFASIPAAVSALQNSLKLAKPGA
Ga0256380_100547233300028039SeawaterMEQIVENNMNDILTRAGKTWLQTFVGLLVASWASRTINIETLDPLQELSTVAGFAFASIPAAVSVLQNSMKLAQPDA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.