NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100696

Metagenome Family F100696

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100696
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 48 residues
Representative Sequence MAHFAKIENGVVREVIVVGNDDAPTEAAGQAFIASIGLDGEWVQTSYN
Number of Associated Samples 60
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 90.20 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 94.12 %
Associated GOLD sequencing projects 53
AlphaFold2 3D model prediction Yes
3D model pTM-score0.76

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (74.510 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake
(23.529 % of family members)
Environment Ontology (ENVO) Unclassified
(62.745 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(68.627 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 13.16%    β-sheet: 28.95%    Coil/Unstructured: 57.89%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.76
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.16.1.5: L-aminoacid/polyamine oxidased2ivda22ivd0.62797
d.355.1.1: RplX-liked2jxta12jxt0.62621
d.58.4.9: DGPF domain (Pfam 04946)d1s7ia11s7i0.61375
d.375.1.1: NE1680-liked2hfqa12hfq0.60337
d.139.1.0: automated matchesd2v9ya22v9y0.58637


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF06199Phage_tail_2 0.98
PF13884Peptidase_S74 0.98



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A74.51 %
All OrganismsrootAll Organisms25.49 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003393|JGI25909J50240_1117304Not Available525Open in IMG/M
3300005805|Ga0079957_1246203Not Available832Open in IMG/M
3300006802|Ga0070749_10442889Not Available712Open in IMG/M
3300007216|Ga0103961_1282126All Organisms → Viruses → Predicted Viral1215Open in IMG/M
3300007344|Ga0070745_1132261All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Podoviridae sp. ctDWo9956Open in IMG/M
3300007344|Ga0070745_1281995Not Available595Open in IMG/M
3300007541|Ga0099848_1074851Not Available1328Open in IMG/M
3300007541|Ga0099848_1084307All Organisms → Viruses → Predicted Viral1235Open in IMG/M
3300007541|Ga0099848_1178808Not Available771Open in IMG/M
3300007541|Ga0099848_1315709Not Available534Open in IMG/M
3300007542|Ga0099846_1090677All Organisms → Viruses → Predicted Viral1132Open in IMG/M
3300007542|Ga0099846_1092424All Organisms → Viruses → Predicted Viral1119Open in IMG/M
3300007542|Ga0099846_1258293Not Available603Open in IMG/M
3300007960|Ga0099850_1031388All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2317Open in IMG/M
3300007960|Ga0099850_1135373Not Available999Open in IMG/M
3300007960|Ga0099850_1237465Not Available706Open in IMG/M
3300009160|Ga0114981_10435354Not Available705Open in IMG/M
3300010370|Ga0129336_10319612Not Available859Open in IMG/M
3300010370|Ga0129336_10668619Not Available551Open in IMG/M
(restricted) 3300013126|Ga0172367_10193546All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1288Open in IMG/M
(restricted) 3300013126|Ga0172367_10242253All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1102Open in IMG/M
(restricted) 3300013126|Ga0172367_10254014Not Available1065Open in IMG/M
(restricted) 3300013126|Ga0172367_10564903Not Available616Open in IMG/M
(restricted) 3300013127|Ga0172365_10735493Not Available558Open in IMG/M
(restricted) 3300013128|Ga0172366_10758203Not Available566Open in IMG/M
(restricted) 3300013130|Ga0172363_10968351Not Available525Open in IMG/M
(restricted) 3300013131|Ga0172373_10284855Not Available1073Open in IMG/M
(restricted) 3300013131|Ga0172373_10412202Not Available839Open in IMG/M
(restricted) 3300013131|Ga0172373_10473340Not Available767Open in IMG/M
(restricted) 3300013131|Ga0172373_10748614Not Available573Open in IMG/M
(restricted) 3300013131|Ga0172373_10923136Not Available505Open in IMG/M
(restricted) 3300013132|Ga0172372_10320739Not Available1093Open in IMG/M
(restricted) 3300013132|Ga0172372_10360875Not Available1008Open in IMG/M
(restricted) 3300013132|Ga0172372_10830180Not Available571Open in IMG/M
(restricted) 3300013132|Ga0172372_10923428Not Available532Open in IMG/M
(restricted) 3300013133|Ga0172362_10922058Not Available577Open in IMG/M
3300013372|Ga0177922_10482316Not Available740Open in IMG/M
3300013372|Ga0177922_11102739Not Available914Open in IMG/M
(restricted) 3300014720|Ga0172376_10790629Not Available508Open in IMG/M
3300017701|Ga0181364_1017702Not Available1181Open in IMG/M
3300017723|Ga0181362_1027221Not Available1220Open in IMG/M
3300017723|Ga0181362_1033141Not Available1098Open in IMG/M
3300017723|Ga0181362_1049792Not Available872Open in IMG/M
3300017736|Ga0181365_1121791Not Available625Open in IMG/M
3300017736|Ga0181365_1164983Not Available521Open in IMG/M
3300017774|Ga0181358_1246556Not Available564Open in IMG/M
3300017777|Ga0181357_1163448Not Available814Open in IMG/M
3300017778|Ga0181349_1018725All Organisms → Viruses → Predicted Viral2837Open in IMG/M
3300017778|Ga0181349_1097876Not Available1101Open in IMG/M
3300017778|Ga0181349_1111623Not Available1014Open in IMG/M
3300017784|Ga0181348_1149473Not Available876Open in IMG/M
3300017784|Ga0181348_1153181Not Available861Open in IMG/M
3300017785|Ga0181355_1159117Not Available908Open in IMG/M
3300017785|Ga0181355_1176866Not Available850Open in IMG/M
3300017788|Ga0169931_10366875All Organisms → Viruses → Predicted Viral1085Open in IMG/M
3300017788|Ga0169931_10505969Not Available850Open in IMG/M
3300017788|Ga0169931_10846225Not Available582Open in IMG/M
3300019784|Ga0181359_1054325All Organisms → Viruses → Predicted Viral1541Open in IMG/M
3300020054|Ga0181594_10311940Not Available711Open in IMG/M
3300020074|Ga0194113_10216682Not Available1516Open in IMG/M
3300020074|Ga0194113_10396738All Organisms → Viruses → Predicted Viral1015Open in IMG/M
3300020074|Ga0194113_10522002Not Available850Open in IMG/M
3300020074|Ga0194113_10766002Not Available664Open in IMG/M
3300020074|Ga0194113_10950529Not Available577Open in IMG/M
3300020084|Ga0194110_10049741All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage3798Open in IMG/M
3300020084|Ga0194110_10357863Not Available1004Open in IMG/M
3300020109|Ga0194112_10666865Not Available699Open in IMG/M
3300020109|Ga0194112_11084263Not Available501Open in IMG/M
3300020179|Ga0194134_10349722Not Available564Open in IMG/M
3300020183|Ga0194115_10015645All Organisms → cellular organisms → Bacteria6366Open in IMG/M
3300020183|Ga0194115_10246059Not Available849Open in IMG/M
3300020183|Ga0194115_10394331Not Available597Open in IMG/M
3300020196|Ga0194124_10392715Not Available638Open in IMG/M
3300020198|Ga0194120_10219598All Organisms → Viruses → Predicted Viral1029Open in IMG/M
3300020200|Ga0194121_10225211Not Available996Open in IMG/M
3300020200|Ga0194121_10470541All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes615Open in IMG/M
3300020204|Ga0194116_10004475All Organisms → cellular organisms → Bacteria15986Open in IMG/M
3300020220|Ga0194119_10345836Not Available981Open in IMG/M
3300020222|Ga0194125_10729657Not Available574Open in IMG/M
3300021092|Ga0194122_10373219Not Available724Open in IMG/M
3300021376|Ga0194130_10233787Not Available1061Open in IMG/M
3300021424|Ga0194117_10347109Not Available690Open in IMG/M
3300021959|Ga0222716_10071475All Organisms → Viruses → Predicted Viral2407Open in IMG/M
3300022063|Ga0212029_1023573Not Available839Open in IMG/M
3300022176|Ga0212031_1012266All Organisms → Viruses → Predicted Viral1238Open in IMG/M
3300022176|Ga0212031_1015672All Organisms → Viruses → Predicted Viral1134Open in IMG/M
3300022176|Ga0212031_1022070All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage996Open in IMG/M
3300022176|Ga0212031_1080277Not Available556Open in IMG/M
3300022179|Ga0181353_1043227Not Available1189Open in IMG/M
3300022190|Ga0181354_1235896Not Available528Open in IMG/M
3300022198|Ga0196905_1127626Not Available664Open in IMG/M
3300024509|Ga0255175_1090267Not Available541Open in IMG/M
3300025646|Ga0208161_1044622All Organisms → Viruses → Predicted Viral1461Open in IMG/M
3300025647|Ga0208160_1053801All Organisms → Viruses → Predicted Viral1136Open in IMG/M
3300025889|Ga0208644_1145205All Organisms → Viruses → Predicted Viral1096Open in IMG/M
3300027659|Ga0208975_1049428Not Available1295Open in IMG/M
3300027808|Ga0209354_10380917Not Available551Open in IMG/M
3300029932|Ga0119933_1036799Not Available618Open in IMG/M
3300029932|Ga0119933_1043805All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Woesearchaeota → Candidatus Woesearchaeota archaeon551Open in IMG/M
3300031999|Ga0315274_11506746Not Available639Open in IMG/M
3300033981|Ga0334982_0093420Not Available1594Open in IMG/M
3300034072|Ga0310127_079725All Organisms → Viruses → Predicted Viral1482Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake23.53%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous21.57%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake19.61%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater17.65%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment4.90%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.96%
Drinking Water Treatment PlantEnvironmental → Aquatic → Freshwater → Drinking Water → Unclassified → Drinking Water Treatment Plant1.96%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient1.96%
Freshwater LenticEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic0.98%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.98%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake0.98%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater0.98%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh0.98%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water0.98%
Fracking WaterEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Fracking Water0.98%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003393Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.DDEnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300007216Combined Assembly of cyanobacterial bloom in Punggol water reservoir, Singapore (Diel cycle-Surface and Bottom layer) 16 sequencing projectsEnvironmentalOpen in IMG/M
3300007344Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_4EnvironmentalOpen in IMG/M
3300007541Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007960Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaGEnvironmentalOpen in IMG/M
3300009160Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_MF_MetaGEnvironmentalOpen in IMG/M
3300010370Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNAEnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013128 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 69cmEnvironmentalOpen in IMG/M
3300013130 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment s2_kivu2a2EnvironmentalOpen in IMG/M
3300013131 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10mEnvironmentalOpen in IMG/M
3300013132 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_9.5mEnvironmentalOpen in IMG/M
3300013133 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment s1_kivu2a2EnvironmentalOpen in IMG/M
3300013372Freshwater microbial communities from Lake Erie, Ontario, Canada. Combined Assembly of 10 SPsEnvironmentalOpen in IMG/M
3300014720 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_35mEnvironmentalOpen in IMG/M
3300017701Freshwater viral communities from Lake Michigan, USA - Fa13.ND.MM110.S.NEnvironmentalOpen in IMG/M
3300017723Freshwater viral communities from Lake Michigan, USA - Su13.ND.MM110.S.NEnvironmentalOpen in IMG/M
3300017736Freshwater viral communities from Lake Michigan, USA - Fa13.ND.MM110.D.NEnvironmentalOpen in IMG/M
3300017774Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017777Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300017778Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017784Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300017785Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.D.NEnvironmentalOpen in IMG/M
3300017788Freshwater microbial communities from Lake Kivu, Western Province, Rwanda to study Microbial Dark Matter (Phase II) - Kivu_15m_20LEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020054Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 071413BT metaG (spades assembly)EnvironmentalOpen in IMG/M
3300020074Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200mEnvironmentalOpen in IMG/M
3300020084Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015032 Kigoma Deep Cast 1200mEnvironmentalOpen in IMG/M
3300020109Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015016 Mahale Deep Cast 400mEnvironmentalOpen in IMG/M
3300020179Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015056 Kigoma Offshore 0mEnvironmentalOpen in IMG/M
3300020183Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surfaceEnvironmentalOpen in IMG/M
3300020196Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015031 Kigoma Deep Cast 0mEnvironmentalOpen in IMG/M
3300020198Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015019 Mahale Deep Cast 65mEnvironmentalOpen in IMG/M
3300020200Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015020 Mahale Deep Cast 50mEnvironmentalOpen in IMG/M
3300020204Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015008 Mahale S9 surfaceEnvironmentalOpen in IMG/M
3300020220Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015018 Mahale Deep Cast 100mEnvironmentalOpen in IMG/M
3300020222Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015034 Kigoma Deep Cast 250mEnvironmentalOpen in IMG/M
3300021092Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015021 Mahale Deep Cast 10mEnvironmentalOpen in IMG/M
3300021376Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015050 Kigoma 12 surfaceEnvironmentalOpen in IMG/M
3300021424Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015009 Mahale N1 surfaceEnvironmentalOpen in IMG/M
3300021959Estuarine water microbial communities from San Francisco Bay, California, United States - C33_13DEnvironmentalOpen in IMG/M
3300022063Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022176Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022179Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.D.NEnvironmentalOpen in IMG/M
3300022190Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.NEnvironmentalOpen in IMG/M
3300022198Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300024509Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Atl_RepC_8dEnvironmentalOpen in IMG/M
3300025646Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025647Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025889Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 (SPAdes)EnvironmentalOpen in IMG/M
3300027659Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER78MSRF (SPAdes)EnvironmentalOpen in IMG/M
3300027808Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.DD (SPAdes)EnvironmentalOpen in IMG/M
3300029932Freshwater microbial communities from drinking water treatment plant - The University of Hong Kong - Raw_water_201207AEnvironmentalOpen in IMG/M
3300031999Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_20EnvironmentalOpen in IMG/M
3300033981Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME24Aug2014-rr0011EnvironmentalOpen in IMG/M
3300034072Fracking water microbial communities from deep shales in Oklahoma, United States - MC-3-AEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25909J50240_111730423300003393Freshwater LakeMAHFAKIENGIVREVIVVSNDNAPTEAAGQAFIASIGLAGVWIQTSYNNNNVEG
Ga0079957_124620323300005805LakeMAHFAKVENGVVQQVIVVSNDDAPTEVAGQEFLASLGLN
Ga0070749_1044288913300006802AqueousMAHFAKVENGVVREVIVVGNDYAPTEAAGKQFIASIGLAGEWVQTSYNNNPVEGQDRGKF
Ga0103961_128212633300007216Freshwater LakeMAHFAKIENGIVREVIVIGNDDAPTEAAGKQFIANIGLTGEWVQTSY
Ga0070745_113226113300007344AqueousVAHFAEIKNGIVQQVIVVSNGDAPTEAAGKAFIASI
Ga0070745_128199523300007344AqueousMAHFAKIENGIVREVLVIGNDDAPTEAAGKAFIASIGLAGEWVQTSYNNNPVEG
Ga0099848_107485133300007541AqueousMAHFAKVQNGIVQQVIVVSNDDAPTEAAGQTFLASLGLAGEWVQTSYN
Ga0099848_108430713300007541AqueousMAHFAKVDNGIVSQVIVVSNDDAPTEAAGKAFIASIGLAGEWVQTSYNSNPVEGQDR
Ga0099848_117880813300007541AqueousMAHFAKIENGIVREVIVIGNDDAPTEAAGKQFIANIGLAGEWVQTSYN
Ga0099848_131570923300007541AqueousMAHFAKIENGIVREVLVIGNNDAPTEAAGKAFIASIGLAGEWVQTSYN
Ga0099846_109067713300007542AqueousMAHFAKIENGIVREVIVIGNDDAPTEAAGKAFIASIGLAGEWVQTSYNNNPVEGQDRGKYAGIG
Ga0099846_109242413300007542AqueousMAHFAKVVDGTVREVLVIGNDDAPTEAAGKAFIASIGLDGEWVQTSYNSN
Ga0099846_125829313300007542AqueousMAHFAQISNGIVSQVIVVSNDDAPTEAAGKAFIASIGLAGEWV
Ga0099850_103138813300007960AqueousMAHFAKVQNGIVQQVIVVSNDDASTEAAGQTFLASLGLAGEWVRTSYNNN
Ga0099850_113537313300007960AqueousMAHFAKVVDGTVREVLVIGNDDAPTEAAGKAFIASIGLDGEWVQTSYN
Ga0099850_123746513300007960AqueousMAHFAQISNGIVSQVIVIGNDDAPTEAAGKAFIASIGLAGE
Ga0114981_1043535413300009160Freshwater LakeMAHFAKVNDENIVEQVIVVSNDDAPNEKAGKEFIASLGI
Ga0129336_1031961213300010370Freshwater To Marine Saline GradientMAHFARIQDGKVAQVIVVSNDDAPTEIAGKEFIASLGLTGEWVQTSYNNNPIEGASRGK
Ga0129336_1066861923300010370Freshwater To Marine Saline GradientMAHFAKVENGIVREVIVVGNADAPTEAAGQAFIAACGIAGEWVQTSYNSN
(restricted) Ga0172367_1019354613300013126FreshwaterMAHFAKIENGVVQTVIVVSNDDAPTEAAGQAFIASVGLDGEWVQT
(restricted) Ga0172367_1024225323300013126FreshwaterVAHFAHIVDGIVQTVIVVSNDDAPTEAAGQAFIASIGLSGEWVQTSYNNNPVEG
(restricted) Ga0172367_1025401423300013126FreshwaterMAHFAKIENGVVREVIVVGNDDAPTEAAGQAFIASIGLDGKWVQTSYHN
(restricted) Ga0172367_1056490323300013126FreshwaterMAHFAKIENGVVREVIVVGNDDAPTEAAGQAFIASVGLDGEWVQTSYNNNPV
(restricted) Ga0172365_1073549313300013127SedimentMAHFARVENGSVTQVIVVSNDDAPTEAAGQAFIASIGLDGEWVQT
(restricted) Ga0172366_1075820313300013128SedimentMAHFAKIENGVVREIILVGNDDAPTEAAGQAFIASIGLDGEWVQTSYNNNPVE
(restricted) Ga0172363_1096835113300013130SedimentMAHFAKIENGIVREVIVIGNDDAPTEAAGQAFIVFIGLDGEWVQTSYHNNPV
(restricted) Ga0172373_1028485523300013131FreshwaterMAHFAKIENGVVREVIVVGNDDAPTEAAGQAFIASIGLDGEWVQTSYN
(restricted) Ga0172373_1041220213300013131FreshwaterMAHFAHIVDGIVQRVIVVSNDVAPTEAAGQAFIASIGLDGEWVQTSYNN
(restricted) Ga0172373_1047334023300013131FreshwaterMAHFAKIENGIVREVIVVGNDDAPTEAAGQAFIASIGLDGEWVMTSYNN
(restricted) Ga0172373_1074861423300013131FreshwaterMAHFAHIVDGIVQTVIVVSNDDAPDEATGQAFIESIGLNGEWVQTSY
(restricted) Ga0172373_1092313623300013131FreshwaterMAHFAKIENGVVREVIVIGNDDAPTEAAGQAFIASIGLDGEWVQTSYNNNPIEGASRGKY
(restricted) Ga0172372_1032073913300013132FreshwaterMAHFAKIENGVVREVIVVGNDNAPTEAAGQTFIASIGLDGEWVQTSYNNNPIEGASRGKYAGI
(restricted) Ga0172372_1036087513300013132FreshwaterMAHFAKIDNGIVQQVIVVSNDDAPTEAAGQAFIASTGLT
(restricted) Ga0172372_1083018023300013132FreshwaterMAHFARIDNGVVREVIVVGNDNAPTEAAGQAFIASIGLDGEWVQTSY
(restricted) Ga0172372_1092342813300013132FreshwaterMAHFAKIENSIVQQVIVVSNDDAPDEASGQEFLASIGISGEWVQTSYNNNPIEG
(restricted) Ga0172362_1092205823300013133SedimentMAHFAKIENGVVREVIVIGNDNAPTEAAGQAFITSIGLDGEWVQTSYNSNPVGEP
Ga0177922_1048231613300013372FreshwaterMAHFAKINNNIVVEVIVVSNDDAPTEEVGQAFIAS
Ga0177922_1110273943300013372FreshwaterMAHFAKIENNVVREVIVVGNDDAPTEAAGQAFIASIGLLGEWV
(restricted) Ga0172376_1079062913300014720FreshwaterMAHFARIDNGVVREVIVVGNDNAPTEAAGQAFIASIGLDGEWVQTSYN
Ga0181364_101770223300017701Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPTEAAGQAFIASIGLAGDWIQTSYNNNP
Ga0181362_102722123300017723Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPDEATGQAFIASIGLTGDWV
Ga0181362_103314113300017723Freshwater LakeMAHFAKVDNGMVSQVIVVSNTDAPDEATGQAFIASLGLAGTWVQTSYNNNPIEGASRGKY
Ga0181362_104979213300017723Freshwater LakeMAHFAKIENGIVREVIVVSNDNAPTEAAGQAFIASIGLAGVWIQTSYNNNNV
Ga0181365_112179113300017736Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPDEATGQAFIASIGLTGDWVQTSYNNNPVEGASRGKY
Ga0181365_116498323300017736Freshwater LakeVAHFAKVDNGIVQTVIVVSNDDAPDEATGQAFIASIGLEGDWVQ
Ga0181358_124655613300017774Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPDEATGQQFIASIGLTGNWIQTSYNNNPIE
Ga0181357_116344813300017777Freshwater LakeMAHFAKIENGLVQQVIVVSNDDAPTETAGQAFIESLGLVGEWVQTSYNNN
Ga0181349_101872513300017778Freshwater LakeMAHFAKIENGVVREVIVVGNDDAPTEAAGKAFIASIGLAGEWVQTSYNANFRSKYA
Ga0181349_109787623300017778Freshwater LakeMAHFAKIENTKVVNVIVVANDYASNETEGQAFIASIGLDGVWVQTSYNN
Ga0181349_111162313300017778Freshwater LakeVAHFAKVQNGIVQTVIVVSNDDAPDEATGQAFIASLGLAGEWVQTSYNNNPVE
Ga0181348_114947313300017784Freshwater LakeMAHFAKIENGLVQQVIVVSNDDAPTETAGQAFIESLGLVGEWVQTSYNNNPIEG
Ga0181348_115318123300017784Freshwater LakeMAHFAKIENGLVQQVIVVSNDDAPTETAGQAFIESLG
Ga0181355_115911723300017785Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPDEATGQAFIASIGLT
Ga0181355_117686623300017785Freshwater LakeMAHFAKIENGLVRQVIVVSNDDAPTETDGQEFIALLGLTGEWVQTSYNN
Ga0169931_1036687513300017788FreshwaterMAHFAKIENGIVREVIVIGNDDCAGGDFPESEAAGQAFIASIGLSGE
Ga0169931_1050596913300017788FreshwaterMAHFAKIENGIVREVIVVSNDNAPTEAAGQAFIASLGLDGEWVQT
Ga0169931_1084622513300017788FreshwaterMAHFAQIIDGIVAQVIVVHNNDAPTEADGKAFIASLGLAGEWVQTSYNNNPIEGASRGK
Ga0181359_105432513300019784Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPTEAAGQAFIASIGLAGDWIQTSYNNNPIEGASRG
Ga0181594_1031194023300020054Salt MarshMAHFALIDDNNMVQEVIVVSDTDAPTEEAGQAFIASIGLTGTWMQT
Ga0194113_1021668213300020074Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIG
Ga0194113_1039673823300020074Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLSGEWV
Ga0194113_1052200223300020074Freshwater LakeVAHFARVENGIVQTVIVVGNDDAPTEAAGKAFIASLGLDGEWVQTSYNSNPV
Ga0194113_1076600223300020074Freshwater LakeMAHFARIENGIVREVNVVHNNDAPTEADGKAFLASLGLAGEWVQTSYNNNFR
Ga0194113_1095052913300020074Freshwater LakeVAHFARVENGIVREVIVVGNDNAPTEAAGKAFIASIGLSGE
Ga0194110_1004974153300020084Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFLASIGLSGEWVQTSYNSNPIEGQDR
Ga0194110_1035786323300020084Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLDG
Ga0194112_1066686523300020109Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLDGEWVQTSYNS
Ga0194112_1108426323300020109Freshwater LakeMAHFAQIQNGIVQRVIVVSNDDAPNEAAGQAFLASIGLVGEWIQTSYNNNPIEGASRGKY
Ga0194134_1034972223300020179Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLDGEWVQTSYNNNPVEGASRGKY
Ga0194115_1001564513300020183Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGL
Ga0194115_1024605913300020183Freshwater LakeMAHFAKIENGVVREVIVIGNDDAPTEAAGKAFIASIG
Ga0194115_1039433123300020183Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGQAFIASI
Ga0194124_1039271513300020196Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLDGEWVQTSYK
Ga0194120_1021959823300020198Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLSGEWVQTSYNS
Ga0194121_1022521123300020200Freshwater LakeMAHFAKIENGVVREVIVVGNDDAPTEAAGKAFIASIGLDGEWVMTS
Ga0194121_1047054123300020200Freshwater LakeMAHFAKIENGVVREVIVVGNEWCAGGDFPESEAAGQAFIASIGLSGEWRQTS
Ga0194116_1000447513300020204Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLSGEWVQTS
Ga0194119_1034583613300020220Freshwater LakeMAHFAKIENGVVREVIVVGNDNAPTEAAGKAFIASIGLDGEW
Ga0194125_1072965723300020222Freshwater LakeMAHFAKVENGIVQQVIVVSNDDAPNEATGQAFLASIGLVGEWVHTCY
Ga0194122_1037321923300021092Freshwater LakeMAHFAKIENGVVRDVIVVSNDDAPTEAAGKAFIASIGLDGEWVQTSYNSNP
Ga0194130_1023378713300021376Freshwater LakeMAHFAKIENGIVREVIVVGNDDAPTEAAGQTFIASIGLDGEWVQTSYNNNPVEGASRG
Ga0194117_1034710923300021424Freshwater LakeVAHFARVENGIVREVIVVGNDNAPTEAAGKAFIASIGLSGEWVQTSYNSNPIEGQDRGKYAGI
Ga0222716_1007147563300021959Estuarine WaterMAHFAKVENGVVREVLVIGNDDAPTEAAGKQFIASIGLA
Ga0212029_102357323300022063AqueousMAHFAKIENGIVREVIVIGNDDAPTEAAGKAFIASIGLAGEWVQTSYNSNPIEG
Ga0212031_101226633300022176AqueousMAHFAKVDNGVVSQVIVVSNDDAPTEAAGKAFIASIGLDGEWVQTSF
Ga0212031_101567213300022176AqueousMAHFAKVENGVVREVIVVGNDYAPTEAAGKQFIASIGLAGEWVQTSYNNNPVE
Ga0212031_102207023300022176AqueousMAHFAKIENGIVQQVIVVSNDDAPTEAAGKQFIANIGLAGEW
Ga0212031_108027723300022176AqueousMAHFAKVENNIVQQVIVVSNDDAPTEVAGQTFLASLGLTGEWVQTF
Ga0181353_104322733300022179Freshwater LakeMAHFAKVENGVVQQVIVVSNDDAPTEADGQAFIASLGLDGEWVQTSYNNNPI
Ga0181354_123589623300022190Freshwater LakeMAHFAKVENGVVREVIVIGNDDAPDEATGQAFIASIGLTG
Ga0196905_112762623300022198AqueousMAHFAKIENGIVREVIVIGNNDAPTEAAGKAFIASIGLAGEWVQTSYNNNPVEGQDRGK
Ga0255175_109026713300024509FreshwaterMAHFAKVENGIVREVIVVGNADAPTEAAGQAFIAACGIAGEWVQ
Ga0208161_104462213300025646AqueousMAHFAKIENGIVREVIVIGNDDAPTEAAGKQFIANIGLAGEWVQT
Ga0208160_105380113300025647AqueousMAHFAKIENGIVREVIVISNDDALTEAAGKQFIANIGLAGEWVQTSYNNNPVEGQD
Ga0208644_114520523300025889AqueousMAHFAKIENGIVREVIVISNDDAPTEAAGKQFIANIGLAGEWVQTSYNSNPIEGAD
Ga0208975_104942823300027659Freshwater LenticVAHFAKVDNGIVSQVIVVSNDDAPDEATGQQFIASIGLAGDWIQTS
Ga0209354_1038091713300027808Freshwater LakeMAHFAKIDNGTVTQVIAVSNENAPTEEAGQAFIASLGIAGEWKQTSYNTYRKY
Ga0119933_103679913300029932Drinking Water Treatment PlantMAHFAKVDNGIVSQVIVVSNDDAPTEAAGKAFIASIGLAGEWVQ
Ga0119933_104380523300029932Drinking Water Treatment PlantMAHFAQVENGVVRQVIVVSNDDAPTEAAGKAFIASIGLSGEWVQT
Ga0315274_1150674623300031999SedimentMAHFAKVENGVVREIIVIGNDDAPTEEAGQAFIASIGLDGD
Ga0334982_0093420_1451_15943300033981FreshwaterMAHFAQVADGKVQQVIVVSNEDAPDEATGKAFIASIGLAGDWVQTSYN
Ga0310127_079725_1_1683300034072Fracking WaterMAHFAKVENGIVREVIVVGNADAPTEAAGKAFIAACGIAGEWVQTSYNSNFRGKFA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.