NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103127

Metagenome / Metatranscriptome Family F103127

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103127
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 55 residues
Representative Sequence MNTYRFECVVWVRGETPEEALKELHDEVDYHFSQDNNLIALESDEGKLCEENQQ
Number of Associated Samples 67
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 93.07 %
% of genes near scaffold ends (potentially truncated) 10.89 %
% of genes from short scaffolds (< 2000 bps) 82.18 %
Associated GOLD sequencing projects 58
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (59.406 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(35.644 % of family members)
Environment Ontology (ENVO) Unclassified
(46.535 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(47.525 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.73%    β-sheet: 9.76%    Coil/Unstructured: 69.51%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF10765Phage_P22_NinX 13.86
PF13443HTH_26 1.98
PF13362Toprim_3 0.99
PF13662Toprim_4 0.99
PF08484Methyltransf_14 0.99
PF13519VWA_2 0.99
PF04851ResIII 0.99
PF03237Terminase_6N 0.99
PF00271Helicase_C 0.99
PF02767DNA_pol3_beta_2 0.99
PF01726LexA_DNA_bind 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0592DNA polymerase III sliding clamp (beta) subunit, PCNA homologReplication, recombination and repair [L] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A59.41 %
All OrganismsrootAll Organisms40.59 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2022920000|QLn_FPQQ8XI07LZGJLNot Available501Open in IMG/M
3300001589|JGI24005J15628_10132748Not Available783Open in IMG/M
3300001850|RCM37_1198226Not Available645Open in IMG/M
3300004240|Ga0007787_10241316All Organisms → cellular organisms → Bacteria886Open in IMG/M
3300005581|Ga0049081_10227277Not Available661Open in IMG/M
3300005805|Ga0079957_1006309Not Available9511Open in IMG/M
3300005805|Ga0079957_1239471Not Available849Open in IMG/M
3300006030|Ga0075470_10003865Not Available4637Open in IMG/M
3300006641|Ga0075471_10122267All Organisms → Viruses → Predicted Viral1388Open in IMG/M
3300006734|Ga0098073_1044221Not Available603Open in IMG/M
3300006802|Ga0070749_10065872All Organisms → Viruses → Predicted Viral2180Open in IMG/M
3300006802|Ga0070749_10091858Not Available1802Open in IMG/M
3300006802|Ga0070749_10140534All Organisms → Viruses → Predicted Viral1409Open in IMG/M
3300006802|Ga0070749_10335659Not Available842Open in IMG/M
3300006802|Ga0070749_10439725Not Available716Open in IMG/M
3300007212|Ga0103958_1098540All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2165Open in IMG/M
3300007346|Ga0070753_1057957All Organisms → Viruses → Predicted Viral1574Open in IMG/M
3300007363|Ga0075458_10005704All Organisms → Viruses → Predicted Viral4023Open in IMG/M
3300007538|Ga0099851_1035402All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1990Open in IMG/M
3300007538|Ga0099851_1046857All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1702Open in IMG/M
3300007539|Ga0099849_1092543All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei1211Open in IMG/M
3300007539|Ga0099849_1106676All Organisms → Viruses → Predicted Viral1112Open in IMG/M
3300007540|Ga0099847_1184589Not Available612Open in IMG/M
3300007541|Ga0099848_1178038Not Available773Open in IMG/M
3300007541|Ga0099848_1183132Not Available759Open in IMG/M
3300007541|Ga0099848_1289182All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage565Open in IMG/M
3300007541|Ga0099848_1297795Not Available554Open in IMG/M
3300007542|Ga0099846_1049358All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1597Open in IMG/M
3300007542|Ga0099846_1139854Not Available875Open in IMG/M
3300007609|Ga0102945_1001863All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Polynucleobacter → unclassified Polynucleobacter → Polynucleobacter sp. UK-Gri1-W35960Open in IMG/M
3300007640|Ga0070751_1283309All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage621Open in IMG/M
3300007960|Ga0099850_1096755All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei1222Open in IMG/M
3300007960|Ga0099850_1197792Not Available792Open in IMG/M
3300007960|Ga0099850_1210495Not Available762Open in IMG/M
3300009151|Ga0114962_10209698All Organisms → Viruses → Predicted Viral1133Open in IMG/M
3300009165|Ga0105102_10104685All Organisms → Viruses → Predicted Viral1333Open in IMG/M
3300009529|Ga0114919_10342683All Organisms → Viruses → Predicted Viral1044Open in IMG/M
3300010297|Ga0129345_1106269All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1037Open in IMG/M
3300010299|Ga0129342_1162806All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei807Open in IMG/M
3300010299|Ga0129342_1247158Not Available622Open in IMG/M
3300010299|Ga0129342_1251266Not Available616Open in IMG/M
3300010354|Ga0129333_10213155All Organisms → Viruses → Predicted Viral1753Open in IMG/M
3300010354|Ga0129333_10269564Not Available1530Open in IMG/M
3300010370|Ga0129336_10642404Not Available564Open in IMG/M
(restricted) 3300013126|Ga0172367_10357215All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Polynucleobacter → unclassified Polynucleobacter → Polynucleobacter sp. JS-Safj-400b-B2841Open in IMG/M
(restricted) 3300013126|Ga0172367_10480210Not Available686Open in IMG/M
(restricted) 3300013127|Ga0172365_10014851All Organisms → Viruses5629Open in IMG/M
(restricted) 3300013128|Ga0172366_10649044Not Available622Open in IMG/M
(restricted) 3300013130|Ga0172363_10022545All Organisms → Viruses4821Open in IMG/M
(restricted) 3300013131|Ga0172373_10084640All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2475Open in IMG/M
(restricted) 3300013132|Ga0172372_10131466Not Available2026Open in IMG/M
(restricted) 3300013132|Ga0172372_10693763Not Available644Open in IMG/M
(restricted) 3300013137|Ga0172375_10890027Not Available539Open in IMG/M
(restricted) 3300014720|Ga0172376_10736831Not Available530Open in IMG/M
3300015050|Ga0181338_1053373Not Available587Open in IMG/M
3300017747|Ga0181352_1079674All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage917Open in IMG/M
3300017747|Ga0181352_1131682All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → Porticoccaceae → unclassified Porticoccaceae → Porticoccaceae bacterium670Open in IMG/M
3300017747|Ga0181352_1156800Not Available600Open in IMG/M
3300017754|Ga0181344_1027959All Organisms → Viruses → Predicted Viral1729Open in IMG/M
3300017754|Ga0181344_1098284Not Available851Open in IMG/M
3300017754|Ga0181344_1107973Not Available806Open in IMG/M
3300017754|Ga0181344_1135714Not Available705Open in IMG/M
3300017991|Ga0180434_10018266All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes7007Open in IMG/M
3300018080|Ga0180433_10155506Not Available1901Open in IMG/M
3300019098|Ga0188859_1011944Not Available514Open in IMG/M
3300019784|Ga0181359_1000009Not Available42011Open in IMG/M
3300019784|Ga0181359_1003744All Organisms → Viruses → Predicted Viral4722Open in IMG/M
3300020159|Ga0211734_10774848Not Available12589Open in IMG/M
3300020159|Ga0211734_11126293Not Available763Open in IMG/M
3300021962|Ga0222713_10444463Not Available788Open in IMG/M
3300022179|Ga0181353_1084701Not Available796Open in IMG/M
3300022179|Ga0181353_1119335Not Available632Open in IMG/M
3300022179|Ga0181353_1123513Not Available617Open in IMG/M
3300022198|Ga0196905_1122245Not Available682Open in IMG/M
3300022198|Ga0196905_1168071Not Available559Open in IMG/M
3300022200|Ga0196901_1105108All Organisms → Viruses981Open in IMG/M
3300022200|Ga0196901_1144478Not Available797Open in IMG/M
3300022200|Ga0196901_1273828Not Available516Open in IMG/M
3300022407|Ga0181351_1075760All Organisms → Viruses → Predicted Viral1352Open in IMG/M
3300022407|Ga0181351_1087487All Organisms → Viruses → Predicted Viral1229Open in IMG/M
3300022407|Ga0181351_1179276Not Available731Open in IMG/M
3300023179|Ga0214923_10375776Not Available739Open in IMG/M
3300025138|Ga0209634_1172427Not Available858Open in IMG/M
3300025445|Ga0208424_1001811All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2389Open in IMG/M
3300025655|Ga0208795_1175027Not Available519Open in IMG/M
3300025674|Ga0208162_1177374Not Available560Open in IMG/M
3300025687|Ga0208019_1175413Not Available582Open in IMG/M
3300025687|Ga0208019_1185170Not Available557Open in IMG/M
3300025889|Ga0208644_1049597All Organisms → Viruses → Predicted Viral2339Open in IMG/M
3300025889|Ga0208644_1281191Not Available671Open in IMG/M
3300026097|Ga0209953_1007915Not Available2305Open in IMG/M
3300027302|Ga0255096_1025939All Organisms → Viruses → Predicted Viral1292Open in IMG/M
3300027693|Ga0209704_1122032Not Available748Open in IMG/M
3300027708|Ga0209188_1005050Not Available8579Open in IMG/M
3300031539|Ga0307380_11431005Not Available519Open in IMG/M
3300031578|Ga0307376_10213797All Organisms → Viruses → Predicted Viral1312Open in IMG/M
3300031951|Ga0315904_11095028Not Available622Open in IMG/M
3300033488|Ga0316621_11144732Not Available586Open in IMG/M
3300034101|Ga0335027_0201389All Organisms → Viruses → Predicted Viral1414Open in IMG/M
3300034104|Ga0335031_0640260All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage621Open in IMG/M
3300034118|Ga0335053_0547532All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage674Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous35.64%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake17.82%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater10.89%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient6.93%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment2.97%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine2.97%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.98%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.98%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.98%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake1.98%
Pond WaterEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Pond Water1.98%
Hypersaline Lake SedimentEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Hypersaline → Sediment → Hypersaline Lake Sediment1.98%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil1.98%
Freshwater LenticEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic0.99%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.99%
Marine PlanktonEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Marine Plankton0.99%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater0.99%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater0.99%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface0.99%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water0.99%
Saline WaterEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Saline Water0.99%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.99%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2022920000Saline water microbial communities from Qinghai Lake, Tibetan Plateau - High mountain lake (unassembled)EnvironmentalOpen in IMG/M
3300001589Marine viral communities from the Pacific Ocean - LP-40EnvironmentalOpen in IMG/M
3300001850Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM37, ROCA_DNA234_0.2um_Ob_C_2aEnvironmentalOpen in IMG/M
3300004240Freshwater lake microbial communities from Lake Michigan, USA - Fa13.BD.MLB.SNEnvironmentalOpen in IMG/M
3300005581Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER78MSRFEnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300006030Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006641Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_>0.8_DNAEnvironmentalOpen in IMG/M
3300006734Marine viral communities from the Gulf of Mexico - 31_GoM_OMZ_CsCl metaGEnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300007212Combined Assembly of cyanobacterial bloom in Punggol water reservoir, Singapore (Diel cycle-Bottom layer) 7 sequencing projectsEnvironmentalOpen in IMG/M
3300007346Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31EnvironmentalOpen in IMG/M
3300007363Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNAEnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007539Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaGEnvironmentalOpen in IMG/M
3300007540Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007541Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007609Salt pond water microbial communities from South San Francisco under conditions of wetland restoration - Salt Pond MetaG R2_restored_H2O_MGEnvironmentalOpen in IMG/M
3300007640Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_28EnvironmentalOpen in IMG/M
3300007960Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaGEnvironmentalOpen in IMG/M
3300009151Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_MF_MetaGEnvironmentalOpen in IMG/M
3300009165Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015EnvironmentalOpen in IMG/M
3300009529Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaGEnvironmentalOpen in IMG/M
3300010297Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_20_0.8_DNAEnvironmentalOpen in IMG/M
3300010299Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_15_0.2_DNAEnvironmentalOpen in IMG/M
3300010354Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNAEnvironmentalOpen in IMG/M
3300010370Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNAEnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013128 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 69cmEnvironmentalOpen in IMG/M
3300013130 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment s2_kivu2a2EnvironmentalOpen in IMG/M
3300013131 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10mEnvironmentalOpen in IMG/M
3300013132 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_9.5mEnvironmentalOpen in IMG/M
3300013137 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_11.1mEnvironmentalOpen in IMG/M
3300014720 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_35mEnvironmentalOpen in IMG/M
3300015050Freshwater viral communities from Lake Michigan, USA - Sp13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017747Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.S.NEnvironmentalOpen in IMG/M
3300017754Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.D.DEnvironmentalOpen in IMG/M
3300017991Hypersaline lake sediment archaeal communities from the Salton Sea, California, USA - SS_1_D_2 metaGEnvironmentalOpen in IMG/M
3300018080Hypersaline lake sediment archaeal communities from the Salton Sea, California, USA - SS_1_D_1 metaGEnvironmentalOpen in IMG/M
3300019098Metatranscriptome of marine microbial communities from Baltic Sea - GS684_0p1EnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020159Freshwater lake microbial communities from Lake Erken, Sweden - P4710_108 megahit1EnvironmentalOpen in IMG/M
3300021962Estuarine water microbial communities from San Francisco Bay, California, United States - C33_649DEnvironmentalOpen in IMG/M
3300022179Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.D.NEnvironmentalOpen in IMG/M
3300022198Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022200Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022407Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300023179Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1510EnvironmentalOpen in IMG/M
3300025138Marine viral communities from the Pacific Ocean - LP-40 (SPAdes)EnvironmentalOpen in IMG/M
3300025445Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025655Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025674Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025687Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025889Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 (SPAdes)EnvironmentalOpen in IMG/M
3300026097Salt pond water microbial communities from South San Francisco under conditions of wetland restoration - Salt Pond MetaG R2_restored_H2O_MG (SPAdes)EnvironmentalOpen in IMG/M
3300027302Freshwater microbial communities from Columbia River, Oregon, United States - Colum_Atlam_RepA_8dEnvironmentalOpen in IMG/M
3300027693Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027708Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130625_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031578Soil microbial communities from Risofladan, Vaasa, Finland - TR-2EnvironmentalOpen in IMG/M
3300031951Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA120EnvironmentalOpen in IMG/M
3300033488Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_OW2_C1_D1_CEnvironmentalOpen in IMG/M
3300034101Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME19Sep2005-rr0107EnvironmentalOpen in IMG/M
3300034104Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME02Aug2005-rr0120EnvironmentalOpen in IMG/M
3300034118Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME05Aug2017-rr0165EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
QL_na_71162602022920000Saline WaterMNTYKFNCVVWVQGETQADALKELHDEVLYHFSMDNNLIALESDDGELVEESKQ
JGI24005J15628_1013274823300001589MarineMNFYKFNCNVWVRGETAADALNELHEEVDYLFKQDNNLIALESDKGVKVEEDA*
RCM37_119822633300001850Marine PlanktonECQVWVRGESAEAALKELHDEIEYHFSQDNNLIALESDGGTLVEEEQS*
Ga0007787_1024131623300004240Freshwater LakeMNTYRFECVVWVRGESAEQAFKELHDEVDYHFSQDNNLVALESDAGTLEETEVEE*
Ga0049081_1022727733300005581Freshwater LenticMKVYKFQCTVWVRGESAEAACNELHDEVDYWFSQDNNLVALESDQGTEEVEE*
Ga0079957_100630953300005805LakeMNLYRFECTVWVRGNSLEEAQKELHDEVQYHFGLDNNLVSLESNEGELAEDE*
Ga0079957_123947133300005805LakeMNLYKFECTVWVQGTSAADAEKHLHDEVDYHFGQDNNLIALESQPPQLVDQVLSLRP*
Ga0075470_1000386593300006030AqueousMNLYKFTCSVWVRGESLEAALQELHEEVDYHFGLDNNLIALESDEGVLAEEGESK*
Ga0075471_1012226713300006641AqueousNLYKFTCSVWVRGESLEAALQELHEEVDYHFGLDNNLIALESDEGVLAEEGESK*
Ga0098073_104422123300006734MarineMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLVALESDEGKLVEGEC*
Ga0070749_1006587233300006802AqueousMKMKTYEFKCTVWVRGETREDALKELQDEVEYHFGLDNNLMALMSDEGKLVEEEAHVPSA
Ga0070749_1009185843300006802AqueousMNVYAFNCTVWVQGETKEDALKELHDEVEYHFSLDNNLVALESDEGKLVEGEL*
Ga0070749_1014053433300006802AqueousMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLVALESDEGKIVEGETA*
Ga0070749_1033565923300006802AqueousMNVYRFKCTVWVAGDSAEEALKHLHEEVEYHFGQDNSLIALESDEGVIDETYGD*
Ga0070749_1043972533300006802AqueousMNLYAFNCTVWVQGETKEDALKELHDEVEYNFGLDNNLVALESDEGKLVEEETA*
Ga0103958_109854083300007212Freshwater LakeMNVYRFECVVWVRGETLEEAQKELHDEVKYYFSQDNNLIALESDAGKLDEEGVHDE*
Ga0070753_105795733300007346AqueousMNVYAFNCTVWVQGETKEDALKELHDEVEYNFGLDNNLVALESDEGKLVEEETA*
Ga0075458_1000570463300007363AqueousMNLYKFECVVLVRGESAEDALKELHDEVDYHFSLDNNLIALESDAGALIEEGESERITV*
Ga0099851_103540243300007538AqueousMNTYQFECVVWVRGETLEGALKELHDEVDYLFSLDNNLIALESDKGQLCEEGST*
Ga0099851_104685763300007538AqueousMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGELCEENQQ*
Ga0099849_109254333300007539AqueousMGDLRMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGKLCEENQQ*
Ga0099849_110667653300007539AqueousMNLYKFKCVVWVRGETPEEALKHLHEEVEYHFSQDNNLIALESDEGELCEEEK*
Ga0099847_118458923300007540AqueousMNLYKFKCSVWVRGETPEEALKHLHEEVEYHFSQDNNLIALESDEGELCEENQQ*
Ga0099848_117803823300007541AqueousMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLIALESDEGKLVEGEQV*
Ga0099848_118313223300007541AqueousMNVYAFNCTVWVQGETKEDALKELHDEVEYHFGLDNNLVALESDEGKLVEGEC*
Ga0099848_128918233300007541AqueousMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLVALESDGGKIVEGETA*
Ga0099848_129779523300007541AqueousMNVYQFKCVVWVRGETPEEALKHLHDEVDYHFSQDNNLLALESDEGELCEEEK*
Ga0099846_104935853300007542AqueousMGGVRKNTYRLGCGVWVRGEAPEEALKELHDEVKYHFSQDNNLIALESDEGELCEENQQ*
Ga0099846_113985433300007542AqueousMNLYKFKCSVWVRGETPEEALKHLHEEVEYHFSQDNNLIALESDEGELCEEEK*
Ga0102945_1001863123300007609Pond WaterVQGESLTDALEQLHSEVQYHFGLDNNLVALESDQGELVEEAQDDTKD*
Ga0070751_128330913300007640AqueousMNVYAFNCTVWVRGETKEDALKDLHDEVEYHFGLDNNLVALESDEGKIVEGEQV*
Ga0099850_109675533300007960AqueousMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGKLCEENQQ*
Ga0099850_119779233300007960AqueousMNVYQFKCVVWVRGETPEEALKELHDEVDYHFSQDNNLLALESDEGELCEEEK*
Ga0099850_121049513300007960AqueousMNVYAFNCTVWVQGETKEDALKELHDEVKYHFGLDNNLVALESDEGKLVEGEAA*
Ga0114962_1020969843300009151Freshwater LakeMKVYKFQCTVWVRGESAESASKELHKEVEYHFGLDNNLLALESDNGEFVEEDQS*
Ga0105102_1010468533300009165Freshwater SedimentMKKYKFNCAIWVRGEDAEQARKELHDEVDYWFALDNNLIALESDEGYEEDDDE*
Ga0114919_1034268333300009529Deep SubsurfaceMNLYKFKCSVWVRGETPEEALKHLHEEVEYHFGQDNNLIALESDEGELCEEEK*
Ga0129345_110626923300010297Freshwater To Marine Saline GradientMGDLRMNTYRFECVVWVRGETPEEALKELHDEVEYHFSQDNNLIALESDEGELCEENQQ*
Ga0129342_116280613300010299Freshwater To Marine Saline GradientMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLVALESD
Ga0129342_124715833300010299Freshwater To Marine Saline GradientMNVYAFNCTVWVRGETKEDALKELHDEVDYHFGLDNNLVALESDEGKIVEGETA*
Ga0129342_125126633300010299Freshwater To Marine Saline GradientMGDLRMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGE
Ga0129333_1021315553300010354Freshwater To Marine Saline GradientMNLYKFECVVWVRGDSAESALKELHNEVDYHFSLDNNLVALESDAGELVEEGESK*
Ga0129333_1026956453300010354Freshwater To Marine Saline GradientTGDQTMNLYKFECTVWVQGTSAEEALKELHDEVDYHFGQDNNLIALESGKPELVDQTLTKRA*
Ga0129336_1064240413300010370Freshwater To Marine Saline GradientMNLYKFECTVWVQGTSAEEALKELHDEVDYHFGQDNNLIALESGKPELVDQTLTKRA*
(restricted) Ga0172367_1035721513300013126FreshwaterMNIYRFECVVWVRGETLEEALKELHDEVEYHFSQDNNLIALESGAGKLDEEGV*
(restricted) Ga0172367_1048021023300013126FreshwaterMNVYRFECVVWVRGETPEEALKHLHDEVKYHFSQDNNLVALESDEGKLDEEESKND*
(restricted) Ga0172365_1001485163300013127SedimentMNVYRFECVVWVRGETLEEALKHLHDEVKYHFSQDNNLVALESDEGKLDEEESKND*
(restricted) Ga0172366_1064904413300013128SedimentMSNVYRFECVVWVRGETLEEALKHLHDEVKYHFSQDNNLVALESDEGKLDEEESKND*
(restricted) Ga0172363_1002254543300013130SedimentMNVYRFECVVWVRGETPEEALKHLHDEVKYHFNQDNNLIALESDAGKLDEEEV*
(restricted) Ga0172373_1008464053300013131FreshwaterMNVYRFECVVWVRGETLEEALKHLHDEVKYHFNQDNNLIALESDAGKLDEEEV*
(restricted) Ga0172372_1013146663300013132FreshwaterMSNVYRFECVVWVRGETLEEALKHLHDEVKYNFSQDNNLVALESDEGKLDEEESKND*
(restricted) Ga0172372_1069376333300013132FreshwaterMNVYRFECVVWVRGETLEEAQKELHDEVKYHFSQDNNLIALESDAGKLDEEEV*
(restricted) Ga0172375_1089002723300013137FreshwaterMNVYRFECVVWVRGETLEEAQKELHDEVKYHFSQDNNLIALESDAGKLDEEEVHDE*
(restricted) Ga0172376_1073683123300014720FreshwaterMSNVYRFECVVWVRGETLEEALKHLHDEVKYHFSQDNNLVALESDEGKLDEEES
Ga0181338_105337323300015050Freshwater LakeVWVRGESAESASKELHEEVKYHFGLDNNLIALESDDGEFVEGETE*
Ga0181352_107967423300017747Freshwater LakeVNLYKFECVVWVQGESQEAALKELHDEADYHFAQDNNLIALQSDEGVFVEKMGEIE
Ga0181352_113168223300017747Freshwater LakeMNTYRFECVVWVRGESAEQAFKELHDEVEYHFSQDNNLVALESDAGTLEETEEQS
Ga0181352_115680023300017747Freshwater LakeVNLYRFNCNVWVRGESLEQALKELHEEVDYHFGLDNNLIALESDEGELVDGLYNTCNDE
Ga0181344_102795943300017754Freshwater LakeMNVYKFDCVVWVHGESAEDAFQELHAEADYFFGLDNNLIALGSDEGTLVEITEQGTLQ
Ga0181344_109828433300017754Freshwater LakeVNLYRFNCNVWVRGESLEQALKELHEEVDYHFGLDNNLIALESDEGELVDSENQEESK
Ga0181344_110797323300017754Freshwater LakeMNLYRFNCNVWVRGESLEEALKELHNEVDYHFGLDNNLIALESDEGELVDSQETIDVEEL
Ga0181344_113571423300017754Freshwater LakeMKVYKFNCNVLVRGESLEDALKELHDEVEYHFGHDNNLIALESDGGVLVENEDQGERA
Ga0180434_1001826653300017991Hypersaline Lake SedimentMKTFEFKCTVWVQGETREDALKELHDEVDYHFGLDNNLVALESDEGKLVEEDS
Ga0180433_1015550613300018080Hypersaline Lake SedimentMKTYEFKCTVWVRGETIEDALKELHDEVDYHFGLDNNLVALESDAGKLVEEDS
Ga0188859_101194413300019098Freshwater LakeMNTYRFECVVWVRGETPEEALKELHDEVDYHFSQDNNLIALESDEGKLCEENQQ
Ga0181359_1000009523300019784Freshwater LakeMNTYRFECVVWVRGESAEQAFKELHDEVDYHFSQDNNLVALESDAGTLEETEVEE
Ga0181359_100374473300019784Freshwater LakeMNLYKFRCIVWVQGASAADALKELYEEVDYHFSLDNNLIALESDSGELVHEIKGEDQ
Ga0211734_1077484883300020159FreshwaterMNLYKFECTVWVQGTSAADAEKHLHDEVDYHFGQDNNLIALESGNAQLVDQTLTERT
Ga0211734_1112629333300020159FreshwaterMNLYKFECTVWVQGTTAADAEKHLHDEVDYHFGQDNNLIALQSGNAQLVDQTLTERT
Ga0222713_1044446323300021962Estuarine WaterMKVYKFQCIVWVHGESAEEALKELHEEADYFFGLDNNLIALSSDEGTLVNTEQGLLQ
Ga0181353_108470133300022179Freshwater LakeMNLYRFECTVWVRGNTLEEACKELHDEVQYHFGLDNNLVSLESNEGELAEDE
Ga0181353_111933523300022179Freshwater LakeMNTYRFECVVWVRGESAEQALKELHDEVEYHFSQDNNLVALESDAGTLEETEEQS
Ga0181353_112351323300022179Freshwater LakeMNLYRFECTVWVRGNTLEEACKELHDEVQYHFGQDNNLVALESNEGELAEDE
Ga0196905_112224523300022198AqueousMNTYQFECVVWVRGETLEGALKELHDEVDYLFSLDNNLIALESDKGQLCEEGST
Ga0196905_116807123300022198AqueousMETFEFKCTVWVRGETREDALKELHDEVDYHFGLDNNLMALESDEGKLVEGEQR
Ga0196901_110510833300022200AqueousMKTYEFKCTVWVRGETREDALKELHDEVEYHFGMDNNLVALESDEGKVKEENEDED
Ga0196901_114447813300022200AqueousMNVYAFNCTVWVRGETKEDALKELHDEVEYHFGLDNNLVALESDEGKIVEGETA
Ga0196901_127382813300022200AqueousMNVYAFNCTVWVQGETKEDALKELHDEVEYHFGLDNNLVALESDEGKLVEGEC
Ga0181351_107576043300022407Freshwater LakeMKTYKFQCTVWVQGESAEEARKELHNEVQYHFGLDNNLIALESDAGEFVETEGESK
Ga0181351_108748713300022407Freshwater LakeRKQMNVYKFQCIVWVHGESAEEALKELHEEADYFFGLDNNLIALGSDEGTLVETDKEMLQ
Ga0181351_117927613300022407Freshwater LakeMNTYRFQCTVWVRGENAESALKELHDEVDYHFAQDNNLVALESDNGTLDEERIEK
Ga0214923_1037577623300023179FreshwaterMNLYRFECTVWVRGNTLDEAQKELHDEVLYHFGQDNNLVALESNEGELAEDDTDL
Ga0209634_117242733300025138MarineMNFYKFNCNVWVRGETAADALNELHEEVDYLFKQDNNLIALESDKGVKVEEDA
Ga0208424_100181173300025445AqueousMNLYKFTCSVWVRGESLEAALQELHEEVDYHFGLDNNLIALESDEGVLAEEGESK
Ga0208795_117502713300025655AqueousMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGKLCEE
Ga0208162_117737413300025674AqueousMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGKL
Ga0208019_117541313300025687AqueousMNVYAFNCTVWVQGETKEDALKELHDEVKYHFGLDNNLVALESDEGKLVEGEAA
Ga0208019_118517033300025687AqueousMNTYRFECVVWVRGETPEEALKELHDEVKYHFSQDNNLIALESDEGKLCEENQQ
Ga0208644_104959773300025889AqueousMKTYEFKCTVWVRGETREDALKELQDEVEYHFGLDNNLMALMSDEGKLVEEEAHVPSA
Ga0208644_128119123300025889AqueousMNLYAFNCTVWVQGETKEDALKELHDEVEYNFGLDNNLVALESDEGKLVEEETA
Ga0209953_100791523300026097Pond WaterMKTYKFRCDVWVQGESLTDALEQLHSEVQYHFGLDNNLVALESDQGELVEEAQDDTKD
Ga0255096_102593943300027302FreshwaterMNLYRFECVVWVRGESAEDALKELHDEVDYHFSLDNNLIALESDAGELVEEGESK
Ga0209704_112203223300027693Freshwater SedimentMKKYKFNCAIWVRGEDAEQARKELHDEVDYWFALDNNLIALESDEGYEEDDD
Ga0209188_1005050213300027708Freshwater LakeMKVYKFQCTVWVRGESAESASKELHKEVEYHFGLDNNLLALESDNGEFVEEDQS
Ga0307380_1143100523300031539SoilMNTYRFECVVWVRGETPEEALKELHDEVDYHFSQDNNLIALESDEGELCEENQQ
Ga0307376_1021379743300031578SoilMNVYQFKCVVWVRGETPEEALKHLHDEVEYHFSQDNNLIALESDEGELCEEEK
Ga0315904_1109502823300031951FreshwaterMNLYKFQCTIWVRGTSPADAEKHLHDEVDYHFGQDNNLIALESGEPELVDQTLTERT
Ga0316621_1114473223300033488SoilMSVYKFQCTVWVSGETREAALKELHDEVEYHFSLDNNLIALESDDGELVEQGESK
Ga0335027_0201389_970_11523300034101FreshwaterMMNVYKFECTVWVRGESREAALKELHDEVDYHFGLDNNLIALESDEGALVEEETNQGESK
Ga0335031_0640260_349_4953300034104FreshwaterVHGDSAEEAMQELHNEADYFFGLDNNLIALASDEGTLVETDTEQGTLQ
Ga0335053_0547532_3_1703300034118FreshwaterMNLYKFQCTIWVRGTSPADAEKHLHDEVDYHFGQDNNLIALESGEPELVDQTLTER


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.