NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096673

Metagenome Family F096673

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096673
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 39 residues
Representative Sequence MSFTIMQHDGMKVVQWFSTVDELLKSMLANPKDRYWRNK
Number of Associated Samples 62
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 69.23 %
% of genes near scaffold ends (potentially truncated) 18.27 %
% of genes from short scaffolds (< 2000 bps) 45.19 %
Associated GOLD sequencing projects 49
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (40.385 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake
(58.654 % of family members)
Environment Ontology (ENVO) Unclassified
(77.885 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(90.385 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.93%    β-sheet: 20.90%    Coil/Unstructured: 64.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF03796DnaB_C 6.73
PF13155Toprim_2 6.73
PF06945DUF1289 5.77
PF12705PDDEXK_1 3.85
PF01612DNA_pol_A_exo1 3.85
PF10124Mu-like_gpT 3.85
PF11753DUF3310 3.85
PF00476DNA_pol_A 3.85
PF00149Metallophos 2.88
PF00145DNA_methylase 2.88
PF03237Terminase_6N 2.88
PF00436SSB 1.92
PF01464SLT 1.92
PF04545Sigma70_r4 1.92
PF16786RecA_dep_nuc 1.92
PF027395_3_exonuc_N 0.96
PF15943YdaS_antitoxin 0.96
PF12236Head-tail_con 0.96
PF04404ERF 0.96
PF13884Peptidase_S74 0.96
PF13384HTH_23 0.96
PF06048DUF927 0.96
PF11351GTA_holin_3TM 0.96
PF00589Phage_integrase 0.96
PF02195ParBc 0.96
PF00271Helicase_C 0.96
PF13712Glyco_tranf_2_5 0.96
PF07120DUF1376 0.96
PF06147DUF968 0.96
PF13482RNase_H_2 0.96
PF01242PTPS 0.96
PF13392HNH_3 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0305Replicative DNA helicaseReplication, recombination and repair [L] 6.73
COG1066DNA repair protein RadA/Sms, contains AAA+ ATPase domainReplication, recombination and repair [L] 6.73
COG3313Predicted Fe-S protein YdhL, DUF1289 familyGeneral function prediction only [R] 5.77
COG0749DNA polymerase I, 3'-5' exonuclease and polymerase domainsReplication, recombination and repair [L] 3.85
COG0270DNA-cytosine methylaseReplication, recombination and repair [L] 2.88
COG0629Single-stranded DNA-binding proteinReplication, recombination and repair [L] 1.92
COG2965Primosomal replication protein NReplication, recombination and repair [L] 1.92
COG02585'-3' exonuclease Xni/ExoIX (flap endonuclease)Replication, recombination and repair [L] 0.96
COG07206-pyruvoyl-tetrahydropterin synthaseCoenzyme transport and metabolism [H] 0.96
COG3756Uncharacterized conserved protein YdaU, DUF1376 familyFunction unknown [S] 0.96
COG5519Predicted ATPase domain of Cch-like helicases, DUF927 familyGeneral function prediction only [R] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms59.62 %
UnclassifiedrootN/A40.38 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002933|G310J44882_10070030Not Available756Open in IMG/M
3300003785|Ga0007851_100359All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales3550Open in IMG/M
3300004448|Ga0065861_1162019Not Available726Open in IMG/M
3300004460|Ga0066222_1011229All Organisms → Viruses → Predicted Viral3106Open in IMG/M
3300004460|Ga0066222_1067136All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300004774|Ga0007794_10170540Not Available650Open in IMG/M
3300004805|Ga0007792_10194868Not Available623Open in IMG/M
3300006105|Ga0007819_1065883Not Available753Open in IMG/M
3300006109|Ga0007870_1066503Not Available698Open in IMG/M
3300007542|Ga0099846_1061117All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → Planctomyces → Planctomyces bekefii1418Open in IMG/M
3300009068|Ga0114973_10097792All Organisms → Viruses → Predicted Viral1671Open in IMG/M
3300009082|Ga0105099_10793676Not Available592Open in IMG/M
3300009152|Ga0114980_10030899All Organisms → Viruses → Predicted Viral3313Open in IMG/M
3300009152|Ga0114980_10093599All Organisms → Viruses → Predicted Viral1798Open in IMG/M
3300009152|Ga0114980_10116587All Organisms → Viruses → Predicted Viral1592Open in IMG/M
3300009154|Ga0114963_10185130Not Available1214Open in IMG/M
3300009155|Ga0114968_10004545Not Available10332Open in IMG/M
3300009155|Ga0114968_10392570Not Available759Open in IMG/M
3300009158|Ga0114977_10000978Not Available18881Open in IMG/M
3300009158|Ga0114977_10009204Not Available6275Open in IMG/M
3300009159|Ga0114978_10018572All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage5169Open in IMG/M
3300009160|Ga0114981_10006194All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage7344Open in IMG/M
3300009164|Ga0114975_10001503All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage16588Open in IMG/M
3300009164|Ga0114975_10042958All Organisms → Viruses → Predicted Viral2682Open in IMG/M
3300009164|Ga0114975_10061207All Organisms → Viruses → Predicted Viral2205Open in IMG/M
3300009164|Ga0114975_10082015All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1875Open in IMG/M
3300009164|Ga0114975_10130429Not Available1445Open in IMG/M
3300009164|Ga0114975_10257292Not Available975Open in IMG/M
3300009164|Ga0114975_10268934All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage950Open in IMG/M
3300009175|Ga0073936_10089520All Organisms → Viruses → Predicted Viral2569Open in IMG/M
3300009180|Ga0114979_10000275Not Available34089Open in IMG/M
3300009180|Ga0114979_10000708Not Available21991Open in IMG/M
3300009180|Ga0114979_10104459All Organisms → Viruses → Predicted Viral1750Open in IMG/M
3300009180|Ga0114979_10284933Not Available984Open in IMG/M
3300009180|Ga0114979_10589232All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage636Open in IMG/M
3300009182|Ga0114959_10003050Not Available13361Open in IMG/M
3300009184|Ga0114976_10395904Not Available723Open in IMG/M
3300009184|Ga0114976_10668697Not Available523Open in IMG/M
3300010158|Ga0114960_10000357Not Available38219Open in IMG/M
3300010160|Ga0114967_10033624All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage3405Open in IMG/M
3300010334|Ga0136644_10174435Not Available1296Open in IMG/M
3300010885|Ga0133913_10143423Not Available6390Open in IMG/M
3300010885|Ga0133913_10799509All Organisms → Viruses → Predicted Viral2458Open in IMG/M
3300010885|Ga0133913_11050971Not Available2101Open in IMG/M
3300010885|Ga0133913_11866425All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1500Open in IMG/M
3300010885|Ga0133913_12627024All Organisms → Viruses → Predicted Viral1221Open in IMG/M
3300011921|Ga0120089_100146Not Available26484Open in IMG/M
3300011921|Ga0120089_103714All Organisms → cellular organisms → Bacteria → Proteobacteria1589Open in IMG/M
3300012665|Ga0157210_1000137All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage38227Open in IMG/M
3300013286|Ga0136641_1141439Not Available653Open in IMG/M
3300017747|Ga0181352_1004353Not Available4897Open in IMG/M
3300017754|Ga0181344_1001935Not Available7490Open in IMG/M
3300017754|Ga0181344_1007414All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage3611Open in IMG/M
3300017754|Ga0181344_1105133Not Available818Open in IMG/M
3300017766|Ga0181343_1115845All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage755Open in IMG/M
3300020707|Ga0214238_1001814All Organisms → Viruses → Predicted Viral4038Open in IMG/M
3300020726|Ga0214220_1003047All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage3519Open in IMG/M
3300021354|Ga0194047_10000344All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage31538Open in IMG/M
3300022200|Ga0196901_1041554All Organisms → Viruses → Predicted Viral1749Open in IMG/M
3300022555|Ga0212088_10007917All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage18030Open in IMG/M
3300022555|Ga0212088_10009845Not Available15481Open in IMG/M
3300022555|Ga0212088_10149349All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.1978Open in IMG/M
3300022594|Ga0236340_1015249All Organisms → Viruses → Predicted Viral2206Open in IMG/M
3300025316|Ga0209697_10236014All Organisms → Viruses → Predicted Viral1005Open in IMG/M
3300025428|Ga0208506_1013678All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1563Open in IMG/M
3300025723|Ga0208741_10002114All Organisms → Viruses7252Open in IMG/M
3300025782|Ga0208867_1024550All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.904Open in IMG/M
3300025838|Ga0208872_1028001All Organisms → Viruses → Predicted Viral2531Open in IMG/M
3300027708|Ga0209188_1000500Not Available37860Open in IMG/M
3300027733|Ga0209297_1000849All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes18668Open in IMG/M
3300027733|Ga0209297_1003250All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage8719Open in IMG/M
3300027733|Ga0209297_1072422All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1518Open in IMG/M
3300027733|Ga0209297_1187420Not Available828Open in IMG/M
3300027734|Ga0209087_1001472All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage13843Open in IMG/M
3300027734|Ga0209087_1003101Not Available9199Open in IMG/M
3300027734|Ga0209087_1028747All Organisms → Viruses → Predicted Viral2673Open in IMG/M
3300027734|Ga0209087_1049885All Organisms → Viruses → Predicted Viral1917Open in IMG/M
3300027734|Ga0209087_1145090All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria958Open in IMG/M
3300027741|Ga0209085_1012113All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage4289Open in IMG/M
3300027741|Ga0209085_1027988All Organisms → Viruses → Predicted Viral2682Open in IMG/M
3300027747|Ga0209189_1047052All Organisms → Viruses → Predicted Viral2106Open in IMG/M
3300027759|Ga0209296_1006148Not Available7596Open in IMG/M
3300027759|Ga0209296_1022690All Organisms → Viruses → Predicted Viral3557Open in IMG/M
3300027763|Ga0209088_10000338Not Available34093Open in IMG/M
3300027763|Ga0209088_10009720All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage5232Open in IMG/M
3300027763|Ga0209088_10328851Not Available611Open in IMG/M
3300027782|Ga0209500_10010558All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage5709Open in IMG/M
3300027782|Ga0209500_10059256All Organisms → Viruses → Predicted Viral2012Open in IMG/M
3300027782|Ga0209500_10386969Not Available566Open in IMG/M
3300027896|Ga0209777_10161748All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.1835Open in IMG/M
3300027963|Ga0209400_1001179All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage21783Open in IMG/M
3300027969|Ga0209191_1023030All Organisms → Viruses → Predicted Viral3063Open in IMG/M
3300027969|Ga0209191_1082148All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1405Open in IMG/M
3300027969|Ga0209191_1182869All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage835Open in IMG/M
(restricted) 3300027970|Ga0247837_1024844All Organisms → cellular organisms → Bacteria → Proteobacteria4640Open in IMG/M
3300027974|Ga0209299_1045710All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1836Open in IMG/M
3300027974|Ga0209299_1154355All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage864Open in IMG/M
(restricted) 3300027977|Ga0247834_1007619Not Available11430Open in IMG/M
3300028025|Ga0247723_1012275All Organisms → Viruses → Predicted Viral3229Open in IMG/M
(restricted) 3300028114|Ga0247835_1005341Not Available10486Open in IMG/M
(restricted) 3300028569|Ga0247843_1030270All Organisms → Viruses → unclassified viruses → unclassified DNA viruses → unclassified dsDNA viruses → Prokaryotic dsDNA virus sp.3688Open in IMG/M
(restricted) 3300028569|Ga0247843_1215194Not Available720Open in IMG/M
3300031772|Ga0315288_10087874Not Available3586Open in IMG/M
3300032092|Ga0315905_10035692Not Available5051Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake58.65%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater8.65%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake4.81%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater4.81%
Freshwater Lake HypolimnionEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake Hypolimnion4.81%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Hypolimnion → Freshwater2.88%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine2.88%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.92%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous1.92%
Mine Pit PondEnvironmental → Terrestrial → Geologic → Mine → Unclassified → Mine Pit Pond1.92%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.96%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment0.96%
Anoxic Zone FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Zone Freshwater0.96%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.96%
FreshwaterEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater0.96%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater0.96%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.96%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002933Combined Assembly of freshwater hypolimnion microbial communities from Trout Bog Lake, Wisconsin, USAEnvironmentalOpen in IMG/M
3300003785Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBH06Jun08EnvironmentalOpen in IMG/M
3300004448Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300004460Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300004774Freshwater microbial communities from Crystal Bog, Wisconsin, USA - MA5MEnvironmentalOpen in IMG/M
3300004805Freshwater microbial communities from Crystal Bog, Wisconsin, USA - MA6MEnvironmentalOpen in IMG/M
3300006105Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBE07Jul09EnvironmentalOpen in IMG/M
3300006109Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBH04Jul08EnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300009068Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_140807_MF_MetaGEnvironmentalOpen in IMG/M
3300009082Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 1-3cm May2015EnvironmentalOpen in IMG/M
3300009152Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaGEnvironmentalOpen in IMG/M
3300009154Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_131016_EF_MetaGEnvironmentalOpen in IMG/M
3300009155Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_EF_MetaGEnvironmentalOpen in IMG/M
3300009158Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_MF_MetaGEnvironmentalOpen in IMG/M
3300009159Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140212_EF_MetaGEnvironmentalOpen in IMG/M
3300009160Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_MF_MetaGEnvironmentalOpen in IMG/M
3300009164Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130626_EF_MetaGEnvironmentalOpen in IMG/M
3300009175Freshwater lake bacterial and archeal communities from Alinen Mustajarvi, Finland, to study Microbial Dark Matter (Phase II) - Alinen Mustajarvi 5m metaGEnvironmentalOpen in IMG/M
3300009180Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140625_EF_MetaGEnvironmentalOpen in IMG/M
3300009182Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130625_EF_MetaGEnvironmentalOpen in IMG/M
3300009184Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_EF_MetaGEnvironmentalOpen in IMG/M
3300010158Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130625_MF_MetaGEnvironmentalOpen in IMG/M
3300010160Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130628_MF_MetaGEnvironmentalOpen in IMG/M
3300010334Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_EF_MetaG (v2)EnvironmentalOpen in IMG/M
3300010885northern Canada Lakes Co-assemblyEnvironmentalOpen in IMG/M
3300011921Mine pit pond microbial communities from Vermont, USA - 2MEnvironmentalOpen in IMG/M
3300012665Freshwater microbial communities from Talbot River, Ontario, Canada - S11EnvironmentalOpen in IMG/M
3300013286Freshwater microbial communities from Elizabeth Lake, Yosemite National Park, California, USA - 13020-23YEnvironmentalOpen in IMG/M
3300017747Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.S.NEnvironmentalOpen in IMG/M
3300017754Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.D.DEnvironmentalOpen in IMG/M
3300017766Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.S.DEnvironmentalOpen in IMG/M
3300020707Freshwater microbial communities from Trout Bog Lake, WI - 05AUG2008 hypolimnionEnvironmentalOpen in IMG/M
3300020726Freshwater microbial communities from Trout Bog Lake, WI - 31JUL2007 hypolimnionEnvironmentalOpen in IMG/M
3300021354Anoxic zone freshwater microbial communities from boreal shield lake in IISD Experimental Lakes Area, Ontario, Canada - Jun2016-L221-5mEnvironmentalOpen in IMG/M
3300022200Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022555Alinen_combined assemblyEnvironmentalOpen in IMG/M
3300022594Freshwater microbial communities from thermokarst lake SAS2a, Kuujjuarapick, Canada - Sample Summer S1EnvironmentalOpen in IMG/M
3300025316Freshwater lake bacterial and archeal communities from Alinen Mustajarvi, Finland, to study Microbial Dark Matter (Phase II) - Alinen Mustajarvi 5m metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025428Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBH29Jul08 (SPAdes)EnvironmentalOpen in IMG/M
3300025723Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBE03Jun09 (SPAdes)EnvironmentalOpen in IMG/M
3300025782Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBE05Oct08 (SPAdes)EnvironmentalOpen in IMG/M
3300025838Freshwater microbial communities from Crystal Bog, Wisconsin, USA - CBH12Aug08 (SPAdes)EnvironmentalOpen in IMG/M
3300027708Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130625_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027733Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027734Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027741Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_131016_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027747Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027759Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130206_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027763Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140625_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027782Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140212_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027896Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies -HBP12 HB (SPAdes)EnvironmentalOpen in IMG/M
3300027963Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027969Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130626_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027970 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_14.5mEnvironmentalOpen in IMG/M
3300027974Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027977 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_12mEnvironmentalOpen in IMG/M
3300028025Subsurface sediment microbial communities from gas well in West Virginia, United States - MSEEL Well Study Marcellus 5H_FCEnvironmentalOpen in IMG/M
3300028114 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_13.5mEnvironmentalOpen in IMG/M
3300028569 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2017_8mEnvironmentalOpen in IMG/M
3300031772Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_20EnvironmentalOpen in IMG/M
3300032092Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 4 MA121EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
G310J44882_1007003023300002933FreshwaterMGGLMSFIIYQKDGLRCIQYFFNTEQLIKSMLNNPNDSYHRNS*
Ga0007851_10035923300003785FreshwaterMSFNIYQTNGLRCIQWFFNTDELIKSMLNNPNDAYHRIT*
Ga0065861_116201933300004448MarineMSFTITTKDGMRVDQWFRSVDELLQSMLNNPEDRYWRNT*
Ga0066222_101122943300004460MarineMSFDIMTENGMRVTQWFSSIDELLRSMLSNPKDRYWRNK*
Ga0066222_106713613300004460MarineMSFDIMTEKGMRVTQWFSSIDELLKSMLANPKDRYWRNT*
Ga0007794_1017054023300004774FreshwaterMSFTIMQHDGMKAIQWFNTVDDLLKSMLANPNDAYHRNKS*
Ga0007792_1019486833300004805FreshwaterMSFTIMQHDGMKTIQWFSNVDELIKSMLNNPKDRYYRNDDNRR*
Ga0007819_106588333300006105FreshwaterHDGMKVIQWFSTMDELINSMLNNPKDRYHRNDDNRR*
Ga0007870_106650313300006109FreshwaterMSFNIYQTNGLRCIQWFFNTDELIKSMLNNPNDAYHR
Ga0099846_106111723300007542AqueousMSFEIRTHDGMKVVQWFKDIDELIASMTANHKDTYHRNLYV*
Ga0114973_1009779243300009068Freshwater LakeMSFTIMQHDGMKVVQWFRSIDELLKSMLANPKDRYWRNK*
Ga0105099_1079367623300009082Freshwater SedimentMSFDIITKEGMRVTQWFTSVDELLKSMLANPKDRYWRNT*
Ga0114980_1003089963300009152Freshwater LakeMSFTIMQHDGMKVVQWFSTVDELLKSMLANPKDRYWRNK*
Ga0114980_1009359953300009152Freshwater LakeMSFTIMQHDGMKVVQWFFNIDELIKSMLNNPKDRYWRNG*
Ga0114980_1011658723300009152Freshwater LakeMSFTITTHNGMRIEQWFFSIDELIKSMIKNPNDRYWRNK*
Ga0114963_1018513033300009154Freshwater LakeMSFDIITEKGMRVTQWFSSIDELLKSMLANPKDRYWRNI*
Ga0114968_10004545143300009155Freshwater LakeMSFTIYTHDGMKVIQWFFNMDELIKSMQTNYKDHYHRNS*
Ga0114968_1039257033300009155Freshwater LakeMSFTIMQHDGMKVVQWFSNVDQLIKPMLANPKDRYWR
Ga0114977_10000978203300009158Freshwater LakeMSFTITTYDGMRVDQWFSSVDELLKSMLANPKDRYWRNG*
Ga0114977_10009204123300009158Freshwater LakeMSFTIMQHDGMKVIMWFADVDILLISIKNNPKDHYHRNT*
Ga0114978_10018572123300009159Freshwater LakeMSFDIMTEKGMRVTQWFSSVDELLKSMLANPKDRYWRNK*
Ga0114981_1000619433300009160Freshwater LakeMSFTIYTHDGMKQIQWFFNMDELIKAMLNNPKNHYHRNSN*
Ga0114975_1000150363300009164Freshwater LakeMSFDIITKEGMRVTQWFTSVDELLKSMLANPKDRYWRNS*
Ga0114975_1004295853300009164Freshwater LakeMSFTITTKEGMRVDQWFRSVDELVKSMLDNPHDRYWRNS*
Ga0114975_1006120733300009164Freshwater LakeMSFDIITKEGMRVTQWFTSIDELLKSMLANPKDRYWRNK*
Ga0114975_1008201553300009164Freshwater LakeVSFTIMQHDGMKVVQWFSNVDQLIKSMLANPKDRYWRNK*
Ga0114975_1013042943300009164Freshwater LakeMSFDIITKDGMRVTQWFSSVDELLKSMLNNPKDRYWRNK*
Ga0114975_1025729223300009164Freshwater LakeMSFDIITEKGMRVTQWFSSVDELLKSMLANPKDRYWRNT*
Ga0114975_1026893413300009164Freshwater LakeMSFTITTKDGMRIDQWFRSVDELVQSMLDNPKDRYWRN
Ga0073936_1008952043300009175Freshwater Lake HypolimnionMSFTIMRHDGMKVTQWFRTIDELIASMLANPKDSYWRNK*
Ga0114979_1000027563300009180Freshwater LakeMERVSFTIYTHDGMKVVQWFKNVDELLKSMLNNPKDRYHRNE*
Ga0114979_1000070813300009180Freshwater LakeMSFDIITYEGMRVTQWFPTVDALLKSMLANPKDRYWRVL*
Ga0114979_1010445933300009180Freshwater LakeMSFTITTHDGMKVVQWFSTMDELLKSMLNNPKDRYWRNK*
Ga0114979_1028493313300009180Freshwater LakeMSFDILTHEGMRVTQWFTSVDELLKSMLANPKDRYWRNK*
Ga0114979_1058923233300009180Freshwater LakeMSFTIYTHDGMKVIQWFFNIDELIKAMINNPLDRYH
Ga0114959_1000305073300009182Freshwater LakeMSFEIMRHDGMKEIHWFTIDQLIKSMLANPKDRYWRIR*
Ga0114976_1039590413300009184Freshwater LakeMSFTIMQHDGMKVTQWFRTIDELIISMINNPHDRYYRN
Ga0114976_1066869723300009184Freshwater LakeMSFDILTHEGMRVTQWFTSVDDLLKSMLANPKDRYWRNK*
Ga0114960_10000357383300010158Freshwater LakeMSFTIMRHDGMKETHWFSTIDDLLKSMLANPKDRYWRNK*
Ga0114967_1003362463300010160Freshwater LakeMSFTIMQHDGMKVVQWFSNVDQLIKSMLANPKDRYWRNK*
Ga0136644_1017443523300010334Freshwater LakeMSFTIMQHDGMKVTQWFSTIDELIKSMINNPKDVYHRNT*
Ga0133913_10143423153300010885Freshwater LakeMSFTITTKEGMRVDQWFRSVDELLQSMLNNPEDRYWRNT*
Ga0133913_1079950973300010885Freshwater LakeMSFTITTKEGMRVDQWFRSVDELLQSMLDNPKDRYWRNS*
Ga0133913_1105097163300010885Freshwater LakeFTIYTHDGYKHIQWFFNMDELIKAMQTNYKDHYHRNS*
Ga0133913_1186642543300010885Freshwater LakeMSFDILTEKGMRVTQWFRSIDELLQSMKANPKDRYWRNT*
Ga0133913_1262702433300010885Freshwater LakeMSFEIITHKGMRMEQWFTSVDELLRSMANNPNDRYWRIR*
Ga0120089_100146193300011921Mine Pit PondMSFEIMQHNGMKIIQYFFSMDELIKSMLNNPKDRYWRIK*
Ga0120089_10371413300011921Mine Pit PondTNAMSFEIIQADGMRCIQWFFNTDELIKSMLNNPNDRYWRIG*
Ga0157210_1000137163300012665FreshwaterMSFDIITKEGMRVTQWFSSVDELLKSMLANPKDRYWRNT*
Ga0136641_114143923300013286FreshwaterVSFTIMQHDGMKVNQWFNTIDDLLKSMLNNPKDAYHRNKS*
Ga0181352_100435353300017747Freshwater LakeMSFTIMRHNGMKEIQYFFDIQSLIKSMLNNPKDAYHRNM
Ga0181344_1001935143300017754Freshwater LakeMSFTITTHNGMRVDQWFTSVDELIKSMINNPKDRYWRNS
Ga0181344_100741493300017754Freshwater LakeMSFTIMRHNGMKEIQYFFSMDELIKSMLKNPKDAYHRNL
Ga0181344_110513313300017754Freshwater LakeIWRRKGMRVTQWFSSVDELLKSMLANPKDRYWRNT
Ga0181343_111584513300017766Freshwater LakeMSFTIMQHNGMKVIQYFFSMDELIKSMLKNPKDVYHRN
Ga0214238_1001814113300020707FreshwaterMSFTIMQHDGMKVIQYFFNIDDLIKSMLNNPKDVYWRNT
Ga0214220_100304773300020726FreshwaterMRHDGMKVIQWFSNVDELIKSMLNNPKDRYYRNDNNHR
Ga0194047_10000344273300021354Anoxic Zone FreshwaterMSFSVYTHDGMKETQWFFSIDELINSMINNPLNRYHRNV
Ga0196901_104155463300022200AqueousMSFEIRTHDGMKVVQWFKDIDELIASMTANHKDTYHRNLYV
Ga0212088_10007917153300022555Freshwater Lake HypolimnionMSFTIMRHDGMKVTQWFRTIDELIASMLANPKDSYWRNK
Ga0212088_10009845133300022555Freshwater Lake HypolimnionMSFTIMQHDGMKAIQWFNTVDDLLKSMLANPNDAYHRNKS
Ga0212088_1014934933300022555Freshwater Lake HypolimnionMQHDGMKVIQWFSTMDELINSMLNNPKDRYHRNDNNHR
Ga0236340_101524923300022594FreshwaterMSFTIMQHDGMKTIQWFSNVDELIKSMLNNPKDRYYRNDDNRR
Ga0209697_1023601413300025316Freshwater Lake HypolimnionMSFTIMQHDGMKAIQWFNTVDDLLKSMLANPNDAYHR
Ga0208506_101367853300025428FreshwaterMSFNIYQTNGLRCIQWFFNTDELIKSMLNNPNDAYHRIT
Ga0208741_1000211433300025723FreshwaterMQHDGMKVIQWFSNVDELIKSMLNNPKDRYHRNDNNHR
Ga0208867_102455033300025782FreshwaterDGMKVIQWFSTMDELINSMLNNPKDRYHRNDDNRR
Ga0208872_102800113300025838FreshwaterYQVDGIKVIQWFNNIDLLLISMLNNPQAVYHRNMK
Ga0209188_1000500283300027708Freshwater LakeMSFEIMRHDGMKEIHWFTIDQLIKSMLANPKDRYWRIR
Ga0209297_1000849123300027733Freshwater LakeMSFTITTYDGMRVDQWFSSVDELLKSMLANPKDRYWRNG
Ga0209297_100325043300027733Freshwater LakeMSFTIMQHDGMKVIMWFADVDILLISIKNNPKDHYHRNT
Ga0209297_107242243300027733Freshwater LakeMSFTIMQHDGMKVVQWFSNVDQLIKSMLANPKDRYWRNK
Ga0209297_118742043300027733Freshwater LakeVSFTILTKDGMRVDQWFRSIDELLKSMLANPKDRYWRNG
Ga0209087_1001472193300027734Freshwater LakeMSFTITTKDGMRIDQWFRSVDELVQSMLDNPKDRYWRNS
Ga0209087_1003101233300027734Freshwater LakeMSFDIITKEGMRVTQWFTSVDELLKSMLANPKDRYWRNS
Ga0209087_102874733300027734Freshwater LakeMSFDIITEKGMRVTQWFSSVDELLKSMLANPKDRYWRNT
Ga0209087_104988523300027734Freshwater LakeMSFEIMRHDGMKVVQWFSTTDELIKSMINNPNDRYWRIR
Ga0209087_114509033300027734Freshwater LakeTIYTHDGMKVIQWFFNIDELIKAMLNNPKDRYHRN
Ga0209085_1012113133300027741Freshwater LakeMSFTIYTHDGMKVIQWFFNIDELIKAMINNPLDRY
Ga0209085_102798863300027741Freshwater LakeMSFDIITEKGMRVTQWFSSIDELLKSMLANPKDRYWRNI
Ga0209189_104705243300027747Freshwater LakeMSFTIMRHDGMKETHWFSTIDDLLKSMLANPKDRYWRNK
Ga0209296_100614833300027759Freshwater LakeMSFDIITKDGMRVTQWFSSVDELLKSMLNNPKDRYWRNK
Ga0209296_102269063300027759Freshwater LakeMSFDILTHEGMRVTQWFTSVDELLKSMLANPKDRYWRNK
Ga0209088_10000338333300027763Freshwater LakeMERVSFTIYTHDGMKVVQWFKNVDELLKSMLNNPKDRYHRNE
Ga0209088_1000972053300027763Freshwater LakeMSFTIMQHDGMKVVQWFSTVDELLKSMLANPKDRYWRNK
Ga0209088_1032885113300027763Freshwater LakeMSFDIITYEGMRVTQWFPTVDALLKSMLANPKDRYWRVL
Ga0209500_1001055893300027782Freshwater LakeVSFTIMQHDGMKVTQWFRSIDELLKSMLANPKDRYWRNK
Ga0209500_1005925653300027782Freshwater LakeMSFDIMTEKGMRVTQWFSSVDELLKSMLANPKDRYWRNK
Ga0209500_1038696923300027782Freshwater LakeMSFTITTHNGMRIEQWFFSIDELIKSMIKNPNDRYWRNK
Ga0209777_1016174823300027896Freshwater Lake SedimentMQHDGMKVIQWFSTMDELINSMLNNPKDRYHRNDDNRR
Ga0209400_1001179273300027963Freshwater LakeMSFTIYTHDGMKVIQWFFNMDELIKSMQTNYKDHYHRNS
Ga0209191_102303043300027969Freshwater LakeMSFDIITKEGMRVTQWFTSIDELLKSMLANPKDRYWRNK
Ga0209191_108214843300027969Freshwater LakeMSFDIITEKGMRVTQWFSSIDELVKSMLANPKDRYWRNT
Ga0209191_118286933300027969Freshwater LakeTIYTHKGMKVIQYFFSIDELIKSMLNNPKDAYHRNL
(restricted) Ga0247837_102484423300027970FreshwaterMSFTITTHAGMRVDQWFHSVDELLSSMVKNPNDRYWRNT
Ga0209299_104571023300027974Freshwater LakeMSFTIYTHDGMKQIQWFFNMDELIKAMLNNPKNHYHRNSN
Ga0209299_115435533300027974Freshwater LakeMSFTIMQHDGMKVVQWFFNIDELIKSMLNNPKDRYWRNG
(restricted) Ga0247834_100761943300027977FreshwaterMSFTILSQDGTMFIQWFFNIDELIKAMQNNPKDTYHRNV
Ga0247723_101227533300028025Deep Subsurface SedimentMSFTIITKEGMRVDQWFRSIDELLQSMKANPNDRYWRNS
(restricted) Ga0247835_1005341153300028114FreshwaterMSFTITTHDGMKVIQYFFNVDELIKSMLNNPRDRYWRNK
(restricted) Ga0247843_103027053300028569FreshwaterMQHDGMKVIQWFFNIDELIKSMLNNPQDRYYRNDDNNR
(restricted) Ga0247843_121519413300028569FreshwaterMSFTIMQHDGMKVIQYFFNIDELIKSMLNNPKDTYHRNV
Ga0315288_1008787443300031772SedimentMSFTIYTHDGMKVEQYFFSINELIISMIANSKDRYHRNT
Ga0315905_1003569273300032092FreshwaterMSFTIYTDTGMRVIQYFFNIDELIKSMLNNPKDRYHRNS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.