NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F105051

Metagenome Family F105051

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105051
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 36 residues
Representative Sequence MTISGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT
Number of Associated Samples 70
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 67.02 %
% of genes near scaffold ends (potentially truncated) 9.00 %
% of genes from short scaffolds (< 2000 bps) 64.00 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater
(24.000 % of family members)
Environment Ontology (ENVO) Unclassified
(56.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(55.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.27%    β-sheet: 3.03%    Coil/Unstructured: 69.70%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF05050Methyltransf_21 13.00
PF08291Peptidase_M15_3 6.00
PF08241Methyltransf_11 3.00
PF00961LAGLIDADG_1 3.00
PF01370Epimerase 3.00
PF13578Methyltransf_24 3.00
PF02675AdoMet_dc 2.00
PF01832Glucosaminidase 2.00
PF03592Terminase_2 2.00
PF00011HSP20 1.00
PF03237Terminase_6N 1.00
PF11351GTA_holin_3TM 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1586S-adenosylmethionine decarboxylaseAmino acid transport and metabolism [E] 2.00
COG3728Phage terminase, small subunitMobilome: prophages, transposons [X] 2.00
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.00 %
All OrganismsrootAll Organisms17.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002206|metazooDRAFT_1313446Not Available637Open in IMG/M
3300002408|B570J29032_109078486Not Available587Open in IMG/M
3300002408|B570J29032_109454285Not Available776Open in IMG/M
3300005517|Ga0070374_10032662All Organisms → cellular organisms → Bacteria → Proteobacteria2688Open in IMG/M
3300005805|Ga0079957_1016822All Organisms → cellular organisms → Archaea5241Open in IMG/M
3300005805|Ga0079957_1037249All Organisms → cellular organisms → Bacteria3133Open in IMG/M
3300005805|Ga0079957_1048871Not Available2609Open in IMG/M
3300005805|Ga0079957_1086423Not Available1754Open in IMG/M
3300005805|Ga0079957_1124135Not Available1359Open in IMG/M
3300005805|Ga0079957_1193445Not Available988Open in IMG/M
3300006030|Ga0075470_10065173All Organisms → cellular organisms → Bacteria1109Open in IMG/M
3300006639|Ga0079301_1025638All Organisms → Viruses → Predicted Viral2021Open in IMG/M
3300006802|Ga0070749_10445654Not Available710Open in IMG/M
3300006802|Ga0070749_10532518Not Available638Open in IMG/M
3300006805|Ga0075464_10143932Not Available1396Open in IMG/M
3300006805|Ga0075464_10492802Not Available750Open in IMG/M
3300006805|Ga0075464_10566063All Organisms → cellular organisms → Bacteria698Open in IMG/M
3300006862|Ga0079299_1007074Not Available2637Open in IMG/M
3300006875|Ga0075473_10403141Not Available553Open in IMG/M
3300007162|Ga0079300_10020939All Organisms → Viruses → Predicted Viral2334Open in IMG/M
3300007162|Ga0079300_10069453Not Available1066Open in IMG/M
3300007363|Ga0075458_10095114Not Available929Open in IMG/M
3300007541|Ga0099848_1246852Not Available625Open in IMG/M
3300008072|Ga0110929_1014628Not Available6125Open in IMG/M
3300008114|Ga0114347_1000954Not Available21194Open in IMG/M
3300008448|Ga0114876_1122747Not Available993Open in IMG/M
3300009068|Ga0114973_10716377Not Available508Open in IMG/M
3300009085|Ga0105103_10906402Not Available517Open in IMG/M
3300009146|Ga0105091_10592663Not Available572Open in IMG/M
3300009152|Ga0114980_10000663Not Available24591Open in IMG/M
3300009152|Ga0114980_10207207Not Available1152Open in IMG/M
3300009158|Ga0114977_10046839Not Available2683Open in IMG/M
3300009159|Ga0114978_10097085Not Available1950Open in IMG/M
3300009163|Ga0114970_10768762Not Available508Open in IMG/M
3300009165|Ga0105102_10007295All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → unclassified Cellvibrionales → Cellvibrionales bacterium TMED494113Open in IMG/M
3300009165|Ga0105102_10065420Not Available1634Open in IMG/M
3300009168|Ga0105104_10032367Not Available2927Open in IMG/M
3300009168|Ga0105104_10096346Not Available1593Open in IMG/M
3300009169|Ga0105097_10149823All Organisms → cellular organisms → Bacteria1283Open in IMG/M
3300009170|Ga0105096_10786226Not Available508Open in IMG/M
3300010354|Ga0129333_10115509All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → Planctomyces → Planctomyces bekefii2475Open in IMG/M
3300010354|Ga0129333_10210696Not Available1764Open in IMG/M
3300010354|Ga0129333_10868692Not Available765Open in IMG/M
3300010354|Ga0129333_11726300Not Available509Open in IMG/M
3300011338|Ga0153699_1205Not Available25250Open in IMG/M
3300013004|Ga0164293_10450526Not Available857Open in IMG/M
3300013004|Ga0164293_10821215Not Available588Open in IMG/M
3300013005|Ga0164292_10332881Not Available1030Open in IMG/M
(restricted) 3300013126|Ga0172367_10334129Not Available880Open in IMG/M
(restricted) 3300014720|Ga0172376_10314174Not Available930Open in IMG/M
(restricted) 3300014720|Ga0172376_10346477All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon870Open in IMG/M
(restricted) 3300014720|Ga0172376_10425743Not Available757Open in IMG/M
3300017747|Ga0181352_1028653Not Available1689Open in IMG/M
3300017754|Ga0181344_1088959Not Available902Open in IMG/M
3300017785|Ga0181355_1277992Not Available635Open in IMG/M
3300017785|Ga0181355_1340803Not Available553Open in IMG/M
3300019784|Ga0181359_1100549All Organisms → cellular organisms → Bacteria1060Open in IMG/M
3300019784|Ga0181359_1132833Not Available875Open in IMG/M
3300020074|Ga0194113_10308291Not Available1199Open in IMG/M
3300020533|Ga0208364_1001964Not Available3997Open in IMG/M
3300021961|Ga0222714_10471118Not Available650Open in IMG/M
3300021961|Ga0222714_10616940Not Available539Open in IMG/M
3300021962|Ga0222713_10084595All Organisms → Viruses2307Open in IMG/M
3300021963|Ga0222712_10254029Not Available1123Open in IMG/M
3300021963|Ga0222712_10437567Not Available787Open in IMG/M
3300022179|Ga0181353_1065499Not Available935Open in IMG/M
3300022190|Ga0181354_1022189Not Available2024Open in IMG/M
3300023179|Ga0214923_10002325Not Available25440Open in IMG/M
3300024490|Ga0255185_1007392Not Available1591Open in IMG/M
3300027693|Ga0209704_1008749Not Available2357Open in IMG/M
3300027721|Ga0209492_1166283Not Available766Open in IMG/M
(restricted) 3300027728|Ga0247836_1004400Not Available17632Open in IMG/M
(restricted) 3300027728|Ga0247836_1030998All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3541Open in IMG/M
(restricted) 3300027728|Ga0247836_1056365Not Available2201Open in IMG/M
(restricted) 3300027728|Ga0247836_1137258All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → unclassified Verrucomicrobiales → Verrucomicrobiales bacterium1085Open in IMG/M
(restricted) 3300027730|Ga0247833_1043573Not Available2545Open in IMG/M
(restricted) 3300027730|Ga0247833_1044681All Organisms → Viruses → Predicted Viral2491Open in IMG/M
(restricted) 3300027730|Ga0247833_1060579Not Available1937Open in IMG/M
3300027734|Ga0209087_1001255Not Available15008Open in IMG/M
3300027759|Ga0209296_1257150Not Available715Open in IMG/M
3300027900|Ga0209253_10151391Not Available1878Open in IMG/M
(restricted) 3300027977|Ga0247834_1081913Not Available1521Open in IMG/M
3300028025|Ga0247723_1043429Not Available1323Open in IMG/M
(restricted) 3300029268|Ga0247842_10153145All Organisms → Viruses → Predicted Viral1338Open in IMG/M
3300029930|Ga0119944_1022166Not Available857Open in IMG/M
3300031758|Ga0315907_10003495Not Available18501Open in IMG/M
3300031857|Ga0315909_10642969Not Available700Open in IMG/M
3300033996|Ga0334979_0051510Not Available2668Open in IMG/M
3300034019|Ga0334998_0056995Not Available2703Open in IMG/M
3300034019|Ga0334998_0210068Not Available1204Open in IMG/M
3300034072|Ga0310127_003447Not Available15199Open in IMG/M
3300034092|Ga0335010_0677066Not Available511Open in IMG/M
3300034101|Ga0335027_0745111Not Available575Open in IMG/M
3300034116|Ga0335068_0577811Not Available509Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater24.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment10.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake10.00%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous10.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake8.00%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake6.00%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water5.00%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient4.00%
Deep SubsurfaceEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface4.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater2.00%
Freshwater LenticEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic2.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake2.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater2.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater1.00%
LakeEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Lake1.00%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment1.00%
Freshwater, PlanktonEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton1.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater1.00%
AquaticEnvironmental → Aquatic → Freshwater → Drinking Water → Unclassified → Aquatic1.00%
Water BodiesEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Water Bodies1.00%
AquaticEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Aquatic1.00%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater1.00%
Fracking WaterEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Fracking Water1.00%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002206Freshwater microbial communities from San Paulo Zoo lake, Brazil - OCT 2012EnvironmentalOpen in IMG/M
3300002408Freshwater microbial communities from Lake Mendota, WI, sample - 15JUL2010 deep hole epilimnion (Lake Mendota Combined assembly, ASSEMBLY_DATE=20140123)EnvironmentalOpen in IMG/M
3300005517Freshwater lake microbial communities from Lake Michigan, USA - Su13.BD.MM15.SN (version 4)EnvironmentalOpen in IMG/M
3300005581Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER78MSRFEnvironmentalOpen in IMG/M
3300005585Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ON33MSRFEnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300006030Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006639Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series FC 2014_7_11EnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300006805Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNAEnvironmentalOpen in IMG/M
3300006862Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series LW 2014_7_11EnvironmentalOpen in IMG/M
3300006875Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNAEnvironmentalOpen in IMG/M
3300007162Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series HT 2014_7_11EnvironmentalOpen in IMG/M
3300007363Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNAEnvironmentalOpen in IMG/M
3300007541Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaGEnvironmentalOpen in IMG/M
3300008072Microbial Communities in Water bodies, Singapore - Site MAEnvironmentalOpen in IMG/M
3300008114Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE2, Sample E2014-0106-C-NAEnvironmentalOpen in IMG/M
3300008448Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - August 4, 2014 all contigsEnvironmentalOpen in IMG/M
3300009068Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_140807_MF_MetaGEnvironmentalOpen in IMG/M
3300009085Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009152Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaGEnvironmentalOpen in IMG/M
3300009158Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_MF_MetaGEnvironmentalOpen in IMG/M
3300009159Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140212_EF_MetaGEnvironmentalOpen in IMG/M
3300009163Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_140205_XF_MetaGEnvironmentalOpen in IMG/M
3300009165Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009169Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009170Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 1-3cm May2015EnvironmentalOpen in IMG/M
3300010354Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNAEnvironmentalOpen in IMG/M
3300011338Lotic viral community from Han River, Hwacheon, Gangwon-do, South Korea - HaengjuEnvironmentalOpen in IMG/M
3300013004Eutrophic lake water microbial communities from Lake Mendota, Wisconsin, USA - GEODES118 metaGEnvironmentalOpen in IMG/M
3300013005Eutrophic lake water microbial communities from Lake Mendota, Wisconsin, USA - GEODES117 metaGEnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300014720 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_35mEnvironmentalOpen in IMG/M
3300014811Aquatic viral communities from ballast water - Michigan State University - AB_ballast waterEnvironmentalOpen in IMG/M
3300017747Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.S.NEnvironmentalOpen in IMG/M
3300017754Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.D.DEnvironmentalOpen in IMG/M
3300017777Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300017785Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.D.NEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020074Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200mEnvironmentalOpen in IMG/M
3300020533Freshwater microbial communities from Lake Mendota, WI - 08JUN2012 deep hole epilimnion (SPAdes)EnvironmentalOpen in IMG/M
3300021961Estuarine water microbial communities from San Francisco Bay, California, United States - C33_3DEnvironmentalOpen in IMG/M
3300021962Estuarine water microbial communities from San Francisco Bay, California, United States - C33_649DEnvironmentalOpen in IMG/M
3300021963Estuarine water microbial communities from San Francisco Bay, California, United States - C33_657DEnvironmentalOpen in IMG/M
3300022179Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.D.NEnvironmentalOpen in IMG/M
3300022190Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.NEnvironmentalOpen in IMG/M
3300023179Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1510EnvironmentalOpen in IMG/M
3300024490Freshwater microbial communities from Mississippi River, Louisiana, United States - Miss_Cont_RepB_0hEnvironmentalOpen in IMG/M
3300025896Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027693Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 1-3cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027721Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027728 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_14mEnvironmentalOpen in IMG/M
3300027730 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_8mEnvironmentalOpen in IMG/M
3300027734Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027759Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130206_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027900Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - BRP12 BR (SPAdes)EnvironmentalOpen in IMG/M
3300027977 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_12mEnvironmentalOpen in IMG/M
3300028025Subsurface sediment microbial communities from gas well in West Virginia, United States - MSEEL Well Study Marcellus 5H_FCEnvironmentalOpen in IMG/M
3300029268 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_19mEnvironmentalOpen in IMG/M
3300029930Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727EnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031857Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125EnvironmentalOpen in IMG/M
3300033996Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME20Jul2016-rr0004EnvironmentalOpen in IMG/M
3300034019Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME24Sep2014-rr0049EnvironmentalOpen in IMG/M
3300034072Fracking water microbial communities from deep shales in Oklahoma, United States - MC-3-AEnvironmentalOpen in IMG/M
3300034092Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME03Aug2012-rr0069EnvironmentalOpen in IMG/M
3300034101Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME19Sep2005-rr0107EnvironmentalOpen in IMG/M
3300034116Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-CONTROL-GENDONOREnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
metazooDRAFT_131344633300002206LakeMTFNNYKKTIAKLLKAYHKKYDAFGKKKNGSKKSRT*
B570J29032_10907848623300002408FreshwaterMTVSGYKRAIAKLLKAYHKKWDCFGNKKNGSKKSRT*
B570J29032_10945428523300002408FreshwaterMTISGYKKAISKLLKAYHKKWDCFGKEKKKNGSKKSRT*
Ga0070374_1003266293300005517Freshwater LakeMTLSGYKKSIAKLLKAYHKKFDCFGKRKNGSKKT*
Ga0049081_1026041233300005581Freshwater LenticMTVSGYKRAIAKLLKAYRKKYDAFGKERPKKRKRIK*
Ga0049084_1016572133300005585Freshwater LenticMTVSGYKKAIAKLLKAYRKKYDAFGKERPKKRKRIK*
Ga0079957_101682233300005805LakeMKMISKYSRNIKKLLKEYHRKWDCFGNRRNKKKK*
Ga0079957_103724943300005805LakeMTISGYKKAIAKLLKAYHKKWDCFGNERNKKKKKSK*
Ga0079957_104887163300005805LakeMISKYKKAIAKLLKEYHKKWDCFGNERKKNGSKKSRT*
Ga0079957_108642373300005805LakeMTISGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT*
Ga0079957_112413523300005805LakeMTISGYKKAIARLLKAYKKKYDAFGNERKKKNGSKKS*
Ga0079957_119344543300005805LakeCRTITMTISGYKKAIARLLKAYKKKYDAFGKERKNGSKKSRT*
Ga0075470_1006517363300006030AqueousMTYTMYKKAIAKLLKAYHKKWNAFGEPRKKNGSKKSRT*
Ga0079301_102563823300006639Deep SubsurfaceMTIAGYKKAIAKLLKAYHKKWDCFGNKKNGSKKSRT*
Ga0070749_1044565423300006802AqueousMISKYKKAIAKLLKEYHKKWDCFGNERNKNESKKSRT*
Ga0070749_1053251823300006802AqueousMTISGYKKAIAKLLKAYHKKWDCFGKKKNGSKKSRT*
Ga0075464_1014393223300006805AqueousMTLTSYKKAIAKLLKAYHKKWDAFGKARKGKRRVKR*
Ga0075464_1049280223300006805AqueousMTLTMYKKAMVKLLKAYHKKWDAFGKQRPKKRKRSK*
Ga0075464_1056606333300006805AqueousMTISGYKKAISKLLKAYHKKWDCFGKKKNGSKKSRT*
Ga0079299_100707423300006862Deep SubsurfaceMKNKYLKQIQKLLKSYHKKWDCFGNRKNEKKNKTR*
Ga0075473_1040314123300006875AqueousMTYTNYKKAIAKLLKAYHKKWNAFGEPRKKNGSKKSRT*
Ga0079300_1002093973300007162Deep SubsurfaceMTVAGYKKAIAKLLKAYHKKWDCFGNKKNGSKKSRT*
Ga0079300_1006945353300007162Deep SubsurfaceMTISGYKKAIAKLLKAYHKKWDCFGNKKNGSKKSRT*
Ga0075458_1009511443300007363AqueousMTFTNYKKAIAKLLKAYHKKYNAFGKERLKKRKR*
Ga0099848_124685213300007541AqueousMTISGYKKAIAKLLKAYHKKWDCFGRERKKNGSKKSRT*
Ga0110929_101462883300008072Water BodiesMTMTQYKKAIAKLLKAYHKKWDCFGKRRKNGSKKS*
Ga0114347_1000954183300008114Freshwater, PlanktonMTISGYKKAIARLLKAYKKKYDALGRERKNGSKKSRA*
Ga0114876_112274733300008448Freshwater LakeMTMTQYKKTIAKLLKAYHKKWDCFGKEKKKNGSKKSRT*
Ga0114973_1071637723300009068Freshwater LakeMTISEYKKSIAKLLKAYHKKFDCFGKRKNGSKKK*
Ga0105103_1090640213300009085Freshwater SedimentMTFSNYKKAIARLLKAYHKKYDAFGKKKNGSKKSRA*
Ga0105091_1059266333300009146Freshwater SedimentMTITQYKKAIAKLLKAYHKKWNAFGEPVKKKNGSKKSRT*
Ga0114980_1000066373300009152Freshwater LakeMTMSGYRKAIIKLLKAYHKKYDAFGKQKPRKKKRSK*
Ga0114980_1020720743300009152Freshwater LakeMTLTNYKKAIAKLLKAYHKKWDAFGKAKKGKRRVKR*
Ga0114977_1004683963300009158Freshwater LakeMTLTSYKKAIAKLLKAYHKKWDAFGKAKKGKRRVKR*
Ga0114978_1009708553300009159Freshwater LakeMTQTSDKKAIDKLLKAYHKKWDAFGKQRPKKRKRSK*
Ga0114970_1076876223300009163Freshwater LakeMTISKYKKSIAKLLKAYHKKFDCFGKRKNVSKKT*
Ga0105102_1000729563300009165Freshwater SedimentMTFNNYKKTIAKLLKAYHKKYDCFGKKKNESKKSRA*
Ga0105102_1006542023300009165Freshwater SedimentMTFSNYKKTIARLLKAYHKKYDAFGKKKNGSKKSRA*
Ga0105104_1003236723300009168Freshwater SedimentMTFSNYKKAIARLLKAYHKKYDAFGKKKNGSKKSRT*
Ga0105104_1009634663300009168Freshwater SedimentMTFNNYKKTIAKLLKAYHKKYDCFGKKKNGSKKSRA*
Ga0105097_1014982373300009169Freshwater SedimentMTIAGYKKAIAKLLKAYHKKWNAFGEPVKKKNGSKKSR
Ga0105096_1078622623300009170Freshwater SedimentMTFSNYKKKIARLLKAYHKKYDAFGKKKNGSKKSRA*
Ga0129333_1011550933300010354Freshwater To Marine Saline GradientMTISEYKKAIAKLLKAYHKKWDCFGRKRKKNGSKKSRT*
Ga0129333_1021069643300010354Freshwater To Marine Saline GradientMTISGYKKAIARLLKAYKKKYDAFGKERKNGSKKSRT*
Ga0129333_1086869223300010354Freshwater To Marine Saline GradientMTISQYKKAIAKLLKAYHKKWDCFGKEKKKNGSKKSRT*
Ga0129333_1172630033300010354Freshwater To Marine Saline GradientMTISGYKKAISKLLKTYHKKWDCFGRERKKNGSKKSRT*
Ga0153699_1205103300011338FreshwaterMTVSGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT*
Ga0164293_1045052623300013004FreshwaterMTVSGYKRAIAKLLKAYHKKWDCFGNKKNGPKKSRT*
Ga0164293_1082121523300013004FreshwaterMTISGYKKAIAKLLKAYHKKWDCFGKRKNGSKKSRT*
Ga0164292_1033288133300013005FreshwaterMTITQYKKAIAKLLKAYHKKWDCFGKKKNGSKKSRT*
(restricted) Ga0172367_1033412913300013126FreshwaterAMTVSGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKS*
(restricted) Ga0172376_1031417423300014720FreshwaterMTVSGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKS*
(restricted) Ga0172376_1034647723300014720FreshwaterMIISGYKKAIAKLLKTYHKKWDCFGNERNKKKKK*
(restricted) Ga0172376_1042574323300014720FreshwaterMTISGYKKAIVKLLKAYHKKWDCFGNERNKKKKK*
Ga0119960_109801513300014811AquaticMTVSGYKRAIAKLLKAYRKKYDAFGKERPKKRKRTK*
Ga0181352_102865343300017747Freshwater LakeMKANKYLKQIQKLLKSYHKKWDCFGNIKNEKKNKTR
Ga0181344_108895913300017754Freshwater LakeNKDGTMTKSGYKKAIAKLLKAYHKKWDCFGKRKNGSKKSRT
Ga0181357_113255653300017777Freshwater LakeMTVSGYKRAIAKLLKAYRKKYDAFGKERPKKRKRIKXYKIEDL
Ga0181355_127799243300017785Freshwater LakeMTFTNYKKAIAKLLKAYHKKWDAFGNKKNGSKKSRT
Ga0181355_134080323300017785Freshwater LakeMTISQYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT
Ga0181359_110054913300019784Freshwater LakeLMTITQYKKAIAKLLKAYHKKWDCFGKKKNGSKKSRT
Ga0181359_113283333300019784Freshwater LakeMTVSGYKRAIAKLLKAYRKKYDAFGKERPKKRKRIK
Ga0194113_1030829143300020074Freshwater LakeMTISGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKS
Ga0208364_100196453300020533FreshwaterMTVAGYKKAIAKLLKAYHKKWDCFGNKKNGSKKSRT
Ga0222714_1047111823300021961Estuarine WaterMTIAGYKKAIAKLLKAYHKKWDCFGNKKNGSKKSRT
Ga0222714_1061694023300021961Estuarine WaterMTVSGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT
Ga0222713_1008459573300021962Estuarine WaterMTISGYKKAIAKLLKAYHKKWDCFGKERKKNGSKKSRT
Ga0222712_1025402923300021963Estuarine WaterMTITQYKKAIAKLLKAYHKKWDCFGKKKNGSKKSRT
Ga0222712_1043756733300021963Estuarine WaterMTISGYKKAIARLLKAYKKKYDALGRERKNGSKKSRT
Ga0181353_106549923300022179Freshwater LakeMKNKYLKQIQKLLKSYHKKWDCFGNIKNEKKNKTR
Ga0181354_102218943300022190Freshwater LakeMTLSSYKKAIAKLLKAYHKKWDCFGKKKNGSKKSRT
Ga0214923_10002325253300023179FreshwaterMTMTQYKKAIAKLLKAYHKKWDCFGKKRKNGSKKS
Ga0214923_1049243923300023179FreshwaterMTLTNYKKAIAKLLKAYHKKWDAFGKARKGKRRVKR
Ga0255185_100739243300024490FreshwaterMTISGYKKAIAKLLKAYKKKYDAFGKERKNGSKKSRT
Ga0208916_1026622633300025896AqueousMTLTSYKKAIAKLLKAYHKKWDAFGKARKGKRRVKR
Ga0209704_100874953300027693Freshwater SedimentMTFNNYKKTIAKLLKAYHKKYDVFGKKKNGSKKSRT
Ga0209492_116628353300027721Freshwater SedimentMTFTTYKKAIEKLLKAYHKKYDCFGKKKNGSKKSRT
(restricted) Ga0247836_100440083300027728FreshwaterMTVSGYKKAIAKLLKAYHKKWDCFGKERKKKNGRR
(restricted) Ga0247836_103099863300027728FreshwaterMIISGYKKAIAKLLKAYHKKWDCFGKERKKKNGNR
(restricted) Ga0247836_105636573300027728FreshwaterMTISGYKKAIAKLLKAYHKKWDCFGKERKKKNGRR
(restricted) Ga0247836_113725823300027728FreshwaterMTVSGYKKAIVKLLKAYHKKWDCFGKERKKKNGRR
(restricted) Ga0247833_104357353300027730FreshwaterMTISGYKKAIAKLLKAYHKKWDCFGKERKKKNGSR
(restricted) Ga0247833_104468153300027730FreshwaterMTISEYKKAIAKLLKAYHKKWDCFGKERKKKNGNR
(restricted) Ga0247833_106057913300027730FreshwaterNNGNAMTISGYKKAIAKLLKAYHKKWDAFGKKRKKKNGNR
Ga0209087_100125533300027734Freshwater LakeMTLTSYKKAIAKLLKAYHKKWDAFGKAKKGKRRVKR
Ga0209296_125715043300027759Freshwater LakeMTLTNYKKAIAKLLKAYHKKWDAFGKAKKGKRRVKR
Ga0209253_1015139183300027900Freshwater Lake SedimentMTISGYKKAIAKLLKSYHKKWDCFGKRKNGSKKSRT
(restricted) Ga0247834_108191333300027977FreshwaterMTISGYKKAIAKLLKAYHKKWDAFGKKRKKKNGNR
Ga0247723_104342943300028025Deep Subsurface SedimentMTVSGYKKAIAKLLKAYHKKWDCFGNKKNGPKKSRT
(restricted) Ga0247842_1015314563300029268FreshwaterMTVSGYKKAIAKLLKAYHKKWDCFGKERKKKNGNR
Ga0119944_102216613300029930AquaticMTISGYKKAIARLLKAYKKKYDAFGKERKNGSKKS
Ga0315907_10003495203300031758FreshwaterMTISGYKKAIARLLKAYKKKYDALGRERKNGSKKSRA
Ga0315909_1064296923300031857FreshwaterMTMTQYKKAIAKLLKAYHKKWDCFGKEKKKNGSKKSRT
Ga0334979_0051510_768_8783300033996FreshwaterMTVSGYKRAIAKLLKAYHKKWDCFGNKKNGPKKSRT
Ga0334998_0056995_984_10943300034019FreshwaterMTFTNYKKAIAKLLKAYHKKWDCFGKKKNGSKKSQT
Ga0334998_0210068_425_5353300034019FreshwaterMTMSGYKRAIAKLLKAYHKKWDCFGNKKNGSKKSRT
Ga0310127_003447_3245_33613300034072Fracking WaterMTISGYKKAISKLLKAYHKKWDCFGKERKKNGSKKSRT
Ga0335010_0677066_403_5103300034092FreshwaterTISGYKKSISKLLKAYHKKWDCFGKKKNGSKKTRT
Ga0335027_0745111_295_4053300034101FreshwaterMTFNNYKKTIAKLLKAYHKKYDCFGKKKNGSKKSRT
Ga0335068_0577811_147_2573300034116FreshwaterMTMTQYKKAIAKLLKAYHKKWDCFGKKKNGSKKPRT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.