NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104726

Metagenome Family F104726

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104726
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 49 residues
Representative Sequence LKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Number of Associated Samples 61
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.81 %
% of genes near scaffold ends (potentially truncated) 24.00 %
% of genes from short scaffolds (< 2000 bps) 72.00 %
Associated GOLD sequencing projects 51
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake
(33.000 % of family members)
Environment Ontology (ENVO) Unclassified
(84.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(85.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.57%    β-sheet: 7.84%    Coil/Unstructured: 70.59%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF08401ArdcN 37.00
PF02767DNA_pol3_beta_2 3.00
PF14279HNH_5 3.00
PF13481AAA_25 2.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG4227Antirestriction protein ArdCReplication, recombination and repair [L] 37.00
COG0592DNA polymerase III sliding clamp (beta) subunit, PCNA homologReplication, recombination and repair [L] 3.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.00 %
All OrganismsrootAll Organisms32.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002408|B570J29032_109346534Not Available704Open in IMG/M
3300003277|JGI25908J49247_10094305Not Available725Open in IMG/M
3300003277|JGI25908J49247_10169006Not Available506Open in IMG/M
3300003393|JGI25909J50240_1054067Not Available831Open in IMG/M
3300004461|Ga0066223_1279702Not Available892Open in IMG/M
3300004771|Ga0007797_1033762Not Available1397Open in IMG/M
3300004772|Ga0007791_10021212All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium Tous-C9LFEB2307Open in IMG/M
3300005581|Ga0049081_10051477All Organisms → cellular organisms → Bacteria1559Open in IMG/M
3300005583|Ga0049085_10276393Not Available548Open in IMG/M
3300005940|Ga0073913_10020575Not Available950Open in IMG/M
3300006805|Ga0075464_10031400All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium2836Open in IMG/M
3300006805|Ga0075464_10047481All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon2358Open in IMG/M
3300006805|Ga0075464_10064453Not Available2053Open in IMG/M
3300006805|Ga0075464_10539915Not Available715Open in IMG/M
3300006805|Ga0075464_11065875Not Available509Open in IMG/M
3300008450|Ga0114880_1023265All Organisms → cellular organisms → Bacteria2857Open in IMG/M
3300009068|Ga0114973_10026989All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium Tous-C9LFEB3505Open in IMG/M
3300009068|Ga0114973_10292702All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes869Open in IMG/M
3300009151|Ga0114962_10040629All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon3122Open in IMG/M
3300009151|Ga0114962_10723762Not Available507Open in IMG/M
3300009152|Ga0114980_10035997All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon3045Open in IMG/M
3300009154|Ga0114963_10015118All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia5198Open in IMG/M
3300009154|Ga0114963_10228142Not Available1065Open in IMG/M
3300009154|Ga0114963_10582058Not Available588Open in IMG/M
3300009155|Ga0114968_10070248All Organisms → cellular organisms → Bacteria2199Open in IMG/M
3300009155|Ga0114968_10519056Not Available638Open in IMG/M
3300009158|Ga0114977_10506485Not Available660Open in IMG/M
3300009159|Ga0114978_10272728Not Available1044Open in IMG/M
3300009180|Ga0114979_10321204Not Available917Open in IMG/M
3300009183|Ga0114974_10066763Not Available2369Open in IMG/M
3300009183|Ga0114974_10333430All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales884Open in IMG/M
3300010157|Ga0114964_10108853All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Thiotrichales → Francisellaceae → Francisella → unclassified Francisella → Francisella sp. SYW-91370Open in IMG/M
3300010160|Ga0114967_10165325All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300010160|Ga0114967_10222544All Organisms → Viruses → Predicted Viral1003Open in IMG/M
3300010370|Ga0129336_10029639All Organisms → cellular organisms → Bacteria3313Open in IMG/M
3300010370|Ga0129336_10218383Not Available1079Open in IMG/M
3300010885|Ga0133913_10631664All Organisms → cellular organisms → Bacteria2809Open in IMG/M
3300011114|Ga0151515_10769Not Available13397Open in IMG/M
3300017701|Ga0181364_1037614Not Available776Open in IMG/M
3300017701|Ga0181364_1056565Not Available610Open in IMG/M
3300017716|Ga0181350_1168488Not Available505Open in IMG/M
3300017723|Ga0181362_1022213All Organisms → cellular organisms → Bacteria1362Open in IMG/M
3300017723|Ga0181362_1039972Not Available989Open in IMG/M
3300017736|Ga0181365_1030791Not Available1353Open in IMG/M
3300017774|Ga0181358_1042879Not Available1733Open in IMG/M
3300017774|Ga0181358_1122805Not Available911Open in IMG/M
3300017774|Ga0181358_1218336Not Available615Open in IMG/M
3300017777|Ga0181357_1049305Not Available1645Open in IMG/M
3300017777|Ga0181357_1130838Not Available936Open in IMG/M
3300017780|Ga0181346_1068672Not Available1412Open in IMG/M
3300017784|Ga0181348_1082892Not Available1269Open in IMG/M
3300017784|Ga0181348_1084336Not Available1256Open in IMG/M
3300019784|Ga0181359_1005579Not Available4116Open in IMG/M
3300019784|Ga0181359_1034285All Organisms → cellular organisms → Bacteria1964Open in IMG/M
3300019784|Ga0181359_1047980Not Available1649Open in IMG/M
3300019784|Ga0181359_1053551All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Arcobacteraceae → Aliarcobacter → Aliarcobacter butzleri1554Open in IMG/M
3300019784|Ga0181359_1080921Not Available1217Open in IMG/M
3300019784|Ga0181359_1084017Not Available1189Open in IMG/M
3300019784|Ga0181359_1094314All Organisms → cellular organisms → Bacteria1106Open in IMG/M
3300019784|Ga0181359_1132039Not Available879Open in IMG/M
3300020498|Ga0208050_1008555Not Available1187Open in IMG/M
3300022190|Ga0181354_1022608All Organisms → cellular organisms → Bacteria2009Open in IMG/M
3300022190|Ga0181354_1150881Not Available727Open in IMG/M
3300022407|Ga0181351_1055630Not Available1640Open in IMG/M
3300022407|Ga0181351_1068503Not Available1442Open in IMG/M
3300022752|Ga0214917_10096515All Organisms → cellular organisms → Bacteria1739Open in IMG/M
3300022752|Ga0214917_10102080Not Available1665Open in IMG/M
3300023174|Ga0214921_10308238Not Available876Open in IMG/M
3300023179|Ga0214923_10131733Not Available1606Open in IMG/M
3300023179|Ga0214923_10148530All Organisms → cellular organisms → Bacteria1470Open in IMG/M
3300023179|Ga0214923_10377418All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Methanomicrobia → Methanosarcinales → Candidatus Methanoperedenaceae → Candidatus Methanoperedens736Open in IMG/M
3300023179|Ga0214923_10516248Not Available583Open in IMG/M
3300023184|Ga0214919_10077123Not Available2966Open in IMG/M
3300023184|Ga0214919_10154541Not Available1814Open in IMG/M
3300025896|Ga0208916_10245139Not Available778Open in IMG/M
3300027547|Ga0209864_1016012Not Available861Open in IMG/M
3300027608|Ga0208974_1122897Not Available677Open in IMG/M
3300027712|Ga0209499_1026644All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon2598Open in IMG/M
3300027733|Ga0209297_1384397Not Available500Open in IMG/M
3300027734|Ga0209087_1016082All Organisms → cellular organisms → Bacteria3730Open in IMG/M
3300027736|Ga0209190_1014612All Organisms → cellular organisms → Bacteria4545Open in IMG/M
3300027736|Ga0209190_1098600Not Available1351Open in IMG/M
3300027736|Ga0209190_1204132Not Available812Open in IMG/M
3300027741|Ga0209085_1021513All Organisms → cellular organisms → Bacteria3099Open in IMG/M
3300027749|Ga0209084_1052091Not Available1962Open in IMG/M
3300027759|Ga0209296_1036085Not Available2683Open in IMG/M
3300027759|Ga0209296_1048846Not Available2219Open in IMG/M
3300027785|Ga0209246_10392207Not Available523Open in IMG/M
3300027808|Ga0209354_10022691Not Available2501Open in IMG/M
3300027808|Ga0209354_10265090Not Available688Open in IMG/M
3300027963|Ga0209400_1027293All Organisms → cellular organisms → Bacteria3231Open in IMG/M
3300027973|Ga0209298_10009192All Organisms → cellular organisms → Bacteria5347Open in IMG/M
3300027973|Ga0209298_10337421Not Available579Open in IMG/M
(restricted) 3300027977|Ga0247834_1182397Not Available811Open in IMG/M
(restricted) 3300028553|Ga0247839_1261883Not Available668Open in IMG/M
3300031539|Ga0307380_10226396Not Available1786Open in IMG/M
3300031565|Ga0307379_10324061Not Available1509Open in IMG/M
3300031707|Ga0315291_10548705Not Available1060Open in IMG/M
3300034106|Ga0335036_0017390All Organisms → cellular organisms → Bacteria → PVC group5826Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake33.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake32.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater12.00%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous6.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater3.00%
Freshwater LenticEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lentic3.00%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient2.00%
SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand2.00%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil2.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater1.00%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.00%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002408Freshwater microbial communities from Lake Mendota, WI, sample - 15JUL2010 deep hole epilimnion (Lake Mendota Combined assembly, ASSEMBLY_DATE=20140123)EnvironmentalOpen in IMG/M
3300003277Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.SDEnvironmentalOpen in IMG/M
3300003393Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.DDEnvironmentalOpen in IMG/M
3300004461Marine viral communities from Newfoundland, Canada BC-2EnvironmentalOpen in IMG/M
3300004771Freshwater microbial communities from Crystal Bog, Wisconsin, USA - MA2.5MEnvironmentalOpen in IMG/M
3300004772Freshwater microbial communities from Crystal Bog, Wisconsin, USA - MA0.5MEnvironmentalOpen in IMG/M
3300005581Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER78MSRFEnvironmentalOpen in IMG/M
3300005583Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG SU08MSRFEnvironmentalOpen in IMG/M
3300005940Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_25-Nov-14EnvironmentalOpen in IMG/M
3300006805Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNAEnvironmentalOpen in IMG/M
3300008450Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - Oct 27, 2014 all contigsEnvironmentalOpen in IMG/M
3300009068Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_140807_MF_MetaGEnvironmentalOpen in IMG/M
3300009151Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_MF_MetaGEnvironmentalOpen in IMG/M
3300009152Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaGEnvironmentalOpen in IMG/M
3300009154Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_131016_EF_MetaGEnvironmentalOpen in IMG/M
3300009155Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_EF_MetaGEnvironmentalOpen in IMG/M
3300009158Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_MF_MetaGEnvironmentalOpen in IMG/M
3300009159Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140212_EF_MetaGEnvironmentalOpen in IMG/M
3300009180Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140625_EF_MetaGEnvironmentalOpen in IMG/M
3300009183Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130206_EF_MetaGEnvironmentalOpen in IMG/M
3300010157Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_131016_MF_MetaGEnvironmentalOpen in IMG/M
3300010160Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130628_MF_MetaGEnvironmentalOpen in IMG/M
3300010370Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNAEnvironmentalOpen in IMG/M
3300010885northern Canada Lakes Co-assemblyEnvironmentalOpen in IMG/M
3300011114Freshwater viral communities from Lake Soyang, Gangwon-do, South Korea - SYL_2016FebEnvironmentalOpen in IMG/M
3300017701Freshwater viral communities from Lake Michigan, USA - Fa13.ND.MM110.S.NEnvironmentalOpen in IMG/M
3300017716Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.DCM.DEnvironmentalOpen in IMG/M
3300017723Freshwater viral communities from Lake Michigan, USA - Su13.ND.MM110.S.NEnvironmentalOpen in IMG/M
3300017736Freshwater viral communities from Lake Michigan, USA - Fa13.ND.MM110.D.NEnvironmentalOpen in IMG/M
3300017774Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017777Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300017780Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM15.D.NEnvironmentalOpen in IMG/M
3300017784Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020498Freshwater microbial communities from Lake Mendota, WI - 13JUN2010 deep hole epilimnion (SPAdes)EnvironmentalOpen in IMG/M
3300022190Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.NEnvironmentalOpen in IMG/M
3300022407Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300022752Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL_1208_BBEnvironmentalOpen in IMG/M
3300023174Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1505EnvironmentalOpen in IMG/M
3300023179Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1510EnvironmentalOpen in IMG/M
3300023184Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1503EnvironmentalOpen in IMG/M
3300025896Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Spr_0.19_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027547Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_25-Nov-14 (SPAdes)EnvironmentalOpen in IMG/M
3300027608Freshwater lentic microbial communities from great Laurentian Lakes, MI, USA - Great Lakes metaG ER15MSRF (SPAdes)EnvironmentalOpen in IMG/M
3300027712Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130208_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027733Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027734Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130805_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027736Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_140205_XF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027741Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_131016_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027749Freshwater microbial communities from Lake Croche, Canada to study carbon cycling - C_130820_MF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027759Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_130206_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027785Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.SN (SPAdes)EnvironmentalOpen in IMG/M
3300027808Freshwater lake microbial communities from Lake Michigan, USA - Sp13.BD.MM15.DD (SPAdes)EnvironmentalOpen in IMG/M
3300027963Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130807_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027973Freshwater microbial communities from Lake Simoncouche, Canada to study carbon cycling - S_140806_EF_MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300027977 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_12mEnvironmentalOpen in IMG/M
3300028553 (restricted)Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_16mEnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031565Soil microbial communities from Risofladan, Vaasa, Finland - UN-2EnvironmentalOpen in IMG/M
3300031707Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_20EnvironmentalOpen in IMG/M
3300034106Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME23Aug2013-rr0131EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
B570J29032_10934653423300002408FreshwaterLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI*
JGI25908J49247_1009430513300003277Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESV*
JGI25908J49247_1016900613300003277Freshwater LakeLRTYQVEVFDLKDQYGGGMPCLQFEAQSEREAFEVALESLGVYMKIGEPS*
JGI25909J50240_105406723300003393Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGI
Ga0066223_127970213300004461MarineLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMESAS*
Ga0007797_103376223300004771FreshwaterLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS*
Ga0007791_1002121213300004772FreshwaterLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYM
Ga0049081_1005147733300005581Freshwater LenticLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININIKEESI*
Ga0049085_1027639333300005583Freshwater LenticLRTYQVEVYDLKDQYGGGMPCLQFEAQSEREAFEVTLESLGLCMKIGEPS*
Ga0073913_1002057523300005940SandLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI*
Ga0075464_1003140033300006805AqueousLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGISINMNEENI*
Ga0075464_1004748163300006805AqueousLKTYLVEVFDNKDQYGSGMPCLQFEATCRQEAYETIFESLGININIREESI*
Ga0075464_1006445343300006805AqueousLKTYLVEVFDSKDQYGGGMPCLQFEATCKQEAYETIFESLGININMREESV*
Ga0075464_1053991523300006805AqueousLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFETLGINIKMREESI*
Ga0075464_1106587513300006805AqueousLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININMKEESI*
Ga0114880_102326543300008450Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESI*
Ga0114973_1002698943300009068Freshwater LakeLKTYLVEVFDKKDQYGGGMPVLQFEATCEKEAHQIIMESFGVYMTMESAS*
Ga0114973_1029270223300009068Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGINISMKEESI*
Ga0114962_1004062913300009151Freshwater LakeHGKGKGLKTYLVEVFDKKDQYCGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS*
Ga0114962_1072376223300009151Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININIREESI*
Ga0114980_1003599753300009152Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVVS*
Ga0114963_10015118113300009154Freshwater LakeLKTYLVEVFDKKDSHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMESAS*
Ga0114963_1022814233300009154Freshwater LakeLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIFMTMESAS*
Ga0114963_1058205823300009154Freshwater LakeFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININIREESI*
Ga0114968_1007024843300009155Freshwater LakeLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMGSAS*
Ga0114968_1051905613300009155Freshwater LakeLKTYLVEVFDSKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI*
Ga0114977_1050648523300009158Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLGININIREESI*
Ga0114978_1027272823300009159Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLGININMREESV*
Ga0114979_1032120413300009180Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLGININMREE
Ga0114974_1006676343300009183Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKDESI*
Ga0114974_1033343023300009183Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS*
Ga0114964_1010885333300010157Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININIREESI*
Ga0114967_1016532523300010160Freshwater LakeLKTYLVEVFDSKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESV*
Ga0114967_1022254423300010160Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS*
Ga0129336_1002963943300010370Freshwater To Marine Saline GradientLKTYLVEVFDKKDSHGLSMPVLQFEATSEKEAHEIVMESLGIYMTMESAS*
Ga0129336_1021838333300010370Freshwater To Marine Saline GradientLKTYLVEVFDKKDSHGLSMPVLQFEATSEKEAHEIVMESLGVYMTMESAS*
Ga0133913_1063166443300010885Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLCININIREESI*
Ga0151515_10769153300011114FreshwaterLKTYLVEVFDKKDSHGLSMPVLQFEATCEKEAHEIVMESLGIYMTIGEAS*
Ga0181364_103761413300017701Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGIN
Ga0181364_105656513300017701Freshwater LakeDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0181350_116848823300017716Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREEIV
Ga0181362_102221323300017723Freshwater LakeLKTYLVEVFDKKDQYGDGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0181362_103997213300017723Freshwater LakeDQYGGGMPSLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0181365_103079123300017736Freshwater LakeLKTYLVEVLDNKDQYDGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0181358_104287953300017774Freshwater LakeKGESLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0181358_112280513300017774Freshwater LakeVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0181358_121833623300017774Freshwater LakeVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESV
Ga0181357_104930543300017777Freshwater LakeLKTYLVEVFDSKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESI
Ga0181357_113083823300017777Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGINI
Ga0181346_106867243300017780Freshwater LakeFDSKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESI
Ga0181348_108289233300017784Freshwater LakeYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESV
Ga0181348_108433613300017784Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESL
Ga0181359_100557953300019784Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0181359_103428543300019784Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0181359_104798023300019784Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREESV
Ga0181359_105355153300019784Freshwater LakeLKTYLVEVFDKKDQYGGGMPSLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0181359_108092133300019784Freshwater LakeLRTYQVEVFDLKDQYGGGMPCLQFEAQSEREAFEVALESLGVYMKIGEPS
Ga0181359_108401723300019784Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETILESLGININMKEESI
Ga0181359_109431433300019784Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0181359_113203923300019784Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0208050_100855523300020498FreshwaterLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0181354_102260833300022190Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGINISMKEESI
Ga0181354_115088123300022190Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETI
Ga0181351_105563013300022407Freshwater LakeSLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0181351_106850323300022407Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININIKEGSI
Ga0214917_1009651523300022752FreshwaterLKTYIVEVFDKKDSHGLSMPVLQFEATSEKEAHEIVMESLGVYMTMESAS
Ga0214917_1010208023300022752FreshwaterLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMREENI
Ga0214921_1030823823300023174FreshwaterLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININIREENI
Ga0214923_1013173353300023179FreshwaterLKTYLVEVFDKKDSYGLSMPVLQFEATSEKEAHEIVMESLGVYMTMESAS
Ga0214923_1014853013300023179FreshwaterLKTYLVEVFDKKDSHGLSMPMLQFEATSEKEAHEIVMESLGVYMTMESAS
Ga0214923_1037741823300023179FreshwaterVEVFDNKDQYGGGMPCLQFEATCRLEAYETVFENLGININMKEESI
Ga0214923_1051624823300023179FreshwaterDKKDSYGLSMPVLQFEATSEKEAHEIVMESLGVYMTMESAS
Ga0214919_1007712343300023184FreshwaterLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININMKEESI
Ga0214919_1015454163300023184FreshwaterLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMRMEIAS
Ga0208916_1024513933300025896AqueousEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0209864_101601213300027547SandVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0208974_112289723300027608Freshwater LenticKDQYGGGMPCLQFEATCRQEAYETIFESLGININIKEESI
Ga0209499_102664423300027712Freshwater LakeLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIFMTMESAS
Ga0209297_138439723300027733Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLGININIREESI
Ga0209087_101608253300027734Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFENLGININMREESV
Ga0209190_101461253300027736Freshwater LakeLKTYLVEVFDKKDQYGGGMPVLQFEATCEKEAHQIIMESFGVYMTMESAS
Ga0209190_109860053300027736Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCRQEAYETIFESLGININIKEESI
Ga0209190_120413223300027736Freshwater LakeLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMGSAS
Ga0209085_102151353300027741Freshwater LakeLKTYLVEVFDKKDSHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMESAS
Ga0209084_105209113300027749Freshwater LakeKGKGLKTYLVEVFDKKDQYCGGMPCLQFEATCEKEAHEIVMESLGIYMRMEVAS
Ga0209296_103608553300027759Freshwater LakeLKTYLVEVFDSKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKEESI
Ga0209296_104884633300027759Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGININMKDESI
Ga0209246_1033780813300027785Freshwater LakeDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0209246_1039220713300027785Freshwater LakeGEKLKTYLVEVFDKKDQYGGGMPCLQFEATCRQEAYETIFESLGINISMKEESI
Ga0209354_1002269133300027808Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININIKEGSV
Ga0209354_1026509023300027808Freshwater LakeLKTYLVEVFDNKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVAS
Ga0209400_102729353300027963Freshwater LakeVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIYMTMGSAS
Ga0209298_1000919253300027973Freshwater LakeLKTYLVEVFDKKDQYGGGMPCLQFEATCEKEAHEIVMESLGIYMKMEVVS
Ga0209298_1033742123300027973Freshwater LakeLKTYLVEVFDKKDGHGLSMPVLQFEATCEKEAHEIVMESLGIYMTM
(restricted) Ga0247834_118239713300027977FreshwaterKGEVLKTYLVEVFDNRDQYGGGMPCLQFEATCRREAYETIFESMGININIKEESI
(restricted) Ga0247839_126188313300028553FreshwaterLKTYLVEVFDNRDQYGGGMPCLQFEATCRREAYET
Ga0307380_1022639643300031539SoilLRTYLVEVFDKKDSHGLSMPVLQFEATCEKEAHEIVMDSLGVYMRMETAS
Ga0307379_1032406163300031565SoilLKTYLVEVFDKKDSHGLSMPVLQFEATCEKEAHEIVMDSLGVYMRMETAS
Ga0315291_1054870523300031707SedimentLKTYLVEVFDNKDQYGGGMPCLQFEATCKQEAYETIFESLGININMKDESI
Ga0335036_0017390_2371_25263300034106FreshwaterLKTYLVEVFDKKDQYGCGMPCLQFEATCRQEAYETIFESLGINISMKEESI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.