NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091890

Metagenome Family F091890

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091890
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 58 residues
Representative Sequence MRLLYRATRFDPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDYLDPQWQLVDTSDE
Number of Associated Samples 75
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 4.35 %
% of genes near scaffold ends (potentially truncated) 39.25 %
% of genes from short scaffolds (< 2000 bps) 37.38 %
Associated GOLD sequencing projects 62
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (85.981 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(44.860 % of family members)
Environment Ontology (ENVO) Unclassified
(60.748 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(63.551 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.79%    β-sheet: 24.42%    Coil/Unstructured: 62.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF14284PcfJ 10.28
PF13392HNH_3 3.74
PF00847AP2 1.87
PF13936HTH_38 0.93



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A85.98 %
All OrganismsrootAll Organisms14.02 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001968|GOS2236_1031170Not Available1943Open in IMG/M
3300005805|Ga0079957_1098507Not Available1600Open in IMG/M
3300006734|Ga0098073_1052331Not Available544Open in IMG/M
3300006802|Ga0070749_10682942Not Available550Open in IMG/M
3300006875|Ga0075473_10388110Not Available564Open in IMG/M
3300007539|Ga0099849_1165992Not Available846Open in IMG/M
3300007541|Ga0099848_1006260Not Available5433Open in IMG/M
3300007541|Ga0099848_1178506All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium772Open in IMG/M
3300007541|Ga0099848_1301236Not Available550Open in IMG/M
3300008055|Ga0108970_10141113All Organisms → Viruses3169Open in IMG/M
3300009466|Ga0126448_1002948Not Available4716Open in IMG/M
3300010318|Ga0136656_1276156Not Available549Open in IMG/M
3300010354|Ga0129333_10367267All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium1278Open in IMG/M
3300010354|Ga0129333_10763319All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium827Open in IMG/M
3300010354|Ga0129333_11472305Not Available558Open in IMG/M
3300010368|Ga0129324_10398040Not Available532Open in IMG/M
3300010370|Ga0129336_10328581All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium845Open in IMG/M
3300012282|Ga0157136_1010014Not Available573Open in IMG/M
3300013005|Ga0164292_10770528All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl610Open in IMG/M
(restricted) 3300013137|Ga0172375_10823814All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl569Open in IMG/M
3300017754|Ga0181344_1071736Not Available1021Open in IMG/M
3300017785|Ga0181355_1056096All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl1669Open in IMG/M
3300019745|Ga0194002_1005709Not Available1393Open in IMG/M
3300019784|Ga0181359_1006076Not Available3997Open in IMG/M
3300020109|Ga0194112_10267220All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Synechococcales1321Open in IMG/M
3300021356|Ga0213858_10595768Not Available503Open in IMG/M
3300021963|Ga0222712_10340888All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium927Open in IMG/M
3300021963|Ga0222712_10777592All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl530Open in IMG/M
3300021964|Ga0222719_10736242Not Available551Open in IMG/M
3300022067|Ga0196895_1000059Not Available12577Open in IMG/M
3300022176|Ga0212031_1038161Not Available793Open in IMG/M
3300022198|Ga0196905_1168711Not Available557Open in IMG/M
3300025057|Ga0208018_129382Not Available588Open in IMG/M
3300025057|Ga0208018_130745Not Available566Open in IMG/M
3300025635|Ga0208147_1054592Not Available1018Open in IMG/M
3300025674|Ga0208162_1057331Not Available1281Open in IMG/M
3300025674|Ga0208162_1101529Not Available855Open in IMG/M
3300025732|Ga0208784_1142159All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobiaceae → unclassified Verrucomicrobiaceae → Verrucomicrobiaceae bacterium711Open in IMG/M
3300027499|Ga0208788_1140858All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl535Open in IMG/M
3300031758|Ga0315907_10040850All Organisms → Viruses → Predicted Viral4122Open in IMG/M
3300031758|Ga0315907_10988197Not Available609Open in IMG/M
3300031963|Ga0315901_10546048Not Available892Open in IMG/M
3300034200|Ga0335065_0166034Not Available1460Open in IMG/M
3300034283|Ga0335007_0430624Not Available815Open in IMG/M
3300034283|Ga0335007_0694616All Organisms → Viruses → unclassified bacterial viruses → Synechococcus phage S-EIVl570Open in IMG/M
3300034418|Ga0348337_115192Not Available838Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous44.86%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater8.41%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient8.41%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake6.54%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine6.54%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater4.67%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake3.74%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water3.74%
SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Sediment0.93%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland0.93%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Epilimnion → Freshwater0.93%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater0.93%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.93%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake0.93%
AquaticEnvironmental → Aquatic → Freshwater → Drinking Water → Unclassified → Aquatic0.93%
FreshwaterEnvironmental → Aquatic → Freshwater → River → Unclassified → Freshwater0.93%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment0.93%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater0.93%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.93%
Meromictic PondEnvironmental → Aquatic → Unclassified → Unclassified → Unclassified → Meromictic Pond0.93%
Deep SubsurfaceEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface0.93%
EstuaryHost-Associated → Plants → Leaf → Unclassified → Unclassified → Estuary0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001968Marine microbial communities from Lake Gatun, Panama - GS020EnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300006734Marine viral communities from the Gulf of Mexico - 31_GoM_OMZ_CsCl metaGEnvironmentalOpen in IMG/M
3300006790Marine viral communities from the Gulf of Mexico - 32_GoM_OMZ_CsCl metaGEnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300006810Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01EnvironmentalOpen in IMG/M
3300006875Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNAEnvironmentalOpen in IMG/M
3300007346Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31EnvironmentalOpen in IMG/M
3300007363Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNAEnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007539Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaGEnvironmentalOpen in IMG/M
3300007541Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007960Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaGEnvironmentalOpen in IMG/M
3300008055Metatranscriptomes of the Eelgrass leaves and roots. Combined Assembly of Gp0128390, Gp0128391, Gp0128392, and Gp0128393Host-AssociatedOpen in IMG/M
3300008450Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - Oct 27, 2014 all contigsEnvironmentalOpen in IMG/M
3300009131Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Open_0915_D1EnvironmentalOpen in IMG/M
3300009466Aquatic microbial communities from different depth of meromictic Siders Pond, Falmouth, Massachusetts; Cast 1, 2m depth; DNA IDBA-UDEnvironmentalOpen in IMG/M
3300010299Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_15_0.2_DNAEnvironmentalOpen in IMG/M
3300010300Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_27_0.2_DNAEnvironmentalOpen in IMG/M
3300010318Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_15_0.8_DNAEnvironmentalOpen in IMG/M
3300010354Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNAEnvironmentalOpen in IMG/M
3300010368Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_15_0.2_DNAEnvironmentalOpen in IMG/M
3300010370Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNAEnvironmentalOpen in IMG/M
3300012282Freshwater microbial communities from Central Basin Methane Hotpot in Lake Erie, Ontario, Canada - Station 1365 - Bottom - Depth 20.5mEnvironmentalOpen in IMG/M
3300013005Eutrophic lake water microbial communities from Lake Mendota, Wisconsin, USA - GEODES117 metaGEnvironmentalOpen in IMG/M
3300013137 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_11.1mEnvironmentalOpen in IMG/M
3300017754Freshwater viral communities from Lake Michigan, USA - Su13.VD.MLB.D.DEnvironmentalOpen in IMG/M
3300017785Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.D.NEnvironmentalOpen in IMG/M
3300019745Sediment microbial communities from the Broadkill River, Lewes, Delaware, United States ? FLT_8-9_MGEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020109Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015016 Mahale Deep Cast 400mEnvironmentalOpen in IMG/M
3300020183Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surfaceEnvironmentalOpen in IMG/M
3300020190Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015013 Mahale N5 surfaceEnvironmentalOpen in IMG/M
3300020204Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015008 Mahale S9 surfaceEnvironmentalOpen in IMG/M
3300020222Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015034 Kigoma Deep Cast 250mEnvironmentalOpen in IMG/M
3300020566Freshwater microbial communities from Lake Mendota, WI - 13SEP2009 deep hole epilimnion (SPAdes)EnvironmentalOpen in IMG/M
3300020570Freshwater microbial communities from Lake Mendota, WI - 31AUG2010 deep hole epilimnion (SPAdes)EnvironmentalOpen in IMG/M
3300021356Coastal seawater microbial communities near Pivers Island, North Carolina, United States - PICO245EnvironmentalOpen in IMG/M
3300021376Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015050 Kigoma 12 surfaceEnvironmentalOpen in IMG/M
3300021962Estuarine water microbial communities from San Francisco Bay, California, United States - C33_649DEnvironmentalOpen in IMG/M
3300021963Estuarine water microbial communities from San Francisco Bay, California, United States - C33_657DEnvironmentalOpen in IMG/M
3300021964Estuarine water microbial communities from San Francisco Bay, California, United States - C33_34DEnvironmentalOpen in IMG/M
3300022063Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022067Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_30 (v3)EnvironmentalOpen in IMG/M
3300022159Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_28 (v3)EnvironmentalOpen in IMG/M
3300022176Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022198Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022200Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300024356Freshwater microbial communities from Altamaha River, Georgia, United States - Atl_Miss_RepC_8dEnvironmentalOpen in IMG/M
3300025057Marine viral communities from the Gulf of Mexico - 31_GoM_OMZ_CsCl metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025585Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025610Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_D_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025635Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025647Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025655Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025674Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025687Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025732Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025759Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_24 (SPAdes)EnvironmentalOpen in IMG/M
3300025872Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_D_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300027499Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series FC 2014_7_11 (SPAdes)EnvironmentalOpen in IMG/M
3300027901Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-1-36_30 (SPAdes)EnvironmentalOpen in IMG/M
3300029933Aquatic microbial communities from drinking water treatment plant in Pearl River Delta area, China - influent_20120727_2EnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031857Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125EnvironmentalOpen in IMG/M
3300031951Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA120EnvironmentalOpen in IMG/M
3300031963Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA116EnvironmentalOpen in IMG/M
3300033994Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME25Jul2006D11-rr0046EnvironmentalOpen in IMG/M
3300034012Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME18Aug2017-rr0027EnvironmentalOpen in IMG/M
3300034200Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME18Jul2013-rr0190EnvironmentalOpen in IMG/M
3300034280Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME10Aug2009-rr0048EnvironmentalOpen in IMG/M
3300034283Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME07Aug2003-rr0061EnvironmentalOpen in IMG/M
3300034374Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31 (v4)EnvironmentalOpen in IMG/M
3300034418Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_28 (v4)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
GOS2236_103117053300001968MarineMYRPEEWAPALCRASFQLADDEQIPTDEDGFCHFLYQLDLEWQLVDTSDWSLD*
Ga0079957_109850713300005805LakeYRATRFEPEEWAPALCQATIELDPEEPIPLDENCFCSYLDQLDPQWQVLSPDDI*
Ga0098073_105233113300006734MarineLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCAYLDYLDPDWQVLDLTDEQ*
Ga0098074_102785813300006790MarineRLDPEEWAPALCQTTIELDPGEPIPLDEDGFCAYLDNLDPQWQLLDTSVGDPD*
Ga0098074_115759033300006790MarineVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCSYLDYLDPDWQVLDLTDEQ*
Ga0098074_116736333300006790MarineVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCAYLDYLDPDWQVLDLTDEQ*
Ga0070749_1003760773300006802AqueousVDEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDMSDCND*
Ga0070749_1040146633300006802AqueousYRATHFEPDEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDKDDIE*
Ga0070749_1068294223300006802AqueousVDEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLVDTSDS*
Ga0070754_1010844913300006810AqueousEMRLLYRATHFEPEEWAPALCEVTIAIDPEDPIPLDEDSFCSYLDQLDPQWQLVDAY*
Ga0070754_1041286423300006810AqueousEMRLLYRATHFEPEEWAPALCEVTIAIDPEDPIPLDEDSFCSYLDQLDPQWQLVDSY*
Ga0075473_1038811013300006875AqueousEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLVDTSDS*
Ga0070753_125216613300007346AqueousEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0070753_125460633300007346AqueousLYRATYLDPEEWAPALCQTTIELDPEEPIPLDEDGFCSYLDDLDPQWQLLDTSIGDPD*
Ga0075458_1008131813300007363AqueousEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDMSDCND*
Ga0099851_104714433300007538AqueousVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCSYLDHLDPDWQVLDLTDEQ*
Ga0099851_107664913300007538AqueousYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQVLSPDDI*
Ga0099851_118924413300007538AqueousYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDTSVGDPD*
Ga0099851_120902013300007538AqueousVDEMRLLYRATRLDPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0099851_133310913300007538AqueousTHFEPEEWAPALCTATIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDTSDE*
Ga0099849_116599223300007539AqueousLYRATHFEPDEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDKDDIE*
Ga0099849_134257513300007539AqueousATHLDPEEWAPALCEATIELDPEEPIPLDEDGFCRYLDQLDPQWQLVDMSDQ*
Ga0099848_100626013300007541AqueousMRLLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0099848_114120553300007541AqueousLLYRATRLDPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSIGDPD*
Ga0099848_117850623300007541AqueousVVDEMRLLYHATHLDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPQWQLVERD
Ga0099848_130123633300007541AqueousRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCAYLDYLDPDWQVLDLTDEQ*
Ga0099846_115107613300007542AqueousVTAVVDEMRLLYHATHLDPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0099846_124863613300007542AqueousTHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDTSVGDPD*
Ga0099850_103958513300007960AqueousEPEEWAPALCQATIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0099850_135557223300007960AqueousVTAVVDEMRLLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSIGDPD*
Ga0108970_1014111333300008055EstuaryMRLLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLSDYE*
Ga0114880_105454333300008450Freshwater LakeLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLTDEQ*
Ga0115027_1128254413300009131WetlandRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPQWQVVDMNDC*
Ga0126448_1002948123300009466Meromictic PondEMRLLYRATHLDPEEWAPALCQATIELDPEEPIPLDEDSFCSYLDDLDPQWQLLDTSIGDPD*
Ga0129342_134714923300010299Freshwater To Marine Saline GradientYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD*
Ga0129351_112881833300010300Freshwater To Marine Saline GradientVVDEMRLLYRATHFEPEEWAPALCQATIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDTSVGDPD*
Ga0136656_127615613300010318Freshwater To Marine Saline GradientTHFDPEEWAPALCQTTIELDPEEPIPLDEDGFCRYLDDLDPQWQLLDTSIGDPD*
Ga0129333_1018972613300010354Freshwater To Marine Saline GradientEMRLLYRATHFEPEEWAPALCTASFELDEEQQIPTDEDGFCCYLDQLDLHWQLVDTSDYDLD*
Ga0129333_1036726733300010354Freshwater To Marine Saline GradientLYRATRFEPEEWAPALCQATIELDPEEPIPLDEDGFCSYLGQLDPQWQLIERD*
Ga0129333_1076331923300010354Freshwater To Marine Saline GradientRFEPEEWAPALCQATIELDPEEPIPLDEDGFCSYLDQLDPQWQLVDCD*
Ga0129333_1147230523300010354Freshwater To Marine Saline GradientDEMRLLYRATRFEPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPQWKLVERD*
Ga0129324_1039804013300010368Freshwater To Marine Saline GradientLLYRATRFEPEEWAPALCQTTIELDPKEPIPFDEDGFCAYLDQLDPQWQLLDTSVGDPD*
Ga0129336_1032858113300010370Freshwater To Marine Saline GradientTRFEPEEWAPALCTTTVELDPEEPIPLDEDGFCAYLNQLDPQWELLDMSDDE*
Ga0157136_101001433300012282FreshwaterRLLYRATRFEPEEWAPALCQATIELDPEESIPLDEDGFCAYLDHLDPQWQLLDLTDEQ*
Ga0164292_1077052813300013005FreshwaterYVTVTAMVDNMRLLYRATRLDPEEWAPALCEATFQLGEDESIPTSEDGFCCYLDHRDVEWQLVDTSDYSY*
(restricted) Ga0172375_1082381413300013137FreshwaterHKATYFEPDEWAPALCIADFELDEDEQIPVDEDGFCQYLADRDLQWQLVDTSDWYLAS*
Ga0181344_107173613300017754Freshwater LakeLYRATRFEPEEWTAALCQATIELDPEEPIPLDEDGFCSYLDQLDPQWQVLSPDDI
Ga0181355_105609613300017785Freshwater LakeVTAVVDDMRLLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLSDYE
Ga0181355_106193613300017785Freshwater LakeAVVDDMRLLYRATRFDPEEWAPALCTTTIELDPEESIPLDEDGFCSYLDDLDPHWQLLDLSDYE
Ga0194002_100570913300019745SedimentLYRATHFEPDAWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDKDDIE
Ga0181359_100607673300019784Freshwater LakeETAVVDDMRLLYRATRFDPEEWAPALCQATIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDVTDDY
Ga0194112_1026722033300020109Freshwater LakeRATYLEPEEWAPALCTATFQLDEGEQIPLDEDGFCGYLDQLDLYWQLVDTSDSDLD
Ga0194115_1038329133300020183Freshwater LakeTRFEPEEWAPALCTATIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDMSDE
Ga0194118_1026199723300020190Freshwater LakeDEMRLLYRATYLEPEEWAPALCTATFQLDEGEQIPLDEDGFCGYLDQLDLYWQLVDTSDSDLD
Ga0194116_1026681613300020204Freshwater LakeDEMRLLYRATYLEPEEWAPALCTATFQLDEGEQIPLDEDGFCSYLDQLDLYWQLVDTSDSDLD
Ga0194125_1062883613300020222Freshwater LakeTAVVEDMRLLYRATRFEPEEWCPALCSASFQLDDNESIPLDEDGFCSYLNERDLNWQLVDTSDYNID
Ga0208222_102070213300020566FreshwaterVVDDMRLLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLSDDE
Ga0208465_102400513300020570FreshwaterEPDEWAPALCSTSFELDPGESIPTDEDGFCSYLDQLDLHWQLIDTSDCDID
Ga0213858_1059576813300021356SeawaterRLLYRATHFEPDEWAPALCQTTIELDPEEPIPLDEDGFCAYLNHLDPKWQLVDTSDE
Ga0194130_1049600113300021376Freshwater LakeRLLYRATRFEPEEWAPALCTATIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDMSDE
Ga0222713_1046000823300021962Estuarine WaterRYEPEEWAPALCTASFELDEEQQIPTDEDGFCCYLDQLDLHWQLVDTSDYDLD
Ga0222712_1034088823300021963Estuarine WaterYRATQFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDHLDPQWQLVERDDYE
Ga0222712_1077759213300021963Estuarine WaterRDDPEEWAPGLCTASFEMADDESIPFDEDSFCDFLDGLDLQWQLVDTSDYYLDDAA
Ga0222719_1073624213300021964Estuarine WaterVIVTAVVDEMRLLYRATYLDPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDDLDPQWQLLDTSIGDPD
Ga0212029_105774013300022063AqueousVTVTAVVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCSYLDHLDPDWQVLDLTDEQ
Ga0196895_100005913300022067AqueousPRATHFEPDEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDKDDIE
Ga0196893_100158713300022159AqueousSKVKAIVDEMRLLYRATHFEPDEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDKDDIE
Ga0212031_103816113300022176AqueousVTVTAVVDEMRLLYRATRFDPEEWAPALCQATIELDPEEPIPLDEDGFCRYLDDLDPQWQLLDTSIGDPD
Ga0196905_116871113300022198AqueousVVITAVVDEMRLLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD
Ga0196901_102614713300022200AqueousLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQVLSPDDI
Ga0196901_116131533300022200AqueousLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDTSVGDPD
Ga0255169_102415033300024356FreshwaterLVDEMRLLYRATHLDPEEWAPALCTATVELDPDEPIPLDEDGFALYLDQLDPQWQLVEYD
Ga0208018_12267713300025057MarineVTAVVDEMRLLYRATRFDPEEWAPALCQATIELDPEEPIPLDEDGFCRYLDDLDPQWQLLDTSIGDPD
Ga0208018_12938233300025057MarineVVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCSYLDYLDPDWQVLDLTDEQ
Ga0208018_13074533300025057MarineVVDEMRLLYRATRFEPEEWAPALCEATIELDPTEQIPLDEDGFCAYLDYLDPDWQVLDLTDEQ
Ga0208546_110033933300025585AqueousTVTALVDEMRLLYRASRLDPEEWAPALCTATVELDPDEPIPLDEDGFAQYLDQLDPQWQLVDASDE
Ga0208149_111196013300025610AqueousMRLLYRATYLDPEDWAPALCQTTIELDPEEPIPLDEDGFCSYLDDLDPQWQLLDTSIGDP
Ga0208147_105459253300025635AqueousEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDMSDCN
Ga0208160_103365113300025647AqueousATHFEPEEWAPALCQATIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD
Ga0208795_108744813300025655AqueousVVDEMRLLYRATYFEPEEWAPALCTTTIELDPEEPIPLDEDGFCAYLDHLDPQWELLDTSDE
Ga0208795_117162313300025655AqueousTHFEPEEWAPALCTATIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDTSDE
Ga0208162_105733133300025674AqueousLYRATHFEPEEWAPALCQATIELDPEEPIPLDEDGFCAYLDQLDPQWQLLDTSVGDPD
Ga0208162_110152913300025674AqueousVDEMRLLYRATHFEPDEWAPALCQTTIELDPEKPIPLDEDGFCAYLDQLDPQWQLLDKDDIE
Ga0208019_101356873300025687AqueousRATHLDPEEWAPALCEATIELDPEEPIPLDEDGFCRYLDQLDPQWQVLSPDDI
Ga0208019_109800313300025687AqueousTVTAVVDEMRLLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQVLSPDDI
Ga0208019_113184913300025687AqueousRATHLDPEEWAPALCEATIELDPEEPIPLDEDGFCRYLDQLDPQWQLVDMSDQ
Ga0208019_114974013300025687AqueousRLDPEEWAPALCTATIELDPEEPIPLDEDGFCAYLDHLDPQWQLVDTSDE
Ga0208784_114215923300025732AqueousYRATRFEPEEWAPALCQATIELDPEEPIPLDEDGFCSYLDQLDPQWQLVDCD
Ga0208899_103498943300025759AqueousLYRATHFEPEEWAPALCEVTIAIDPEDPIPLDEDSFCSYLDQLDPQWQLVDSY
Ga0208783_1001772213300025872AqueousVDEMRLLYRASRLDPEEWAPALCTATVELDPDEPIPLDEDGFAQYLDQLDPQWQLVDASD
Ga0208788_114085823300027499Deep SubsurfaceVEDMRLLYKATRDDPEEWAPALCTTSFELDPEQPIPTDEDSFCNYLSDLSLNWELVDTSDYNLD
Ga0209427_1084304813300027901Marine SedimentTRLDPEEWAPALCQATIELDPGEPIPLDEDGFCAYLDNLDPQWQLLDTSVGYPD
Ga0119945_102158423300029933AquaticPAEYAPGLCSASFTLDPDEPIPLDEDGFCEYLAALDLDWQLVDTSDWYLD
Ga0315907_1004085013300031758FreshwaterHATHLDPEEWAPALCKTTVELDPEEPIPLDEDGFCAYLNQLDPQWQLVDKDDFDLD
Ga0315907_1098819713300031758FreshwaterLLYRATRLDPEEWAPALCTTTVELDPEEPIPLDEDGFCAYLDYLDPQWQLVDTSDE
Ga0315909_1072148533300031857FreshwaterMRLLYRATRFDPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDYLDPQWQLVDTSDE
Ga0315904_1048970413300031951FreshwaterFEPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPHWQLVDNSDYDLDS
Ga0315901_1054604813300031963FreshwaterDDMRLLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDHLDPQWQLVDLSDE
Ga0334996_0415025_444_6233300033994FreshwaterMRLLYRATHFEPEEWAPALCQTTIELDPEEPIPLDEDGFCSYLDDLDPQWQLVERDDLD
Ga0334986_0262666_3_1913300034012FreshwaterMRLLYKATHLDPEEWAPALCTASFELDEGEQIPTDEDGFCSYLDSLDLNWELVDTSDWDIDS
Ga0335065_0166034_32_2023300034200FreshwaterMRLLYRATRFDPEEWAPALCTTTIELDPGEPIPFDEDGFCSYLDQLDPQWQLVDCD
Ga0334997_0568765_532_7023300034280FreshwaterLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLSDDE
Ga0335007_0430624_23_2113300034283FreshwaterMRLLYRATHLDPEEWAPALCTASFELDEGEQIPTDEDGFCSYLDSLNLEWQLIDTSDWHLDQ
Ga0335007_0639429_437_6073300034283FreshwaterLYRATRFDPEEWAPALCTTTIELDPEEPIPLDEDGFCSYLDQLDPLWQLLDLSDYE
Ga0335007_0694616_3_1883300034283FreshwaterRLLYRATRFDPEEWAPALCQATIELDPEEPIPLDEDGFCSYLDQLDPHWQLVDNSDYDLD
Ga0348335_037934_1_1593300034374AqueousFEPEEWAPALCQTTIELDPEEPIPLDEDGFCAYLDHLDPQWQLLDTSVGDPD
Ga0348337_115192_677_8383300034418AqueousYLDPEEWAPALCQTTIELDPEEPIPLDEDGFCSYLDDLDPQWQLLDTSIGDPD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.