NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F083668

Metagenome / Metatranscriptome Family F083668

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F083668
Family Type Metagenome / Metatranscriptome
Number of Sequences 112
Average Sequence Length 52 residues
Representative Sequence MKWTPDKITALVLIVGCLALLFTGIDGEVKSILTLAAGYFFGVSYAERKK
Number of Associated Samples 56
Number of Associated Scaffolds 112

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.36 %
% of genes near scaffold ends (potentially truncated) 16.07 %
% of genes from short scaffolds (< 2000 bps) 72.32 %
Associated GOLD sequencing projects 48
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (87.500 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface
(24.107 % of family members)
Environment Ontology (ENVO) Unclassified
(30.357 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(49.107 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 53.85%    β-sheet: 0.00%    Coil/Unstructured: 46.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 112 Family Scaffolds
PF02195ParBc 4.46
PF00589Phage_integrase 2.68
PF04266ASCH 0.89
PF13385Laminin_G_3 0.89
PF13229Beta_helix 0.89

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 112 Family Scaffolds
COG2411Predicted RNA-binding protein, contains PUA-like ASCH domainGeneral function prediction only [R] 0.89
COG3097Uncharacterized conserved protein YqfB, UPF0267 familyFunction unknown [S] 0.89
COG4405Predicted RNA-binding protein YhfF, contains PUA-like ASCH domainGeneral function prediction only [R] 0.89


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A87.50 %
All OrganismsrootAll Organisms12.50 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003141|Ga0052246_1006361Not Available1443Open in IMG/M
3300003143|Ga0052245_1000096Not Available3341Open in IMG/M
3300003143|Ga0052245_1002711Not Available1228Open in IMG/M
3300003143|Ga0052245_1023378Not Available808Open in IMG/M
3300003143|Ga0052245_1030499Not Available767Open in IMG/M
3300003143|Ga0052245_1037019Not Available670Open in IMG/M
3300003144|Ga0052244_1006863Not Available1281Open in IMG/M
3300005236|Ga0066636_10070172Not Available1854Open in IMG/M
3300005236|Ga0066636_10232177Not Available810Open in IMG/M
3300005935|Ga0075125_10082017All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1407Open in IMG/M
3300005935|Ga0075125_10367715Not Available559Open in IMG/M
3300008255|Ga0100403_1062773Not Available1550Open in IMG/M
3300008255|Ga0100403_1081453Not Available1298Open in IMG/M
3300008255|Ga0100403_1159060Not Available824Open in IMG/M
3300008255|Ga0100403_1326318Not Available511Open in IMG/M
3300008470|Ga0115371_11163969Not Available1311Open in IMG/M
3300008470|Ga0115371_11309028Not Available932Open in IMG/M
3300008517|Ga0111034_1013614Not Available1494Open in IMG/M
3300009149|Ga0114918_10013746All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium RBG_13_51_186329Open in IMG/M
3300009149|Ga0114918_10459727Not Available686Open in IMG/M
3300009499|Ga0114930_10045045All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium2746Open in IMG/M
3300009499|Ga0114930_10160640Not Available1178Open in IMG/M
3300009499|Ga0114930_10560587Not Available507Open in IMG/M
3300009528|Ga0114920_10000898Not Available13373Open in IMG/M
3300009529|Ga0114919_10009419Not Available7665Open in IMG/M
3300009529|Ga0114919_10029102Not Available4160Open in IMG/M
3300009529|Ga0114919_10046583All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3220Open in IMG/M
3300009529|Ga0114919_10077466Not Available2429Open in IMG/M
3300009529|Ga0114919_10161984Not Available1606Open in IMG/M
3300009529|Ga0114919_10400695Not Available954Open in IMG/M
3300009529|Ga0114919_10519764Not Available820Open in IMG/M
3300009529|Ga0114919_10541792Not Available800Open in IMG/M
3300009529|Ga0114919_10807994Not Available635Open in IMG/M
3300009529|Ga0114919_10807997Not Available635Open in IMG/M
3300009529|Ga0114919_11097093Not Available534Open in IMG/M
3300009529|Ga0114919_11106112Not Available531Open in IMG/M
3300013086|Ga0163202_1121971Not Available561Open in IMG/M
(restricted) 3300013127|Ga0172365_10040086All Organisms → cellular organisms → Archaea → TACK group → Candidatus Korarchaeota → Candidatus Korarchaeota archaeon3176Open in IMG/M
(restricted) 3300013127|Ga0172365_10166467Not Available1366Open in IMG/M
(restricted) 3300013127|Ga0172365_10317142Not Available925Open in IMG/M
(restricted) 3300013127|Ga0172365_10618418Not Available618Open in IMG/M
(restricted) 3300013127|Ga0172365_10767028Not Available544Open in IMG/M
(restricted) 3300013128|Ga0172366_10086889Not Available2105Open in IMG/M
(restricted) 3300013128|Ga0172366_10276216Not Available1050Open in IMG/M
(restricted) 3300013128|Ga0172366_10339894Not Available924Open in IMG/M
(restricted) 3300013128|Ga0172366_10874885Not Available520Open in IMG/M
(restricted) 3300013129|Ga0172364_10165550Not Available1501Open in IMG/M
(restricted) 3300013129|Ga0172364_10182116Not Available1418Open in IMG/M
(restricted) 3300013129|Ga0172364_10208027Not Available1310Open in IMG/M
(restricted) 3300013129|Ga0172364_10310253Not Available1030Open in IMG/M
(restricted) 3300013129|Ga0172364_10312475All Organisms → Viruses → Predicted Viral1025Open in IMG/M
(restricted) 3300013129|Ga0172364_10429427Not Available845Open in IMG/M
(restricted) 3300013129|Ga0172364_10433788Not Available840Open in IMG/M
(restricted) 3300013129|Ga0172364_10817122Not Available574Open in IMG/M
(restricted) 3300013129|Ga0172364_10883624Not Available548Open in IMG/M
3300014148|Ga0180010_1012038Not Available795Open in IMG/M
3300014613|Ga0180008_1057134All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1554Open in IMG/M
3300014613|Ga0180008_1145629Not Available919Open in IMG/M
3300014656|Ga0180007_10138692Not Available1586Open in IMG/M
3300014656|Ga0180007_10158616Not Available1462Open in IMG/M
3300014656|Ga0180007_10288937Not Available1017Open in IMG/M
3300014656|Ga0180007_10341054Not Available920Open in IMG/M
3300014911|Ga0180301_10043465Not Available3034Open in IMG/M
3300015370|Ga0180009_10158877Not Available1072Open in IMG/M
3300015370|Ga0180009_10317619Not Available643Open in IMG/M
3300017992|Ga0180435_10709252Not Available848Open in IMG/M
3300022217|Ga0224514_10034977Not Available1660Open in IMG/M
3300022553|Ga0212124_10060665All Organisms → Viruses → Predicted Viral2200Open in IMG/M
3300024262|Ga0210003_1024435All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium RBG_13_51_183551Open in IMG/M
3300024263|Ga0209978_10428847Not Available641Open in IMG/M
3300024353|Ga0209979_1051097All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1972Open in IMG/M
3300024429|Ga0209991_10000815Not Available13931Open in IMG/M
3300024433|Ga0209986_10010995All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium6780Open in IMG/M
3300024433|Ga0209986_10022748Not Available4189Open in IMG/M
3300024433|Ga0209986_10048235All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2545Open in IMG/M
3300024433|Ga0209986_10216351Not Available947Open in IMG/M
3300024433|Ga0209986_10270061Not Available816Open in IMG/M
3300025018|Ga0210043_1028437Not Available2032Open in IMG/M
3300025018|Ga0210043_1061237Not Available1199Open in IMG/M
3300025018|Ga0210043_1095296Not Available883Open in IMG/M
3300025022|Ga0210056_1096297Not Available1189Open in IMG/M
3300025285|Ga0208046_1098775Not Available561Open in IMG/M
3300025736|Ga0207997_1165676All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium688Open in IMG/M
3300025868|Ga0210051_1117343Not Available1415Open in IMG/M
3300027740|Ga0214474_1056860Not Available1505Open in IMG/M
3300027742|Ga0209121_10058444Not Available1904Open in IMG/M
3300027893|Ga0209636_10021084All Organisms → cellular organisms → Archaea → Euryarchaeota → Thermococci → unclassified Thermococci → Thermococci archaeon6607Open in IMG/M
3300028620|Ga0257139_1053111Not Available678Open in IMG/M
3300028670|Ga0257143_1056594Not Available554Open in IMG/M
3300029689|Ga0257138_1032435Not Available818Open in IMG/M
3300029891|Ga0246100_133406Not Available769Open in IMG/M
3300031227|Ga0307928_10154193Not Available1273Open in IMG/M
3300031278|Ga0307431_1001641Not Available12638Open in IMG/M
3300031280|Ga0307428_1001255Not Available13166Open in IMG/M
3300031280|Ga0307428_1140669Not Available612Open in IMG/M
3300031331|Ga0307432_1002551Not Available9793Open in IMG/M
3300031351|Ga0307427_1001820Not Available10575Open in IMG/M
3300031365|Ga0307443_1002017Not Available11547Open in IMG/M
3300031365|Ga0307443_1004315Not Available7244Open in IMG/M
3300031379|Ga0307434_1016558Not Available2613Open in IMG/M
3300031539|Ga0307380_10009093Not Available12645Open in IMG/M
3300031539|Ga0307380_10481171Not Available1097Open in IMG/M
3300031539|Ga0307380_11002021Not Available667Open in IMG/M
3300031553|Ga0315547_1002445Not Available13091Open in IMG/M
3300031553|Ga0315547_1042813Not Available1769Open in IMG/M
(restricted) 3300031593|Ga0315307_1279636Not Available526Open in IMG/M
3300031643|Ga0315533_1002229Not Available13773Open in IMG/M
3300031643|Ga0315533_1026898Not Available2224Open in IMG/M
3300031673|Ga0307377_10499380Not Available886Open in IMG/M
3300031862|Ga0315280_10056920Not Available3240Open in IMG/M
3300032020|Ga0315296_10111487Not Available1731Open in IMG/M
3300032046|Ga0315289_10664918Not Available953Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface24.11%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment19.64%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater12.50%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh7.14%
AquiferEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Aquifer6.25%
Marine SedimentEnvironmental → Aquatic → Marine → Neritic Zone → Unclassified → Marine Sediment6.25%
Salt Marsh SedimentEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh Sediment3.57%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil3.57%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater2.68%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine2.68%
Saline LakeEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Saline Lake2.68%
SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment1.79%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment0.89%
Marine SedimentEnvironmental → Aquatic → Marine → Coastal → Sediment → Marine Sediment0.89%
Marine SedimentEnvironmental → Aquatic → Marine → Hydrothermal Vents → Sediment → Marine Sediment0.89%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.89%
MarineEnvironmental → Aquatic → Marine → Oil Seeps → Unclassified → Marine0.89%
Saline WaterEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Saline Water0.89%
Hypersaline Lake SedimentEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Hypersaline → Sediment → Hypersaline Lake Sediment0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.89%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003141Marine sediment microbial communities from deep subseafloor of Shimokita Peninsula, Japan - 107 mbsfEnvironmentalOpen in IMG/M
3300003143Marine sediment microbial communities from deep subseafloor - Sample from 48.5 mbsfEnvironmentalOpen in IMG/M
3300003144Marine sediment microbial communities from deep subseafloor - Sample from 18.6 mbsfEnvironmentalOpen in IMG/M
3300005236Groundwater microbial communities from aquifer - Crystal Geyser CG07_land_8/20/14_0.80EnvironmentalOpen in IMG/M
3300005935Saline lake microbial communities from Ace Lake, Antarctica - Antarctic Ace Lake Metagenome 02UKNEnvironmentalOpen in IMG/M
3300008255Groundwater microbial communities from Crystal Geyser aquifers in Utah, USA - Crystal Geyser metaG 2015-01tEnvironmentalOpen in IMG/M
3300008470Sediment core microbial communities from Adelie Basin, Antarctica. Combined Assembly of Gp0136540, Gp0136562, Gp0136563EnvironmentalOpen in IMG/M
3300008517Marine sediment microbial communities from Aarhus Bay station M5, Denmark - 175 cmbsf. Combined Assembly of Gp0128389 and Gp0131431 MM4PM4EnvironmentalOpen in IMG/M
3300009149Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaGEnvironmentalOpen in IMG/M
3300009499Deep subsurface microbial communities from Anholt, Denmark to uncover new lineages of life (NeLLi) - Anholt_01485 metaGEnvironmentalOpen in IMG/M
3300009528Deep subsurface microbial communities from South Pacific Ocean to uncover new lineages of life (NeLLi) - Chile_00310 metaGEnvironmentalOpen in IMG/M
3300009529Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaGEnvironmentalOpen in IMG/M
3300013086Freshwater microbial communities from Powell Lake, British Columbia, Canada to study Microbial Dark Matter (Phase II) - PL_2010_300mEnvironmentalOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013128 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 69cmEnvironmentalOpen in IMG/M
3300013129 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 10cmEnvironmentalOpen in IMG/M
3300014148Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - OS_PW_MetaGEnvironmentalOpen in IMG/M
3300014613Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PW_MetaGEnvironmentalOpen in IMG/M
3300014656Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PC_MetaGEnvironmentalOpen in IMG/M
3300014911Subseafloor sediment microbial communities from Guaymas Basin, Gulf of California, Mexico - Guay16, Core 4569-2, 12-15 cmEnvironmentalOpen in IMG/M
3300015370Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - OS_PC_MetaGEnvironmentalOpen in IMG/M
3300017992Hypersaline lake sediment archaeal communities from the Salton Sea, California, USA - SS_3_S_1 metaGEnvironmentalOpen in IMG/M
3300022217Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_24EnvironmentalOpen in IMG/M
3300022553Powell_combined assemblyEnvironmentalOpen in IMG/M
3300024262Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024263Deep subsurface microbial communities from South Atlantic Ocean to uncover new lineages of life (NeLLi) - Benguela_00093 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024353Deep subsurface microbial communities from Anholt, Denmark to uncover new lineages of life (NeLLi) - Anholt_01485 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024429Deep subsurface microbial communities from South Pacific Ocean to uncover new lineages of life (NeLLi) - Chile_00310 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300024433Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025018Groundwater microbial communities from Crystal Geyser aquifers in Utah, USA - Crystal Geyser metaG 2015-01t (SPAdes)EnvironmentalOpen in IMG/M
3300025022Groundwater microbial communities from aquifer - Crystal Geyser CG07_land_8/20/14_0.80 (SPAdes)EnvironmentalOpen in IMG/M
3300025285Freshwater microbial communities from Powell Lake, British Columbia, Canada to study Microbial Dark Matter (Phase II) - PL_2010_300m (SPAdes)EnvironmentalOpen in IMG/M
3300025736Saline lake microbial communities from Ace Lake, Antarctica - Antarctic Ace Lake Metagenome 02UKN (SPAdes)EnvironmentalOpen in IMG/M
3300025868Groundwater microbial communities from aquifer - Crystal Geyser CG23_combo_of_CG06-09_8/20/14_all (SPAdes)EnvironmentalOpen in IMG/M
3300027740Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT95D214 HiSeqEnvironmentalOpen in IMG/M
3300027742Oil polluted marine microbial communities from Coal Oil Point, Santa Barbara, California, USA - Sample 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027893Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-1-52-54 (SPAdes)EnvironmentalOpen in IMG/M
3300028620Metatranscriptome of saline water microbial communities from Sakinaw Lake, British Columbia, Canada - sak_2010_1_5_80m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300028670Metatranscriptome of saline water microbial communities from Sakinaw Lake, British Columbia, Canada - sak_2011_1_27_50m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029689Metatranscriptome of saline water microbial communities from Sakinaw Lake, British Columbia, Canada - sak_2010_1_5_55m (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300029891Groundwater microbial communities from Horonobe Underground Research Laboratory (URL), Japan - horonobe_ig2158EnvironmentalOpen in IMG/M
3300031227Saline water microbial communities from Ace Lake, Antarctica - #232EnvironmentalOpen in IMG/M
3300031278Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1603-170EnvironmentalOpen in IMG/M
3300031280Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1603-240EnvironmentalOpen in IMG/M
3300031331Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1603-150EnvironmentalOpen in IMG/M
3300031351Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1603-190EnvironmentalOpen in IMG/M
3300031365Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - SW1601-220EnvironmentalOpen in IMG/M
3300031379Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1603-220EnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031553Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - Salt Marsh Sediment SW1601-240EnvironmentalOpen in IMG/M
3300031593 (restricted)Freshwater sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - TDP2EnvironmentalOpen in IMG/M
3300031643Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - Salt Marsh Sediment SW1601-30EnvironmentalOpen in IMG/M
3300031673Soil microbial communities from Risofladan, Vaasa, Finland - TR-3EnvironmentalOpen in IMG/M
3300031862Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_40EnvironmentalOpen in IMG/M
3300032020Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G14_18EnvironmentalOpen in IMG/M
3300032046Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_40EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0052246_100636133300003141Marine SedimentMKWTPDKVTTLILIVGCLVLLFTGRDGEVKSILTVAAGWLFGTGYMDIKRGGK*
Ga0052245_100009643300003143Marine SedimentMSWTPDKICAVILIVGCLLLVAFRIDGEVKSILTISAGYLFGTGIMEYRTKKKGK*
Ga0052245_100271113300003143Marine SedimentMKWTPDKVTALILVVGCLVLIFTGRDSEVKSILTVAAGWLFGASYVDIKRGGKK*
Ga0052245_102337823300003143Marine SedimentMKKWPPDKVTALVLIVGCLALAFCGIDGEIKAILTISATYLFTVSYQERHKQA*
Ga0052245_103049913300003143Marine SedimentTPDKVCAVILIVGCLLLVAFRIDGEVKSILTISAGYLFGTGIMEYRTKKKGK*
Ga0052245_103701923300003143Marine SedimentMKWTPDKVTALILVVGCLVLIFTGRDSEVKSILTVAAGWLFGASYVDIKRGGKI
Ga0052244_100686333300003144Marine SedimentMKWSPDKITALILIIGCLGLLFTGIDSEVKSILTIAAGYLFGTAIAEKKK*
Ga0066636_1007017223300005236GroundwaterMKWTPDKITALILIVGCLALIFTGKDAEVKSILTVAAGWLFGSTYAERRKK*
Ga0066636_1023217733300005236GroundwaterMKWTPDKITALILIIGCFGLLFTGIDSEVKSILTVAAGWLFGSAYIEKRKK*
Ga0075125_1008201733300005935Saline LakeMKWTPDKVTALILVVGCLVLIFTGLNSEVKSILTISAGWLFGTGYVEIKRGGKK*
Ga0075125_1036771513300005935Saline LakeMKWTPDKITAMVLIAGCLALLFTGIDGEVKSILTISAGYLFGTSVFDRRSR*
Ga0100403_106277323300008255AquiferMKWTPDKITALILIIGCFGLLFTGIDSEVKSILTVAAGWLFGSAYSEKRKK*
Ga0100403_108145333300008255AquiferMKKWSPDKITAVILLVGCFGLLFTGIDGEVKSILTIAAGYLFGTSIMKRKK*
Ga0100403_115906013300008255AquiferMKWSPDKITALILIVGSLLLIFTGKDSEVKSILTVAAGWLFGSAYQERKKR*
Ga0100403_132631833300008255AquiferMKWSPDKITALILIVGCLTLIFTGKDSEVKSILTIAAGWLFGSTYTERKKR*
Ga0115371_1116396933300008470SedimentMMKWTPDKITALVLVVGCMGLLFTGIDGEVKSILTISAGYLFGVGIAERKPS*
Ga0115371_1130902823300008470SedimentMRWTPDKITAMVLVVGCLVLIFTGRDGEVKSIPTMAGGWLFGTGYAEIKKGGKK*
Ga0111034_101361433300008517Marine SedimentMKWTPDKITALILLVGCLGLIFTGIDGEVKSILTVSAGYLFGTGVADYRIKKPPKDN*
Ga0114918_10013746123300009149Deep SubsurfaceMKNWTPDKITAMVLVIGCLALIFSGIDGEVKSILTLSAGWLFGGAYISAKTK*
Ga0114918_1045972713300009149Deep SubsurfaceMKLWTPDKITAMIIVVGCLLLIFSGIDGEVKAILATAAGWLFGKSTLDAKGKTQ*
Ga0114930_1004504573300009499Deep SubsurfaceMKWTPDKITALVLIIGCLALLFTRIDGEVKSILTVAAGWLFGSSYAEIRKGGEKK*
Ga0114930_1016064033300009499Deep SubsurfaceMKWTPDKITALVLIAGCLALRFTGIDGEVMAILAMAAAWLFGSTYTERKGKGG
Ga0114930_1056058723300009499Deep SubsurfaceLKWTPDKITALVLIVGCFALLFTGIDGEVKSILTLSAGYFFGVSYTERKK*
Ga0114920_10000898193300009528Deep SubsurfaceMKWSPDKITALVLICGCLALLFTGIDGEVKSILTIAAGYFFGVSYADRKK*
Ga0114919_10009419103300009529Deep SubsurfaceMKWTPDKITALVLIVGCLALLFTGIDGEVKSVLTLAAGYFFGVSYAERKK*
Ga0114919_1002910233300009529Deep SubsurfaceMKWTPDKITALVLIAGCLVLLFTGIDGEVKSILTLAAGYFFGVSYSERKK*
Ga0114919_1004658333300009529Deep SubsurfaceLKWTPDKITALILIVGCLGLIFTGTDSEVKSILTMAAGWLFGATYSERSKKGGK*
Ga0114919_1007746663300009529Deep SubsurfaceLKWAPDRITAMILVIGCLVLIFTGRNSEVKSILTVAAGWLFGTAYMERPKKGGK*
Ga0114919_1016198413300009529Deep SubsurfaceLKWTPDKITALILVVGCLGLIFTGRNSEVKSILTMAAGWLFGTAYVERQKKGGK*
Ga0114919_1040069533300009529Deep SubsurfaceLKWTPDKITALILVVGCLGLIFTGRNSEVKSILTMAAGWLFGTAYMERPKKGGK*
Ga0114919_1051976433300009529Deep SubsurfaceMKWSPDKITALILIVGCLALVFTGIDGEVKSILTIAAGWLFGSSYAEIRKGGKKE*
Ga0114919_1054179213300009529Deep SubsurfaceMRWTPDKVTALILVCGCLALLFTGINGEVKSILAIAAGWLFGGAFMERMKAKGDK*
Ga0114919_1080799423300009529Deep SubsurfaceMKWSPDKITALVLLVGCLILLFTGIDGEVKSILTLSAGYFFGVSYSERRKR*
Ga0114919_1080799723300009529Deep SubsurfaceMKWSPDKITALVLLVGCLALVFTGIDGEVKAILTIAAGWLFGSAYAERRQK*
Ga0114919_1109709313300009529Deep SubsurfaceMKWTPDKITALILIVGCLALIFTGRDSEVKSILTVAAGWLFGTAYADRPKKGGK*
Ga0114919_1110611233300009529Deep SubsurfacePDKITALVLIVGCLTLIFTGRDSEVKSILTVAAGWLFGATYSERTTKKGGESK*
Ga0163202_112197113300013086FreshwaterMSKWTPDKITALILICSCIALMACRIDGEVKSILTIASGYLFGVGIQEHGKHNGGVK*
(restricted) Ga0172365_1004008643300013127SedimentMKPVWTPDKITAMVLVLGCLGLIYSGIDGEVKSILTLAAGWLFGGAYVERTRRP*
(restricted) Ga0172365_1016646723300013127SedimentMKWTPDKITALILVVGCLGLRFTGIDSEVMSILTIAAGWLFGSTYIERRKKGGK*
(restricted) Ga0172365_1031714213300013127SedimentMKLVWTPDKITAMVLVLGCMGLIYSGIDGEVKAILTVAAGWLFGGAYIERTRRP*
(restricted) Ga0172365_1061841833300013127SedimentTALVLVVGCLGLRFTGIDSEVMSILTIAAGWLFGSAYTEIRAKGGK*
(restricted) Ga0172365_1076702823300013127SedimentMKWTPDKITALVLIVGCLGLRFTGIDTEVMSILIMAAGWFFGSTYTERKGKGG*
(restricted) Ga0172366_1008688963300013128SedimentTAVILLLGCLGLLYTGIDGEVKSILTIAAGYLFGVSIHEREKQT*
(restricted) Ga0172366_1027621623300013128SedimentMKWTADKITAVILLCGCFALLFTGIDGEVKAILTLAAGWLFGGAYVERTRRP*
(restricted) Ga0172366_1033989443300013128SedimentMKPVWTPDKITAMVLVLGCLGLIYCGIDGEVKSILTVAAGWLFGGAYVERTRRP*
(restricted) Ga0172366_1087488513300013128SedimentTYTGGAIMKWTPDKITALILVVGCLGLRFTGIDSEVMSILTIAAGWLFGSTYIERRKKGGK*
(restricted) Ga0172364_1016555053300013129SedimentMKWTADKITAVILLCGCFALLFTGIDGEVKAILTLAAGWLFGGAYVERTRR
(restricted) Ga0172364_1018211623300013129SedimentMKWTPDKVTALILVVGCLGLRFTGIDSEVMSILTIAAGWLFGSTYIERRKKGGK*
(restricted) Ga0172364_1020802733300013129SedimentTAVILLIGCFGLLYAGIDGEVKSILTIAAGYLFGVSIHEREKQT*
(restricted) Ga0172364_1031025333300013129SedimentMKKWTPDKVLAAILLIGCLALIATGIDGEVKSILAMAGGWLFHASYATIHHKGE*
(restricted) Ga0172364_1031247553300013129SedimentAMVLVLGCLGLIYSGIDGEVKAILTVAAGWLFGGAYVERTRRP*
(restricted) Ga0172364_1042942723300013129SedimentMKKWTPDKITAVILLLGCLGLLYTGIDGEVKSILTIAAGYLFGVSIHERGKQT*
(restricted) Ga0172364_1043378823300013129SedimentMKKWSPDQVAALILIIGCLTLLAAGIDTEVKSILTIAAGYLFGTRIKSKSSPK*
(restricted) Ga0172364_1081712233300013129SedimentWTPDKITALILVVGCLGLRLTGIDSEVMSILTIAAGWLFGSAYIEKKSKGGQ*
(restricted) Ga0172364_1088362433300013129SedimentMKWTPDKITALVLLVGCLVLLFTGIDGEVKSILTLSAGYFFGVSYAERTRKGGTK*
Ga0180010_101203813300014148GroundwaterMKWSPDKITALVLIAGCFVLLFTGIDGEVKSILTLAAGYFFGTAIAERRKR*
Ga0180008_105713413300014613GroundwaterMKWTPDKITALVLIAGCFVLLFAGIDGEVKSILTIAAGYFFGTS
Ga0180008_114562923300014613GroundwaterMKKWTPDKVTAVILIVGCLGLLFTGIDGEVKSILTIAAGYLFGVGIAEKQK*
Ga0180007_1013869253300014656GroundwaterMKWTPDKLTALVLVVGCLVLVFTGIDGEVKTILTMSATYLFVTGVVDYRAKKQSKND*
Ga0180007_1015861623300014656GroundwaterMKKWTPDKVTAVILIAGCLGLLFTGIDGEVKSILTIAAGYLFATGIAERKK*
Ga0180007_1028893723300014656GroundwaterMKWTPDKITALVLLAGCFVLLFTGIDGEVKSILLIAAGYLFGTAIVDRRKK*
Ga0180007_1034105433300014656GroundwaterMKWTPDKITALILIVGCFGLLFTGIDGEVKYILLIAAGYLFGTAIVDRRQ
Ga0180301_1004346523300014911Marine SedimentMHWTADKITALVLVVCCTILLFTGIDGEVKSILTMAAGYFFGASIRDKVLD*
Ga0180009_1015887713300015370GroundwaterMKWSPDKITALVLIVGCFVLLFTGIDGEVKSILLIAAGYLFGTAIVDRRKK*
Ga0180009_1031761923300015370GroundwaterMKWTPDKITALVLIAGCFVLLFTGIDGEVKSILTLAAGYFFGTSITERRK
Ga0180435_1070925233300017992Hypersaline Lake SedimentMNWTPDKITALVLLVGCLVLVFTGIDGEVKAILTIAAGWLFGAAYAERTTKKGG
Ga0224514_1003497733300022217SedimentMNWSPDKITALVLVAGCLALVFTGIDGEVKAILTISAGYLFGVGIAEKKQQGGK
Ga0212124_1006066543300022553FreshwaterMSKWTPDKITALILICSCIALMACRIDGEVKSILTIASGYLFGVGIQEHGKYNGGVK
Ga0210003_102443543300024262Deep SubsurfaceMKNWTPDKITAMVLVIGCLALIFSGIDGEVKSILTLSAGWLFGGAYISAKTK
Ga0209978_1042884713300024263Deep SubsurfacePDKITAMVLVIGCLVLIFTGRDSEVKSILTVAAGWLFGTAYAEKKKK
Ga0209979_105109743300024353Deep SubsurfaceMKWTPDKITALVLIIGCLALLFTRIDGEVKSILTVAAGWLFGSSYAEIRKGGEKK
Ga0209991_10000815183300024429Deep SubsurfaceMKWSPDKITALVLICGCLALLFTGIDGEVKSILTIAAGYFFGVSYADRKK
Ga0209986_1001099513300024433Deep SubsurfaceMKWSPDKITALVLLVGCLALVFTGIDGEVKAILTIAAGWLFGSAYAERRQK
Ga0209986_1002274863300024433Deep SubsurfaceMKWTPDKITALVLIAGCLVLLFTGIDGEVKSILTLAAGYFFGVSYSERKK
Ga0209986_1004823533300024433Deep SubsurfaceLKWTPDKITALILIVGCLGLIFTGTDSEVKSILTMAAGWLFGATYSERSKKGGK
Ga0209986_1021635133300024433Deep SubsurfaceLKWTPDKITALILVVGCLGLIFTGRNSEVKSILTMAAGWLFGTAYMERPKKGGK
Ga0209986_1027006123300024433Deep SubsurfaceLKWAPDRITAMILVIGCLVLIFTGRNSEVKSILTVAAGWLFGTAYMERPKKGGK
Ga0210043_102843733300025018AquiferMKWTPDKITALILIIGCFGLLFTGIDSEVKSILTVAAGWLFGSAYSEKRKK
Ga0210043_106123713300025018AquiferMKWTPDKITALILIVGCLALIFTGKDAEVKSILTVAAGWLFGSTYAERRKK
Ga0210043_109529623300025018AquiferMKWTPDKITALILIVGSLLLIFTGKDSEVKSILTVAAGWLFGSAYQERKKR
Ga0210056_109629713300025022GroundwaterMKWTPDKITALILIIGCFGLLFTGIDSEVKSILTVAAGWLFGSAYIEKRKK
Ga0208046_109877533300025285FreshwaterMSKWTPDKITALILICSCIALMACRIDGEVKSILTIASGYLFGVGIQEHGKHNGGVK
Ga0207997_116567613300025736Saline LakeMKWTPDKVTALILVVGCLVLIFTGLNSEVKSILTISAGWLFGTGYVEIKRGGKK
Ga0210051_111734323300025868GroundwaterMKWTPDKITALILIIGCFGLLFTGIDSEVKSILTVAAGWLFGSTYSEKRKK
Ga0214474_105686033300027740SoilMDKWFLNWTPDKITALTLLIGCLVLVFTGIDGEVKSILTLAAGWLFGSAFADRAKAQKS
Ga0209121_1005844433300027742MarineMKWSPDKITAMVLVIGCLILIFTGKDSEVKSILTIAAGWLFGSAYIEKKKR
Ga0209636_1002108473300027893Marine SedimentMKWTPDKITALILVVGCLVLRFTGIDSEVMSILTMAAGWLFGSAYTERKAKGGT
Ga0257139_105311133300028620MarineWTPDKITAMVLVIGCLALIFTGIDGEVKSILTLSAGWLFGGAYIAAKTK
Ga0257143_105659413300028670MarineMKWTPDKITAMVLVIGCLALIFTGIDGEVKSILTLSAGWLFGGAYIAAKTK
Ga0257138_103243533300029689MarineCERSTTMKWTPDKITAMVLVIGCLALIFTGIDGEVKSILTLSAGWLFGGAYIAAKTK
Ga0246100_13340633300029891GroundwaterMKKWSPDKITAVILIVGCFGLLFTGIDGEVKAILTIAAGYLFGTSIIERKK
Ga0307928_1015419333300031227Saline WaterMKWTPDKVTALILIVGCLVLIFTGRDSEVKSILTVAAGWLFGASYVDIKRGGKK
Ga0307431_1001641143300031278Salt MarshMSWTPDKVTALILVVGCLGLIAFNIDSEVKSILTVAAGWLFGTGYAEIRTKRRNK
Ga0307428_1001255123300031280Salt MarshLKWTPDKITALILIIGCLTLIFTGMDSEVKSILTLGAGWLFGSSYSERSTKGGK
Ga0307428_114066923300031280Salt MarshMKWTPDKITALVLIAGCLALLFTGIDGEVKSILTLAAGYFFGVSYTERKK
Ga0307432_1002551103300031331Salt MarshMSWTPDKVTALILVVGCLGLIAFNIDSEVKSILTVAAGWLFGAGYAEIRTKRRNK
Ga0307427_1001820133300031351Salt MarshMSWTPDKVTALILVVGCLGLIAFNIDSEVKSILTVAAGWLFGMGYAEIRTKRRNK
Ga0307443_1002017133300031365Salt MarshVKWTPDKITALVLIAGCLALLFTGIDGEVKSILTLAAGYFFGVSYTERKK
Ga0307443_100431563300031365Salt MarshMKWTADKITALVLLCGCLGLLFTGIDGEVKAILTIAAGWLFGGAYSERRKGKEG
Ga0307434_101655873300031379Salt MarshMSWTPDKVIALILVIGCLGLVFTGIDSEVKSILAMAAGYLFGAGYTERKDKKKNK
Ga0307380_10009093123300031539SoilMKKWTPDKVTALILVVSCCYLLFTGIDGEVKSILTIAATYLFTTGIADRNK
Ga0307380_1048117133300031539SoilMKWTPDKVTALILVVGCLVLIFTGRNSEVKSILTIAAGWLFGTSYIEIKKGGKK
Ga0307380_1100202123300031539SoilMKKWSPDKITALVLILSCVYLLCSGIDGEVKSILTISAGYLFATGVLERQK
Ga0315547_100244583300031553Salt Marsh SedimentMQWTPDKVTALILVVGCLALIFTGRDSEVKSILTVAAGWLFGTGYAEIKKGGAK
Ga0315547_104281333300031553Salt Marsh SedimentMKWSPDKITAVILVAGCLALLFTGRNSEVKSILTVAAGWLFGTAYMERPKKGGK
(restricted) Ga0315307_127963623300031593SedimentMKWTPDKITALVLVVGCLGLRFSGIDSEVISILSIAAGWLFGAAYTERTRKGGT
Ga0315533_1002229103300031643Salt Marsh SedimentMKWTPDKITALVLIVGCLALLFTGIDGEVKSILTLAAGYFFGVSYAERKK
Ga0315533_102689813300031643Salt Marsh SedimentMKWTPDKITALVLIAGCLVLLFTGIDGEVKSILTLAAGYFFGVSYTERKK
Ga0307377_1049938013300031673SoilMKKWSPDKITALVLILSCVYLLCSGIDGEVKSILTISAGYLFATGVIERQ
Ga0315280_1005692033300031862SedimentMKLEWDMIIALVLVVGCLGLLFTGIDSEVKSILTVAAGWAFGSVYSKRRKKGG
Ga0315296_1011148733300032020SedimentMSWTPDKLCAIILIVGCLLLVAFHIDSEVKSILTISAGYLFGTGFSEIRAKKKTK
Ga0315289_1066491813300032046SedimentMDKWLLTWTPDKITALTLMVGCLVLIFTGIDGEVKSILTLAAGWLFGTAFADRSKAQKS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.