NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F088887

Metagenome Family F088887

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088887
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 55 residues
Representative Sequence MPLWLRRTTFNLMKEHYDKQNEEAEKQQNMLKNKTGSKDIARPNIAPKPNYIAKAPKK
Number of Associated Samples 61
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 64.49 %
% of genes near scaffold ends (potentially truncated) 26.61 %
% of genes from short scaffolds (< 2000 bps) 46.79 %
Associated GOLD sequencing projects 61
AlphaFold2 3D model prediction Yes
3D model pTM-score0.36

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (81.651 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake
(26.605 % of family members)
Environment Ontology (ENVO) Unclassified
(66.972 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(86.239 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 34.88%    β-sheet: 0.00%    Coil/Unstructured: 65.12%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.36
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF04984Phage_sheath_1 14.68
PF06841Phage_T4_gp19 5.50
PF02562PhoH 0.92
PF01541GIY-YIG 0.92
PF06745ATPase 0.92
PF01510Amidase_2 0.92
PF00877NLPC_P60 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG3497Phage tail sheath protein FIMobilome: prophages, transposons [X] 14.68
COG0791Cell wall-associated hydrolase, NlpC_P60 familyCell wall/membrane/envelope biogenesis [M] 0.92
COG1702Phosphate starvation-inducible protein PhoH, predicted ATPaseSignal transduction mechanisms [T] 0.92
COG1875Predicted ribonuclease YlaK, contains NYN-type RNase and PhoH-family ATPase domainsGeneral function prediction only [R] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A81.65 %
All OrganismsrootAll Organisms18.35 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000882|FwDRAFT_10151576Not Available929Open in IMG/M
3300001847|RCM41_1000680All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium7370Open in IMG/M
3300001848|RCM47_1065827Not Available1355Open in IMG/M
3300001968|GOS2236_1088465All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2850Open in IMG/M
3300002842|contig_10017All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium3754Open in IMG/M
3300005664|Ga0073685_1000005Not Available45909Open in IMG/M
3300006484|Ga0070744_10065459Not Available1058Open in IMG/M
3300006484|Ga0070744_10176330Not Available611Open in IMG/M
3300006875|Ga0075473_10210891Not Available784Open in IMG/M
3300007538|Ga0099851_1080037All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti1258Open in IMG/M
3300007973|Ga0105746_1085812Not Available1023Open in IMG/M
3300008116|Ga0114350_1000340Not Available44353Open in IMG/M
3300008117|Ga0114351_1018788Not Available5406Open in IMG/M
3300008120|Ga0114355_1174762All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Phycisphaerae → unclassified Phycisphaerae → Phycisphaerae bacterium728Open in IMG/M
3300008266|Ga0114363_1001618Not Available17434Open in IMG/M
3300008266|Ga0114363_1028443Not Available3211Open in IMG/M
3300008267|Ga0114364_1000861Not Available18368Open in IMG/M
3300008267|Ga0114364_1001129Not Available16127Open in IMG/M
3300008267|Ga0114364_1001832Not Available18048Open in IMG/M
3300008267|Ga0114364_1003880Not Available11829Open in IMG/M
3300008267|Ga0114364_1004726Not Available14342Open in IMG/M
3300008267|Ga0114364_1009930All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti4424Open in IMG/M
3300008267|Ga0114364_1010741Not Available6048Open in IMG/M
3300008267|Ga0114364_1028397Not Available3230Open in IMG/M
3300008267|Ga0114364_1045797Not Available1601Open in IMG/M
3300008267|Ga0114364_1145881Not Available661Open in IMG/M
3300008267|Ga0114364_1187543Not Available522Open in IMG/M
3300008448|Ga0114876_1002823Not Available11996Open in IMG/M
3300008448|Ga0114876_1031465Not Available2589Open in IMG/M
3300008448|Ga0114876_1123639Not Available988Open in IMG/M
3300008448|Ga0114876_1233103All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage589Open in IMG/M
3300008450|Ga0114880_1012609Not Available4121Open in IMG/M
3300008450|Ga0114880_1142945Not Available871Open in IMG/M
3300008459|Ga0114865_1002028Not Available12438Open in IMG/M
3300010348|Ga0116255_10622177Not Available695Open in IMG/M
3300011115|Ga0151514_10004Not Available159567Open in IMG/M
3300011335|Ga0153698_1220Not Available26907Open in IMG/M
(restricted) 3300013126|Ga0172367_10113945Not Available1868Open in IMG/M
(restricted) 3300013127|Ga0172365_10090775Not Available1957Open in IMG/M
(restricted) 3300013129|Ga0172364_10344649All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage966Open in IMG/M
(restricted) 3300013131|Ga0172373_10015223Not Available8632Open in IMG/M
(restricted) 3300013133|Ga0172362_10035144All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti4015Open in IMG/M
3300017722|Ga0181347_1011528All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2861Open in IMG/M
3300017761|Ga0181356_1001339All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti10764Open in IMG/M
3300017761|Ga0181356_1015131Not Available2889Open in IMG/M
3300017761|Ga0181356_1041808Not Available1607Open in IMG/M
3300017761|Ga0181356_1079616All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1090Open in IMG/M
3300017774|Ga0181358_1004484Not Available6029Open in IMG/M
3300017774|Ga0181358_1026076All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2306Open in IMG/M
3300017774|Ga0181358_1027199All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2252Open in IMG/M
3300017774|Ga0181358_1097061Not Available1060Open in IMG/M
3300017774|Ga0181358_1111995Not Available968Open in IMG/M
3300017774|Ga0181358_1128012Not Available886Open in IMG/M
3300017777|Ga0181357_1087742Not Available1187Open in IMG/M
3300017777|Ga0181357_1121307Not Available980Open in IMG/M
3300017777|Ga0181357_1158461Not Available830Open in IMG/M
3300017778|Ga0181349_1122599Not Available956Open in IMG/M
3300017784|Ga0181348_1096060Not Available1161Open in IMG/M
3300017784|Ga0181348_1162330Not Available828Open in IMG/M
3300017784|Ga0181348_1169171Not Available805Open in IMG/M
3300017784|Ga0181348_1174603Not Available787Open in IMG/M
3300017784|Ga0181348_1286461Not Available556Open in IMG/M
3300019784|Ga0181359_1002055All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti5713Open in IMG/M
3300019784|Ga0181359_1004224Not Available4539Open in IMG/M
3300019784|Ga0181359_1007001Not Available3805Open in IMG/M
3300019784|Ga0181359_1026457Not Available2225Open in IMG/M
3300019784|Ga0181359_1026904Not Available2208Open in IMG/M
3300019784|Ga0181359_1114601Not Available972Open in IMG/M
3300020048|Ga0207193_1126728Not Available2222Open in IMG/M
3300020074|Ga0194113_10187540Not Available1672Open in IMG/M
3300020161|Ga0211726_10093128Not Available2890Open in IMG/M
3300020179|Ga0194134_10053746Not Available2202Open in IMG/M
3300020183|Ga0194115_10046158Not Available2814Open in IMG/M
3300020183|Ga0194115_10184703Not Available1046Open in IMG/M
3300020183|Ga0194115_10192321Not Available1015Open in IMG/M
3300020190|Ga0194118_10014458Not Available6483Open in IMG/M
3300020190|Ga0194118_10056073Not Available2569Open in IMG/M
3300020193|Ga0194131_10042288All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti3311Open in IMG/M
3300020193|Ga0194131_10200944Not Available940Open in IMG/M
3300020196|Ga0194124_10134682Not Available1349Open in IMG/M
3300020197|Ga0194128_10422328Not Available629Open in IMG/M
3300020198|Ga0194120_10016424Not Available8045Open in IMG/M
3300020200|Ga0194121_10072490All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2437Open in IMG/M
3300020200|Ga0194121_10290886Not Available836Open in IMG/M
3300020204|Ga0194116_10067507All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Nematoda → Chromadorea → Rhabditida → Spirurina → Spiruromorpha → Filarioidea → Onchocercidae → Wuchereria → Wuchereria bancrofti2423Open in IMG/M
3300020214|Ga0194132_10307301Not Available840Open in IMG/M
3300020221|Ga0194127_10477932Not Available810Open in IMG/M
3300020603|Ga0194126_10183663Not Available1524Open in IMG/M
3300021424|Ga0194117_10305607Not Available750Open in IMG/M
3300022179|Ga0181353_1014124Not Available2056Open in IMG/M
3300022407|Ga0181351_1053759Not Available1674Open in IMG/M
3300022407|Ga0181351_1216727Not Available625Open in IMG/M
3300023174|Ga0214921_10005723Not Available17436Open in IMG/M
3300023174|Ga0214921_10010371Not Available11534Open in IMG/M
3300023174|Ga0214921_10037742Not Available4544Open in IMG/M
3300023174|Ga0214921_10040602Not Available4303Open in IMG/M
3300023174|Ga0214921_10077197Not Available2664Open in IMG/M
3300023179|Ga0214923_10000035All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales168946Open in IMG/M
3300023184|Ga0214919_10000053All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales170255Open in IMG/M
3300023184|Ga0214919_10196109Not Available1523Open in IMG/M
3300024346|Ga0244775_10479756Not Available1017Open in IMG/M
3300025646|Ga0208161_1020955Not Available2462Open in IMG/M
3300031758|Ga0315907_10037612Not Available4325Open in IMG/M
3300031787|Ga0315900_10020700Not Available7641Open in IMG/M
3300032053|Ga0315284_11393395Not Available753Open in IMG/M
3300032092|Ga0315905_10322915Not Available1474Open in IMG/M
3300034060|Ga0334983_0346786Not Available873Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake26.61%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake25.69%
Freshwater, PlanktonEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton14.68%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater10.09%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment3.67%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater2.75%
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous2.75%
EstuarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine2.75%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater1.83%
Marine PlanktonEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Marine Plankton1.83%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Lake Sediment0.92%
FreshwaterEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater0.92%
Freshwater And MarineEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater And Marine0.92%
AquaticEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Aquatic0.92%
Stormwater Retention PondEnvironmental → Aquatic → Freshwater → Pond → Unclassified → Stormwater Retention Pond0.92%
Estuary WaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Estuary Water0.92%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine0.92%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge0.92%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000882Freshwater microbial communities from the Columbia RiverEnvironmentalOpen in IMG/M
3300001847Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM41. ROCA_DNA251_0.2um_TAP-D_2aEnvironmentalOpen in IMG/M
3300001848Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM47, ROCA_DNA265_0.2um_TAP-S_3aEnvironmentalOpen in IMG/M
3300001968Marine microbial communities from Lake Gatun, Panama - GS020EnvironmentalOpen in IMG/M
3300002842Stormwater retention pond microbial communities from Williamsburg, VA - Sample from Junction Rte 99 and John Tyler HighwayEnvironmentalOpen in IMG/M
3300005664Freshwater viral communities from Emiquon reservoir, Havana, Illinois, USAEnvironmentalOpen in IMG/M
3300006484Estuarine microbial communities from the Columbia River estuary, USA - metaG S.535EnvironmentalOpen in IMG/M
3300006875Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_0.19_N_>0.8_DNAEnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007973Coastal water column microbial communities from Columbia River Estuary, Oregon, USA - CMOP_DNA_1460A_0.2umEnvironmentalOpen in IMG/M
3300008116Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE2, Sample E2014-0106-3-NAEnvironmentalOpen in IMG/M
3300008117Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample E2014-0108-C-NAEnvironmentalOpen in IMG/M
3300008120Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample E2014-0108-3-NAEnvironmentalOpen in IMG/M
3300008266Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample HABS-E2014-0108-C-NAEnvironmentalOpen in IMG/M
3300008267Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, sample HABS-E2014-0024-100-LTREnvironmentalOpen in IMG/M
3300008448Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - August 4, 2014 all contigsEnvironmentalOpen in IMG/M
3300008450Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - Oct 27, 2014 all contigsEnvironmentalOpen in IMG/M
3300008459Freshwater viral communities during cyanobacterial harmful algal blooms (CHABs) in Western Lake Erie, USA - July 8, 2014 all contigsEnvironmentalOpen in IMG/M
3300010348AD_HKYLcaEngineeredOpen in IMG/M
3300011115Freshwater viral communities from Lake Soyang, Gangwon-do, South Korea - SYL_2016MayEnvironmentalOpen in IMG/M
3300011335Lotic viral community from Han River, Hwacheon, Gangwon-do, South Korea - GumanEnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013129 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 10cmEnvironmentalOpen in IMG/M
3300013131 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10mEnvironmentalOpen in IMG/M
3300013133 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment s1_kivu2a2EnvironmentalOpen in IMG/M
3300017722Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.S.NEnvironmentalOpen in IMG/M
3300017761Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.NEnvironmentalOpen in IMG/M
3300017774Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017777Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300017778Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.S.DEnvironmentalOpen in IMG/M
3300017784Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM110.D.NEnvironmentalOpen in IMG/M
3300019784Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300020048Microbial communities from Manganika and McQuade lakes, Minnesota, USA Combined Assembly of Gp0225457, Gp0225456, Gp0225455, Gp0225454, Gp0225453, Gp0224915EnvironmentalOpen in IMG/M
3300020074Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200mEnvironmentalOpen in IMG/M
3300020161Freshwater lake microbial communities from Lake Erken, Sweden - P4710_101 megahit1EnvironmentalOpen in IMG/M
3300020179Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015056 Kigoma Offshore 0mEnvironmentalOpen in IMG/M
3300020183Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surfaceEnvironmentalOpen in IMG/M
3300020190Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015013 Mahale N5 surfaceEnvironmentalOpen in IMG/M
3300020193Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015053 Kigoma Offshore 120mEnvironmentalOpen in IMG/M
3300020196Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015031 Kigoma Deep Cast 0mEnvironmentalOpen in IMG/M
3300020197Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015037 Kigoma Deep Cast 65mEnvironmentalOpen in IMG/M
3300020198Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015019 Mahale Deep Cast 65mEnvironmentalOpen in IMG/M
3300020200Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015020 Mahale Deep Cast 50mEnvironmentalOpen in IMG/M
3300020204Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015008 Mahale S9 surfaceEnvironmentalOpen in IMG/M
3300020214Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015054 Kigoma Offshore 80mEnvironmentalOpen in IMG/M
3300020221Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015036 Kigoma Deep Cast 100mEnvironmentalOpen in IMG/M
3300020603Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015035 Kigoma Deep Cast 150mEnvironmentalOpen in IMG/M
3300021424Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015009 Mahale N1 surfaceEnvironmentalOpen in IMG/M
3300022179Freshwater viral communities from Lake Michigan, USA - Fa13.VD.MLB.D.NEnvironmentalOpen in IMG/M
3300022407Freshwater viral communities from Lake Michigan, USA - Su13.VD.MM15.S.DEnvironmentalOpen in IMG/M
3300023174Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1505EnvironmentalOpen in IMG/M
3300023179Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1510EnvironmentalOpen in IMG/M
3300023184Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1503EnvironmentalOpen in IMG/M
3300024346Whole water sample coassemblyEnvironmentalOpen in IMG/M
3300025646Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300031758Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA123EnvironmentalOpen in IMG/M
3300031787Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA114EnvironmentalOpen in IMG/M
3300032053Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_16EnvironmentalOpen in IMG/M
3300032092Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 4 MA121EnvironmentalOpen in IMG/M
3300034060Freshwater microbial communities from Lake Mendota, Madison, Wisconsin, United States - TYMEFLIES-ME16May2013-rr0016EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
FwDRAFT_1015157623300000882Freshwater And MarineMPLWLRRTTFFMIKEHYDKEAEEYEKQNRTLKNKGKGEISRPNIAPSPTYSTKAPKK*
RCM41_100068043300001847Marine PlanktonMPLWLRKTTFFMIKEHYDREAEEHDKQNKMLKNNGKSEIARPNIAPAPTYTAKAPKK*
RCM47_106582723300001848Marine PlanktonMPLWLRRTTFNLIKEHYDKQNEEMEKQQNMLKNKSNNKDIARPNIAPKPTYTTKAPRK*
GOS2236_108846533300001968MarineMPLWLRRITFNLMKEHYDKENEEIEKQNNMLKNKTNSKDIARPNIAPTPNYTTKAPKK*
contig_1001723300002842Stormwater Retention PondMPLWLRKTTFYMIKEYYDKQNEEAEKQQNMMKNKGKSEISRPNVTPKSTYTTKAD*
Ga0073685_1000005333300005664AquaticMPLWLRRTTFNLMKEHYDKQNEENEKQQSMLKNKKDTSVSRPNISPPTYTTKAPKK*
Ga0070744_1006545923300006484EstuarineMPLWLRKTTFTLIKNYYEKQNEAAEKQQNMLKNKSGNKDISRPNIAPQPNYTTKAPRK*
Ga0070744_1017633023300006484EstuarineMPIWLRRTTFNLLKEHFDKENEANEKQSNMMKNNGKSKEIARPNITPSPNYTAKAPRK*
Ga0075473_1021089123300006875AqueousMPLWLRRTTFNMIKEFYDKEAEEAEKQQKQLKNNGKSEIARPGIAPKSTPSYTTKAPKK*
Ga0099851_108003713300007538AqueousIWLRRFTFNKLKEHFDKQNEEAEKQQNMLSNKQNATKEISRPNIAPTYTTSKAPKK*
Ga0105746_108581213300007973Estuary WaterMPLWLRRTTFNLIKEYYDKQNEENEKQQNILKNTSKKDIARPNIAPTYTAK
Ga0114350_1000340333300008116Freshwater, PlanktonMPLWLRKTTFNLIKEHYDKQNEEAEKQSNILKNKTGSKDIARPNIAPTYTAKVPKK*
Ga0114351_101878873300008117Freshwater, PlanktonMPLWLRKTTFNLIKEHYDKQNEEAEKQNNMLKNKTGNKDISRPNIAPTYTAKVPKK*
Ga0114355_117476223300008120Freshwater, PlanktonMPLWLRKTTFNLIKEHYDKQNEEAEKQNNMLKNKTGNKDISRPNIAPTPNYVAKAPKK*
Ga0114363_100161893300008266Freshwater, PlanktonMPIWLRKTTFNMIKEFYDKEAEEHEKQNKLLKNNGKNEISRPNIAPPVPPTYKVKAPKK*
Ga0114363_102844323300008266Freshwater, PlanktonMIKEHYDKEAEEYEKHNNMMKNKGKSEIARPNITPTPTYTAKAPKK*
Ga0114364_100086123300008267Freshwater, PlanktonMIKEHYDKEAEEYEKQNRTLKNKGKGEISRPNIAPSPTYSTKAPKK*
Ga0114364_100112993300008267Freshwater, PlanktonMIKEHYDKEAEEQDKQNNMLKNKGKSEIARPSIAPKPNYTSKAPKK*
Ga0114364_100183283300008267Freshwater, PlanktonMPVWLRRTTFNMIKEFYEKEAEEAEKQQSQLNNKSKIDVSRPAISQKSTPTYSTKAPKK*
Ga0114364_100388033300008267Freshwater, PlanktonMPLWLRRTTYNLIKEYYDKEAEEHEKQQNMLKNKSGNSKSVSRPNIAPNPTYTAKAPKK*
Ga0114364_100472693300008267Freshwater, PlanktonMIKEFYDKEAEEAEKHQKQLKNGGGKKGELARPNISRTPTYTTKAPKK*
Ga0114364_100993023300008267Freshwater, PlanktonMPLWLRRTTFNLIKEHYDKETEEYEKQQNMLKNKSGSSKSVSKPNIAPKPTYTAKAPKK*
Ga0114364_101074143300008267Freshwater, PlanktonMPLWLRRTTFNLIKEYYDKQNEENEKQQNILKNTSKKDIARPNIAPTYTAKVPKK*
Ga0114364_102839723300008267Freshwater, PlanktonMPLWLRRTTFNLMREYYEKQNEAQEKQNNILKNSGKNKEISRPNISPPTYVTKAPKK*
Ga0114364_104579713300008267Freshwater, PlanktonMPLWLRKTTFVLIKEYYDNQNEEMEKQNNALKNKSNISRPNVTPPNYTAKAPRK*
Ga0114364_114588123300008267Freshwater, PlanktonMPLWLRRTTFNLMKEHYDKEAEEQEKQQNSLKNKGKGNLSRPNIAPPPTYTAKAPRK*
Ga0114364_118754323300008267Freshwater, PlanktonMPLWLRRTTFNLIKEHYDKEAEEYEKQQNTLKNKSGSKSVSKPNIAPKPTYTAKAPRK*
Ga0114876_100282333300008448Freshwater LakeMPLWLRRTTFNLLKEHYDKEAEEYDKQKNMMNNKGKNEIARPNITPDYIAKAPRK*
Ga0114876_103146533300008448Freshwater LakeMPLWLRRTTFNLLKEHYDKEKEEADKQQNMLNNKGKNEIARPNITSTPDYTTKAPRK*
Ga0114876_112363923300008448Freshwater LakeMPLWLRRTTFNLMKEHYDKQNEEAEKQQNMLSNKGKKEIARPNISPDYIAKAPKK*
Ga0114876_123310323300008448Freshwater LakeMPLWLRRTTFNLLKEHYDKEKEEADKQQNMLNNKGKNEIARPNISSAPDYVVKAPRK*
Ga0114880_101260943300008450Freshwater LakeMMKEHYDKEAEEQEKQNNMLKNNGKHEIARPNITSAPKPNYTAKVPKK*
Ga0114880_114294523300008450Freshwater LakeMPLWLRRTTFNLMKEYYEKEAEEAEKQNKMLKNASGKSEINRPNIAPTPNYTTKVSKK*
Ga0114865_100202883300008459Freshwater LakeMIKEFYEKEAEEAEKQQSQLNNKSKIDVSRPAISQKSTPTYSTKAPKK*
Ga0116255_1062217723300010348Anaerobic Digestor SludgeMPLWLRRTTFNLLKEYYDKEKEKIETQQNSIKNKKDIAKPNIAPNYITKAPKK*
Ga0151514_100041843300011115FreshwaterMPLWLRRTTFNLLNEHFTKQNEEAEKQQNMLKNKSSKEISRPNIAPSPTYTSKAPRK*
Ga0153698_1220123300011335FreshwaterMIKEFYDKEAEEAEKQQKTLKNGNSKGEITRPNISPKSPTYTSKAPRK*
(restricted) Ga0172367_1011394543300013126FreshwaterMPLWLRKTTFNLIKEHYDKQNEEAEKQSNILKNKTGNKDISRPNIAPTYTAKVPKK*
(restricted) Ga0172365_1009077523300013127SedimentLWLRKTTFNLIKEHYDKQNEEAEKQSNILKNKTGNKDISRPNIAPTYTAKVPKK*
(restricted) Ga0172364_1034464923300013129SedimentMKEYYDKEKEEVEKQNNMLKNTTGNKDIARPNIAPPPNYVAKAPKK*
(restricted) Ga0172373_1001522343300013131FreshwaterMPIWLRKTTFNLMKEHYDKQNEEAEKQQNMLKNKTGASKNISRPNITPSYTAKVPKK*
(restricted) Ga0172362_1003514413300013133SedimentMEEKKLIKEFDKEKEEVEKQNNMLKNTTGNKDIARPNIAPPPNYVAKAPKK*
Ga0181347_101152813300017722Freshwater LakeLRRFTFNKLKEHFDKQNEEAEKQQNMLKNKQTSSKEISRPNIAPTYTTSKAPKR
Ga0181356_100133913300017761Freshwater LakeDKEAEEAEKQQKTLKNGNKGEVSRPNISPKTPNYTSKAPKK
Ga0181356_101513113300017761Freshwater LakeFNKLKEHFDKQNEEAEKQQNMLKNKQTPSKEISRPNIAPNYVTSKTPKNG
Ga0181356_104180813300017761Freshwater LakeMPLWLRRTTFNLLNEHFNKQNEETEKQQNMLKNNKSSKDISRPNIANPTYTAKVPRK
Ga0181356_107961623300017761Freshwater LakeMPLWLRRATFNMIKEFYDKEAEEAEKQQKTLKNGSKGEVSRPNISPKTPNYTSKAPRK
Ga0181358_100448423300017774Freshwater LakeMPLWLRRTTFNLLNEHFTKQNEEAEKQQNMLKNKSGKEVVRPNIAPSPTYTSKAPRKXGL
Ga0181358_102607633300017774Freshwater LakePIWLRRFTFNKLKEHFDKQNEEAEKQQNMLKNKQTSSKEISRPNIAPTYTTSKAPKK
Ga0181358_102719923300017774Freshwater LakeMQIWLRRFTFNKLKEHFDKQNEEAEKQQNMLKNKQTSSKEISRPNIAPTYTTSKAPKK
Ga0181358_109706123300017774Freshwater LakeWLRRTTFNLLNEHFNKQNEETEKQQNMLKNNKSSKDISRPNIANPTYTAKVPRK
Ga0181358_111199523300017774Freshwater LakeTFNKIKEHFDKQNEENEKQQNMLNNKQSTRKEIARPNIAPTYTTKAPKK
Ga0181358_112801223300017774Freshwater LakeMPLWLRRTTFNLLKEHFDKQKEETEKHQNTLKNKSDIAKPNIAPTYVTKAPRK
Ga0181357_108774223300017777Freshwater LakeMPLWLRRTTFNLIKEHYDKEAEEYEKQQNMLKNKSGSSKSVAKPNIAPKPTYTTKAPKK
Ga0181357_112130723300017777Freshwater LakeFNLLKEHFDKENEANEKQSNMMKNNGKSKEIARPNITPSPTYTTKAPKK
Ga0181357_115846113300017777Freshwater LakeIWLRRFTFNKLKEHFDKQNEEAEKQQNMLKNKQTPSKEISRPNIAPNYVTSKTPKNG
Ga0181349_112259923300017778Freshwater LakePIWLRRFTFNKLKEHFDKQNEEAEKQQNMLKNKQTSSKEISRPNIAPTYTTSKASKR
Ga0181348_109606013300017784Freshwater LakeTTFNLIKEHYDKEAEEYEKQQNILKNKSNSSKSVSKPNIAPKPTYTAKAPKK
Ga0181348_116233013300017784Freshwater LakeMPLWLRRTTFNLIKEYYDKEAEEYEKQQNMLKNKSGNSKSVAKPNIAPKPTYTAKAPRK
Ga0181348_116917123300017784Freshwater LakeMPIWLRRTTFNLLKEHFDKENEANEKQSNMMKNNGKSKEIARPNITPSP
Ga0181348_117460313300017784Freshwater LakeTFNKLKEHFDKQNEEAEKQQNMLKNKQTSSKEISRPNIAPTYTTSKASKR
Ga0181348_128646113300017784Freshwater LakeFNLLKEHFDKENEANEKQSNMMKNNGKSKEIARPNITPSPNYTAKAPRK
Ga0181359_100205543300019784Freshwater LakeMPIWLRKTTFNIMNEYFEKQNEETEKQQNMLKNKSGNKEISRPNIANPTYTTKAAKK
Ga0181359_100422423300019784Freshwater LakeMPLWLRRTTFNMMKEFYDKEAEEAEKQQKTLKNGNKGEVSRPNISPKTPNYTSKAPKK
Ga0181359_100700123300019784Freshwater LakeMPLWLRRTTFNLLNEHFNKQNEETEKQQNMLKNNKSSKEISRPNIANPTYTAKVPRK
Ga0181359_102645723300019784Freshwater LakeMPLWLRRTTFNLLNEHFTKQNEEAEKQQNMLKNKSGKEVVRPNIAPSPTYTSKAPRK
Ga0181359_102690423300019784Freshwater LakeMPLWLRRTTFNLIKEYYDKQNEENEKQQNILKNTSKKDIARPNIAPTYTAKVPKK
Ga0181359_111460123300019784Freshwater LakeMPLWLRRTTFNLMNEYYEKQNEAQEKQNNMLNNKNKNEIARPNIAPNPTYTTKAAKK
Ga0207193_112672853300020048Freshwater Lake SedimentMPLWLRKTTFNLLKEHFDKQKEESDKQQNTLKNKSNITKPNITPTYVTKAPKK
Ga0194113_1018754023300020074Freshwater LakeMPLWLRKTTFNLMKEHYDKENETIEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0211726_1009312823300020161FreshwaterMPLWLRRTTFNLLNEHFNKQNEETEKQQNMLKNNKSGKNIVRPNIANPTYTAKVPRK
Ga0194134_1005374623300020179Freshwater LakeMKEHYDKENETIEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0194115_1004615823300020183Freshwater LakeMPLWLRRTTFNLIKEYYDKQNEEAEKQQNMLKNKNSSSKEVSRPNIAPTYIAKMPKK
Ga0194115_1018470313300020183Freshwater LakeMPIWLRKTTFNLMKEHYDKQNEEAEKQQNMLKNKTGASKNISRPNITPSYTAKVPKK
Ga0194115_1019232123300020183Freshwater LakeMPLWLRKTTFNLMREHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVTKALKK
Ga0194118_1001445853300020190Freshwater LakeMREHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVTKALKK
Ga0194118_1005607323300020190Freshwater LakeIYHMPLWLRKTTFNLMKEHYDKENETIEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPK
Ga0194131_1004228813300020193Freshwater LakeSLWLRRTTFNLMKEHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0194131_1004631723300020193Freshwater LakeMKEHYDKENEAVEKQQNMLKNKSGKEISRLNIAPKQQPTYVAKAPKK
Ga0194131_1020094413300020193Freshwater LakeMKEHYDKENEAVEKQNNMLKNKTGNKDISRPNITPSPNYVAKAPKK
Ga0194124_1013468223300020196Freshwater LakeKTTFNLMREHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVTKALKK
Ga0194128_1032552123300020197Freshwater LakeLMKEHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPIYTAKVPKK
Ga0194128_1042232813300020197Freshwater LakeFNLMREHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVTKALKK
Ga0194120_1001642463300020198Freshwater LakeMPLWLRRTTFNLMKEHYDKENEAVEKQNNMLKNKTGNKDISRPNITPSPNYVAKAPKK
Ga0194121_1007249023300020200Freshwater LakeYNMPLWLRKTTFNLMKEHYDKENEAVEKQQNMLKNKSGKEISRLNIAPKQQPTYVAKAPK
Ga0194121_1029088623300020200Freshwater LakeLRKTTFNLMKEHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPIYTAKVPKK
Ga0194116_1006750713300020204Freshwater LakeLWLRKTTFNLMKEHYDKQNEEVEKQQNMLKNKTGVSKDIARPNIAPTYTAKVPKK
Ga0194132_1030730113300020214Freshwater LakeKTTFNLMKEHYDKENETIEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0194127_1047793223300020221Freshwater LakeMPLWLRKTTFNLMREHYDKENEAVEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0194126_1018366313300020603Freshwater LakeTFNLMKEHYDKENETIEKQNNMLKNKTGSKDISRPNIAPSPNYVAKAPKK
Ga0194117_1030560723300021424Freshwater LakeKTTFNLMKEYYDKENEAVEKQNNMLKNKTGSKDISRPNIAPLPNYVAKAPKK
Ga0181353_101412423300022179Freshwater LakeMPLWLRRTTFNLMKEHYDKQNEDIDKQNKMLSNANNKNIARPNIAPSPDYTIKAPKK
Ga0181351_105375923300022407Freshwater LakeMPLWLRRTTFNLMKEHYDKEDEEIEKQNNILKNQTGTSKNVSRPNIAPTPNYIAKAPKK
Ga0181351_121672723300022407Freshwater LakeMPIWLRRTTFNLLKEHFDKENEANEKQSNMMKNNGKSKEIARPNITPSPNYTAKAPRK
Ga0214921_1000572393300023174FreshwaterMPLWLRRTTFNLIKEYYDAQNEAHEKQNNTLKNKGKSKEISRPNIAPPTYVTKAPKK
Ga0214921_1001037143300023174FreshwaterMPLWLRRTTFNLMNEYYEKQNEEAEKQQRSLNNNGKNNISRPNIAPAPTYTTKAAKK
Ga0214921_1003774213300023174FreshwaterLWLRKTTFNLIKEFYDKEAEEAENQQKTLKNGNKSEIARPNVTPKPTYTTKAAKK
Ga0214921_1004060223300023174FreshwaterMPLWLRRTTFNLIKEFYDKEAEEYEKQNKTLNNGNKNEIARPNVTPKPTYTTKAAKK
Ga0214921_1007719743300023174FreshwaterMPLWLRQTTFNLMKEHYDKEAEANEKQSNMLNNKNKSEISRPNIAPPPTYTTKAPKK
Ga0214923_100000351193300023179FreshwaterMPLWLRRTTFNLMKEHYDKQNEEAEKQQNMLKNKTGSKDIARPNIAPKPNYIAKAPKK
Ga0214919_10000053923300023184FreshwaterMPLWLRRTTFNMMKEHYDKEAEEAEKQQNMLKNNSKSEIARPNITPTPNYTTKAPKK
Ga0214919_1019610923300023184FreshwaterMPLWLRRTTFNMIKEFYEKEAEETEKQQKTLKNGSKGEVSRPGIAPKSNPTYTSKAPKK
Ga0244775_1047975623300024346EstuarineMPLWLRKTTFTLIKNYYEKQNEAAEKQQNMLKNKSGNKDISRPNIAPQPNYTTKAPRK
Ga0208161_102095523300025646AqueousMPLWLRRTTFNLIKEYYDKQNEETEKQQNMLKNKTGSSNDIARPNIAPTYTAKVPKK
Ga0315907_1003761223300031758FreshwaterMPLWLRKTTFNLIKEHYDKQNEEAEKQSNILKNKTGSKDIARPNIAPTYTAKVPKK
Ga0315900_1002070043300031787FreshwaterMPLWLRKTTFNLIKEHYDKQNEEAEKQNNMLKNKTGNKDISRPNIAPTPNYVAKAPKK
Ga0315284_1139339523300032053SedimentMPLWLRRTTFNIIKEFYDNEAEAAEKQQKTLKNGNNKGEVARPNISRPNPTYTTKAAKN
Ga0315905_1032291523300032092FreshwaterMPLWLRRTTFHMMKEHYDKEAEEQEKQNNMLKNNGKHEIARPNITSAPKPNYTAKVPKK
Ga0334983_0346786_2_1633300034060FreshwaterLRRTTFNLLNEHFTKQNEDAEKQQNMLKNNKSSKDIARPNIANPTYTAKGPRK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.