NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F100682

Metagenome Family F100682

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100682
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 39 residues
Representative Sequence MPEATAEDWAKQNKLRQEWLAANPDAEYEGWMSI
Number of Associated Samples 54
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Viruses
% of genes with valid RBS motifs 47.52 %
% of genes near scaffold ends (potentially truncated) 17.65 %
% of genes from short scaffolds (< 2000 bps) 93.14 %
Associated GOLD sequencing projects 51
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Duplodnaviria (79.412 % of family members)
NCBI Taxonomy ID 2731341
Taxonomy All Organisms → Viruses → Duplodnaviria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton
(21.569 % of family members)
Environment Ontology (ENVO) Unclassified
(77.451 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(76.471 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60
1B570J29590_1101121
2B570J29032_1095162941
3Ga0068877_101217302
4Ga0068877_106704491
5Ga0068877_107372912
6Ga0068876_101131542
7Ga0068876_101657532
8Ga0068876_103572502
9Ga0068876_103734712
10Ga0068872_103070362
11Ga0068872_103273112
12Ga0068872_107517971
13Ga0049081_101572741
14Ga0079957_101971013
15Ga0079957_10887832
16Ga0079957_11169292
17Ga0079957_11848484
18Ga0079957_12945261
19Ga0075467_104080252
20Ga0075458_101171321
21Ga0102861_10901872
22Ga0104986_164714
23Ga0105746_11293371
24Ga0105747_11591221
25Ga0114339_10976293
26Ga0114340_11929811
27Ga0114340_12527341
28Ga0114340_12581151
29Ga0114347_10109232
30Ga0114355_11709401
31Ga0114841_10995231
32Ga0114336_11583754
33Ga0114353_12189021
34Ga0114363_10992811
35Ga0114363_11009061
36Ga0114363_11110333
37Ga0114363_11189502
38Ga0114363_11507232
39Ga0114363_11511543
40Ga0114363_11542242
41Ga0114363_11591921
42Ga0114363_11766173
43Ga0114363_12085711
44Ga0114363_12177931
45Ga0114363_12230342
46Ga0114363_12430211
47Ga0114878_11667292
48Ga0114878_12643852
49Ga0114876_10887683
50Ga0114876_11023642
51Ga0114876_11111652
52Ga0114876_11969381
53Ga0114876_12319922
54Ga0114876_12507302
55Ga0114876_12617212
56Ga0114880_10468206
57Ga0114880_10949844
58Ga0114880_10953053
59Ga0114880_11383951
60Ga0114880_11407492
61Ga0114880_11909561
62Ga0114880_12020391
63Ga0114880_12052921
64Ga0105103_107456081
65Ga0114978_103279283
66Ga0114981_104910242
67Ga0105097_102775483
68Ga0114974_101010384
69Ga0114976_102877273
70Ga0164293_103259522
71Ga0164293_108436852
72Ga0172367_100381573
73Ga0172367_103772002
74Ga0172367_104036891
75Ga0172367_104975152
76Ga0172367_105239621
77Ga0172373_103798092
78Ga0177922_110768051
79Ga0172376_102191792
80Ga0208364_10233011
81Ga0207939_10221563
82Ga0208147_11450012
83Ga0255082_10480722
84Ga0209246_101955241
85Ga0209972_104070361
86Ga0209990_101113143
87Ga0315900_103140931
88Ga0315900_109753741
89Ga0315909_105605252
90Ga0315906_107189002
91Ga0315903_101498865
92Ga0315903_106706041
93Ga0335005_0772402_283_405
94Ga0334987_0168727_347_493
95Ga0334995_0239440_937_1056
96Ga0335027_0381463_766_921
97Ga0335027_0503038_80_199
98Ga0335027_0562964_531_650
99Ga0335029_0053334_413_559
100Ga0335036_0330681_398_517
101Ga0335063_0199301_778_897
102Ga0335048_0616653_347_499
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.42%    β-sheet: 0.00%    Coil/Unstructured: 72.58%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530MPEATAEDWAKQNKLRQEWLAANPDAEYEGWMSISequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
90.2%9.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Freshwater
Freshwater
Freshwater Lake
Freshwater Lentic
Freshwater
Freshwater, Plankton
Freshwater Lake
Freshwater Lake
Freshwater
Lake
Freshwater
Freshwater
Aqueous
Estuary Water
Estuarine
12.7%18.6%21.6%16.7%3.9%4.9%5.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
B570J29590_11011213300002278FreshwaterWDTNVMPIATAADWAKQNELRQEWLANNPDATYIGWTSI*
B570J29032_10951629413300002408FreshwaterMPIATTEDWARQNKMRQEWLAANPDAEYEGWMSI*
Ga0068877_1012173023300005525Freshwater LakeMPIATAEDWAKQNKLRQECLANNPDATYIGWTSI*
Ga0068877_1067044913300005525Freshwater LakeMDGARMPEATAEDWAKQNKLRQEWLAANPDAEYEGWMSI*
Ga0068877_1073729123300005525Freshwater LakeIAHLGLSSRVVAGVRMPEATAEDWAKQNKLRQEWLANNPNAEYEGWMSI*
Ga0068876_1011315423300005527Freshwater LakeMPIATAEDWIKQNKLRQEWLDNNPDAQYLGWVSI*
Ga0068876_1016575323300005527Freshwater LakeVVAGARMPEATAEDWAKQNQLREEWLANNPDAEYIGWMSI*
Ga0068876_1035725023300005527Freshwater LakeMPIAKAEDWARQNKMRQEWLAVNPDAEYEGWMSI*
Ga0068876_1037347123300005527Freshwater LakeVVDGGGMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI*
Ga0068872_1030703623300005528Freshwater LakeMPIATAEDWARQNKMRQEWLDANPDAEYEGWMSI*
Ga0068872_1032731123300005528Freshwater LakeVAVGVRMPETTAEDWAKQNKLRQEWLANNPNAEYEGWMSI*
Ga0068872_1075179713300005528Freshwater LakeMPIATAEDWAKQNKLRQEWLANNPDATYIGWTSI*
Ga0049081_1015727413300005581Freshwater LenticYLRVADGARMPEATALDWAKQNALREQWLIDNPDAQYIGWMSI*
Ga0079957_1019710133300005805LakeMPIATAEDWARQNKMRQEWLAANPDAEYEGWMSI*
Ga0079957_108878323300005805LakeMPIATAEDWARQNKMRQEWLTAHPDAEYEGWMSI*
Ga0079957_111692923300005805LakeMPVATAEDWIKQNKLRQEWLDNNPDAQYLGWVSI*
Ga0079957_118484843300005805LakeMPIATAEDWAKQNKLRQEWLADNPDAEYEGWMSI*
Ga0079957_129452613300005805LakeVAVGPRMPEATAEDWAKQNQLRQEWLANNPDADYIGWTSI*
Ga0075467_1040802523300006803AqueousVADGVRMPEATAEDWAKQNALREQWLIDNPDAQYIGWMSI*
Ga0075458_1011713213300007363AqueousLGLSSKVVDGGRMPEATPEDWARQNKMRQEWLAAHPDAEYEGWMSI*
Ga0102861_109018723300007544EstuarineMMAENDIDWEWQNALREEWLANNPDAEYQGWLSI*
Ga0104986_1647143300007734FreshwaterMPVATAEDWIKQNKLRQEWLAANPDATYIGWTSI*
Ga0105746_112933713300007973Estuary WaterESIAHLGLYLRVADGVRMPEATAADWAKQNKLRQKWLAANPDADYIGWTSI*
Ga0105747_115912213300007974Estuary WaterVVAGVRMPEATAEDWAKQNKLRQEWLAANPDADYIGWTSI*
Ga0114339_109762933300008106Freshwater, PlanktonMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI*
Ga0114340_119298113300008107Freshwater, PlanktonMPIATAEDWAKQNKLRQEWLAANPDATYIGWTSI*
Ga0114340_125273413300008107Freshwater, PlanktonVAVGVRMPEATAEDWARQNKMRQEWLANNPDATYIGWTSI*
Ga0114340_125811513300008107Freshwater, PlanktonMPEATPEDWARQNKMRQEWLAANPDAEYEGWMSI*
Ga0114347_101092323300008114Freshwater, PlanktonMPIATAEDWAKQNKLRQEWLADNPDATYIGWTSI*
Ga0114355_117094013300008120Freshwater, PlanktonMPEATAEDWAKQNQLREEWLANNPDAEYIGWMSI*
Ga0114841_109952313300008259Freshwater, PlanktonMPIATAEDWAKQNELRQEWLANNPDATYIGWTSI*
Ga0114336_115837543300008261Freshwater, PlanktonVRLGLSSRVAVGVRMPEATAEDWAKQNKMRQEWLAANPDADYIGWTSI*
Ga0114353_121890213300008264Freshwater, PlanktonMPIATAEDWAKQNKLRQELLANNPDATYIGWTSI*
Ga0114363_109928113300008266Freshwater, PlanktonMPIATAEDWVKQNKMRQEWLDANPDAEYEGWMSI*
Ga0114363_110090613300008266Freshwater, PlanktonMPIATAEDWAEQNKLRQEWLANNPDATYIGWTSI*
Ga0114363_111103333300008266Freshwater, PlanktonMPIATAEDWAKQNKMRQEWLDANPDAEYEGWMSI*
Ga0114363_111895023300008266Freshwater, PlanktonVADGARMPEATAEDWAKQNQLRQEWLANNPDADYIGWTSI*
Ga0114363_115072323300008266Freshwater, PlanktonVVAGVRMPEATAEDWAKQNKLRQEWLANNPDAAYIGWTSI*
Ga0114363_115115433300008266Freshwater, PlanktonMPIATAEDWARQNKMRQEWLDAHPDAEYEGWMSI*
Ga0114363_115422423300008266Freshwater, PlanktonVADGARMPEATAEDWAKQNQLRQEWLANNPDAEYEGWMSI*
Ga0114363_115919213300008266Freshwater, PlanktonMPIATAEDWAKQNKLRQEWLANNADATYIGWTSI*
Ga0114363_117661733300008266Freshwater, PlanktonMPVATAEDWAKQNKLRQEWLANNPDATYIGWTSI*
Ga0114363_120857113300008266Freshwater, PlanktonMPIATAEDWAKQNKMRQEWLDANPDADYEGWMSI*
Ga0114363_121779313300008266Freshwater, PlanktonVHLGLSSKVVAGGRMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI*
Ga0114363_122303423300008266Freshwater, PlanktonMMPIATAEDWAQQNKLREEWLEANPNAEYEGWMSI*
Ga0114363_124302113300008266Freshwater, PlanktonVHLGLSLKVGDGGRMPEATPEDWARQNKMRQEWLAANPDAE
Ga0114878_116672923300008339Freshwater LakeMPEATAEDWAKQNKLRQEWLANNPDAAYIGWTSI*
Ga0114878_126438523300008339Freshwater LakeVADGVRMPEATAEDWAKQNKLRQEWLANNPDATYIGWTSI*
Ga0114876_108876833300008448Freshwater LakeVHLGSSSKVVDGGRMPEVTPEDWARQNKMRQEWLAANPDAEYEGWMSI*
Ga0114876_110236423300008448Freshwater LakeMPEATAEDWARQNKMRQEWLNANPNAEYEGWMSI*
Ga0114876_111116523300008448Freshwater LakeMPIATAEDWAKQNKLRQEWLINNPDATYIGWTSI*
Ga0114876_119693813300008448Freshwater LakeVAVGAKMPEATAEDWAKQNKLRQEWLANNPNADYIGWTSI*
Ga0114876_123199223300008448Freshwater LakeVHLGLSLKVVDGGRMPEATPEDWARQNKMRQEWLAANPDAEYEGWMSI
Ga0114876_125073023300008448Freshwater LakeMPEATAEDWAKQNKLRQEWLAANPDAEYEGWMSI*
Ga0114876_126172123300008448Freshwater LakeMPIATAEDWVKQNKLRQEWLANNPDAEYEGWMSI*
Ga0114880_104682063300008450Freshwater LakeMPEATAEDWVKQNKLRQEWLAANPDATYIGWTSI*
Ga0114880_109498443300008450Freshwater LakeMPIATAEDWAKQNKLRQEWLAANPDAEYEGWMSI*
Ga0114880_109530533300008450Freshwater LakeMPIATAEDWAKHNKLRQEWLDANPDAEYEGWMSI*
Ga0114880_113839513300008450Freshwater LakeMPIATPEDWAKQNKMRQEWLDANPDAEYEGWMSI*
Ga0114880_114074923300008450Freshwater LakeVHLGSFSKVVAGGRMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI*
Ga0114880_119095613300008450Freshwater LakeVHLDSSSKVVDGGRMPEATPEDWARQNKMRQEWLAAHPDADYIGWTSI*
Ga0114880_120203913300008450Freshwater LakeAGGRMPEATAEDWVKQNKMRQEWLAANPDAEYEGWMSI*
Ga0114880_120529213300008450Freshwater LakeVADGARMPEATAEDWAKQNKLRQEWLANNPDATYIGWTSI*
Ga0105103_1074560813300009085Freshwater SedimentVAGVRMPEATAEDWAKQNKLRQEWLANNPDADYIGWTSI*
Ga0114978_1032792833300009159Freshwater LakeALNVIYKCQGCTAHLVSYSKVADGVRMPEATAADWAYQNALRKQWLIDNPDAQYLGWVSI
Ga0114981_1049102423300009160Freshwater LakeSYSKVADGVRMPEATAADWARQNALREQWLIDNPDAQYIGWMSI*
Ga0105097_1027754833300009169Freshwater SedimentMPEATAEDWAKQNQLRQEWLANNPDATYIGWTSI*
Ga0114974_1010103843300009183Freshwater LakeVADGVRMPEATAEDWAKQNALHEQWLLDNPDAKYLGWVSI*
Ga0114976_1028772733300009184Freshwater LakeQECIAHLVSYSKVADGVRMPEATAADWAKQNALREQWLIDNPDAQYLGWVSI*
Ga0164293_1032595223300013004FreshwaterVVAGVKMPEATAEDWAKQNKLRQEWLANNPDAEYIGWMSI*
Ga0164293_1084368523300013004FreshwaterVGVGVKMPEATAEDWLKQNKLRQEWLADNPDAEYIGWMSI*
(restricted) Ga0172367_1003815733300013126FreshwaterMPVATPEDWIKQIQMRQEWLEANPNAEYIGWTSI*
(restricted) Ga0172367_1037720023300013126FreshwaterMPEMTAEDWQKQNELRKQWLLENPDAEYIGWTSI*
(restricted) Ga0172367_1040368913300013126FreshwaterMPEATPEDWIKQNQMRLEWLEANPDAEYEGWMSI*
(restricted) Ga0172367_1049751523300013126FreshwaterMPEATPEDWIKQNQMRQEWLEANPDAEYIGWTSI*
(restricted) Ga0172367_1052396213300013126FreshwaterMPEATPEDWIKQNELRKQWLLDNPDAEYIGWTSI*
(restricted) Ga0172373_1037980923300013131FreshwaterMPEATREDWIKQNQMRQEWLEANPDAEYEGWMSI*
Ga0177922_1107680513300013372FreshwaterMPIATAEDWAKQNKMRQEWLAANPDAEYEGWMSI*
(restricted) Ga0172376_1021917923300014720FreshwaterMGESTDIDWAYQEKLRQEWLAANPDAQYPGWTSI*
Ga0208364_102330113300020533FreshwaterVVAGVKMPEATAEDWAKQNKLRQEWLANNPDATYIGWTSI
Ga0207939_102215633300020536FreshwaterCTAHLVSYLKVADGVRMAEATAEDWAKQNALRAQWLIDNPDAQYIGWMSI
Ga0208147_114500123300025635AqueousAHLGLSSKVVDGGRMPEATPEDWARQNKMRQEWLAAHPDAEYEGWMSI
Ga0255082_104807223300027139FreshwaterSKVADGVRMPEATAEDWAKQNALREQWLIDNPDAHYLGWVSI
Ga0209246_1019552413300027785Freshwater LakeECTAHLVLYLKVADGVRMAEATAEDWAKQNELHKQWLIDNPDAQYIGWMSI
Ga0209972_1040703613300027793Freshwater LakeVVDGGGMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI
Ga0209990_1011131433300027816Freshwater LakeVVAGARMPEATAEDWAKQNQLREEWLANNPDAEYIGWMSI
Ga0315900_1031409313300031787FreshwaterVADGARMPEATAEDWAKQNQLRQEWLANNPDAEYEGWMSI
Ga0315900_1097537413300031787FreshwaterVVAGVRMPEATAEDWARQNKMRQEWLNANPNAEYEGWMSI
Ga0315909_1056052523300031857FreshwaterVHLGSFSKVVAGGRMPEATPEDWARQNKMRQEWLDAHPDAEYEGWMSI
Ga0315906_1071890023300032050FreshwaterVVAGVRMPEATAEDWAKQNKLRQEWLANNPDAAYIGWTSI
Ga0315903_1014988653300032116FreshwaterVADGVRMPEATAEDWAKQNKLRQEWLANNPDATYIGWTSI
Ga0315903_1067060413300032116FreshwaterVAVGARMPEATAEDWARQNKMRQEWLANNPDADYIGWTSI
Ga0335005_0772402_283_4053300034022FreshwaterVVAGVKMPEATAEDWAKQNKMRQEWLANNPNADYIGWTSI
Ga0334987_0168727_347_4933300034061FreshwaterMHLGLSSRVVAGVRMPEATAEDWAKQNKLRQEWLANNPDADYIGWTSI
Ga0334995_0239440_937_10563300034062FreshwaterVAGVKMPEATAEDWVIQNKLRQEWLADNPDADYIGWTSI
Ga0335027_0381463_766_9213300034101FreshwaterESIAHLGLSSRVVAGVRMPEATTEDWALQNKMRQEWLNANPNAEYEGWMSI
Ga0335027_0503038_80_1993300034101FreshwaterMAGARMPEATAEDWAKQNKLRQEWLANNPDADYIGWTSI
Ga0335027_0562964_531_6503300034101FreshwaterVAGVKMPEATAEDWAKQNKLRQEWLAANPNADYIGWTSI
Ga0335029_0053334_413_5593300034102FreshwaterMHLGLYLRVVAGVRMPEATAEDWAKQNKLRQEWLANNPDADYIGWTSI
Ga0335036_0330681_398_5173300034106FreshwaterVAGVKMPEATAEDWAKQNKLRQEWLANNPDATYIGWTSI
Ga0335063_0199301_778_8973300034111FreshwaterVAGVKMPEATAEDWAKQNKLRQEWLANNPDADYIGWTSI
Ga0335048_0616653_347_4993300034356FreshwaterYIAHLVLYSKVAVGVRMPEATTEDWALQNKLREEWLANNPDAQYMGWVSI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.