NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101047

Metagenome / Metatranscriptome Family F101047

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101047
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 47 residues
Representative Sequence MGRMKELYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK
Number of Associated Samples 88
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Viruses
% of genes with valid RBS motifs 72.55 %
% of genes near scaffold ends (potentially truncated) 28.43 %
% of genes from short scaffolds (< 2000 bps) 76.47 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Duplodnaviria (64.706 % of family members)
NCBI Taxonomy ID 2731341
Taxonomy All Organisms → Viruses → Duplodnaviria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine
(18.628 % of family members)
Environment Ontology (ENVO) Unclassified
(47.059 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(52.941 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56
1M3P_101713623
2JGI12547J11936_11005292
3JGI12421J11937_101691621
4B570J14230_100933462
5JGI24766J26685_100170612
6B570J40625_10001347713
7B570J40625_10001761426
8JGI25908J49247_100167287
9JGI25908J49247_101257432
10JGI25920J50251_100938583
11JGI25926J51410_10261356
12JGI25923J51411_10451343
13Ga0007760_111830731
14Ga0068877_107677922
15Ga0068876_102801923
16Ga0068876_102909473
17Ga0068876_102956813
18Ga0049080_103036871
19Ga0078894_105850964
20Ga0078894_112581921
21Ga0073913_100847593
22Ga0079301_10058434
23Ga0075464_103743483
24Ga0075473_100856252
25Ga0102861_11064422
26Ga0102877_10373794
27Ga0102879_11044762
28Ga0102879_11578272
29Ga0102881_11498852
30Ga0102919_11026592
31Ga0102897_10788472
32Ga0102902_11593811
33Ga0102908_10907241
34Ga0105745_10516124
35Ga0105745_11778553
36Ga0114346_12507613
37Ga0114351_13621491
38Ga0114336_12413194
39Ga0104241_10030582
40Ga0104242_10359912
41Ga0102831_10660183
42Ga0114977_104913524
43Ga0114975_100312502
44Ga0114969_107033871
45Ga0114982_10254084
46Ga0153801_10041064
47Ga0157210_10121184
48Ga0138284_12837543
49Ga0164295_111812343
50Ga0177922_109282381
51Ga0181364_10156112
52Ga0211732_10490563
53Ga0211735_111522951
54Ga0208326_1249092
55Ga0208050_100114111
56Ga0208590_10154834
57Ga0208858_10196384
58Ga0208723_10379551
59Ga0214168_10216911
60Ga0222713_100268749
61Ga0222713_100385141
62Ga0222713_101955694
63Ga0222713_103645215
64Ga0222713_103903482
65Ga0222712_105276503
66Ga0214921_1000712617
67Ga0214921_1001649612
68Ga0214921_1001914716
69Ga0214919_102531004
70Ga0244777_106901884
71Ga0244775_106132563
72Ga0244776_100888852
73Ga0244776_100948796
74Ga0208147_10007057
75Ga0208009_10352741
76Ga0255065_10500195
77Ga0208800_10027062
78Ga0208928_10299753
79Ga0208024_10384882
80Ga0208172_10177684
81Ga0208168_10299783
82Ga0208556_10183896
83Ga0208788_100029730
84Ga0255072_10828111
85Ga0209552_11782282
86Ga0208960_10511342
87Ga0209599_1000236410
88Ga0209617_1000178325
89Ga0209355_102461712
90Ga0209990_100651871
91Ga0209191_10508367
92Ga0247723_10151284
93Ga0315907_103203633
94Ga0315908_112274033
95Ga0315900_104865723
96Ga0315909_101454056
97Ga0334992_0057842_76_222
98Ga0334998_0029524_3645_3791
99Ga0335000_0441866_3_119
100Ga0335030_0834302_372_536
101Ga0335055_0152541_926_1039
102Ga0334997_0832758_418_552
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.67%    β-sheet: 10.42%    Coil/Unstructured: 72.92%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MGRMKELYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENKSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
88.2%11.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater
Freshwater And Sediment
Freshwater Lake
Freshwater Lentic
Freshwater And Sediment
Freshwater And Sediment
Freshwater
Freshwater, Plankton
Freshwater Lake
Freshwater
Freshwater
Lotic
Freshwater
Freshwater
Aqueous
Estuary Water
Estuarine
Estuarine
Estuarine Water
Sand
Deep Subsurface
Deep Subsurface Sediment
8.8%15.7%10.8%2.9%4.9%5.9%3.9%2.9%18.6%5.9%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
M3P_1017136233300000268LoticMGRMKELYTQILDCETCNGKGWVFFGNAKEYDVESCECNPLSFFQENK*
JGI12547J11936_110052923300000736Freshwater And SedimentMSKMKELYTQIFECDTCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
JGI12421J11937_1016916213300000756Freshwater And SedimentMVRMKELYTQILECETCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK*
B570J14230_1009334623300001282FreshwaterMGRMKELYTQILECDTCYGNGWLYYGDENTYDVEACQCNPLSFFQENK*
JGI24766J26685_1001706123300002161Freshwater And SedimentMGRMKELYTQILECDTCYGKGWLYYGNEEMFDVEACQCNPLGFFQENK*
B570J40625_100013477133300002835FreshwaterMGRMKEIYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK*
B570J40625_100017614263300002835FreshwaterMGRMKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK*
JGI25908J49247_1001672873300003277Freshwater LakeMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK*
JGI25908J49247_1012574323300003277Freshwater LakeMGRMKELYTQILECETCNGKGWVFFGNAKDYDVESCECNPLGFFQENK*
JGI25920J50251_1009385833300003404Freshwater LakeMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGKEN*
JGI25926J51410_102613563300003490Freshwater LakeMGKMKELYTQILECETCYGTGWLYYGDENNYDVEACQCNPLSFFQENK*
JGI25923J51411_104513433300003493Freshwater LakeMSRMKELYTQILECETCNGKGWVFFGNAKEYDVESCECNPLSFFQENK*
Ga0007760_1118307313300004793Freshwater LakeMGRMKELYTQILECEACNGQGWQFFGNETDYDVEACDCNPLGFFQE
Ga0068877_1076779223300005525Freshwater LakeMGRMKELYTQILECDTCYGKGWLYYGNEEMYDVEACQCNPLGFFQENK*
Ga0068876_1028019233300005527Freshwater LakeMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0068876_1029094733300005527Freshwater LakeMGRMKELYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK*
Ga0068876_1029568133300005527Freshwater LakeMGRMKEIYTQILECEDCNGQGWLYFGNNENYDVECCNCNPLGFFQENK*
Ga0049080_1030368713300005582Freshwater LenticMVRMKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK*
Ga0078894_1058509643300005662Freshwater LakeHERKTKMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0078894_1125819213300005662Freshwater LakeMGRMKEIYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0073913_1008475933300005940SandMSKMKELYTQIFECDTCYGKGWLYYGDEDNFDVEACQCNPLSFFQENK*
Ga0079301_100584343300006639Deep SubsurfaceMGRMKELYTQILECDTCYGNGWLYYGDENNYDVEACQCNPLSFFQENK*
Ga0075464_1037434833300006805AqueousMGRMKELYTQIFECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0075473_1008562523300006875AqueousMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0102861_110644223300007544EstuarineMKTLEIANCETCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK*
Ga0102877_103737943300007548EstuarineMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0102879_110447623300007549EstuarineMGRMKELYTQILECETCNGQGWQFFGNETDYDVKACECNPLGFFQENK*
Ga0102879_115782723300007549EstuarineMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQE
Ga0102881_114988523300007551EstuarineMGKMKELYTQILECDTCYGTGWLYYGDENNYDVEACQCNPLSFFQENK*
Ga0102919_110265923300007597EstuarineMKELYTQIFECENCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0102897_107884723300007617EstuarineMSKMKELYTQIFECENCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0102902_115938113300007644EstuarineIMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0102908_109072413300007665EstuarineMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPIGF
Ga0105745_105161243300007972Estuary WaterMKELYTQIFECDICLGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0105745_117785533300007972Estuary WaterSKMKELYTQIFECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0114346_125076133300008113Freshwater, PlanktonMGRMKELYTQILECDTCYGKGWLYYGNEEMYDVEACDCNPLGFFQENK*
Ga0114351_136214913300008117Freshwater, PlanktonTKMGRMKELYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK*
Ga0114336_124131943300008261Freshwater, PlanktonYTQIFECDTCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0104241_100305823300008953FreshwaterMGKMKELYTQILECETCWGKGWLYYGDEETYDVCACDCNPLGFFQENK*
Ga0104242_103599123300008962FreshwaterMSKMKELYTQILECEACNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0102831_106601833300008996EstuarineMTKMKELYTQIFECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0114977_1049135243300009158Freshwater LakeMGRMKELYTQILECETCNGKGWVFFGNAKEYDVESCECNPLGFFQENK*
Ga0114975_1003125023300009164Freshwater LakeMKELYTQIFECDTCLGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0114969_1070338713300009181Freshwater LakeTQILECETCNGQGWQFFGNETDYDVEACDCNPHGFFQENK*
Ga0114982_102540843300009419Deep SubsurfaceMKELYTQIFDCETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0153801_100410643300012017FreshwaterMNKMKELYTQILECDTCNGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0157210_101211843300012665FreshwaterMGRMKELYTQILECETCNGKGWVFFGNAKEYDVESCECNPLSFFQENK*
Ga0138284_128375433300012779Freshwater LakeMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACDCNPLGFFQENK*
Ga0164295_1118123433300013014FreshwaterMGRMKELYTQIFECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK*
Ga0177922_1092823813300013372FreshwaterLYTQILECDTCYGNGWLYYGDENNYDVEACDCNPLGFFQENK*
Ga0181364_101561123300017701Freshwater LakeMNKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0211732_104905633300020141FreshwaterMGRMKEIYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK
Ga0211735_1115229513300020162FreshwaterMGRTKEIYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208326_12490923300020494FreshwaterMGRMKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK
Ga0208050_1001141113300020498FreshwaterMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208590_101548343300020501FreshwaterMGRMKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQ
Ga0208858_101963843300020524FreshwaterERKTKMGRMKELYTQILECDTCYGNGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0208723_103795513300020571FreshwaterHERKTKMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0214168_102169113300021140FreshwaterECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0222713_1002687493300021962Estuarine WaterMSKMKELYTQIFECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0222713_1003851413300021962Estuarine WaterMGRMKELYTQILECDTCYGNGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0222713_1019556943300021962Estuarine WaterMGKMKELYTQILECETCWGKGWLYYGDEETYDVCACDCNPLGFFQENK
Ga0222713_1036452153300021962Estuarine WaterTQILECDTCYGNGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0222713_1039034823300021962Estuarine WaterMGKMKELYTQILECETCYGTGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0222712_1052765033300021963Estuarine WaterRKTKMGRMKELYTQILECETCYGNGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0214921_10007126173300023174FreshwaterMKELYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0214921_10016496123300023174FreshwaterMKELYTQIFECDTCLGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0214921_10019147163300023174FreshwaterMTKMKELYTQIFECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0214919_1025310043300023184FreshwaterTQILECETCWGKGWLYYGDEETYDVCACDCNPLGFFQENK
Ga0244777_1069018843300024343EstuarineMGKMKELYTQILECDTCYGTGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0244775_1061325633300024346EstuarineMGRMKELYTQILECETCNGKGWVFFGNAKDYDVESCECNPLGFFQENK
Ga0244776_1008888523300024348EstuarineMKELYTQIFECENCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0244776_1009487963300024348EstuarineMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208147_100070573300025635AqueousMGRMKELYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0208009_103527413300027114Deep SubsurfaceMGRMKEIYTQILECEDCNGQGWLYFGNNENYDVECCNCNPLGFFQENK
Ga0255065_105001953300027142FreshwaterLYTQILECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0208800_100270623300027193EstuarineMGRMKELYTQILECETCNGKGWVFFGNAKEYDVESCECNPLGFFQENK
Ga0208928_102997533300027217EstuarineYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208024_103848823300027222EstuarineMKELYTQIFECENCNGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208172_101776843300027231EstuarineMSKMKELYTQIFECENCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0208168_102997833300027305EstuarineMSKMKELYTQIFECENCNGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0208556_101838963300027366EstuarineMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQ
Ga0208788_1000297303300027499Deep SubsurfaceMKELYTQILECDTCYGNGWLYYGDENNYDVEACQCNPLSFFQENK
Ga0255072_108281113300027508FreshwaterMGRMKELYTQILECETCNGQGWVFFGNETDYDVEACECNPLSFFQENK
Ga0209552_117822823300027563Freshwater LakeMSRMKELYTQILECETCNGKGWVFFGNAKEYDVESCECNPLSFFQENK
Ga0208960_105113423300027649Freshwater LenticMVRMKELYTQILECETCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK
Ga0209599_10002364103300027710Deep SubsurfaceMTKMKELYTQIFDCETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0209617_10001783253300027720Freshwater And SedimentMVRMKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK
Ga0209355_1024617123300027744Freshwater LakeMKELYTQIFECDTCLGQGWQFFGNETDYDVEACECNPLGFFQENK
Ga0209990_1006518713300027816Freshwater LakeSKHERKTKMGRMKELYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK
Ga0209191_105083673300027969Freshwater LakeMSKMKELYTQIFECDTCLGQGWQFFGNETDYDVEACDCNP
Ga0247723_101512843300028025Deep Subsurface SedimentMKELYTQIFDCETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0315907_1032036333300031758FreshwaterMGRMKELYTQILECDTCYGKGWLYYGNEEMYDVEACQCNPLGFFQENK
Ga0315908_1122740333300031786FreshwaterYTQILECDTCYGKGWLYYGNEEMYDVEACQCNPLGFFQENK
Ga0315900_1048657233300031787FreshwaterMGRMKELYTQILECETCNGQGWQFFGNQTDYDVEACECNPLGFFQENK
Ga0315909_1014540563300031857FreshwaterILECDTCYGKGWLYYGNEEMFDVEACQCNPLGFFQENK
Ga0334992_0057842_76_2223300033992FreshwaterMGRMKELYTQILECDTCYGNGWLYYGDENTYDVEACQCNPLSFFQENK
Ga0334998_0029524_3645_37913300034019FreshwaterMGRMKELYTQILECDTCYGNGWLYYGDENNYDVEACDCNPLGFFQENK
Ga0335000_0441866_3_1193300034063FreshwaterILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK
Ga0335030_0834302_372_5363300034103FreshwaterNERKTKMGRMKELYTQILECETCYGNGWLYYGDENTYDVEACQCNPLSFFQENK
Ga0335055_0152541_926_10393300034110FreshwaterLECETCNGQGWQFFGNETDYDVEACDCNPLGFFQENK
Ga0334997_0832758_418_5523300034280FreshwaterKELYTQILECDTCYGKGWLYYGDEDNYDVEACQCNPLSFFQENK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.