NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100693

Metagenome / Metatranscriptome Family F100693

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100693
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 42 residues
Representative Sequence YKRGTLHAGINPKGPKKAPLAKSRKQAIAIALSEAGKSKKK
Number of Associated Samples 90
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 3.92 %
% of genes near scaffold ends (potentially truncated) 87.25 %
% of genes from short scaffolds (< 2000 bps) 84.31 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (45.098 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(13.725 % of family members)
Environment Ontology (ENVO) Unclassified
(37.255 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(42.157 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60
1FwDRAFT_100119675
2ML8_100267762
3metazooDRAFT_12757042
4B570J40625_1008411191
5Ga0070374_100482371
6Ga0075470_100805481
7Ga0070749_105010461
8Ga0070749_105537252
9Ga0075464_110410752
10Ga0102978_10031171
11Ga0075460_101402211
12Ga0075458_101900322
13Ga0099847_10977441
14Ga0099846_11389583
15Ga0102861_10559093
16Ga0105746_12839863
17Ga0105748_101556535
18Ga0108970_101232371
19Ga0114340_11903702
20Ga0114351_10333496
21Ga0114841_10887674
22Ga0114363_10906541
23Ga0114876_11492773
24Ga0105102_108043121
25Ga0105104_101738965
26Ga0105104_106222722
27Ga0105097_101875311
28Ga0114979_104917221
29Ga0136655_11017103
30Ga0136644_104893391
31Ga0129333_107518661
32Ga0129336_102681252
33Ga0136551_10000431
34Ga0133913_128889683
35Ga0137675_10223551
36Ga0138284_12912711
37Ga0164292_104346151
38Ga0129327_103707731
39Ga0170791_135326931
40Ga0177922_103450531
41Ga0134315_10211521
42Ga0181363_10597401
43Ga0181347_11867303
44Ga0181356_12414111
45Ga0181343_11395174
46Ga0181357_10213098
47Ga0181357_12765461
48Ga0181357_12826181
49Ga0181355_13173133
50Ga0181355_13235233
51Ga0180433_110759573
52Ga0211726_106644064
53Ga0211726_109113483
54Ga0211731_109254694
55Ga0210339_14087241
56Ga0222712_106186383
57Ga0196903_10300454
58Ga0181354_11850902
59Ga0214923_100088212
60Ga0244777_1000032543
61Ga0244775_112501603
62Ga0255171_10312373
63Ga0208147_10409445
64Ga0208147_11222973
65Ga0208784_10529011
66Ga0208644_11213733
67Ga0208644_12875603
68Ga0208177_10684273
69Ga0208974_11750512
70Ga0209599_100074031
71Ga0209492_10755751
72Ga0209296_12506174
73Ga0209770_100841661
74Ga0209107_103146653
75Ga0209229_105034342
76Ga0209985_101821191
77Ga0209354_103607231
78Ga0209253_106161621
79Ga0209820_11613314
80Ga0209298_101157385
81Ga0255180_10027255
82Ga0238435_1065981
83Ga0307375_107942811
84Ga0307377_103686133
85Ga0315907_100542022
86Ga0315907_101261005
87Ga0315909_103645455
88Ga0315904_109744663
89Ga0315901_102096081
90Ga0315901_102673203
91Ga0315901_107890621
92Ga0315901_108405121
93Ga0315274_114415573
94Ga0315906_104458844
95Ga0315284_120393392
96Ga0334982_0501830_406_534
97Ga0334986_0121731_14_151
98Ga0334985_0779218_372_506
99Ga0334987_0672071_469_597
100Ga0334995_0581376_505_639
101Ga0335010_0039631_3330_3470
102Ga0335033_0160978_1118_1240
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 24.64%    β-sheet: 0.00%    Coil/Unstructured: 75.36%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540YKRGTLHAGINPKGPKKAPLAKSRKQAIAIALSEAGKSKKKSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
54.9%45.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Freshwater
Lake
Freshwater Lake
Freshwater Lentic
Freshwater And Sediment
Freshwater And Sediment
Freshwater Lake Sediment
Freshwater
Freshwater, Plankton
Freshwater Lake
Freshwater Lake
Freshwater
Sediment
Freshwater And Marine
Surface Water
Freshwater
Freshwater
Pond Fresh Water
Aqueous
Freshwater To Marine Saline Gradient
Estuary Water
Estuarine
Wetlands Benthic
Estuarine
Estuarine Water
Hypersaline Lake Sediment
Soil
Deep Subsurface
Estuary
5.9%13.7%8.8%3.9%5.9%5.9%8.8%13.7%3.9%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FwDRAFT_1001196753300000882Freshwater And MarineMGEYKAGTLHAGRNPKGPKKAPMAKSRKQAIAIAMSESGMKKRK*
ML8_1002677623300001335Wetlands BenthicMREYKAGKLKAGVNPKGPKKAPMAKSRRQAVAIALRSAGVPKKKK*
metazooDRAFT_127570423300002202LakeMREFKSGKLHGGVNPKGPKKAPVVKSKRQALAIALSQAGVKKK*
B570J40625_10084111913300002835FreshwaterMGEYKKGTLHAGVNPKGPAKAPLAKSRKQAIAIAMSEAGKSKKK*
Ga0070374_1004823713300005517Freshwater LakeEYKRGTLHSGKNPKGPKKAPLVKSRKQAVAIALSEAGKSRGKKR*
Ga0075470_1008054813300006030AqueousMGEFKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK*
Ga0070749_1050104613300006802AqueousKRGTLHAGINPKGPKKAPMAKSRKQAIAIALSEAGKSKKRS*
Ga0070749_1055372523300006802AqueousHSGINPKGPKKAPLAKSRKQALAIAISVTRKKKK*
Ga0075464_1104107523300006805AqueousHSGKDPKGPKKAPVVKNRKQAITIALSSAGMSKKKAKKK*
Ga0102978_100311713300007177Freshwater LakeVMGEFKRGTLHAGKDPKGPKKAKIVKSKKQAIAIALSQAGKAKKK*
Ga0075460_1014022113300007234AqueousLHAGVDPKGPKKAPVVKSRNQAIAIALSQAGKAKKK*
Ga0075458_1019003223300007363AqueousMGEYKRGTLHAGINPKGPAKAPMAKSRKQAIAIALSEAGMSKKKKK*
Ga0099847_109774413300007540AqueousKAGTLHGGIDPKGKKKAPVVKSRKQAIAIALSQAGVAKKKKGH*
Ga0099846_113895833300007542AqueousVDPDGKGPKKAPVVKSQKQAIAIALRQAGAPPKGKKK*
Ga0102861_105590933300007544EstuarineAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK*
Ga0105746_128398633300007973Estuary WaterTLNAGKDPKGPKKAAVVKNRKQAIAIALSQAGKAKKRAK*
Ga0105748_1015565353300007992Estuary WaterGTLNAGKDPKGPKKAPVVKNRKQAIAIALSQAGKAKKRAK*
Ga0108970_1012323713300008055EstuaryYKRGKLHAGVNPKGPAKAPMAKSRKQAIAIALSEAGKSKKK*
Ga0114340_119037023300008107Freshwater, PlanktonMKEYKAGTLHAGKNPKGPKKAKIVRSKKQAIAIALSEAGKAKKK*
Ga0114351_103334963300008117Freshwater, PlanktonKVLGEYKRGTLHSGKDPKGPKKAPVVKNRRQAIAIALSSAGKAKKK*
Ga0114841_108876743300008259Freshwater, PlanktonKAGKLKAGINPKGPKKAPMAKSRKQAVAIALSQAGMSKKK*
Ga0114363_109065413300008266Freshwater, PlanktonHSGKDPKGPKKAPVVKSRKQAIAIALSEAGKAKKK*
Ga0114876_114927733300008448Freshwater LakeYKSGKLHAGINPKGPKKAPMAKNRSQALAIALRSAGVAKKKK*
Ga0105102_1080431213300009165Freshwater SedimentGKLKAGIDPKGPKKAPLAKSRAQAVAIALRSAGVKKKK*
Ga0105104_1017389653300009168Freshwater SedimentAKVMGEYKRGTLHAGINPKGPKKAPMAKSRSQAVAIAMSESGMKRKGK*
Ga0105104_1062227223300009168Freshwater SedimentEYKRGTLKAGINPKGPKKAPMAKSRKQAVAIAMSQAGMKKKK*
Ga0105097_1018753113300009169Freshwater SedimentFKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK*
Ga0114979_1049172213300009180Freshwater LakeHKVMGEYKDKSLHSGKDGKVVKNRKQAIAIALSESGAAKKKK*
Ga0136655_110171033300010316Freshwater To Marine Saline GradientAGVDPDGKGPKKAPVVKSQKQAIAIALRQAGAPPKGKKK*
Ga0136644_1048933913300010334Freshwater LakeLHAGVNPKGPKKAPIVKSRKQAVAIALSQAGISKRK*
Ga0129333_1075186613300010354Freshwater To Marine Saline GradientYKAGTLHSGVDPKGPKKAPIVKSRKQAIAIALSEAGKAKKK*
Ga0129336_1026812523300010370Freshwater To Marine Saline GradientGKLHAGVNPKGPKKAPLAKSRKQAIAIALSEAGMSKKK*
Ga0136551_100004313300010388Pond Fresh WaterMGEYKRGTLHAGVNPKGPAKAPMAKSRKQAIAIALSEAGKSKKK*
Ga0133913_1288896833300010885Freshwater LakeGKVMKEYGAGKLHGGINPKGPKKAPIVKNRKQAVAIAMSMAGMKKKKSK*
Ga0137675_102235513300010966Pond Fresh WaterRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK*
Ga0138284_129127113300012779Freshwater LakeSGTLHAGRNPKGPKKAPIVKSRKQAIAISLSMAGMQKKNRQK*
Ga0164292_1043461513300013005FreshwaterEYKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK*
Ga0129327_1037077313300013010Freshwater To Marine Saline GradientGGIDPKGKKKAPVVKSRKQAIAIALSQAGVAKKKKGH*
Ga0170791_1353269313300013295FreshwaterGRNPKGPKKAPIVKSRKQAIAISLSMAGMQKKNRQK*
Ga0177922_1034505313300013372FreshwaterGEYKAGTLHAGVNPKGPKKAPLAKSRKQAVAIALSQAGMTKRK*
Ga0134315_102115213300014962Surface WaterAGVDPKGPKKAPMAKSRKQAIAIAMRSAGMPKKKK*
Ga0181363_105974013300017707Freshwater LakeLHGGVDPKGPKKAPIVKSRKQAIAIAMSEAGMSKKKK
Ga0181347_118673033300017722Freshwater LakeYKRGTLHSGKDPKGPKKAPLVKSRKQAVAIALSEAGKSRGKKR
Ga0181356_124141113300017761Freshwater LakeQAKIAKVLGEFKDKKLHSGVDPKGPKKARVVKSRKQTIAIALSEAGKLKGKK
Ga0181343_113951743300017766Freshwater LakeAGINPKGPKKAPLAKSRKQAVAIALSQAGMSKKKK
Ga0181357_102130983300017777Freshwater LakeEYKRGTLHGGIDPKGPKKAPVVTSRKQAVAIALSQAGKAKKGKM
Ga0181357_127654613300017777Freshwater LakeVLGEFKDKKLHSGIDPKGPKKARVVKSRKQAIAIALSEAGKLRGKK
Ga0181357_128261813300017777Freshwater LakeEFKTGKLHGGIDPKGPKKAMLVKNPKQAIAIALVQAAAMKKKKGK
Ga0181355_131731333300017785Freshwater LakeKRGTLHGGIDPKGPKKAPVVKSRKQAIAIALSEAGKSRKTK
Ga0181355_132352333300017785Freshwater LakeKRGTLHSGKDPKGPKKAAVVKNRKQAVAIALSVAGKSKKKGK
Ga0180433_1107595733300018080Hypersaline Lake SedimentLRAGKDPKGPKKAPKAKSRKQAIAIALSEAGMSKPEKRAKGG
Ga0211726_1066440643300020161FreshwaterHAGRDPKGPKKAPVVKSRKQAIAIALSEAGKSKKK
Ga0211726_1091134833300020161FreshwaterYKRGTLHAGINPKGPKKAPLAKSRKQAIAIALSEAGKSKKK
Ga0211731_1092546943300020205FreshwaterAKVMGEYKAGTLHAGVNPKGPRKAPLAKSRAQATAIAMSQAGMSKRK
Ga0210339_140872413300021332EstuarineVMGEFKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK
Ga0222712_1061863833300021963Estuarine WaterMGEYKRGTLHAGVNPKGPKKAPLAKSRKQALAIAMSEAGMKKKK
Ga0196903_103004543300022169AqueousHGGIDPKGPKKAPLVKNPKQAIAIALSQANAKKKKK
Ga0181354_118509023300022190Freshwater LakeGTLHAGVNPKGPKKAPLAKNRKQAVAIAMSEAGIKKRK
Ga0214923_1000882123300023179FreshwaterMSEFKKGTLHAGKDPDGKGPKKAPIVKSRKQAIAIALSEQAKSKPKGGKK
Ga0244777_10000325433300024343EstuarineMGEYKSGTLHAGRNPKGSKKAPLAQSRKQAIAIAMSAMGMKKK
Ga0244775_1125016033300024346EstuarineTLSAGMNPKGPKKAPMAKSRKQAVAIAMSEAGMAKKGKKK
Ga0255171_103123733300024354FreshwaterKLHAGVDPKGPKKAPMAKSRKQAIAIALSEAGKSKKK
Ga0208147_104094453300025635AqueousKAGTLHSGKDPKGPKKAPVVKSRKQAIAIALSSAGMAKKKKK
Ga0208147_112229733300025635AqueousEFKMGTLHGGRNPKGPKKAPIVKSRKQAIAIALSEAGKSRKRR
Ga0208784_105290113300025732AqueousKGTLHGGINPKGPKKAPVVKSQKQAIAIALSSAGISKKKGKK
Ga0208644_112137333300025889AqueousVMHEWKTGKLHAGVDPDGKGPKKAPVVKSQKQAIAIALRQAGAPPKGKKK
Ga0208644_128756033300025889AqueousTLHAGRDPKGPKKAPIVKSRKQAIAIALSEAGKARKK
Ga0208177_106842733300027254EstuarineVSKVMGEYKSGTLHAGRNPKGPKKAPLARSRKQAIAIAMSEAGMKKRK
Ga0208974_117505123300027608Freshwater LenticKMSKVMGEYKMGTLKAGVNPKGPAKAPMAKSRKQAVAIAMSEAGKMKRK
Ga0209599_1000740313300027710Deep SubsurfaceFKAGELHAGKDPKGPKKAPIVKNRKQAIAIALSEAGASKGKKK
Ga0209492_107557513300027721Freshwater SedimentEYKAGTLHAGVNPKGPKKAPLAKNRKQAVAIAMSEAGIKKRK
Ga0209296_125061743300027759Freshwater LakeEYKRGTLKAGVNPKGPKKAPMAKSRKQAVAIAMSQAGMSKKK
Ga0209770_1008416613300027769Freshwater LakeLHAGVNPKGPKKAPMAKSRKQAIAIAMSEAGMKRKK
Ga0209107_1031466533300027797Freshwater And SedimentMGEYKRGTLHAGINPKGPKKAPLAKSRAQAVAIAMSEAGMKKKKK
Ga0209229_1050343423300027805Freshwater And SedimentSGVDPKGPKKARIVKSRKQAIAIALSEAGKSIKKVKGK
Ga0209985_1018211913300027806Freshwater LakeSGKNPKGPKKAPIVKNRKQAVAIALSQAGMSKKKKKK
Ga0209354_1036072313300027808Freshwater LakeGTLNAGVNPKGPAKAKKAGSREQAIAIALSEAGMSKKKK
Ga0209253_1061616213300027900Freshwater Lake SedimentMGEYKRGTLHAGINPKGPKKAPLAKSRKQAVAIALSVAGKSKKK
Ga0209820_116133143300027956Freshwater SedimentLHAGINPKGPKKAPMAKSRAQAVAIAMSESGMKRKGK
Ga0209298_1011573853300027973Freshwater LakeEYKAGTLHGGVNPKGPKKAPIVKSRKQAIAIALSEAGKSIKKAKGK
Ga0255180_100272553300028073FreshwaterGEYKRGTLHAGVNPKGPKKAPMAGSRKQAIAIAMSEAGKMKKK
Ga0238435_10659813300029349FreshwaterMKEYKAGTLHSGVDPKGPKKAKVVTSRNQAIAIALSEANKSKKKAKGK
Ga0307375_1079428113300031669SoilAKVMGEYKRGTLHAGINPKGPKKAPLAKSRAQAVAIAMSEAGMKKKKK
Ga0307377_1036861333300031673SoilRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGMSKKMKKK
Ga0315907_1005420223300031758FreshwaterMKEYKAGTLHAGKNPKGPKKAKIVRSKKQAIAIALSEAGKAKKK
Ga0315907_1012610053300031758FreshwaterGKLKAGINPKGPKKAPMAKSRKQAVAIALSQAGMSKKK
Ga0315909_1036454553300031857FreshwaterTLHSGKDPKGPKKAPVVKNRKQAIAIALSSAGMSKKKAKKK
Ga0315904_1097446633300031951FreshwaterLHGGVDPAGPKKAPVVKSRKQAIAIALSQAGKARKK
Ga0315901_1020960813300031963FreshwaterKLKAGINPKGPKKAPMAKSRKQAVAIALSQAGMSKKK
Ga0315901_1026732033300031963FreshwaterMGEYKAGTLHAGVNPKGPKKAPLAKSRKQAVAIALSQAGMTKRK
Ga0315901_1078906213300031963FreshwaterGTLHGGVDPAGPKKAPVVKSRKQAIAIALSQAGKARKK
Ga0315901_1084051213300031963FreshwaterLHAGRDPKGPKKAPVVKNRKQAVAIALSQAGMAKKRGKKK
Ga0315274_1144155733300031999SedimentTLHSGQDPKGPRKAPVVKSRKQAVAIAMSQAGMSKKRK
Ga0315906_1044588443300032050FreshwaterKVMGEFKRGTLHAGIDPKGPKKAKIVKNKKQAIAIALSQAGKAKKK
Ga0315284_1203933923300032053SedimentGKIMSEFKAGVLHSGINPKGPKKAPLAKSRKQAIAIALSVTGKAKKTTKKK
Ga0334982_0501830_406_5343300033981FreshwaterKEGKLHSGKDPKGPKKAPKVKSSKQAIAIALSEAGVAKKKGK
Ga0334986_0121731_14_1513300034012FreshwaterMKEFKSGTLHSGKNPKGPKKAKVVTNRKQAIAIAMSEAGMKRKKK
Ga0334985_0779218_372_5063300034018FreshwaterMGEFKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK
Ga0334987_0672071_469_5973300034061FreshwaterKSGTLHSGKDPKGPKKAPVVKSKKQAVAIAMSEAGMSKKGKK
Ga0334995_0581376_505_6393300034062FreshwaterMGEYKRGTLHGGVDPAGPKKAPVVKSRKQAIAIALSQAGKAKKK
Ga0335010_0039631_3330_34703300034092FreshwaterMGEFKRGTLHSGKDPKGPKKAAVVKNRKQAVAIALSVAGKSKKKGK
Ga0335033_0160978_1118_12403300034117FreshwaterKRGTLHAGVNPKGPAKAPLAKSRKQAIAIALSEAGKSKKK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.