NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F084170

Metagenome Family F084170

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F084170
Family Type Metagenome
Number of Sequences 112
Average Sequence Length 43 residues
Representative Sequence MFAKWTDDMVREYFDTHWNATLHEICALSGRTKADVKRVLMGG
Number of Associated Samples 87
Number of Associated Scaffolds 112

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 89.38 %
% of genes near scaffold ends (potentially truncated) 22.32 %
% of genes from short scaffolds (< 2000 bps) 75.00 %
Associated GOLD sequencing projects 68
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (41.071 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(43.750 % of family members)
Environment Ontology (ENVO) Unclassified
(61.607 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(78.571 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1BBAY94_100395596
2BBAY94_101156721
3JGI26082J51739_101087601
4JGI26085J52751_10241452
5Ga0068515_1185002
6Ga0068515_1210932
7Ga0070728_100387726
8Ga0070727_100963224
9Ga0076924_10660434
10Ga0070743_101004524
11Ga0075474_100103923
12Ga0075474_100417124
13Ga0075474_100871545
14Ga0075474_102284643
15Ga0075462_100827382
16Ga0075466_11303632
17Ga0075466_11671142
18Ga0075461_100835561
19Ga0070749_100178793
20Ga0070749_101784024
21Ga0070749_104047472
22Ga0070749_104900512
23Ga0070754_101458943
24Ga0070754_101606982
25Ga0070754_102667692
26Ga0070754_103299701
27Ga0075481_101809222
28Ga0080109_1052322
29Ga0070745_11122033
30Ga0070752_10049451
31Ga0102934_10568382
32Ga0099851_10446405
33Ga0099851_11070114
34Ga0099851_13580672
35Ga0099848_10413484
36Ga0099848_13128661
37Ga0099848_13408502
38Ga0099846_10159992
39Ga0102945_10116123
40Ga0075480_105207733
41Ga0102831_12224622
42Ga0102814_101489612
43Ga0102815_100354563
44Ga0114918_105994661
45Ga0114915_10294875
46Ga0115562_10537114
47Ga0115559_10988372
48Ga0129324_101802783
49Ga0129324_104109621
50Ga0136587_10562942
51Ga0136587_11946561
52Ga0136560_10694251
53Ga0160423_104854593
54Ga0129327_101121851
55Ga0181565_103631761
56Ga0181577_101263254
57Ga0181560_101093773
58Ga0181559_101432663
59Ga0181563_1001213212
60Ga0181563_100434794
61Ga0181591_108696494
62Ga0206677_102583774
63Ga0213861_100663163
64Ga0222718_100403843
65Ga0222718_100537212
66Ga0222715_105704571
67Ga0222719_106540672
68Ga0196883_10373172
69Ga0212023_10052304
70Ga0196903_10245002
71Ga0212031_10192243
72Ga0196891_100201013
73Ga0196905_10233512
74Ga0196901_10209878
75Ga0196901_10907103
76Ga0196901_11369282
77Ga0224504_100563623
78Ga0228670_10267105
79Ga0244775_100461006
80Ga0244775_102932683
81Ga0255047_1001458112
82Ga0208814_10153204
83Ga0208303_10015117
84Ga0208149_10190603
85Ga0208149_11174191
86Ga0209405_11132062
87Ga0208643_10715093
88Ga0208134_11665641
89Ga0208428_11091523
90Ga0208898_10084661
91Ga0208019_11986552
92Ga0208899_10231951
93Ga0208899_12160803
94Ga0209137_10359885
95Ga0208645_10852852
96Ga0208644_10408091
97Ga0209953_10265922
98Ga0209925_12423931
99Ga0208166_10302271
100Ga0208167_10573773
101Ga0209037_10086556
102Ga0208305_101196222
103Ga0209379_100324113
104Ga0209692_100070586
105Ga0209271_102963722
106Ga0209536_1021393292
107Ga0306872_1177032
108Ga0306871_10610681
109Ga0307996_10016032
110Ga0302130_100339312
111Ga0316202_101231064
112Ga0348335_117589_93_233
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.03%    β-sheet: 0.00%    Coil/Unstructured: 61.97%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MFAKWTDDMVREYFDTHWNATLHEICALSGRTKADVKRVLMGGSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
58.9%41.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Marine
Deep Ocean
Surface Seawater
Marine Sediment
Deep Subsurface
Seawater
Marine Sediment
Microbial Mat
Aqueous
Seawater
Freshwater To Marine Saline Gradient
Marine
Seawater
Estuarine
Salt Marsh
Marine
Estuarine
Estuarine Water
Pelagic Marine
Marine Water
Sediment
Saline Lake
Pond Water
Hypersaline Water
Pond Soil
Macroalgal Surface
4.5%43.7%3.6%5.4%6.2%3.6%4.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
BBAY94_1003955963300000949Macroalgal SurfaceMKWTDDMIREYFDSHWSATLHEICALSGRTKKEVK
BBAY94_1011567213300000949Macroalgal SurfaceMKAWTDEMVKEYFDTHWDVTIHQLCALSMRSKDDIKRILMER*
JGI26082J51739_1010876013300003617MarineMFAKWTDDMVRDYFDTRWNATLHEICALSGRTKADVKRVLMGG*
JGI26085J52751_102414523300003908MarineMTFAKWTDDMVRDYFDTHWNATLHEICALSGRTKADLKRVLMEG*
Ga0068515_11850023300004829Marine WaterMTFAKWSDDMIREYFDTHWNATIHEISALSGRTKADIKRVLMGES*
Ga0068515_12109323300004829Marine WaterMALEDETMKPWTDDMVREYFDTHWDVTIHQICALSMRSKADVKRILMEG*
Ga0070728_1003877263300005588Marine SedimentMKWSDDMIREYFDTHWNATLHEICALSGRTKAEVKRALMRESA*
Ga0070727_1009632243300005590Marine SedimentMKWSDDMIREYFDTHWNATLHEICALSGRTRADVKRALMRGNA*
Ga0076924_106604343300005747MarineMLIWTDDMVREYFDTHLNATLHEICALSGRNKADVKRVLMEE*
Ga0070743_1010045243300005941EstuarineMFAKWTDDMVRDYFDTHWNATLHEVCALSGRSKADVKQVLMEG*
Ga0075474_1001039233300006025AqueousMFAKWTDEMIREYFDSHWDATIHEVCALSGRTKAEVKRALMQGSKT*
Ga0075474_1004171243300006025AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRVLMEQE*
Ga0075474_1008715453300006025AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRILMEQE*
Ga0075474_1022846433300006025AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRIL
Ga0075462_1008273823300006027AqueousMLIWTDDMVREYFDTHLNATLHEICALSGRNRADVKRVLMEE*
Ga0075466_113036323300006029AqueousMTMATWTDDMIIEYFDTHINATIHEICALSGRRRADVKRVLMGG*
Ga0075466_116711423300006029AqueousMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKSDVKRALMEG*
Ga0075461_1008355613300006637AqueousMFAKWTDEMIREYFDSHWDATIHEVCALSGRTKAEVK
Ga0070749_1001787933300006802AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMEG*
Ga0070749_1017840243300006802AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVQ
Ga0070749_1040474723300006802AqueousMKAWTDEMVKEYFDTHWDVTIHQLCALSMRSKADVKRILMER*
Ga0070749_1049005123300006802AqueousMAKWTDDMVREYFDTHWNATIHQICALSGRSKADVKRVLMEG*
Ga0070754_1014589433300006810AqueousMFAKWTDDMIREYFDSHWDATIHEVCALSGRTKAEVKRALMQGSKT*
Ga0070754_1016069823300006810AqueousMFAKWTDDMIREYFDTHLNATIHEICALSGRTKAEVKRALMQGSKT*
Ga0070754_1026676923300006810AqueousMFAKWTDEMIREYFDSHWNATIHEICALSGRTKADVKRALMVQSK*
Ga0070754_1032997013300006810AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMGG*
Ga0075481_1018092223300006868AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRILMEQG*
Ga0080109_10523223300007023Hypersaline WaterMSHWTDDMIREYFDTHWNATIHEICALSGRSKAEVKRALMQK*
Ga0070745_111220333300007344AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMGD*
Ga0070752_100494513300007345AqueousNVMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMGG*
Ga0102934_105683823300007537Pond SoilMNRWTDDMIREYFDTHWNATIHEICALSGRTKAEVKRALMQGAG*
Ga0099851_104464053300007538AqueousMKKWTDDMVREYFDTHWNVTIHTLCALSGRSKADVKAILLEA*
Ga0099851_110701143300007538AqueousMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKTDVKSVLMEGTDT*
Ga0099851_135806723300007538AqueousMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKLDVKSVLMEGADT*
Ga0099848_104134843300007541AqueousMFAKWNDDMIREYFDTHWNATIHEICALSGRTKADVKRALMVQSK*
Ga0099848_131286613300007541AqueousMTFVKWTDDMVRDYFDTHWNATLHEICALSGRTKADVKRV
Ga0099848_134085023300007541AqueousMTFAKWTDDMVRDYFDTHWNATLHEICALSGRTKADVKRVLMEG*
Ga0099846_101599923300007542AqueousMNIWTDDMIREYFDSNWNVTIHQICALSGRTKADVKAVLMGGK*
Ga0102945_101161233300007609Pond WaterMTFAKWTDDMVREYFDTHWNATLHEICALSGRTKVDIKRVLMEG*
Ga0075480_1052077333300008012AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRILM
Ga0102831_122246223300008996EstuarineMKVWTDEMVREYFDTHWNATLHDICALSGRTKADVKRVLMGG*
Ga0102814_1014896123300009079EstuarineMFAKWTDDMIIEYFDTRPFCKWEATIHEICALSGRARTDVKRVLMKG*
Ga0102815_1003545633300009080EstuarineMPFAKWTDDMIIEYFDTRPFCKWEATIHEICALSGRARTDVKRVLMKG*
Ga0114918_1059946613300009149Deep SubsurfaceMAHWTDEMIREYFDTHWNATIHEICALSGRTRAEVKRV
Ga0114915_102948753300009428Deep OceanMFAKWTDDMVRDYFDTHWNATLHEISALSGRTKTDVKLVLMKG*
Ga0115562_105371143300009434Pelagic MarineMLVWTDDMVREYFDTHLNATLHEICALSGRNKADVKRVLMEE*
Ga0115559_109883723300009438Pelagic MarineMTFAKWTDDMVREYFDTHWNATLHEICALSGRTKADLKRVLMEG*
Ga0129324_1018027833300010368Freshwater To Marine Saline GradientLIWTDDMVREYFDTHLNATLHEICALSGRNKADVKRVLMEE*
Ga0129324_1041096213300010368Freshwater To Marine Saline GradientDMVREYFDTHWNATLHEICALSGRSKIDVKRALMEG*
Ga0136587_105629423300011188Saline LakeMARWTDDMIKESFDTNWNATVHQVSALSGRNKADVKRILMGGN*
Ga0136587_119465613300011188Saline LakeVREYFDTNWDATVHQISALSGRNKADIKRILMGGK*
Ga0136560_106942513300012269Saline LakeRWSDDMVREYFDTNWDATVHQISALSGRNKADIKRILMGGK*
Ga0160423_1048545933300012920Surface SeawaterMKYRLIWTDEMVKEYFDSHLNTTIHEICALSGRTKDEVKNILMEKV*
Ga0129327_1011218513300013010Freshwater To Marine Saline GradientMLIWTDDMVREYFDTHLNATLHEICALSGRNKADVKRV
Ga0181565_1036317613300017818Salt MarshEMVREYFDTHWNATIHEICALSGRTKADVKRVLMEQE
Ga0181577_1012632543300017951Salt MarshMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRVLMEQE
Ga0181560_1010937733300018413Salt MarshMKAWTDEMVKEYFDTHWDVTIHQLCALSMRSKADVKRILMER
Ga0181559_1014326633300018415Salt MarshMKAWTDEMVKEYFDTHWDVTIHQLCALSMRSKDDIKRILMER
Ga0181563_10012132123300018420Salt MarshMFAKWTDDMVREYFDTHLNATLHEICALSGRSKIDVKRALMEG
Ga0181563_1004347943300018420Salt MarshMKKWTDQMICEYFDTHWYATIHTICALSGRTKSDVKRVLMGGK
Ga0181591_1086964943300018424Salt MarshMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRVLM
Ga0206677_1025837743300021085SeawaterMEIWTDNMVREYFDTHWNVTIHELCTLSGLKKSDVKRALMKG
Ga0213861_1006631633300021378SeawaterMLKQWTDDMVKEYFDTHWGATIHHICALSGRKRADVVAILLGGK
Ga0222718_1004038433300021958Estuarine WaterMTLAKWTDDMVRDYFDTHWNATLHEICALSGRTKADVKRVLMEG
Ga0222718_1005372123300021958Estuarine WaterMVKWTDDMVREYFDTHWNATLHEVCAYSGRTKADVKRVLMEG
Ga0222715_1057045713300021960Estuarine WaterMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVK
Ga0222719_1065406723300021964Estuarine WaterMNKIEFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMGK
Ga0196883_103731723300022050AqueousMFAKWTDEMIREYFDSHWDATIHEVCALSGRTKAEVKRALMQGSKT
Ga0212023_100523043300022061AqueousMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKSDVKRALMEG
Ga0196903_102450023300022169AqueousMLIWTDDMVREYFDTHLNATLHEICALSGRNKADVKRVLMEE
Ga0212031_101922433300022176AqueousMTFAKWTDDMVRDYFDTHWNATLHEICALSGRTKADVKRVLMEG
Ga0196891_1002010133300022183AqueousMLIWTDDMVREYFDTHLNATLHEICALSGRNRADVKRVLMEE
Ga0196905_102335123300022198AqueousMFAKWTDDMVREYFDTHWNATLHEICALSGRTKADVKRVLMGG
Ga0196901_102098783300022200AqueousMNIWTDDMIREYFDSNWNVTIHQICALSGRTKADVKAVLMGGK
Ga0196901_109071033300022200AqueousMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKTDVKSVLMEGTDT
Ga0196901_113692823300022200AqueousMKKWTDDMVREYFDTHWNVTIHTLCALSGRSKADVKAILLEA
Ga0224504_1005636233300022308SedimentMFAKWTDDMVRDYFDTHWNATLHEICALSGRTKADVKRVLMEG
Ga0228670_102671053300024319SeawaterMFAKWTDDMVRDYFDTHWNATLHEISALSGRTKADVKRVLMEG
Ga0244775_1004610063300024346EstuarineMFAKWTDDMVRDYFDTHWNATLHEVCALSGRSKADVKQVLMEG
Ga0244775_1029326833300024346EstuarineMKVWTDEMVREYFDTHWNATLHDICALSGRTKADVKRVLMGG
(restricted) Ga0255047_10014581123300024520SeawaterMKWTDDMLKEYFDTHWNATIHEICALSGRTKADIKRILMGGK
Ga0208814_101532043300025276Deep OceanMFAKWTDDMVRDYFDTHWNATLHEICALSGRTKTDVKLVLMKG
Ga0208303_100151173300025543AqueousMMKWSDEIVKEYFDTHWNVTIHQLCALSGRTKVDVKRILMESK
Ga0208149_101906033300025610AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRILMEQE
Ga0208149_111741913300025610AqueousTRARHNKMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRVLMEQE
Ga0209405_111320623300025620Pelagic MarineMLVWTDDMVREYFDTHLNATLHEICALSGRNKADVKRVLMEE
Ga0208643_107150933300025645AqueousMTMATWTDDMIIEYFDTHINATIHEICALSGRRRADVKRVLMGG
Ga0208134_116656413300025652AqueousDDMVREYFDTHWNATLHEICALSGRSKVDVKRALMEG
Ga0208428_110915233300025653AqueousMKRWTDEMVREYFDTHWNATIHEICALSGRTKADVKRILMEQG
Ga0208898_100846613300025671AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMGG
Ga0208019_119865523300025687AqueousMFAKWTDDMVREYFDTHWNATLHEICALSGRTKADVKRALMGG
Ga0208899_102319513300025759AqueousMFAKWTDEMIREYFDSHWDATIHEVCALSGRTKAEVKRALMQG
Ga0208899_121608033300025759AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLLEG
Ga0209137_103598853300025767MarineMFAKWTDDMVRDYFDTRWNATLHEICALSGRTKADVKRVLMGG
Ga0208645_108528523300025853AqueousMFAKWTDEMIREYFDSHWNATIHEICALSGRTKADVKRALMVQSK
Ga0208644_104080913300025889AqueousMFAKWTDEMVREYFDTHWNATLHEICALSGRTKADVKRVLMEG
Ga0209953_102659223300026097Pond WaterMTFAKWTDDMVREYFDTHWNATLHEICALSGRTKVDIKRVLMEG
Ga0209925_124239313300026197Pond SoilMNRWTDDMIREYFDTHWNATIHEICALSGRTKAEVKRALMQGAG
Ga0208166_103022713300027215EstuarineMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKSDVKSV
Ga0208167_105737733300027219EstuarineMFAKWTDDMVRDYFDTHWNATLHEVCALSGRNKSD
Ga0209037_100865563300027612MarineMTFAKWTDDMVRDYFDTHWNATLHEICALSGRTKADLKRVLMEG
Ga0208305_1011962223300027753EstuarineMFAKWTDDMIIEYFDTRPFCKWEATIHEICALSGRARTDVKRVLMKG
Ga0209379_1003241133300027758Marine SedimentMKWSDDMIREYFDTHWNATLHEICALSGRTRADVKRA
Ga0209692_1000705863300027828Marine SedimentMKWSDDMIREYFDTHWNATLHEICALSGRTKAEVKRALMRESA
Ga0209271_1029637223300027845Marine SedimentMKWSDDMIREYFDTHWNATLHEICALSGRTRADVKRAL
Ga0209536_10213932923300027917Marine SedimentMFAKWTDDMIREYFDSHWDATIHEICALSGRTKAEVKRALMQGSKT
Ga0306872_11770323300028361Saline LakeMARWTDDMIKESFDTNWNATVHQVSALSGRNKADVKRILMGGN
Ga0306871_106106813300028370Saline LakeMVREYFDTNWDATVHQISALSGRNKADIKRILMGGK
Ga0307996_100160323300031589MarineMFAKWKDDMVRDYFDTHWNATLHEICALSGRTKADVKRVLMKG
Ga0302130_1003393123300031700MarineMKKWTDDMVRDYFDAHLNTTLHELGAYSGRTRADLKRILMAPQKPRD
Ga0316202_1012310643300032277Microbial MatMNMATWTDDMIIEYFDTHINATIHEICALSGRRRADVKRVLMGG
Ga0348335_117589_93_2333300034374AqueousMFAKWTDDMIREYFDTHLNATIHEICALSGRTKAEVKRALMQGSKT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.