NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100553

Metagenome / Metatranscriptome Family F100553

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100553
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 44 residues
Representative Sequence MPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLR
Number of Associated Samples 92
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.71 %
% of genes near scaffold ends (potentially truncated) 99.02 %
% of genes from short scaffolds (< 2000 bps) 93.14 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (74.510 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(15.686 % of family members)
Environment Ontology (ENVO) Unclassified
(27.451 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.078 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.
1AF_2010_repII_A1DRAFT_101323261
2AF_2010_repII_A001DRAFT_100336781
3JGI25613J43889_101155542
4Ga0066397_100110971
5Ga0066823_100016882
6Ga0070690_1002569981
7Ga0008090_100070981
8Ga0070667_1011174812
9Ga0070714_1004104052
10Ga0070713_1015506981
11Ga0070705_1001202511
12Ga0070663_1003014992
13Ga0070707_1006176992
14Ga0070707_1022111941
15Ga0066700_103289412
16Ga0066699_109184331
17Ga0058697_104603851
18Ga0070664_1000732363
19Ga0066705_101710791
20Ga0066654_103742641
21Ga0066905_1005467032
22Ga0066905_1013033842
23Ga0066903_1012526332
24Ga0066903_1062283292
25Ga0066903_1063964021
26Ga0068858_1001546952
27Ga0070717_107773191
28Ga0075425_1028389532
29Ga0068865_1012917072
30Ga0099829_101793663
31Ga0099827_102104262
32Ga0126374_104582302
33Ga0126380_115596941
34Ga0126384_103682152
35Ga0126384_105860081
36Ga0126384_109737291
37Ga0126373_114069651
38Ga0134071_105255882
39Ga0126376_112134911
40Ga0126372_102888741
41Ga0126379_108536481
42Ga0126381_1012092861
43Ga0126381_1023296082
44Ga0126383_106227492
45Ga0134122_125675442
46Ga0137383_108082001
47Ga0137399_112550122
48Ga0137381_117795422
49Ga0137377_106919802
50Ga0137360_115137732
51Ga0137361_100857713
52Ga0137390_103683191
53Ga0157291_103313202
54Ga0157295_103851822
55Ga0126375_113202212
56Ga0164300_106474301
57Ga0164298_103691681
58Ga0126369_133160141
59Ga0163162_120635032
60Ga0157372_132027722
61Ga0157379_116421552
62Ga0182033_112150662
63Ga0182035_106828161
64Ga0182035_119409571
65Ga0182032_106757291
66Ga0182034_103730971
67Ga0182037_106204482
68Ga0182037_113304062
69Ga0182039_114897231
70Ga0163161_112549701
71Ga0184605_100630442
72Ga0137408_11042921
73Ga0193701_10039013
74Ga0193727_11016082
75Ga0210400_115236911
76Ga0193698_10217911
77Ga0207656_103552752
78Ga0207692_107727282
79Ga0207693_111639641
80Ga0207686_106625362
81Ga0207651_109410842
82Ga0207658_107905791
83Ga0209283_108446632
84Ga0307305_104014281
85Ga0318492_103198751
86Ga0318494_108431962
87Ga0318537_100448783
88Ga0318546_105320262
89Ga0318497_103545463
90Ga0318495_104464341
91Ga0306925_119333291
92Ga0318522_100012956
93Ga0306921_105702203
94Ga0310916_106394981
95Ga0318507_102896781
96Ga0318507_103590332
97Ga0318575_100440124
98Ga0318505_101388302
99Ga0318504_100367751
100Ga0318513_105283041
101Ga0318524_103394472
102Ga0307471_1006971141
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 39.19%    β-sheet: 0.00%    Coil/Unstructured: 60.81%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLRSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
74.5%25.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Agricultural Soil
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Agave
Tropical Rainforest Soil
7.8%10.8%13.7%3.9%15.7%9.8%5.9%6.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A1DRAFT_1013232613300000597Forest SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWAN
AF_2010_repII_A001DRAFT_1003367813300000793Forest SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVW
JGI25613J43889_1011555423300002907Grasslands SoilMPSKTDLCVIASADLQRFASHLFVAAGVAPAMADEWAKSLVWANLRGV
Ga0066397_1001109713300004281Tropical Forest SoilMPSDTKLHVIPSADLQRFASALFEAAGVARSMADEWAKSLVWANLR
Ga0066823_1000168823300005163SoilMPPTSNDVHLISGSDLQRFASGLFQALGVAGPMADEWARSLVWANLRGVDS
Ga0070690_10025699813300005330Switchgrass RhizosphereMPPTSDVHLISGSDLQRFASALFQALGVAGPMADEWARSL
Ga0008090_1000709813300005363Tropical Rainforest SoilMPSKNESHVISSADLERFASALFAAAGVAPAMADEWAKSLVWANLRGVDS
Ga0070667_10111748123300005367Switchgrass RhizosphereMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSL
Ga0070714_10041040523300005435Agricultural SoilMPSDSKLHVIASADLQRFASALFAAAGVAPPMADEWAKSLVWANLRGV
Ga0070713_10155069813300005436Corn, Switchgrass And Miscanthus RhizosphereMSGKNELHVIRGDDLRRFVSGLFQALEVAPAMADEWATSLVWANLRGVD
Ga0070705_10012025113300005440Corn, Switchgrass And Miscanthus RhizosphereMPPTSDDVHLISGSDLQRFASALFQALGVAGPMADEWARSL
Ga0070663_10030149923300005455Corn RhizosphereMPPTSNDVHLIAGSDLQRFASALFQALGVAGPMADEWARSL
Ga0070707_10061769923300005468Corn, Switchgrass And Miscanthus RhizosphereMPSDSKLHVIASADLQRFASALFAAAGVAPPMADEWAKSLVWANLRG
Ga0070707_10221119413300005468Corn, Switchgrass And Miscanthus RhizosphereMSSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLRGVD
Ga0066700_1032894123300005559SoilMKPKTELHVISSADLHRFASALFATAGVVPAMAEEWAKSLVWANLRGVDSH
Ga0066699_1091843313300005561SoilMKSKTELHVISSADLHRFASALFAAARVAPAMAEEWAKSLV
Ga0058697_1046038513300005562AgaveMLKASGMPPKLESQVISSADLERFASALFQAAGVAPSMADEWAK
Ga0070664_10007323633300005564Corn RhizosphereMPPTSNDVHLIAGSDLQRFASALFQALGVAGPMADEWARSLVW
Ga0066705_1017107913300005569SoilMKPKTELHVISSADLHRFASALFAAAGVVPAMAEEWAKSLVWANLRG
Ga0066654_1037426413300005587SoilMPSNTGSHVISSADLERFARALFVAVGVAPAMAEEWAKSLIWANL
Ga0066905_10054670323300005713Tropical Forest SoilMPSDTKLHVIPSADLQHFARALFEAAGVAPPMADQWAKSLVWANL
Ga0066905_10130338423300005713Tropical Forest SoilMPSELEVQVIPSADLQHFARALFEAAGVAPPMADQWAKSLVWANLRGV
Ga0066903_10125263323300005764Tropical Forest SoilMASNSELHVIAAPDLERFASALFQALDVERSMADE
Ga0066903_10622832923300005764Tropical Forest SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLR
Ga0066903_10639640213300005764Tropical Forest SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLRGVDS
Ga0068858_10015469523300005842Switchgrass RhizosphereMPPTSDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANLRGV
Ga0070717_1077731913300006028Corn, Switchgrass And Miscanthus RhizosphereMPSKTDLCVIASADLQRFASHLFVAAGVAPAMADEWA
Ga0075425_10283895323300006854Populus RhizosphereMPPTSDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANL
Ga0068865_10129170723300006881Miscanthus RhizosphereMPPTSDDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANLRGVDS
Ga0099829_1017936633300009038Vadose Zone SoilMPPKSDLHVIQSSDLQRFASALFQASGVARPMADEWAKSLVWANLRGVD
Ga0099827_1021042623300009090Vadose Zone SoilMPPKSELHVIQSHDLERFASALFQPLGVAPVMEKTWLML*
Ga0126374_1045823023300009792Tropical Forest SoilMSSRSELHVIQAADLERFASALFQAAGVAEAMADEWARSLVWANLRG
Ga0126380_1155969413300010043Tropical Forest SoilMPAKSELHLVRGSDLERFASALFQATGVARAMADDWAKSLVWANLRGT
Ga0126384_1036821523300010046Tropical Forest SoilMPAKAEMHVIAAAELERFASALLQALKVARPMADEWAKSL
Ga0126384_1058600813300010046Tropical Forest SoilMPSDTKLHVIPSTDLQRFARALFQAAGVAPPMADEWAKSLVW
Ga0126384_1097372913300010046Tropical Forest SoilLKTESHLILSADLQRFAGALFEAAGVAPAMADEWAKSLVWANLR
Ga0126373_1140696513300010048Tropical Forest SoilMPSKTEVHVIPSADLQRFASALFHAAGVAPPMADEWAKSLVW
Ga0134071_1052558823300010336Grasslands SoilMKPKTELHVISSADLHRFASALFATAGVVPAMAEE
Ga0126376_1121349113300010359Tropical Forest SoilMPSDSKLQVIPSADLQRFASALFHAAGVAPPMADEWAKSLVWANLRGV
Ga0126372_1028887413300010360Tropical Forest SoilMSSRSELHVVQASDLERFASALFQAAGVAEAMADEWARSLVWANLRG
Ga0126379_1085364813300010366Tropical Forest SoilMPAKAEMHVIAAAELERFASALLQALKVARPMADEWAKS
Ga0126381_10120928613300010376Tropical Forest SoilMPSKLEVHVISSADLQRFAGALFQAAGVAPAMADQWAKSLVWANLRG
Ga0126381_10232960823300010376Tropical Forest SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADE
Ga0126383_1062274923300010398Tropical Forest SoilMSSRSALHVIQAADLERFASALFQAAGVTEAMADEWAR
Ga0134122_1256754423300010400Terrestrial SoilMPPTSELHLIAGSDLQRFASALFQALDVAGPMADE
Ga0137383_1080820013300012199Vadose Zone SoilMKPKTELHVISSADLHHFASALFAAAGVVPAMAEEWAKSLVWANL
Ga0137399_1125501223300012203Vadose Zone SoilMSSKSDLHVIQSSDLQRFASALFQASRVAQPMADEWAESLVWANLR
Ga0137381_1177954223300012207Vadose Zone SoilMPSKSESHVISSADLERFASALFQAAGVAPSMADEWAKSLVWAN
Ga0137377_1069198023300012211Vadose Zone SoilMPSKSESHVISSADLERFASALFQAAGVAPSMADEW
Ga0137360_1151377323300012361Vadose Zone SoilMPSDSKLHVIASADLQRFASALFAAAGVAPPMADEWAKSL
Ga0137361_1008577133300012362Vadose Zone SoilMPSDTKLHVIPSADLQHFARALFQAAGVAPAMADQWAKSLVWANL
Ga0137390_1036831913300012363Vadose Zone SoilMSAKSELHVIQSADLERFASALFEGAGVARPLAEEWAKSLIWANLR
Ga0157291_1033132023300012902SoilMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSLVWAN
Ga0157295_1038518223300012906SoilMPSTSDLQLISGSDLQRFASALFQALGVAGPMADEWAR
Ga0126375_1132022123300012948Tropical Forest SoilMSSKLELHVIPTADLQHFARALFEAAGVAPPMADQWAKSLVWANLR
Ga0164300_1064743013300012951SoilMPSNTQSQVISSADRGRFAPPLFVAAGVAPAMAGAWAKSPVWADLRGGVDSHGVL
Ga0164298_1036916813300012955SoilMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSLVWANLRGV
Ga0126369_1331601413300012971Tropical Forest SoilLQVTEMPSDTKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLR
Ga0163162_1206350323300013306Switchgrass RhizosphereMSPTSDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANLRGVDS
Ga0157372_1320277223300013307Corn RhizosphereMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSLVWANLRGVD
Ga0157379_1164215523300014968Switchgrass RhizosphereMSAKNELHVISGDDLRRFSSALFQARGVAPAMADEWATSLVWANLRGV
Ga0182033_1121506623300016319SoilLKTESHVILSADLQCFAGALFVAAGVAPAMADEWAKSLVW
Ga0182035_1068281613300016341SoilLKTESHVILSADLQCFAGALFVAAGVAPAMADEWAKSLVWANLRG
Ga0182035_1194095713300016341SoilMPSKRELHVIPSVDLQHFARALFEAAGVAPPMADQWAKSLVWANLR
Ga0182032_1067572913300016357SoilMPLKTESHVILSADLQCFAGALFVAAGVAPAMADEWAKSLVWANL
Ga0182034_1037309713300016371SoilMPPKLELHVIPSADLQHFARALFEAAGVAPPMADQWAKSL
Ga0182037_1062044823300016404SoilMPLKTESHVILSTGLQRFAGALFVAAGVAPAMADEWAKSLVWANLR
Ga0182037_1133040623300016404SoilMSSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVW
Ga0182039_1148972313300016422SoilMPLKTDLHIIQSDALERFASALFRALGVAPDMGDEWARSLVWANL
Ga0163161_1125497013300017792Switchgrass RhizosphereMPPTSDDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANLRGV
Ga0184605_1006304423300018027Groundwater SedimentMPPKSELHVIPSSDLQGFASALFQALGVAGPMADEWA
Ga0137408_110429213300019789Vadose Zone SoilMPSKSESHVVISSADLERFASALFQAAGVAPSMADEWAKSL
Ga0193701_100390133300019875SoilMPPKSELHVIPSSDLQGFASALFQALGVAGPMADEWAKSLVWA
Ga0193727_110160823300019886SoilMPPKSELHVIPSSDLQGFASALFQASGVAGPMADEWAKSLVWANLRGVDS
Ga0210400_1152369113300021170SoilMPSDSKLHVIASADLQRFASSLFAAAGVAPPMADEWAKSLVW
Ga0193698_102179113300021968SoilMPPKSELHVIPSSDLQGFASALFQASGVAGPMADEWAKSLVWANL
Ga0207656_1035527523300025321Corn RhizosphereMPPTSDDVHLISGSDLQRFASALFQALGVAGPMADEWARSLVWANLRG
Ga0207692_1077272823300025898Corn, Switchgrass And Miscanthus RhizosphereMPSDSKLHVIASADLQRFASSLFAAAGVAPPMADEWAKSLVWA
Ga0207693_1116396413300025915Corn, Switchgrass And Miscanthus RhizosphereMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSLVWANLRGVDS
Ga0207686_1066253623300025934Miscanthus RhizosphereMPPTSNDVHLISGSDLQRFASGLFQALGVAGPMADEWARSLVWA
Ga0207651_1094108423300025960Switchgrass RhizosphereMPPTSDDVHLISGSDLQRFASALFQALGVAGPMADE
Ga0207658_1079057913300025986Switchgrass RhizosphereMPPTSELHLIAGSDLQRFASALFQALGVAGPMADEWAKSLVWANLRG
Ga0209283_1084466323300027875Vadose Zone SoilMSAKSELHVIQSADLERFASALFEGAGVARPLAEEWAKSLIWANLRG
Ga0307305_1040142813300028807SoilMPPKSELHVIPSSDLQGFASALFQALGVAGPMADEWAKSLVWANLRGVDS
Ga0318492_1031987513300031748SoilMPPKLELHVIPSADLQHFARALFEAAGVAPPMADQWAKSLVWANLR
Ga0318494_1084319623300031751SoilMPSDTKLHVIPSTDLQRFARALFQAAGVAPPMADEWAKSLVWAN
Ga0318537_1004487833300031763SoilMPSDSKLHVIPSADLQRFASALLAAAGVAPPMADEWAKSLVWANL
Ga0318546_1053202623300031771SoilMPSKLEVHVISSADLQRFAGALFQAAGVAPAMADQWAK
Ga0318497_1035454633300031805SoilMPLKTESHVILSTGLQRFAGALFVAAGVAPAMADEWAKSLVWANLRG
Ga0318495_1044643413300031860SoilMPPKLELHVIPSADLQHFARALFEAAGVAPPMADQWAKSLVWANLRGVDS
Ga0306925_1193332913300031890SoilMPSDTKLHVIPSTDLQRFASALFHAAGVAPPMADEWAKSLVWANLRGVDS
Ga0318522_1000129563300031894SoilMPSDSKLHVIPSADLQRFASALLAAAGVAPPMADEWAKSLVWANLRGVDS
Ga0306921_1057022033300031912SoilLKTKSHVIPSADLQRFAGALFVAAGVAPAMADEWAKSLVWAN
Ga0310916_1063949813300031942SoilMPSDSKLHVIASADLQRFASALLAAAGVAPPMADEWAKSLVWAN
Ga0318507_1028967813300032025SoilLKTESHVILSADLQCFAGALFVAAGVAPAMADEWAKSLVWANLRGVD
Ga0318507_1035903323300032025SoilMPSKNESHVIASADLERFASALFAAAGVAPAMADEWAKSLVWANLRGV
Ga0318575_1004401243300032055SoilMPLKTESHVILSTGLQCFAGALFVAAGVAPAMADEWAKSLVWANLRGVD
Ga0318505_1013883023300032060SoilMPSDSKLHVIPSADLQRFASALFAAAGVAPPMADEWAKSLVWANLRGVD
Ga0318504_1003677513300032063SoilMPSDTKLHVIPSTDLQRFARALFQAAGVAPPMADEWAN
Ga0318513_1052830413300032065SoilMPSDSKLHVIPNADLQRFASALFAAAGVAPPMADEWA
Ga0318524_1033944723300032067SoilMPPKLELHVIPSADLQHFARALFEAAGVAPPMADQWAKSLVWANL
Ga0307471_10069711413300032180Hardwood Forest SoilMTSKTELHVISGADLQRFASAFFAAAGVAQAMAEEWARSLVW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.