NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096952

Metagenome Family F096952

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096952
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 44 residues
Representative Sequence MIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLL
Number of Associated Samples 98
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 37.50 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 93.27 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(25.000 % of family members)
Environment Ontology (ENVO) Unclassified
(22.115 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.654 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64
1GPKNP_09226120
2ICCgaii200_10600761
3C688J13580_10356643
4JGI25405J52794_100191651
5Ga0062593_1024656581
6Ga0063455_1002034053
7Ga0063455_1002665361
8Ga0062595_1011034821
9Ga0062594_1030736021
10Ga0066673_108248642
11Ga0070666_107741341
12Ga0070669_1007748443
13Ga0070685_108926721
14Ga0070704_1003166763
15Ga0066698_101623351
16Ga0068855_1001010041
17Ga0068855_1003688611
18Ga0066708_105584581
19Ga0068856_1009078241
20Ga0066903_1018212453
21Ga0070717_121383211
22Ga0066659_113243471
23Ga0075436_1001158571
24Ga0079219_103400603
25Ga0075435_1010070791
26Ga0114129_119673132
27Ga0075423_118446361
28Ga0134082_103438611
29Ga0134084_102616121
30Ga0134065_103453303
31Ga0134065_103505141
32Ga0134063_101839863
33Ga0134125_102414881
34Ga0134127_125375451
35Ga0134122_111632963
36Ga0120161_10632073
37Ga0137383_106058931
38Ga0137374_100961351
39Ga0137386_107468721
40Ga0137366_106362333
41Ga0157326_10152601
42Ga0157303_100969841
43Ga0157301_100872653
44Ga0162651_1000147893
45Ga0164300_104829813
46Ga0164300_109435561
47Ga0164298_101704833
48Ga0164303_111202342
49Ga0164302_108824063
50Ga0164308_116670411
51Ga0164304_106967513
52Ga0164306_106571563
53Ga0164305_111025591
54Ga0157378_124630202
55Ga0120149_11955802
56Ga0182005_11687901
57Ga0134072_103304993
58Ga0132258_114228281
59Ga0132256_1003937541
60Ga0132257_1008466201
61Ga0132255_1011207111
62Ga0132255_1046798841
63Ga0187786_100947591
64Ga0184604_102386141
65Ga0187788_100002091
66Ga0184621_101817203
67Ga0193730_11758092
68Ga0247785_10161021
69Ga0247788_11324441
70Ga0247669_10058241
71Ga0247681_10158403
72Ga0207680_108919271
73Ga0207704_108973113
74Ga0207679_107016513
75Ga0209153_12617102
76Ga0209267_13127312
77Ga0209056_107292772
78Ga0209577_101882671
79Ga0207981_11041661
80Ga0247822_117790242
81Ga0307293_100149081
82Ga0307313_100239713
83Ga0307315_101823573
84Ga0307306_102582791
85Ga0307287_102655713
86Ga0307292_100391983
87Ga0307296_100959181
88Ga0307312_105271853
89Ga0307289_102783251
90Ga0307286_103117202
91Ga0307278_100238064
92Ga0307277_101525081
93Ga0307308_106424752
94Ga0268241_101571212
95Ga0307496_101215081
96Ga0307469_113908541
97Ga0310901_105044612
98Ga0310895_106853131
99Ga0307470_116251142
100Ga0307471_1007061533
101Ga0247829_111833813
102Ga0247829_114294451
103Ga0373948_0161577_1_108
104Ga0373950_0161034_2_133
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 50.72%    β-sheet: 0.00%    Coil/Unstructured: 49.28%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLLSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Soil
Agricultural Soil
Permafrost
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
25.0%3.8%2.9%5.8%2.9%8.7%2.9%4.8%3.8%3.8%2.9%3.8%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPKNP_092261202070309009SoilMNLRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLVVPL
ICCgaii200_106007612228664021SoilMNLRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLVVPLAAGG
C688J13580_103566433300001205SoilMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLL
JGI25405J52794_1001916513300003911Tabebuia Heterophylla RhizosphereMNVRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLGIPLA
Ga0062593_10246565813300004114SoilMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLLIVPLA
Ga0063455_10020340533300004153SoilMFRDLYSNLTAREQRALRWSAAATAAVLWLTTAWADPWLLLGVPVVA
Ga0063455_10026653613300004153SoilMIHDLYSSLTAREQRALRWSGAATAAALWLSTAWADPWVLVSVPVIAGAFW
Ga0062595_10110348213300004479SoilMFSDLSDLYTNLTAREQRALRWSAAATAAALWLSTAWTDPWLLLA
Ga0062594_10307360213300005093SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAIPVLAGGLWVYHR
Ga0066673_1082486423300005175SoilMFRDLYTNLSAREQRAFKWSAAATAAALWLSTAWTDPWLLLAVPL
Ga0070666_1077413413300005335Switchgrass RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLLIVPLTGAGFWFYR
Ga0070669_10077484433300005353Switchgrass RhizosphereMISDLYTSLTAREQRAFRWSAAAAAVALWLSMAWMDPMILLTVPL
Ga0070685_1089267213300005466Switchgrass RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLLIVPL
Ga0070704_10031667633300005549Corn, Switchgrass And Miscanthus RhizosphereMIRDLYTSMTAREQRAIRWSAAATAAALWLSTAWADPRFLL
Ga0066698_1016233513300005558SoilMFRDLYSNMTAREQRALRWSAAATAVALWLSTAWADPWLLLAVPLSG
Ga0068855_10010100413300005563Corn RhizosphereMISDLYTSLTAREQRAFRWSAAAAAVALWLSMAWMDPMILLTVPLIGGAF
Ga0068855_10036886113300005563Corn RhizosphereMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADP
Ga0066708_1055845813300005576SoilMFRDLYTNLNTREQRALRWSAAATAAALWLSTAWADPWLLL
Ga0068856_10090782413300005614Corn RhizosphereMFSELSDLYSNLTAREQRALRWSAAATAAVLWLATAWTDPWLLLAVP
Ga0066903_10182124533300005764Tropical Forest SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLGIPLAAGGFWFYRRR
Ga0070717_1213832113300006028Corn, Switchgrass And Miscanthus RhizosphereMIHDLYTSLTAREQRAVRWSAAATAAALWLSTAWA
Ga0066659_1132434713300006797SoilMFRDLYTNLNAREQRALRWSAAATAAALWLSTAWADPW
Ga0075436_10011585713300006914Populus RhizosphereMNFRDLYESLTAREQRALRWSAAATAVALWLSTAW
Ga0079219_1034006033300006954Agricultural SoilMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWA
Ga0075435_10100707913300007076Populus RhizosphereMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLGIPLAAGGFWFYRR
Ga0114129_1196731323300009147Populus RhizosphereMISDLYTSLTAREQRASRWSAAATAVALWLSMAWADPKFLLTVPLIGGGFYLYRI
Ga0075423_1184463613300009162Populus RhizosphereMISDLYTSLTAREQRAFRWSAAATAVALWLSMAWADPKFLL
Ga0134082_1034386113300010303Grasslands SoilMFRDSYTNLTAREQRALRWSAAATAAALWLSTAWADPWL
Ga0134084_1026161213300010322Grasslands SoilMFRDLYSNMTAREQRAFRWSAAATAVALWLSTAWADPWLLLAVPLSAGA
Ga0134065_1034533033300010326Grasslands SoilMFRDLYSNMTAREQRALRWSAAATAVALWLSTAWA
Ga0134065_1035051413300010326Grasslands SoilMFSDLYSNLTAREQRALRWSAAATAAALWLSTAWADPWL
Ga0134063_1018398633300010335Grasslands SoilMFRDLYTNLSAREQRAFKWSAAATAAALWLSTAWT
Ga0134125_1024148813300010371Terrestrial SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAIPV
Ga0134127_1253754513300010399Terrestrial SoilMFRDLYTNLSAREQRAFKWSAAATAAALWLSTAWTDPW
Ga0134122_1116329633300010400Terrestrial SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLSMAWMDPMILLTVPLIGG
Ga0120161_106320733300012005PermafrostMISDFYSSLTSREQRALRWSAAATAVALWLATAWADPWIL
Ga0137383_1060589313300012199Vadose Zone SoilMINDLYSSLTSREQRALRWSAAATAVALWLATAWADPWVLVTVP
Ga0137374_1009613513300012204Vadose Zone SoilMMFRDLYSNLTAREQRAFKWSAGATAAALWLSTAWTDPWL
Ga0137386_1074687213300012351Vadose Zone SoilMMREFFDDLTSREQRALGWSAVAVAAALWLATAWTDPWLLLGV
Ga0137366_1063623333300012354Vadose Zone SoilMMFRDLYSNLTAREQRAFKWSAGATAAALWLSTAWTDPWLLLVVPASAAAFFVYR
Ga0157326_101526013300012513Arabidopsis RhizosphereMISDLYTSLTAREQRALRWSAAATAVALWLSMAWADPKFLLTVP
Ga0157303_1009698413300012896SoilMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLIVVPLAAGGFW
Ga0157301_1008726533300012911SoilMNFRDLYESLTAREQRALRWSAAAAAVALWLSTAWADPRFLIV
Ga0162651_10001478933300012938SoilMMRDLYSSLTPREQRAIRWSAAATAAALWLSTAWADPWFLL
Ga0164300_1048298133300012951SoilMIRDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLLIVPLAGAGFWFYR
Ga0164300_1094355613300012951SoilMFSELNDLYANLTAREQRALRWSAAATAAALWLSTAWTDP
Ga0164298_1017048333300012955SoilMIRDLYTSLTAREQRAIRWSAAATAAALWLSTAWA
Ga0164303_1112023423300012957SoilMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLLIVPLAGAGFWFYRR
Ga0164302_1088240633300012961SoilMIRDLYTSLTAREQRAIRWSAAAPAAALWLSTAWADP
Ga0164308_1166704113300012985SoilMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWAD
Ga0164304_1069675133300012986SoilMIQDLYSSLTAREQRALRWSAAAAAAALWLSTAWA
Ga0164306_1065715633300012988SoilMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLLIVPLAGAAFWFYR
Ga0164305_1110255913300012989SoilMFRDLYTNLTAREQRALRWSAAASAAVLWLATAWTDPWLLLAVPLVAGAFFAYRA
Ga0157378_1246302023300013297Miscanthus RhizosphereMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAIPVLAGGLWVYHRR
Ga0120149_119558023300014058PermafrostMINDLYSSLTPREQRAFRWSAAATAVALWLATAWAD
Ga0182005_116879013300015265RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSAAWADPRFLLIVPLAGAGFWF
Ga0134072_1033049933300015357Grasslands SoilMFRDLYSNLTAREQRALRWSAAATAAALWLATAWADPWL
Ga0132258_1142282813300015371Arabidopsis RhizosphereMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLIVVPLAA
Ga0132256_10039375413300015372Arabidopsis RhizosphereMMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLVIPLAAGGFWFYRR
Ga0132257_10084662013300015373Arabidopsis RhizosphereMNFRDLYESLTAREQRALRWRAAATAVALWLSTAWADPRFLLGIPL
Ga0132255_10112071113300015374Arabidopsis RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFLL
Ga0132255_10467988413300015374Arabidopsis RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWADPRFL
Ga0187786_1009475913300017944Tropical PeatlandMLRDLYSNLTAREQRALRWSAAATAVALWLSVAWADLWLL
Ga0184604_1023861413300018000Groundwater SedimentRRPSVMMRDLYSSLTAREQRAIRWSAAATAAALWLSTA
Ga0187788_1000020913300018032Tropical PeatlandMFSDLYSNLTAREQRALRWSAAATAAALWLSTAWADPWLLLAVPLA
Ga0184621_1018172033300018054Groundwater SedimentMIQDLYSSLTAREQRALRWSAAATAVALWLSTAWADPW
Ga0193730_117580923300020002SoilMIHDLYSSLTAREQRAVRWSAAATAAALWLSTAWADPWFLVTVPMIAGGF
Ga0247785_101610213300022889SoilMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLGIPLA
Ga0247788_113244413300022901SoilMNFRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLGI
Ga0247669_100582413300024182SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAIPVLA
Ga0247681_101584033300024310SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAIP
Ga0207680_1089192713300025903Switchgrass RhizosphereMIRDLYTSLTAREQRAVRWSAAATAAALWLSTAWA
Ga0207704_1089731133300025938Miscanthus RhizosphereMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLLIVPLAGA
Ga0207679_1070165133300025945Corn RhizosphereMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLA
Ga0209153_126171023300026312SoilMFRDLYTNLNAREQRALRWSAAATAAALWLSTAWADPWLLLAVPLAGA
Ga0209267_131273123300026331SoilMFRDLYTNLSAREQRAFKWSAAATAAALWLSTAWTDPWLLLAVPLAAVGFFV
Ga0209056_1072927723300026538SoilMFRDLYTNLNAREQRALRWSAAATAAALWLSTAWADPWLLLAVPLAGAGFFVY
Ga0209577_1018826713300026552SoilMFRDLYTNLSAREQRAFKWSAAATAAALWLSTAWTAP
Ga0207981_110416613300027560SoilMLRDLYSNLTAREQRALRWSAAATAVALWLSVAWADLWLLLAVPL
Ga0247822_1177902423300028592SoilMIHDLYTSLTAGEQRAIRWSAAATAAALWLSTAWADPR
Ga0307293_1001490813300028711SoilMINDLYSSLTAREQRALRWSAAATAVALWLSTAWADPW
Ga0307313_1002397133300028715SoilMIQDLYSSLTAREQRALRWSAAATAVALWLSTAWADPWILI
Ga0307315_1018235733300028721SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLAMAWADPMILLTVPLIGG
Ga0307306_1025827913300028782SoilMIHDLYSSLTAREQRAVRWSAAATAAALWLSTAWADP
Ga0307287_1026557133300028796SoilMIQDLYSSLTAREQRALRWSAAATAVALWLSTAWADPWILITVP
Ga0307292_1003919833300028811SoilMFRDLYSNLTAREQRALRWSAAATAAALWLATAWADPWLLLA
Ga0307296_1009591813300028819SoilMIHDLYTSLTAREQRAIRWSAAATAAALRLSTAWA
Ga0307312_1052718533300028828SoilMFRDLYSNLTAREQRALRWSAAATAAPLWLAPAWTHTRLLH
Ga0307289_1027832513300028875SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLAMAWADPMILLT
Ga0307286_1031172023300028876SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLAMAWADPM
Ga0307278_1002380643300028878SoilMFRDLYSNLTVREQRALRWSAAATAAALWLATAWTDPRL
Ga0307277_1015250813300028881SoilMIGDMYSSLTAREQRALRWSAGATAAALWLTTAWADPWILLTVP
Ga0307308_1064247523300028884SoilMINDLYSSLTAREQRALRWSAAATAVALWLSTAWADPWILITVPLIGAA
Ga0268241_1015712123300030511SoilMNFRDLYESLTAREQRALRWSAAATAAALWLSTAWTDPRFLVVVPLAVGGFWLYRR
Ga0307496_1012150813300031200SoilMINDLYSSLTAREQRALRWSAAATAVALWLSTAWADP
Ga0307469_1139085413300031720Hardwood Forest SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLSMAWADPKFLL
Ga0310901_1050446123300031940SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWADPRFLLAI
Ga0310895_1068531313300032122SoilMNFRDLYDSLTAREQRALRWSAAATAVALWLATAWADPRFLLAIPVLAGGLW
Ga0307470_1162511423300032174Hardwood Forest SoilMIHDLYTSLTAREQRAIRWSAAATAAALWLSTAWADPRFLLIVPLAGAAFW
Ga0307471_10070615333300032180Hardwood Forest SoilMFSDLYTNLTAREQRALRWSAAATAAALWLATAWADPWLLLAVPLAAA
Ga0247829_1118338133300033550SoilMIRDLYTSMTAREQRAIRWSAAATAAALWLSTAWA
Ga0247829_1142944513300033550SoilMISDLYTSLTAREQRAFRWSAAAAAVALWLSMAWMDPMILLTVPLIGGAFY
Ga0373948_0161577_1_1083300034817Rhizosphere SoilMNFRDLYESLTAREQRALRWSAAATAVALWLATAWA
Ga0373950_0161034_2_1333300034818Rhizosphere SoilMNVRDLYESLTAREQRALRWSAAATAVALWLSTAWADPRFLLGI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.