NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F078432

Metagenome / Metatranscriptome Family F078432

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078432
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 40 residues
Representative Sequence MTKKMSKEEKRVTELVRQRMKESRSTELLGFDVESVHAGR
Number of Associated Samples 98
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 74.14 %
% of genes near scaffold ends (potentially truncated) 99.14 %
% of genes from short scaffolds (< 2000 bps) 89.66 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.276 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(28.448 % of family members)
Environment Ontology (ENVO) Unclassified
(28.448 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.828 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1JGIcombinedJ26739_1011934611
2JGIcombinedJ26739_1017869461
3Ga0066679_109821671
4Ga0066684_102118533
5Ga0066675_103282982
6Ga0066388_1035828832
7Ga0070711_1020561202
8Ga0066701_102223222
9Ga0066691_105328371
10Ga0066652_1014435802
11Ga0066660_112587531
12Ga0099794_103224901
13Ga0099794_107142091
14Ga0099830_104108892
15Ga0099828_103829361
16Ga0116224_104733061
17Ga0134064_100646311
18Ga0137392_111450241
19Ga0137391_103332231
20Ga0137393_116280371
21Ga0137389_106360691
22Ga0137388_108447992
23Ga0137364_102445241
24Ga0137363_106656862
25Ga0137399_108754381
26Ga0137362_101769891
27Ga0137362_104348611
28Ga0137381_105218401
29Ga0137379_106270002
30Ga0137360_100190221
31Ga0137360_106180521
32Ga0137361_103418231
33Ga0137390_103615003
34Ga0137398_100120861
35Ga0137398_104146312
36Ga0137410_102157831
37Ga0134081_101952881
38Ga0181533_11119932
39Ga0134079_101133241
40Ga0134079_103341391
41Ga0137411_10482612
42Ga0137411_10663702
43Ga0182034_100930963
44Ga0182039_106729882
45Ga0182039_108828203
46Ga0187802_104012771
47Ga0187801_100403753
48Ga0187779_107176112
49Ga0187779_110095061
50Ga0187777_103837451
51Ga0187816_1000167010
52Ga0187804_102639772
53Ga0187770_103882021
54Ga0187770_113980831
55Ga0066669_117564301
56Ga0179594_103419342
57Ga0179592_103743911
58Ga0210407_100397394
59Ga0210407_112604261
60Ga0210403_114875582
61Ga0179596_103717982
62Ga0210406_106745402
63Ga0210400_104574601
64Ga0210385_114990252
65Ga0210410_101711451
66Ga0210410_105788002
67Ga0210409_107617952
68Ga0210409_115224111
69Ga0212123_107405101
70Ga0228598_11022902
71Ga0209155_12008772
72Ga0209155_12582312
73Ga0209803_13464851
74Ga0209158_12117371
75Ga0179593_10748791
76Ga0179587_105940202
77Ga0207802_10191451
78Ga0207727_1093612
79Ga0207815_10058751
80Ga0209106_10653582
81Ga0209517_102355452
82Ga0209701_106400871
83Ga0209283_104978351
84Ga0209488_100332631
85Ga0222749_105681162
86Ga0073994_100670403
87Ga0318541_102791392
88Ga0318541_107825112
89Ga0318528_105056541
90Ga0318542_104808172
91Ga0318561_100089021
92Ga0318560_108195031
93Ga0307476_105057271
94Ga0307474_100829703
95Ga0307469_101839941
96Ga0306918_101258803
97Ga0318546_112780311
98Ga0318566_103082692
99Ga0318552_103362202
100Ga0310917_102308101
101Ga0306925_102677061
102Ga0306925_106492082
103Ga0306923_102508393
104Ga0310912_100129407
105Ga0310912_105419671
106Ga0310910_108888052
107Ga0306926_125692331
108Ga0306922_109442472
109Ga0318575_101330533
110Ga0311301_104408291
111Ga0307471_1001945843
112Ga0307471_1021555491
113Ga0307472_1005045241
114Ga0335069_108348341
115Ga0335076_109376012
116Ga0310914_105421801
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Mixed Signal Peptide: No Secondary Structure distribution: α-helix: 27.94%    β-sheet: 5.88%    Coil/Unstructured: 66.18%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MTKKMSKEEKRVTELVRQRMKESRSTELLGFDVESVHAGRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog
Freshwater Sediment
Iron-Sulfur Acid Spring
Vadose Zone Soil
Grasslands Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Rhizosphere
3.4%28.4%3.4%7.8%23.3%7.8%5.2%4.3%3.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10119346113300002245Forest SoilMSRKMTPEEKQATELVRRRMKESQSSEFLGFDVES
JGIcombinedJ26739_10178694613300002245Forest SoilMAPELTELEKQMTELVRQRMQASNSSEMLGFEVESVHDGRAVFVLR
Ga0066679_1098216713300005176SoilMTKKMSKEEKRVTELVRQRMKESRSTELLGFDVESVHAGRAIFRLDV
Ga0066684_1021185333300005179SoilMAKKMSKEEKRVTELVRRRMRESRSTELLGFDVESVHAGRAIFRLDV
Ga0066675_1032829823300005187SoilMTRKMSKAEKQVTELVRRRMKESRSTELLGFDVESVHAGR
Ga0066388_10358288323300005332Tropical Forest SoilMVPKLTDEEKRVTELVRQRIKESKAIELLGFDVESVHEGRAI
Ga0070711_10205612023300005439Corn, Switchgrass And Miscanthus RhizosphereMSRKMSKEEERVTELVRQRMKESMATELLGFDVESVHDGRAIFRLDV
Ga0066701_1022232223300005552SoilMTKKMSKGEKRFTELVRQRMKESRSTELLGFDVESVHAGRAIFRLDV
Ga0066691_1053283713300005586SoilMTKKMSKGEKRVTQLVRRRMKESRSTELLGFDVESVHAG
Ga0066652_10144358023300006046SoilMTKKMSKEEKRVTELVRQRMKESRSTELLGFDVESV
Ga0066660_1125875313300006800SoilMTRKMSKAEKQVTELVRRRMKESRSTELLGFDVESVHAG
Ga0099794_1032249013300007265Vadose Zone SoilMTKKISKEEKRVTELVRQRMKASHSMEMLGFDVESV
Ga0099794_1071420913300007265Vadose Zone SoilMTKGMSKEEKRVTELVRRRMKESRSTELLGFEVESVHKGRAIFRL
Ga0099830_1041088923300009088Vadose Zone SoilMTKKMTKEEKRVTELVRRRMKESRSTELLGFDVESVHAGRAIF
Ga0099828_1038293613300009089Vadose Zone SoilMTKKMSNEEKRVTEMVRQRMKESRSTELLGFDVESVHA
Ga0116224_1047330613300009683Peatlands SoilMARKFSNAEKRATELVRQRMKESKATELLGFDVESVQE
Ga0134064_1006463113300010325Grasslands SoilMTKKMSKEEKRVTELVRQRMKESRSTEFLGFDVESVHAGR
Ga0137392_1114502413300011269Vadose Zone SoilMAGKRRNTMTKRMSKEEKRVTELVRRRMKESRSTELLGF
Ga0137391_1033322313300011270Vadose Zone SoilMSKEEKRITEMVRQRIKESRSTELLGFDVESVHEGRAIFR
Ga0137393_1162803713300011271Vadose Zone SoilMTKKMSKEEKRVTELVRQRMKESRSMELLGFDVESVHEGRA
Ga0137389_1063606913300012096Vadose Zone SoilMTKKMTKEEKRVTELVRRRMKESRSTELLGFDVESV
Ga0137388_1084479923300012189Vadose Zone SoilMTKKMSNEEKRVTELVRRRMKESRSTELLGFDVESVQAGRA
Ga0137364_1024452413300012198Vadose Zone SoilMSKKMSKEERRVTELVRLRMKESRSTELLGFDVERVHAGRAI
Ga0137363_1066568623300012202Vadose Zone SoilMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGRAIFRL
Ga0137399_1087543813300012203Vadose Zone SoilMTKKMSKEEKRITELVRQRMKESRSTELLGFDVESV
Ga0137362_1017698913300012205Vadose Zone SoilMERLDKMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGRA
Ga0137362_1043486113300012205Vadose Zone SoilMSKEEKRVTELVRRRMKESRSTDLLGFEVESVHKGRAIFRL
Ga0137381_1052184013300012207Vadose Zone SoilMTKKMSKEEKRVTELVRRRMKESRSTELLGFDVESVHAGRAI
Ga0137379_1062700023300012209Vadose Zone SoilMSKDEKRITELVRQRMKESRSTELLGFEVESVHQG
Ga0137360_1001902213300012361Vadose Zone SoilMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGRAI
Ga0137360_1061805213300012361Vadose Zone SoilMERLDKMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREG
Ga0137361_1034182313300012362Vadose Zone SoilMSKEEKRITELVRRRMKESRSTELLGFDVESVHAGRAIFRLDVR
Ga0137390_1036150033300012363Vadose Zone SoilMTKKMSKEEKRVTELVRRRMKESRSTELLGFDVESVHTGRA
Ga0137398_1001208613300012683Vadose Zone SoilMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGRA
Ga0137398_1041463123300012683Vadose Zone SoilMERLDKMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGRAIFR
Ga0137410_1021578313300012944Vadose Zone SoilMTKGMSKEEKRVTELVRRRMKESRSTELLGFEVESVHKGRAI
Ga0134081_1019528813300014150Grasslands SoilMTKKMSKGEKRFTELVRRRMKESRSTELLGFDVESV
Ga0181533_111199323300014152BogMSRKLSREEKKVTGLVRRRMRESKATELLGFDVES
Ga0134079_1011332413300014166Grasslands SoilMTKKMSKEEKRVTELVRQRMKESRSTELLGFDVESVHAGR
Ga0134079_1033413913300014166Grasslands SoilMAKKMSKEEKRVTELVRRRMRESRSTELLGFDVESV
Ga0137411_104826123300015052Vadose Zone SoilMSKEEKRVTELVRRRMKESRSTDLLGFEVESVHKG
Ga0137411_106637023300015052Vadose Zone SoilMSKEEKRVTELVRRRMKESRSTELLGFEVESVHKGRAIFRL
Ga0182034_1009309633300016371SoilMPRKMTVEEKRATELVRARMRESKSTELLGFDVESVH
Ga0182039_1067298823300016422SoilMAKARKLSSEEKLVTELVRQRMRESKATELLGFDVES
Ga0182039_1088282033300016422SoilMARKMSKEEKRVTELVRQRMKECMATELLGFDVESVHD
Ga0187802_1040127713300017822Freshwater SedimentVRKSPKMSGEEKRVTELVRQRMKESKATELLGFDVESVQ
Ga0187801_1004037533300017933Freshwater SedimentMRRKLTKEEKRVTELVRLRMKESQSSELLGFDVESVHDGRAIFGM
Ga0187779_1071761123300017959Tropical PeatlandMTRKLTPEEQKATELVRQRMKESKSTELLGFDVESVQDGRAVFRL
Ga0187779_1100950613300017959Tropical PeatlandMPRKMTEEEQKATELVRQRMKESKSTELLGFDVESVHDG
Ga0187777_1038374513300017974Tropical PeatlandMPRKMTEEEQKATELVRQRMKESKSTELLGFDVESVHDGRAI
Ga0187816_10001670103300017995Freshwater SedimentMVRKSPKMSDQEKRVTELVRQRMKESKAIALLGFDVE
Ga0187804_1026397723300018006Freshwater SedimentMGRKMSREEKRATELVRLRMKESKSSELLGFDVESVHD
Ga0187770_1038820213300018090Tropical PeatlandMSRKLSREEKRITGLVRRRMRESKATELLGFDVESVHDGRAIF
Ga0187770_1139808313300018090Tropical PeatlandMSRKLTKEEKRVTELVRQRMKESKSSELLGFDVESVHDGRAIFR
Ga0066669_1175643013300018482Grasslands SoilMTKKMSKAEKQVTELVRRRMKESRSTELLGFDVESVHA
Ga0179594_1034193423300020170Vadose Zone SoilMTKKMSKEEKRVTELVRRRMKESRSTELLGFDVESVH
Ga0179592_1037439113300020199Vadose Zone SoilMSRKMTAQEKQVTEMVRRRIKESDSTELLGFDVESVHDGRAIFR
Ga0210407_1003973943300020579SoilMSRKMSKEEERVTELVRQRMKESMATELLGFDVESVHDGRAIFR
Ga0210407_1126042613300020579SoilMSRKMSKEEKRVTELVRQRMKESMATELLGFDVESVHDGR
Ga0210403_1148755823300020580SoilMTKNKKMSKEEKRVTELVRRRIKESRSTELLGFDVESVHAGRAIF
Ga0179596_1037179823300021086Vadose Zone SoilMTKKMSKEEKRVTELVRRRMKESRSTELLGFDVESVHA
Ga0210406_1067454023300021168SoilMSRKLTEEEKRVTELVRQRMKESKSTELLGFEVESVHEG
Ga0210400_1045746013300021170SoilMARKMTPEEKQATELVRRRMKESMSSDLLGFDVESVHDGRAIFRL
Ga0210385_1149902523300021402SoilMTKKMSKAEKQATEMVRQRLRESRSTELLGFDVESVQTG
Ga0210410_1017114513300021479SoilMSRKMTPEEKQATELVRRRMKESKSSELLGFDVESVHDGRAIFRLD
Ga0210410_1057880023300021479SoilMARKMTPEEKQATELVRRRMKESMSSELLGFDVES
Ga0210409_1076179523300021559SoilMTKKMSKDEKRVTAMVRRRIKESRSTELLGFDVESVHTGRA
Ga0210409_1152241113300021559SoilMSRKLTEEEKRITEFVRQRMKESKSTELLGFEVESVHEGRAIFRLD
Ga0212123_1074051013300022557Iron-Sulfur Acid SpringMAPELTELEKQMTELVRQRMQESNSSEMLGFEVES
Ga0228598_110229023300024227RhizosphereMTKKMSKEEKRVTEMVRRRIKESRSTELLGFDVESVQAGRAVF
Ga0209155_120087723300026316SoilMTRKMSKAEKQVTELVRRRMKESRSTELLGFDVES
Ga0209155_125823123300026316SoilMTKKMSKAEKQVTELVRRRMKESRSTELLGFDVES
Ga0209803_134648513300026332SoilMTKRMSKEEKRVTELVRRRMKESRSTDLLGFEVES
Ga0209158_121173713300026333SoilMTKKMSREEKRVTELVRQRMKESHSMEMLGFDVESVREGR
Ga0179593_107487913300026555Vadose Zone SoilMTKKMSKEEKRITELVRQRMKEKPLDGASGFDVESVHAGRAFSGWT
Ga0179587_1059402023300026557Vadose Zone SoilMSRKMTPEEKHVTELVRRRMKESKSSELLGFDVES
Ga0207802_101914513300026847Tropical Forest SoilMARKMTAEEKRATELVRARMRESKSTELLGFDVESV
Ga0207727_10936123300026854Tropical Forest SoilMGRKLTADEQKITELVRQRMKESKSTELLGFDVESVHDGRAVFR
Ga0207815_100587513300027014Tropical Forest SoilMPRKMTAEEKRATELVRARMRESNSTELLGFDVESVHNGRAV
Ga0209106_106535823300027616Forest SoilMNKKMSKEEKRITELVRRRMKESRSTELLGFDVES
Ga0209517_1023554523300027854Peatlands SoilMSRKLSSEEKRVTRLVRRRMRESKATELLGFDVESVHDGRAIFRLDV
Ga0209701_1064008713300027862Vadose Zone SoilMTKKMSNEEKRITELVRRRMKESRSTELLGFDVESVHAGRAI
Ga0209283_1049783513300027875Vadose Zone SoilLTWRQNEMTKKMSKEEKRVTELVRRRMKESRSTELLGFDVE
Ga0209488_1003326313300027903Vadose Zone SoilMTKRMSKEEKRVTELVRRRMKESRSTELLGFEVESVH
Ga0222749_1056811623300029636SoilMTKKMSKAEKQATEMVRQRLRESRSTELLGFDVESV
Ga0073994_1006704033300030991SoilMTKKMSKEEKRITELVRQRMKESRSTELLGFDVESVHA
Ga0318541_1027913923300031545SoilMVRKLTRKEQRITELVRQRMRESKSTELLGFAVESVHDGRA
Ga0318541_1078251123300031545SoilMPRKMSVEEKRATELVRARMRESKSTELLGFDVESVHNGRAVFF
Ga0318528_1050565413300031561SoilMVRKLTRKEQRITELVRQRMRESKSTELLGFAVESVHDGRAVF
Ga0318542_1048081723300031668SoilMAKARKLSSEEKLVTELVRQRMRESKATELLGFDVESVQD
Ga0318561_1000890213300031679SoilMPRKMNADEKRATELVRARMRESKSTELLGFDVESVH
Ga0318560_1081950313300031682SoilMPRKMNADEKRATELVRARMRESKSTELLGFDVESVHNG
Ga0307476_1050572713300031715Hardwood Forest SoilMVRKLSKEEKRATELVRRRMKGNESSELLGFEVESV
Ga0307474_1008297033300031718Hardwood Forest SoilMTKRMSKVEKRVTELVRRRIRESRSTELLGFDVESVHEGRAIFR
Ga0307469_1018399413300031720Hardwood Forest SoilMTKRMTKEEKRVTEMVRRRIKESRSTELLGFDVESVQ
Ga0306918_1012588033300031744SoilMAKARKLSSEEKLVTELVRQRMRESKATELLGFDV
Ga0318546_1127803113300031771SoilMPRKMNADEKRATELVRARMRESKSTELLGFDVESVHNGRAVFFLDV
Ga0318566_1030826923300031779SoilMAKARKLSSEEKLVTELVRQRMRESKATELLGFDVESVQDGRAI
Ga0318552_1033622023300031782SoilMPRKMTVEEKRATELVRARMRESKSTELLGFDVESVHNGRAVF
Ga0310917_1023081013300031833SoilMPRKMTAEEKRATELVRARMRESKSTELLGFDVESVHDGRAVFFLD
Ga0306925_1026770613300031890SoilMAGPLSDEEKRLTELVRLRMKESKSSALLGFDVESVHDGRAVFRLKV
Ga0306925_1064920823300031890SoilMPRKMSVEEKRATELVRARMRESKSTELLGFDVES
Ga0306923_1025083933300031910SoilMPRKMTAEEKRATELVRARMRESKSTELLGFDVESVHDGRA
Ga0310912_1001294073300031941SoilMPRRMTAEEKRATELVRARMRESKSTELLGFDVES
Ga0310912_1054196713300031941SoilMAGPLSDEEKRLTELVRLRMKESKSSALLGFDVESVHDGRAVFRLKVG
Ga0310910_1088880523300031946SoilMPRKMTAEEKRATELVRARMRESKSTELLGFDVESVHDGRAVFF
Ga0306926_1256923313300031954SoilMSRKMTAQEKQATELVRRRMKESKSSELLGFDVESVHDGRAVFRL
Ga0306922_1094424723300032001SoilMPRKMSVEEKRATELVRARMRESKSTELLGFDVESVHNGRAV
Ga0318575_1013305333300032055SoilMPRKMNADEKRATELVRARMRESKSTELLGFDVESVHSGRAVFF
Ga0311301_1044082913300032160Peatlands SoilMSRKLSREEKNATELVRRRMRESKATELFGFDVESVHDGRAI
Ga0307471_10019458433300032180Hardwood Forest SoilMSKEEKRVTELVRRRMKESRSTELLGFEVESVHKGRAIFRLDVR
Ga0307471_10215554913300032180Hardwood Forest SoilMSRKMTAQEKQVTEMVRRRIKESDSTELLGFDVESVHDGRAVFRLDV
Ga0307472_10050452413300032205Hardwood Forest SoilMSRKMTPEEKQATELVRRRMKESKSSELLGFDVESVHDGRAIF
Ga0335069_1083483413300032893SoilMARKLTKEEKRVTELVRQRMKESKSSELLGFDVESVHDGRAIFRLDV
Ga0335076_1093760123300032955SoilMVRKLTEEEQKATELVRQRMKESKSSELLGFDVESVHD
Ga0310914_1054218013300033289SoilMPRKMTAEVKRATELVRARMRESKSTELLGFDVESVH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.