NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F091148

Metagenome / Metatranscriptome Family F091148

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091148
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 48 residues
Representative Sequence MTTKLDVRISRRCFLASASAATTVALLAPRKLFAQDDGLVQTARRT
Number of Associated Samples 78
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 97.17 %
% of genes near scaffold ends (potentially truncated) 98.13 %
% of genes from short scaffolds (< 2000 bps) 88.79 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (52.336 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(39.252 % of family members)
Environment Ontology (ENVO) Unclassified
(48.598 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(71.028 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70
1Ga0066690_108480462
2Ga0126384_118861121
3Ga0126373_113304212
4Ga0126373_120237462
5Ga0126370_124610612
6Ga0126376_130973501
7Ga0126372_111585052
8Ga0126378_131731081
9Ga0126379_117559222
10Ga0126381_1006270723
11Ga0126381_1048387231
12Ga0150983_121259421
13Ga0150983_144202411
14Ga0150983_149777302
15Ga0137392_100086351
16Ga0137393_100851191
17Ga0150985_1085783281
18Ga0137394_104508992
19Ga0182036_106253091
20Ga0182036_116877871
21Ga0182036_118562011
22Ga0182036_118717391
23Ga0182041_104173391
24Ga0182033_100705481
25Ga0182033_116251302
26Ga0182035_106107861
27Ga0182032_114496302
28Ga0182032_120627711
29Ga0182034_110092891
30Ga0182040_104798743
31Ga0182040_119070832
32Ga0182037_119654961
33Ga0182039_116380921
34Ga0182039_120623252
35Ga0182039_121701202
36Ga0182038_119599142
37Ga0187778_108241231
38Ga0187777_108323632
39Ga0187766_104600541
40Ga0179590_10534673
41Ga0179592_104674581
42Ga0210407_106471471
43Ga0210403_109919972
44Ga0210399_100960274
45Ga0210395_112926622
46Ga0210401_101651751
47Ga0210404_101687782
48Ga0210404_103448702
49Ga0210400_101317281
50Ga0210400_112552982
51Ga0210408_110537652
52Ga0210396_115901041
53Ga0213877_102125611
54Ga0210394_100405505
55Ga0210394_103704271
56Ga0210394_106964332
57Ga0210394_110221232
58Ga0210390_105131051
59Ga0210402_106337712
60Ga0210409_116948342
61Ga0222728_11262921
62Ga0242665_102524222
63Ga0207745_10186851
64Ga0207852_10304681
65Ga0207817_10043961
66Ga0208369_10334491
67Ga0209684_10572522
68Ga0209799_10397652
69Ga0308309_110360561
70Ga0222749_102381022
71Ga0318573_107449191
72Ga0310915_109314881
73Ga0318542_103402041
74Ga0318561_106639531
75Ga0306917_102023291
76Ga0306917_104394291
77Ga0306917_108118431
78Ga0306917_110770963
79Ga0306917_110934272
80Ga0306918_100220811
81Ga0306918_113843061
82Ga0318502_102218161
83Ga0307475_101581581
84Ga0307478_115636851
85Ga0310917_100053357
86Ga0310917_108918832
87Ga0318495_103831801
88Ga0306919_104574191
89Ga0306919_112698602
90Ga0306925_103034841
91Ga0306925_118891202
92Ga0310912_107099583
93Ga0310913_109207222
94Ga0310910_100694251
95Ga0310910_101416101
96Ga0310910_101480123
97Ga0310909_103709572
98Ga0310909_104523911
99Ga0310909_106142553
100Ga0306926_103416732
101Ga0307479_105839552
102Ga0318531_103106092
103Ga0318575_105082061
104Ga0318533_112157652
105Ga0318540_102872351
106Ga0306920_1001666651
107Ga0310914_105656552
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 43.24%    β-sheet: 0.00%    Coil/Unstructured: 56.76%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MTTKLDVRISRRCFLASASAATTVALLAPRKLFAQDDGLVQTARRTSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
58.9%41.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Vadose Zone Soil
Tropical Forest Soil
Bulk Soil
Soil
Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Avena Fatua Rhizosphere
4.7%9.3%39.3%29.0%2.8%2.8%4.7%3.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066690_1084804623300005177SoilMNTKFNPSISRHRFLVSTSMATSIAFLAPRHLLAQDEGLVQTARKTAAAAIVTVQKLR
Ga0126384_1188611213300010046Tropical Forest SoilMTTKLDVRISRRCFLASAGATTTVALVMPRKLFGQDEGL
Ga0126373_1133042123300010048Tropical Forest SoilMTTKLDVRISRRSFLASATSAFAVALMAPRQLFAKNDGLVQTARRTAAASNLTVQKLR
Ga0126373_1202374623300010048Tropical Forest SoilMSTKLDVGISRRSFLASASAAAAVAFVAPRRLFAQDDGLVQTA
Ga0126370_1246106123300010358Tropical Forest SoilMTSILDAPNSRRSFLASTIATTTVALMAPRQLFSQNDGLVQTARRTAAAA
Ga0126376_1309735013300010359Tropical Forest SoilMTTILDVRISRRSFLASTIATTTVALMAPRQLLAQNDGLVQTARKTAAAATVTVQK
Ga0126372_1115850523300010360Tropical Forest SoilMDTKLDEISRRCFLVSATATTTVALLAPRKLFAQGDGLVQTARKSA
Ga0126378_1317310813300010361Tropical Forest SoilMSAELDVGISRRSFLASASSAAAVAFVAPRRFFAQDDGLVQTARRTASTDTVT
Ga0126379_1175592223300010366Tropical Forest SoilMSRELAARVSRRNFVASASAATTMAWLAPRKVFAQEEGLV
Ga0126381_10062707233300010376Tropical Forest SoilMTTKLDLRISRRCFLASTSAATTVALLAPRNKLFAQDDGLVQTARKTAAGATVTVQK
Ga0126381_10483872313300010376Tropical Forest SoilMTTELGVRISRRCFLASASAATTVALLAQRKLFALDDGLVQTA*
Ga0150983_1212594213300011120Forest SoilMSARLDSGVSRRRFLICTSMAATVGVLAPCDLFAQDDGLVQTARKTAAAATIT
Ga0150983_1442024113300011120Forest SoilMGAKFESGISRRGFLISTSMVAAVGVLYPRDLLAQDDGLVQTARKTAAAATITVQKLRGNVS
Ga0150983_1497773023300011120Forest SoilMETKLDPSISRRRFLVSTSMATTVALLAPRHMFAQDEGLVQTARKTAAA
Ga0137392_1000863513300011269Vadose Zone SoilMDTKLDSGISRRRFLASTSLVATVALLAPRDLFAQDDGLV
Ga0137393_1008511913300011271Vadose Zone SoilMSANLDCGISRRRFLLSTSMAATVGLLAPRDLFAQDDGLVQTARKTAAAAT
Ga0150985_10857832813300012212Avena Fatua RhizosphereMSKTLDFGIPRRRFLISTGMAASVGLLAPRDLFAQDNGLVQKARKTAATATVTVQ
Ga0137394_1045089923300012922Vadose Zone SoilMNTQLEPTISRRRLLRSTSLAATVGLLAPRHLLAQDDGLV
Ga0182036_1062530913300016270SoilMTTKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGL
Ga0182036_1168778713300016270SoilMFTKPHTGISRRRFLVSTSMVTTVALVAPRRVFAQDQGLVQAARK
Ga0182036_1185620113300016270SoilMSTELDVRISRRNFLASASAATTMAWLAPSKLFAQEDGLVQTARTTA
Ga0182036_1187173913300016270SoilMNTKTDPGISRRRFLVSTSMATTGVLLAPSYVFAQGEGLVQTARKTAGAATITAQKLRGK
Ga0182041_1041733913300016294SoilMSTKLDVGISRRSFLATASAAAAVAFVAPRRLFAQDDGLVQTARRTASADKVT
Ga0182033_1007054813300016319SoilMNSKPDRGISRRRFLASASMATSIALLAPRHLVTQNEGL
Ga0182033_1162513023300016319SoilMSTELAVRVSRRNFLASASAATTIAWLAPRKLFAQEEGLVQTARKTAATDKITIQKLRGN
Ga0182035_1061078613300016341SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQTA
Ga0182032_1144963023300016357SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQT
Ga0182032_1206277113300016357SoilMNSKPDRGISRRRFLASASMATSIALLAPRHLVTQNEGLVQTARKTA
Ga0182034_1100928913300016371SoilMSTKLDVGISRRSFLATASAAAAVAFVAPRRLFAQDDGLV
Ga0182040_1047987433300016387SoilMSTKLDVGISRRSFLATASAAAAVAFVAPRRLFAQDDGLVQTARRTASADKVTV
Ga0182040_1190708323300016387SoilMTTKLDVRISRRCFLASASAATTVALLAPRKLFAQDDGLVQTARRT
Ga0182037_1196549613300016404SoilMTTKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQTA
Ga0182039_1163809213300016422SoilMSTELAVRVSRRNFLASASAATTIAWLAPRKLFAQEEGLVQTARKT
Ga0182039_1206232523300016422SoilMNTTLDVRISRRSLLASTIATTTVALVAPRQLFAQNDGLVQTARKTASAATITAQTLRVNISVLMGAGG
Ga0182039_1217012023300016422SoilMSAKLDVRISRRFFLASASAAAAVGFVAPRRLFAQDDGLVQTARRTASTDTVTVQ
Ga0182038_1195991423300016445SoilMSTELAVRISRRNFLASASAATTITWLAPSKLFAQEDGLVQTARRTAATDKITIRKLRG
Ga0187778_1082412313300017961Tropical PeatlandMSTKPHTGISRRRFLASTGMVTTVALVAPRQVFAQDQGLVQTARKTAASDKV
Ga0187777_1083236323300017974Tropical PeatlandMSTKLHTGISRRRFLLSTSMVTAAALVAPRHVLAQDQGLVQTARKTAA
Ga0187766_1046005413300018058Tropical PeatlandMTTSLDTRVSRRCFVASASAATAMAWLAPRRLFAQGDGLVQTARRTAAADKVT
Ga0179590_105346733300020140Vadose Zone SoilMSTRLDSGSSRRRFLISTSMAATVGLLAPRDLFAQDDG
Ga0179592_1046745813300020199Vadose Zone SoilMSAKLDSGISRRRFLISTSMAATVGLLAPRDLFAQGDGLVQTARK
Ga0210407_1064714713300020579SoilMSTTLDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTARKTAAAGTITVQKLR
Ga0210403_1099199723300020580SoilMSTTLDSGISRRRFLISTSMAATVALLAPRDLFAQDDGLVQTARKTA
Ga0210399_1009602743300020581SoilMYTKPGSNISRRRFLTSTSVATTVALVAPCHVFAQDE
Ga0210395_1129266223300020582SoilMNPKLNADISRRRFLVSTSMATTVALLAPRHMFAQDEGLVQTARK
Ga0210401_1016517513300020583SoilVSVKLDSDISRRRFLISTSMAATVGLLAPRALFAQDEGLVQTARKTA
Ga0210404_1016877823300021088SoilMNPKLNADISRRRFLVSTSMATTVALLAPRHMFAQDEGLVQTARKTA
Ga0210404_1034487023300021088SoilMSAKLDSGISRRRFLISSNMAATVGVLAPRDLFAQDDGL
Ga0210400_1013172813300021170SoilMSANLDSGISRRRFLLSTSMAATVGVLAPRDLFAQEDGLVQTARRTAAA
Ga0210400_1125529823300021170SoilMSAKLDSGISRRRFLISTSMAATAGLLAPRDLFAQ
Ga0210408_1105376523300021178SoilMSAKLDSGISRRGFLISTSMAATVGLLAPRDLFAQDDGLVQTARKTAAAA
Ga0210396_1159010413300021180SoilMSTTLDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTA
Ga0213877_1021256113300021372Bulk SoilMSQKLDVGISRRSFLASASAAAAVAFVAPPRLFAQDDGLVQTARRTAATDKIT
Ga0210394_1004055053300021420SoilMCPRLDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTARKTAAAA
Ga0210394_1037042713300021420SoilVGAKVDSGVSRRRFLVSTSVAAAVGLLARRDLFAQDDGLVQTARK
Ga0210394_1069643323300021420SoilMSTKSDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTARKTA
Ga0210394_1102212323300021420SoilMSTKLDSGISRRRFLISTSMAATVGVLAPHDLFAQDDGLVQTARKTAAA
Ga0210390_1051310513300021474SoilMTTELDNRISRRRFLASTSLATAVGLLVPRALFAG
Ga0210402_1063377123300021478SoilMNTKDSAVSRRRFLVSASIATTAVILAPRRIFAQGDGLVQTARKTAANSTITVQKLRGSV
Ga0210409_1169483423300021559SoilMRTKSDSGISRRRFLISTSKAATVGLLAPRDLFAQDDGLVQTARKTAA
Ga0222728_112629213300022508SoilMSAGLDSGISRRRFLISTSIAATVGVLAPPDLYAQDDGLVQTARRTAAAATIS
Ga0242665_1025242223300022724SoilMSAKLDSGISRRGFLISTSMAATVGLLAPRDLFAQDDGLVQTAR
Ga0207745_101868513300026889Tropical Forest SoilMTTKLDVRISRRWFLTSASAATTVALLAPRKLFAQDEGLVQTARRTAAASTITVQ
Ga0207852_103046813300026959Tropical Forest SoilMTTKLDVRISRRWFLTSASAATTVALLAPRKLFAQDEGLVQTARRTAAASTI
Ga0207817_100439613300026979Tropical Forest SoilMTTKLDVRISRRWFLTSASAATTVALLAPRKLFAQDEGLVQTARRTAAASTITVQK
Ga0208369_103344913300026998Forest SoilMSTKLDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTA
Ga0209684_105725223300027527Tropical Forest SoilMNASLDRGVSRRRFLAATSAAAGVAFLAPRQLFAQDTGLVATARRTAAAGNVTV
Ga0209799_103976523300027654Tropical Forest SoilMSTELAVRISRRNFLASASAASTMAWLAPSDLFSQGDGLVQTARRTAATDKI
Ga0308309_1103605613300028906SoilMSTKSDSGISRRRFLISTSMAATVGLLAPRDLFAQDDGLVQTARKTAAAATI
Ga0222749_1023810223300029636SoilMSTKSDSGISRRQFLISTSMAATVGLLAPRDLFAQDDGLVQ
Ga0318573_1074491913300031564SoilMTTKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQT
Ga0310915_1093148813300031573SoilMSTELAARVSRRNFLASASAATTMAWLAPRKLFAQEEGLVQTARKTAATD
Ga0318542_1034020413300031668SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQ
Ga0318561_1066395313300031679SoilMSAKLDVGISRRSFLASASAAAAVAFVAPRRLFAQDDGLVQTARRTASTDTVTVQK
Ga0306917_1020232913300031719SoilMNSKPDRGISRRRFLASASMATSIALLAPRHLVTQNEGLVQTARKTAGAATIT
Ga0306917_1043942913300031719SoilMSTKLDVGISRRSFLASASAAAAVAFVAPRRLFAQDDGLV
Ga0306917_1081184313300031719SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAE
Ga0306917_1107709633300031719SoilMNTKTDPGISRRRFLVSTSMATTGVLLAPSYVFAQGEGLVQTARKTAGAATIT
Ga0306917_1109342723300031719SoilMSTELAVRVSRRNFLASASAATTIAWLAPRKLFAQEEGLV
Ga0306918_1002208113300031744SoilMTTTLDVRISRRSFLASTIATTSVALMAPRQLYAQNDGL
Ga0306918_1138430613300031744SoilMTTKLDDRISRRCFLASASAATTVALLAPRKLFAQNDGLVQTARR
Ga0318502_1022181613300031747SoilMTTTLDVRISRRSFLASTIATTSVALMAPRQLYAQNDGLVQTARKTAAAA
Ga0307475_1015815813300031754Hardwood Forest SoilMSAELDFGIPRRRFLMSTSMAATVGLLAPRDLFAQDDGLVQTAR
Ga0307478_1156368513300031823Hardwood Forest SoilMSAKLDSGISRRRFLISTSMAATVGVLAPRDLFAQDDGLVQTARKTAASATITVQKLR
Ga0310917_1000533573300031833SoilMTTKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQTARR
Ga0310917_1089188323300031833SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQTARR
Ga0318495_1038318013300031860SoilMTTILDVRISRRSFLASTIATTTVALMAPRQLFAQNDGLVQTARKAAAAATVT
Ga0306919_1045741913300031879SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAEDD
Ga0306919_1126986023300031879SoilMSTELAARVSRRNFLASASAATTMAWLAPRKLFAQEEGLVQTARKTAATDKI
Ga0306925_1030348413300031890SoilMSTKLDVGISRRSFLASASAAAAVAFVAPRRLFAQDDGLVQTARRTASTDTVT
Ga0306925_1188912023300031890SoilMSTELDVRISRRNFLASASAATTMAWLAPSKLFAQED
Ga0310912_1070995833300031941SoilMNTKTDPGISRRRFLVSTSMATTGVLLAPSYVFAQGEGLVQTARKTAGAATITAQKL
Ga0310913_1092072223300031945SoilMTMKLDVRVSRRCFLASASGATALAFVAPLKLFAED
Ga0310910_1006942513300031946SoilMTTKLDDRISRRCFLASASAATTVALLAPRKLFAQNDGLVQ
Ga0310910_1014161013300031946SoilMNTTLDVRISRRSLLASTIATTTVALVAPRQLFAQNDGLVQTARKTAAAAT
Ga0310910_1014801233300031946SoilMNSKPDRGISRRRFLASASMATSIALLAPRHLVTQNEGLVQTARKTAGAATITVQKL
Ga0310909_1037095723300031947SoilMSTKLDVGISRRSFLASASAAAAVAFVAPRRLFAQDDGLVQKARRTASTDKVTVQK
Ga0310909_1045239113300031947SoilMSTELAVRVSRRNFLASASAATTIAWLAPRKLFAQEEGLVQTARKTAATD
Ga0310909_1061425533300031947SoilMNARLEVEICRRSFLASASAAAAVALVAPRRLFAQNDGLVQTARRTASTDTVTVQKL
Ga0306926_1034167323300031954SoilMSAKLDVGISRRSFLASASAAAAVAFVAPRRLFAQ
Ga0307479_1058395523300031962Hardwood Forest SoilMSAGLDSGVSRRRFLICTSMAATVGVLAPRDLFAQDDGLVQTARK
Ga0318531_1031060923300031981SoilMTTKLDVRVSRRCFLASASGATALAFVAPLKLFAEDDGLVQTARRTAVASTVTVQK
Ga0318575_1050820613300032055SoilMTTILDVRISRRSFLASTIATTTVALTAPRQLFAQNDGLVQTARKTAAAATITAQKLRGN
Ga0318533_1121576523300032059SoilMSAELAARTSRRNFLASASAATTIAWLAPSKLFAQEVGLV
Ga0318540_1028723513300032094SoilMTTTLDVRISRRSFLASTIATTSVALMAPRQLYAQNDGLVQTARKTAAAATVTAQKLRDNIS
Ga0306920_10016666513300032261SoilMNARLEVEICRRSFLASASAAAAVALVAPRRLFAQNDGLVQTARRTAS
Ga0310914_1056565523300033289SoilMNSKPDRGISRRRFLASASMATSIALLAPRHLVTQNEGLVQTARKTAGAATI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.