NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097848

Metagenome Family F097848

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097848
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 47 residues
Representative Sequence MTPCPLGLAGQCYCANQVSSFAAAEPAKTAVLLLAHGSPENPDQIP
Number of Associated Samples 97
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 95.19 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 91.35 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.32

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (84.615 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Wetlands → Bog → Peatland
(14.423 % of family members)
Environment Ontology (ENVO) Unclassified
(44.231 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.308 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.
1E41_02082150
2JGI12269J14319_101914751
3JGI12269J14319_102491122
4JGI12053J15887_106337281
5Ga0062389_1016099141
6Ga0066680_102427832
7Ga0066790_103290011
8Ga0075029_1004122311
9Ga0075014_1003806951
10Ga0075021_109752862
11Ga0066660_112981441
12Ga0116225_11983043
13Ga0116137_10846512
14Ga0116106_10703962
15Ga0116224_104022251
16Ga0116227_110789172
17Ga0116131_11551752
18Ga0116130_12168362
19Ga0116134_11259192
20Ga0116134_12765101
21Ga0116223_102300273
22Ga0126373_110200622
23Ga0134062_102334992
24Ga0074044_104808501
25Ga0126379_135338422
26Ga0137365_110790182
27Ga0137399_106834362
28Ga0137399_112366901
29Ga0137394_103515442
30Ga0137359_110722541
31Ga0137413_107444962
32Ga0137404_102769811
33Ga0137404_113492242
34Ga0137410_118121091
35Ga0181521_106173362
36Ga0181530_103793852
37Ga0181538_100607714
38Ga0187849_10662053
39Ga0187877_11726602
40Ga0187801_101180362
41Ga0187809_102709321
42Ga0187853_100308343
43Ga0187850_102743852
44Ga0187780_110537952
45Ga0187777_111391851
46Ga0181520_102177611
47Ga0187816_100844552
48Ga0187870_12001661
49Ga0187815_105228802
50Ga0187888_11160423
51Ga0187864_103791281
52Ga0187881_101626592
53Ga0187857_101195022
54Ga0187867_107730871
55Ga0187862_104188642
56Ga0187871_107219702
57Ga0187890_102568972
58Ga0187858_103185132
59Ga0187858_108986541
60Ga0187784_113284012
61Ga0187769_114296302
62Ga0182031_13164301
63Ga0210395_101136161
64Ga0210404_102558711
65Ga0210405_102899022
66Ga0210383_109399341
67Ga0224550_10245061
68Ga0208038_10005381
69Ga0208688_10537711
70Ga0208688_10618212
71Ga0208188_10580892
72Ga0208188_11208461
73Ga0208714_11060492
74Ga0207664_105561732
75Ga0207665_112296321
76Ga0209839_102080131
77Ga0257150_10164992
78Ga0209117_10820761
79Ga0209530_11271261
80Ga0209248_100461431
81Ga0209448_100964691
82Ga0209180_105349162
83Ga0209517_103379923
84Ga0209068_102390271
85Ga0209067_104949411
86Ga0209488_108220531
87Ga0302171_100063511
88Ga0302160_101240942
89Ga0302202_101355961
90Ga0302222_102744992
91Ga0302257_10941571
92Ga0302291_102462411
93Ga0311327_109222331
94Ga0311338_101264291
95Ga0302306_101237962
96Ga0302191_102353442
97Ga0311353_101647773
98Ga0311372_122007612
99Ga0311357_105152272
100Ga0311354_103510351
101Ga0310039_103789212
102Ga0265316_100401391
103Ga0334804_076119_781_906
104Ga0370483_0045672_1227_1367
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 4.05%    β-sheet: 10.81%    Coil/Unstructured: 85.14%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MTPCPLGLAGQCYCANQVSSFAAAEPAKTAVLLLAHGSPENPDQIPSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.32
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
15.4%84.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Peatland
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Peatlands Soil
Arctic Peat Soil
Soil
Grass Soil
Soil
Untreated Peat Soil
Tropical Peatland
Bog Forest Soil
Bog
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Soil
Fen
Palsa
Bog
Rhizosphere
Host-Associated
14.4%2.9%3.8%10.6%3.8%4.8%10.6%6.7%4.8%3.8%2.9%3.8%6.7%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E41_020821502170459005Grass SoilMSACPLGLGGRCYCADRVSSFTAAPSRTAVLLLAHGTPESPEHVPEFLSYVTG
JGI12269J14319_1019147513300001356Peatlands SoilMTPCPLGPTSQCYCANHVSSFSAAAKSSGTSAKTTVLLLAHGSPENPDQIPEFLSYVTGG
JGI12269J14319_1024911223300001356Peatlands SoilMTPCPLGLAGRCYCANQVSSFAVAEPAKTAVLLLAHGSPENPDQIPEFLRYVTG
JGI12053J15887_1063372813300001661Forest SoilMTACPLGLAGQCYCANRVSSFVSSEGSAKTAVLLLAHGSPENPSQVSE
Ga0062389_10160991413300004092Bog Forest SoilMKPCPLGQSGQCYCANQVSSFAASQPAKTAVLLLAHGSPESPDQI
Ga0066680_1024278323300005174SoilMTACPLGLAAQCYCANQVSSFTSSEMPAKTAVFLLAHGSPENPAQVPEFLG
Ga0066790_1032900113300005995SoilMTACPLGLAGQCYCANRVNSFAAAERAKTAVLLLAHGSPENPDQIPE
Ga0075029_10041223113300006052WatershedsMVPCPLGQAAQCYCANRVSSFACAKSPAKTAVLLLAHGSPENPDQIPE
Ga0075014_10038069513300006174WatershedsMTMCPLGRAEACYCANQVSAFVRAPAQSLSARIAVLLLAHGSPENPD
Ga0075021_1097528623300006354WatershedsMTPCPLGPARHCYCASRVTSFGGLPATAKQAVLLLAHGSPENSDQVPEFLNY
Ga0066660_1129814413300006800SoilMTACPLGLAGQCYCANQVSSFAASEAPAKTAVLLLAHGSPENPGQ
Ga0116225_119830433300009524Peatlands SoilMSPCPLGLASQCYCANRVSAFAASERPAKTTVLLLAHGSPENPDQIP
Ga0116137_108465123300009549PeatlandMTICPLGLVGQCYCANRVSSFAASEQSSKLSANTEVLLLAH
Ga0116106_107039623300009645PeatlandMTPCPLGPASQCYCANQVSSFPASGPPANTAVLLLAHGSPENPDQIPEFL
Ga0116224_1040222513300009683Peatlands SoilMPHRVKDGVEKMTACPLGLAGQCYCANRVSAFASSAAQPAKTAVLLLAHGSPENPSQVPEFLS
Ga0116227_1107891723300009709Host-AssociatedMNRMTPCPLNLAGECYCANRVSSFGAGKLPAKTAVLLL
Ga0116131_115517523300009760PeatlandMTPCPLGLAGQCYCANRVRSFASSEPAKTAVLLLAHGSPENPGQ
Ga0116130_121683623300009762PeatlandMEPKMTACPLGLSRQCYCANHVSSFAAEANSSASPAKTAVLLLAHG
Ga0116134_112591923300009764PeatlandMTACPLGLASQCYCANQVSSFAASEQSTQLSAKTAVLLLAHGSPENTSQVPEFLSSV
Ga0116134_127651013300009764PeatlandMTPCPLGLASQCYCANRVSSFATPEQSSEQFAKTA
Ga0116223_1023002733300009839Peatlands SoilMTPCPLGLAGQCYCANRVSSFASAELPARAAVLLLAHGSPE
Ga0126373_1102006223300010048Tropical Forest SoilMTTCPLNLDDRCYCANHINTFEAAPSKTAVLLLAHGTPESPAQIPE
Ga0134062_1023349923300010337Grasslands SoilMSACPLGFGNRCYCANQDSSFAAVSSKTAVLLLAHGTP
Ga0074044_1048085013300010343Bog Forest SoilMTPCPLGQSGQCYCANRINSFAASAPAKTAVLLLAHGSPENPSQVPEFLGYVTGG
Ga0126379_1353384223300010366Tropical Forest SoilMSGCPLDLGGHCYCANQVSWFAGEESKTAVLLLAHGSPEN
Ga0137365_1107901823300012201Vadose Zone SoilMSACPLELGGRCYCASHVISFAAAPSQTAVLLLAHGTPENPDQIP
Ga0137399_1068343623300012203Vadose Zone SoilMTACPLGLDAQCYCANQVSSFTSSEAPARTAVLLLAHGSPENPGQVPEFLGYV
Ga0137399_1123669013300012203Vadose Zone SoilMTACPLAFAGQCYCANQVSWFAASEAPAKTAALLLAHGSP
Ga0137394_1035154423300012922Vadose Zone SoilMSTEGCPLRLGAECYCANRVSAFATAHSSAAVLLLAHGTPENRDQIPEY
Ga0137359_1107225413300012923Vadose Zone SoilMSACPLGLGGRCYCANHVVSFAGAPSKTAVLLLAHGTPEDPEQVPEYLPY
Ga0137413_1074449623300012924Vadose Zone SoilMTPCPLGLAGQCYCANQKNSFASKQPAKTAVLLLAHGSPENPGQISEFLSYVTGG
Ga0137404_1027698113300012929Vadose Zone SoilMSACPLGLGGRCYCANHVVSFAGAPSKTAVLLLAHGTPENPEQ
Ga0137404_1134922423300012929Vadose Zone SoilMSACPLGFGNRCYCANQDSSFAAASSKTAVLLLAHGTPD
Ga0137410_1181210913300012944Vadose Zone SoilMSACPLGFGNRCYCANQDSSFAAASSKTAVLLLAHGTPDT
Ga0181521_1061733623300014158BogMTICPLGLVGQCYCANRVSSFAASEQSSKLSAKTAVLLLAHGSPE
Ga0181530_1037938523300014159BogMTPCPLGPASQCYCANRASSFAAAEPAKAAVLLLAHGSPENPDQIPEFL
Ga0181538_1006077143300014162BogMTPCPLGPASQCYCANQVSSFPASGPPANTAVLLLAHGSPENPDQIPEFLRYVT
Ga0187849_106620533300017929PeatlandMTACPLGLAGQCYCANQVSSFAAEANSSASPAKAAVLLLAHGSPENPDQVPEFLSYVTG
Ga0187877_117266023300017931PeatlandMTPCPLGLAGQCYCANRVSSFAPSALPAKTAALLLAHGSPENPGQVSEFLSYVTG
Ga0187801_1011803623300017933Freshwater SedimentMTPCPLGLAGRCYCANRVSSFAVTPAKTAVLLLAHG
Ga0187809_1027093213300017937Freshwater SedimentMTACPLGLGSACYCRNHVSSFLPTEPSARDAKTAVLLLAHGSPENPDQVPEFLRYVTG
Ga0187853_1003083433300017940PeatlandMTACPLGLAGQCYCANQVSAFAASELPAKTAVLLLAHGSPENP
Ga0187850_1027438523300017941PeatlandMSECPLGLGSACYCANQVSSFWVAPASLAGPKTAVLLLAHGSPENPDQ
Ga0187780_1105379523300017973Tropical PeatlandMTCPLGLGGQCYCANKISFFAAAPTKTAVLLLAHGSPENPDQIPEFLRHVTG
Ga0187777_1113918513300017974Tropical PeatlandMTPCPLGLGGACYCANHLSSFPVAAAPSAGAKTAVLLLAHGSPENPDQIPEFL
Ga0181520_1021776113300017988BogMTPCPLGLVGQCYCSSHASSFACSQPKTKTAVLLLAHGSPENPDQTPEFLRY
Ga0187816_1008445523300017995Freshwater SedimentMTDCPLGQPCQCYCAHGVTSFAAAGPSQKTAVLLLAHGSPENPSQVPEFLSYVT
Ga0187870_120016613300017998PeatlandMTACPLGFASQCYCANRVSSFAASPAKAAVLLLAHGSPENPDQVPE
Ga0187815_1052288023300018001Freshwater SedimentMTPCPLGLGGQCYCANQVSSFAVSPSKTAVLLLAHGSPENPGQVPEFLGYVT
Ga0187888_111604233300018008PeatlandMTACPLSLASQCYCANRVSAFAASARPAKTDVLLLAHGSPEN
Ga0187864_1037912813300018022PeatlandMSPCPLGLASQCYCANRVSAFAASEHSAKTAVLLLAHGSPENPDQVPEFLSYVTG
Ga0187881_1016265923300018024PeatlandMTPCPLGQAGQCYCANRVSSFAASERSAKTAVLLLAHGSPENPDQIPEFLKYVT
Ga0187857_1011950223300018026PeatlandMTPCPLGQAGQCYCANRVSSFAASERSAKTAVLLLAHGSPENPDQIPEFLK
Ga0187867_1077308713300018033PeatlandMTPCPLGLAGQCYCANQASSFCAAPAKTAVLLLAHGSPENPDQIPEF
Ga0187862_1041886423300018040PeatlandMTPCPLGLAGQCYCANQASSFCAAPAKTAVLLLAHGSPEKPDQIP
Ga0187871_1072197023300018042PeatlandMTPCPLGLAGQCYCANRVNSFAAAPAKTAVLLLAHGSPETPD
Ga0187890_1025689723300018044PeatlandMTPCPLGRDAKCYCANQVSSFASAKPAKIAVLLLAHG
Ga0187858_1031851323300018057PeatlandMTPCPLGLAGQCYCANRVSSFSGSPAKTAVLLLAHGSPENPDQTPE
Ga0187858_1089865413300018057PeatlandMTPCPLGLAGRCYCANQVSAFAASELPAKTAVLLLA
Ga0187784_1132840123300018062Tropical PeatlandMTPCPLGLAGECYCANQINSFASAPAKTAVLLLAHGSPENPDQI
Ga0187769_1142963023300018086Tropical PeatlandMTPCPLGLTSECYCANRVSSFAAPDQSSERAARTAVLLLAHGSPETPGQIPEFLSYV
Ga0182031_131643013300019787BogPCPLGLAGQCYCANRVNSFAAAPAKTAVLLLAHGSPETPDQTPEFLRYVTEAAHSHRK
Ga0210395_1011361613300020582SoilMTACPLGLASQCYCANKISSFADRNRAKTAVLLLAHGSPETPDHIPEFLKYVTG
Ga0210404_1025587113300021088SoilMTACPLGLAGQCYCANQVSSFAASEEPARTAVLLLAHGSPENPSQVP
Ga0210405_1028990223300021171SoilMTACPLGLAGQCYCANQVSSFAASEEPARTAVLLLAHGSPENPSQVPEFLGY
Ga0210383_1093993413300021407SoilMTACPLGLAGECYCAKQISSFGVSNPPAKTAVLLVAHGSPENPDQIPEFLGYVT
Ga0224550_102450613300022873SoilMTNCPLGLAGQCYCANHVSSFPAQPASATSAKTAVLLLAHGCPENPGQIPEFL
Ga0208038_100053813300025446PeatlandMTICPLGLVGQCYCANRVSSFAASEQSSKLSAKTAVLLLAH
Ga0208688_105377113300025480PeatlandMTPCPLGLAGQCYCANRVSSFAPSALPAKTAVLLLAHGSPENPGQVSEFLSYVTGGRPL
Ga0208688_106182123300025480PeatlandMTPCPLGLAGQCYCANQVSSFAAAEPAKTAVLLLAHGSPENPDQIP
Ga0208188_105808923300025507PeatlandMTACPLGLAGQCYCANQVSAFAASELPAKTAVLLLAHGSPENPS
Ga0208188_112084613300025507PeatlandMTPCPLGQAGQCYCANRVSSFAASERSAKTAVLLLAHGSPENPDQIPEFLKY
Ga0208714_110604923300025527Arctic Peat SoilMTPCPLGQAGQCYCAKQVSSFAVPSAKTAVLLLAHGSPENPDQVPEFLSYVTG
Ga0207664_1055617323300025929Agricultural SoilMTPCPLGLAGQCYCANQKNSFASKQPAKTAVLLLAHGSPENPGQIPEFLSYVT
Ga0207665_1122963213300025939Corn, Switchgrass And Miscanthus RhizosphereMSACPLGLGGRCYCANHLSSFAPAPSKTAVLLLAH
Ga0209839_1020801313300026294SoilMTACPLGLAGQCYCANRVNSFAAAERAKTAVLLLAHGSPENPDQIPEF
Ga0257150_101649923300026356SoilMTACPLGLAAQCYCANQVSSFAASEAPAKTAVLLL
Ga0209117_108207613300027645Forest SoilMTACPLALSGRCYCANHVSSFAASEQSSDLSAKTAVLMLAHGSPENPSQVPEFLNNV
Ga0209530_112712613300027692Forest SoilMTVTPCPLGLAGQCYCANRVNSFTAPKLPAKDAVLLLAHGS
Ga0209248_1004614313300027729Bog Forest SoilMMACPLALANRCYCANQVSSFSTLEQASGTSAKTAVLLLAHGSPENTDQIPE
Ga0209448_1009646913300027783Bog Forest SoilMTPCPLRLDSQCYCGNRVSSFAAAEPRKTAVLLLAHGSPENPDQI
Ga0209180_1053491623300027846Vadose Zone SoilMSTCPLGLGGRCYCANHVSSFVAAPSKTAVLLLAHGSPEN
Ga0209517_1033799233300027854Peatlands SoilMSPCPLGLASQCYCANRVSAFAASERPAKTTVLLLAHGSPENPDQIPEFLSYVTG
Ga0209068_1023902713300027894WatershedsMTNCPLGLAEKCYCANSVSAFSASGPAKTAILLLAHGSPENPDQVPEFLRYVTG
Ga0209067_1049494113300027898WatershedsMNPCPLGLGGQCYCANRVSSFAASPSKTAVLLLAHGSPENPGQVPEFLGYV
Ga0209488_1082205313300027903Vadose Zone SoilMTACPLGLAGQCYCANRVSSFAASEEPARTAVLLL
Ga0302171_1000635113300028651FenMRNCPLGPIAQCYCANQVSSFSIAASKTAVLLLAHGTPESPSQIPEYLG
Ga0302160_1012409423300028665FenMTSCPLGLAGTCYCANSVNSFVRSSAVPEAKKTAVLLLAHGTPENPDQIPEYLRYVTG
Ga0302202_1013559613300028762BogMTDCPLNLAGECYCANRVSSFGAGKLPGKTAVLLLAHGSPDNPEQVPEFLN
Ga0302222_1027449923300028798PalsaMTACPLGLASQCYCENKVSSFGSAKSPAKTAVLLLAHGSPENP
Ga0302257_109415713300028855FenMRNCPLGPIAQCYCANQVSSFSIAASKTAVLLLAHGTPE
Ga0302291_1024624113300028865FenMRNCPLGPIAQCYCANQVSSFSIAASKTAVLLLAHGTPESPSQIPEYLGYVTGG
Ga0311327_1092223313300029883BogMTPCPLGLAGQCYCANQASSFSAEPAKANAAVLLLAHGSPENP
Ga0311338_1012642913300030007PalsaMTACPLGLAARCYCANRINSFASEKLPAQTAVLLLAHGSPENPGQIPEFLSYVT
Ga0302306_1012379623300030043PalsaMTNCPLGLAGQCYCANHVSSFPAQPASATSAKTAVLLLAHGSPENPGQIPEFLSYVTG
Ga0302191_1023534423300030049BogMTDCPLNLAGECYCANRVSSFGAGKLPGKTAVLLL
Ga0311353_1016477733300030399PalsaMTACPLGLASQCYCENKVSSFGSAKSPAKTAVLLLAHGSPENPD
Ga0311372_1220076123300030520PalsaMTACPLGLASQCYCENKVSSFGSAKSPAKTAVLLLAN
Ga0311357_1051522723300030524PalsaMRLQALRGTEMTACPLGLASQCYCENKVSSFGSAKSPAKTAVLLLAHGSPENPDQIPEFL
Ga0311354_1035103513300030618PalsaMTACPLGLASQCYCENKVSSFGSAKSPAKAAVLLLAHGSPENPD
Ga0310039_1037892123300030706Peatlands SoilMTACPLGLAGQCYCANRVSAFASSAAQPAKTAVLLLAHGSPEN
Ga0265316_1004013913300031344RhizosphereMTACPLGLASQCYCANQVSSFAAAGQSSQLSAKTAVLLLAHGSP
Ga0334804_076119_781_9063300033818SoilMTPCPLGLAGQCYCANQASSFSAEPAKANAAVLLLAHGSPEN
Ga0370483_0045672_1227_13673300034124Untreated Peat SoilMTPCPLSLAGECYCANRVSSFGAGKLPAKTAVLLLAHGSPDNPEQVP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.