NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099398

Metagenome / Metatranscriptome Family F099398

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099398
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 44 residues
Representative Sequence MMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRS
Number of Associated Samples 98
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.05 %
% of genes near scaffold ends (potentially truncated) 98.06 %
% of genes from short scaffolds (< 2000 bps) 95.15 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.146 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(14.563 % of family members)
Environment Ontology (ENVO) Unclassified
(20.388 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.718 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.
1JGI1358J11329_101642891
2JGI1027J12803_1080040742
3JGI12635J15846_101377161
4Ga0062385_111656083
5Ga0066673_107685442
6Ga0070733_101535153
7Ga0070766_104082321
8Ga0075023_1000535573
9Ga0075017_1012871711
10Ga0075019_103073121
11Ga0075021_100959571
12Ga0066665_106160451
13Ga0079220_102407933
14Ga0073928_103261831
15Ga0079219_105469561
16Ga0099829_115475991
17Ga0099828_112814562
18Ga0105245_129319251
19Ga0099792_111584381
20Ga0116103_10161331
21Ga0116105_11468611
22Ga0116122_11886691
23Ga0116134_13201232
24Ga0134082_103766362
25Ga0134063_103549511
26Ga0074045_110137001
27Ga0126370_104724501
28Ga0126378_108756393
29Ga0134126_100638928
30Ga0126383_133218062
31Ga0137392_108761091
32Ga0137393_110394762
33Ga0137378_103782201
34Ga0137370_107900473
35Ga0137394_116321262
36Ga0137359_100663307
37Ga0137359_112420882
38Ga0137416_103115081
39Ga0137416_115021791
40Ga0137404_109848203
41Ga0164303_103680031
42Ga0181529_105946572
43Ga0181538_105193511
44Ga0182018_106599251
45Ga0182024_104433761
46Ga0182024_119947201
47Ga0181522_106560811
48Ga0137420_11839044
49Ga0182032_118238752
50Ga0182040_119658932
51Ga0187854_104817212
52Ga0187816_103872072
53Ga0187873_13855452
54Ga0187860_13973782
55Ga0187885_103260242
56Ga0187863_107942961
57Ga0187875_102289961
58Ga0187855_104165471
59Ga0187765_105111881
60Ga0137408_14169411
61Ga0193751_11709331
62Ga0210407_106269763
63Ga0210403_113491572
64Ga0210399_110119831
65Ga0210385_108018061
66Ga0210386_108224661
67Ga0210384_118499762
68Ga0210390_101036351
69Ga0126371_117471461
70Ga0242669_11230422
71Ga0212123_102831253
72Ga0224563_10196332
73Ga0224550_10557921
74Ga0208456_10573011
75Ga0208034_10696361
76Ga0209648_101610804
77Ga0209730_10215442
78Ga0209731_10309852
79Ga0209626_10952603
80Ga0209073_104141761
81Ga0209655_102869682
82Ga0209068_100078188
83Ga0209526_108546141
84Ga0307504_102862321
85Ga0311352_114802121
86Ga0311338_120732532
87Ga0302182_104666601
88Ga0302176_101997401
89Ga0311353_115406891
90Ga0265754_10063553
91Ga0265760_102758592
92Ga0170824_1062407461
93Ga0302324_1016573491
94Ga0307372_104201412
95Ga0307372_104588432
96Ga0307373_103109413
97Ga0307475_114743272
98Ga0318533_114001061
99Ga0307471_1035436861
100Ga0335085_118654082
101Ga0326728_108468321
102Ga0326728_110290321
103Ga0326724_0458167_1_138
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.44%    β-sheet: 0.00%    Coil/Unstructured: 55.56%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
95.1%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Peatland
Freshwater Sediment
Groundwater
Iron-Sulfur Acid Spring
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Bog Forest Soil
Palsa
Permafrost
Soil
Forest Soil
Soil
Soil
Soil
Palsa
Peat Soil
Miscanthus Rhizosphere
6.8%2.9%5.8%4.9%5.8%14.6%3.9%2.9%8.7%4.9%2.9%5.8%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1358J11329_1016428913300000571GroundwaterMIQRILSSKNFVACLLAAATGMALYFKLPFPAENVFLQLMAL
JGI1027J12803_10800407423300000955SoilMIQRILSSKNLVAFVLAAATGMTLYFRVPFPEGNIFL
JGI12635J15846_1013771613300001593Forest SoilMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFFFVKYS
Ga0062385_1116560833300004080Bog Forest SoilMIQRILNSKNFVACLLAAATVLVLYIRMPFPEGNLFLELLFLRAQ
Ga0066673_1076854423300005175SoilMMQRIINSKNLVAFVLAAATGMTLYFRVPFPETNTFLLVMA
Ga0070733_1015351533300005541Surface SoilMMQKILNSKNFVACLLAAATGMILYFRVPFPDENIFLRVMALRSPSIFYFVK
Ga0070766_1040823213300005921SoilMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMAQRSPSIF
Ga0075023_10005355733300006041WatershedsMQRILNSKNLVAFVLAAATGMTLYFRMPFPEGNIFL
Ga0075017_10128717113300006059WatershedsMIQKILNSKNFVACLLAAATGMALYFRVPFPDENIFLQVMALRSPSIFYFVKYSY
Ga0075019_1030731213300006086WatershedsMQRILNSKNLVAFVLAAATGMTLYFRMPFPEGNIFLRVMALRSPSAFE
Ga0075021_1009595713300006354WatershedsMIQRILNSKNLVAFVLAAATGMTLYFRVPFPEANIFLRVMALRSPSAF
Ga0066665_1061604513300006796SoilMMQRIINSKNLVAFVLAAATGMTLHFRVPFPETNTFLL
Ga0079220_1024079333300006806Agricultural SoilMMQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNIFLRVMALRS
Ga0073928_1032618313300006893Iron-Sulfur Acid SpringMMQKILSSKNFVACLLAAATGMTFYFRVPFPDENIFLQVMALRSPS
Ga0079219_1054695613300006954Agricultural SoilMMQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNIFLRVMALRSPSA
Ga0099829_1154759913300009038Vadose Zone SoilMMQRVLNSKNFVAFVLAAATGMTLYFGVPFPEGNIFLRVMALRS
Ga0099828_1128145623300009089Vadose Zone SoilMMQRILNSKNLVAFVLAAATGMTLYFRMPFPEGNIFLRVM
Ga0105245_1293192513300009098Miscanthus RhizosphereMIQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFL
Ga0099792_1115843813300009143Vadose Zone SoilMMQKILNSKNFVACLLAAASGMTLYFRVPFPEENVFLQVM
Ga0116103_101613313300009615PeatlandMIQRILNSKNLVAFVLAAATGMALYFRMPFREDNIFLQVMALRSPSV
Ga0116105_114686113300009624PeatlandMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMAQR
Ga0116122_118866913300009639PeatlandMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIF
Ga0116134_132012323300009764PeatlandMMQKILNSKNFVACLLAAATGMNLYFRVPFPDENIFLRVMAQR
Ga0134082_1037663623300010303Grasslands SoilMIQRVLNSENFIAFLLASATGMTLYFLLPFPEGNLFLRERQQEVE
Ga0134063_1035495113300010335Grasslands SoilMMQRIINSKNLVAFVLAAATGMTLYFRVPFPETNTFLLVM
Ga0074045_1101370013300010341Bog Forest SoilMIQRILNSKNLVAFLLAAATGMTLYFRMPFREDNIFLQVM
Ga0126370_1047245013300010358Tropical Forest SoilMMQRILNSKNLVAFVLAAATGMTLYFRIPFPESNI
Ga0126378_1087563933300010361Tropical Forest SoilMMQRILNSKNFVAFILAAATGMTLYFRVPLPERNIFLRVMALRSPSAFEAL
Ga0134126_1006389283300010396Terrestrial SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMAL
Ga0126383_1332180623300010398Tropical Forest SoilMMQRILNSKNLVAFVLAAATGMTLYFRAPFPEGNIFLRVMALRSPSAFEVLKYS
Ga0137392_1087610913300011269Vadose Zone SoilMIQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNIFLRVMALQSPSAFEV
Ga0137393_1103947623300011271Vadose Zone SoilMIQRILNSRNFVASLLAAATGMALYLRLPFPTGNVFLQAR
Ga0137378_1037822013300012210Vadose Zone SoilMMQRILNSKNLVAFVLAAATGLTLYFRVPFPEGNIFLR
Ga0137370_1079004733300012285Vadose Zone SoilMIQRILNSKNFVAYLLAAATGMALYFRIPFPDENIFLLVMVLRSPSI
Ga0137394_1163212623300012922Vadose Zone SoilMIQRILNSKNLVAFVLAAATGMTLYFRAPFPEANIFLGVMALRSPSAFQVLKYS
Ga0137359_1006633073300012923Vadose Zone SoilMIQKILNSKNFVACLLAAATGMALYFRVPFPDENIFLQVMALRSPSI
Ga0137359_1124208823300012923Vadose Zone SoilMIQRILSSKNFVACLLAAATGLALFFKLPFPAENVFLQ
Ga0137416_1031150813300012927Vadose Zone SoilMIQKILNSKNFVACLLAAATGMTLYFRVPFPDENIFLRVMALRSPSIFYFVKY
Ga0137416_1150217913300012927Vadose Zone SoilMIQRIINSKNLVAFVLAAATGMTLYFRVPFPETNTFLLVMALRSPSA
Ga0137404_1098482033300012929Vadose Zone SoilMIQRILNSKNLVAFVLATATGMTLYFRVPFPDGNIFLRVMALRSPSAFEVLK
Ga0164303_1036800313300012957SoilMMQRILNSKNLVALVLAAATGMTLYFRIPFPEGNLF
Ga0181529_1059465723300014161BogMMQKILNSKNFVACLLAAATGMTLYFRVPFPDENIFLQVMAQRSPSIFYFVKYSY
Ga0181538_1051935113300014162BogMIQRILNSKNLVAFVLAAATGMTLYFRIPFREDNIFLQVMALRS
Ga0182018_1065992513300014489PalsaMIQKILNSKNFVACLLAAATGMTLYFRVPFPDENIFLQVMAQRSPSIFCF
Ga0182024_1044337613300014501PermafrostMIQRILSSKNFVACLLAAATGMTLYFRVPFPDENIFLQVMAVRSPS
Ga0182024_1199472013300014501PermafrostMIQRILSSKNFVACLLAAATGMTLYFRVPFPDENIFLQVMAVRSPSI
Ga0181522_1065608113300014657BogMMQKLLNSKNFVACLLAAATGMTLYFWVPFPDDNI
Ga0137420_118390443300015054Vadose Zone SoilMIQRILRSKNFVACLLAAVMGMALYFKLPFPAENAFL
Ga0182032_1182387523300016357SoilMIQRILASKDFVASLLAAGTGMTLYFRVPFPENNVFLQVIALRSPWAFGVLKYS
Ga0182040_1196589323300016387SoilMIQRILASKDFVASLLAAGTGMTLYFRVPFPEGNVFLQVIALRSPSAFGVL
Ga0187854_1048172123300017938PeatlandMMQKILNSKNFVACLLAAATGMILYFRVPFPEENIFLQVMALRSPSIFYFV
Ga0187816_1038720723300017995Freshwater SedimentMMQRILNSKNLVAFVLAAATGMTLYFWVPFPEANIFVRVMALRSPSVFEVLKY
Ga0187873_138554523300018013PeatlandMIQRILNSKNFIAFVLASATGMTLYFLLPFPEGNLFL
Ga0187860_139737823300018014PeatlandMMQKILNSKNFVACLLAAATGMTLYFRVRFPEDNVFLQVMAQRSPSIFYF
Ga0187885_1032602423300018025PeatlandMIQRILNSKNLVAFVLAAATGMALYFRMPFREDNIFLQVMALRS
Ga0187863_1079429613300018034PeatlandMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQV
Ga0187875_1022899613300018035PeatlandMIQRIINSKNFIAFLLASATGMTLYFLYPFPASNLFLRVIA
Ga0187855_1041654713300018038PeatlandMIQRIINSKNFIAFLLASATGMTLYFLYPFPASNLFLRVI
Ga0187765_1051118813300018060Tropical PeatlandMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFYFVK
Ga0137408_141694113300019789Vadose Zone SoilMIQRILSSKNFVACLLAAVMGMALYFKLPFPAENAFL
Ga0193751_117093313300019888SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPDENVFLQVMVLRSPSIFY
Ga0210407_1062697633300020579SoilMMQRILNSKNLVAFVLAAATGMTLYFRMPFPEGNIFLRVMALR
Ga0210403_1134915723300020580SoilMQRILNSKNLVAFVLAAATGMTLYFRMPFPESNIFLRVMALRS
Ga0210399_1101198313300020581SoilMIQRILNSRNFVACLLAAVTGMALYFELPFPTGNVFVKLIP
Ga0210385_1080180613300021402SoilMMQKILNSKNFVACLLAAATGMTLYFQVPFPEENVFLQVMALRS
Ga0210386_1082246613300021406SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFYFVKYS
Ga0210384_1184997623300021432SoilMIGRILNSKNFVACLLAAATGMVLYIKVPFPEGNLFFELMFLWAR
Ga0210390_1010363513300021474SoilMQRILNSKNFVAFVLAAATGMTLYFRMPFPEGNIFLRVMALRSPSA
Ga0126371_1174714613300021560Tropical Forest SoilMIQRLLNSKNFVACLLTAATGMILYIRMPFPEDNLF
Ga0242669_112304223300022528SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRS
Ga0212123_1028312533300022557Iron-Sulfur Acid SpringMIQRILNSKNFVACLLAAATGMVLYIKMPFPEGNLFFELMFLWARP
Ga0224563_101963323300022731SoilMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFL
Ga0224550_105579213300022873SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFL
Ga0208456_105730113300025441PeatlandMIQRILNSKNLVAFVLAAATGMALYFRMPFREDNIFLQVMALRSPSVF
Ga0208034_106963613300025442PeatlandVIQRILNSKNFIAFVLASATGMTLYFLFPFPEGNLFLRLIAVNA
Ga0209648_1016108043300026551Grasslands SoilMMQKIINSKNLVAFVLAAATGMTLYFRVPFPETNTFLLVMALRSPSAFEA
Ga0209730_102154423300027034Forest SoilMMQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNIFLGVMALRSPSA
Ga0209731_103098523300027326Forest SoilMMQKIINSKNLVAFVLAAATGMMLYFRVPFPETNTFLL
Ga0209626_109526033300027684Forest SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPDENIFLQVMAQRSSSIF
Ga0209073_1041417613300027765Agricultural SoilMMQRILNSKNLVAFVLAAATGMTLYFRMPFPEGNIF
Ga0209655_1028696823300027767Bog Forest SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFYFVKY
Ga0209068_1000781883300027894WatershedsMIQRILTSRNFVACLLAAVTGMALYFELPFPTENV
Ga0209526_1085461413300028047Forest SoilMMHRILNSRNFVARLLAAATGMVLYSKQPFYRENVSLQVMALRVPFVH
Ga0307504_1028623213300028792SoilVIQRILNSRNFVACLLAAATGIAFYLKLPFPTENVFL
Ga0311352_1148021213300029944PalsaMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFFFVKYS
Ga0311338_1207325323300030007PalsaVIQRILNSKNFIAFVLASATGMTLHFLLPFPEGNLFL
Ga0302182_1046666013300030054PalsaMIQRVLNSKNFIAFVLASATGMTLYFLLPFPEGDIFLRLIS
Ga0302176_1019974013300030057PalsaVTQRILNSKNFIAFVLASATGMTLYFLLPFPEGNLFLRLI
Ga0311353_1154068913300030399PalsaMIQRILNSRNFVACLLAAATGMALYFELPFPTGNVFLHLMALRA
Ga0265754_100635533300031040SoilMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFYFV
Ga0265760_1027585923300031090SoilVIQRILNSKNFIAFVLASATGMTFYFLLPFPEGNLFLRLIAA
Ga0170824_10624074613300031231Forest SoilMMQRIINSKNLVAFVLAAATGMALYFRVPFPEGNTF
Ga0302324_10165734913300031236PalsaMIQRILNSRNFVACLLAAATGMALYFELPFPTANVFLHLMA
Ga0307372_1042014123300031671SoilMIQRILNSKNFIAFVLASATGMTLYFLLPFPEGNLFLRLI
Ga0307372_1045884323300031671SoilMMQKILSSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMALRSPSIFFFV
Ga0307373_1031094133300031672SoilMMQKILNSKNFVACLLAAATGMTLYFRVRFPEDNVFLQVMAQRSP
Ga0307475_1147432723300031754Hardwood Forest SoilMMQKILNSKNFVACLLAAATGMTLYFRVPFPEENVFLQVMA
Ga0318533_1140010613300032059SoilMIQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNVFVRVIALRSPSAFKVLK
Ga0307471_10354368613300032180Hardwood Forest SoilMMQRILNSKNLVAFVLAAATGMTLYFRVPFPEGNTFL
Ga0335085_1186540823300032770SoilMIQRILNSKNFIAFVLASATGMTLYFLLPFPEGNLFLRLIGV
Ga0326728_1084683213300033402Peat SoilMIQRILNSKNLVAFVLAAATGMILYFRMPFREDNIFLQVMALRSP
Ga0326728_1102903213300033402Peat SoilMIQRILNSKNLVAFVLAAATGMALYFRMPFREDNIFLQVMALRSPSVFQVLKYSY
Ga0326724_0458167_1_1383300034091Peat SoilMIQRILNSKNLVAFVLAAATGMALYFRMPFREDNIFLQVMALRSPS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.