NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102158

Metagenome / Metatranscriptome Family F102158

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102158
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 46 residues
Representative Sequence MSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAE
Number of Associated Samples 98
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.02 %
% of genes from short scaffolds (< 2000 bps) 94.12 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(26.471 % of family members)
Environment Ontology (ENVO) Unclassified
(27.451 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.059 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60
1A3035W6_11082381
2A35518A_10016251
3JGI12053J15887_103729931
4JGI25387J43893_10767822
5Ga0007429J51699_10949992
6Ga0062589_1025069751
7Ga0062595_1016944091
8Ga0070690_1003702351
9Ga0070680_1003492022
10Ga0070680_1007976992
11Ga0070682_1003099141
12Ga0070692_107610661
13Ga0070693_1010895941
14Ga0070704_1007653351
15Ga0066670_100490413
16Ga0066703_101209501
17Ga0075425_1003046893
18Ga0075435_1016894322
19Ga0104326_1120212
20Ga0066710_1024598242
21Ga0066793_101692122
22Ga0099792_106017601
23Ga0105249_123781101
24Ga0127471_10858492
25Ga0127500_10136372
26Ga0127495_10500841
27Ga0127466_10115061
28Ga0134070_101433702
29Ga0134109_101465742
30Ga0134066_100347622
31Ga0134128_112563741
32Ga0137463_12081621
33Ga0136625_10698651
34Ga0137382_107876742
35Ga0137376_112752352
36Ga0137377_101430533
37Ga0134051_12298472
38Ga0134055_11983032
39Ga0150984_1225148552
40Ga0137397_109138432
41Ga0137413_100664691
42Ga0164298_111486031
43Ga0134087_101731491
44Ga0164308_116232272
45Ga0120179_10019056
46Ga0157377_103205602
47Ga0137403_108336371
48Ga0134112_104483441
49Ga0134083_106081741
50Ga0184618_104841821
51Ga0190272_103512191
52Ga0066667_111037532
53Ga0184641_11509361
54Ga0193720_10548371
55Ga0193722_11165222
56Ga0193723_10061991
57Ga0193747_10870831
58Ga0193755_12342802
59Ga0193749_10256561
60Ga0193726_13159602
61Ga0193695_10357182
62Ga0182009_103965171
63Ga0222621_10164551
64Ga0222622_103210242
65Ga0247745_10431542
66Ga0193714_10118851
67Ga0193714_10233731
68Ga0247670_10094833
69Ga0137417_11189512
70Ga0207707_112175852
71Ga0207652_104122111
72Ga0207689_115370481
73Ga0207651_120059501
74Ga0207677_103885232
75Ga0207702_116308241
76Ga0209238_11303321
77Ga0209239_11980341
78Ga0209470_12307182
79Ga0209152_100638041
80Ga0209152_102348951
81Ga0209806_10243264
82Ga0209807_11348782
83Ga0209156_102820911
84Ga0209969_10200101
85Ga0209331_10650251
86Ga0209818_10321421
87Ga0209387_10553761
88Ga0209488_104723562
89Ga0307298_100965732
90Ga0307280_101947301
91Ga0307280_103828621
92Ga0307312_105787681
93Ga0307308_105446932
94Ga0307304_102963482
95Ga0308196_10138852
96Ga0308185_10263462
97Ga0308193_10254061
98Ga0308191_10126992
99Ga0308187_101254662
100Ga0310813_104571911
101Ga0310895_100548083
102Ga0370546_078603_376_549
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 51.39%    β-sheet: 0.00%    Coil/Unstructured: 48.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Polar Desert Sand
Soil
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Soil
Agricultural Soil
Permafrost
Soil
Grasslands Soil
Soil
Soil
Prmafrost Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Avena Fatua Rhizosphere
Corn Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Avena Fatua Rhizosphere
26.5%8.8%11.8%2.9%7.8%4.9%2.9%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A3035W6_110823813300001359PermafrostMSPQSSQAAIAALNGVHPVLAHFDKPRSAEDLAADIID
A35518A_100162513300001565PermafrostMSPANTQAALAALNGARPVLAHFNTPRSAEDLAADIIDLWTGV
JGI12053J15887_1037299313300001661Forest SoilVSPPNTQAALTALNGVRPVLTHFETSRAAEDLAADIIDLWSGAEASLQALV
JGI25387J43893_107678223300002915Grasslands SoilVSPQNSQGALTALNGVRPVLTHFETSRSAEDLAADIIDLWTGVET
Ga0007429J51699_109499923300003579Avena Fatua RhizosphereMSPQNNPGALAALNGARPVLANFDKARSAEDLAADIIDLWTAAEQALQAMVGNASLTG
Ga0062589_10250697513300004156SoilMSPANTQAALAALNGVRPVLSHFDSPRSAEDLAADIIDLWVGAEAS
Ga0062595_10169440913300004479SoilMSPQNNPGAVAALNGARPVLAHFDQPRSAEDLAADIIDLWSGAESALQSVVGNTSLSGQQ
Ga0070690_10037023513300005330Switchgrass RhizosphereMSPQNTPGAVSALNGARPVLAHFEKPRSAEDLAADIIDLWTGAESALQSLVGNASLTG
Ga0070680_10034920223300005336Corn RhizosphereMSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAE
Ga0070680_10079769923300005336Corn RhizosphereMSPRNNPGALAALSGARPVLANFGKARSAEDLAADIIDLWTAAEQALQALVGNASLTG
Ga0070682_10030991413300005337Corn RhizosphereMSPQSTQAALASLNGARPVLAHFDQPRSAEDLAADIIDLW
Ga0070692_1076106613300005345Corn, Switchgrass And Miscanthus RhizosphereMSPQNNPGALAALKGARPVLAHFDKSRSAEDLAADIIDLW
Ga0070693_10108959413300005547Corn, Switchgrass And Miscanthus RhizosphereMSPQNTQAALASLNGARPVLAHFDQPRSAEDLAADIIDLWTGVEGALQTLVGNSSLTG
Ga0070704_10076533513300005549Corn, Switchgrass And Miscanthus RhizosphereMSPQSTQAALASLNGARPVLAHFDQPRSAEDLAADIIDLWSGVESALQ
Ga0066670_1004904133300005560SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIIDLWTGAEGALQALVGN
Ga0066703_1012095013300005568SoilMSPQNTPGALASLNGVRPVLAHFETPRSAEDLAADIIDLWTGAEGALQ
Ga0075425_10030468933300006854Populus RhizosphereMSPQSTQAALASLNGARPVLAHFDQPRSAEDLAADI
Ga0075435_10168943223300007076Populus RhizosphereMSPQNTPGALASLNGVRPVLAHFERPRSAEDLAADIIDLWT
Ga0104326_11202123300007740SoilVSPQNSQAALTAMNGVRPVLTHFETPRAAEDLAADIIDLWTGAEASLQALVG
Ga0066710_10245982423300009012Grasslands SoilMSPQNTPGALASLNGVRPVLAHFETPRSAEDLAADIIDLWTGAE
Ga0066793_1016921223300009029Prmafrost SoilVSPQNSKAALTALNGVRPVLTHFGTPRAAEDLAADIIDLWTGAEASV
Ga0099792_1060176013300009143Vadose Zone SoilMSPANTQAALAALSGARPILAHFNTPRSAEDLAADI
Ga0105249_1237811013300009553Switchgrass RhizosphereMSPQNTPGAVSALNGARPVLAHFEKPRSAEDLAADI
Ga0127471_108584923300010090Grasslands SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIIDLWTGAEGALQSLVGNS
Ga0127500_101363723300010103Grasslands SoilVSPQNTQAALAALNGARPVLAHFDSPRSAEDLAADIIDLW
Ga0127495_105008413300010115Grasslands SoilVSPQNTQAALTVLNGARPVLAHFDSPRSAEDLAADIIDLWTGAEGALQALVGNS
Ga0127466_101150613300010116Grasslands SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIIDL
Ga0134070_1014337023300010301Grasslands SoilMSPQNSQAALASLNGVRPVLAHFDTPRSAEDLAADIIDLWTG
Ga0134109_1014657423300010320Grasslands SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIIDLWTGAEGALQSLVGN
Ga0134066_1003476223300010364Grasslands SoilMSPQNNQAALASLNGARPVLAHFDQPRSAEDLAADIIDLWTGVEAALQTLVGNS
Ga0134128_1125637413300010373Terrestrial SoilMSPQNTQAALASLNGARPVLAHFDQPRSAEDLAADIIDLWTGVEGALQTLVGNSS
Ga0137463_120816213300011444SoilMSPQSSQAAIAALNGVHPVLAHFDKPRSAEDLAADIIDLWAGAESALQALVGNSSLTGQ
Ga0136625_106986513300012091Polar Desert SandVTPASSGALTALNGVRPVLSHFDNPRSAEDLAADIIDLWTGVESSLRALVGGSALS
Ga0137382_1078767423300012200Vadose Zone SoilMSPRNNPGALAALSGARPVLAHFDKARSAEDLAADIIDLWTAA
Ga0137376_1127523523300012208Vadose Zone SoilMSPQNTPGALAAVNGARPILAHFSTPRSAEDLAADIIDLWAAAETALQALVGNGSLTGQ
Ga0137377_1014305333300012211Vadose Zone SoilMSPQSSQAALAALNGVRPVLAHFDKPRSAEDLAAD
Ga0134051_122984723300012398Grasslands SoilMSPQNTPGALAALNGARPVLAHFESPRSAEDLAADIIDLWSAAEGALQALVGNSSLTG
Ga0134055_119830323300012401Grasslands SoilMSPQNNPGALAALNGARPVLANFGKARSAEDLAADIIDLWTAAEQALQALVGNASLT
Ga0150984_12251485523300012469Avena Fatua RhizosphereMSPQNNPGAVAALNGARPVLSHFDQPRSAEDLAADIID
Ga0137397_1091384323300012685Vadose Zone SoilMSPQSSQAALSALNGVRPVLAHFETPRNAEDLAADII
Ga0137413_1006646913300012924Vadose Zone SoilMSPANTQAALAALSGARPILAHFNTPRSAEDLAADIIDLWTGVERSLQALI
Ga0164298_1114860313300012955SoilMSPQGSQAAVSALNGVRPVLAHFETPRNAEDLAADIIDLWAGVESSLQ
Ga0134087_1017314913300012977Grasslands SoilMSPQNNPGALAALNGARPVLANFDKARSAEDLAADIID
Ga0164308_1162322723300012985SoilMSPQNNPGAVAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGA
Ga0120179_100190563300013763PermafrostVSPQNSQAALTALNGVRPVLTHFETSRAAEDLAADIIDLWTGAE
Ga0157377_1032056023300014745Miscanthus RhizosphereMSPQNTPGAVSALNGARPVLAHFEKPRSAEDLAADIIDLWTGAESALQSLVGNASLTGQ
Ga0137403_1083363713300015264Vadose Zone SoilMSPQNTPGAIAALNGARPVLANFDKPRSAEDLAADIIDLWAAAEVALQALVGNS
Ga0134112_1044834413300017656Grasslands SoilMSPANTQAAQNALNAVRPTLAHFDTPRNAEDLAADV
Ga0134083_1060817413300017659Grasslands SoilMSPQKNPGALAALNGARPVLANFGKARSAEDLAADIIDLWTAAEQALQALVGN
Ga0184618_1048418213300018071Groundwater SedimentMSPQNSQAALAALNSARPVLAHFDTPRSTEDLAADIIDLWTGAEASLQALV
Ga0190272_1035121913300018429SoilVTPASSAALAALNGVRPVLTHFETPRGGEDLAADIIDLWT
Ga0066667_1110375323300018433Grasslands SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIIDLWTGAEGALQSLVGNSSLTAQ
Ga0184641_115093613300019254Groundwater SedimentMSPQNSQAALAALAGARPVLAHFETPRSAEDLAADIIDLWTGVESALQALVDRE
Ga0193720_105483713300019868SoilMSPQSSQAALSALNGVRAVLAHFERPRNAEDLAADIIDLWAGVESSLQ
Ga0193722_111652223300019877SoilVSPQNSQTALTALNGVRPVLAHFDSPRNAEDLAADVIDLW
Ga0193723_100619913300019879SoilMSPQGSQAAVSALNGVRPVLAHFETPRNAEDLAADIIDLWAGVESSLQA
Ga0193747_108708313300019885SoilMSPQGSQAALSALNGVRPVLAHFETPRNAEDLAADIIDLWAGVESALQA
Ga0193755_123428023300020004SoilMSPQGSQAAVSALNGVRPVLAHFETPRNAEDLAADIIDLWAGVEGSLQALVGNSSLTGQ
Ga0193749_102565613300020010SoilMSPQSSQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWAGAESALQT
Ga0193726_131596023300020021SoilMSPQSSQALSALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAESALQALVGNPSLT
Ga0193695_103571823300021418SoilMSPQNSQAALAALNGARPVLAHFETSRSTEDLAADI
Ga0182009_1039651713300021445SoilVSPQNNQAALTALNGARPVLAHFDTPRSAEDLAADIIDLWGG
Ga0222621_101645513300021510Groundwater SedimentMSPQSSQAALSALNGVRAVLAHFERPRNAEDLAADIIDLWAGVENS
Ga0222622_1032102423300022756Groundwater SedimentMSPQSSQAALAALNGARPVLAHFETSRSAEDLAADIIDL
Ga0247745_104315423300022898SoilMSPQNTPGAVSALNGARPVLAHFEKPRSAEDLAADII
Ga0193714_101188513300023058SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESALQSL
Ga0193714_102337313300023058SoilMSPQGSQAALSALNGVRPVLAHFETPRNAEDLAADIIDLWA
Ga0247670_100948333300024283SoilMSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGA
Ga0137417_111895123300024330Vadose Zone SoilMSPQNTPSALAALNGARATLAHFETPRNAEDLAADIIDLW
Ga0207707_1121758523300025912Corn RhizosphereMSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAERSL
Ga0207652_1041221113300025921Corn RhizosphereMSPRNNPGALAALSGARPVLANFGKARSAEDLAADI
Ga0207689_1153704813300025942Miscanthus RhizosphereMSPQNTPGAVSALNGARPVLAHFEKPRSAEDLAADIIDLWT
Ga0207651_1200595013300025960Switchgrass RhizosphereMSPQNNQAALAALNGVRPVLAHFDKPRSAEDLAADIIDLWTGAER
Ga0207677_1038852323300026023Miscanthus RhizosphereMSPQNTQAALASLNGARPVLAHFDQPRSAEDLAADIIDLWTGVEGA
Ga0207702_1163082413300026078Corn RhizosphereMSPQNNPSALAALNGARPVLAHFDQPRSAEDLAADI
Ga0209238_113033213300026301Grasslands SoilMSPQSKQAALASLNGARSVLAHFDQPRSAEDLAADIIDLWS
Ga0209239_119803413300026310Grasslands SoilMSPQNNPGAVTSLNGVRPVLAHFDTPRSAEDLAADIIDLWAGAEGS
Ga0209470_123071823300026324SoilMSPQNTPGALAALNGARPVLAHFESPRSAEDLAADI
Ga0209152_1006380413300026325SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAADIID
Ga0209152_1023489513300026325SoilMSPQNTPGALASLNGVRPVLAHFETPRSAEDLAADIIDLWTGAEGALQALV
Ga0209806_102432643300026529SoilMSPQNTPGALASLNGVRPVLAHFETPRSAEDLAADIIDLWTGAEGAL
Ga0209807_113487823300026530SoilMSPQNNPGAVTSLNGVRPVLAHFDTPRSAEDLAADIIDLWAGAEGSLQSLVGNSSLTGQQ
Ga0209156_1028209113300026547SoilMSPQNTPGALASLNGVRPVLAHFDTPRSAEDLAAD
Ga0209969_102001013300027360Arabidopsis Thaliana RhizosphereMSPQVNQAAIAALNGVRPVLSHFDSPRSAEDLAADIIDLWAGAESSL
Ga0209331_106502513300027603Forest SoilVSPPNSQAALTALNGVRPVLTHFETSRAAEDLAADIIDLWSGAETSLQALVGNSSL
Ga0209818_103214213300027637Agricultural SoilVTPASSGALAALNGVRPVLTHFETPRGAEDLAADIIDLWTGVESSLR
Ga0209387_105537613300027639Agricultural SoilVTPASSGALAALNGVRPVLTHFETPRGAEDLAADIIDLWTGVESSLRALVGGSSL
Ga0209488_1047235623300027903Vadose Zone SoilMSPQGSQAALSALNGVRPVLAHFETPRNAEDLAADIIERRVMNAE
Ga0307298_1009657323300028717SoilMSPQSSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESSLQSLVGNP
Ga0307280_1019473013300028768SoilMSPANTQAAQAALSGVRPVLAHFNTPRSAEDLAADIIDLWTGVAR
Ga0307280_1038286213300028768SoilVSPQNSQAALTALNGVRPVLTHFETPRNAEDLAAD
Ga0307312_1057876813300028828SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESSLQ
Ga0307308_1054469323300028884SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWA
Ga0307304_1029634823300028885SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESAL
Ga0308196_101388523300030989SoilMSPQSSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWA
Ga0308185_102634623300031081SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESALQSLVGNSSL
Ga0308193_102540613300031096SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDL
Ga0308191_101269923300031098SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVE
Ga0308187_1012546623300031114SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWAGVESSLQSLVGNSSL
Ga0310813_1045719113300031716SoilMSPQNNPSALAALNGARPVLAHFDQPRSAEDLAADIIDLWTGAESALQA
Ga0310895_1005480833300032122SoilMSPRNNPGALAALSGARPVLANFGKARSAEDLAADIIDLWTAAEQALQA
Ga0370546_078603_376_5493300034681SoilMSPQNSQAALAALNGARPVLAHFETSRSAEDLAADIIDLWTGVESALQSLVGNSSLTG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.