NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102533

Metagenome / Metatranscriptome Family F102533

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102533
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 36 residues
Representative Sequence MKSLVKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD
Number of Associated Samples 65
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.99 %
% of genes near scaffold ends (potentially truncated) 28.71 %
% of genes from short scaffolds (< 2000 bps) 56.44 %
Associated GOLD sequencing projects 59
AlphaFold2 3D model prediction Yes
3D model pTM-score0.22

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (92.079 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil
(10.891 % of family members)
Environment Ontology (ENVO) Unclassified
(20.792 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(35.644 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.
1TB_PC08_66DRAFT_1000101713
2TB_PC08_66DRAFT_100021572
3JGIcombinedJ13530_1056936321
4JGIcombinedJ13530_1057453212
5JGI20220J20339_10761912
6JGI24146J20443_10352173
7MIS_100601285
8JGIcombinedJ21911_100003735
9JGIcombinedJ21913_100056225
10JGI24146J26653_10340682
11JGI24145J26757_101447972
12JGIcombinedJ26865_10276112
13Ga0066603_105051352
14Ga0066599_1003496172
15Ga0066599_1008182332
16Ga0066599_1012711562
17Ga0074471_102366622
18Ga0074471_102579742
19Ga0074471_103513661
20Ga0074471_104608282
21Ga0074471_104812572
22Ga0074471_105640111
23Ga0074471_106325902
24Ga0079306_100080317
25Ga0079306_10012005
26Ga0079306_11691762
27Ga0066793_100053189
28Ga0105047_100025482
29Ga0105047_101310883
30Ga0115026_113714052
31Ga0115027_100702105
32Ga0115027_104051211
33Ga0114977_100574182
34Ga0073936_100310631
35Ga0073936_100561392
36Ga0114951_100022046
37Ga0114951_1000355318
38Ga0114951_101329144
39Ga0115592_11049662
40Ga0115595_10631541
41Ga0116245_100984632
42Ga0138308_10003062
43Ga0138308_10132916
44Ga0151652_100686432
45Ga0151652_127417782
46Ga0151652_137531822
47Ga0173609_105783571
48Ga0075315_10125172
49Ga0182027_103091691
50Ga0207193_100233527
51Ga0207193_10203925
52Ga0194039_10022569
53Ga0214167_10555762
54Ga0194053_100157564
55Ga0194053_100654002
56Ga0212088_1000277119
57Ga0212088_100254706
58Ga0212088_100600324
59Ga0212088_100724645
60Ga0212088_102420901
61Ga0209083_10240742
62Ga0208584_11518031
63Ga0209744_12384722
64Ga0209123_10237143
65Ga0209748_11974662
66Ga0209226_104067312
67Ga0208789_100005865
68Ga0208789_100141314
69Ga0208789_10043032
70Ga0209397_100842212
71Ga0208980_100936122
72Ga0208980_102390762
73Ga0265296_12001422
74Ga0247838_11911932
75Ga0268284_100154314
76Ga0265593_100207211
77Ga0247844_10072629
78Ga0247840_100540186
79Ga0247840_101423922
80Ga0247842_100414504
81Ga0247841_102221002
82Ga0247841_102574392
83Ga0265327_1000136118
84Ga0311364_119883482
85Ga0302322_1001667501
86Ga0315268_1000555313
87Ga0315270_100066259
88Ga0315270_100359052
89Ga0315270_100619314
90Ga0335397_102185891
91Ga0335397_102713694
92Ga0316619_113035241
93Ga0316625_1009081662
94Ga0316624_117540941
95Ga0316616_1012454202
96Ga0316616_1017988132
97Ga0316616_1020924452
98Ga0373912_0092525_198_305
99Ga0370479_0092500_676_792
100Ga0370490_0048671_371_478
101Ga0370503_0296675_2_124
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 25.40%    β-sheet: 0.00%    Coil/Unstructured: 74.60%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MKSLVKKKVEVQKVNTKKVAQCCAKTSKTVVGCHDSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.22
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
92.1%7.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Wetland
Freshwater
Freshwater
Anoxic Zone Freshwater
Freshwater Lake
Freshwater Lake Hypolimnion
Freshwater
Lake Chemocline
Sediment
Freshwater Lake Sediment
Wetland
Sinkhole Freshwater
Groundwater
Groundwater
Freshwater
Freshwater
Wetland
Saline Water
Sediment (Intertidal)
Arctic Peat Soil
Soil
Untreated Peat Soil
Natural And Restored Wetlands
Fen
Prmafrost Soil
Deep Subsurface
Fen
Rhizosphere
Anaerobic Digestor Sludge
Sediment
Sediment Slurry
5.9%6.9%3.0%6.9%4.0%4.0%4.0%4.0%4.0%4.0%6.9%10.9%5.9%3.0%5.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
TB_PC08_66DRAFT_10001017133300000228GroundwaterMLSNSKTKKIEIMKSLVKKKIEVQKVNTKKVAQCCAKTSKTVVGCHD*
TB_PC08_66DRAFT_1000215723300000228GroundwaterMKSLVKKKIEVQKVNTKKVAQCCAKTSKTVVGCHD*
JGIcombinedJ13530_10569363213300001213WetlandNVVKMKSLVKKKIMKVEAKPAQCCAKVSKTVVGCHD*
JGIcombinedJ13530_10574532123300001213WetlandMKSLVKKKERKANETAKPAQCCAKISKIIVGCHD*
JGI20220J20339_107619123300001809WetlandMKSLVKKKVKVEKAETKKEAQCCAKTSKTVVGCHD*
JGI24146J20443_103521733300001868Arctic Peat SoilMTIMKSLIKKKEGKVNAKAAQCCAKTSKIIVGCHD*
MIS_1006012853300002027Sinkhole FreshwaterMKSLVKKIETVKVNQEVTKKVAQCCAKTSKTIVGCHD*
JGIcombinedJ21911_1000037353300002066Arctic Peat SoilMTIMKSLIKKKEGKVNAKPAQCCAKTSKIIVGCHD*
JGIcombinedJ21913_1000562253300002068Arctic Peat SoilMTIMKSLIKKKEGKVNAKPAQCCATSKIIVGCHD*
JGI24146J26653_103406823300002101Arctic Peat SoilMTIMKSLIKKKEGKVNAKXAQCCAKTSKIIVGXHD*
JGI24145J26757_1014479723300002183Arctic Peat SoilMTIMKSLIKKKEGKVNAKPAQCCAXTSKIIVGCHD*
JGIcombinedJ26865_102761123300002347Arctic Peat SoilMTIMKSLIKKKEGKVSAKPAQCCAKTSKIIVGCHD*
Ga0066603_1050513523300004154FreshwaterNKKINFMKSLVKKKVEVQKATSKKVAQCCAKTSKTVVGCHD*
Ga0066599_10034961723300004282FreshwaterMKSLVKKNTGKAGVKVKASENAKPAQCCAKTSKIIVGCHD*
Ga0066599_10081823323300004282FreshwaterMKSLVKKNNVKASAKANGNGNAKPAQCCAKTSKIIVGCHD*
Ga0066599_10127115623300004282FreshwaterFTYKIIDMKSVIRKKEKRNNTKTAQCCAKTSKTVVGCHD*
Ga0074471_1023666223300005831Sediment (Intertidal)MKTLVKKKVEAQKVNTKKVAQCCAKTSKAVVGCHD*
Ga0074471_1025797423300005831Sediment (Intertidal)MKSLVKKKIEVQKVNAKKVAQCCAKTSKTVVGCHD*
Ga0074471_1035136613300005831Sediment (Intertidal)SNSINQTYMTSLIKKKKVEVEAKVAQCCAKTSKNVVGCHD*
Ga0074471_1046082823300005831Sediment (Intertidal)MSYEKSDQKKIEKENVKVAQCCAKTSKTVAGCHD*
Ga0074471_1048125723300005831Sediment (Intertidal)MKTLVKKKIEAQKINTKKVAQCCAKTSKTVVGCHD*
Ga0074471_1056401113300005831Sediment (Intertidal)TTMKSLIKKKDGKVNTKVAQCCAKTSKTVVGCHD*
Ga0074471_1063259023300005831Sediment (Intertidal)THMKSLIKKKVEVQKTNTKKVAQCCAKTSKTVVGCHD*
Ga0079306_1000803173300007959Deep SubsurfaceMKSLVKKKIEVQKVNTKKVAQCCAKTSKTIVGCHD*
Ga0079306_100120053300007959Deep SubsurfaceMKSLVKKKMEVQKVNTKKTAQCCAKTSKTVVGCHD*
Ga0079306_116917623300007959Deep SubsurfaceMKSLVKKKIEVQKVNTKKVAQCCAKTSTTVVGCHD*
Ga0066793_1000531893300009029Prmafrost SoilMTIMKTLIKKKEGKVNAKPAQCCAKTSKIIVGCHD*
Ga0105047_1000254823300009083FreshwaterMKSLIKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD*
Ga0105047_1013108833300009083FreshwaterMKTLVKKKVETQKVNTKKVAQCCAKTSKTVVGCHD*
Ga0115026_1137140523300009111WetlandMKSLVKKKEGKVKANENAKPAQCCAKTSKIIVGCHD*
Ga0115027_1007021053300009131WetlandMKSLVKKKIEIQKVNTKKVAQCCAKTSKTVVGCHD*
Ga0115027_1040512113300009131WetlandIETMKNLVKKKVKVEKANEKKEAQCCAKTSKTVVGCHD*
Ga0114977_1005741823300009158Freshwater LakeMKTLVKKKIENQKTNTKKVAQCCAKTSKAVVGCHD*
Ga0073936_1003106313300009175Freshwater Lake HypolimnionYNTMKCLVKKQNEKEPIKKVAQCCAKTSKTIVGCHD*
Ga0073936_1005613923300009175Freshwater Lake HypolimnionMKSLIKKKVKVQKAETKKVAQCCAKTSKTIVGCHD*
Ga0114951_1000220463300009502FreshwaterMIMKTLIKKKVQKAVEKKPAQCCAKTSKVVAGCHD*
Ga0114951_10003553183300009502FreshwaterMKSLVKKKVKAPKAETKKVAQCCAKTSKTIVGCHD*
Ga0114951_1013291443300009502FreshwaterQQRLINYFFMKSLIKKKEPKKVEVAQCCAKTSRTVVGCHD*
Ga0115592_110496623300009755WetlandKSLVKKKEGKEKANENAKPAQCCAKTSKIIVGCHD*
Ga0115595_106315413300010138WetlandLKIETMKSLIKKKIKVEKETAKKEAQCCAKTSKTVVGCHD*
Ga0116245_1009846323300010338Anaerobic Digestor SludgeMKSLIKIKQPTKKQSPKKAQCCAKLSKTTVGCHD*
Ga0138308_100030623300010965Lake ChemoclineMKTLVKKKIETQKVNTKKVAQCCAKTSKTVVGCHD*
Ga0138308_101329163300010965Lake ChemoclineMKSLVKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD*
Ga0151652_1006864323300011340WetlandMKSLFKKKERKANETAKPAQCCAKISKIIVGCHD*
Ga0151652_1274177823300011340WetlandMKSLIKKKIKVEKENAKKEAQCCAKTSKTVVGCHD*
Ga0151652_1375318223300011340WetlandKIQTMKSLVKKKVKAPKAETKKVAQCCAKTSKTIVGCHD*
Ga0173609_1057835713300013315SedimentIPVIMKSLIKKKESKVNAKPAQCCAKTSKIIVGCHD*
Ga0075315_101251723300014258Natural And Restored WetlandsVLLIFKIDFMKSLVKRKEKKLKTKEAQCCAKTSKNVAGCHD*
Ga0182027_1030916913300014839FenNKFNITHSKKLSMKSLVKKKEISSNTKTAQCCAKTSKTIVGCHD*
Ga0207193_1002335273300020048Freshwater Lake SedimentMKTLVKKKIENQKTNTKKVAQCCAKTSKAVVGCHD
Ga0207193_102039253300020048Freshwater Lake SedimentMKSLIKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD
Ga0194039_100225693300020163Anoxic Zone FreshwaterMKSLVKKKIKIEKANSKKVAQCCAKTSKTIVGCHD
Ga0214167_105557623300021136FreshwaterVITITQKLIFMKSLIKKKEEPKAVKIAQCCAKTSRTVVGCHD
Ga0194053_1001575643300021520Anoxic Zone FreshwaterMKSLVKKKIKVEKASTKKVAQCCAKTSKTIVGCHD
Ga0194053_1006540023300021520Anoxic Zone FreshwaterTKISTMKSLVKKKVKVEKVNTKKVAQCCAKTSKTVMGCHD
Ga0212088_10002771193300022555Freshwater Lake HypolimnionMKSLVKKKTNVEKANTKKTAQCCAKTSKTIVGCHD
Ga0212088_1002547063300022555Freshwater Lake HypolimnionMIMKTLIKKKVQKAVEKKPAQCCAKTSKVVAGCHD
Ga0212088_1006003243300022555Freshwater Lake HypolimnionMKSLVKKKVKAPKAETKKVAQCCAKTSKTIVGCHD
Ga0212088_1007246453300022555Freshwater Lake HypolimnionNQILTIKTETMKSLIKKKVKVQKAETKKVAQCCAKTSKTIVGCHD
Ga0212088_1024209013300022555Freshwater Lake HypolimnionFPKYKRGKAIQQRLINYFFMKSLIKKKEPKKVEVAQCCAKTSRTVVGCHD
Ga0209083_102407423300025162FreshwaterMKSLVKKKTEIETANSKKVAQCCAKTSATIVGCHD
Ga0208584_115180313300025533Arctic Peat SoilMTIMKSLIKKKEGKVNAKPAQCCAKTSKIIVGCHD
Ga0209744_123847223300025692Arctic Peat SoilMTIMKSLIKKKEGKVNAKAAQCCAKTSKIIVGCHD
Ga0209123_102371433300025718Arctic Peat SoilMKTLVKKKIEIQKVNTQKVAQCCAKTSKTVVGCHD
Ga0209748_119746623300025836Arctic Peat SoilMSIMKSLIKKKEVKVSVKPAQCCAKTSKTIVGCHD
Ga0209226_1040673123300025865Arctic Peat SoilMTNMKSLIKKKEGKVNAKAAQCCTKTSKIIVGCHD
Ga0208789_1000058653300027265Deep SubsurfaceMKSLVKKKMEVQKVNTKKTAQCCAKTSKTVVGCHD
Ga0208789_1001413143300027265Deep SubsurfaceMKSLVKKKIEVQKVNTKKVAQCCAKTSKTIVGCHD
Ga0208789_100430323300027265Deep SubsurfaceMKSLVKKKIEVQKVNTKKVAQCCAKTSTTVVGCHD
Ga0209397_1008422123300027871WetlandLKIETMKSLVKKKVKVEKANTKKEAQCCAKTSKAVVGCHD
Ga0208980_1009361223300027887WetlandMKSLIKKKIKVEKENAKKEAQCCAKTSKTTVGCHD
Ga0208980_1023907623300027887WetlandMKSLVKKKIKVEKENAKKEAQCCAKSSKTVVGCHD
Ga0265296_120014223300028032GroundwaterLLIFKFEFMKSLVKKKTQKTSTKKVAQCCAKTSKTIVGCHD
(restricted) Ga0247838_119119323300028044FreshwaterTMKSLIKKKVKVEKVNSKKVAQCCAKTSKTIVGCHD
Ga0268284_1001543143300028176Saline WaterMKSLVKKKVKVQKADTKKVAQCCAKTSKTIVGCHD
Ga0265593_1002072113300028178Saline WaterMKSLVKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD
(restricted) Ga0247844_100726293300028571FreshwaterMKSLIKKKVKVEKVNSKKVAQCCAKTSKTIVGCHD
(restricted) Ga0247840_1005401863300028581FreshwaterMKSLVKKKVKVEKVSTKKVAQCCAKTSKTIVGCHD
(restricted) Ga0247840_1014239223300028581FreshwaterMKSLIKKKIKVEKANSKKVAQCCAKTSKTIVGCHD
(restricted) Ga0247842_1004145043300029268FreshwaterMKSLVKKKVKVEKVNTKKVAQCCAKTSKTIVGCHD
(restricted) Ga0247841_1022210023300029286FreshwaterMKTLVKKTVKVEKISTKSSAQCCAKTSKNVPGCHD
(restricted) Ga0247841_1025743923300029286FreshwaterMKSLVKKKIKVEKENAKKEAQCCAKTSKTVVGCHD
Ga0265327_10001361183300031251RhizosphereMKSLVKKKTKEKTETKTKTALCCAKTSKAIPGCHD
Ga0311364_1198834823300031521FenMKSLVKKKLEAQKVNTKKVAQCCAKTSKTVVGCHD
Ga0302322_10016675013300031902FenLTQSKFRIMKSLIKKKETRTIVKVAQCCAKTSKTVVGCHD
Ga0315268_10005553133300032173SedimentMKSLIKKKVETQKVNTKKVAQCCAKTSKTVVGCHD
Ga0315270_1000662593300032275SedimentMKSLVKKKIETQKVNTPKVAQCCAKTSKTVVGCHD
Ga0315270_1003590523300032275SedimentMKTLVKKKIEAQKINTKKVAQCCAKTSKTVVGCHD
Ga0315270_1006193143300032275SedimentMKSLVKKKIEVQKVDTKKVAQCCAKTSKTVVGCHD
Ga0335397_1021858913300032420FreshwaterIFNMKTLVKKKVETQKVNTKKVAQCCAKTSKTVVGCHD
Ga0335397_1027136943300032420FreshwaterMKTLVKKKVETQKVNTKKVAQCCAKTSKTVVGCHD
Ga0316619_1130352413300033414SoilYLNKIMKSVIKKKETRKQIKTAQCCAKTSKTVVGCHD
Ga0316625_10090816623300033418SoilMKSLVKKKVKVEKANEKKEAQCCAKSSKTVVGCHD
Ga0316624_1175409413300033486SoilYFKSQITIMKSLIKNKANKVKAKVAQCCAKISKTIAGCHD
Ga0316616_10124542023300033521SoilMKSLVKKKIEVQKVNTQKVAQCCAKTSKTVVGCHD
Ga0316616_10179881323300033521SoilMKSLVKKKVKVEKANEKKEAQCCAKTSKTVVGCHD
Ga0316616_10209244523300033521SoilMKSLVKKKIEVKKESAKKTAQCCAKTSKTVVGCHD
Ga0373912_0092525_198_3053300034088Sediment SlurryMKSLVKKKTEVQKVNTKKVAQCCAKTSKTVVGCHD
Ga0370479_0092500_676_7923300034123Untreated Peat SoilIKLLKIMKSLIKKKAERVTAKPAQCCAKTSKTIVGCHD
Ga0370490_0048671_371_4783300034128Untreated Peat SoilMKSLIKKKIKVEKENAKKEAQCCAKTSKNVVGCHD
Ga0370503_0296675_2_1243300034196Untreated Peat SoilTKIEIIKSLLKKKVEVQKVNTKKVAQCCAKTSKTVVGCHD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.