NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096680

Metagenome / Metatranscriptome Family F096680

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096680
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 38 residues
Representative Sequence METILPSVIFVSMIVVFGVDATRKRRDFLKKHKRA
Number of Associated Samples 79
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 64.42 %
% of genes near scaffold ends (potentially truncated) 13.46 %
% of genes from short scaffolds (< 2000 bps) 74.04 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (82.692 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Peat → Unclassified → Unclassified → Fen
(8.654 % of family members)
Environment Ontology (ENVO) Unclassified
(19.231 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Water (non-saline)
(36.538 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.
1LWSO_01169240
2TB_PC08_66DRAFT_100297381
3TB_PC08_66DRAFT_100393083
4TB_GS10_10DRAFT_100043016
5TB_GS10_10DRAFT_100974121
6TB_FS06_10DRAFT_10173623
7JGIcombinedJ13530_1046777422
8JGIcombinedJ13530_1096392841
9MIS_100012714
10MIS_11412242
11MIS_1000214017
12MIS_100266013
13MIS_100411645
14MIS_100574243
15MIS_101098673
16JGI20214J51088_105279652
17JGI20214J51650_104083652
18Ga0066599_1006146911
19Ga0062378_101207552
20Ga0071116_10103634
21Ga0073900_101981532
22Ga0074479_103194743
23Ga0074479_107405091
24Ga0074471_108515583
25Ga0074471_110529522
26Ga0075269_101004861
27Ga0075156_100453933
28Ga0082021_10166455
29Ga0105044_101936902
30Ga0105105_100807101
31Ga0105105_104625031
32Ga0105105_105578932
33Ga0105105_106416512
34Ga0105105_108494101
35Ga0105048_1000556410
36Ga0105098_105425742
37Ga0105047_1000076632
38Ga0102851_117934651
39Ga0115027_106079562
40Ga0115027_114569901
41Ga0115027_114882112
42Ga0115027_117673811
43Ga0105091_101321852
44Ga0115028_103683812
45Ga0116204_12702261
46Ga0137456_11519751
47Ga0119867_11530302
48Ga0137322_10298182
49Ga0137350_11202701
50Ga0154020_101846352
51Ga0154020_104522233
52Ga0172368_101051002
53Ga0172368_102648042
54Ga0172369_103162582
55Ga0172367_1000030266
56Ga0172373_101172774
57Ga0075358_10205601
58Ga0075358_10608831
59Ga0180070_10403412
60Ga0184629_102430792
61Ga0207193_100208416
62Ga0207193_10282426
63Ga0163152_100750053
64Ga0206227_11304091
65Ga0210377_104873952
66Ga0210324_12723252
67Ga0233424_100262513
68Ga0233425_1000733716
69Ga0209835_11638902
70Ga0209575_101257362
71Ga0209288_101075032
72Ga0209288_102287412
73Ga0209277_101279452
74Ga0209181_1000037425
75Ga0208980_100862133
76Ga0209496_101872873
77Ga0209668_106650732
78Ga0268283_10903542
79Ga0302159_100683761
80Ga0302162_101258512
81Ga0302167_100462862
82Ga0268298_1000094019
83Ga0268298_100199873
84Ga0268298_103134781
85Ga0302293_100465082
86Ga0302173_103244721
87Ga0302212_10850092
88Ga0311366_100709871
89Ga0311364_124097892
90Ga0302322_1019577492
91Ga0335397_1000044241
92Ga0316625_1007466912
93Ga0326726_110935062
94Ga0316627_1013510692
95Ga0316627_1023289572
96Ga0316616_1002195883
97Ga0316616_1020614562
98Ga0316617_1000140305
99Ga0316617_1009496392
100Ga0335027_0554337_169_276
101Ga0370498_042978_872_991
102Ga0370502_0028104_1424_1531
103Ga0370507_0108154_51_170
104Ga0370503_0256197_2_127
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 47.62%    β-sheet: 0.00%    Coil/Unstructured: 52.38%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035METILPSVIFVSMIVVFGVDATRKRRDFLKKHKRAExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreSignal PeptideTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
82.7%17.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Wetland
Groundwater Sediment
Freshwater Lake Sediment
Freshwater Sediment
Freshwater
Freshwater Microbial Mat
Anoxic Lake Water
Freshwater Lake Sediment
Wetland Sediment
Freshwater Wetlands
Sinkhole Freshwater
Groundwater
Sinkhole
Freshwater
Freshwater
Estuarine
Wetland
Saline Water
Soil
Sediment (Intertidal)
Groundwater Sediment
Soil
Untreated Peat Soil
Natural And Restored Wetlands
Rice Paddy Soil
Deep Subsurface Sediment
Fen
Peat Soil
Activated Sludge
Activated Sludge
Active Sludge
Activated Sludge
Wastewater Treatment Plant
Wastewater Effluent
8.7%5.8%7.7%6.7%4.8%4.8%4.8%3.8%3.8%6.7%3.8%8.7%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
LWSO_011692402088090005Freshwater SedimentMETILPSVIFVSMVVIFGVDATRKRRDFLKKHKRA
TB_PC08_66DRAFT_1002973813300000228GroundwaterMERIMQTILPSVIFLSIVIVFGVDANRKRRDFLKKRNRA*
TB_PC08_66DRAFT_1003930833300000228GroundwaterMETIFPSVLFLSMIIVFSVDATRKRREFLKKHNRA*
TB_GS10_10DRAFT_1000430163300000230GroundwaterMETILPSVIFLSMVVVFGVDATRKRREFLKKRNRI*
TB_GS10_10DRAFT_1009741213300000230GroundwaterMETILPSVIFVSMIVVFGVDATRKRRDFLKKHKRA*
TB_FS06_10DRAFT_101736233300000233GroundwaterMQTILPSVIFLSIVIVFGVDANRKRRDFLKKRNRA*
JGIcombinedJ13530_10467774223300001213WetlandMETIIPSVIFVSLIVVFGVDARQKRRDFLRKQKQA*
JGIcombinedJ13530_10963928413300001213WetlandMETILPSVIFVSMVVIFGVDATRKRRAFLKKHKRA*
MIS_1000127143300001765Sinkhole FreshwaterMQTILPSVIFLSMVIIFGVDATRKRRAFLKKHNHN*
MIS_114122423300002024Sinkhole FreshwaterMELIMETILQSVVFVSMIVVFGVDATRKRREFLKKRNRA*
MIS_10002140173300002026Sinkhole FreshwaterMETILPSVIFLSMVVVFGVDATRKRREFLKKHDRA*
MIS_1002660133300002026Sinkhole FreshwaterMETILPSVIFLSMVVVFGVDATRKRRDFLKKRNRI*
MIS_1004116453300002027Sinkhole FreshwaterMETILPSVIFLSMVVVFGVDASRKRREFLKKRNRI*
MIS_1005742433300002027Sinkhole FreshwaterMGTILPSVIFLSMIVVFSVDAAQKRRAYLKKRKLT*
MIS_1010986733300002027Sinkhole FreshwaterMETIFPAFIFVSIVVVFSVDANRKRREFLKKYRSE*
JGI20214J51088_1052796523300003432WetlandMESIMETILPSVIFLSMIVVFGVDATRKRREFLKKQKRV*
JGI20214J51650_1040836523300003541WetlandVYHFSIQMESIMETILPSVIFLSMIVVFGVDATRKRREFLKKQKRV*
Ga0066599_10061469113300004282FreshwaterMESIMETILPSVIFLSMVVVFGVDATRKRREFLKKHNRA*
Ga0062378_1012075523300004780Wetland SedimentMYHIEHRNGEHMETILPSVIFVSMIVIFGVDATRKRRDFLKKQKRA*
Ga0071116_101036343300005077SinkholeMELSMETILQSVVFVSMIIVFGVDATRKRREFLRKHKRA*
Ga0073900_1019815323300005659Activated SludgeMETILPSVIFLSMVVVFSVDATRKRREFLKKHKSA*
Ga0074479_1031947433300005829Sediment (Intertidal)METILPSVIFVSMVVVFGLDATRKRRDFLKKNKRA*
Ga0074479_1074050913300005829Sediment (Intertidal)RNGERMETILPSVIFVSMIVIFGVDATRKRRDFLKKQKNA*
Ga0074471_1085155833300005831Sediment (Intertidal)MGALVPSVIFLSMVIIFGVDAKRKRRDFLKKQKQA*
Ga0074471_1105295223300005831Sediment (Intertidal)MQTILPSVIFLSMVVIFGVDASRKRRDFLKKRNRT*
Ga0075269_1010048613300005905Rice Paddy SoilETILPSVIFVSMVIVFGVDATRKRRDFLKKHKRA*
Ga0075156_1004539333300005982Wastewater EffluentMETILPSVIFVSMVVIFGVDATRKRRDFLKKIKRA*
Ga0082021_101664553300006092Wastewater Treatment PlantMETILSSVIFVSMVLVFGVDATRKRRDYLKKRSRN*
Ga0105044_1019369023300007521FreshwaterMETILTSVIFVSMVVIFGVDATRKRRDFLKKHKRA*
Ga0105105_1008071013300009009Freshwater SedimentMESIMQTILPSVIFLSMVIVFGVDATLKRREFLKKNNRV*
Ga0105105_1046250313300009009Freshwater SedimentMENFMETILPSVIFLSMIVVFGVDATRKRREFLKKNNRA*
Ga0105105_1055789323300009009Freshwater SedimentMELVMETIVQSVIFISMIVIFGVDASRKRREFLKKQNRS*
Ga0105105_1064165123300009009Freshwater SedimentMESFMETILPSVIFLSMVVVFGVDATRKRREFLKKRNRT*
Ga0105105_1084941013300009009Freshwater SedimentMEIIMETILQSVVFVSMIVVFGVDATRKRREFLRKQKRA*
Ga0105048_10005564103300009032FreshwaterMYHLIIENGAYMETFLPSVIFLSMVVVFSVDATRKRRDFLKKRNRS*
Ga0105098_1054257423300009081Freshwater SedimentMGTYQMEIIMQTILPSVIFLSMIVVFGVDASRKRRDFLRKQKRA*
Ga0105047_10000766323300009083FreshwaterMETILPSVIFVSIVVVFSIDATRKRRDFLKKHKRA*
Ga0102851_1179346513300009091Freshwater WetlandsMETILPSVIFVSMMIVFGVDATRKRRDFLKKRNRA*
Ga0115027_1060795623300009131WetlandMETILPSVIFVSMIVIFGVDASRKRRDFLKKRKRA*
Ga0115027_1145699013300009131WetlandMETILPSVIFVSMVVIFGVDAAIKRRDFLKKHKRA*
Ga0115027_1148821123300009131WetlandMEINMETIFQSVVFVSMIIVFGVDATRKRRDFLRKQKRA*
Ga0115027_1176738113300009131WetlandMERFMETLLPSVIFLSMVIVFGVDASRKRREFLKKQNQT*
Ga0105091_1013218523300009146Freshwater SedimentMENIMETILPSVIFLSMVVIFGVDATRKRREFLKKRNRA*
Ga0115028_1036838123300009179WetlandMESIMQTILPSVIFLSMVIVFGVDASRKRRDFLKKNNRV*
Ga0116204_127022613300010293Anoxic Lake WaterMEIIMETILQSVIFISMIVVFGVDATHKRREFLRKQKRA*
Ga0137456_115197513300011428SoilCRNGEYMETILPSVIFVSMIVIFGVDAARKRRDFLKKHKRA*
Ga0119867_115303023300012018Activated SludgeMETIFPSFLILSMVVVFSVDATRKRREFLNKRKTA*
Ga0137322_102981823300012146SoilMETILPSVIFVSMVVIFGVDATRKRRDFLKKHKRA*
Ga0137350_112027013300012166SoilLKSKINTLMYHGYIEMESVMETILPSVIFVSIIVVFGVDATRKRRDFLKKHKRA*
Ga0154020_1018463523300012956Active SludgeMETILPSVIFLSMVVVISVDATRKRREFLKKHKSA*
Ga0154020_1045222333300012956Active SludgeYMETILPSVIFLSMVVVFSVDATRKRREFLKKHKSA*
(restricted) Ga0172368_1010510023300013123FreshwaterMETILPSVIFLSMIIVFSVDASRKRREFLKKHKRA*
(restricted) Ga0172368_1026480423300013123FreshwaterMETILPSVIFLSMIIVFGVDGTRKRREYLKKQKRS*
(restricted) Ga0172369_1031625823300013125FreshwaterETILPSVIFLSMIIVFSVDASRKRREFLKKHKRA*
(restricted) Ga0172367_10000302663300013126FreshwaterMGIQMEIIMETILQSVVFVSMIVVFGVDATRKRREFLRKQKRA*
(restricted) Ga0172373_1011727743300013131FreshwaterMETLLSPFLFLSMIVVFSVDAQRKRREFLKKHTTA*
Ga0075358_102056013300014303Natural And Restored WetlandsMAMEINMETIFQSVIFLSMIVVFGVDATRKRREFLRKQKRA*
Ga0075358_106088313300014303Natural And Restored WetlandsRNGERMETILPSVIFVSMIVIFGVDATRKRRDFLKKHKRA*
Ga0180070_104034123300015251SoilMETILPSVIFVSMVVVFGVDAARKRRDFLKKRDRA*
Ga0184629_1024307923300018084Groundwater SedimentMETIFTSIIFVSMVVVFGVDATRKRRDFLKKNKSA
Ga0207193_1002084163300020048Freshwater Lake SedimentMETILPSVIFLSMVVVFGVDATRKRREFLKKHDRA
Ga0207193_102824263300020048Freshwater Lake SedimentMETILPSVIFLSMVVVFGVDASRKRREFLKKRNRI
Ga0163152_1007500533300020213Freshwater Microbial MatMETILTSVIFVSMVVIFGVDATRKRRDFLKKHKRA
Ga0206227_113040913300021063Deep Subsurface SedimentMETILPSVIFVSMVVVFGVDAARKRRDFLKKRNRA
Ga0210377_1048739523300021090Groundwater SedimentMETILPSVIFLSMIVVFGVDATRKRRDFLKKHKRA
Ga0210324_127232523300021333EstuarineMETILPSVIFVSMVVVFGLDATRKRRDFLKKNKRA
(restricted) Ga0233424_1002625133300023208FreshwaterMESIFPSVIFLSMIIIFSVDAKRKRRDFLKKNKRA
(restricted) Ga0233425_10007337163300024054FreshwaterMRIYPLEINMETILQSVVFVSMVIVFGVDATRKRREFLRKQKRA
Ga0209835_116389023300025115Anoxic Lake WaterMEIIMETILQSVIFISMIVVFGVDATHKRREFLRKQKRA
Ga0209575_1012573623300027739FreshwaterMELIMETILQSVVFVSMIVVFGVDATRKRREFLKKRNRA
Ga0209288_1010750323300027762Freshwater SedimentMETILPSVIFLSMIVVFGVDATRKRREFLKKNNRA
Ga0209288_1022874123300027762Freshwater SedimentMESFMETILPSVIFLSMVVVFGVDASRKRREFLKKRNRI
Ga0209277_1012794523300027776Wastewater EffluentMETILPSVIFVSMVVIFGVDATRKRRDFLKKIKRA
Ga0209181_10000374253300027878FreshwaterMYHLIIENGAYMETFLPSVIFLSMVVVFSVDATRKRRDFLKKRNRS
Ga0208980_1008621333300027887WetlandMETILPSVIFVSMIVIFGVDATRKRRDFLKKQKRA
Ga0209496_1018728733300027890WetlandMYHIEHRNGEHMETILPSVIFVSMIVIFGVDATRKRRDFLKKQKRA
Ga0209668_1066507323300027899Freshwater Lake SedimentMYHIEHRNGERMETILPSVIFVSIIVIFGVDATRKRRDFLKKQKRA
Ga0268283_109035423300028283Saline WaterMETIIPSVIFLSMVVVFGVDANRKRRDFLKKHKRA
Ga0302159_1006837613300028646FenMETILPSIIFASIIVVFGVDATRKRRDFLKKHKRF
Ga0302162_1012585123300028649FenMETILPSVIFVSMVIVFGFDATRKRRDYLKKDKRA
Ga0302167_1004628623300028676FenMETILPSVIFVSMIVIFGVDATRKRRDFLKKHKRA
Ga0268298_10000940193300028804Activated SludgeMETIFPSFLFLSMVVVFSVDATRKRREFLKKRKTA
Ga0268298_1001998733300028804Activated SludgeMETLLSPFLFLSMIVVFSVDAKRKRREFLKKHTPA
Ga0268298_1031347813300028804Activated SludgeMETILPSVIFLSMVFVIGVDAARKRRDFLKKNKHAS
Ga0302293_1004650823300029981FenMETILPSVIFVSMIVVFGVDATRKRRDFLKKNKSA
Ga0302173_1032447213300030055FenVSWHCRNGEHMETILPSVIFVSIVVVFGLDATRKRRDFLKKHKRA
Ga0302212_108500923300030492FenMETILPSVIFVSMIVVFGVDATRKRRDFLKKNKRA
Ga0311366_1007098713300030943FenMETILPSVIFVSMVIVFGFDATRKRREFLKKHKRA
Ga0311364_1240978923300031521FenEHMETILPSVIFVSIVVVFGLDATRKRRDFLKKHKRA
Ga0302322_10195774923300031902FenCMETILPSIIFASIIVVFGVDATRKRRDFLKKHKRF
Ga0335397_10000442413300032420FreshwaterMETILPSVIFVSIVVVFSIDATRKRRDFLKKHKRA
Ga0316625_10074669123300033418SoilMESFMETILPSVIFLSMVVVFGVDATRKRREFLKKRNRV
Ga0326726_1109350623300033433Peat SoilSEVSSHYRNGERMETILPSVIFVSMVVIFGVDAARKRRDFLKKNKRA
Ga0316627_10135106923300033482SoilVRIQMEIIMETILQSVVFVSMIIVFGVDATRKRRDFLRKQKRA
Ga0316627_10232895723300033482SoilKHRNGERMETILTSVIFVSIVVVFGVDATRKRRDFLKKHKRA
Ga0316616_10021958833300033521SoilMRIYQMEIIMETFFQSVVFVSMIIVFGVDATRKRRDFLRKQKRS
Ga0316616_10206145623300033521SoilMETILPSMIFVSMIVIFGVDASRKRRDFLKKHKRA
Ga0316617_10001403053300033557SoilMETILPSVIFVSMVVIFGVDAAIKRRDFLKKHKRA
Ga0316617_10094963923300033557SoilMETIFQSVVFVSMIIVFGVDATRKRRDFLRKQKRA
Ga0335027_0554337_169_2763300034101FreshwaterMETILPSVIFLSMIIVFSLDATRKRREFLKNQKPT
Ga0370498_042978_872_9913300034155Untreated Peat SoilMESFMETILPSVIFLSMVVVFGVDATRKRRAFLKKRSRI
Ga0370502_0028104_1424_15313300034156Untreated Peat SoilMETILPSVIFVSMVVIFGLDATRKRRDFLKKHKRA
Ga0370507_0108154_51_1703300034158Untreated Peat SoilMESFMETILPSVIFLSMVVIFGVDATRKRRAFLKKRSRI
Ga0370503_0256197_2_1273300034196Untreated Peat SoilYRNGEYMETILPSVIFVSMVVIFGLDATRKRRDFLKKHKRA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.