NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101831

Metagenome / Metatranscriptome Family F101831

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101831
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 47 residues
Representative Sequence MSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTG
Number of Associated Samples 45
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 91.18 %
% of genes near scaffold ends (potentially truncated) 15.69 %
% of genes from short scaffolds (< 2000 bps) 79.41 %
Associated GOLD sequencing projects 44
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.294 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(32.353 % of family members)
Environment Ontology (ENVO) Unclassified
(35.294 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.020 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1JGI24973J35851_10425392
2JGI25406J46586_100130872
3JGI25407J50210_101119402
4Ga0081538_100180292
5Ga0081538_103742182
6Ga0082029_13301282
7Ga0082029_14847992
8Ga0079217_108259541
9Ga0079215_106887451
10Ga0079216_115611042
11Ga0105679_101041292
12Ga0105679_108371702
13Ga0105679_109198132
14Ga0075418_108411932
15Ga0126307_100269787
16Ga0126307_100705262
17Ga0126307_100738305
18Ga0126313_101084202
19Ga0126313_103227242
20Ga0126313_103882652
21Ga0126313_104224102
22Ga0126313_104565222
23Ga0126305_101810321
24Ga0126304_108489062
25Ga0126309_100099555
26Ga0126309_100145925
27Ga0126309_100455504
28Ga0126309_100632922
29Ga0126309_100888762
30Ga0126309_102532722
31Ga0126309_103752052
32Ga0126308_100743233
33Ga0126308_101873972
34Ga0126308_102972402
35Ga0126308_104512292
36Ga0126308_108077422
37Ga0126308_109031532
38Ga0126312_102013672
39Ga0126312_102843962
40Ga0126312_108592821
41Ga0126314_101469284
42Ga0126310_100126704
43Ga0126310_100348921
44Ga0126310_110804732
45Ga0126321_10818352
46Ga0126321_11442852
47Ga0126306_105635431
48Ga0127502_106563142
49Ga0127502_109459481
50Ga0136625_10801981
51Ga0136632_100256222
52Ga0136615_102536412
53Ga0182001_105145751
54Ga0182001_105467172
55Ga0183260_100032954
56Ga0183260_100666614
57Ga0136617_100475885
58Ga0136617_101832013
59Ga0190265_135369331
60Ga0190275_100539032
61Ga0190275_100889572
62Ga0190275_101804663
63Ga0190275_102179772
64Ga0190275_102502792
65Ga0190275_102618582
66Ga0190275_103276932
67Ga0190275_108395402
68Ga0190275_116509932
69Ga0190275_118614091
70Ga0190275_118852132
71Ga0190275_131121231
72Ga0190269_103579422
73Ga0190269_105160442
74Ga0190269_109395991
75Ga0190269_112811682
76Ga0190268_101192702
77Ga0190268_105568562
78Ga0190268_105725602
79Ga0190268_110423391
80Ga0190268_117955582
81Ga0190268_123070682
82Ga0190270_100635472
83Ga0190270_103367592
84Ga0190270_119782542
85Ga0190271_100911564
86Ga0190271_123090801
87Ga0190264_101141522
88Ga0190264_120672372
89Ga0190267_106985062
90Ga0190267_108185351
91Ga0190267_110449382
92Ga0196959_101078222
93Ga0299913_117697281
94Ga0272433_100385632
95Ga0307408_1022191491
96Ga0307413_104589182
97Ga0307413_106985372
98Ga0307406_112861152
99Ga0307407_105060492
100Ga0307407_115968062
101Ga0307409_1020890542
102Ga0307415_1018052952
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 51.32%    β-sheet: 0.00%    Coil/Unstructured: 48.68%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTGCytopl.Extracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
85.3%14.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Polar Desert
Polar Desert Sand
Soil
Soil
Serpentine Soil
Termite Nest
Agricultural Soil
Soil
Soil
Rock
Tabebuia Heterophylla Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Rhizosphere
3.9%6.9%32.4%30.4%2.9%3.9%2.9%7.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI24973J35851_104253923300002547Polar DesertMSERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVVALFAG
JGI25406J46586_1001308723300003203Tabebuia Heterophylla RhizosphereMSESGSQGSWVQSSSMALVVILWALVLLPLLWGVYQTLKGVVALFS*
JGI25407J50210_1011194023300003373Tabebuia Heterophylla RhizosphereMSEEARQESGGQSSSSSAVLVVVLWAMVLIPLAWGVYQTLLGVVALFTGG*
Ga0081538_1001802923300005981Tabebuia Heterophylla RhizosphereMSEESRQESGGQTSSSTVLVVVLWALVLIPLAWGVYQTLIGVVALFAGG*
Ga0081538_1037421823300005981Tabebuia Heterophylla RhizosphereMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG*
Ga0082029_133012823300006169Termite NestMSEETRQESGGQTSSSTVLAVVLWGLIPLAWGVYQTLLGVLALFAGG*
Ga0082029_148479923300006169Termite NestMSEEARQESGGQTSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG*
Ga0079217_1082595413300006876Agricultural SoilMSEEARQQSGGPSSSSSTVLVVVLWALVLIPLAWGVYQTLIGVVALFAGG*
Ga0079215_1068874513300006894Agricultural SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLLWGVYQTLTGVVALFS*
Ga0079216_1156110423300006918Agricultural SoilMSEEARQQSGGPSSSSSTVLVVVLWALVLIPLAWGVYQTLTGVVALFSGG*
Ga0105679_1010412923300007790SoilMSEQARQESGGQTSSSSVVLVIVLWALVLIPLLWGIYQTLLGVVALFTGG*
Ga0105679_1083717023300007790SoilMSEEAHQESGGQSSSSSAVLVVVLWALVLIPLAWGVYQTLTAVVALFTGG*
Ga0105679_1091981323300007790SoilMSERAGQEGGGQSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFAG*
Ga0075418_1084119323300009100Populus RhizosphereMSEEARQESGRQSSSSSMFLVVVLWAMVLIPLLWGVYQTLTGVVALFTG*
Ga0126307_1002697873300009789Serpentine SoilMSESELRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGAVALFS*
Ga0126307_1007052623300009789Serpentine SoilMSEEARQESGGQTSSSSKVLVVVLWALVLIPLAWGVYQTLTGVVALFAGG*
Ga0126307_1007383053300009789Serpentine SoilMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTGVVALFTGG*
Ga0126313_1010842023300009840Serpentine SoilMSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126313_1032272423300009840Serpentine SoilMSERARQEGREQTSSSMVLVIVLWAFVLIPLLWGVYQTLTGVVALFAG*
Ga0126313_1038826523300009840Serpentine SoilMSEEARQESGGQTSSSIILVIVLWALVLILLAWGVYQTLTGVVALFTGG*
Ga0126313_1042241023300009840Serpentine SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLIGVVALFTGG*
Ga0126313_1045652223300009840Serpentine SoilMSEEARQESGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126305_1018103213300010036Serpentine SoilMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTG
Ga0126304_1084890623300010037Serpentine SoilMSESESRGGQSSMVLAVVLRALVLIPLLWGIYQTLRGVVALFS*
Ga0126309_1000995553300010039Serpentine SoilMSESESRRSGSQASSSTALVVVLWALVLIPLLWGVYQTLTGVVALFAA*
Ga0126309_1001459253300010039Serpentine SoilMSESESRGSGGQSSTVLVVVLWALVLIPLLWGVYQTLTAVVALFS*
Ga0126309_1004555043300010039Serpentine SoilMSEEARQDSGGQSSSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFTGG*
Ga0126309_1006329223300010039Serpentine SoilMSERAGQEDGGRSSSSMALVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126309_1008887623300010039Serpentine SoilMSERAQQEGGGQTSSSMVPVVVLWALVLVPLLWGVYQTLTGVVALFAGG*
Ga0126309_1025327223300010039Serpentine SoilMSESESRGGGGQSSMVAAVVLWAIVLIPLLWGVYQTLT
Ga0126309_1037520523300010039Serpentine SoilMSERARQEGGEQSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126308_1007432333300010040Serpentine SoilMSEEARQESGGQTSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFAGG*
Ga0126308_1018739723300010040Serpentine SoilMSEEARQESGGQTSSSSMVLVIVLWAMVLIPLAWGVYQTLTGVVALFAGG*
Ga0126308_1029724023300010040Serpentine SoilMSEEARQESGGQTSSSSIVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126308_1045122923300010040Serpentine SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126308_1080774223300010040Serpentine SoilMSEEARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFAGG*
Ga0126308_1090315323300010040Serpentine SoilMSEEARQEGGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126312_1020136723300010041Serpentine SoilMSEEARQESGGQTSSRIILVIVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126312_1028439623300010041Serpentine SoilMSESEPRGGQSGVVLAVVLWALVLLPLLWGIYQTLTGVVALFS*
Ga0126312_1085928213300010041Serpentine SoilSSSMVLVVVLWAMVLIPLAWGVYQTLIGVVALFTGG*
Ga0126314_1014692843300010042Serpentine SoilMSEEARQESGEQTSSGMVLVVVLWAMVLIPLVWGVYQTLTGVVALFTGG*
Ga0126310_1001267043300010044Serpentine SoilMSERARQEGREQTSSSMVLVIVLWAFVLTPLLWGVYQTLTGVVALFAG*
Ga0126310_1003489213300010044Serpentine SoilQEDGGQSSSSMAVVVVLWALVLIPLLWGVYQTLLGVVALFAG*
Ga0126310_1108047323300010044Serpentine SoilMSEEARQESGGQSSSSSTVLVVVLWALVLIPLAWGIYQTLTGVVALFSGG*
Ga0126321_108183523300010145SoilMNERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFTG*
Ga0126321_114428523300010145SoilMSERAGQEGGGQSSSMVLVVVLWALVLIPLLWGVYQTLTGVAALFAGG*
Ga0126306_1056354313300010166Serpentine SoilMSESELRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGVVALFS*
Ga0127502_1065631423300011333SoilMSEQARQESGGQTSSSSVVLVIVLWALVLLPLLWGIYQTLLGVVALFTGG*
Ga0127502_1094594813300011333SoilNLARKGISMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS*
Ga0136625_108019813300012091Polar Desert SandGSTNWVPVVVLWTVVLIPLLWGVYQTLTSVVALFAG*
Ga0136632_1002562223300012093Polar Desert SandMSESESRGSGGQTSSSTVLAVVLWVLVLIPLLWGVYQTLTGVVALFS*
Ga0136615_1025364123300012678Polar Desert SandMTENESGNSSWVLAVVLWALVLIPLAWGVYQTLTSVPALFTGG*
Ga0182001_1051457513300014488SoilMSERAGQEGGGQSSSMVLVVVLWALVLIPLLWGVYQTLL
Ga0182001_1054671723300014488SoilMSEEARQENGGQTSSSSTIFVVVLWAMVLIPLAWGVYQTLLGVVALFTG*
Ga0183260_1000329543300017787Polar Desert SandMSESESRGSGGQTSSSTVLAVVLWVLVLIPLLWGVYQTLTGVVALFS
Ga0183260_1006666143300017787Polar Desert SandMNERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVGALFAG
Ga0136617_1004758853300017789Polar Desert SandMTENESGNSSWVLAVVLWALVLIPLAWGVYQTLTSVPALFTGG
Ga0136617_1018320133300017789Polar Desert SandMNERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVVALFAG
Ga0190265_1353693313300018422SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLTAVVALFTGG
Ga0190275_1005390323300018432SoilMSESEPRGGQSSVLLAIVLWAIVLIPLLWGIYQTLIGVVALFS
Ga0190275_1008895723300018432SoilMSESESRGGQSSMVLAVVLWAIVLIPLLWGIYQTLTGVVALFS
Ga0190275_1018046633300018432SoilMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALF
Ga0190275_1021797723300018432SoilVSESESQGGQSGVVVAVVLWALVLLPLLWGIYQTL
Ga0190275_1025027923300018432SoilMSEQARQESGGQTSSSSVVLVIVLWALVLLPLLWGIYQTLLGVVALFTGG
Ga0190275_1026185823300018432SoilVSEETRQESGGQTSSSIVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1032769323300018432SoilVSEETRQESGGQTSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1083954023300018432SoilMSEETRQESGGQTSSSLVLVAVLWAIVLIPLLWGVYQTLLGVVALFS
Ga0190275_1165099323300018432SoilVSEETRQESGGQTSSSVVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1186140913300018432SoilMSESESRGGGSSVILVVVLWALVLIPLLWGVYQTLIGV
Ga0190275_1188521323300018432SoilMSEQAQQEGGGQTSSSMVLVIVRWALVLMPLLWGVYQTLLGVVALFAGV
Ga0190275_1311212313300018432SoilSQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190269_1035794223300018465SoilMSERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFTG
Ga0190269_1051604423300018465SoilMSESESRGSGSSMVLAVVLWALVLIPLLWGVYQTLTG
Ga0190269_1093959913300018465SoilEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG
Ga0190269_1128116823300018465SoilMSEEARQDSGGQSSSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFTGG
Ga0190268_1011927023300018466SoilMSEAQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190268_1055685623300018466SoilMSEEARQESGGQSSSSSTVLVVVLWALVLVPLAWGVYQTLTGVVALFTG
Ga0190268_1057256023300018466SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190268_1104233913300018466SoilRGGQSNMVLAVVLWALVLIPLLWGVYQTLIGVVALFS
Ga0190268_1179555823300018466SoilMTEEARQEGGGQTSSSMVLVFVLWAAVLIPLLWGVYQTLLGVVALFS
Ga0190268_1230706823300018466SoilMSESESRGGQSNMVLAVVLWALVLIPLLWGVYQTLIGVVALFS
Ga0190270_1006354723300018469SoilMSEEARQESGGQTSSMVLVIVLWAMVLIPLAWGVYETLKGVVALFTGG
Ga0190270_1033675923300018469SoilMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190270_1197825423300018469SoilMSERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFTG
Ga0190271_1009115643300018481SoilMSESQSRGSGRSGMVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190271_1230908013300018481SoilMSESESRGGQSSMVLVAVLWAIVLIPLLWGVYQTLLGVVALFS
Ga0190264_1011415223300019377SoilMSERAQQEGGGQPSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFAGG
Ga0190264_1206723723300019377SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFTGG
Ga0190267_1069850623300019767SoilMSESESRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGVVALFS
Ga0190267_1081853513300019767SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190267_1104493823300019767SoilMSESQSRGSGQSGVVLVVVLWALVLIPLLWGVYQTLLGVIALFS
Ga0196959_1010782223300021184SoilMSEEARQENGGQTSSSSTILVVVLWAMVLIPLAWGIYQTLLGVVALFTG
Ga0299913_1176972813300031229SoilMSEEARQESGGQTSSSMVLVIVLWAMVLIPLAWGVYQTLIGVVALFTGG
Ga0272433_1003856323300031450RockMSESGSRRSGGQNSSSMVLVVGLWTIVLIPLLWGVYQALTSVAALFGG
Ga0307408_10221914913300031548RhizosphereMSEEARQESGGQTSSSIILVIVLWVLVLIPLAWGVYQTLTGVVALFTGG
Ga0307413_1045891823300031824RhizosphereMSEEARQESGGQTSSSIILVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307413_1069853723300031824RhizosphereMSEEARQESGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307406_1128611523300031901RhizosphereMSEEARQESGGQTSSSSMVLVIVLWAMVLIPLAWGVYQTLTGVVALFAGG
Ga0307407_1050604923300031903RhizosphereMSEEARQESGGQTSSSSMVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307407_1159680623300031903RhizosphereMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307409_10208905423300031995RhizosphereMSEEARQEGGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307415_10180529523300032126RhizosphereMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTGVVALFTGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.