NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101831

Metagenome / Metatranscriptome Family F101831

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101831
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 47 residues
Representative Sequence MSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTG
Number of Associated Samples 45
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 91.18 %
% of genes near scaffold ends (potentially truncated) 15.69 %
% of genes from short scaffolds (< 2000 bps) 79.41 %
Associated GOLD sequencing projects 44
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.294 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(32.353 % of family members)
Environment Ontology (ENVO) Unclassified
(35.294 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.020 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 51.32%    β-sheet: 0.00%    Coil/Unstructured: 48.68%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTGCytopl.Extracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
85.3%14.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Polar Desert
Polar Desert Sand
Soil
Soil
Serpentine Soil
Termite Nest
Agricultural Soil
Soil
Soil
Rock
Tabebuia Heterophylla Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Rhizosphere
3.9%6.9%32.4%30.4%2.9%3.9%2.9%7.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI24973J35851_104253923300002547Polar DesertMSERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVVALFAG
JGI25406J46586_1001308723300003203Tabebuia Heterophylla RhizosphereMSESGSQGSWVQSSSMALVVILWALVLLPLLWGVYQTLKGVVALFS*
JGI25407J50210_1011194023300003373Tabebuia Heterophylla RhizosphereMSEEARQESGGQSSSSSAVLVVVLWAMVLIPLAWGVYQTLLGVVALFTGG*
Ga0081538_1001802923300005981Tabebuia Heterophylla RhizosphereMSEESRQESGGQTSSSTVLVVVLWALVLIPLAWGVYQTLIGVVALFAGG*
Ga0081538_1037421823300005981Tabebuia Heterophylla RhizosphereMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG*
Ga0082029_133012823300006169Termite NestMSEETRQESGGQTSSSTVLAVVLWGLIPLAWGVYQTLLGVLALFAGG*
Ga0082029_148479923300006169Termite NestMSEEARQESGGQTSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG*
Ga0079217_1082595413300006876Agricultural SoilMSEEARQQSGGPSSSSSTVLVVVLWALVLIPLAWGVYQTLIGVVALFAGG*
Ga0079215_1068874513300006894Agricultural SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLLWGVYQTLTGVVALFS*
Ga0079216_1156110423300006918Agricultural SoilMSEEARQQSGGPSSSSSTVLVVVLWALVLIPLAWGVYQTLTGVVALFSGG*
Ga0105679_1010412923300007790SoilMSEQARQESGGQTSSSSVVLVIVLWALVLIPLLWGIYQTLLGVVALFTGG*
Ga0105679_1083717023300007790SoilMSEEAHQESGGQSSSSSAVLVVVLWALVLIPLAWGVYQTLTAVVALFTGG*
Ga0105679_1091981323300007790SoilMSERAGQEGGGQSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFAG*
Ga0075418_1084119323300009100Populus RhizosphereMSEEARQESGRQSSSSSMFLVVVLWAMVLIPLLWGVYQTLTGVVALFTG*
Ga0126307_1002697873300009789Serpentine SoilMSESELRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGAVALFS*
Ga0126307_1007052623300009789Serpentine SoilMSEEARQESGGQTSSSSKVLVVVLWALVLIPLAWGVYQTLTGVVALFAGG*
Ga0126307_1007383053300009789Serpentine SoilMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTGVVALFTGG*
Ga0126313_1010842023300009840Serpentine SoilMSEEARQESGGQTSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126313_1032272423300009840Serpentine SoilMSERARQEGREQTSSSMVLVIVLWAFVLIPLLWGVYQTLTGVVALFAG*
Ga0126313_1038826523300009840Serpentine SoilMSEEARQESGGQTSSSIILVIVLWALVLILLAWGVYQTLTGVVALFTGG*
Ga0126313_1042241023300009840Serpentine SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLIGVVALFTGG*
Ga0126313_1045652223300009840Serpentine SoilMSEEARQESGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126305_1018103213300010036Serpentine SoilMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTG
Ga0126304_1084890623300010037Serpentine SoilMSESESRGGQSSMVLAVVLRALVLIPLLWGIYQTLRGVVALFS*
Ga0126309_1000995553300010039Serpentine SoilMSESESRRSGSQASSSTALVVVLWALVLIPLLWGVYQTLTGVVALFAA*
Ga0126309_1001459253300010039Serpentine SoilMSESESRGSGGQSSTVLVVVLWALVLIPLLWGVYQTLTAVVALFS*
Ga0126309_1004555043300010039Serpentine SoilMSEEARQDSGGQSSSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFTGG*
Ga0126309_1006329223300010039Serpentine SoilMSERAGQEDGGRSSSSMALVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126309_1008887623300010039Serpentine SoilMSERAQQEGGGQTSSSMVPVVVLWALVLVPLLWGVYQTLTGVVALFAGG*
Ga0126309_1025327223300010039Serpentine SoilMSESESRGGGGQSSMVAAVVLWAIVLIPLLWGVYQTLT
Ga0126309_1037520523300010039Serpentine SoilMSERARQEGGEQSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFTG*
Ga0126308_1007432333300010040Serpentine SoilMSEEARQESGGQTSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFAGG*
Ga0126308_1018739723300010040Serpentine SoilMSEEARQESGGQTSSSSMVLVIVLWAMVLIPLAWGVYQTLTGVVALFAGG*
Ga0126308_1029724023300010040Serpentine SoilMSEEARQESGGQTSSSSIVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126308_1045122923300010040Serpentine SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126308_1080774223300010040Serpentine SoilMSEEARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLTGVVALFAGG*
Ga0126308_1090315323300010040Serpentine SoilMSEEARQEGGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126312_1020136723300010041Serpentine SoilMSEEARQESGGQTSSRIILVIVLWALVLIPLAWGVYQTLTGVVALFTGG*
Ga0126312_1028439623300010041Serpentine SoilMSESEPRGGQSGVVLAVVLWALVLLPLLWGIYQTLTGVVALFS*
Ga0126312_1085928213300010041Serpentine SoilSSSMVLVVVLWAMVLIPLAWGVYQTLIGVVALFTGG*
Ga0126314_1014692843300010042Serpentine SoilMSEEARQESGEQTSSGMVLVVVLWAMVLIPLVWGVYQTLTGVVALFTGG*
Ga0126310_1001267043300010044Serpentine SoilMSERARQEGREQTSSSMVLVIVLWAFVLTPLLWGVYQTLTGVVALFAG*
Ga0126310_1003489213300010044Serpentine SoilQEDGGQSSSSMAVVVVLWALVLIPLLWGVYQTLLGVVALFAG*
Ga0126310_1108047323300010044Serpentine SoilMSEEARQESGGQSSSSSTVLVVVLWALVLIPLAWGIYQTLTGVVALFSGG*
Ga0126321_108183523300010145SoilMNERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFTG*
Ga0126321_114428523300010145SoilMSERAGQEGGGQSSSMVLVVVLWALVLIPLLWGVYQTLTGVAALFAGG*
Ga0126306_1056354313300010166Serpentine SoilMSESELRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGVVALFS*
Ga0127502_1065631423300011333SoilMSEQARQESGGQTSSSSVVLVIVLWALVLLPLLWGIYQTLLGVVALFTGG*
Ga0127502_1094594813300011333SoilNLARKGISMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS*
Ga0136625_108019813300012091Polar Desert SandGSTNWVPVVVLWTVVLIPLLWGVYQTLTSVVALFAG*
Ga0136632_1002562223300012093Polar Desert SandMSESESRGSGGQTSSSTVLAVVLWVLVLIPLLWGVYQTLTGVVALFS*
Ga0136615_1025364123300012678Polar Desert SandMTENESGNSSWVLAVVLWALVLIPLAWGVYQTLTSVPALFTGG*
Ga0182001_1051457513300014488SoilMSERAGQEGGGQSSSMVLVVVLWALVLIPLLWGVYQTLL
Ga0182001_1054671723300014488SoilMSEEARQENGGQTSSSSTIFVVVLWAMVLIPLAWGVYQTLLGVVALFTG*
Ga0183260_1000329543300017787Polar Desert SandMSESESRGSGGQTSSSTVLAVVLWVLVLIPLLWGVYQTLTGVVALFS
Ga0183260_1006666143300017787Polar Desert SandMNERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVGALFAG
Ga0136617_1004758853300017789Polar Desert SandMTENESGNSSWVLAVVLWALVLIPLAWGVYQTLTSVPALFTGG
Ga0136617_1018320133300017789Polar Desert SandMNERARQEGGEQTSSSMVLVVVLWAFVLIPLLWGVYQTLTGVVALFAG
Ga0190265_1353693313300018422SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLTAVVALFTGG
Ga0190275_1005390323300018432SoilMSESEPRGGQSSVLLAIVLWAIVLIPLLWGIYQTLIGVVALFS
Ga0190275_1008895723300018432SoilMSESESRGGQSSMVLAVVLWAIVLIPLLWGIYQTLTGVVALFS
Ga0190275_1018046633300018432SoilMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALF
Ga0190275_1021797723300018432SoilVSESESQGGQSGVVVAVVLWALVLLPLLWGIYQTL
Ga0190275_1025027923300018432SoilMSEQARQESGGQTSSSSVVLVIVLWALVLLPLLWGIYQTLLGVVALFTGG
Ga0190275_1026185823300018432SoilVSEETRQESGGQTSSSIVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1032769323300018432SoilVSEETRQESGGQTSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1083954023300018432SoilMSEETRQESGGQTSSSLVLVAVLWAIVLIPLLWGVYQTLLGVVALFS
Ga0190275_1165099323300018432SoilVSEETRQESGGQTSSSVVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190275_1186140913300018432SoilMSESESRGGGSSVILVVVLWALVLIPLLWGVYQTLIGV
Ga0190275_1188521323300018432SoilMSEQAQQEGGGQTSSSMVLVIVRWALVLMPLLWGVYQTLLGVVALFAGV
Ga0190275_1311212313300018432SoilSQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190269_1035794223300018465SoilMSERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLTGVVALFTG
Ga0190269_1051604423300018465SoilMSESESRGSGSSMVLAVVLWALVLIPLLWGVYQTLTG
Ga0190269_1093959913300018465SoilEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLLGVVALFTGG
Ga0190269_1128116823300018465SoilMSEEARQDSGGQSSSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFTGG
Ga0190268_1011927023300018466SoilMSEAQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190268_1055685623300018466SoilMSEEARQESGGQSSSSSTVLVVVLWALVLVPLAWGVYQTLTGVVALFTG
Ga0190268_1057256023300018466SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFS
Ga0190268_1104233913300018466SoilRGGQSNMVLAVVLWALVLIPLLWGVYQTLIGVVALFS
Ga0190268_1179555823300018466SoilMTEEARQEGGGQTSSSMVLVFVLWAAVLIPLLWGVYQTLLGVVALFS
Ga0190268_1230706823300018466SoilMSESESRGGQSNMVLAVVLWALVLIPLLWGVYQTLIGVVALFS
Ga0190270_1006354723300018469SoilMSEEARQESGGQTSSMVLVIVLWAMVLIPLAWGVYETLKGVVALFTGG
Ga0190270_1033675923300018469SoilMSESQSRGSGRSGIVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190270_1197825423300018469SoilMSERAGQEGGGQSSSSMVLVVVLWALVLIPLLWGVYQTLLGVVALFTG
Ga0190271_1009115643300018481SoilMSESQSRGSGRSGMVLVVVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190271_1230908013300018481SoilMSESESRGGQSSMVLVAVLWAIVLIPLLWGVYQTLLGVVALFS
Ga0190264_1011415223300019377SoilMSERAQQEGGGQPSSSMVPVVVLWALVLIPLLWGVYQTLTGVVALFAGG
Ga0190264_1206723723300019377SoilMSEQARQESGGQSSSSSTVLVVVLWALVLIPLLWGVYQTLLGVVALFTGG
Ga0190267_1069850623300019767SoilMSESESRGGQSSMVLAVVLWALVLIPLLWGIYQTLTGVVALFS
Ga0190267_1081853513300019767SoilMSEEARQESGGQTSSSSIVLVIVLWALVLIPLLWGVYQTLTGVVALFS
Ga0190267_1104493823300019767SoilMSESQSRGSGQSGVVLVVVLWALVLIPLLWGVYQTLLGVIALFS
Ga0196959_1010782223300021184SoilMSEEARQENGGQTSSSSTILVVVLWAMVLIPLAWGIYQTLLGVVALFTG
Ga0299913_1176972813300031229SoilMSEEARQESGGQTSSSMVLVIVLWAMVLIPLAWGVYQTLIGVVALFTGG
Ga0272433_1003856323300031450RockMSESGSRRSGGQNSSSMVLVVGLWTIVLIPLLWGVYQALTSVAALFGG
Ga0307408_10221914913300031548RhizosphereMSEEARQESGGQTSSSIILVIVLWVLVLIPLAWGVYQTLTGVVALFTGG
Ga0307413_1045891823300031824RhizosphereMSEEARQESGGQTSSSIILVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307413_1069853723300031824RhizosphereMSEEARQESGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307406_1128611523300031901RhizosphereMSEEARQESGGQTSSSSMVLVIVLWAMVLIPLAWGVYQTLTGVVALFAGG
Ga0307407_1050604923300031903RhizosphereMSEEARQESGGQTSSSSMVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307407_1159680623300031903RhizosphereMSEEARQESGGQTSSSSIVLVIVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307409_10208905423300031995RhizosphereMSEEARQEGGGQTSSSSMVLVVVLWALVLIPLAWGVYQTLTGVVALFTGG
Ga0307415_10180529523300032126RhizosphereMSEETRQESGGQTSSSIVLVIVLWAMVLIPLAWGVYQTLTGVVALFTGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.