NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F068196

Metagenome / Metatranscriptome Family F068196

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068196
Family Type Metagenome / Metatranscriptome
Number of Sequences 125
Average Sequence Length 49 residues
Representative Sequence MPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKTIDSQLAAQ
Number of Associated Samples 106
Number of Associated Scaffolds 125

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 89.60 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 93.60 %
Associated GOLD sequencing projects 99
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (65.600 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(43.200 % of family members)
Environment Ontology (ENVO) Unclassified
(55.200 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(48.800 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 57.53%    β-sheet: 0.00%    Coil/Unstructured: 42.47%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKTIDSQLAAQSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
34.4%65.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Bog
Freshwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Surface Soil
Peatlands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Soil
Soil
Soil
Soil
Peatland
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Palsa
Corn Rhizosphere
Corn Rhizosphere
4.8%5.6%43.2%12.0%4.8%4.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI26340J50214_1003375813300003368Bog Forest SoilMPIRLRLAVAFAVIAAAVFALGSWLFVAGLASAQLSAIDSQLSAQLTQ
Ga0062386_10091884223300004152Bog Forest SoilMPIRLRLALACAVAAAIAFALGSWLFISALSSAQLGTIDSQLAVELGQ
Ga0066869_1011241223300005165SoilMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLTTLDSQLTAQLAQAGR
Ga0070711_10099952323300005439Corn, Switchgrass And Miscanthus RhizosphereMPIRLRLAAAFALAAAALFALGGWLFAAGLSSAQLKTLDSQLTAQL
Ga0070735_1071626623300005534Surface SoilMSIRLRLAAAFATVAAALFALGGWAFASGLSSAQLNMIDSQLTVQLAQAARYLPPGSAAR
Ga0066706_1046738713300005598SoilMPIRLRLAVAFGLAAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQAGRYLPGSGTGTAIS
Ga0075019_1044118713300006086WatershedsVPIRLRLALAFAAAAALLFAIGGWLFSSALSAAQLSAIDSQL
Ga0075015_10021330013300006102WatershedsMPIRLRLAVAFAVIAAAVFALGSWLFVTGLASAQLSAIDSQLSAQLTQA
Ga0070715_1003991063300006163Corn, Switchgrass And Miscanthus RhizosphereMPIRLRLATAFALAAAALFALGGWLFASGLSSAQLTTLDSQ
Ga0075014_10031952123300006174WatershedsMPIRLRLAVAFAVAAAAIFALGGWLFISSLSSAQLGAIDSQLAV
Ga0066659_1118254313300006797SoilMPIRLRLAVAFGLAAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQ
Ga0079220_1163433223300006806Agricultural SoilMSIRLRLALAFAAAGALLFSVGAWLFAATLSAAQLRVIDSQLTAQLAAAPRYLS
Ga0105241_1213601713300009174Corn RhizosphereMPIRLRLAAAFALAAAALFALGGWLFAAGLSSAQLKTLD
Ga0116221_137398123300009523Peatlands SoilMPIRLRLAVAFAVAAAAIFALGGWLFISGLSSAQLGTIDSQLA
Ga0105237_1249873913300009545Corn RhizosphereMPIRLRLALGFAAAAALFFAVGGWIFAAALSSAQLGVIDSQLTAQLTQAARYVPGPARTPASVP
Ga0116215_141217913300009672Peatlands SoilMPIRLRLAVAFAVIAAAVFALGSWLFVTGLASAQLSAIDSQL
Ga0116216_1074217213300009698Peatlands SoilMPIRLRLALAFALAAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQAGR
Ga0134128_1133742823300010373Terrestrial SoilMSIRLRLALAFAAAAALLFAVGAWLFAATLSAAQLRVIDSQLTA
Ga0136449_10284300123300010379Peatlands SoilMSIRLRLAVTFAAAAALLFAAGGWLFAAALSSAQLRVIDAQLTAQ
Ga0138573_111131523300011089Peatlands SoilMPIRLRLAVAFAVAAAAIFALGGWLFISSLSSAQLGAIDSQLAVQLSQAGQYLAAGGPS
Ga0151490_179985613300011107SoilMPIRLRLAVAFALAAAAVFALGGWLFASGLSSAQLKTIDSQLAAQLAQAGGSL
Ga0137392_1113344113300011269Vadose Zone SoilMPIRFRLAIAFAVAAAAVLALGSWLFISGLSSAQLAAIDSQLAFQLSQAGRYLSAGGQSGSSDST
Ga0137380_1076277033300012206Vadose Zone SoilMPIRLRLAVAFAAIAAAVFALGGWLFASGVASAQLS
Ga0137377_1052623633300012211Vadose Zone SoilMPIRLRLAVAFGLAAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQAGRYLPG
Ga0164309_1160514713300012984SoilMPIRLRLAIAFALAAAALFALGGWLFAAGLSSAQLKTLDSQLTAQLAQAGRYLPA
Ga0157373_1101588813300013100Corn RhizosphereMPIRLRLAAAFALAAAALFALGGWLFAAGLSSAQLKTLDSQLTAQLAQAGRY
Ga0181530_1064977313300014159BogMPIRLRLALACAVAAAIAFALGSWLFISALSSAQLGA
Ga0182036_1185778523300016270SoilMPIRLRLAVAFAVAAAALFALGSWAFASGLSSAQLSMIDSQLTVQL
Ga0182041_1152558523300016294SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSMIDSQLTVQLTQAGRYYPPGSATGAATAPNA
Ga0182041_1214066823300016294SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQLAVQLSQAGRYVTAGGQ
Ga0182033_1144596223300016319SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAVRYLPPGRTAG
Ga0182035_1176873523300016341SoilMPIRLRLAVAFAIAAAAVFALGSWLFIASLSAAQLGTIDSQLAIELTQAGRYIAAG
Ga0182032_1143795323300016357SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSMIDSQLTVQLTQAARYLPPGS
Ga0182037_1088515923300016404SoilMPIRLRLAIAFGAAAAAVFALGGWLFISSLSSTQLAGIDS
Ga0182037_1112320023300016404SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSMIDSQLTVQLNQAGRYYPPGSATGAATA
Ga0182039_1094753013300016422SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAAQLTQADRYL
Ga0187807_101673143300017926Freshwater SedimentMPIRLRLAVAFAVVAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPR
Ga0187807_110301223300017926Freshwater SedimentMPIRLRLALAFAAAATLLFAIGGWLFSSALSAAQLGAIDSQLTAQLVQAARYLPG
Ga0187806_104001713300017928Freshwater SedimentMPIRLRLAVAFAVVAAALFALGSWASASGLSSAQLSLIDSQLTVQ
Ga0187814_1017467213300017932Freshwater SedimentMPIRLRLALACAVAAAIAFALGSWLFISALSAAQLGTIDSQLAV
Ga0187809_1020447623300017937Freshwater SedimentMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLTTLDSQLTAQLAQAGRYLPAGGTGSAS
Ga0187780_1000108413300017973Tropical PeatlandMPIRLRLAIAFGLAAAAIFALGGWLFITSMSSAQLAGIDSQLAAQLAQADRYLAA
Ga0187767_1009152423300017999Tropical PeatlandMPIRLRLAVAFAVVAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQ
Ga0187805_1014761913300018007Freshwater SedimentMPIRLRLALAFATAAALLFAVGGWLFAAALSSAQLRVIDSQLTAQLA
Ga0187766_1099349823300018058Tropical PeatlandVPIRLRLAVAFAVIAGALFALGGWLFASGLSSAQLSAIDSQLTLQLTQAGRYLPPG
Ga0187771_1126021723300018088Tropical PeatlandMPIRLRLAIAFAFAAAAVFALAGWLFINSLSSAQLAGIDSQLAIQLTQAGRYLAEG
Ga0187770_1002126013300018090Tropical PeatlandMPIRLRLAVAFAAAAAAVFALGGWLFISSLSSAQLAGIDSQLAVQLSQAGRYLA
Ga0197907_1127159123300020069Corn, Switchgrass And Miscanthus RhizosphereMSIRLRLALAFAAAAALLFAVGAWLFAATLSAAQLRVIDSQ
Ga0187768_103010913300020150Tropical PeatlandMPIRLRLAIAFGLAAAAIFALGGWLFITSMSSAQLAGIDSQLAAQLAQADRYLAAGS
Ga0207707_1035601633300025912Corn RhizosphereMPIRLRLALGFAAAAALFFAVGGWIFAAALSSAQLGVIDSQLTAQLTQAARY
Ga0209890_1011161913300026291SoilVRNKGSDVPIRLRLALAFAAATAIAFALGSWLFIGALSSAQLGTIDSQLAVQL
Ga0207777_108523723300027330Tropical Forest SoilMPIRIRLAIAFALAAAAVFALGGWLFITSLSSAQLGAIDSQLATQLTQ
Ga0209073_1042521123300027765Agricultural SoilMPIRLRLAAAFALAAAALFALGGWLFAAGLSSAQLKTLDS
Ga0209656_1004646913300027812Bog Forest SoilMPIRLRLAIAFAVVAAALYALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSAA
Ga0209580_1018523533300027842Surface SoilMPIRLRLAVAFAVIAAAAFTLGGWLFAGGLSSAQLNTIDSQLTA
Ga0302235_1042296413300028877PalsaMPIRLRLALACAVAAAIAFALGGWLFISALSSAQLGTIDSQLAVELGQAGRYLPADRASG
Ga0318516_1012866513300031543SoilMPIRLRLALAFALAAAALFALGGWLFASGLSSAQLKTI
Ga0318534_1052068223300031544SoilMPIRLRLAVAFAIAAAAVFALGSWLFIASLSAAQLGTIDSQLAIELTQ
Ga0318573_1012971433300031564SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAAQLTQ
Ga0318515_1037760723300031572SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQ
Ga0318555_1028710533300031640SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSAAG
Ga0318542_1077645323300031668SoilMPIRLRLAVAFAVVAAALYALGGWAFASGLSSAQLS
Ga0318561_1012076523300031679SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSTA
Ga0318572_1024181213300031681SoilMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKTIDSQ
Ga0318560_1029296123300031682SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQLAVQLSQAGRYV
Ga0318496_1041979713300031713SoilMPIRLRLAIAFALAAAALFALGGWLLVSSISSAQLSTIDSQL
Ga0318496_1042855123300031713SoilMPIRLRLAVAFAMVAAALFALGSWAFATGLSSAQLST
Ga0306917_1069393613300031719SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQLAVQLSQAGRYVTAGGQPAAAGPP
Ga0306918_1034457033300031744SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAA
Ga0318492_1042464623300031748SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSLIDSQLTVQLTQAAR
Ga0318494_1022476723300031751SoilMPIRLRLAIAFALAAAALFALGGWLLVSSISSAQLST
Ga0318494_1025874013300031751SoilMPIRLRLAIAFGAAAAAVFALGGWLFISSLSSTQLAGIDSQ
Ga0318554_1007763513300031765SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSTAG
Ga0318554_1024816913300031765SoilMSIRLRLALTFAAAAALLFAIGGWLFAAALSAAQLRVIDSQLTAQLAQ
Ga0318554_1076293423300031765SoilMPIRLRLAVAFALTAAALFALGGWLFASGLSSAQLKT
Ga0318526_1009269113300031769SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAAQ
Ga0318526_1035623823300031769SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQLAVQLSQAGQYLTA
Ga0318546_1126625713300031771SoilMPIRLRLAVAFAVVAGALFALGGWAFASGLSSAQLSMIDSQLTIQLTQAGRYYPPGSATGAAAALNA
Ga0318498_1014315733300031778SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSLIDSQLTVQLT
Ga0318566_1037993513300031779SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAAQLTQADRRQRGAGQPV
Ga0318566_1051766113300031779SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSLIDSQ
Ga0318508_119240523300031780SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVLLTQAARYLPPGSTAGVGLSPPGDYVI
Ga0318552_1041032623300031782SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTID
Ga0318529_1058174123300031792SoilMPIRLRLAVAFAVVAAALFALGSWAFASGLSSAQLSMIDSQLTVQLT
Ga0318557_1012394113300031795SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYL
Ga0318557_1017843813300031795SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLA
Ga0318523_1047490723300031798SoilMPIRLRLAVAFALTAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQA
Ga0318523_1059435713300031798SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAA
Ga0318565_1063360123300031799SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLP
Ga0318497_1067002613300031805SoilMPIRLRLAVAFAAVAAALFALGSWAFASRLSSAQLSLIDSQLTVQLTQAARYLPPGGAADVALGDY
Ga0318568_1037146923300031819SoilMPIRLRLAVAFALAAAALFALGGWLFASSLSSAQLK
Ga0318567_1035884423300031821SoilMPIRLRLAVAFAMVAAALFALGSWAFATGLSSAQLSTID
Ga0318567_1089336023300031821SoilMSIRLRLALTFAAAAALLFAIGGWLFAAALSAAQLRVI
Ga0318499_1031541213300031832SoilMPIRLRLAVAFAVAAAALFALGSWAFASGLSSAQLSMIDSQLTVQLNQ
Ga0318517_1021338913300031835SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGID
Ga0318511_1038722613300031845SoilMPIRLRLAAAFAAVAAAVFALGGWAFASRLSSAQLS
Ga0318511_1047769213300031845SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQ
Ga0318512_1026850413300031846SoilMPIRLRLALAFALAAAALFALGGWLFASGLSSAQLKTIDSQLTAQ
Ga0318495_1029981923300031860SoilMPIRLRLALAFALAAAALFALGGWLFASGLSSAQLKT
Ga0318520_1022673233300031897SoilMPIRLRLAIAFAVAAAALFALGGWLLVSSISSAQLSTIDSQLAV
Ga0318520_1028357313300031897SoilMPIRLRLAVAFALAAAAVFALGSWLFASGLSSAQLTAIDSQLTAQLAQAGRYLPAGS
Ga0306921_1043784613300031912SoilMPIRLRLALAFALTAAALFALGGWLFASGLSSAQLKTIDSQLTAQLAQAGR
Ga0306921_1252604823300031912SoilMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKTI
Ga0310910_1093064223300031946SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDS
Ga0318559_1038912823300032039SoilMPIRLRLAIAFAVVAAALFALGSWAFASGLSSAQLSTIDSQL
Ga0318556_1026314713300032043SoilMPIRLRLAVAFAAIAAALFALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSTAGVSLSPP
Ga0318570_1040887413300032054SoilMSIRLRLALTFAAAAALLFAIGGWLFAAALSAAQLRVIDSQ
Ga0318570_1054937713300032054SoilMPIRLRLAVAFAVVAGALFALGGWAFASGLSSAQLSMIDSQLTVQLTQAGRYY
Ga0318575_1010246713300032055SoilMPIRLRLAVAFALTAAALFALGGWLFASGLSSAQLGVDGLQ
Ga0318505_1009009713300032060SoilMPIRLRLAIAFALAAAAVFALGGWLFITSMSSAQLAGIDSQLAAQLSQADRYLA
Ga0318514_1054109223300032066SoilMPIRLRLALAFALAAAALFALGGWLFASGLSSAQLK
Ga0318524_1075202723300032067SoilMPIRLRLAAAVAVIAAATFALGGWLFASGLSSAQLS
Ga0318553_1027168413300032068SoilMPIRLRLAVAFAVVAAALFALGGWAFASGLSSAQLSMIDSQLTVQLTQAGRY
Ga0306924_1085470123300032076SoilMPIRLRLAVAFAIAAAAVFALGSWLFIASLSAAQLGTIDSQLAIELT
Ga0306924_1118762613300032076SoilMSIRLRLALAFALAAAALFALGGWLFASGLSSAQLKTIDSQLTAQL
Ga0311301_1023674713300032160Peatlands SoilMPIRLRLAVAFAVIAAAAFALGGWLFGSGLSSAQLSAIDSQLTAQLTQAARYLPSGSTARSPTATSPS
Ga0311301_1035139943300032160Peatlands SoilMPIRLRLAIAFAVVAAALYALGSWAFASGLSSAQLSLIDSQLTVQLTQAARYLPPGSAAG
Ga0335085_1005022713300032770SoilMPIRLRLAVAFALAAAAVFALGGWLFASGLSSAQLKTIDSQLAAQLAQAGRSQPAGGYL
Ga0335085_1126548913300032770SoilMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKTIDSQLAAQ
Ga0335078_1183264923300032805SoilMPIRLRLTVAFALAAAVLFALGGWLFASGLSSAQLKTLDSQLTAQLAQA
Ga0335070_1106258323300032829SoilMPIRLRLALAFAVAAAAVFALGGWLFISGLSSAQL
Ga0335071_1014344113300032897SoilMPIRWRLTLAFAATAALLFAIGGWLFAAALSSAQLGVIDAQL
Ga0335073_1063596513300033134SoilMPIRLRLAVAFALAAAAVFALGGWLFASGLSSAQLKTIDSQLTVQLAQAGRSQPAGGYLT
Ga0318519_1100274013300033290SoilMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQL
Ga0314862_0087222_600_7103300033803PeatlandMPIRLRLAVAFALAAAALFALGGWLFASGLSSAQLKT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.