NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072642

Metagenome / Metatranscriptome Family F072642

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072642
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 47 residues
Representative Sequence MTMFAVIARVVGEPETLVAAAFGILPLAVGIGYFLDAAMIHRDLKAS
Number of Associated Samples 102
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.51

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.174 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(23.967 % of family members)
Environment Ontology (ENVO) Unclassified
(29.752 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.116 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 57.33%    β-sheet: 0.00%    Coil/Unstructured: 42.67%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MTMFAVIARVVGEPETLVAAAFGILPLAVGIGYFLDAAMIHRDLKASExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.51
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Grass Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Fen
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Fen
Peat Soil
Boreal Forest Soil
5.0%5.0%3.3%5.0%3.3%3.3%24.0%16.5%3.3%5.0%8.3%3.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E41_110199402170459005Grass SoilLVSGGFGFILMFAMIARVVGEPETLVASAFGILPLAVGLGYFVDAALIHRDLKAS
JGI12270J11330_1018881723300000567Peatlands SoilRAGILLVTGGLGLMIMFTVIARVTSEPETLVAAAFGILPLAVGIGYFLDAALIRRDLKAS
JGIcombinedJ26739_10007361713300002245Forest SoilPQTLVASAFGILPLAVGLGYFLDAALIRRDLKAS*
Ga0062384_10123240413300004082Bog Forest SoilILLVTGGLGFMIMFAVIARVVDEPRTMVAGAFGILPLAVGIGYFLDAALIHRDLKTT*
Ga0062384_10124773113300004082Bog Forest SoilQIEREPDTWVAAVFGIIPLAIGIGYFLDATLIRRDLKAVS*
Ga0062387_10080344023300004091Bog Forest SoilRFSGEPETMVAGSFGVLPLAIGLGYFVDAALIRRDLKAA*
Ga0062389_10121312323300004092Bog Forest SoilIARFSGEPETMVAGSFGVLPLAIGLGYFVDAALIRRDLKAA*
Ga0062386_10017807733300004152Bog Forest SoilGVGFILMFAMISRAVNEPETMAAAAFGILPLAVGIGYFLDAALIRRDLRAS*
Ga0062386_10022723013300004152Bog Forest SoilVGFMTMFAVIARVVGEPQTLVAAAFGILPLAVGVGYFLDAAMIRRDLKAS*
Ga0066388_10031955213300005332Tropical Forest SoilMFAAIARIVNEPETLVAAAFGILPLAVGIGYFLDAALIHRDLKIS*
Ga0066388_10085824633300005332Tropical Forest SoilIAKVVNEPETLAAAAFGILPLAVGIGYFVDAALLHRDWKAS*
Ga0070714_10069065223300005435Agricultural SoilGLGFFLMFALIARFSGEPETMVAASFGVLPLAIGLGYFVDAAMIHRDIAKAS*
Ga0070697_10066995123300005536Corn, Switchgrass And Miscanthus RhizosphereSAGLGFFLMFALIARFSGEPETMVAASFGVLPLAIGLGYFVDAAMIHRDIAKAS*
Ga0066903_10510225213300005764Tropical Forest SoilRAGILLVSGGLGFMLMFGMIAKAVNEPETLAAAAFGILPLAVGIGYFVDAALLHRDLKAS
Ga0070766_1025538733300005921SoilRVVGEPQTLVASAFGILPLAIGLGYFVDAALIRRDLKAS*
Ga0075029_10008005543300006052WatershedsLMFGIIARVVGEPETLVVSAFGILPLAVGIGYFVDAAMVRRDLKTS*
Ga0075017_10111951623300006059WatershedsFALIARIVNEPETMVAAAFGILPLAVGIGYFLDAALIHRDLKAT*
Ga0075014_10016169313300006174WatershedsGFMVMFAAIARIVQEPETLAAAAVGILPLAVGIGYFLDAALIRRDLKTT*
Ga0070712_10007311313300006175Corn, Switchgrass And Miscanthus RhizosphereIARFSGEPETMVAASFGVLPLAIGLGYFVDAAMIHRDIAKAS*
Ga0075021_1051591513300006354WatershedsPETLVASAFGILPLAVGLGYFVDAALIRRDLKAS*
Ga0099828_1117695913300009089Vadose Zone SoilPETWVAASLGIIPLAVGLGYFLDWTLIRRDLHPSS*
Ga0099828_1184192223300009089Vadose Zone SoilIVMFAMIARVLGEPETMVGSAIGVLPLAVGLGYFADAALIRRDLKAS*
Ga0116218_150839823300009522Peatlands SoilGFMTMFAVIARVVGEHETLVAAAFGILPLAVGIGYFLDAAMIHRDLKAS*
Ga0116217_1033437823300009700Peatlands SoilRAGILLVTGGLGFMTMFAVIARVVGEPQTLAAAAVGILPLAVGIGYFLDAAMIHRDLKAS
Ga0126373_1152603813300010048Tropical Forest SoilGFFAMFAMIARAVGEPETMAAAAFGILPLAVGLGYFVDAALIRRDLRAS*
Ga0126376_1025951143300010359Tropical Forest SoilLVSAGIGFILMFAMIARVVGEPETLVASAFGILPLAVGLGYFVDAALIRRDFKAS*
Ga0126378_1192275213300010361Tropical Forest SoilMIARVVGEPQTLVASAFSILPLAVGLGYFVDAALIRRDLKAS*
Ga0136449_10173306923300010379Peatlands SoilTGGLGFMTMFAVIARVVGEPQTLAAAAVGILPLAVGIGYFVDAAMIHRDLKAS*
Ga0126358_103747223300010856Boreal Forest SoilVGFIVMFAMIARVVGEPETLVASAFGILPLAIGLGYFVDAALIRRDLKAS*
Ga0137404_1025145813300012929Vadose Zone SoilMFAMIARVVGEPQTLVGSAFGILPLAVGLGYFVDAALIHRDLKAS*
Ga0126369_1000466113300012971Tropical Forest SoilRSRRAGILLVSGGLGFFAMFAMIARAVGEPETMAAAAFGILPLAVGLGYFVDAALIRRDLRAS*
Ga0137420_132264133300015054Vadose Zone SoilMIARVVGEPATLVASAFGILPLAIGLGYFVDAALIRRDLKAS*
Ga0182036_1066154323300016270SoilVIARVVEEPHTLVAAAFGILPLAVGIGYFLDAALIHKDLRVS
Ga0182041_1145505933300016294SoilEPQTMAAAAFGILPLAVGLGYFVDAALIRRDLRAS
Ga0182035_1076989413300016341SoilRVVGERETLVAAAFGILPLAIGIGYFLDAALIHRELKVS
Ga0182032_1021730513300016357SoilIVQEPHTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0182040_1066425313300016387SoilFMAMFAVIAKVVGEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0182040_1176971213300016387SoilAGILLVSGGLGFMAMFAVIARVVGEPETLVAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0182037_1056248613300016404SoilLMFSLIARIVQEPHTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0182037_1062360813300016404SoilEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0182037_1101506513300016404SoilRSRRAGILLVSGGLGFFVMFAVIARAVGEPQTMAAAAFGILPLAVGLGYFVDAALIRRDLRAS
Ga0182038_1041799413300016445SoilVVGEPETLVAAAFGILPLAIGIGYFLDAALIHRELKAS
Ga0181511_113919413300016702PeatlandLGFMIMFAAIARIVDEPQTLVAASLGILPLAVGIGYFLDAALIHRDMKAT
Ga0187802_1002595143300017822Freshwater SedimentSGGLGFMIMFAVIARVVNEPDTLVASAVGILPLAVGIGYFLDAALIHRDIKAS
Ga0187801_1011118013300017933Freshwater SedimentQFARSRRAGILLVSGGVGFIVMFALIARIVGEPETLAAAAVGILPLAVGIGYFLDAALIHRDIKAS
Ga0187848_1044159123300017935PeatlandLVTGGLGFMTMFAVIARVVGEPETLVAAAFGILPLAVGIGYFLNAAMIRRDLKAS
Ga0187809_1043434333300017937Freshwater SedimentGFMIMFAVIARVVNEPDTLVASAVGILPLAVGIGYFLDAALIHRDIKAS
Ga0187819_1007349413300017943Freshwater SedimentEPQTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0187819_1019554723300017943Freshwater SedimentSRRAGILLVSGGLGFMIMFAVIARVVNEPDTLVASAVGILPLAVGIGYFLDAALIHRDIKAS
Ga0187779_1031293023300017959Tropical PeatlandARVVEEPHTLVAAAFGILPLALAIGYFLDAAMIHRDIKAS
Ga0187781_1071582713300017972Tropical PeatlandMFAAIAKIVNEPQTLVAAAFGILPLAVGIGYFVDAALIHRDLKAS
Ga0187777_1025456023300017974Tropical PeatlandGLGFMIMFAVIARVVNEPDTLVASAVGILPLAVGIGYFLDAALIHRDIKAS
Ga0187804_1021207123300018006Freshwater SedimentVSGGVGFIVMFALIARIVGEPETLAAAAVGILPLAVGIGYFLDAALIHRDMKTT
Ga0187772_1094885623300018085Tropical PeatlandMLMFAVIARVVEEPHTLVAAAFGILPIAVGIGHFVDAALIHRDMKTA
Ga0187771_1004227753300018088Tropical PeatlandFMIMFAVIARVVEEPHTMVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0187771_1046488223300018088Tropical PeatlandGFMTMFAVIARVVGEPETLAAAAFGILPLAVGIGYFVDATLIHRDLTAS
Ga0182028_124365333300019788FenMTMFAVIARVVGEPETLVAAAFGILPLAVGIGYFLDAAMIHRDLKAS
Ga0182028_154299043300019788FenMTMFAVIARVVGEPGDSGCGAFGILPLAVGNRILLDAAMIHRDLKAS
Ga0210399_1045830813300020581SoilAGILLVSGGVGLIVMFAMISRLLGEPETMVGSAIGVLPLAVGLGYFVDAALIHRDLKAS
Ga0210406_1034855113300021168SoilRSRRAGILLVSGGVGFILMFAMIARVVGEPQTLVGSAFGILPLAVGLGYFVDAALIHRDLKAS
Ga0210406_1044464133300021168SoilFILMFAMIARVVGEPQTLVASAFGILPLAVGLGYFVDAALIRRDLKAS
Ga0210406_1046535113300021168SoilLIARFSGEPQTMVAASFGVLPLAIGLGYFVDAAMIHRDIAKAS
Ga0210400_1069989123300021170SoilMLVMFAMIARLLGEPETMVGSAIGVLPLAVGLGYFVDAALIHRDLKAS
Ga0210393_1103797413300021401SoilRIVQEPETMVAAALGILPLAVGIGYFVDAALIHRDLKAT
Ga0210387_1123408613300021405SoilEPETMVAGSFGVLPLAIGLGYFVDAALIRRDLKTA
Ga0210394_1012717043300021420SoilLLVSGGVGMIVMFAMIARLLGEPETMVGSAIGVLPLAVGLGYFVDAALIHRDLKAS
Ga0210392_1002253313300021475SoilALIARFSGEPETMVAASFGVLPLAIGLGYFVDAAMIHRDIAKAS
Ga0210392_1144723713300021475SoilSRLLGEPETMVGSAIGVLPLAVGLGYFVDAVLIRRDLKAS
Ga0210398_1011125813300021477SoilLVSGGLGFMLMFAVIARVVAEPETMAAAAFGILPLAVGIGYFVDATLIHRDIKAT
Ga0210398_1018522813300021477SoilRAGVLLVSGGVGFILMFAMIARVLGEPETLVASAFGILPLAVGMGYFLDAALIRRDLKAS
Ga0210410_1062079433300021479SoilGLGFIVMFAMIARVLGEPETLVASAFGVLPLAIGLGYFVDAALIRRDLKAS
Ga0210409_1170793933300021559SoilRVVGEPETLVASAFGILPLAVGLGYFVDAALIHRDLKAS
Ga0242672_112617613300022720SoilSGGVGMIVMFAMIARLLGEPETMVGSAIGVLPLAVGLGYFVDAALIHRDLKAS
Ga0242657_120378913300022722SoilLGEPETLVASAFGVLPLAIGLGYFVDAALIRRDLKAS
Ga0137417_146235563300024330Vadose Zone SoilMIARVVGEPQTLVGSAFGILPLAVGLGYFVDAALIHRDLKAS
Ga0247668_101922513300024331SoilGGGFIVMFAMIARVLGEPETLVASAFGILPLAIGLGYFVDAALIHRDLKAS
Ga0207818_102030813300026868Tropical Forest SoilGFMTMFAVIARVVEEPHTLVAAAFGILPVAVGIGYFLDAALIHKDLKAS
Ga0207805_103146813300026887Tropical Forest SoilVIARVVEEPHTLVAAAFGILPVAVGIGYFLDAALIHKDLKAS
Ga0207824_103673413300026990Tropical Forest SoilVVEEPQTLVAAAFGILPLAIGIGYFLDAALIHKDLKAS
Ga0208859_104784213300027069Forest SoilFILMFAMIARVLGEPETLVASAFGILPLAVGMGYFLDAALIRRDLKAS
Ga0207780_103139023300027313Tropical Forest SoilVVGERETLVAAAFGILPLAIGIGYFLDAALIHRELKVS
Ga0207761_104365413300027516Tropical Forest SoilHFAQEPQTMVAAAFGILPLVVGLGYFVDATLIQRDMKAS
Ga0209524_113812113300027521Forest SoilSGGLGFILMFALIARFSGEPETMVAGSFGVLPLAIGLGYFVDAALTRRDLKTA
Ga0209736_112266733300027660Forest SoilEPQTLVASAFGILPLAVGLGYFVDAALIRRDLKAS
Ga0207826_103477423300027680Tropical Forest SoilIARVVEEPHTLVAAAFGILPVAVGIGYFLDAALIHKDLKAS
Ga0207862_123837423300027703Tropical Forest SoilMFAVIARVVEEPHTLVAAAFGILPVAVGIGYFLDAALIHKDLKAS
Ga0209580_1032998613300027842Surface SoilVGFILMFAMIARVVGEPQTLVASAFGILPLAVGLGYFVDAALIRRDLKAS
Ga0209701_1042939913300027862Vadose Zone SoilRSRRAGILLVSGGAGFILMFAMIARVVGEPETLVASAFGILPLAVGLGYFVDAALIHRDLKAS
Ga0247682_111792523300028146SoilFIVMFAMIARVLGEPETLVASAFGILPLAIGLGYFVDAALIHRDLKAS
Ga0302159_1009677123300028646FenVEPDAWVATYFGLIPLAIGLGYFVDFALIRRDLHTS
Ga0308309_1055219123300028906SoilAVNEPETMAAAALGVFPLAVGIGYFLDAMLIRRDLKAS
Ga0170834_10593938233300031057Forest SoilRVVGEPETLVASAFGILPLAIGLGYFVDAALIRRDLKAS
Ga0170824_10351860233300031231Forest SoilVVGEPETLVGSAIGVLPLAVGWGYFVDAALIRRDLKAT
Ga0318541_1030012623300031545SoilVQEPHTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0318538_1054168633300031546SoilVGEPQTMAAAAFGILPLAVGLGYFVDAALIRRDLRAS
Ga0306917_1033075213300031719SoilGILLVSGGLGFFVMFAVIARAVGEPQTMAAAAFGILPLAVGLGYFVDAALIRRDLRAS
Ga0307469_1090389833300031720Hardwood Forest SoilFARSRRAGILLVSGGVGFILMFAMIARVVGEPQTLVASAFGILPLAIGMGYFVDAALIRRDLKAS
Ga0318493_1081315913300031723SoilILLVSGGLGFFVMFAVIARAVGEPQTMAAAAFGILPLAVGLGYFVDAALIRRDLRAS
Ga0318492_1052224413300031748SoilVVGEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0318526_1017639023300031769SoilILLVCGGLGFMAMFAVIAKVVGEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0318546_1045496413300031771SoilTGILLVCGGLGFMAMFAVIAKVVGEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0310917_1015532613300031833SoilFMLMFSLIARIVQEPHTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0318512_1064623423300031846SoilFMAMFAVIAKVVGERETLVAAAFGILPLAIGIGYFLDAALIHRELKAS
Ga0306919_1074044113300031879SoilGFMAMFAVIAKVVGEPQALAAAAFGILPLAVGIGYFLDAALIHRELKAS
Ga0306923_1044460833300031910SoilFAVIAKVVGEPETLVAAAFGILPLAIGIGYFLDAALIHRELKAS
Ga0306923_1123577823300031910SoilVNEPETLAAAAFGILPLAVGIGYFVDAALLHRDLKAS
Ga0306923_1154801413300031910SoilLLVCGGFGFMTMFAVIARVVEEPHTLVAAAFGILPLAVGIGYFLDAALIHKDLRVS
Ga0306921_1275006113300031912SoilKVVGEPETLVAAAFGILPVAIGIGYFLDAALIHRELKAS
Ga0310910_1070575123300031946SoilGGLGFMAMFAVIAKVVGEPETLVAAAFGILPVAIGIGYFLDAALIHRELKAS
Ga0310910_1122988223300031946SoilGILLVSGGLGFMLMFGMIAKVVNEPETLAAAAFGILPLAVGIGYFVDAALLHRDLKAS
Ga0318533_1069698223300032059SoilMFSLIARIVQEPHTLVAAAFGILPLAVGIGYFLDAALIHRDLKAS
Ga0306924_1146299213300032076SoilRRAGILLVSGGLGFMVMFAVIARVVNEPETLVASAVGILPLAVGIGYFLDAALIHRDLKT
Ga0306924_1204270523300032076SoilFMAMFAVIARVVGERETLVAAAFGILPLAIGIGYFLDAALIHRELKVS
Ga0307470_1016235413300032174Hardwood Forest SoilLVSGGAGFILMFAMIARVVGEPETLVGSAFGILPLAVGLGYFVDAALIRRDLKAS
Ga0306920_10016839313300032261SoilVMEEPQTLVAAAFGILPLAIGIGYFLDAALIHKDLKAS
Ga0306920_10316899313300032261SoilLGFMAMFAVIAKVVGEPETLVAAAFGILPLAIGIGYFLDAALIHRELKAS
Ga0335079_1222656013300032783SoilAAIARIVSEPETLVAAAFGILPLAVGIGYFLDAALIHRDLKVS
Ga0335069_1079134133300032893SoilFAMIARAVCEPETMAAAAFGILPLAVGLGYFVDAALIRRDLKAS
Ga0335076_1174732223300032955SoilTFALIARFSGEPQTLVAASFGILPLAIGLGYFVDAAMIHRDIAKAS
Ga0335077_1024277013300033158SoilVSGGLGFFLTFALIARFSGEPQTLVAASFGILPLAIGLGYFVDAAMIHRDIAKAS
Ga0371490_101225713300033561Peat SoilITMFAVIARVVGEPETLAAAAFGILPLAVGIGYFLDAAMIRRDLKAS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.