NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069133

Metagenome / Metatranscriptome Family F069133

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069133
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 44 residues
Representative Sequence METTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYG
Number of Associated Samples 104
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 94.35 %
% of genes near scaffold ends (potentially truncated) 99.19 %
% of genes from short scaffolds (< 2000 bps) 90.32 %
Associated GOLD sequencing projects 100
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (72.581 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(36.290 % of family members)
Environment Ontology (ENVO) Unclassified
(42.742 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(38.710 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 30.99%    β-sheet: 0.00%    Coil/Unstructured: 69.01%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540METTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGExtracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
27.4%72.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Freshwater Sediment
Soil
Iron-Sulfur Acid Spring
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Corn Rhizosphere
Miscanthus Rhizosphere
Plant Roots
Populus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Boreal Forest Soil
Tropical Rainforest Soil
5.6%3.2%6.5%6.5%3.2%36.3%7.3%4.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10055691913300002245Forest SoilMETTTIPPTLPTVPARPLAAVRRYAGQWGLIAALAALPIYFGVHDLVFGYQVGIVA
Ga0062385_1035929513300004080Bog Forest SoilMETTTITPTQPRVPVRSLAAVRRYAGQWGLIAALAALPVYYGVHDL
Ga0062386_10071426923300004152Bog Forest SoilMETTTVSPPGLPAAPARQLAAVRRYAGQWGLIAALAALPVYYGI
Ga0062386_10097250013300004152Bog Forest SoilMETTTISPTLPTVPVRPLAAVRQHVGQWGLIAALAALPVYYGIH
Ga0066395_1032013123300004633Tropical Forest SoilVATRTISPTRPPAAPARPLAAVRRYAGRWGLIVALAALPV
Ga0066388_10583370623300005332Tropical Forest SoilMETTTISPTPLPTVPVRPLAAVRRYVGQWGLIAALAALPVYYGVHDLV
Ga0008090_1005428923300005363Tropical Rainforest SoilMETTTVSPPSPPVAPVRSLAAVRRYAGQWGLIAALAALPVYYGV
Ga0070710_1120826823300005437Corn, Switchgrass And Miscanthus RhizosphereVNTTTITPGPPATAARPVTAALAAVRRYAGRWGLIVALLALPVYYGV
Ga0070706_10093898223300005467Corn, Switchgrass And Miscanthus RhizosphereMETTTISPTPLPTVPVRPLAAVRRYAGQWGLIAALAALPVYY
Ga0070762_1004552113300005602SoilMETTTITPALPSVPMRRLAAVRRYAGQWGLIAALAAL
Ga0070763_1025879723300005610SoilMETTTVTPTLPTVPVRPLAAVRRYAGQWGLIAALAALP
Ga0070763_1042381013300005610SoilMETTTIPPTLPTVPVRSLAAVRRYAGQWGLIAALAALP
Ga0070717_1133883113300006028Corn, Switchgrass And Miscanthus RhizosphereMETTTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYQV
Ga0075028_10029910513300006050WatershedsMEPVETTTISPTPSPPAVGARPLAAIRRFAGRWGLIAALAALPVYY
Ga0075017_10108667323300006059WatershedsMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYG
Ga0075030_10105775623300006162WatershedsMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYG
Ga0070712_10121994313300006175Corn, Switchgrass And Miscanthus RhizosphereVETTTLSRSPLAATARPLAAVRRVAGRWGLIAALAALPVYYGIS
Ga0075021_1067873023300006354WatershedsMETTTIPPSAPPAARARPLATVRRYAGRWGLIAAL
Ga0075425_10016950743300006854Populus RhizosphereVETTTLSRSPLAATARPLAAVRRVAGRWGLIAALAALP
Ga0075426_1006370013300006903Populus RhizosphereVATRTVSPTRPPTAPARPLAAVRRFAGRWGLIVALAALPVYYG
Ga0116218_115418723300009522Peatlands SoilMETTTVSPPSLPAAPAHPLAAVRRYAGQWGLIAALVALPVYYGVHDLVYGYQVGIV
Ga0105237_1044118913300009545Corn RhizosphereMETTTIPPSAPPAARARPLAAVRRFAGRWGLIAALAA
Ga0116224_1009118413300009683Peatlands SoilMETTTISPTLPAAPVRPLAAVRRHAGQWGLIAALAALPVYYGVH
Ga0116219_1062521723300009824Peatlands SoilMETTTISPTLPTVPVRPLAAVRRYAGQWGLIAALAALPVYYG
Ga0127503_1034644613300010154SoilMEPVETTTISPTPSPPAVTARSLAAVRRYAGRWGLIAALAALP
Ga0126372_1225002123300010360Tropical Forest SoilMETTTISPTPLPTVPVRPLAAVRRYAGQWGLIAALAAL
Ga0126378_1108090013300010361Tropical Forest SoilMETTTIPPSAPPAARARPLAAVRRYAGRWGLIAALAALPVYY
Ga0126379_1353310413300010366Tropical Forest SoilMETTTVSPPGQTAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHD
Ga0126381_10434764723300010376Tropical Forest SoilMETTTVSPPSLPTAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHD
Ga0126381_10495319823300010376Tropical Forest SoilMETTTISPTPLPTVPVRPLAAVRRYVGQWGLIAALAALPVYYGVHDLVYGY
Ga0126361_1122810713300010876Boreal Forest SoilMETTTITPTLPSVPMRRLTAVRRYAGQWGLIAALAALP
Ga0137381_1111567623300012207Vadose Zone SoilVETTTISPSSPPAVAARPLAAVRRFFGRWGLIAALAALPVYYGI
Ga0137379_1151297123300012209Vadose Zone SoilVETTTISPSSPPAVAARPLAAVRRFFGRWGLIAALAALPV
Ga0137377_1145740913300012211Vadose Zone SoilVETRTISPSLPAAPVRPLAVARRYVGRWGLIVALAALPI
Ga0137372_1004845713300012350Vadose Zone SoilMETTTISPSVPPEARVRPLAAVRRYVGRWGLIAALAALPVYYGI
Ga0137394_1139157223300012922Vadose Zone SoilMELVETTTISPTPSPPAVGARPLAAVRRFAGRWGLIAALAALPVYY
Ga0137413_1086786523300012924Vadose Zone SoilMEPVETTTTSPSPPAVAARRLAAVRRYVGRWALIAALA
Ga0126369_1155988213300012971Tropical Forest SoilMETTTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALAALPV
Ga0126369_1301743923300012971Tropical Forest SoilMETTTISPAPLPTVPVRPLAAVRRYAGQWGLIAALAALPVYY
Ga0126369_1347476613300012971Tropical Forest SoilVATRTVSPTSPPAAPARPLVAVRRYVGRWGLIVALA
Ga0157371_1138921423300013102Corn RhizosphereVNTTTITPSPPATAARPAAAALAALRRYVGRWGLIAALLALPVYYA
Ga0157370_1045434623300013104Corn RhizosphereVATRTISPTSPPAAPARPLAAVRRFVGRWGLIAALAALPVYYGIH
Ga0137409_1066831613300015245Vadose Zone SoilMEPVETTTISPTPSPPAVGARPLAAVRRFAGRWGLIAALAALPI
Ga0182036_1049385623300016270SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGY
Ga0182037_1134618323300016404SoilMETTTVSPPSLPAAPARPLAAVRRYAGQWGLIAALAALPVYYGVH
Ga0182037_1153391413300016404SoilVATRTVSPSPPAVRTHPLVAIRHYAGRWGLIVALAALPVYYGIQDLV
Ga0182038_1077757623300016445SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALA
Ga0187802_1000272013300017822Freshwater SedimentMETTTVPPPSLPAAPARSLGAVRRYVGQWGLIAALAALPVYYGVHDLVYGYQ
Ga0187807_111991223300017926Freshwater SedimentMETTTITPTLPAVPARPLAAVRRYAGQWGLIAALAALPVYYGI
Ga0187801_1016657713300017933Freshwater SedimentMETTTITPTLPTVPVRPLAAVRRYAGQWGLIAALAALPVYYG
Ga0187808_1028424123300017942Freshwater SedimentVETTTISPSPPPVVAARPLAAVRRFFGRWGLIAALAA
Ga0187808_1052469123300017942Freshwater SedimentMETTTVSPPSLPAAPARQLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYQV
Ga0187822_1018030323300017994Freshwater SedimentMETTTVSPPSLPAAPVRRLAAVRRHAGQWGLIAALAALPVYYGIHD
Ga0187810_1000059213300018012Freshwater SedimentMETTIVSPPSLPAAPARSLAAVRRHAGQWGLIAALAALPVYYG
Ga0187851_1002829813300018046PeatlandVETTTISPIPPAVAARPLAAVRRYFGRWGLIAALA
Ga0187765_1061151113300018060Tropical PeatlandVATRTISPASPPKAPARPLVAVRKYAGRWGLIVAL
Ga0213875_1055716413300021388Plant RootsMETTTVSPPSLPAAPARPLAAVRRYAGQWGLIAAL
Ga0212123_1031771823300022557Iron-Sulfur Acid SpringMETTTVTPTLPTVPVRPLAAVRRYAGQWGLIAALAALPVYYG
Ga0179591_124194323300024347Vadose Zone SoilMEPVETTTISPTPSPPAVGARHSRRFAVLAGRWGLIAALAALADLLRHQVI
Ga0207416_108856313300025134Iron-Sulfur Acid SpringMETTTITPTLPSVPMRRLAAVRRYAGQWGLIAALAALPVYYGVHDLV
Ga0207656_1004220533300025321Corn RhizosphereMETTTIPPSAPPAARARPLAAVRRFAGRWGLIAALAALPVYY
Ga0207684_1155675713300025910Corn, Switchgrass And Miscanthus RhizosphereMETTTISPTPLPTVPVRPLAAVRRYAGQWGLIAALAALPVY
Ga0207677_1110364513300026023Miscanthus RhizosphereMEPVETTTISPTPSPPSVAARPLAAVRHYAGRWGL
Ga0209240_123352213300026304Grasslands SoilMETTTISPTPLPTVPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVY
Ga0208042_105969613300027568Peatlands SoilVETTTISPSPPPALAARPLAAVRRFFGRWGLIAALAALPVY
Ga0209166_1065306323300027857Surface SoilMETTTISPTLPAVPTRPLAAVRRYAGQWGLIAALAALPVYYGVHDLV
Ga0209167_1068789013300027867Surface SoilMETTTVSPPGLPAAPARQLAAVRRYAGQWGLIAALAALPVYYGVHDLV
Ga0209380_1002761043300027889SoilMETTTITPTLPTVPLRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYG
Ga0209380_1051784923300027889SoilMETTTITPTLPKVPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYG
Ga0302228_1032428323300028808PalsaMETTTITPTQPTVPVRSLAAVRRYAGQWGLIAALAALPVYYG
Ga0302179_1014609223300030058PalsaVETTTISPIPPPVAAHPLAAVRRYFGRWGLIAALAALP
Ga0210254_1128100013300030602SoilMETTTITPTLPSVPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYQVTV
Ga0310915_1038835913300031573SoilMETTTIPPSAPPAARARPLAAVRRYAGRWGLIAALAALPVYYGI
Ga0318555_1023048823300031640SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVY
Ga0318561_1055511213300031679SoilMETTTVSPPSLPAAPVRQLAAVRRYAGQWGLIAALAALPVY
Ga0318572_1043603613300031681SoilMETTTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALA
Ga0310686_11070302423300031708SoilVNTTITPSPPALAARPAATVLAALRRYGGRWGLIVALAALPVYY
Ga0318496_1005323213300031713SoilMETTTLSPSAPPAARARPLAAVRRYAGRWGLIAALAALPVYY
Ga0318496_1029507323300031713SoilMETTTVSPPSLPAAPVSPLAAVRRHAGQWGLIAALAALPVYY
Ga0318496_1032124323300031713SoilMEATTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYG
Ga0307476_1012645523300031715Hardwood Forest SoilMETTTVSPPSLPAAPARQLAAVRRYAGQWGLIAALA
Ga0318493_1010665023300031723SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYT
Ga0318494_1012140333300031751SoilMEATTVSPPSPPVAPVRQLAAVRRHAGQWGLIAALAALPVYYGVHDLVYGYT
Ga0318535_1020928923300031764SoilMETTTVSPPSPPVAPVHPLAAVRRYAGQWGLIAALAALPVYYG
Ga0318509_1022530123300031768SoilMETTTVSPPSLPAAPARPLAAVRRYAGEWGLIAALAALPVYYGVHDLVYGYTV
Ga0318509_1066849713300031768SoilMETTTLSPSAPPAARARPLAAVRRYAGRWGLIAALAAI
Ga0318521_1030559723300031770SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGY
Ga0318521_1038031013300031770SoilMETTTVSPPSLPATPVRPLAAVRRYAGQWGLIAALAALPVYYGVHD
Ga0318543_1010864223300031777SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSVAG
Ga0318543_1050667323300031777SoilMETTTVSPPSPPVAPVHPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSVAG
Ga0318566_1015795423300031779SoilMETTTVSPPSLPAAPARPLAAVRRYAGQWGLIAALAALPVYYGVHDLV
Ga0318547_1051659823300031781SoilMETTTISPTLPTVPVRPLAAVRRYAGQWGLIAALA
Ga0318547_1094822523300031781SoilMEATTVSPPSPPVAPVRQLAAVRRHAGQWGLIAALAALPVYYG
Ga0318548_1067861913300031793SoilMETTTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHD
Ga0318503_1007849813300031794SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYT
Ga0318523_1039772423300031798SoilMETTTVSPPSQPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSVA
Ga0318567_1008682133300031821SoilMETTTLSPSAPPAARARPLAAVRRYAGRWGLIAALAALPV
Ga0318567_1059135223300031821SoilMETTTVSPPSPPVAPVHPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYG
Ga0318499_1029386023300031832SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAAQPV
Ga0310917_1083355923300031833SoilMETTTIPPSAPPAARARPLAAVRRYAGRWGLIAALAA
Ga0318517_1005222813300031835SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDL
Ga0318517_1044701013300031835SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHD
Ga0318511_1049629123300031845SoilMETTTVSPPSLPTAPVRPLAAVRRYAGQWGLIGALAALPVYYGVHDLVYGYQV
Ga0318495_1026152923300031860SoilMETTTISPTLPTVPVRPLAAVRRYAGQWGLIAALAALPVY
Ga0318495_1045479523300031860SoilMETTTVSPPSLPAAPARPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSV
Ga0306919_1099892423300031879SoilMETTTVSPPSLPAAPARSLATVRRHAGQWGLIAALAALPVYY
Ga0306923_1062492023300031910SoilMETTTVSPPSQPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYT
Ga0310909_1008832443300031947SoilMETTTVSPPSPPVAPVHPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSV
Ga0318531_1025914823300031981SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYY
Ga0306922_1086996723300032001SoilMETTTVSPPSPPVAPVHPLAAVRRYAGQWGLIAALAALPV
Ga0318569_1014507123300032010SoilMPYDTPHGAMETTTISPSAPPAARARPLAAVRRYAGRWGLIAALAALPVY
Ga0318506_1051978923300032052SoilMETTTVSPPSLPAAPARPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTV
Ga0318570_1001450543300032054SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVSVA
Ga0318570_1036326713300032054SoilMEATTVSPPSPPVAPVRQLAAVRRHAGQWGLIAALAALPVYYGVHDLVY
Ga0318575_1069188513300032055SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYY
Ga0318510_1002649113300032064SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALP
Ga0318524_1024736813300032067SoilMPYDTPHGAMETTTISPSAPPAARARPLAAVRRYAGRWGLIAALAALPVYY
Ga0318553_1059253823300032068SoilMETTTLSPSAPPAARARPLAAVRRYAGRWGLIAALAA
Ga0318553_1060927423300032068SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVFGY
Ga0306920_10149811223300032261SoilMETTTVSPPSPPVAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVY
Ga0306920_10370953623300032261SoilMETTTLSPSAPPAARARPLAAVRRYAGRWGLIAALAALPVYYG
Ga0335080_1124770823300032828SoilMETTTASPPSLPAAPVRQLAAVRRHAGQWGLIAALAALPVYYGV
Ga0310914_1053020813300033289SoilMETTTVSPPSLPAAPVRRLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTVS
Ga0318519_1051850913300033290SoilMETTTVSPPSLPAAPVRPLAAVRRYAGQWGLIAALAALPVYYGVHDLVYGYTV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.