NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F078544

Metagenome Family F078544

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078544
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 38 residues
Representative Sequence MGRHTAERGPANDLFMATVVGALLLLCVIILALAAN
Number of Associated Samples 89
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 63.79 %
% of genes near scaffold ends (potentially truncated) 25.00 %
% of genes from short scaffolds (< 2000 bps) 81.03 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction Yes
3D model pTM-score0.52

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (64.655 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(31.034 % of family members)
Environment Ontology (ENVO) Unclassified
(31.034 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.276 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.
1Ga0066675_104540602
2Ga0066681_100797572
3Ga0070734_100090437
4Ga0070697_1001928201
5Ga0070730_101366672
6Ga0066697_101717582
7Ga0070733_102758512
8Ga0066903_1001608792
9Ga0066903_1003517732
10Ga0066903_1016779872
11Ga0066903_1020448762
12Ga0066903_1047165072
13Ga0066903_1060331322
14Ga0070717_104116482
15Ga0066696_108783402
16Ga0079222_118210142
17Ga0079221_109918912
18Ga0079220_101626852
19Ga0075425_1000382705
20Ga0073928_102729772
21Ga0075435_1000073025
22Ga0075435_1006385051
23Ga0099828_102004593
24Ga0099827_112395952
25Ga0126374_106438612
26Ga0126380_100007617
27Ga0126384_104201972
28Ga0126384_106230632
29Ga0126382_116107212
30Ga0126373_114081021
31Ga0126373_117706152
32Ga0126376_107227092
33Ga0126376_108499662
34Ga0126378_103365152
35Ga0126379_114151432
36Ga0126379_121720652
37Ga0126381_1004992322
38Ga0126381_1014835562
39Ga0126383_110093061
40Ga0137776_11880134
41Ga0137776_12076443
42Ga0137391_111900812
43Ga0137383_100895502
44Ga0137365_101088822
45Ga0137378_103667282
46Ga0137378_108791481
47Ga0137377_101455482
48Ga0137371_105467721
49Ga0137384_101298613
50Ga0126369_110862152
51Ga0126369_120799941
52Ga0134087_104517222
53Ga0134078_103589382
54Ga0132255_1053769532
55Ga0066669_102660832
56Ga0210395_109032881
57Ga0210405_105707001
58Ga0210393_108352291
59Ga0210383_103113382
60Ga0210384_105712921
61Ga0210409_101375591
62Ga0126371_103506692
63Ga0126371_104284472
64Ga0126371_113578442
65Ga0126371_113667922
66Ga0126371_120836522
67Ga0126371_122494411
68Ga0126371_123509442
69Ga0207684_1000298415
70Ga0207684_102365411
71Ga0207646_100588505
72Ga0207664_109669652
73Ga0179587_100219662
74Ga0209684_10311582
75Ga0209178_10368371
76Ga0209060_103724211
77Ga0209701_105384162
78Ga0209590_108616491
79Ga0318516_107756612
80Ga0318516_108748991
81Ga0318534_103005532
82Ga0318534_105959962
83Ga0318534_106101162
84Ga0318515_100579462
85Ga0318515_103416822
86Ga0318555_100334421
87Ga0318555_101183511
88Ga0318542_100560052
89Ga0318542_104548992
90Ga0318560_107438821
91Ga0310686_1064158282
92Ga0318496_105521821
93Ga0307469_104909452
94Ga0318500_107067542
95Ga0307468_1017920382
96Ga0318492_106422082
97Ga0318494_107042782
98Ga0318554_105849862
99Ga0318508_11890891
100Ga0318497_100280403
101Ga0318564_101354151
102Ga0310917_101458312
103Ga0306925_106308582
104Ga0318536_105647492
105Ga0306921_110255401
106Ga0310916_100094867
107Ga0310913_102096582
108Ga0318562_100502501
109Ga0310911_107466842
110Ga0318524_100488441
111Ga0318553_107478202
112Ga0318525_106875452
113Ga0318577_105209172
114Ga0307471_1024293951
115Ga0306920_1020251412
116Ga0335078_100043405
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 40.62%    β-sheet: 0.00%    Coil/Unstructured: 59.38%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MGRHTAERGPANDLFMATVVGALLLLCVIILALAANExtracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.52
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Planomonospora
Unclassified
64.7%34.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Sediment
Iron-Sulfur Acid Spring
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Arabidopsis Rhizosphere
Populus Rhizosphere
11.2%20.7%3.4%3.4%3.4%31.0%6.0%4.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066675_1045406023300005187SoilMRGRLVYSFHMGRHTAERGPANDVFMATVVTALLLLCVIILALAAN*
Ga0066681_1007975723300005451SoilMRGRLVYSFHMGRHTAERGPANDVFMATVVTALLLLCVIILALAAT*
Ga0070734_1000904373300005533Surface SoilVYSFFMGRHTAERGPANDLFMATVVGALLLLCVIILALAMN*
Ga0070697_10019282013300005536Corn, Switchgrass And Miscanthus RhizosphereSRVYSFLMGRHTAERGPANDLFMATILAALLLLCVATLALAMN*
Ga0070730_1013666723300005537Surface SoilMGRHASERGPANDLFMAAVLGALLLLCTIVLALAAS*
Ga0066697_1017175823300005540SoilMRGRLVYSFHMGRHTAERGPANDVFMATVVTALLLLCVIILALA
Ga0070733_1027585123300005541Surface SoilMGRHTAERGPANDLFMAAVVVALLLLFMIIMALAANWA*
Ga0066903_10016087923300005764Tropical Forest SoilMGRHAAERGPSNDVFMATVLSALLLLCMIILALASWP*
Ga0066903_10035177323300005764Tropical Forest SoilMGRHAAERGPANDMFMATVVSALLLLCMIILALAGNWA*
Ga0066903_10167798723300005764Tropical Forest SoilMGRHAAERGPSNDMFMATVVSALVLLGMIMLTLAGNWA*
Ga0066903_10204487623300005764Tropical Forest SoilMGRHAAESGPSNDLFMATVLSALLLLFMIILTLASWH*
Ga0066903_10471650723300005764Tropical Forest SoilMGRHAAERGPSNDMFMATVLSALLLLCMIILALASWP*
Ga0066903_10603313223300005764Tropical Forest SoilGARVYSFFMGRHTAERGPANDLFMATVLGALLLLCVIILTLAAN*
Ga0070717_1041164823300006028Corn, Switchgrass And Miscanthus RhizosphereMGRHTAERGPANDLFMATVLGALLLLFVIVLALAAN*
Ga0066696_1087834023300006032SoilMGRHTAERGPANDVFMATVVTALLLLCVIILALAAN*
Ga0079222_1182101423300006755Agricultural SoilMGRHAAERGPSNDVFMATVLSALLLLSMIIFTLASWP*
Ga0079221_1099189123300006804Agricultural SoilMGRHAAERGPSNDLFMATVLSALLLLCLIILALASWP*
Ga0079220_1016268523300006806Agricultural SoilMGRHTAERGPANDLFMATIVGALLLLCVIILTLAAN*
Ga0075425_10003827053300006854Populus RhizosphereMGRHAAERGPSNDLFMATVLSALLLLCMIILALASWP*
Ga0073928_1027297723300006893Iron-Sulfur Acid SpringMGRHTAERGPANDLFMAAVVVALLLLFMIIMALAANWP*
Ga0075435_10000730253300007076Populus RhizosphereMGRHAAERGPSNDLFMATVLSALLLLCIIILALASWP*
Ga0075435_10063850513300007076Populus RhizosphereMGRHTAERGPANDLFMATVLGALFLLCVIILTLAAN*
Ga0099828_1020045933300009089Vadose Zone SoilMGRHTAERGPANDMFMVTVVGALLLLWMIILALAANWP*
Ga0099827_1123959523300009090Vadose Zone SoilMGRHAAERGPANDWFMAVVLSALLLLFTIILALSMN*
Ga0126374_1064386123300009792Tropical Forest SoilMGRHAAERGPSNDMFMVTVVSTLLLLCTIVLTLAGNWA*
Ga0126380_1000076173300010043Tropical Forest SoilMGRHAAERGPANDMFMVTVVSALILFCVIILTLAGNWA*
Ga0126384_1042019723300010046Tropical Forest SoilMGRHAAERGPANDMFMVTVVSALILLCAIILTLAGNWA*
Ga0126384_1062306323300010046Tropical Forest SoilMGRHAAERGPSNDMFMATVVSALFLLCMIVLTLAGNWA*
Ga0126382_1161072123300010047Tropical Forest SoilVYSFFMGRHTAERGPANDLFMATILGALLLLCVIILTLAAS*
Ga0126373_1140810213300010048Tropical Forest SoilMGRHAAERGPSNDVFMATVLSALLLLFMIILALASWP*
Ga0126373_1177061523300010048Tropical Forest SoilMGRHTAERGPANDLFMATVLGALLLLCAIILALAAG*
Ga0126376_1072270923300010359Tropical Forest SoilMGRHTAERGPANDLFMATILGALLLLCVIILTLAAS*
Ga0126376_1084996623300010359Tropical Forest SoilMGRHTAERGPANDLFMATIVGALLLLCVIILTLAMN*
Ga0126378_1033651523300010361Tropical Forest SoilVYSFFMGRHTAERGPANDLFMATVLGALLLLCVIILTLAAN*
Ga0126379_1141514323300010366Tropical Forest SoilRVYSFFMGRHTAERGPANDLFMATVLAALLLLCVIILTLAAN*
Ga0126379_1217206523300010366Tropical Forest SoilVYSFFMGRHTAERGPANDLFMATVVAALLLLCVIILALAAN*
Ga0126381_10049923223300010376Tropical Forest SoilMGRHAAERGPSNDMFMATVLSALLLLCLIILALASWP*
Ga0126381_10148355623300010376Tropical Forest SoilMGRHAAERGPSNDMFMATVLCALLLLCMIILALASWP*
Ga0126383_1100930613300010398Tropical Forest SoilARVYSFFMGRHTAERGPANDLFMATVVGALLLLCVIILTLAAN*
Ga0137776_118801343300010937SedimentMGRHTAERGPANDVFMTTVLAALLVLCVTILALAMN*
Ga0137776_120764433300010937SedimentVYSFFMGRHTAERGPANDLFMTTIVGALLLLCVIILALAMN*
Ga0137391_1119008123300011270Vadose Zone SoilVYSFHMGRHTAERGPANDLFMVAVLSALLVLCAIILALAAN*
Ga0137383_1008955023300012199Vadose Zone SoilMGRHAAERGPANDLFMAVVLSALLLLFAIILALSMN*
Ga0137365_1010888223300012201Vadose Zone SoilMGRHAAERGPANDLFMSVVLSALLLLFTIILALSMN*
Ga0137378_1036672823300012210Vadose Zone SoilMGRHAAERGPANDLFMAVVLSALLLLFTIILALSMN*
Ga0137378_1087914813300012210Vadose Zone SoilMGRHTAERGPANDLFMVAVLSALLVLCAIILTLAAN*
Ga0137377_1014554823300012211Vadose Zone SoilMGRHAAERGPSNDLFMATVLSALLLLCVIILALASWP*
Ga0137371_1054677213300012356Vadose Zone SoilMGRHAAERGPANDLFMAVVLSALLLLFTIVLALSMN*
Ga0137384_1012986133300012357Vadose Zone SoilRHAAERGPANDLFMAVVLSALLLLFTIVLALSMN*
Ga0126369_1108621523300012971Tropical Forest SoilMGRHTAERGPANDLFMATVVGALLLLCVIILALAAN*
Ga0126369_1207999413300012971Tropical Forest SoilMGRHEAGRGPANDMFMVTVVSALLVLLAIIVALAAG*
Ga0134087_1045172223300012977Grasslands SoilMRGRLVYSFHMGRHTADRGPANDVFMATVVTALLLLCVIILALAVN*
Ga0134078_1035893823300014157Grasslands SoilPMRGRLVYSFHMGRHTAERGPANDVFMATVVTALLLLCVIILALAAN*
Ga0132255_10537695323300015374Arabidopsis RhizosphereGRHAAERGPSNDVFMATVLSALLLLSMIIFTLASWP*
Ga0066669_1026608323300018482Grasslands SoilMRGRLVYSFHMGRHTAERGPANDVFMATVVTALLLLCVIILALAAN
Ga0210395_1090328813300020582SoilMGRHTAERGPANDLFMAAVVVALLLLFMIIMALAANWP
Ga0210405_1057070013300021171SoilMGRHTAERGPANDLFMATVVVALLLLFMIVMALAANWA
Ga0210393_1083522913300021401SoilVYSFHMGRHTAERGPANDLFMVAVLSALLVLCAVILALAAN
Ga0210383_1031133823300021407SoilMGRHTAERGPANDSFMAAVVVALLLLFLIIMALAAS
Ga0210384_1057129213300021432SoilMGRHTAERGPANDLFMVAVVSALLVLCAVILALAAN
Ga0210409_1013755913300021559SoilMGRHTAERGPANDLFMATVVVALLLLFMIIMALAANWP
Ga0126371_1035066923300021560Tropical Forest SoilMGRHTAERGPANDLFMATVVGVLLLLCVIILALAMN
Ga0126371_1042844723300021560Tropical Forest SoilMGRHEAGRGPANDMFMVTVVSALLVLLAIIVALAAG
Ga0126371_1135784423300021560Tropical Forest SoilFFMGRHTAERGPANDLFMATVLGALLLLCAIILALAAG
Ga0126371_1136679223300021560Tropical Forest SoilMGRHTAERGPANDLFMATVVGALLLLCVIILALAAN
Ga0126371_1208365223300021560Tropical Forest SoilMGRHAAERGPSNDMFMATVLSALLLLCMIILALASWP
Ga0126371_1224944113300021560Tropical Forest SoilMGRHGAERGPANDLFMATVLSALLLLCTIILALAM
Ga0126371_1235094423300021560Tropical Forest SoilMGRHVAERGPSNDLFMATVLSALLLLCLVIFALASWP
Ga0207684_10002984153300025910Corn, Switchgrass And Miscanthus RhizosphereMGRHAAERGPATDLFMATVLSALLLLFMIIMALAA
Ga0207684_1023654113300025910Corn, Switchgrass And Miscanthus RhizosphereMGRHTAERGPANDLFMAAVLAALLLLCVATLALAMN
Ga0207646_1005885053300025922Corn, Switchgrass And Miscanthus RhizosphereMGRHTAERGPANDLFMATILAALLLLCVATLALAMN
Ga0207664_1096696523300025929Agricultural SoilRHAAERGPSNDLFMATVLSALLLLCVIILALASWP
Ga0179587_1002196623300026557Vadose Zone SoilMGRHTAERGPANDLFMVAVLSALLVLCAVILALAAN
Ga0209684_103115823300027527Tropical Forest SoilMGRHAAERGPANDMFMATVVSALLLLCMIILALAGNWA
Ga0209178_103683713300027725Agricultural SoilVYSFFMGRHTAERGPANDLFMATVVTALLLLFVIILALAMN
Ga0209060_1037242113300027826Surface SoilVYSFFMGRHTAERGPANDLFMATVVGALLLLCVIILALAMN
Ga0209701_1053841623300027862Vadose Zone SoilMGRHTAERGPANDMFMVTVVGALLLLWMIILALAANWP
Ga0209590_1086164913300027882Vadose Zone SoilMGRHAAERGPANDWFMAVVLSALLLLFTIILALSMN
Ga0318516_1077566123300031543SoilMGRHAAERGPSTDLFMASVVSALLLLCTIILALAS
Ga0318516_1087489913300031543SoilMGRHTAERGPANDLFMATVLGALLLLCVIILTLAAN
Ga0318534_1030055323300031544SoilMGRHAAERGPSNDLFMATVLSALLLLCMIILALASWP
Ga0318534_1059599623300031544SoilVYSFFMGRHTAERGPANDLFMATVVAALLLLCVIILTLAAN
Ga0318534_1061011623300031544SoilMGRHTAERGPANDLFMATVLAALLLLCVIILALAAS
Ga0318515_1005794623300031572SoilVYSFFMGRHTAERGPANDLFMATVLAALLLLCVIILALAAS
Ga0318515_1034168223300031572SoilTQVYSFFMGRHTAERGPANDLFMATVVAALLLLCVIILTLAAN
Ga0318555_1003344213300031640SoilMGRPAAERGPSTDLFMATVVSALLLLCTIILALAS
Ga0318555_1011835113300031640SoilMGRHAAERGPSNDVFMATVLSALLLLFMIILALASWP
Ga0318542_1005600523300031668SoilMGRHTAERGPANDLFMTTVVGTLLLLCVIILTLAAN
Ga0318542_1045489923300031668SoilMGRHAAERGPANDMYMATVVSALLLLCMITLTLAGNWA
Ga0318560_1074388213300031682SoilFFMGRHTAERGPANDLFMATVVAALLLLCVIILTLAAN
Ga0310686_10641582823300031708SoilMGRHTAERGPANDLFMATVVVALLLLFMIIMALAAS
Ga0318496_1055218213300031713SoilMGRHTAERGPANDLFMATVLSALLLLCTVILALAS
Ga0307469_1049094523300031720Hardwood Forest SoilMGRHAAERGPSNDLFMATVLSALLLLCVIILALASWP
Ga0318500_1070675423300031724SoilMGRHTAERGPANDLFMATVVAALLLLCVIILTLAAN
Ga0307468_10179203823300031740Hardwood Forest SoilMGRHTAERGPANDLFIATVLGALLLLFVIVLALDAN
Ga0318492_1064220823300031748SoilCRASYMGRHTAERGPANDLFMATVLSALLLLCTVILALAS
Ga0318494_1070427823300031751SoilMGRHTAERGPANDLFMATVVAALLLLCVIILALAAS
Ga0318554_1058498623300031765SoilMGRHAAERGPSNDLFMATVLSALLLLCMIVLALASW
Ga0318508_118908913300031780SoilCMGRHAAERGPSNDLFMATVLSALLLLCMIILALASWP
Ga0318497_1002804033300031805SoilMGRHAAERGPANDLFMATVVSALLLLCTIILALAS
Ga0318564_1013541513300031831SoilASCMGRHAAERGPSNDLFMATVLSALLLLCMIILALASWP
Ga0310917_1014583123300031833SoilMGRHAAERGPSNDLFMATVLSALLLLCMIILALAIWP
Ga0306925_1063085823300031890SoilMGRHAAERGPANDMLMVTVVSALLLLCMIILALASWP
Ga0318536_1056474923300031893SoilASYMGRHTAERGPANDLFMATVLSALLLLCTVILALAS
Ga0306921_1102554013300031912SoilGRHAAERGPSNDLFMATVLSALLLLCMIVLALASWP
Ga0310916_1000948673300031942SoilMGRHTAERGPANDLFMATVVAVLLLLCVIILTLAAN
Ga0310913_1020965823300031945SoilSCMGRHAAERGPSNDLFMATVLSALLLLCMIILALASWP
Ga0318562_1005025013300032008SoilCRASYMGRHAAERGPSTDLFMASVVSALLLLCTIILALAS
Ga0310911_1074668423300032035SoilMGRHAAERGPANDMYMATVVSALLLLCMITLTLAGN
Ga0318524_1004884413300032067SoilMGRHAAERGPSTDLFMATVVSALLLLCTIILALAS
Ga0318553_1074782023300032068SoilCMGRHAAERGPANDLFMATVLSALLLLCMIILALASWP
Ga0318525_1068754523300032089SoilVYSFFMGRHTAERGPANDLFMATVLAALLLLCAIILTLAVN
Ga0318577_1052091723300032091SoilYMGRHAAERGPSNDVFMATVLSALLLLFMIILALASWP
Ga0307471_10242939513300032180Hardwood Forest SoilARGCRGSYMGRHAAERGPSNDVFMATVLSALLLLCVIILALASWP
Ga0306920_10202514123300032261SoilMGRHAAERGPANDMFMVTVVSALLLLCMIILALASWP
Ga0335078_1000434053300032805SoilMGRHGAEREPANDLFMVTVVCALLLLFVIILALAMN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.