NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F092506

Metagenome / Metatranscriptome Family F092506

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092506
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 41 residues
Representative Sequence MSLLAPIRRHRADQTASSGLLRSVRLFRLFLAEQSDPETF
Number of Associated Samples 93
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 96.26 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (69.159 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(43.925 % of family members)
Environment Ontology (ENVO) Unclassified
(44.860 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(43.925 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.
1FG2_06248630
2JGI12635J15846_105465041
3JGI12053J15887_106347691
4Ga0066654_104161001
5Ga0070761_110080952
6Ga0070763_106101721
7Ga0070717_111894451
8Ga0075017_1010758801
9Ga0070712_1000965203
10Ga0070765_1009812551
11Ga0070765_1014912622
12Ga0070765_1021263211
13Ga0079221_113919812
14Ga0116224_103397111
15Ga0116216_101601981
16Ga0116216_103130172
17Ga0136449_1026377922
18Ga0137383_110977921
19Ga0137380_108475371
20Ga0137369_103122501
21Ga0164309_115289701
22Ga0181522_100464791
23Ga0132257_1012673781
24Ga0182033_107335602
25Ga0182033_120207891
26Ga0182035_104532892
27Ga0182032_106784541
28Ga0182040_106615332
29Ga0187802_101759141
30Ga0187820_12955612
31Ga0187806_11050012
32Ga0187814_103169372
33Ga0187808_102306571
34Ga0187819_100807543
35Ga0187819_104887122
36Ga0187779_106009301
37Ga0187777_101357121
38Ga0187777_114851021
39Ga0066667_112721341
40Ga0187768_10888562
41Ga0210395_106050892
42Ga0210395_114289981
43Ga0210400_110557801
44Ga0210393_115102021
45Ga0210389_107912611
46Ga0210387_115048961
47Ga0126371_124221252
48Ga0207692_110824571
49Ga0207644_117103412
50Ga0207665_105688031
51Ga0209326_10137361
52Ga0209116_10635242
53Ga0209333_11297481
54Ga0209380_106394681
55Ga0209415_102527381
56Ga0311368_102545851
57Ga0210261_12247252
58Ga0318534_101906281
59Ga0318534_102157562
60Ga0318538_101190251
61Ga0318528_102182052
62Ga0318515_102541842
63Ga0318572_108721861
64Ga0318560_103887732
65Ga0318560_107484892
66Ga0310686_1089019952
67Ga0306917_105513751
68Ga0318493_100417613
69Ga0318502_106427712
70Ga0318494_101511152
71Ga0318554_105104052
72Ga0318554_108485401
73Ga0318521_101489562
74Ga0318546_101695041
75Ga0318546_110905262
76Ga0318547_103605241
77Ga0318557_103322611
78Ga0318576_103522432
79Ga0318565_102074232
80Ga0318568_104942102
81Ga0318567_107677641
82Ga0318564_101981221
83Ga0318499_101263971
84Ga0310917_104598081
85Ga0318544_104390472
86Ga0318551_106663991
87Ga0318520_100384993
88Ga0318520_110202531
89Ga0310910_109961901
90Ga0310909_103326072
91Ga0306926_129263631
92Ga0307479_104056312
93Ga0318562_102236712
94Ga0318562_108904241
95Ga0318563_104082772
96Ga0318533_114179722
97Ga0318505_104136641
98Ga0318513_106080882
99Ga0318514_103749862
100Ga0318514_106266001
101Ga0306924_119744931
102Ga0318525_103138211
103Ga0335074_103304783
104Ga0335075_109765172
105Ga0335073_112809921
106Ga0310811_106893892
107Ga0373948_0185745_2_136
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.71%    β-sheet: 0.00%    Coil/Unstructured: 60.29%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MSLLAPIRRHRADQTASSGLLRSVRLFRLFLAEQSDPETFSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
30.8%69.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Grass Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Arabidopsis Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
6.5%2.8%2.8%4.7%43.9%7.5%2.8%3.7%4.7%5.6%3.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FG2_062486302189573004Grass SoilMSLMAPTRRHSADQAPPSGLLRSVRLFRLFLAEQSDPEKFYSSLA
JGI12635J15846_1054650413300001593Forest SoilMSLLAPIRRRRGNQAARSGVLRSIRLFRLFLGEQNDQETFYVSLAEDAVQQVAE
JGI12053J15887_1063476913300001661Forest SoilMSLMAPTSRQCADQATPVGLRRSVRLFRLFLAEQSDPEK
Ga0066654_1041610013300005587SoilMSLMAPTRRHSADQAPPSGLLRSIRLFRLFLAEQSDPE
Ga0070761_1100809523300005591SoilMSLLAPIRRHRAEQAARSGLLRSVRLFRLFLAEQAEPE
Ga0070763_1061017213300005610SoilMSLLGPIRRRRGNQAARSGVLRSIRLFRLFLGEQND
Ga0070717_1118944513300006028Corn, Switchgrass And Miscanthus RhizosphereMSLLAPIRRHRVDQHSVDRTTRSGLLRSVRLFRLF
Ga0075017_10107588013300006059WatershedsMGLLAPIRRHRADQANLSPDQATHSGLRRSVRLFRLFMAEQTDPEKFYASL
Ga0070712_10009652033300006175Corn, Switchgrass And Miscanthus RhizosphereLSLLAPNRRYRADQATPSGLRRSVRLFRLFLAEQSDPE
Ga0070765_10098125513300006176SoilMSLLAPTRRHRADQAPPSGLRRSVRLFRLFLAEQSDP
Ga0070765_10149126223300006176SoilMSLLGPIRRRRGNQAARSGVLRSIRLFRLFLGEQNDQETFYLSLAEDAVQQVAEHT
Ga0070765_10212632113300006176SoilMSLLAPTRRHCADQPPPAGLRRSVRLFRLFRAEQSDPEKFYAS
Ga0079221_1139198123300006804Agricultural SoilMSLLAPIRRRSADQATRSADQETSSGLRRSVRLFRLFLAEQTDPEKFYAGLAEAAV
Ga0116224_1033971113300009683Peatlands SoilMSLLAPIRRHPADQANSSGLLRSVRLFLLFLAEQAEPEKFYA
Ga0116216_1016019813300009698Peatlands SoilMSLLAPIRRRPAAQAKSSGLLRSVRLFRLFLAEQAEPEK
Ga0116216_1031301723300009698Peatlands SoilMSLQAPIRRDRADQTARSVVLRSVRLFRLFLAEQSDPETFYTG
Ga0136449_10263779223300010379Peatlands SoilMSLLAPIRRHRADQTAHSGVLRSVRLFRLFLAEQSDPE
Ga0137383_1109779213300012199Vadose Zone SoilMSLLSPIRRRRADQATRSGLLRSIRLFRMFLAEQADPEKFYAYLAEDAVQQVAEHC
Ga0137380_1084753713300012206Vadose Zone SoilMSLLAPNRRYRADQATPSGLRRSVRLFRLFLAEQSDPEK
Ga0137369_1031225013300012355Vadose Zone SoilMSLMAPTRRHSADQAPPSGLLRSVRLFRLFLADQRDPEQFYAS
Ga0164309_1152897013300012984SoilMSLMAPTRRHSADQAPNSGLLRSIRLFRLFLADQGDPGRFNGAWP
Ga0181522_1004647913300014657BogMSLLGPIRRRRGNQAARSGVRRSIRLFRLFLGEQNDQETFYV
Ga0132257_10126737813300015373Arabidopsis RhizosphereMSLMAPTRRHSADQAPPSGLLRSVRLFRLFLAEQSDPEKFYASLAEDA
Ga0182033_1073356023300016319SoilMSLLAPRRPRADHTASSGLLRSVRLFRLFLAEQSDPE
Ga0182033_1202078913300016319SoilMSLLAPNRLDRADQAVPSGLKRSFRLFRLFLAEQSDPEQFYASLATDAVQQ
Ga0182035_1045328923300016341SoilMSLQAPIRRHGANQASSTGLLRSVRLFRLFLSEQSDPETFY
Ga0182032_1067845413300016357SoilMSLLAPIRRHRADQTASSGLLRSVRLFRLFLAEQSDPETF
Ga0182040_1066153323300016387SoilMSLLAPNRRHVDQAAGSGLLRSVRLFRLFLAEQTDPEKFY
Ga0187802_1017591413300017822Freshwater SedimentMSLPAPIRRHRDDQTARSGVLRSVRLFRLFLAEQSDPEAFYTGLAED
Ga0187820_129556123300017924Freshwater SedimentMSLLAPRRPRADQTASSGLLRSIRLFRLFLAEQSDPETFYGNLAEDAVQQ
Ga0187806_110500123300017928Freshwater SedimentMSLLAPIRRHRSDQAAPSGLLRSVRLFRLFLAEQTDPETFYAS
Ga0187814_1031693723300017932Freshwater SedimentMSLQAPIRRDRADQTARSGVLRSVRLFRLFLAEQS
Ga0187808_1023065713300017942Freshwater SedimentMSLLAPIRRHRAGQAADSGLLRSVRLFRLFLAEQSDPEK
Ga0187819_1008075433300017943Freshwater SedimentMSLPAPIRRHRDDQTARSGVLRSVRLFRLFLAEQSDPEAFY
Ga0187819_1048871223300017943Freshwater SedimentMSLLAPIRRHRADQAAGSGLLRSVRLFRLFLAEQSDPETFYASLAED
Ga0187779_1060093013300017959Tropical PeatlandMSLLAPDRHHPADQARSSGLLRSLRLFRLFLAEQTDPEKFYAS
Ga0187777_1013571213300017974Tropical PeatlandMSLLAPNRRHLDQAAGTGLLRSVRLFRLFLAEQADPEKFYTSLAED
Ga0187777_1148510213300017974Tropical PeatlandMSLLAPIRHRADQTARSGLLRSVRLFRLFLAEQSDPETFYTSLAE
Ga0066667_1127213413300018433Grasslands SoilMSLMAPTSRPCSDQAPPVRLRRSVRLFRSFPAQHGVPE
Ga0187768_108885623300020150Tropical PeatlandMSLLAPNRRHLDQAAGTGLLRSVRLFRLFLAEQADPEKFYT
Ga0210395_1060508923300020582SoilMSLLAPFRRRAAPSGVRRSVRLFRLFLAEQSDPEAFYVSLAEDA
Ga0210395_1142899813300020582SoilMSLMAPVSRQCADQATPAGLRRSVRLFRLFLAEQSDPEKFYGSLA
Ga0210400_1105578013300021170SoilLSLLAPNRRYRADQATPSGLRRSVRLFRLFLAEQS
Ga0210393_1151020213300021401SoilMSLLAPFRRRAAPSGVRRSVRLFRLFLAEQSDPEAF
Ga0210389_1079126113300021404SoilMSLLAPTRRHCADQPPPAGLRRSVRLFRLFRAEQSDPEKFYVSLAEDA
Ga0210387_1150489613300021405SoilLSLLAPNRRYRADQATPSGLRRSVRLFRLFRAEQG
Ga0126371_1242212523300021560Tropical Forest SoilMSLQAPIRRHGADQAPPTGLLRSVRLFRLFLAEQSDPEKFY
Ga0207692_1108245713300025898Corn, Switchgrass And Miscanthus RhizosphereMSLLAPRRDRANEDGATPATRSGLLRSFRLFRLFLAEQSDP
Ga0207644_1171034123300025931Switchgrass RhizosphereMSLLAPNRRNRVDQHRVDRLTRSGLLRSVRLFRLFLA
Ga0207665_1056880313300025939Corn, Switchgrass And Miscanthus RhizosphereMSLTAPTRRHSADQAPPSGLLRSVRLFRLFLAEQSDPEKFYASLAED
Ga0209326_101373613300026899Forest SoilMSLMAPTRRHSADQAPSSGLLRSIRLFRLFLAEQSDPEMFYASLAED
Ga0209116_106352423300027590Forest SoilMSLLAPIRRRRGNRAARSGVLRSIRLFRLFLGEQTNRETFYVS
Ga0209333_112974813300027676Forest SoilMSLLAPIRRHRAEQAAHSGLRRSVGLFRLFLAEQAEPEKFYAGLAEDAV
Ga0209380_1063946813300027889SoilMSLLAPIRRHRAEQAARSGLLRSVRLFRLFLAEQAEP
Ga0209415_1025273813300027905Peatlands SoilMSLLAPFRRRRAASFGLLRSVRLFRLFLAEQSDPEA
Ga0311368_1025458513300029882PalsaMSLLAPIRRRRGNQAARSGVLRSIRLFRLFLGEQNDQETFYVSLA
Ga0210261_122472523300030582SoilMSLLAPIRRHRADQTARSGVLRSVRLFRLFLAEQSDPE
Ga0318534_1019062813300031544SoilMSLQAPIRRDCADQAPPTGLLRSVRLFRLFLAEQSDPE
Ga0318534_1021575623300031544SoilMSLLAPIRRHPADQPGRSGLLRSVRLFRLFLAEQTDP
Ga0318538_1011902513300031546SoilMSLLAPNRLDRADQAAPSGLKRSFRLFRLFLAEQSDPEKFYASLATD
Ga0318528_1021820523300031561SoilMSLLAPNRLDRADQAAPSGLKRSFRLFRLFLAEQSDPEKF
Ga0318515_1025418423300031572SoilMSLLAPNRLDRADQAAPSGLKRSFRLFRLFLAEQSDPEQ
Ga0318572_1087218613300031681SoilMGLMAPIRRRRVDPAARSGLLRSVRLFRLFLAEQADPDTFYTSLAED
Ga0318560_1038877323300031682SoilMSLLAPIRRHRVDQAARSGLLRSVRLFRLFLAEQTDPEKFY
Ga0318560_1074848923300031682SoilMSLLAPNRRHRADQAASSGLGRSVRLFRLFLAEQADPE
Ga0310686_10890199523300031708SoilMSLLAPIRRHRADPTARSGVLRSVRLFRLFLAEQS
Ga0306917_1055137513300031719SoilMSLQAPIRRHGANQASSTGLLRSVRLFRLFLSEQSDPETFYASLAED
Ga0318493_1004176133300031723SoilMSLQAPIRRHGANQASSTGLLRSVRLFRLFLSEQSDPETFYARLAE
Ga0318502_1064277123300031747SoilMSLLAPRRLHADHPASSGLLRSVRLFRLFLAEQSDP
Ga0318494_1015111523300031751SoilMSLQAPIRRDCADQAPPTGLLRSVRLFRLFLAEQSD
Ga0318554_1051040523300031765SoilMSLLAPNRRLPGDQAAPSGLLRSVRLFRLFLAEQTD
Ga0318554_1084854013300031765SoilMSLLAPNRHHRADQAGRSGLLRSVRLFRLFLAEQTDP
Ga0318521_1014895623300031770SoilMSLQAPIRRHGANQASSTGLLRSVRLFRLFLSEQSDPETFYA
Ga0318546_1016950413300031771SoilMSLQAPIRRHSADQAPPAGLLRSVRLFRLFLAEQSDPEKFYASLAE
Ga0318546_1109052623300031771SoilMSLLAPRRPRADHPASSGLLRSVRLFRLFLAEQSD
Ga0318547_1036052413300031781SoilMSLLAPRRPRADHTASSGLLRSVRLFRLFLAEQSDP
Ga0318557_1033226113300031795SoilMSLLAPNRRLPGDQAAPSGLLRSVRLFRLFLAEQTDPETFYSGLA
Ga0318576_1035224323300031796SoilMSLLAPRRPRADHPASSGLLRSVRLFRLFLAEQSDP
Ga0318565_1020742323300031799SoilMSLLAPRRPRADHTASSGLLRSVRLFRLFLAEQSDPETFYTSLAED
Ga0318568_1049421023300031819SoilMSLLAPTRRHCADQPPPAGLRRSVRLFRLFRAEQSDPEKFY
Ga0318567_1076776413300031821SoilMSLQAPIRRHSADQAPPTGLLRSVRLFRLFLAEQSDPEKFYASLAEDAV
Ga0318564_1019812213300031831SoilMSLLAPRRPRADHTASSGLLRSVRLFRLFLAEQSDPETFYTSLA
Ga0318499_1012639713300031832SoilMSLLAPRRLHADHPASSGLLRSVRLFRLFLAEQSDPETF
Ga0310917_1045980813300031833SoilMSLLAPRRPRADHPASSGLLRSVRLFRLFLAEQSDPETFYTSLA
Ga0318544_1043904723300031880SoilMSLLAPNRRHVDQAAGSGLLRSVRLFRLFLAEQTDPEKF
Ga0318551_1066639913300031896SoilMGLMAPIRRRRVDPAARSGLLRSVRLFRLFLAEQAD
Ga0318520_1003849933300031897SoilMSLLAPIRRHRVDQAARSGLLRSVRLFRLFLAEQTDPEKFYTSLA
Ga0318520_1102025313300031897SoilMSLLAPRRPRADHTASSGMLRSVRLFRLFLAEQSDPETFYTSLAEY
Ga0310910_1099619013300031946SoilMSLQAPIRRHSADQAPPAGLLRSVRLFRLFLAEQS
Ga0310909_1033260723300031947SoilMSLLAPNRLDRADQAAPSGLKRSFRLFRLFLAEQSDPEKFYASLATDAVQ
Ga0306926_1292636313300031954SoilMSLQAPIRRHSADHDQAPPAGLLRSVRLFRLFLAEQSDPEKFY
Ga0307479_1040563123300031962Hardwood Forest SoilMSLLAPIRRYRADQATGSGLLRSVRLFRLFLAEQTEPDKFYA
Ga0318562_1022367123300032008SoilMSLQAPIRRHSADQAPPTGLLRSVRLFRLFLAEQSDP
Ga0318562_1089042413300032008SoilMSLQAPIRRDCADQAPPTGLLRSVRLFRLFLAEQSDPEKFYASLATDAVQQ
Ga0318563_1040827723300032009SoilMSLQAPIRRHSADHDQAPPAGLLRSVRLFRLFLAEQSDPEKF
Ga0318533_1141797223300032059SoilMSLLAPIRRHRVDQATRSGLLRSVRLFRLFMAEQTDPERFYA
Ga0318505_1041366413300032060SoilMSLLAPNRLDRADQAAPSGLKRSFRLFRLFLAEQSDPEQFYASLATDAVQQ
Ga0318513_1060808823300032065SoilMSLLAPIRRHHADQTASSGLLRSIRLFRLFLAEQSDPETFY
Ga0318514_1037498623300032066SoilMSLLAPNRRHRADQAASSSGLGRSVRLFRLFLAEQTDAEAFYARL
Ga0318514_1062660013300032066SoilMSLLAPIRRHRADQTASSGLLRSVRLFRLFLAEQSD
Ga0306924_1197449313300032076SoilMSLLAPIRRHRADQVAGSGLLRSVRLFRLFLAEQS
Ga0318525_1031382113300032089SoilMSLQAPIRRHSADHDQAPPAGLLRSVRLFRLFLAE
Ga0335074_1033047833300032895SoilMSLLAPIRRRRAGQAARSGLLRSIRLFRLFLAEQADPER
Ga0335075_1097651723300032896SoilMSLLAPIRRHGTDPAARSGLLRSVRLFRLFLAEQA
Ga0335073_1128099213300033134SoilMGLLAPTRRHHADQANPPPDQAAGTGLRRSVRLFRLFMAEQS
Ga0310811_1068938923300033475SoilMSLMAPTSRQCADQATPVGLRRSVRLFRLFLAEQSDPEKFYSS
Ga0373948_0185745_2_1363300034817Rhizosphere SoilMSLMAPTRRHSAVQAPPSGLLRSVRLFRLFLAEQSDPEKFYASLA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.