NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F088397

Metagenome Family F088397

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088397
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 41 residues
Representative Sequence MAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAV
Number of Associated Samples 92
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 70.64 %
% of genes near scaffold ends (potentially truncated) 96.33 %
% of genes from short scaffolds (< 2000 bps) 91.74 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (77.982 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(52.294 % of family members)
Environment Ontology (ENVO) Unclassified
(55.046 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(48.624 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.
1FI_00348330
2E41_01178960
3Ga0066388_1038525141
4Ga0070666_101805824
5Ga0070695_1015213012
6Ga0066903_1054706641
7Ga0068865_1010739471
8Ga0066709_1038746311
9Ga0116224_101952962
10Ga0116216_101432761
11Ga0116216_101769081
12Ga0126379_101115542
13Ga0126379_133964011
14Ga0136449_1009985184
15Ga0157344_10282691
16Ga0164298_109979212
17Ga0126369_128342712
18Ga0157371_114639891
19Ga0132258_114018201
20Ga0132255_1003138821
21Ga0182036_111187982
22Ga0182033_106738022
23Ga0182033_109458442
24Ga0182035_103848721
25Ga0182040_104107291
26Ga0187806_10337243
27Ga0187825_104427732
28Ga0187776_111149983
29Ga0187804_101016081
30Ga0210400_109479662
31Ga0210405_102886881
32Ga0210396_103215293
33Ga0210389_103546013
34Ga0210386_116054811
35Ga0210409_101087275
36Ga0126371_120563891
37Ga0207656_104026922
38Ga0207699_104354911
39Ga0207654_110690432
40Ga0207693_101036114
41Ga0207663_102653371
42Ga0207662_103723141
43Ga0207662_111278532
44Ga0207700_100390741
45Ga0207700_103932181
46Ga0207686_116959921
47Ga0207709_107672542
48Ga0207689_100693887
49Ga0257164_10672792
50Ga0310038_100982241
51Ga0318516_102394712
52Ga0318541_106697132
53Ga0318573_106395981
54Ga0318542_102147873
55Ga0318572_100399071
56Ga0318560_103598592
57Ga0318496_100712261
58Ga0318496_105871982
59Ga0318493_105021001
60Ga0318501_100065301
61Ga0318501_100993311
62Ga0318502_105485472
63Ga0318494_103983931
64Ga0318494_106669921
65Ga0318535_101546481
66Ga0318535_104386702
67Ga0318554_105652583
68Ga0318554_108383032
69Ga0318526_100721041
70Ga0318521_110448232
71Ga0318546_109035181
72Ga0318566_100703184
73Ga0318552_103604802
74Ga0318548_101043033
75Ga0318576_100564411
76Ga0318576_101791231
77Ga0318576_104718412
78Ga0318565_103419142
79Ga0318497_106320312
80Ga0318497_106445492
81Ga0318511_100784683
82Ga0318512_100942423
83Ga0318495_100109121
84Ga0306925_117133571
85Ga0318551_104242281
86Ga0318551_109530702
87Ga0318520_110928031
88Ga0306923_108337753
89Ga0306921_109054391
90Ga0306921_115619662
91Ga0308175_1006140011
92Ga0306922_109181212
93Ga0318563_101684492
94Ga0318569_101395971
95Ga0318569_102060143
96Ga0318569_103586031
97Ga0318559_101262782
98Ga0318558_100888401
99Ga0318506_101515421
100Ga0318570_103060952
101Ga0318575_101773321
102Ga0318504_101621213
103Ga0318513_103678302
104Ga0318514_103619321
105Ga0318525_102684471
106Ga0318518_103481762
107Ga0311301_107229061
108Ga0307472_1019800401
109Ga0335070_104482943
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.58%    β-sheet: 0.00%    Coil/Unstructured: 59.42%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAVSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
22.0%78.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Soil
Tropical Forest Soil
Peatlands Soil
Grasslands Soil
Grass Soil
Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Peatland
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
3.7%5.5%52.3%9.2%5.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FI_003483302166559006Grass SoilMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAEAARGRLGRGR
E41_011789602170459005Grass SoilMAVVPDAGKPLSPQAADLTRRVARVVLGEPADLMAEVHAAVLAG
Ga0066388_10385251413300005332Tropical Forest SoilVPDAGELLSPQAADLAHRIARVFLDEPAELMDQVHAAVSAAA
Ga0070666_1018058243300005335Switchgrass RhizosphereMAVVPEAGEPLSPQAADLVRRIARTVLDEPADLMDQVHAAVS
Ga0070695_10152130123300005545Corn, Switchgrass And Miscanthus RhizosphereMAVVPDAGESLSPEAADLVRRVARVFLDEPADLMAEVHAAVSAAADEP
Ga0066903_10547066413300005764Tropical Forest SoilVPDTGEPLSPQAADLARRIARMFLDEPAELMDQVHAAVS
Ga0068865_10107394713300006881Miscanthus RhizosphereMAVVPEAGEPLSPQAADLVRRIARTVLDEPADLMDQ
Ga0066709_10387463113300009137Grasslands SoilMAVVPDAGEPLSPQAADLIRRIARVVLDEPADLMAEVQA
Ga0116224_1019529623300009683Peatlands SoilMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAVSA
Ga0116216_1014327613300009698Peatlands SoilMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAVLAAADEPL
Ga0116216_1017690813300009698Peatlands SoilMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHA
Ga0126379_1011155423300010366Tropical Forest SoilPARCRAPLSPQAADLVRRIARMVLNEPADLMAEVHAAVFAAAD*
Ga0126379_1339640113300010366Tropical Forest SoilMRGPLSPQAADLVRRIARVFPDEPGGLMAGIHAAVSAAA
Ga0136449_10099851843300010379Peatlands SoilMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAV
Ga0157344_102826913300012476Arabidopsis RhizosphereMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAELHA
Ga0164298_1099792123300012955SoilMAVVADAGEPLSAQAADLIRQIARVFLDEPADLMAEVHAAVSAAADE
Ga0126369_1283427123300012971Tropical Forest SoilMAVVHDAGEPLSPQAADLARRIARVFLDEPADLMDQVYTPSRPSSG
Ga0157371_1146398913300013102Corn RhizosphereMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAE
Ga0132258_1140182013300015371Arabidopsis RhizosphereMAVMPDAGEPDAGKPLSPQAADLVRRIARTILDEPADLLDQVHAAVA
Ga0132255_10031388213300015374Arabidopsis RhizosphereMAVVPDAGESLSPEAADLVRRVARVFLDEPADLMAEV
Ga0182036_1111879823300016270SoilMRVVPEAGELLSPQAADLVRRIARGILDEPADLMDQVHAAV
Ga0182033_1067380223300016319SoilVADAGELLSPQAAELVRRIARMILDEPADLMDQVQA
Ga0182033_1094584423300016319SoilMAVVPDAGEPDATEPLSPQAADLVRRIARVILDEPADLMDQVHADVSAAADEPLRS
Ga0182035_1038487213300016341SoilMAVVPDAGEPLSPQAADLVRRIARAVLDEPADLMA
Ga0182040_1041072913300016387SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQVQAAEEPLN
Ga0187806_103372433300017928Freshwater SedimentMAVVADAGEPLSPQAADLTRRVARVFLDEPADLMAELHA
Ga0187825_1044277323300017930Freshwater SedimentMAVVADAGEPLSPQAADLIRRVARVVLDEPADLMAE
Ga0187776_1111499833300017966Tropical PeatlandVPDAGEPDAGGILSPQAADLVRRIAGVVLGEPADLMAAVHAAVSAAAD
Ga0187804_1010160813300018006Freshwater SedimentMAVVADAGEPLSPQAADLIRRVARVFLDEPADLMA
Ga0210400_1094796623300021170SoilMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAE
Ga0210405_1028868813300021171SoilMTVVPDVGEPLSPQAADLVRRIARMVLDEPADLMTEVQAAVSAA
Ga0210396_1032152933300021180SoilMAVVPDAGEPLSPQAANLVRRIARMVLDEPTDLMAEVQAA
Ga0210389_1035460133300021404SoilMAVVPGAGELLSPQAADLVRRLARGILDEPADLMAEVYAAV
Ga0210386_1160548113300021406SoilMAVVADAGEPLSPQAADLIRRVARVFLDEPADLMAELH
Ga0210409_1010872753300021559SoilVADAGEPLSPQAADLVRRVARVFLDEPADLMAELHAAVLAAVD
Ga0126371_1205638913300021560Tropical Forest SoilMATVPDAGEPLSPQAADLVRRISRAVLDEPAELMAEVHAAV
Ga0207656_1040269223300025321Corn RhizosphereMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEAHA
Ga0207699_1043549113300025906Corn, Switchgrass And Miscanthus RhizosphereMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAELHAA
Ga0207654_1106904323300025911Corn RhizosphereMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEAH
Ga0207693_1010361143300025915Corn, Switchgrass And Miscanthus RhizosphereMMTVVPDAGESLSPQAADLVRRVAQRILDEPAELMD
Ga0207663_1026533713300025916Corn, Switchgrass And Miscanthus RhizosphereMAVVPDAGESLSPEAADLVRRVARVFLDEPADLMAE
Ga0207662_1037231413300025918Switchgrass RhizosphereMAVVPDAGEPLSPEAADLVRRVARVFLDEPADLMA
Ga0207662_1112785323300025918Switchgrass RhizosphereMAVVPEAGEPLSPQAADLVRRLARTILDEPADLMDQVHAAVSAAA
Ga0207700_1003907413300025928Corn, Switchgrass And Miscanthus RhizosphereMPDAAGPLSPQTADLVRRIARVVLDEPADLMDQVQAAVSAAADE
Ga0207700_1039321813300025928Corn, Switchgrass And Miscanthus RhizosphereMAVVPDAGEPLSPQAADLIRRIARVFLDEPADLMAELHAAVSAAADE
Ga0207686_1169599213300025934Miscanthus RhizosphereMAVVPEAGEPLSPQAADLVRRIARTVLDEPADLMDQV
Ga0207709_1076725423300025935Miscanthus RhizosphereMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEAHAAVSAAADEPLR
Ga0207689_1006938873300025942Miscanthus RhizosphereMAVVPEAGEPLSPQAADLVRRIARTVLDEPADLMDQVHAAVSAA
Ga0257164_106727923300026497SoilMTVVAGAGEPLSPQAADLVRRVARVFLDEPADLMAE
Ga0310038_1009822413300030707Peatlands SoilMAVVADAGEPLSPQAADLVRRVARVFLDEPADLMAEV
Ga0318516_1023947123300031543SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQVHAAVSAAADEP
Ga0318541_1066971323300031545SoilVPDSGEQLSPQAADLARRIARVFLDEPADLMAEVHAAVSAAA
Ga0318573_1063959813300031564SoilMTVVPDAGEPLSPQAADLVRRIARVFLDEPADLMDQVQAAVS
Ga0318542_1021478733300031668SoilVADAGELDAGEPLSPQAADMVRRIARVILDEPADLMD
Ga0318572_1003990713300031681SoilVADAGEPDAGEPLSPQAADLVRRIARAILDEPADLMDQVHEAVSAAAC
Ga0318560_1035985923300031682SoilMAVVPDAGEPLSPQAADLVRRIARAVLDEPADLMAEVYAAE
Ga0318496_1007122613300031713SoilMAVVPDAGEPLSPQAADLVRRIARAVLDEPADLMAEVYAAVS
Ga0318496_1058719823300031713SoilVADAGELDAGEPLSPQAADMVRRIARVILDEPADLMDRV
Ga0318493_1050210013300031723SoilMAVVPDHGQPLSPQAADLVRRIARTVLDEPADLMTEVYAAVSAAA
Ga0318501_1000653013300031736SoilVPDAGEPLSPQAADLARRIARVILDEPADLMAEVHA
Ga0318501_1009933113300031736SoilMAIVPDAGEPLSPQAADLVRRIARVFLDEPADLMDQVQAAVSA
Ga0318502_1054854723300031747SoilMAVMPDAGEPDAGKLLSPQAADLVRRIARTILDDPDDLMDQVHAA
Ga0318494_1039839313300031751SoilLSPQAADLIRRIARVFLDEPGELMAEVQAAVSAAAD
Ga0318494_1066699213300031751SoilVPDSGEQLSPQAADLARRIARVFLDEPADLMAEVHAAVSAAADEPLR
Ga0318535_1015464813300031764SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQV
Ga0318535_1043867023300031764SoilMAIVPDAGEPLSPQAADLVRRIARVFLDEPADLMDQVQ
Ga0318554_1056525833300031765SoilVSDAGEPDAAEPLSPQAAGLVRRIARAVLDEPADLMAEVYA
Ga0318554_1083830323300031765SoilLSPQAADLVRRVAQVFLDEPADLMDQVQAAVSAAADEPL
Ga0318526_1007210413300031769SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQ
Ga0318521_1104482323300031770SoilMAVVPDVGELLSPQAADLVRRIARMVLDEPADLMDQVQAAVSAAADEPL
Ga0318546_1090351813300031771SoilMAVVPDAGEALSPQAAELVRRIARRILDEPADLMAEIYAAVSAA
Ga0318566_1007031843300031779SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQVHAAVSAAADEPL
Ga0318552_1036048023300031782SoilMAVVPDAGEPLSPQAADLVRRIARAVLDEPADLMAEV
Ga0318548_1010430333300031793SoilVADAGEPDAGEPLSPQAADLVRRIARAILDEPADLMDQVHEAVSAAACGD
Ga0318576_1005644113300031796SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQVHAAVSA
Ga0318576_1017912313300031796SoilVPDSGKQLSPQAADLARRIARVFLDEPADLMAEVHAAVSAAADEP
Ga0318576_1047184123300031796SoilMAAVPDAGKPLSPQAADLVRRIARAVLDEPADLMAEVYAAVSA
Ga0318565_1034191423300031799SoilVPDSGKQLSPQAADLARRIARVFLDEPADLMAEVHAAV
Ga0318497_1063203123300031805SoilMAVVPDAGEPLSPQAADLVRRIARGILDEPADLMD
Ga0318497_1064454923300031805SoilMAVVPDNGQPLSPQAADLVRRIARVFLDEPADLMAEMYAAVS
Ga0318511_1007846833300031845SoilVADAGEPDAGEPLSPQAADLVRRIARAILDEPADLMDQV
Ga0318512_1009424233300031846SoilVADAGEPDAGEPLSPQAADLVRRIARAILDEPADLMDQVH
Ga0318495_1001091213300031860SoilVPDAGEPLSPQAAELVRRIARVFLDEPADLMDQVHSAV
Ga0306925_1171335713300031890SoilMAVVPDAGEALSPQAAELVRRIARRILDEPADLMAEIHAAVS
Ga0318551_1042422813300031896SoilMAIVPDAGEPLSPQAADLVRRIARVFLDEPADLMDQVQAAVSAAADEP
Ga0318551_1095307023300031896SoilMAVVTHAGEPLSPQAADLVRRIARVILDEPADLMAEVQAAVSAAAMSRCAPSRCWPRR
Ga0318520_1109280313300031897SoilMAVVPDAAEPLSPQAADLVRRIARVILDEPAGLMDQVQAAVT
Ga0306923_1083377533300031910SoilVADAGEPDAGEPLSPQAADLVRRIARAILDEPADLMDQVHEAVSAAAD
Ga0306921_1090543913300031912SoilVADAGEPDAGEPLSPQAADLVRRIARGILDQPAGLMSQV
Ga0306921_1156196623300031912SoilMAVVPDNGQPLSPQAADLVRRIARVFLDEPADLMAEMYAAVSAAADEP
Ga0308175_10061400113300031938SoilMTVVPEAGEPLSPQAADLVRRIARTILDEPDDLLDQVHAAVSAAA
Ga0306922_1091812123300032001SoilMAVVPDHGQPLSPQAADLVRRIARTVLDEPADLMTEVYAAVSAAADEP
Ga0318563_1016844923300032009SoilMAVVPDAGEPLSPQAADLVRRIARGILDEPADLMDQVHAAVSAACT
Ga0318569_1013959713300032010SoilMAVVPDHGQPLSPQAADLVRRIARTVLDEPADLMTE
Ga0318569_1020601433300032010SoilVADAGEPDAGEPLSPQAADLVRRIARGILDEPADLMDRV
Ga0318569_1035860313300032010SoilVPDAGEPLSPQAAELVRRIARVFLDEPADLMAEMHA
Ga0318559_1012627823300032039SoilMRVVPEAGEPLSPQAADLVRRIARGILDEPADLMDQVH
Ga0318558_1008884013300032044SoilVPDAGEPLSPQAAELVRRIARVFLDEPADLMAEMHAAVSAAADEPL
Ga0318506_1015154213300032052SoilVPDAGESLSPQAAELARRIARVFLDEPADLMAEVHAA
Ga0318570_1030609523300032054SoilMTVVPDAGEPLSPQAADLVRRIARVFLDEPADLMDQVQA
Ga0318575_1017733213300032055SoilMAVMPDAGEPLSPQAADLVRQVAQAVLDEPADLMAEVYAAVSAAADEPLR
Ga0318504_1016212133300032063SoilMAVVPDAGEPLSPQAADLVRRIARGILDEPADLMDQVHAAVSAAC
Ga0318513_1036783023300032065SoilMAVVPDNGQPLSPQAADLVRRIARVFLDEPADLMAEMYA
Ga0318514_1036193213300032066SoilMTVMPDAGEPLSPQAADLVRQVAQVVLDEPADLMAEVYAAVS
Ga0318525_1026844713300032089SoilMTVMPDAGEPLSPQAADLIRQIAQMVLDEPADLMA
Ga0318518_1034817623300032090SoilMAVVPDAGEPLSPQAADLVRRIARAVLDEPADLMAEVYAAVSA
Ga0311301_1072290613300032160Peatlands SoilMAVVPDAGEPLSPQAADLVRRVARVFLDEPADLMAEVHAAVSAAAD
Ga0307472_10198004013300032205Hardwood Forest SoilVPDAGEPLSPQAADLVRRIARVFLDEPADLMAELHAAVLA
Ga0335070_1044829433300032829SoilVPDAGEPDAREPLSPQAADLVRRIARVILDEPADFM


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.