NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F095993

Metagenome Family F095993

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095993
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 43 residues
Representative Sequence AGHMADDGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL
Number of Associated Samples 94
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 6.67 %
% of genes near scaffold ends (potentially truncated) 86.67 %
% of genes from short scaffolds (< 2000 bps) 84.76 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.190 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(19.048 % of family members)
Environment Ontology (ENVO) Unclassified
(24.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.619 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62
1JGI12270J11330_1000276410
2Ga0063454_1012186581
3Ga0066683_107429002
4Ga0070668_1017713322
5Ga0070694_1015482782
6Ga0073909_102819872
7Ga0070763_101295171
8Ga0066903_1073176491
9Ga0066696_101387853
10Ga0066696_105751071
11Ga0075029_1008791581
12Ga0075030_1005535141
13Ga0075018_103745862
14Ga0068871_1015211512
15Ga0066660_108012892
16Ga0075424_1023441441
17Ga0105247_102623731
18Ga0116222_10071115
19Ga0126374_105606442
20Ga0134065_102398431
21Ga0134080_102129812
22Ga0126376_109515291
23Ga0126372_120527602
24Ga0126378_101876992
25Ga0105239_102680931
26Ga0126381_1000917691
27Ga0126381_1048986791
28Ga0126383_122891991
29Ga0137392_112945402
30Ga0153924_11399401
31Ga0137382_100556602
32Ga0137382_102915962
33Ga0137377_104843482
34Ga0137384_111364162
35Ga0137375_101763503
36Ga0150984_1144012011
37Ga0157329_10156941
38Ga0157350_10566741
39Ga0137410_105343211
40Ga0164302_107240631
41Ga0126369_109919492
42Ga0126369_113988483
43Ga0164309_108367601
44Ga0164304_108134342
45Ga0157369_122927611
46Ga0157378_131328902
47Ga0132255_1060453522
48Ga0182036_111084752
49Ga0182034_114649912
50Ga0182040_104357141
51Ga0182039_107889171
52Ga0187801_102146031
53Ga0187779_110424582
54Ga0187783_100435321
55Ga0187783_112505562
56Ga0187765_102720152
57Ga0066667_113023991
58Ga0193747_11127871
59Ga0193693_10337872
60Ga0210402_118390931
61Ga0210409_115117952
62Ga0224564_10045613
63Ga0207700_100795194
64Ga0207691_101293083
65Ga0207683_117670932
66Ga0209473_11130242
67Ga0208236_10001313
68Ga0208199_10052274
69Ga0208324_10141181
70Ga0209074_100409442
71Ga0209074_102963002
72Ga0209811_102365871
73Ga0209465_104963852
74Ga0209698_101242073
75Ga0209526_107620741
76Ga0318516_107641242
77Ga0318571_101682251
78Ga0318515_102050402
79Ga0318542_102027872
80Ga0318542_106999921
81Ga0318496_100103434
82Ga0306918_112139352
83Ga0318492_107659732
84Ga0307475_109792482
85Ga0318521_107574101
86Ga0318557_101197641
87Ga0318557_102445261
88Ga0318576_105087472
89Ga0318520_109242211
90Ga0306921_100386336
91Ga0306921_103227481
92Ga0306926_118133672
93Ga0318530_100580363
94Ga0306922_112269402
95Ga0318506_104277271
96Ga0318575_103431642
97Ga0318518_106194842
98Ga0318518_107261172
99Ga0307471_1032326501
100Ga0335069_111886021
101Ga0335075_104688661
102Ga0335073_108549041
103Ga0335077_106350104
104Ga0335077_107035171
105Ga0372943_1202793_364_489
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 47.83%    β-sheet: 0.00%    Coil/Unstructured: 52.17%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540AGHMADDGFAARLLTSAEPGAAGGRVILAEAARDALRAAGLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
96.2%3.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Unplanted Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Avena Fatua Rhizosphere
Attine Ant Fungus Gardens
3.8%5.7%6.7%8.6%3.8%4.8%19.0%8.6%4.8%3.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12270J11330_10002764103300000567Peatlands SoilMTSYGDHGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL*
Ga0063454_10121865813300004081SoilACINVGVRARYLAGHMGDDGFAAALATPAGSRAPGLAGERLVLAEAARDALREGTI*
Ga0066683_1074290023300005172SoilLAGHMADDGFAAQLLTSAENPEPGSAGARIVLAEAARDALRDGSL*
Ga0070668_10177133223300005347Switchgrass RhizosphereDDGFAVRLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0070694_10154827823300005444Corn, Switchgrass And Miscanthus RhizosphereACIGVGVRARYLAGYMADDGFAARLLTSAENPEPGSAGARIVLAEAARDALRAGRL*
Ga0073909_1028198723300005526Surface SoilRARYLAGHMGEDGFAARLATSAESGDLGAAGGRVVLAEAARDALREGSL*
Ga0070763_1012951713300005610SoilRYLAGHMADDGFAAGLLTSAEPGAAGGRVVLAEAARDALRAAGL*
Ga0066903_10731764913300005764Tropical Forest SoilVRARYLAGHMADDGFAARLLTPAEPGEAGGRVVLAEAARDALREGGL*
Ga0066696_1013878533300006032SoilMGDDGFAARFGTSAGSGEPGSAGSRVVLAEAARDALRGVGL*
Ga0066696_1057510713300006032SoilLAGHMGDDGFAARLATPGESREPGTAGGRVVLAEAARDALREGSL*
Ga0075029_10087915813300006052WatershedsGHMADDGFAARLLTSAEPGAAGSRVILAEAARDALRAAGL*
Ga0075030_10055351413300006162WatershedsRARYLAGHMADDGYAATLLTAAEAGEAGGRVVLAEAARDALLEGGL*
Ga0075018_1037458623300006172WatershedsRYLAGHMADDGFAARLLTSAEPGAAGDRVILAEAARDALRAAGL*
Ga0068871_10152115123300006358Miscanthus RhizosphereVGVRARYLAGHMGDDGFAARLATSAGSREPGTAGGRVVLAEAARDALREGGL*
Ga0066660_1080128923300006800SoilMADDGFAARLLTSAENPDPGAAGGRVILAEAARDALREGSL*
Ga0075424_10234414413300006904Populus RhizosphereCIGVGVRARYLAGHMGDDGFAARLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0105247_1026237313300009101Switchgrass RhizosphereARLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0116222_100711153300009521Peatlands SoilMTSHGDHGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL*
Ga0126374_1056064423300009792Tropical Forest SoilGHMADDGYARLLVSTEPGAAGEQTVLAEAARDALRGGGATER*
Ga0134065_1023984313300010326Grasslands SoilDGFTARLLTSAESGEPGAAGARVVLAEAARDALRAGSL*
Ga0134080_1021298123300010333Grasslands SoilVGVRARYLAGHMADDGFAARLLTSAESREPGAAGERVVLAEAARDALRAGSL*
Ga0126376_1095152913300010359Tropical Forest SoilADDGFAARLLTSAENPDPGSAASRVILAEAARDALREGGL*
Ga0126372_1205276023300010360Tropical Forest SoilGVGVRARYLAGHMADDGFAARLLTSAESGEPGAAWARIVLAEAARDALREGSL*
Ga0126378_1018769923300010361Tropical Forest SoilVGVRARYLAGHMADDGFAARLLTPAEPGEAGGRVVLAEAARDALREGGL*
Ga0105239_1026809313300010375Corn RhizosphereRARYLAGHMGDDGFAARLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0126381_10009176913300010376Tropical Forest SoilRYLAGHMADDGYAATLLPGAVVNRTVLAEAARDALRDGGV*
Ga0126381_10489867913300010376Tropical Forest SoilMADDGYARLLASAEPGAARDLNVLAEAARDALRDGGV*
Ga0126383_1228919913300010398Tropical Forest SoilGDDGFAARLGTPADSGEAGSFGSRFVLAEAARDALREGSL*
Ga0137392_1129454023300011269Vadose Zone SoilVRARYLAGHMADDGFAARLLTSAENPEPGSAGARIVLAEAARDALRAAGR*
Ga0153924_113994013300012089Attine Ant Fungus GardensVRARYLAGHMADDGYAATLLSSAQAGEAGGRVVLAEAARDALCEGGL*
Ga0137382_1005566023300012200Vadose Zone SoilMADDGFAARLLTSADSREPGAAGGRVVLAEAARDALREDSL*
Ga0137382_1029159623300012200Vadose Zone SoilMADDGFARLLTSAEPGAAGGRVILAEAARDALRAAGL*
Ga0137377_1048434823300012211Vadose Zone SoilYLAGHMADDGFAAQLLASAENPEPGSAGARIVLAEAARDTLREGSL*
Ga0137384_1113641623300012357Vadose Zone SoilRARYLAGHMGDDGFAATFQPSAENPDPGSPGSRAVLAEAARDALRAGSL*
Ga0137375_1017635033300012360Vadose Zone SoilMADDGFAARVLTSSREPGAAGGRVVLAEAARDALREGSL*
Ga0150984_11440120113300012469Avena Fatua RhizosphereYLAGHMGDDGFAAALATPAGSRAPGLAGERLVLAEAARDALREGTI*
Ga0157329_101569413300012491Arabidopsis RhizosphereDGFAVRLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0157350_105667413300012499Unplanted SoilMGDDGFAVRLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0137410_1053432113300012944Vadose Zone SoilRYLAGHMADDGFAARLLTSAENPEPGSAGARIVLAEAARDALREGSL*
Ga0164302_1072406313300012961SoilMGEDGFAARLATSAESGDLGAAGGRVVLAEAARDALREGSL*
Ga0126369_1099194923300012971Tropical Forest SoilARYLAGHMADDGYARLLASAEPGAARDLNVLAEAARDALRDGGV*
Ga0126369_1139884833300012971Tropical Forest SoilMADDGFAARLLTSAEPGSAASRAILAEAARDALRKGGL*
Ga0164309_1083676013300012984SoilDDGFAARLLTSAENPEPGLAGARIVLAEAARDALRDGSL*
Ga0164304_1081343423300012986SoilAAQLLTSAENPEPGLAGARIVLAEAARDALRDGSL*
Ga0157369_1229276113300013105Corn RhizosphereGVRARYLAGHMGDDGFAARLATSAGSREPGTAGGRLVLAEAARDALREGSL*
Ga0157378_1313289023300013297Miscanthus RhizosphereRARYLAGHMADDGFAARLLTSAENPEPGSAGARIVLAEAARDALREGGL*
Ga0132255_10604535223300015374Arabidopsis RhizosphereCIGVGVRARYLAGHMADDGFAARLLTSGENPDPGSAGSRAILAEAARDALCDCGL*
Ga0182036_1110847523300016270SoilGHMADDGFAARLLTSAEPGSAASRAILAEAARDALRESGL
Ga0182034_1146499123300016371SoilADDGFAARLLTSAGPGEAGGRVVLAEAARDSLREGGL
Ga0182040_1043571413300016387SoilHMADDGYAATLLPGAVANRTVLAEAARDALRDGGV
Ga0182039_1078891713300016422SoilRARYLAGHMADDGFAARLLTSAGPGEAGGRVVLAEAARDSLREGGL
Ga0187801_1021460313300017933Freshwater SedimentMADDGFAARLLTSAEPGAAGDRVVLAEAARDALRAAGL
Ga0187779_1104245823300017959Tropical PeatlandVRARYLAGHMADDGFAARLLTSAEPGAAGSRVVLAEAARDALRAAGL
Ga0187783_1004353213300017970Tropical PeatlandGHMADDGHAATVLTSARPGEAGGRVVLAEAARQALREGSR
Ga0187783_1125055623300017970Tropical PeatlandADDGFAAGLLTSAEPGAGGSRVLLAEAARDALRAGL
Ga0187765_1027201523300018060Tropical PeatlandARYLAGHMADDGFAARLLVAAEPGEAGGRVVLAEAARDALREGGL
Ga0066667_1130239913300018433Grasslands SoilMGDDGLAASFGTSAQSGEPGSAGSRAVLAEAARDALRADSL
Ga0193747_111278713300019885SoilRYLAGHMGDDGFAARLATPAGSGDPGAAGGRVVLAEAARDALSAGSL
Ga0193693_103378723300019996SoilRYLAGHMADDGFTASLLTSAESREPGAAGGRVVLAEAARDALREGSL
Ga0210402_1183909313300021478SoilMADDGFTGRLLTSAEPGSAGSRAILAEASRDALRQGVGFLR
Ga0210409_1151179523300021559SoilARYLAGHMADDGFAATLLTSAEPGAAGSRVILAEAARDALRAAGR
Ga0224564_100456133300024271SoilAGHMADDGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL
Ga0207700_1007951943300025928Corn, Switchgrass And Miscanthus RhizosphereGVRARYLAGHMGDDGFAASFLTSAGNPEPGSAGSRAVLAEAARDALRDAGL
Ga0207691_1012930833300025940Miscanthus RhizosphereVRARYLAGHMADDGFTARLLTSAESGEPGAAGGRVVLAEAARDALREGSL
Ga0207683_1176709323300026121Miscanthus RhizosphereADDGFAARLLTSAENPEPGSAGARIVLAEAARDALRAGSL
Ga0209473_111302423300026330SoilAGNMADDGFAARLLTSAENPDPGAAGGRVILAEAARDALRAAGL
Ga0208236_100013133300027066Forest SoilMADDGFAARLLTSAEPGAAGSRVVLAEAARDALRAAGL
Ga0208199_100522743300027497Peatlands SoilMTSYGDHGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL
Ga0208324_101411813300027604Peatlands SoilMTSHGDHGFAARLLTSAEPGAAGGRVILAEAARDALRAAGL
Ga0209074_1004094423300027787Agricultural SoilYLAGHMGDDGFAARLATPAGSGEPGTAGARVALAEAARDALREGSL
Ga0209074_1029630023300027787Agricultural SoilGVRARYLAGHMGDDGFAARLGTPADSGEPGSFGSRFVLAEAARDALRAGSL
Ga0209811_1023658713300027821Surface SoilVRARYLAGHMGEDGFAARLATSAESGDLGAAGGRVVLAEAARDALHEGSL
Ga0209465_1049638523300027874Tropical Forest SoilLAGHMADDGYATLLTSAEPGAAGDPTVLAEAARDALRDGGV
Ga0209698_1012420733300027911WatershedsYLAGHMADDGFAARVLTSAEPGAAGGRVVLAEAARDALRAAGL
Ga0209526_1076207413300028047Forest SoilMADDGFAARLLTSPENPEPGSGFSAGARIVLAEAARDALREGSL
Ga0318516_1076412423300031543SoilVGVRSRYLAGHMADDGYARLMTSAAPGTAWDRTVLAEAARDALRDGGV
Ga0318571_1016822513300031549SoilDDGFAARLLTSAGPGEAGGRVVLAEAARDALREGGV
Ga0318515_1020504023300031572SoilARYLAGHMGDDGFAARFLTSAEPGSAGSRVVLAEAARDALRGVGL
Ga0318542_1020278723300031668SoilADDGFAARLLTSAEPGSAASRAILAEAARDALREGGL
Ga0318542_1069999213300031668SoilVGVRSRYLAGHMADDGYARLMTSAEPGTAWDRTVLAEAARDALRDGGV
Ga0318496_1001034343300031713SoilHMADDGFAARLLTSAEPGSAASRAILAEAARDALREGGL
Ga0306918_1121393523300031744SoilVRSRYLAGHMADDGFARLLTSAEPGPAGDRTVLVEAARDALRDGGV
Ga0318492_1076597323300031748SoilAMCPARLMTSAAPGTAWDRTVLAEAARDALRDGGV
Ga0307475_1097924823300031754Hardwood Forest SoilCIGVGVRARYLAGYMADDGFAAGLLTSAEPGAAGSRVVLAEAARDALRAAGL
Ga0318521_1075741013300031770SoilVRARYLAGHMADDGFAARLLTSAGPGEAGGRVVLAEAARDALREGDL
Ga0318557_1011976413300031795SoilYLAGHMADDGYARLMTSAEPGTAWDRTVLAEAARDALRDGGV
Ga0318557_1024452613300031795SoilVRSRYLAGHMADDGYAATLLPGAVANRTVLAEAARDALRDGGV
Ga0318576_1050874723300031796SoilADDGYARLMTSAEPGTAWDRTVLAEAARDALRDGGV
Ga0318520_1092422113300031897SoilAGHMADDGYARLLASAEPGAAGDRTVLAEAARDALRDGGV
Ga0306921_1003863363300031912SoilRYLAGHMADDGYAATLLPGAVVNRTVLAEAARDALRDGGV
Ga0306921_1032274813300031912SoilARYLAGHMADDGFTARLLTSAEPGAAGSRVVLAEAARDALRAAGL
Ga0306926_1181336723300031954SoilYLAGHMADDGFAARLLTSAGPGEAGGRVVLAEAARDALREGGV
Ga0318530_1005803633300031959SoilGVGVRARYLAGHMADDGYAATLLPGAVANRTVLAEAARDALRDGGV
Ga0306922_1122694023300032001SoilDGFAARLLISAEPGEPGGRVVLAEAARDALRESGL
Ga0318506_1042772713300032052SoilARYLAGHMADDGFAARLLTSAEPGSAASRAILAEAARDALREGGL
Ga0318575_1034316423300032055SoilGHMADDGYARLLASAEPGAAGDRTVLAEAARDALRDGGG
Ga0318518_1061948423300032090SoilGFAATLLTSAENPDPGSAGSRAILAEAARDALRQGGP
Ga0318518_1072611723300032090SoilVRARYLAGHMADDGFAARLLTSAGPGEAGGRVVLAEAARDSLREGGL
Ga0307471_10323265013300032180Hardwood Forest SoilRYLAGHMADDGFTARLLTSAESGEPGAAGGRVVLAEAARDALGAGSL
Ga0335069_1118860213300032893SoilMGDDGYAARFLTSDAPGSAGSRAVLAEAARDALRGGGL
Ga0335075_1046886613300032896SoilAARLAATAGPGEPGSFAGRLVLAEAARDALRDGSL
Ga0335073_1085490413300033134SoilMAEDGYAARLLTPGEPGEASGRVVLAEAARDALLDPSQ
Ga0335077_1063501043300033158SoilMSVADDGFAARFLTSAEPGSAGSRVVLAEAARDALRDSGL
Ga0335077_1070351713300033158SoilDDGFAARFLASAGSGEPGSAGSRVVLAEAARDALRDGGL
Ga0372943_1202793_364_4893300034268SoilMGDDGFAATFQPSAENPGPGTPWSRAVLAEAARDALRAGSL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.