NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F101286

Metagenome Family F101286

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101286
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 45 residues
Representative Sequence CDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER
Number of Associated Samples 92
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.98 %
% of genes near scaffold ends (potentially truncated) 99.02 %
% of genes from short scaffolds (< 2000 bps) 91.18 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.020 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.549 % of family members)
Environment Ontology (ENVO) Unclassified
(31.373 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.922 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.
1JGI12053J15887_1002381310
2JGI25383J37093_100777175
3JGI25384J37096_101703683
4JGI25382J37095_101672371
5JGI25389J43894_10865642
6Ga0066680_105615543
7Ga0066680_108657023
8Ga0066684_103819151
9Ga0066685_101953141
10Ga0066675_102820266
11Ga0066388_1012731423
12Ga0066388_1037742411
13Ga0070703_100760601
14Ga0070705_1009728893
15Ga0066687_100133718
16Ga0066687_104446012
17Ga0070699_1008041431
18Ga0066697_103072235
19Ga0070696_1016742613
20Ga0066670_101870931
21Ga0066703_100110891
22Ga0066705_100414067
23Ga0066798_102003223
24Ga0066651_107071101
25Ga0066652_1020828493
26Ga0075024_1004872361
27Ga0075028_1008563571
28Ga0075021_102659065
29Ga0079221_113140191
30Ga0075426_105923961
31Ga0075436_1005181454
32Ga0099793_103904551
33Ga0099828_110168943
34Ga0099792_110695293
35Ga0075423_104949615
36Ga0105856_13278781
37Ga0134082_105459662
38Ga0134084_102739321
39Ga0126378_123520652
40Ga0137393_100749329
41Ga0137393_105366771
42Ga0137389_105694171
43Ga0137364_101410976
44Ga0137399_116992721
45Ga0137380_108973831
46Ga0137380_113074881
47Ga0137380_113961271
48Ga0137376_102009411
49Ga0137376_111646181
50Ga0137377_105743904
51Ga0137370_106095993
52Ga0137385_102040571
53Ga0137361_108884023
54Ga0137397_109082493
55Ga0137395_102959021
56Ga0137394_103949895
57Ga0137394_105092001
58Ga0164304_111630793
59Ga0120155_10951571
60Ga0120173_10017681
61Ga0120125_11493982
62Ga0134078_100727364
63Ga0132257_1019946112
64Ga0184618_104514012
65Ga0066662_113878462
66Ga0193725_10449281
67Ga0193755_10575292
68Ga0193757_10254611
69Ga0210382_100139508
70Ga0210410_110009233
71Ga0247667_10258071
72Ga0207929_10768271
73Ga0207927_10993633
74Ga0207653_101236381
75Ga0207684_103912541
76Ga0209027_13063441
77Ga0209238_11088364
78Ga0209761_10700801
79Ga0209131_13138801
80Ga0209803_11788761
81Ga0209803_13102941
82Ga0209804_13296672
83Ga0257165_10571893
84Ga0209059_12108543
85Ga0209806_12195723
86Ga0209157_11622051
87Ga0209376_13815001
88Ga0209219_11319733
89Ga0209076_10791305
90Ga0208981_10675711
91Ga0208981_11652692
92Ga0208991_11874982
93Ga0209073_101062705
94Ga0209068_101262821
95Ga0209069_105781573
96Ga0137415_112586991
97Ga0307305_101892981
98Ga0307296_100257431
99Ga0307475_107847141
100Ga0318504_104186701
101Ga0307471_1004562811
102Ga0334722_100316101
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.71%    β-sheet: 25.00%    Coil/Unstructured: 60.29%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540CDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLERSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Sediment
Groundwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Arctic Peat Soil
Permafrost
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Permafrost Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
4.9%5.9%22.5%2.9%2.9%19.6%8.8%3.9%4.9%5.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_10023813103300001661Forest SoilASCDTVVVVSWNVDLALRDLAAQEFPRATALQANYPAVVLEK*
JGI25383J37093_1007771753300002560Grasslands SoilVVTRPDQLADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLVR*
JGI25384J37096_1017036833300002561Grasslands SoilDFTVISTPDQLNECDMVVVVSWNVDLPLRDLAAQKFPRGTALPASYPAVVLER*
JGI25382J37095_1016723713300002562Grasslands SoilTDFTVISTPDQLNECDMVVVVSWNVDLPLRDLAAQKFPRGTALPASYPAVVLER*
JGI25389J43894_108656423300002916Grasslands SoilVTSPDQLAACDEILVVSWGVDLXXRXRAAQEFPRLSTLEAYYPAVVLRR*
Ga0066680_1056155433300005174SoilTVISTPDQLNECDMVVVVSWNVDLPLRDLAAQKFPRGTALPASYPAVVLER*
Ga0066680_1086570233300005174SoilVTRPEQLHDCDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER*
Ga0066684_1038191513300005179SoilCDAVVVVSWGVDLALRDQAAQEFPRLTTLEAYYPAVVLER*
Ga0066685_1019531413300005180SoilDCDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER*
Ga0066675_1028202663300005187SoilVVTRPEQLHDCDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER*
Ga0066388_10127314233300005332Tropical Forest SoilTVITRPEQLAACDAVVVVSWGVDLGLRDEAAREFPRLTTLPAYYPAVVLER*
Ga0066388_10377424113300005332Tropical Forest SoilSGFAVVTDPSQLGGCDAVVVVSWGVDLPLRDEAARQLPKLTTLPAYYPAVVLER*
Ga0070703_1007606013300005406Corn, Switchgrass And Miscanthus RhizosphereRADQLPGCDAVVVVSWGVDLGLRDEAARQFPRLRTLEAYYPAVVLER*
Ga0070705_10097288933300005440Corn, Switchgrass And Miscanthus RhizosphereDQLADCDAVVVVSWNVDLPLRDLAAAQFPSATALQAYYPGVVLER*
Ga0066687_1001337183300005454SoilVVTRADQLSDCDAVVVVSWGVDLALRDQAARQFPRLRTLAAYYPAVVLER*
Ga0066687_1044460123300005454SoilVVTSPDQLASCDEIVLVSWGVDPGLRDRAAQEFPRRSRLEAAYPAVVLRR*
Ga0070699_10080414313300005518Corn, Switchgrass And Miscanthus RhizosphereGCDVVVVVSWGVDLALRDQAAEEFPRWTTLQAYYPAVVLER*
Ga0066697_1030722353300005540SoilGGCDAVVVVSWGVDLALRDEAAREFPSLTTLPAYYPAVVLRR*
Ga0070696_10167426133300005546Corn, Switchgrass And Miscanthus RhizosphereVVTSSDQLAQCDAVVVVSWGVDLALRDEAARQFPKLRTLDAYYPAVVLER*
Ga0066670_1018709313300005560SoilFDVVTNADQLEHCDVVVVVSWGVDLGLRDEAARRFPGLTTLPAYYPAVVLRR*
Ga0066703_1001108913300005568SoilQLADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLAR*
Ga0066705_1004140673300005569SoilTEFTVVTRPEQLAGCDAVVVVSWGVDLGLRDEAARQFPKLTILDAYYPAVVLVR*
Ga0066798_1020032233300005980SoilAVVVVSWAVDLQLRDLAAQEFPRATLLQASYPAVVLER*
Ga0066651_1070711013300006031SoilCDAVVVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER*
Ga0066652_10208284933300006046SoilCDAVVVVSWGVDLGLRDEAAREFPKVTTLDAYYPAVVLVR*
Ga0075024_10048723613300006047WatershedsCTVVVVVSWNVDLTLRDLAAQEFPRRTALPAYYPAVVLER*
Ga0075028_10085635713300006050WatershedsLAACDAVVVVSWNVDITLRNEAAREFPTATVLPAYYPAVVLEK*
Ga0075021_1026590653300006354WatershedsQLAECDAVVIVSWNVDLAVRGEAAGEFPRATALQAYYPAVVLER*
Ga0079221_1131401913300006804Agricultural SoilLAGCDAVVVVSWGVDLALRDEAARQFPSLTTLPAYYPAVVLRR*
Ga0075426_1059239613300006903Populus RhizosphereDAVVVVSWGVDLPLRDEAARRFPSLTTLPAYYPAVVLRRQG*
Ga0075436_10051814543300006914Populus RhizosphereDQLDRCDAVVVVSWGVDLPLRDEAARRFPSLTTLPAYYPAVVLRREG*
Ga0099793_1039045513300007258Vadose Zone SoilLSGCDAVVVVSWNVDLEIRDLAAQQFPRRTILEAYYPAVVLER*
Ga0099828_1101689433300009089Vadose Zone SoilVMSADQLTGCDAVVVVSWGVDLALRDEAARQFPSLTTLPAYYPAVVLRR*
Ga0099792_1106952933300009143Vadose Zone SoilCDAVVVVSWNVDLALRDAAAQQFPRRTILPAYYPGVVLER*
Ga0075423_1049496153300009162Populus RhizosphereAGCDAVVVVSWGIDLGLRDQASQEFPRLTTLPAYYPAVVLER*
Ga0105856_132787813300009662Permafrost SoilAVVVVSWNVDLALRDAAALEFPRRTILQAYYPAVVLEH*
Ga0134082_1054596623300010303Grasslands SoilGFTVVTSPDQLAACDEIVVVSWGVDLGLRDRAALEFPRLSTLEAYYPAVVLRR*
Ga0134084_1027393213300010322Grasslands SoilRPDQLAQCDAVVVVSWGVDLGLRDEAARQLPKLRTLDAYYPAVVLER*
Ga0126378_1235206523300010361Tropical Forest SoilKFVVVTRPDQLAACDAVVVVSWGVDLGLRAEAAREFPRVTTLPAYYPAVVLER*
Ga0137393_1007493293300011271Vadose Zone SoilVVVVSWNVDLTLRDLAAREFLRRTALPAYYPAVVLER*
Ga0137393_1053667713300011271Vadose Zone SoilDFKVVTSADQLAGCDAVVVVSWGVDLALRDEAARQFPGLTTLPAYYPAVVLRR*
Ga0137389_1056941713300012096Vadose Zone SoilFAIVSSPDQLNGCDAVVVVSWNVDLALRNLAAQEFPRATKLAAGYPAVVLER*
Ga0137364_1014109763300012198Vadose Zone SoilADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLVR*
Ga0137399_1169927213300012203Vadose Zone SoilFKVVSTANELSGCDAVVVVSWNVDLGIRDLAAQQFPRRTILQAYYPAVVLER*
Ga0137380_1089738313300012206Vadose Zone SoilRPEQLKDCDAVVVVSWGVDLALRDQAAQEFPRLTTLEAYYPAVVLER*
Ga0137380_1130748813300012206Vadose Zone SoilVVVVSWGVDLALRDEAARQLPKLRTLDAYYPAVVLER*
Ga0137380_1139612713300012206Vadose Zone SoilFAVVTRPDQLAGCDAVVVVSWGVDLGLRDQAAQAFPRLTTLQAYYPAVVLER*
Ga0137376_1020094113300012208Vadose Zone SoilRPEQLADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLVR*
Ga0137376_1116461813300012208Vadose Zone SoilVVSWNVDLALRDQAAAEFPRATALQAYYPGVVLER*
Ga0137377_1057439043300012211Vadose Zone SoilCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLVR*
Ga0137370_1060959933300012285Vadose Zone SoilGCDAVLVVRWGLDFAVRDGVARQFSILTTLPAYYPAVVLRR*
Ga0137385_1020405713300012359Vadose Zone SoilDQLDQCDLVVVVSWNVDLALRDQAAQEFPRRTLLPAYYPAVVLER*
Ga0137361_1088840233300012362Vadose Zone SoilVVVVSWNVDLPLRDLAAQQFPRRTLLPASYPAVVLGR*
Ga0137397_1090824933300012685Vadose Zone SoilGCDAIVVVSWGVDLELRDQAAQEFPRLTTLQAYYPAVVLER*
Ga0137395_1029590213300012917Vadose Zone SoilGDLAGCDEVVVVSWNVDLPLRDLAAQQFPRRTLLPASYPAVVLGR*
Ga0137394_1039498953300012922Vadose Zone SoilTNADELSRCDALVVVSWNVDLPLRDLAAQQFPRGTLLPAYYPAVVLER*
Ga0137394_1050920013300012922Vadose Zone SoilFRVVSTAKELSGCDAVVVVRWNVDLGIRDLAAQQFPRRTILQAYYPAVVLER*
Ga0164304_1116307933300012986SoilVVSWNVDLPLRDLAAEQFPKGKLLPAYYPAVVLER*
Ga0120155_109515713300013768PermafrostQVVVVSWNVDLALRDLAAAEFPRRTILPAYYNGVVLER*
Ga0120173_100176813300014031PermafrostDAVVVVSWNVDLALQDQAAQQFPRRTVLPAYYPAVVLER*
Ga0120125_114939823300014056PermafrostAGDLNGCDAVVVVSWNVDLALRDLAAAEFPRRTILPAYYNGVVLER*
Ga0134078_1007273643300014157Grasslands SoilVTRPEQLADCDAVVVVSWGVDLGLRDQAAQEFPRLTTLPAYYPTVVLKR*
Ga0132257_10199461123300015373Arabidopsis RhizosphereDCDAVVVVSWAVDLALRDEAARQFPRLRTLDAYYPAVVLER*
Ga0184618_1045140123300018071Groundwater SedimentKVVSSAGDLDGCDAVVVVSWNVDLELRDLAAQQFPRRTVLPAFYPAVVLEP
Ga0066662_1138784623300018468Grasslands SoilDAVVVVSWGVDLALRDQAAQEFPRLTTLEAYYPAVVLER
Ga0193725_104492813300019883SoilKCDAVVVVSWNVDLPLRDLAAQQFSRGTLLPAYYPAVVLER
Ga0193755_105752923300020004SoilVVSTPEQLADCDGVVVVSWNVDLALRDLAAAQFPRATALQAYYPGVVLER
Ga0193757_102546113300020008SoilGCSAVVVVSWNVDLALRDQAAAEFPRATALQAYYPGVVLER
Ga0210382_1001395083300021080Groundwater SedimentTTDFSVVSTPDQLAGCDAVVVVSWNVDLPLRDLAAEQFPRATALQAYYPGVVLER
Ga0210410_1100092333300021479SoilPADLAACDAVVVVSWNVDLALRDLAAQEFPHATVLGAYYPAVVLER
Ga0247667_102580713300024290SoilAVVVVSWNVDLALRDQAAREFPRATALGASYPAVVLER
Ga0207929_107682713300025505Arctic Peat SoilSPEDLNGCDAVVVVSWNVDLALRDLAAEQFPRRTILPAYYNGVLLER
Ga0207927_109936333300025579Arctic Peat SoilAVVVVSWNVDLALRAAAAEQFPRTTVLHAYYPAVVLER
Ga0207653_1012363813300025885Corn, Switchgrass And Miscanthus RhizosphereVVVVSWGVDLALRDEAARQFPSLTTLPAYYPAVVLER
Ga0207684_1039125413300025910Corn, Switchgrass And Miscanthus RhizosphereVVTAADQLAGCDAVVVVSWNVDLELRDLAAQEFPRRTLLPASYPAVVLGR
Ga0209027_130634413300026300Grasslands SoilTRPEQLAGCDAVVVVSWGVDLGLRDEAARQFPKLTILDAYYPAVVLVR
Ga0209238_110883643300026301Grasslands SoilDCDAVVVVSWGVDLALRDQAARQFPRLRTLAAYYPAVVLER
Ga0209761_107008013300026313Grasslands SoilDQLNECDMVVVVSWNVDLPLRDLAAQKFPRGTALPASYPAVVLER
Ga0209131_131388013300026320Grasslands SoilKDFSVVATPEQLAACSAVVVVSWNVDLALRDQAAAEFPRATALQAYYPGVVLER
Ga0209803_117887613300026332SoilVVSWGVDLGLRDEAAREFPRLTTLQAYYPAVVLER
Ga0209803_131029413300026332SoilVVTRPEQLKDCDAVVVVSWGVDLALRDQAAQEFPRLTTLEAYYPAVVLER
Ga0209804_132966723300026335SoilVVSWGVDLALRDQAAQEFPRLTTLEAYYPAVVLER
Ga0257165_105718933300026507SoilGCDAVVVVSWNVDLALRNLAAQEFPRATTLGAGYPAVVLER
Ga0209059_121085433300026527SoilTVVTRPEQLRDCDAVVVVSWGVDLALRDEAAREFPRLTRLEAYYPAVVLER
Ga0209806_121957233300026529SoilPDQLADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLAR
Ga0209157_116220513300026537SoilLADCDAVVVVSWGVDLGLRDEAAREFPKLTTLDAYYPAVVLVR
Ga0209376_138150013300026540SoilGCDAVVVVSWGVDLALRDEAAREFPSLTTLPAYYPAVVLRR
Ga0209219_113197333300027565Forest SoilVVVVSWNVDLTLRDLAAQAFPRRTLLPAYYPAVVLGR
Ga0209076_107913053300027643Vadose Zone SoilQLAGCSAVVVVSWNVDLALREQAAAEFQRATALQAYYPGVLLER
Ga0208981_106757113300027669Forest SoilDQLASCDTVVVVSWNVDLALRDLAAQEFPRATALQANYPAVVLEK
Ga0208981_116526923300027669Forest SoilKVVERADQLDACSGVVVVSWNVDLTLRDLAAQAFPRRTLLPAYYPAVVLGR
Ga0208991_118749823300027681Forest SoilRSSPDQLASCDTVVVVSWNVDLALRDLAAQEFPRATALQANYPAVVLEK
Ga0209073_1010627053300027765Agricultural SoilSDFAVVTTPDQLAGCDVVVVVSWGVDLALRDQAAEEFPRWTTLQAYYPAVVLER
Ga0209068_1012628213300027894WatershedsTSADQLAECDAVVIVSWNVDLAVRGEAAGEFPRATALQAYYPAVVLER
Ga0209069_1057815733300027915WatershedsCTVVVVVSWNVDLTLRDLAAQEFPRRTALPAYYPAVVLER
Ga0137415_1125869913300028536Vadose Zone SoilVVTTAGELAGCDEVVVVSWNVDLALRDLAAQEFPRGTLLPAGYPAVVLGR
Ga0307305_1018929813300028807SoilAGDLDGCDAVVVVSWNVDLGLRDLAAQRFPRRTALRASYPAVVLEP
Ga0307296_1002574313300028819SoilVSSAGDLDGCDAVVVVSWNVDLGLRDLAAQRFPRRTALRASYPAVVLEP
Ga0307475_1078471413300031754Hardwood Forest SoilYSIVFNPDQLAACDAVVVVSWNVDLALRDLAAQEFTRATKLGASYPTIVLER
Ga0318504_1041867013300032063SoilVVSSPDQLAACDAVVVVSWNVDLALREEAARQFPRLRTLEAYYPAVVLER
Ga0307471_10045628113300032180Hardwood Forest SoilPDQLSACDMVVVVSWNVDLPLRDLAAQEFPHGTLLPASYPAVVLER
Ga0334722_1003161013300033233SedimentVISSPEQLASCDAVVVVSWNVDLALRDEAARELPQATSLPAYYPAVVLKRQPPS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.