NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103762

Metagenome / Metatranscriptome Family F103762

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103762
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 47 residues
Representative Sequence MILLITPSATGPQCAESLYAATGQETHWAQSLQEAATRLREQTFSAAVI
Number of Associated Samples 93
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 70.30 %
% of genes near scaffold ends (potentially truncated) 99.01 %
% of genes from short scaffolds (< 2000 bps) 92.08 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.208 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(7.921 % of family members)
Environment Ontology (ENVO) Unclassified
(19.802 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.525 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.
1INPgaii200_09824872
2JGI12270J11330_100930032
3JGI12679J13547_10025611
4C688J18823_104146051
5C688J35102_1196312052
6Ga0058896_13626632
7Ga0066679_105249611
8Ga0066690_100213691
9Ga0066684_100579341
10Ga0070734_104086661
11Ga0070732_104300931
12Ga0068855_1003142881
13Ga0068856_1019890992
14Ga0075288_10120382
15Ga0075019_103347733
16Ga0075030_1011759242
17Ga0070716_1008280231
18Ga0070712_1017626141
19Ga0075435_1015964381
20Ga0066710_1015811352
21Ga0099792_110956902
22Ga0116218_11582311
23Ga0116111_10856682
24Ga0116133_10963512
25Ga0116117_10458471
26Ga0116216_108307161
27Ga0116134_12635592
28Ga0126372_126066491
29Ga0136449_1026042902
30Ga0150983_124991832
31Ga0137380_101361981
32Ga0137384_112399632
33Ga0137358_108082482
34Ga0181530_105344551
35Ga0157376_131396841
36Ga0137403_111271401
37Ga0182038_107109491
38Ga0181505_101369962
39Ga0187818_100600472
40Ga0187806_11151901
41Ga0187801_101718372
42Ga0187801_102307721
43Ga0187801_103244612
44Ga0187803_101974012
45Ga0187819_108531051
46Ga0187776_105259141
47Ga0187783_109427072
48Ga0187782_100021971
49Ga0187816_105265581
50Ga0187880_12725202
51Ga0066662_113227212
52Ga0066662_128654792
53Ga0210396_106099421
54Ga0210396_108945202
55Ga0210393_110341002
56Ga0210393_116599052
57Ga0210383_106758741
58Ga0210391_105261571
59Ga0187846_102575991
60Ga0208194_10726632
61Ga0208690_10257111
62Ga0207646_118650482
63Ga0207700_101778642
64Ga0207664_113617832
65Ga0208415_10329502
66Ga0209804_11121411
67Ga0207726_10332512
68Ga0209525_10597822
69Ga0209106_11467991
70Ga0208044_10909852
71Ga0209422_10250543
72Ga0209811_104428482
73Ga0209060_104171301
74Ga0209580_100877532
75Ga0209580_105796581
76Ga0209166_101385271
77Ga0209167_104338711
78Ga0209624_100551311
79Ga0209067_103829801
80Ga0209415_100771941
81Ga0209415_108337182
82Ga0209698_111533521
83Ga0302224_102360801
84Ga0302303_102062231
85Ga0311370_101111855
86Ga0311372_110043711
87Ga0311355_115980582
88Ga0302310_101982131
89Ga0302310_105588293
90Ga0075394_111290401
91Ga0073994_124160872
92Ga0170834_1052129352
93Ga0265325_104174931
94Ga0265340_100434533
95Ga0307477_101039182
96Ga0307478_103340972
97Ga0311301_118616852
98Ga0335085_120231881
99Ga0335078_108900951
100Ga0335081_104486261
101Ga0335071_113244681
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 28.57%    β-sheet: 0.00%    Coil/Unstructured: 71.43%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MILLITPSATGPQCAESLYAATGQETHWAQSLQEAATRLREQTFSAAVISequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
79.2%20.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog
Peatland
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Rice Paddy Soil
Tropical Peatland
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Palsa
Biofilm
Corn Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Rhizosphere
5.9%7.9%4.0%5.0%7.9%7.9%5.0%3.0%7.9%4.0%3.0%6.9%4.0%6.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_098248722228664022SoilMILLITPQSKGPEFAAALLAATSQEVHWAQTLREA
JGI12270J11330_1009300323300000567Peatlands SoilMILLITPSASGPQCADSLRAATSRETHWAKTLQEGATRLREQTYSAA
JGI12679J13547_100256113300001174Forest SoilMILLITSSAKGPQCIDCLHAATGLETQWAQSLQEAATRLREQTYSAVVID
C688J18823_1041460513300001686SoilMEEVMILLITPQSRGPEFASSLEAATSQETHWAQNLQEAATRLREQTY
C688J35102_11963120523300002568SoilMEEVMILLITPQSRGPEFASSLEAATSQETHWAQNLQEAATRLREQTYSAAV
Ga0058896_136266323300004101Forest SoilMILLITPSARGPQCADSLKAATGEETHWAQNAHAASTHLRQQS
Ga0066679_1052496113300005176SoilMILLITPQSKGPEFAAALLAATSQETHWAQNLQEAATRLREQTYSAAIIDQFL
Ga0066690_1002136913300005177SoilMILLITPQSKGPEFAAALLAATSQETHWAQNLQEAATRLR
Ga0066684_1005793413300005179SoilMILLITPSSRGPECAACLTAETSQETHWAQSLQAAATHLREE
Ga0070734_1040866613300005533Surface SoilMILLITPQARGPECAEALFAATSQEVHWAPSLKEAATRLRE
Ga0070732_1043009313300005542Surface SoilMILLITSSARGQQCVDALKASTGKDTHWAQTLQEAASRLREQTY
Ga0068855_10031428813300005563Corn RhizosphereMILLITPSSRGPECAACLTAETSQETHWAQSLQAAATRLREETYAVAVIDQLLLETE
Ga0068856_10198909923300005614Corn RhizosphereMILLITPSARGPECTQCLFAETSQETHWAQSLQEGVTHLREQA
Ga0075288_101203823300005874Rice Paddy SoilMILLITPSARGPECTQCLFAETSQETHWAQSVQEGVTHLREQAYVVAVID
Ga0075019_1033477333300006086WatershedsMILLITPQSKGPEFVAALFAATSQETHWAQNLQEAAT
Ga0075030_10117592423300006162WatershedsMILLITASARGQQCADSLQAATGEETHWAQNLQEAATRLREQTYTAAVID
Ga0070716_10082802313300006173Corn, Switchgrass And Miscanthus RhizosphereMILLITPQSKGPEFAAALLAATSQETHWAQNLQEAATHLREQTYSA
Ga0070712_10176261413300006175Corn, Switchgrass And Miscanthus RhizosphereMVKIMILLITPQSRGPEFADALLAATSQETHWAQNLQEASTRLREQTYSAA
Ga0075435_10159643813300007076Populus RhizosphereMILLITPQSRGPELARLINEATSQETHWAQSVQQASTQLRENAYSAAVIDQFLL
Ga0066710_10158113523300009012Grasslands SoilMILLITPQSKGPEFAAALLAATSQETHWAQNLQEAATRLREQTYSAA
Ga0099792_1109569023300009143Vadose Zone SoilMILLITPSARGQQCVEALKAATGRETHWAQNLPEAATRMRQQTYSAAVIDQFLVETEPEE
Ga0116218_115823113300009522Peatlands SoilMGKGMILLITSSASGPQCADSLYVATGQETHWAPNL
Ga0116111_108566823300009616PeatlandMILLITSSASGPQCADALGAATGLETHWAHTLQEAATRLREQ
Ga0116133_109635123300009623PeatlandMILLITPSAGGPQCAEALHAATSQDVHWAQTLQEGA
Ga0116117_104584713300009635PeatlandMILLNTPSASGPQCAESLRAATGQETHWAQTLQEAASRLREQTYSAAVIDQ
Ga0116216_1083071613300009698Peatlands SoilMILLITPSASGPQCADSLRAATSRETHWAKTLQEGATR
Ga0116134_126355923300009764PeatlandMILLITPSANGPQCADSLRAATGRETHWAKSLQEAATRLR
Ga0126372_1260664913300010360Tropical Forest SoilMILLITPQSKGPEFSDALFAATTQETHWARNLQQAATRLREQPYAVAVI
Ga0136449_10260429023300010379Peatlands SoilMGKGMILLITSSASGPKCAESLYVATGQETHWAPTLQAA
Ga0150983_1249918323300011120Forest SoilMILLITSSASGPQCADSLLAATSQETHWAQTVQQAATRLREQTYTAAV
Ga0137380_1013619813300012206Vadose Zone SoilMILLITSSSSGPQCADSLYAATGQETQWAHTLQEAATRLREQTYSAAVIDQFLLEND
Ga0137384_1123996323300012357Vadose Zone SoilMILLITSSASGSQCADSLYSATGQKTDWAQSLQEGATHLREQAYSAAVI
Ga0137358_1080824823300012582Vadose Zone SoilMILLITSSARGQQCADTLKVATGNDTHWAQNLQEATTRLREQTYSAAVIVQFILATEPEE
Ga0181530_1053445513300014159BogMILLITPSARGQQCAESLHSATGKETRWAQNLQQAVTLLREQAYSAAVIDQFLLETE
Ga0157376_1313968413300014969Miscanthus RhizosphereMILLITPSARGPECTQCLFAETSQETHWAQSLQEGVTHLREQAYAVA
Ga0137403_1112714013300015264Vadose Zone SoilMILLITPSARGQQCVDALKVATGNDTHWAQTLQEAAG
Ga0182038_1071094913300016445SoilMILLITPQSRGPELARLINEATSQETDWAQTVQQAATQLRENAYSAVVIDQFVLETEPDEGD
Ga0181505_1013699623300016750PeatlandMILLVTPSANGPQCVETLRAATGQETHWAKNLQEAATRLREQTYSAAVIDQFLLET
Ga0187818_1006004723300017823Freshwater SedimentMILLITPSANGQQCAESLYAATGQDTHWAQTLQEGATRLREQ
Ga0187806_111519013300017928Freshwater SedimentMILLITPQSKGPEFAAALFAATSQETHWAQNLQEAARRLREQTY
Ga0187801_1017183723300017933Freshwater SedimentMILLITPQSRGPECATALFAATSQETTWVSTLQEAATRLREQTY
Ga0187801_1023077213300017933Freshwater SedimentMILLVTPSARGQQFADSLHAATGEETHWAETPQQAATRLREQTYSAAVIDQF
Ga0187801_1032446123300017933Freshwater SedimentMILLITPSATGPQCAESLYAATGQETHWAQSLQEAATRLREQTFSAAVI
Ga0187803_1019740123300017934Freshwater SedimentMILLITPSANGQQCAESLYAATGQDTHWAQTLQEGA
Ga0187819_1085310513300017943Freshwater SedimentMILLITPSATGPQCAESLYAATGQETHWAQSLQEAATCLREQTFSAAVIDQF
Ga0187776_1052591413300017966Tropical PeatlandMILLITPSARGQQCADSLQAATGEETRWAQNLQEAATRLREGSYSA
Ga0187783_1094270723300017970Tropical PeatlandMGKGMILLITPLARGQECAQTLQAATGEETCWAKTLQEATTSLREQ
Ga0187782_1000219713300017975Tropical PeatlandMSMILLITPSARGQQCSESLHAATGEETHWAQKLQDAATRLRQDT
Ga0187816_1052655813300017995Freshwater SedimentMILLITPSANGQQCAESLYAATGQDTHWAQTLQEG
Ga0187880_127252023300018016PeatlandMILLITPSARGQQCAESLHSATGKETRWAQNLQQAVTLLREQAYSAAVIDQFLL
Ga0066662_1132272123300018468Grasslands SoilMILVITSSASGRQCAEAIGAATGRVTQWASSLQQAVTSLREQTYSAAV
Ga0066662_1286547923300018468Grasslands SoilMILLITPSASGAECAEALQAATSRETHWATTLQEAATRL
Ga0210396_1060994213300021180SoilMILLITPSANGPQCAETLRTATGRETHWAKNLQEAAARLREQTYSAAVIDQFLL
Ga0210396_1089452023300021180SoilMILLITSSASGPQCAESLGVATGQETHWAPTLQAAATRLREQTYSAAVI
Ga0210393_1103410023300021401SoilMILLITPSANGPQCVEILRAATGRETHWAKNLHEAATRLREQTYSAAVIYQFLV
Ga0210393_1165990523300021401SoilMILLITSSASGPQCAESLGVATGQETHLAPTLQAAA
Ga0210383_1067587413300021407SoilMILLITSSATGTQCAESLHAATSQDVHWAQTPQDGAARLREQVYSVAVIDQFLLET
Ga0210391_1052615713300021433SoilMILLITPSARGQQCADSLNVATGLSTHWAHNLQEAATRLREQTY
Ga0187846_1025759913300021476BiofilmMILLITPSARGPECADSLQAATGRETHWAQTLQAAANR
Ga0208194_107266323300025412PeatlandMRKGMILVITPSASGPQCCDSLHAATGQETYWAKTLQEASSRLREQTYSTAVIDQFLLET
Ga0208690_102571113300025434PeatlandMILLNTPSASGPQCAESLRAATGQETHWAQTLQEAASRLREQTYSAAV
Ga0207646_1186504823300025922Corn, Switchgrass And Miscanthus RhizosphereMILLITSSANGPQCADSLHAATGQETQWAQTLQEAATRLREQTYTAAV
Ga0207700_1017786423300025928Corn, Switchgrass And Miscanthus RhizosphereMILLITPSSRGPECAACLTAETSQETHWALSLQAAATRLR
Ga0207664_1136178323300025929Agricultural SoilMILLITPQSRGPELASLVQANTSQETQWAQTVQEAATRLRERPYSAAIIDQFLLET
Ga0208415_103295023300025993Rice Paddy SoilMILLITPSARGPECTQCLFAETSQETHWAQSVQEGVTHL
Ga0209804_111214113300026335SoilMILLITPQSKGPEFAAALLAATSQETHWAQNLQEAA
Ga0207726_103325123300027045Tropical Forest SoilMILLITPSASGLQCASSLLAATGRETHWAQTLHDAATRLREQLYSAAVIDQF
Ga0209525_105978223300027575Forest SoilMILLITPSARGQQCADSLNVATGLSTHWAHNLQEAATRLREQTYVAVVVD
Ga0209106_114679913300027616Forest SoilMILLITPSARGQQCVDTLQAATGRETHWAQNLAEAVTRLRQQTYSAAVIDQFLIETE
Ga0208044_109098523300027625Peatlands SoilMILLITPSASGPQCADSLRAATSRETHWAKTLQEGATRLREQTYSA
Ga0209422_102505433300027629Forest SoilMILLITSSAKGPQCIDCLHAATGLETHWAQSLQEAATRLREQTYSAVVIDQF
Ga0209811_1044284823300027821Surface SoilMRAMEMVMILLITPQSKGPEFAAALLAATSQETHWAQSLQEAATRLREQTYSAAVIDQF
Ga0209060_1041713013300027826Surface SoilMILLITSSASGPQCIESLHAATSQEIHWAHTLQEAAAHLRE
Ga0209580_1008775323300027842Surface SoilMILLITSSARGPECVDSLRAATGLETHWAQSLQEAATRLREQTYSA
Ga0209580_1057965813300027842Surface SoilMILLITSSAKGPQCVDSLRAATGLETHWAQSLQEAATRLREQTYSAVVIDQFLL
Ga0209166_1013852713300027857Surface SoilMILLITSSARGQQCADSLHAATGQETHWAQSLQEGATRLREQTYSAAVIDQF
Ga0209167_1043387113300027867Surface SoilMILLITPSVRGHECADALQLATGDETHWAHILQEAATRLREQNYTAVIIDQF
Ga0209624_1005513113300027895Forest SoilMGMILLITSSANGPQCAESLRSATGLETHWAQTLQEAATRLREQTYSA
Ga0209067_1038298013300027898WatershedsMILLITPQSKGPEFAGALFAATSQETHWAQNLQEAATRLREQTYSAAVIDQF
Ga0209415_1007719413300027905Peatlands SoilMILLITSSASGPKCVESLYVATGQETHWAPTLQAAATRLR
Ga0209415_1083371823300027905Peatlands SoilMGKGMILLITSSASGPKCAESLYVATGQETHWAPTLQAAATRLR
Ga0209698_1115335213300027911WatershedsMILLITPSARGQQCADALNAATGLETQWAQNLQQAATRLREQTFIAVVIDQFL
Ga0302224_1023608013300028759PalsaMILLNTPSSSGLQCAESLHAATGQETHWAKTLQEAATRLREHTYSAAVIDQFLLET
Ga0302303_1020622313300028776PalsaMILLITPSARGQECADALQLATGDQTHWAHILQEAATRLREQNYSAVIIDQ
Ga0311370_1011118553300030503PalsaMILLITPSARGQQCADSLNVATSLQTHWAQNLQEGVAR
Ga0311372_1100437113300030520PalsaMILLITSSARGEDCSSAVHDATGQETQWVQSIGEAATRLR
Ga0311355_1159805823300030580PalsaMILLITSSARGEDCSSAVHDATGQETQWVQSIGEAATRLREQTYSAAVIDQFL
Ga0302310_1019821313300030737PalsaMILLNTPSSSGLQCAESLHAATGQETHWAKTLQEAATRLREHTYSAAVIDQF
Ga0302310_1055882933300030737PalsaMGMILLITPSARGQQCADSLNVATSLQTHWAQNLQEGVARLREQTYTAAVIDQFLL
Ga0075394_1112904013300030969SoilMILLITSSANGPQCADALHAATGQETQWVQSLQEASTRLREQTYTAAVIDQFLL
Ga0073994_1241608723300030991SoilMILLITPSARGQQCVDTLQAATGRETHWAQNLAEAVTRLRQQTYSTAVIDQFLIETEPEK
Ga0170834_10521293523300031057Forest SoilMILLVTSSASGPQCVEALHAATGQETQWAQTLQEAATRLREQ
Ga0265325_1041749313300031241RhizosphereMILLITPSASGPKCAESLKAATGQETHWAQSLQEASTRLREQ
Ga0265340_1004345333300031247RhizosphereMILLITPSANGQQCADSLYAATGQETHWAQTLQAGSTRLREQTY
Ga0307477_1010391823300031753Hardwood Forest SoilMSFTASACGQRCADSLQAATGKDTHWAQNLQEAATRLREQTYTARVID
Ga0307478_1033409723300031823Hardwood Forest SoilMILLITSSANGPQCAESLHAATGQETHWAQTLQAAATRLREQTY
Ga0311301_1186168523300032160Peatlands SoilMILLITASARGQQCADSLLAATGEDTHWAQNLQEAATRLREQTYTAAVID
Ga0335085_1202318813300032770SoilMILLITPQSRGPELARLIHEATSQETHWVQSVQQAASQLRENAYSAAVIDQFLLETDPDESD
Ga0335078_1089009513300032805SoilMILLITSSSVGFQCAETLYGETGQETHWAKSLQEAVTHLRERTY
Ga0335081_1044862613300032892SoilMILLITSSATGAQCAEAIHAATSQETHWAKTLQEAATR
Ga0335071_1132446813300032897SoilMILLITPSARGQQCADSLNAATGRETHWAQNLQEAATRLREQTYSAAVIDQFLLETEPEE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.