NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101106

Metagenome / Metatranscriptome Family F101106

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101106
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 37 residues
Representative Sequence VLHWKPRLVAIAAVLALVLVALGGLGIEVDYNLYW
Number of Associated Samples 91
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.76 %
% of genes near scaffold ends (potentially truncated) 21.57 %
% of genes from short scaffolds (< 2000 bps) 83.33 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(20.588 % of family members)
Environment Ontology (ENVO) Unclassified
(23.529 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.039 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.03%    β-sheet: 0.00%    Coil/Unstructured: 53.97%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035VLHWKPRLVAIAAVLALVLVALGGLGIEVDYNLYWSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Natural And Restored Wetlands
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Surface Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Sub-Biocrust Soil
Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.9%20.6%3.9%6.9%5.9%2.9%2.9%3.9%4.9%2.9%2.9%3.9%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FACENCA_47143202035918004SoilGEVDVLHWKPRLVALTALLALVLIALAGLGTEIAYNLYW
ICCgaii200_091431532228664021SoilVLHWKPRLVAIAVVLALVLVALGGLGIEVSENYNLYW
JGI10216J12902_10547701913300000956SoilLHWSPRLVALAAALALVLLVLGGLGFEEPYNLYW*
C688J18823_1027097123300001686SoilLTLLHWKPRLVALAAVLALVLVALAGLGIELTYNLYW*
Ga0070700_10094612313300005441Corn, Switchgrass And Miscanthus RhizosphereQGRLKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0073909_1000593753300005526Surface SoilVLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW*
Ga0073909_1003793833300005526Surface SoilMLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0070672_10098156213300005543Miscanthus RhizosphereLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0068856_10007138723300005614Corn RhizosphereVLHWKPRLVALTAVLALVLIALAGLGTEIAYNLYW*
Ga0068864_10020789723300005618Switchgrass RhizosphereTLGRLKLLHWSPRLVALAAVLALVLLALGGLGIEAPYNLYW*
Ga0066905_10079019523300005713Tropical Forest SoilLHWKPRLVAIAVILALVLVALGGLGIEVVESYNLYW*
Ga0068866_1054683223300005718Miscanthus RhizosphereLKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0068861_10029111323300005719Switchgrass RhizosphereVLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW*
Ga0066903_10013929323300005764Tropical Forest SoilVLHWKPRLVALTAVLALVLIALAGLGVEFAYNLYW*
Ga0066903_10062915823300005764Tropical Forest SoilVLHWKPRLVAIAVVLALVLVALSGLGIEVVESYNLYW*
Ga0066903_10433039713300005764Tropical Forest SoilSWGRFQVLHWKPRLVAIAVVLALVLVALGGLGIEVGEYYNLYW*
Ga0068863_10090603823300005841Switchgrass RhizosphereLKLLHWSPRLVALAAVLALVLLALGGLGIEAPYNLYW*
Ga0081455_1000540963300005937Tabebuia Heterophylla RhizosphereVLHWKPRLVAIAVVLALVLVALSGLGTEVVESYNLYW*
Ga0081540_100777223300005983Tabebuia Heterophylla RhizosphereVLHWKPRLVAIAVVLALVLVAFGGLGIEVGESYNLYW*
Ga0081540_103183113300005983Tabebuia Heterophylla RhizosphereVLHWKPRLVAIAVVLALVLVALGGLGIEVGEYYNLYW*
Ga0074054_1173922923300006579SoilLTLLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW*
Ga0066658_1094375923300006794SoilLLHWKPRLVALAAVLALVLVALAGLGIELTYNLYW*
Ga0079220_1170440623300006806Agricultural SoilVLHWKPRIVALVAVLALVAIVLAGLGVEISYILYW*
Ga0075433_1108886023300006852Populus RhizosphereRLKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0075434_10041063533300006871Populus RhizosphereVGRFTLLHWKPRLVAIAAVLALMLVALGGLAGELIEYNLYW*
Ga0105240_1069791723300009093Corn RhizosphereLIVLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW*
Ga0111539_1019414143300009094Populus RhizosphereLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW*
Ga0105247_1121825623300009101Switchgrass RhizosphereLHWSPRLVALAAVLALVLLALGGLGIEAPYNLYW*
Ga0126313_1001214653300009840Serpentine SoilLLHWKPRLVAIAAVLALVLVALAGLGYELAYNLYW*
Ga0126313_1046389523300009840Serpentine SoilVLHWSPRLVALAAALALILIAFGGLGIDEPYNLYW*
Ga0126313_1117282623300009840Serpentine SoilVLHWKPRLVVIAAVLALVLVALGGLGIEVDYNLYW*
Ga0126305_1054675913300010036Serpentine SoilMLHWKPRLVAIAAVLALVLVALGGLGIEVDYNLYW*
Ga0126309_1006300823300010039Serpentine SoilMLHWNPRLVALTAAVVLILVALAGLGIETAYNLYW*
Ga0126312_1013037023300010041Serpentine SoilVLHWKPRLVAIAAVLALVLVALGGLGIEVDYNLYW*
Ga0126382_1034790623300010047Tropical Forest SoilLHWKPRLVAIAVVLVLVLIALGGLGIEVDYNLYW*
Ga0126382_1096579323300010047Tropical Forest SoilCNPSWGRFQVLHWKPRLVAIAIVLALVLVALSGLGIEVVESYNLYW*
Ga0126382_1218738923300010047Tropical Forest SoilLHWKPRLVAIAVVLALVLVALSGLGIEVVESYNLYW*
Ga0126376_1082394113300010359Tropical Forest SoilVLHWKPRLVALTAVLALVLIALGGLGTEIAYNLYW*
Ga0126372_1041587823300010360Tropical Forest SoilVLHWKPRLVALTALLALVLIALAGLGAEIAYNLYW*
Ga0134125_1126535923300010371Terrestrial SoilLGRLKLLHWRARLIALAAVLALILVALGGLGIEVPYNLYW*
Ga0138513_10003365513300011000SoilLLHWKPRLVAIAVVLALVLIALGGLGIEVDYNLYW*
Ga0137376_1055674533300012208Vadose Zone SoilLKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLY
Ga0157321_102384223300012487Arabidopsis RhizosphereVGRFTLLHWKPRLVAIAAVLALMLVALGGLAVELIEYNLYW*
Ga0157339_103070023300012505Arabidopsis RhizosphereLLHWKPRLVAIAAVLALMLVALGGLAVELIEYNLYW*
Ga0157304_104044923300012882SoilLHWKPRLVAIAVVLALVLVALGGLGIEVDYNLYW*
Ga0157299_1000987123300012899SoilLHWKPRLVASAVVLALVLVALGGLGIEVGENYNLYW*
Ga0157292_1041137823300012900SoilLLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW*
Ga0157296_1000831623300012905SoilLHWKPRLVAIAVVQALVLVALGGLGIEVGENYNLYW*
Ga0157302_1037874623300012915SoilVLHWKPRLVALTALLALVLIALAGLGTEIAYNLYW*
Ga0126375_1013582323300012948Tropical Forest SoilLLHWKPRLVAIAVVLVLVLIALGGLGIEVDYNLYW*
Ga0126375_1211234613300012948Tropical Forest SoilLHWKPRLVAIAVVLALVLVALGGLGIEVGEYYNLYW*
Ga0182008_1058032213300014497RhizosphereLLHWKPRLVAIAAVLALVLLALGGLGIEVDYNLYW*
Ga0173483_1047900023300015077SoilLLHWKPRLVAIAVVLALVLVALGGLGIEVDYNLYW*
Ga0182007_1012820413300015262RhizosphereKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW*
Ga0132258_1362217223300015371Arabidopsis RhizosphereWGRFTLLHWKPRLVAIAVVLVLVLVALGGLGIEVDYNLYW*
Ga0132256_10042533513300015372Arabidopsis RhizosphereLLHWKPRLVAIAVVLVLVLVALGGLGIEADYNLYW*
Ga0132257_10003069963300015373Arabidopsis RhizosphereLLHWKPRLVAIAVVLVLVLVALGGLGIEVDYNLYW*
Ga0132257_10088492613300015373Arabidopsis RhizosphereVGRFTLLHWKPRLVAIAAVLALMLVALGGLAVELVEYNLYW*
Ga0163161_1160208423300017792Switchgrass RhizosphereLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW
Ga0184605_1000761443300018027Groundwater SedimentLTLLHWSPRLVALAAALALIVIAFGGLGIDEPYNLYW
Ga0184608_1014094523300018028Groundwater SedimentMLHWNPRLVALAAVIALILIALGGLGIETDYNLYW
Ga0184635_1021042123300018072Groundwater SedimentMLHWNPRLVALVAVIALILIALGGLGIETDYNLYW
Ga0184624_1004493123300018073Groundwater SedimentLLHWKPRLVAIAVVLALVLIALGGLGIGVDYNLYW
Ga0066667_1105285623300018433Grasslands SoilLTLLHWKPRLVALAAVLALVLVALGGLGVELPYNLYW
Ga0190269_1104499113300018465SoilLTLLHWSPRLVALAAALILIAFGGLGIDEPYNLYW
Ga0184642_160331423300019279Groundwater SedimentLTLLHWSPRLVALAAALALIAIAFGGLGIDEPYNLYW
Ga0173481_1013898623300019356SoilVLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW
Ga0193720_106086823300019868SoilMLHWSPRLVALAAALALIVLALGGLGIETFYNLYW
Ga0193701_103159523300019875SoilLKLLHWSPRLVALAAVLVLVLIALGGLGIELTYNLYW
Ga0193755_109121913300020004SoilGRLTLLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW
Ga0193745_102765223300020059SoilMLHWKPRLVVLAAILALVLVALGGLGFEGSLSGYNLYW
Ga0182009_1046449223300021445SoilLLHWKPRLVAIAAVLALVLLALGGLGIEVDYNLYW
Ga0247752_100578523300023071SoilASCNPSWGRFQVLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW
Ga0247794_1001085423300024055SoilLLHWKPRLVAIAVVLALVLVALGGLGIEVDYNLYW
Ga0247681_102889323300024310SoilLKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW
Ga0210142_103264223300025552Natural And Restored WetlandsLTLLHWSPRLVALVAVLALVLIALGGLAESVTYNLYW
Ga0207642_1008706523300025899Miscanthus RhizosphereLIVLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW
Ga0207688_1050618623300025901Corn, Switchgrass And Miscanthus RhizosphereRSKLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW
Ga0207657_1015762123300025919Corn RhizosphereVLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW
Ga0207691_1042640723300025940Miscanthus RhizosphereLLHWSPRLVALVAVLALVLIALGGLAEELTYSLYW
Ga0207639_1005978523300026041Corn RhizosphereVLHWRPRVVALVAVLALVLIALGGLAEELTYNLYW
Ga0207676_1014117623300026095Switchgrass RhizosphereLKLLYWSSRLVALAAVLALVLLALGGLGIEAPYNLYW
Ga0209074_1047652413300027787Agricultural SoilLLHWKPRLVAIAVVLVLVLVALGGLGIEVDYNLYW
Ga0209811_1008784123300027821Surface SoilMLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW
Ga0209465_1062300923300027874Tropical Forest SoilVLHWKPRLVAIAIVLALVLVALSGLGIEVVESYNLYW
Ga0209382_1021293513300027909Populus RhizosphereQVLHWKPRLVAIAVVLALVLVALGGLGIEVGENYNLYW
Ga0307293_1024355413300028711SoilLTLLHWSPRLVALAAALALIAVVLGGLGIEEPFNLYW
Ga0307285_1002352013300028712SoilSAVGRFTMLHWNPRLVALAAVIALILIALCGLGIETDYNLYW
Ga0307318_1002253823300028744SoilLTLLHWSPRLVALAAALALILIAFGGLGIDEPYNLYW
Ga0307280_1037387513300028768SoilLLHWSPRLVALVAVLALVLIALGGLAEELTYNLYW
Ga0307290_1003115023300028791SoilLTLLHWSPRLVALAAVLVLVLIALGGLGIELTYNLYW
Ga0307284_1004497533300028799SoilLTLLHWSPRLVALAAALALILIALGGLGIDEPYNLYW
Ga0307314_1007422323300028872SoilTMLHWNPRLVALAAVIALILIALGGLGIETDYNLYW
Ga0307278_1018605513300028878SoilSLLHWSPRLVALAAALALILIAFGGLGIDEPYNLYW
Ga0247826_1159724413300030336SoilLTLLHWSPRVIALVAVLALVLIALGGLAEELTYNLYW
Ga0268241_1001363023300030511SoilLLHWRPRLVALAAVIALVLLALGGLGIEVDYNLYW
Ga0308197_1007900423300031093SoilVSTDRARLVALAAVIALILIALGGLGIETDYNLYW
Ga0308175_10008165623300031938SoilLKLLHWSPRVVALVAVLALVLIALGGLAEELTYNLYW
Ga0308175_10032536223300031938SoilVGRFTLLHWKPRLVAIAAVLALVLLALGGLGIEVDYNLYW
Ga0307409_10011955423300031995RhizosphereLLHWKPRLVATTAVLALVLVALGGLGIEVDYNLYW
Ga0308176_1147316823300031996SoilSAKAPRWRFHMLHWKPRLVAIAVVLALVLVALGGLGIEVDYNLYW
Ga0334913_018536_852_9593300034172Sub-Biocrust SoilMLHWNPRLIALAAAFALVLIALCGLGFEAVYNLYW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.