NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F069230

Metagenome Family F069230

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069230
Family Type Metagenome
Number of Sequences 124
Average Sequence Length 42 residues
Representative Sequence KVRELRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS
Number of Associated Samples 103
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.81 %
% of genes near scaffold ends (potentially truncated) 96.77 %
% of genes from short scaffolds (< 2000 bps) 87.10 %
Associated GOLD sequencing projects 100
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (60.484 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(44.355 % of family members)
Environment Ontology (ENVO) Unclassified
(49.194 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(41.129 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.00%    β-sheet: 0.00%    Coil/Unstructured: 60.00%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540KVRELRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVSSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
60.5%39.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Freshwater Sediment
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
4.0%8.9%44.4%9.7%4.0%3.2%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10072430713300000956SoilLGARRADGLTYRELAAEFGISDTSAHAAVNRKTWVHVS*
Ga0062384_10085487413300004082Bog Forest SoilVGNPQAKLSDDKVRKLRSRRADGLTYRQLATEFGISDVSAYAAVHRKTWAHVT*
Ga0066675_1045477613300005187SoilGKVQQLRARRADGLTYQQLADEFGISDVSARSAVNGRTWAHVA*
Ga0066388_10290643913300005332Tropical Forest SoilRAKLTSGKVRELRARRADGLTYRQLAAEFGISDASACAAVNRKTWAHVS*
Ga0066388_10731233223300005332Tropical Forest SoilQLRARRADGLTYRQLAAEFGISDATACAAVNRKTWAHVS*
Ga0066388_10837386423300005332Tropical Forest SoilRAKLTSGKVRELRARRADGLTYRQLAAEFGISDASACAAANRKTWAHVS*
Ga0070682_10002166613300005337Corn RhizosphereKVRKLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA*
Ga0070709_1067733013300005434Corn, Switchgrass And Miscanthus RhizosphereNVRRLRARRADGLTYQQLADEFGISDVSAHAAVNRRTWAHVA*
Ga0070714_10011809843300005435Agricultural SoilAKLTSGKVRELRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS*
Ga0070713_10011525713300005436Corn, Switchgrass And Miscanthus RhizosphereRRLRARRADGLTYRQLAGEFGISDVSAHAAVNGRTWAHVA*
Ga0066903_10636387223300005764Tropical Forest SoilEGNRRAKLTSVRVRQLRARRADGLTYRQLAAEFGISDVAACAAVNRKTWAHVS*
Ga0068870_1011185613300005840Miscanthus RhizosphereKFEGYERGADGLTYRQLAGEFGISDVSAHAAVNGRTWAHVA*
Ga0070717_1014113223300006028Corn, Switchgrass And Miscanthus RhizosphereVGNPQAKLSDDSVRKLRSHRADGLTYRQLATEFGISDVSAYAAVHRKTWAHVT*
Ga0070716_10127707613300006173Corn, Switchgrass And Miscanthus RhizosphereLRARRADGLTYRQLAGEFGISDVSAHAAVNGRTWAHVA*
Ga0070765_10215104823300006176SoilKLTAGKVAELRARRADGLTYRQLADEFCISDVSAHAAVNRRTWAHVA*
Ga0079222_1050461823300006755Agricultural SoilRELRARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA*
Ga0066660_1043155413300006800SoilKLTDDKVRQLRALRADGMTYRQLAAEFGISDVSACAAVNRKTWAHVT*
Ga0105241_1004386043300009174Corn RhizosphereTAEKVRKLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA*
Ga0126374_1141991213300009792Tropical Forest SoilRRADGLTYRQLAAEFGISDVTASAAVNRKTWAHVN*
Ga0116219_1012576223300009824Peatlands SoilAKLTTRKVRQLRARRAQGLTYRQLAAEFGISDVSAYAAANRRTWAHVS*
Ga0126380_1000055573300010043Tropical Forest SoilLTSGKVRELRARRADGLTYRQLAAEFGISDVAACAAVNRKTWAHVS*
Ga0126376_1304153713300010359Tropical Forest SoilLRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS*
Ga0126372_1002686163300010360Tropical Forest SoilELRARRADGLTYRQLAAEFGISDVAACAAVNRKTWAHVS*
Ga0126372_1088716613300010360Tropical Forest SoilGKVRELRARRADGLTYRQLAAEFGISDTSAHAAVNRKTWAHVS*
Ga0126378_1047006813300010361Tropical Forest SoilAKLTSGKVRELRARRADGLTYRQLAAEFGISDVTASAAVNRKHWRM*
Ga0136449_10027409213300010379Peatlands SoilLSDDKVTKLRARRTDGLTYRQLAAEFGISDVSACAAVNRRTWAHVT*
Ga0126383_1059920113300010398Tropical Forest SoilVRQLRARRADGLTYRQLAAEFDISDVTACAAVNRKTWAHVN*
Ga0137383_1028025543300012199Vadose Zone SoilARRADGLTYRQLAVEFGISDVSACAAVNRKTWAHVS*
Ga0137365_1036921613300012201Vadose Zone SoilRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS*
Ga0137380_1043101813300012206Vadose Zone SoilRLRARRADGLTYRQLAGEFGISDVSAYAAVNRTTWAHVA*
Ga0137379_1014961613300012209Vadose Zone SoilKVRRLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA*
Ga0137371_1116934013300012356Vadose Zone SoilSRKVRQLRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS*
Ga0126369_1238034813300012971Tropical Forest SoilLRARRADGLTYRQLAAEFGISDVTACAAVNRKTWAHVN*
Ga0181523_1002340813300014165BogVRKLRALRAGGLTYRQLAAEFGISDVSACAAVNRKTWA
Ga0157380_1135498613300014326Switchgrass RhizosphereLTAEKVRRLRARRADGLTYRQLAGEFGISDVSAHAAVNGRTWAHVA*
Ga0182036_1177676213300016270SoilDKVRRLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0182041_1115350523300016294SoilRAKLTNGKVRQLRARRADGLTYRQLAAEFGISDVAACAAVNRKTWAHVT
Ga0182035_1194656123300016341SoilRARRAQGLTYRQLAADFGISDVSARAAVNGTTWRHIA
Ga0182034_1139573423300016371SoilRARRADGLTYRQLAAEFGISDVTARAAVNRKTWTHVT
Ga0187812_114723223300017821Freshwater SedimentKLTDEKVRGLRARRADGLTYQQLAAEFGISDVSACAAVNRKT
Ga0187806_131344023300017928Freshwater SedimentAKLTNAKVTELRARRADGLTYRQLAAEFGISDVSAWAAVNRRTWTHVN
Ga0187879_1056659113300017946PeatlandVRKLRALRAGGLTYRQLAAEFGISDVSACAAVNRKTWAHVP
Ga0187817_1029699913300017955Freshwater SedimentRRAQGLTYRQLAARFGISDVTAWAAVNGKTWRHIA
Ga0187780_1148969323300017973Tropical PeatlandKVRELRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS
Ga0187782_1159023313300017975Tropical PeatlandVRQLRARRADGLTYRQLAAEFGVSDASACAAVNRKTWAHVS
Ga0187773_1060105223300018064Tropical PeatlandAKLTAGKVRELRARRADGLTFRQLAREFGISDVSACAAVNRRTWAHVS
Ga0187772_1103079513300018085Tropical PeatlandRRADGLTYRQLAAEFGISDVSACAAVNRRTWVHVA
Ga0187770_1169334413300018090Tropical PeatlandLRARRADGLTYRQLAAEFGISDVSACAAVNRRTWVHVA
Ga0210395_1074394423300020582SoilRKVRQLRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS
Ga0210408_1038905313300021178SoilSPHAKLTAGNVRRLRARRADGLTYQQLADEFGISDVSAHAAVNRRTWAHVA
Ga0210408_1042871813300021178SoilVRRLRARRADGLTYQQLADEFGISDVSAHAAVNRRTWAHVA
Ga0210384_1041710333300021432SoilEKVRQLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0210390_1016507743300021474SoilDKVRTLRALRAGGLTYRQLAAEFGISDVSACAAVNRKTWAHVT
Ga0210402_1080515723300021478SoilAKLTAEKVRKLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0126371_1059339133300021560Tropical Forest SoilVGEGNRRAKLTSGKVRELRARRADGLTYRQLAAEFGISDVTASAAVNRKTWAHVN
Ga0126371_1117403913300021560Tropical Forest SoilNRRAKLTTAKVRQLRARRAGGLTYRQLAAEFGISDSSACAAVNRKTWAHVN
Ga0126371_1254910213300021560Tropical Forest SoilKLTDNKVRQLRARRADGVTYRQLAAEFGISDVSAWAAVNGRTWAHVS
Ga0207642_1116279223300025899Miscanthus RhizosphereKLRARRADGLTYRQLAGEFGISDVSAHAAVNGRTWAHVA
Ga0207663_1054578623300025916Corn, Switchgrass And Miscanthus RhizosphereEKVRRLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0207652_1183002213300025921Corn RhizosphereVRKLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0207709_1073625913300025935Miscanthus RhizosphereRRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0207678_1009258613300026067Corn RhizosphereKVRKLRARRADGLTYRQLAGELGISDVSAHAAVNRTTWAHVA
Ga0209648_1020532033300026551Grasslands SoilARRVEGLTYRQLAGEFGISDASACAAVNRRTWAHVA
Ga0209040_1008720713300027824Bog Forest SoilAKLTNDKVRQLRARRADGLTYRQLAAESGISDVSAWAAVNGKTWAHVS
Ga0209166_1051478813300027857Surface SoilRRAQGLTFRQLAAEFGISDVSAWAAVNGKTWRHVT
Ga0318571_1015840623300031549SoilKLTAEKVRQLRARRADGLTYQQLAGEFGISDVSAHAAVNRRTWAHIA
Ga0318573_1046397823300031564SoilAKVKQLRARRADGLTYRQLAAEFGISDVTAYAAVNRKTWAHVR
Ga0310915_1126991723300031573SoilARRAEGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318542_1033869123300031668SoilARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0318561_1081613713300031679SoilGKVRELRARRAAGLTYRQLAAEFGISDASACAAVNRKTWAHVS
Ga0318574_1034625423300031680SoilVRQLRARRAEGLTYRQLATKFGISDVSACAAVNRKTWAHVT
Ga0318560_1056451713300031682SoilRKLRARRAAGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318496_1003230243300031713SoilAKLTSRKVRKLRARRAAGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318493_1090123013300031723SoilELRTRRAEGVTYRKLAAEFSISDVSACSAVNRKTWAHVV
Ga0318500_1069100523300031724SoilVRQLRARRADGLTYRQLAAEFGISDVSAWAAVNGKTWTHVS
Ga0306918_1024022023300031744SoilKLTNAKVRQLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0318502_1028587713300031747SoilRKVRRLRARRADGLTYRQLAAEFGISDVSACAAVNRKTWAHVS
Ga0318502_1047635813300031747SoilTAEKVRQLRARRSDGLTYQQLAGEFGISDVSACAVVNRRTWAHVA
Ga0318502_1069333423300031747SoilRKVRKLRARRAAGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318494_1007912833300031751SoilRRAAGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318526_1038843313300031769SoilLTSRKVRKLRARRAAGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318546_1002810453300031771SoilKVRQLRARRADGLTYRQLAAEFGISDVSAWAAVNGKTWTHVS
Ga0318546_1007407413300031771SoilQVRQLRARRADGLTYRQLAAEFGISDMTARAAANRKTWAHVN
Ga0318546_1023241533300031771SoilRRAEGLTYRQLAAEFGISDVTARAAVQRTTWAHVS
Ga0318543_1047993713300031777SoilVRELRARRAAGLTYRQLAAEFGISDASACAAVNRKTWAHVS
Ga0318566_1016187013300031779SoilLSDGKVRKLRARRADGLTYKQLAAEFGISDVSARAAVNRKTWTHVT
Ga0318547_1009880013300031781SoilRAKLTPGKVRKLRARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA
Ga0318547_1010874233300031781SoilVRQLRARRADGATYRQLAAEFGISDMTARAAANRKTWAHVT
Ga0318547_1077559313300031781SoilVRQLRARRAAGLTYRQLAAEFGISDLSACAVVNRKTWAHVI
Ga0318552_1015204323300031782SoilKLTSGQVRQLRARRADGLTYRQLAAEFGISDMTARAAANRKTWAHVT
Ga0318550_1052251823300031797SoilAKLTNGKVRQLRARRADGLTYRQLAAEFGISDVTARAAVNRKTWTHVT
Ga0318523_1001016243300031798SoilAKLTNAKVRQLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0318565_1030750613300031799SoilTAGKVRELRARRADGLTYQQLATEFGISDVSAWAAVNGKTWAHVN
Ga0318565_1034523723300031799SoilLTPGKVRKLRARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA
Ga0318499_1009609123300031832SoilEKVRQLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0318499_1014381023300031832SoilTSGRVRELRARRAAGLTYRQLAAEFGISDASARAAANRKTWAHVS
Ga0318517_1034853723300031835SoilRRAGGLTYRQPAAEFGISDVSACAAVNRKTWAQVP
Ga0318511_1038750613300031845SoilKVRQLRARRADGLTYRQLAAEFGISDVTARAAVNRKTWTHVT
Ga0318527_1004911913300031859SoilRKVRELRARRAEGLTYRQLAAEFGISDASACAAVNRKTWAHVP
Ga0318495_1019827333300031860SoilKVRELRARRAEGLTYRQLAAEFGISDASACAAVNRKTWAHVP
Ga0318522_1005759433300031894SoilPGKVRKLRARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA
Ga0318522_1029758123300031894SoilKLTGRKVRELRARRAEGLTYRQLAAEFGISDASACAAVNRKTWAHVP
Ga0318551_1021233523300031896SoilIGQGNPRAILTDAKVRLLRARRAQGLTYRQLAAEFGISDVSAWAAANGRTWRHVA
Ga0306923_1237422523300031910SoilKLTADKVRQLRARRADGLTYRQLAGEFGISDVSACAVVNRRTWAHVA
Ga0306921_1274283013300031912SoilKVRQLRARRADGLTYRQLAAEFGVSDVSACAAVNRKTWAHVN
Ga0310912_1019077813300031941SoilRARRADGLTYRQLAAEFGVSDVSACAAVNRKTWAHVN
Ga0310912_1031292913300031941SoilRARRADGLTYRQLAAEFGISDVAACAAVNRKTWAHVT
Ga0310912_1088650513300031941SoilTNAKVRQLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0310910_1009074513300031946SoilVRQLRARRADGLTYRQLAAEFGISDMTARAAANRKTWAHVT
Ga0306926_1040742423300031954SoilRELRARRAAGLTYRQLAAEFGISDASARAAANRKTWAHVS
Ga0306926_1142085823300031954SoilRAKLTSGKVRVLRARRTAGLTYRQLAAEFGISDASARAAVNRKTWAHVS
Ga0306922_1090845513300032001SoilARRAAGLTYRQLAAEFGISDASACAAVNRKTWAHVS
Ga0318562_1052587923300032008SoilQLRARRADGATYRQLAAEFGISDMTARAAANRKTWAHVT
Ga0318507_1027694713300032025SoilTDDKVRQLRARRADGLTYRQLAAEFGISDVSAWAAVNGKTWTHVS
Ga0318556_1023408013300032043SoilQLRARRADGLTYRQLAAEFGISDMSACAAVNRKTWAHVS
Ga0318558_1007634313300032044SoilVRELRARRAEGLTYRQLAAEFGISDASACAAVNRKTWAHVP
Ga0318558_1070234513300032044SoilARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA
Ga0306924_1096178033300032076SoilVRQLRARRADGLTYRQLAAEFGISDTSACAAVNRKTWAHVR
Ga0318525_1010155513300032089SoilTPGKVRKLRARRADGLTYKQLADEFGISDVSAHAAVNRRTWAHVA
Ga0318525_1038976123300032089SoilAEKVRQLRARRADGLTYQQLASEFGISDVSAHAAVNRRTWAHIA
Ga0318525_1053929013300032089SoilEKVRQLRARRADGLTYRQLAGEFGISDVSAHAAVNRRTWAHVG
Ga0307471_10322548413300032180Hardwood Forest SoilVRRLRARRADGLTYRQLAGEFGISDVSAHAAVNRTTWAHVA
Ga0306920_10318733123300032261SoilARRADGLTYRQLASEFGISDVSACAAVNRRTWAHIT
Ga0335080_1162024713300032828SoilTSGKVRQLRARRADGLTYRQLAAEFGISDTSACAAVNGKTWAHVS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.