NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F077625

Metagenome / Metatranscriptome Family F077625

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F077625
Family Type Metagenome / Metatranscriptome
Number of Sequences 117
Average Sequence Length 42 residues
Representative Sequence GATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Number of Associated Samples 104
Number of Associated Scaffolds 117

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.87 %
% of genes near scaffold ends (potentially truncated) 95.73 %
% of genes from short scaffolds (< 2000 bps) 93.16 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (69.231 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(25.641 % of family members)
Environment Ontology (ENVO) Unclassified
(28.205 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.120 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540GATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
69.2%30.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Iron-Sulfur Acid Spring
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Terrestrial
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Avena Fatua Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Attine Ant Fungus Gardens
8.5%3.4%6.0%25.6%3.4%14.5%3.4%9.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10105416513300000956SoilEYTFSPRGATYQREEQRLNLHRVSHLRLTIVPNKNGSGTATLTALRLFA*
JGI12627J18819_1000783233300001867Forest SoilLVQEYNFNPRGATFQREELRFNRLQASRLRFTIVPNKNGSGTATITALRLFA*
JGIcombinedJ26739_10057705713300002245Forest SoilSPRGATYQREEQRLDLDRVTHLRLAIVPNKNGSGTATLTALHLFA*
JGIcombinedJ51221_1018075213300003505Forest SoilTYQREKQRFDLHGVTHLRLTIVPNKSGSGTATLTALRLFA*
Ga0058897_1101517223300004139Forest SoilQDYTFSPRGATYQREEQSFTRVLATHLRLTIVPNKNGHGPATLTALRLFA*
Ga0066675_1034460933300005187SoilTYQREDQRVNLRQVTHLRLTIVPNKSGSGTATLTTLRLFA*
Ga0070714_10249597713300005435Agricultural SoilGGATYQREEQRFNLRQVTHLRLTIVSNKRGSGTATLTALRLFG*
Ga0066707_1000130513300005556SoilQHEELRLELPAITHLSLTIVPNKSGSGIASLTALRLFA*
Ga0070762_1064849413300005602SoilREEQRFNLRQVTHLRLTIVPNKSGSGTASLTALRLFA*
Ga0066903_10456110213300005764Tropical Forest SoilRQEQRLNLSQVSRLRLTIVPNKNGSGTATLTTFRLYA*
Ga0068860_10246871923300005843Switchgrass RhizosphereGSTFQREDLSFNLPKVTHVRLTIVPNKGGTGTASLTSLRLFS*
Ga0066696_1007727013300006032SoilAIFQHEELRLELPAITHLSLTIVPNKSGSGIASLTALRLFA*
Ga0075014_10061969123300006174WatershedsTYQREEQRFNLRQVTHLRLTFVPNKSGSGTAPLTALRLFA*
Ga0070712_10062636043300006175Corn, Switchgrass And Miscanthus RhizosphereYQREEQRFDLHRVTHLRLTIVPNKNNSGTATLTALRLFA*
Ga0079222_1097272913300006755Agricultural SoilSPRGATYQREEQRLNLRHVSRLRLTIVPNKNGSGTATLTALRLFA*
Ga0066665_1021394313300006796SoilSPGGATYQREEQRLNLLQASHLRLTIVPNKNGSGTATLTALGLFA*
Ga0066659_1077625213300006797SoilQRFNLHGVTHLRLTIVPNKNGPGTVTLTALRLFV*
Ga0079220_1123199613300006806Agricultural SoilATFQREQQRFNLRRATHLLLTIVPNKSGSGVATLTSLHLFA*
Ga0079215_1007059113300006894Agricultural SoilGATYQREEQRLNLHQVSHLRLTIVPNKSGSGTATLTSLRLFG*
Ga0099792_1020422233300009143Vadose Zone SoilYQREEQRFNLRQVTHLRLTIVPNKNGAGTATITAIRLFA*
Ga0099792_1091306713300009143Vadose Zone SoilPGGATYQHEEQRFNLLQVTHLRFTIVPNKSGSGTATLTALRLFA*
Ga0114129_1318589113300009147Populus RhizosphereTFSPRGATYQREEQRLNLHQVSHLRLTIVPNKHGSGTATLTALRLFA*
Ga0126384_1013725013300010046Tropical Forest SoilATYQREEQRFNVRQVTHLHLTIVPNKSGSGTATLTALRLFRLGA*
Ga0134062_1068450513300010337Grasslands SoilQQYTFSPQGAIFQHEELRLELPAITHLSLTIVPNKSGSGVATLTALRLFA*
Ga0126378_1012491813300010361Tropical Forest SoilEQRFNLRQVTHLRLTIVPNKSGSGTATLISLRLFA*
Ga0126381_10315627513300010376Tropical Forest SoilNFSPGGATYQREEQRFNVRQVTHLHLTIVPNKSGSGTATLTALRLFRLGA*
Ga0134123_1109641913300010403Terrestrial SoilQEQRFDLRRVTHLRLVIVPNKSGSGTATLTALRLFA*
Ga0124844_131380523300010868Tropical Forest SoilDQRFKLYQVSHLRLTIVPNKNGSGAASLTALRLFA*
Ga0150983_1321096213300011120Forest SoilREEQRFNLRQVNHLRLTIVPNKSGSGTATLTALRLFA*
Ga0120192_1010908423300012021TerrestrialATYQREEQRLNLHQVSHLRLTIVPNKSGSGTATLTSLRLFA*
Ga0120191_1017130113300012022TerrestrialTSPPRGAPYQREEQRLNVHQVSHLRLTIVPNKNGSGTATLTALRLFA*
Ga0153990_101982613300012169Attine Ant Fungus GardensGATYQREEQRFNLHQVSHLRFTIVPNKNGSGTATLTALRLFA*
Ga0137399_1079803023300012203Vadose Zone SoilATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALLLFA*
Ga0137381_1135461413300012207Vadose Zone SoilFSPGGATYQHEEQRFNLLQVTHLRFTIVPNKNGSGTATLTALRLFA*
Ga0137376_1021078433300012208Vadose Zone SoilFSPRGATFQREEQRFNLHGVTHLRFTIVPNKSGSGAASLTALRVFA*
Ga0150985_11636907013300012212Avena Fatua RhizosphereREDLRLALQGVTHLRLVIIPHLRGSGTATLTCLELFA*
Ga0137396_1071243623300012918Vadose Zone SoilAQEYNLTPGGATYQRGSQRCKLIQFTHPLLTIVPNKSGSGTATLTALRLFA*
Ga0137419_1060811433300012925Vadose Zone SoilLSPGGATYQHEEQRFNLLQVTHLRFTIVPNKNGSGTATLTALRLFA*
Ga0164307_1108867713300012987SoilATYQREEQRFDLHRVTHLRLTIVPNKNNSGTATLTALRLFA*
Ga0182036_1122318113300016270SoilRGATYQREELRFNLLQVSRLRLTVVPNKNGSGTATLTTLRLFA
Ga0182041_1056117313300016294SoilEQRLNLHQVSHLRLTIVPNKNGSGTATLTSLRLFA
Ga0182041_1111012823300016294SoilREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALSLFA
Ga0182032_1104842023300016357SoilQREEQRFNLSRVSQLRLTIVPNKNGSGTATLTTLRLYA
Ga0182032_1193005023300016357SoilTYQREEQRLDLDRVTHLRLTIVPNKNGSGTATLTALRLFA
Ga0182034_1032967123300016371SoilEYNFSPRGATFQREEQRFNLHGVTHLRLTIVPNKNGSGTASLVPIHKE
Ga0182040_1004573513300016387SoilFSPGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALSLFA
Ga0182040_1074235923300016387SoilREDQRFNLPRASRLRLTIVPNKNGSGTATLTLLRLFA
Ga0182039_1047414423300016422SoilGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALSLFA
Ga0066667_1115518323300018433Grasslands SoilFSPGGATYQREDQRVNLRQVTHLRLTIVPNKSGSGTATLTTLRLFA
Ga0066662_1281894113300018468Grasslands SoilEEQRFDLHGVTHLRLTIVPNKNGSGTAILTALRLFA
Ga0193713_120215423300019882SoilAIFQHEELRLELPAVTHLSLIIVPNKSGSGVATLTALRLFA
Ga0210403_1030035823300020580SoilMEDVVVLIDRPRGATYQREEQRFNLPQVTHLRLTIVPNKSGSGTVTLTRLRLFA
Ga0179596_1025395113300021086Vadose Zone SoilGGATYQREEQRFNLRQVTHLRLTLVPNKSGSGTASLTALRLFA
Ga0210406_1008776383300021168SoilGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFG
Ga0210406_1051623913300021168SoilMKDLYQREELRFNLQGVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0210408_1030370213300021178SoilNFSPGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0210408_1052702333300021178SoilPGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFG
Ga0210408_1112660913300021178SoilNFSPGGATYQREEQRFNLRQVNHLRLTIVPNKSGSGTATLTALRLFA
Ga0210385_1032548613300021402SoilFQHEELRLELPAITHLSLTIVPNKSGSGVATLTALRLFN
Ga0210397_1080956513300021403SoilATYQREEQSFTRVLATHLRLTIVPNKNGHGPATLTALRLFP
Ga0210387_1097896323300021405SoilYNFRPGGATYQREELRFNLRGVTHLRLTIVPHKSGSGTATLTALRLFA
Ga0210394_1105817713300021420SoilQREKQRLDLHRVNHLRLTIVPNKNGSGTATLTALRLLA
Ga0210409_1042635933300021559SoilATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFG
Ga0210409_1075274823300021559SoilRGATYQREEQRLDLDWVTHLRLIIVPNKNGSSTATLTALRLFA
Ga0126371_1115220123300021560Tropical Forest SoilGATYQREEQRFNLRQVTHLRLTIVPNRGGSGTATLTALRLFA
Ga0212123_1085117013300022557Iron-Sulfur Acid SpringQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0207692_1022220013300025898Corn, Switchgrass And Miscanthus RhizosphereEQRLNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0207700_1083147033300025928Corn, Switchgrass And Miscanthus RhizosphereATYQREEQRFDLHRVTHLRLTIVPNKNNSGTATLTALRLFA
Ga0257152_102697413300026369SoilPGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALLLFA
Ga0209806_112158713300026529SoilEELRLELPAITHLSSAIVPNKSGSGIASLTALRLFA
Ga0179593_110688343300026555Vadose Zone SoilFSPGGATYQREEQRVNLRQVTHLRLTIVPNKSGSGTATLTTLRLFA
Ga0208732_101653823300026984Forest SoilLVQEYTFSPRGATYQREEQRFNLHQVSHLRFTIVPNKNGSGTATLTALRLFA
Ga0207944_100909323300027105Forest SoilTYQREEQRLDLDRVTHLRLTIVPNKNGSGTATLTALRLSA
Ga0208097_102271313300027173Forest SoilEQRFNLHQVSHLRFTIVPNKNGSGTATLTALRLFA
Ga0208097_102972013300027173Forest SoilQREEQRFNLRQVSHLRLSIVPNKNGSGTSTLTALRLFA
Ga0209731_104843323300027326Forest SoilGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0209217_121627423300027651Forest SoilSPRGATYQREEQRLDLDRVTHLRLAIVPNKNGSGTATLTALHLFA
Ga0209448_1004244033300027783Bog Forest SoilLVQEYTFSPGGGYQREEQRFNLLQASRLRLTIVPNKDGSGMATLTALRLFA
Ga0209693_1050222423300027855SoilQREEQRFNLRQVNHLRLTIVPNKSGSGTATLTALRLFA
Ga0209590_1083681513300027882Vadose Zone SoilFNPGGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTAMLTALRLFA
Ga0209380_1041732513300027889SoilYQREEQRFNLRQVNHLRLTIVPNKSGSGTATLTALRLFA
Ga0207428_1106534013300027907Populus RhizosphereTYQREEQRLNLHQVSHLRLTIVPNKNGSGTATLTALRLFA
Ga0075386_1212710313300030916SoilGATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0170822_1142636033300031122Forest SoilEEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRFFA
Ga0170823_1545641513300031128Forest SoilYNFSPGGATYQREEQRFNLRQVTRLRLTIVPNRSGSGTATLTALCLFA
Ga0170824_12388266313300031231Forest SoilQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFG
Ga0170820_1762085413300031446Forest SoilATYQGEEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFG
Ga0318515_1044272913300031572SoilELRFSRLQASCLRLTIVPNKNGSGTATLTTLRLFA
Ga0310915_1119736333300031573SoilEQRFHLRQVSHLRLTIVPNKNGSGTATLTALRLFA
Ga0307474_1087326223300031718Hardwood Forest SoilGGATYQREDQRFNLHQISHLRFTISPNKSGSGTATLTALRHFA
Ga0318493_1007468033300031723SoilYQREDQRFNLHQISHLHFTISPNKGGSGTATLTALRLFA
Ga0306918_1096486313300031744SoilQREDQRFDLRQVTHLRLTIVPNKSGSGPATLTALRLFA
Ga0307477_1078943513300031753Hardwood Forest SoilYQREEQRFDLHGVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0318547_1000724673300031781SoilQREDLRFNLLQVNRLRLTIVPNKNGSGNATLTTLRLFA
Ga0318497_1006482243300031805SoilHQLEDLRFNLLQVHRLRLTIVPNKNGSGNATLTTLRLFA
Ga0318568_1098791613300031819SoilEYTFSPRGATYQREEQRFNLSQVSQLRLTIVPNKNGSGTATLTTLRLYA
Ga0310917_1018925523300031833SoilTYQREEQRFNLSRVSHLRLTIVPNKNGSGTATLTTLRLYA
Ga0310900_1022270433300031908SoilWRQVWYPRGATYQREEQGLNLHQVSHLRLTIVLNKNDSGTATLTALRLFA
Ga0306923_1117435413300031910SoilPAGATYQREEQRLNFHQASHLRLTIVPNKNGSGTATLTSLRLFA
Ga0306921_1225869923300031912SoilEDQRFNLPRASRLRLTIVPNKNGSGTATLTLLRLFA
Ga0310912_1034809313300031941SoilGATYQREEQRFNLNRVSQLRLTIVPNKNGSGTATLTTLRLYA
Ga0310916_1032590813300031942SoilATYQREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALLLFA
Ga0306926_1070149913300031954SoilPGGATYQREEHRFNLRQVTHLRLTIVPNKSGSGTATLTALSLFA
Ga0318531_1026126613300031981SoilFSPRGATYQREELRFDRLQTSRLRLTIVPNKSGSGTATLTTLRLFA
Ga0306922_1053719313300032001SoilRGATYQREEQRVNLRNVTHLRLTIVPNKNGSGTATLTAFRLYA
Ga0318556_1002082653300032043SoilQILVQGYNFSPAGATHQREDLRFNLLQVNRLRLTIVPNKNGSGNATLTTLRLFA
Ga0318556_1037562113300032043SoilGGATYQREEQRFNLRQVSHLRLTIVPNKNGSGTATLTALRLCA
Ga0318570_1054830723300032054SoilATYHREEQRFNLRQVTHLRLTIVPNKSGSGTATLTALRLFA
Ga0318577_1028341623300032091SoilFSPGGATYQREDQRFNLPRASRLRLTIVPNKNGSGTAPLTALRLFA
Ga0318577_1037356313300032091SoilFSPGGATYQREEQRLNLRQVTHLRLTIVPNKSGSGTATLTALLLFA
Ga0318540_1031153223300032094SoilREEQRFNLSRVSQLRLTIVPNKNGSGTATLTTLRLYA
Ga0307471_10212959923300032180Hardwood Forest SoilVQEYTFSPQGAMFQHEELRLELPAVTHLSLIIVPNKSGSGVATLTALRLFA
Ga0307472_10123422913300032205Hardwood Forest SoilEQRLDLDRVTHLRLTIVPNKNGSDTATLTALRLFA
Ga0306920_10082221713300032261SoilFNPGGATYQREEQHFNLRQVSHLCLTIAPNKNGSGTATLTTFRLYA
Ga0306920_10166200413300032261SoilRGATYQHEEQRFNLRRVSHLRLTIVPNKNGSGTATLTALRLYA
Ga0306920_10215057723300032261SoilHGATFQREEQRVNLHRVTHLRLTIVPNKNGSGTASLTALRLFA
Ga0335080_1174225313300032828SoilQGATFQHEDLRLDLPPITHLRLTIVPNKDGSGEATLTSLRLFA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.