NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F079687

Metagenome / Metatranscriptome Family F079687

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079687
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 42 residues
Representative Sequence ALARLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGG
Number of Associated Samples 100
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.13 %
% of genes from short scaffolds (< 2000 bps) 92.17 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.69

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (91.304 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(33.043 % of family members)
Environment Ontology (ENVO) Unclassified
(44.348 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.957 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66
1Ga0063454_1014295752
2Ga0066675_108320652
3Ga0070667_1015977951
4Ga0070708_1007265772
5Ga0066692_107745832
6Ga0066654_109197152
7Ga0068864_1010276961
8Ga0066903_1013332451
9Ga0066903_1024029451
10Ga0073928_106898851
11Ga0075436_1013190421
12Ga0079219_120289831
13Ga0079218_115505152
14Ga0075435_1008563802
15Ga0099793_106654761
16Ga0126373_101560891
17Ga0126373_105681651
18Ga0126373_107588983
19Ga0134062_103312982
20Ga0126381_1024709111
21Ga0126381_1050948101
22Ga0134124_122109762
23Ga0126383_127201711
24Ga0126350_105303473
25Ga0137376_110013012
26Ga0164302_115443301
27Ga0157374_127052092
28Ga0182036_104388583
29Ga0182036_119228762
30Ga0182041_100972861
31Ga0182035_111254081
32Ga0182034_102589383
33Ga0182037_108943662
34Ga0182037_109520922
35Ga0182039_109726111
36Ga0182038_101937923
37Ga0187817_106004052
38Ga0187815_104200422
39Ga0066655_101929221
40Ga0066655_102638363
41Ga0210405_101618121
42Ga0210394_103196751
43Ga0210402_115307401
44Ga0126371_116331602
45Ga0242655_102551351
46Ga0224549_10169611
47Ga0207692_104647421
48Ga0207663_102752291
49Ga0207676_109823271
50Ga0209265_12121481
51Ga0209159_11705792
52Ga0257147_10511832
53Ga0208981_10326163
54Ga0209810_12011392
55Ga0209167_105434782
56Ga0209283_109558632
57Ga0209415_101536631
58Ga0209415_105343442
59Ga0209415_105343452
60Ga0307305_103964681
61Ga0307482_12896401
62Ga0302308_106387002
63Ga0170824_1007767721
64Ga0302325_122120361
65Ga0302326_134321891
66Ga0318573_100515781
67Ga0318515_105266251
68Ga0310915_101920973
69Ga0318555_101302641
70Ga0318561_100481061
71Ga0318561_100948123
72Ga0318572_108313861
73Ga0318572_108671411
74Ga0318560_107034552
75Ga0307476_109454072
76Ga0306917_102218953
77Ga0318501_105600541
78Ga0307477_108511911
79Ga0307475_103838723
80Ga0318535_100134301
81Ga0318509_101296043
82Ga0318546_103237971
83Ga0318529_105885552
84Ga0318548_101711471
85Ga0318503_102279221
86Ga0318568_105860702
87Ga0318512_100681953
88Ga0306919_100754671
89Ga0306919_106822442
90Ga0318544_101692251
91Ga0306925_104860841
92Ga0318520_102068321
93Ga0318520_102079783
94Ga0306921_105081403
95Ga0308174_111429262
96Ga0310912_101265004
97Ga0310910_107395292
98Ga0306926_104595861
99Ga0307479_116231561
100Ga0326597_100625671
101Ga0306922_113392831
102Ga0318563_108080311
103Ga0318507_105461502
104Ga0310911_101760823
105Ga0318570_100116215
106Ga0318533_101995951
107Ga0318504_100484594
108Ga0318514_103927992
109Ga0318577_103471152
110Ga0318577_105606531
111Ga0318540_100837933
112Ga0306920_1005420851
113Ga0306920_1007983411
114Ga0335085_112845241
115Ga0335079_108069331
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 23.19%    β-sheet: 13.04%    Coil/Unstructured: 63.77%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540ALARLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGGSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.69
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
91.3%8.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Iron-Sulfur Acid Spring
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Palsa
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Boreal Forest Soil
6.1%4.3%33.0%15.7%4.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0063454_10142957523300004081SoilAGELREALACAIAARAREVALATICGKSAVEVAIVDRQGGFLARVGA*
Ga0066675_1083206523300005187SoilLARAVAGRAREVALATLCGKTTVELAVVDRQGGFVARVGG*
Ga0070667_10159779513300005367Switchgrass RhizosphereAVARQAREVALATLSGDTAVEVAIVNRDGRLLALVGGAA*
Ga0070708_10072657723300005445Corn, Switchgrass And Miscanthus RhizosphereLAAAVARQAREVALATLSGKTAVEVAIVGRAGDFLARVGAEA*
Ga0066692_1077458323300005555SoilARQAREVALATLSGKTAVEVAIVGRAGDFLARVGVEA*
Ga0066654_1091971523300005587SoilRAREVALATLSGATAVEVAIVDRSREFLARVGAQG*
Ga0068864_10102769613300005618Switchgrass RhizosphereAEAVARQAREVALATLSGDTAVEVAIVNRDGRLLALVGGAA*
Ga0066903_10133324513300005764Tropical Forest SoilEVALATLSGAIAVEVAIVGRSGEFLARVGAEAPP*
Ga0066903_10240294513300005764Tropical Forest SoilTVAAQAREVALATLSGETAIEVAIIDRQGEIMARVGG*
Ga0073928_1068988513300006893Iron-Sulfur Acid SpringGELRKALACAVARRAREVALATLSGNTNVEVAVVDRRGGLLARVGS*
Ga0075436_10131904213300006914Populus RhizosphereAAQAREVALATLCGETAIEVAVVDRRGGIVARVGEWPRGVC*
Ga0079219_1202898313300006954Agricultural SoilAGRAREVALATICGKTAVEVAIVDRQGSFVARVGA*
Ga0079218_1155051523300007004Agricultural SoilDVSGEHRPALAEAVARQAREVALATLPGDTEVEVAIVDRDGCFLARVGERVA*
Ga0075435_10085638023300007076Populus RhizosphereLAAAVARQAREVALATLSGATAVEVAIVGRDGEFLARVGAAS*
Ga0099793_1066547613300007258Vadose Zone SoilGSKRGAFAGGVAVRAREVALATLCGKTAIEVAIVDRQGDFLARVGG*
Ga0126373_1015608913300010048Tropical Forest SoilAAGPLASVLAEGVAYRAREVALATLSGATAVEVAIVDRSGEFLARVGA*
Ga0126373_1056816513300010048Tropical Forest SoilALARVVAARAREVALATISGEIAVEVAIVDRQGEIMARVGE*
Ga0126373_1075889833300010048Tropical Forest SoilALAGARKEGLARLVAVRAREVALATLSGKTAVEVAIVDREGVFLARVGG*
Ga0134062_1033129823300010337Grasslands SoilAALAEAVARQAREVALATLSGATAVELAIVDRTGDFLARVGTES*
Ga0126381_10247091113300010376Tropical Forest SoilLADGVARRAREVALATLSGAIAVEVAIVGRSGEFLARVGAEAPP*
Ga0126381_10509481013300010376Tropical Forest SoilMAPILAEAVAYGAREVALATLSGATAVEVAIVDRSGEFLARVGA*
Ga0134124_1221097623300010397Terrestrial SoilEADSAGAMLDLAGERRDALAEAVARQARVVALATLSGATAVEVAIVGRDGEFLARVGATS
Ga0126383_1272017113300010398Tropical Forest SoilALAGSRKEGLARLVAVRAREVALATLSGKTAVEVAIVDREGVFLARVGG*
Ga0126350_1053034733300010880Boreal Forest SoilREVALATLSGATAVEVAIVDRAGEFLARVGDEAPQ*
Ga0137376_1100130123300012208Vadose Zone SoilAIARRAREVALATLSGAIAVEVAIVDRSGDFLARVGAEA*
Ga0164302_1154433013300012961SoilAREVALATLSGATAVEVAIVGRDGEFLARVGAAS*
Ga0157374_1270520923300013296Miscanthus RhizosphereRPALAAAVARQAREVALATLSGDIAVEVAIVDRGGAFLARVGGPPV*
Ga0182036_1043885833300016270SoilAAQAREVALATLCGKTAIEVAIVDRQGSFLARVGG
Ga0182036_1192287623300016270SoilAGSREGALARLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGG
Ga0182041_1009728613300016294SoilLAAAHRADLARLVAARAREVALATLSGKTAVEVAIVDRDAEFLARVGG
Ga0182035_1112540813300016341SoilVLALAEGREGALARLVAARAREVALATLSGKTAVEVAVVDRAGEFLARVGW
Ga0182034_1025893833300016371SoilAQRAALAGIVAARAREVALATLSGKTAVEVAIVDREAEFLARVGG
Ga0182037_1089436623300016404SoilAARAREVALATLSGKTAVELAIVDREGVFLALVCG
Ga0182037_1095209223300016404SoilSREGALARLVAARAREVALATLSGKTTLEVAIVDREGEFLARVGG
Ga0182039_1097261113300016422SoilEIRALASSRERALARLVAARAREVALAILSGKNTVEVAIVDREGEFLARVGG
Ga0182038_1019379233300016445SoilAEILALAGGREGALARLVAAGAREVALATLSDKTAVEVAIVDREGGFLARVGG
Ga0187817_1060040523300017955Freshwater SedimentDVAGRWRATLADNVARRARSVALATLSGATAVEVAIVDRGGNFLARVGA
Ga0187815_1042004223300018001Freshwater SedimentLAHGVARQAREVALATLSGKTAIDVAVIDRDGNLLGRAGW
Ga0066655_1019292213300018431Grasslands SoilQAREVALATLSGATAVEVAIVSRDGEFLGRVGAMR
Ga0066655_1026383633300018431Grasslands SoilEALAAAVARRAREVAMATLSGHTAVEVAIVDRGGGFLARIGA
Ga0210405_1016181213300021171SoilGDRSTALAGQVAARAREVALATLSGKTAVEVAIVDRAGDFLARVGAEA
Ga0210394_1031967513300021420SoilARRAREVALATLSGATAVEIAIVDRGGEFLARVGA
Ga0210402_1153074013300021478SoilLAGGRRKALARLVAARAREMALATLSGRTEIEVAVVDRTGEFLARVGG
Ga0126371_1163316023300021560Tropical Forest SoilEIAGRARKIALRMLGGRTAVEVAIVDRQGEVLARGR
Ga0242655_1025513513300022532SoilIAGRAREVALATICGKTAVEVAIVDRQGSFLARVGG
Ga0224549_101696113300022840SoilNVLADGVARRAREVALATLSGATEVEVAIVDRGGEFLSRVGG
Ga0207692_1046474213300025898Corn, Switchgrass And Miscanthus RhizosphereARAIAGRAREVALATICGKTAVEVAIVDRQGGVLARVGT
Ga0207663_1027522913300025916Corn, Switchgrass And Miscanthus RhizosphereAAAVARQAREVALATLSGATAVEVAIVGRDGEFLARVGAAS
Ga0207676_1098232713300026095Switchgrass RhizospherePALAEAVARQAREVALATLSGDTAVEVAIVNRDGRLLALVGGAA
Ga0209265_121214813300026308SoilTIAWRAREVALATICGKTAVEIAIVDRLGNFLARVGG
Ga0209159_117057923300026343SoilERREALAEAVGWQAREVALATLSGATAVEVAIVGRDGEFLGRVGAMT
Ga0257147_105118323300026475SoilREALAHGIAAQAREVALATLCGKTAIEVAIIDRQGSFLARVGG
Ga0208981_103261633300027669Forest SoilAAAIARQAREVALATLSGATAVEIAVVDRAGDFLARVGAEA
Ga0209810_120113923300027773Surface SoilAREVALATLSGDTAVEVAIVDRGGALLARVGQPAEAAR
Ga0209167_1054347823300027867Surface SoilALAHAVAGRAREVALATLAGGISVEVAVVDRDGGLLARAGG
Ga0209283_1095586323300027875Vadose Zone SoilYGTALAAAVARQAREVALATLCGDTLVEVAIVDRGGDFLARVGGP
Ga0209415_1015366313300027905Peatlands SoilADGVARRARAVALATLSGATAVEVAIVDRGGDFLARAGAEVKR
Ga0209415_1053434423300027905Peatlands SoilLRTALARRVAEGCREVALATLAGGIAVDVAVFDREGSFLAGTEA
Ga0209415_1053434523300027905Peatlands SoilLRTALARRVAEGCREVALATLAGGIAVDVAVFDREGSLLAGTEA
Ga0307305_1039646813300028807SoilVGWQAREVALATLSGATAVEVVIVGRNGEFLGRVGAIT
Ga0307482_128964013300030730Hardwood Forest SoilRAIAGRAREVALATISGKTAVEVAIVDRQGTFLACVGM
Ga0302308_1063870023300031027PalsaARRAREVALATLSGATEVEVAIVDRGGEFLSRVGG
Ga0170824_10077677213300031231Forest SoilVAARAREMALATLSGRTEIEVAVVDRTGEFLARVGG
Ga0302325_1221203613300031234PalsaGTWAPALADGVARRAREVALATLSGTVAVEVAIVDRDGDFLARVGT
Ga0302326_1343218913300031525PalsaWAPALADGVARRAREVALATLSGTVAVEVAIVDRDGDFLARVGT
Ga0318573_1005157813300031564SoilLAGGRREPLAAMVAARAREVALATLCGKTAVEVAIVDRQGGFLARVGG
Ga0318515_1052662513300031572SoilEALASLVAARAREVALATLSGRTAVEVAIVDREAEFLARVGG
Ga0310915_1019209733300031573SoilVAARAREVALAALSGETAVEVAIVDREGEFLARVGG
Ga0318555_1013026413300031640SoilGAAEILALAGDRRGALARAVAVRAREVALATLSGRTVVEIAIVDRDGGFLARVDE
Ga0318561_1004810613300031679SoilGALARAVAVRAREVALATLSGRTVVEIAIVDRDGGFLARVDE
Ga0318561_1009481233300031679SoilLARLVAGGAREVALATLSGKTAVEVAIVDRQGCFLARVGG
Ga0318572_1083138613300031681SoilDLARLVAARAREVALATLSGKTAVEVAIVDRDAEFLARVGG
Ga0318572_1086714113300031681SoilLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGG
Ga0318560_1070345523300031682SoilLARLVAARAREVAVATLSGKTTLEVAIVDRGGEFLARVGG
Ga0307476_1094540723300031715Hardwood Forest SoilLEAAGSWAPALADGVARRAREVALATLSGATAVEIAIVDRGGEFLARVGAEGQP
Ga0306917_1022189533300031719SoilLALAATRGEALASLVAARAREVALATLSGRTAVEVAIVDREAEFLARVGG
Ga0318501_1056005413300031736SoilRKRHLARLVAGGAREVALATLSGKTAVEVAIVDRQGCFLARVGG
Ga0307477_1085119113300031753Hardwood Forest SoilLAGGRRPALARDVARRAREVALATLGGGTAIEVAIIERGGDLLALVGE
Ga0307475_1038387233300031754Hardwood Forest SoilALAGSREGALARLVAVRAREVALATLSGKTAVEVTIVDREGEFLARVGG
Ga0318535_1001343013300031764SoilGRREPLAAMVAARAREVALATLCGKTAVEVAIVDRQGGFLARVGG
Ga0318509_1012960433300031768SoilAGALARLVAARAREVALATLSGKTAVEVAIVDREGIFLARVGW
Ga0318546_1032379713300031771SoilALAGSRERALARLVAARAREVALAALSGETAVEVAIVDREGEFLARVGG
Ga0318529_1058855523300031792SoilAAHRADLARLVAARAREVALATLSGKTAVEVAIVDRDAEFLARVGG
Ga0318548_1017114713300031793SoilTEGALARLVAAGAREVALATLSGKTAVEVAIVDRQGSFLARVGG
Ga0318503_1022792213300031794SoilALARLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGG
Ga0318568_1058607023300031819SoilEALASVVAARAREVALATLSGRTAVEVAIVDREAEFLARVGG
Ga0318512_1006819533300031846SoilGTRREALARLVAGGAREVALATLSGKTAVEVAIVDRQGCFLARVGG
Ga0306919_1007546713300031879SoilARVVAAQAREVALATLCGKTAIEVAIVDRQGSFLARVGG
Ga0306919_1068224423300031879SoilSRERALARLVAARAREVALAALSGETAVEVAIVDREGEFLARVGG
Ga0318544_1016922513300031880SoilVIAGQAREVALATISGKTAVEVAIVDRQGSFLARVGM
Ga0306925_1048608413300031890SoilQKEALAGLVAARAREVALATLSGKTTVEVAIVDREAEFLARVGG
Ga0318520_1020683213300031897SoilRLVAARAREVALATLSGKTTVEVAIVDREGEFLARVGG
Ga0318520_1020797833300031897SoilEALAGLVAAGAREVALATLSGKTAVEVAIVDRQGSFLARVGG
Ga0306921_1050814033300031912SoilSGGKAAALARLVAARAREVALATLSGKTAVEVAIVDREGIFLARVGW
Ga0308174_1114292623300031939SoilEAVARQAREVALAILSGATAVEVAIVGRDSEFLARVGAAS
Ga0310912_1012650043300031941SoilEVLALAEGRHGALARLVAARAREVALATLSGKTAVEVAVVDRAGEFLARVGW
Ga0310910_1073952923300031946SoilLARLVAARAREVALATLSGKTAVEVAIVDREGIFLARVGW
Ga0306926_1045958613300031954SoilNTLARAIAGRAREVALATLCGKTAIEVAVVDRQGSFLARVGV
Ga0307479_1162315613300031962Hardwood Forest SoilLDIAGEWRMPLADGVARGAREVALATLSGATAVEVAIVDRTGEFLARIGHG
Ga0326597_1006256713300031965SoilTVARRAREVALATVSGGIAVEVAIVDRGGAFLARVGGAAWEDIAT
Ga0306922_1133928313300032001SoilAARAREVALATLSGKTTVEVAIVDREAEFLARVGG
Ga0318563_1080803113300032009SoilLALADGLQRDLARLVATRAREVALATLSGKTAVEVAVVDRTGEFLARVGW
Ga0318507_1054615023300032025SoilVAARAREVSLAALSGETAVEVAIVDREGEFLARVGG
Ga0310911_1017608233300032035SoilREGALARLVAGRAREVALATLSGKTAVEVAIVDREGELLARVGG
Ga0318570_1001162153300032054SoilMVAARAREVALATLCGKTAVEVAIVDRQGGFLARVGG
Ga0318533_1019959513300032059SoilLSLAAAHRADLARLVAARAREVALATLSGKTAVEVAIVDRDAEFLARVGG
Ga0318504_1004845943300032063SoilREPLAAMVAARAREVALATLCGKTAVEVAIVDRQGGFLARVGG
Ga0318514_1039279923300032066SoilVVAAQAREVALATLSCDIAVEVAIVDRQGDFLARVGG
Ga0318577_1034711523300032091SoilARLVAARAREVALATLSGKTAVEVAIVDREGRFLARVGW
Ga0318577_1056065313300032091SoilRADLARLVAARAREVALATLSGKTAVEVAIVDRDAEFLARVGG
Ga0318540_1008379333300032094SoilVAVRAREVALATLSGRTVVEIAIVDRDGGFLARVDE
Ga0306920_10054208513300032261SoilEGALARLVAARAREVALATLSGKTTLEVAIVDREGEFLARVGG
Ga0306920_10079834113300032261SoilKTLARAIARRAREVALATLCGKTALEVAVVDRQGSFLARVGV
Ga0335085_1128452413300032770SoilDGVARRAREVALATLSGATAVEVAIIGRAGDVLARVGGEARP
Ga0335079_1080693313300032783SoilRAALVDGIARRARAVALATLSGATAVEVAIVDRGGEFLARVGA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.