NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097786

Metagenome / Metatranscriptome Family F097786

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097786
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 44 residues
Representative Sequence NPPYGADEYLREAYGAIHQGIAAAGAGYVEVARLTPERMAQ
Number of Associated Samples 91
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.94 %
% of genes near scaffold ends (potentially truncated) 93.27 %
% of genes from short scaffolds (< 2000 bps) 90.38 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (68.269 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(14.423 % of family members)
Environment Ontology (ENVO) Unclassified
(22.115 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.038 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1JGI12269J14319_102822171
2JGI20180J14839_10137911
3JGIcombinedJ26739_1003467971
4Ga0062385_108747391
5Ga0062386_1012420713
6Ga0062388_1021836171
7Ga0066819_10076071
8Ga0066823_101232592
9Ga0070761_109241351
10Ga0070764_105190493
11Ga0075028_1002302781
12Ga0075028_1006789871
13Ga0075028_1008798661
14Ga0075029_1004727553
15Ga0075029_1006029463
16Ga0075029_1007804371
17Ga0075019_103275151
18Ga0070712_1011033543
19Ga0116105_10632931
20Ga0116135_13831503
21Ga0116228_109197171
22Ga0074044_108908901
23Ga0126356_101267161
24Ga0137393_117877382
25Ga0137360_118317881
26Ga0137416_122670301
27Ga0168317_10704881
28Ga0181536_103693123
29Ga0181525_107143493
30Ga0182030_106300771
31Ga0137405_14084302
32Ga0167652_10895741
33Ga0167631_10221661
34Ga0167668_10311903
35Ga0167668_10725503
36Ga0167650_11146783
37Ga0187847_102222613
38Ga0187782_110736891
39Ga0187859_104676223
40Ga0193728_10840771
41Ga0210407_111599981
42Ga0210403_101205115
43Ga0210395_108972451
44Ga0210406_100804561
45Ga0210396_112914681
46Ga0210388_103441934
47Ga0210393_114206991
48Ga0210387_100347491
49Ga0210387_109807601
50Ga0210394_103428324
51Ga0210394_108830523
52Ga0210394_116076021
53Ga0210390_103245331
54Ga0210398_113207391
55Ga0212123_104510283
56Ga0242657_10865391
57Ga0224557_11990841
58Ga0208935_10382052
59Ga0208691_10646263
60Ga0208589_10286791
61Ga0209421_10285363
62Ga0209167_106918281
63Ga0209275_105931423
64Ga0209380_100530681
65Ga0265355_10013491
66Ga0302220_102705521
67Ga0302225_101873623
68Ga0302228_102905933
69Ga0302229_100576741
70Ga0308309_101285271
71Ga0311361_109927321
72Ga0311362_101857811
73Ga0311352_101956421
74Ga0311330_107021381
75Ga0311371_116124473
76Ga0311336_103032074
77Ga0302302_12887912
78Ga0302270_100783235
79Ga0302195_104981752
80Ga0302192_100949544
81Ga0302183_101554092
82Ga0311355_111872853
83Ga0310038_104778351
84Ga0265750_10510831
85Ga0302308_102914153
86Ga0170834_1018454521
87Ga0170834_1035533732
88Ga0170823_150943371
89Ga0302307_102991423
90Ga0302325_112733323
91Ga0265340_104222862
92Ga0170818_1077236913
93Ga0170818_1106117373
94Ga0307373_101003571
95Ga0310686_1083512591
96Ga0310686_1160312493
97Ga0307474_103004791
98Ga0307474_115042851
99Ga0307471_1015991483
100Ga0307471_1024265713
101Ga0348332_117187683
102Ga0334828_139019_14_172
103Ga0334790_111094_711_839
104Ga0370514_066857_786_908
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 28.99%    β-sheet: 0.00%    Coil/Unstructured: 71.01%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540NPPYGADEYLREAYGAIHQGIAAAGAGYVEVARLTPERMAQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
68.3%31.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Peatland
Iron-Sulfur Acid Spring
Watersheds
Soil
Soil
Vadose Zone Soil
Glacier Forefield Soil
Surface Soil
Peatlands Soil
Arctic Peat Soil
Soil
Forest Soil
Hardwood Forest Soil
Untreated Peat Soil
Tropical Peatland
Bog Forest Soil
Bog
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Soil
Fen
Palsa
Bog
Plant Litter
Weathered Mine Tailings
Rhizosphere
Host-Associated
Boreal Forest Soil
2.9%3.8%6.7%3.8%3.8%4.8%14.4%4.8%3.8%4.8%2.9%11.5%5.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12269J14319_1028221713300001356Peatlands SoilGSGLMIVNPPFGADEYLATAYAAIHRGIAAPGAGYVEVARLTPEKL*
JGI20180J14839_101379113300001413Arctic Peat SoilPDDAGLGLAGSGLMIVNPPYGADEFLTAAYGAIHRGIAAAGAGYVEVARLTPEKL*
JGIcombinedJ26739_10034679713300002245Forest SoilLVNPPYGADDFLREAYLAIHASLAEAGRGYVEVARLTAERMAQ*
Ga0062385_1087473913300004080Bog Forest SoilAGVGLAGSGLMIVNPPFGADEYLREAYEAIQRGIAAPNSGYVEVARLTPERMAQ*
Ga0062386_10124207133300004152Bog Forest SoilPPYGADEYLQRSYGAIHAGLTDSSSGYVEVTRLTGERVAQ*
Ga0062388_10218361713300004635Bog Forest SoilAGSGLMIVNPPYGADEYLRRAYAAIHEGIALPGAGYVEVARLTPERMAL*
Ga0066819_100760713300005148SoilAGSGLMIVNPPYGADDYLNDAYAAIHKGIAAPGSGYVEIARLTPERMAQ*
Ga0066823_1012325923300005163SoilSGLMIVNPPYGADDYLNDAYAAIHKGIAAPGSGYVEIARLTPERMAQ*
Ga0070761_1092413513300005591SoilNPPHGADEYLTGAYAAIHQGIATSGSGYVEVARLTPERMAQ*
Ga0070764_1051904933300005712SoilDNYLSDAYAAIHKVIASSGSGYVEVTRLTPERMAQ*
Ga0075028_10023027813300006050WatershedsPDDAGVGLAGSGLMIVNPPFGTDEYLTRAYATIREGIATPKAGYVEVGRLTPERMAQ*
Ga0075028_10067898713300006050WatershedsVVNPPYGADDFLMQAYGVIHGAVAAAGAGYVEVARLTPELMAK*
Ga0075028_10087986613300006050WatershedsGVGLAGSGLMIVNPPYGADDYLNDAYAAIHKGIAAPGSGYVEIARLTPERMAQ*
Ga0075029_10047275533300006052WatershedsIVNPPYGADDHLREAYTAVHRAIAAAGAGYVEVDRLTPERMAQ*
Ga0075029_10060294633300006052WatershedsVGLAGSGLMIANPPYGADEFLGEAYARLHELIGTPAMGYVEVARLTPERVAS*
Ga0075029_10078043713300006052WatershedsGLLIVNPPYGADDHLLEAYSAIHSAVAAAGSGYVEVARLTPERVAQ*
Ga0075019_1032751513300006086WatershedsLIVNPPYGADDHLREAYTAVHRAIAAAGAGYVEVDRLTPERMAQ*
Ga0070712_10110335433300006175Corn, Switchgrass And Miscanthus RhizosphereAGVGLAGSGLMIVNPPYGADDYLNDAYAAIHKGIAAPGSGYVEIARLTPERMAQ*
Ga0116105_106329313300009624PeatlandYGADEHLTAAYAAIHRGVAPATDAGYVEVARLTPEQL*
Ga0116135_138315033300009665PeatlandGVGLAGSGLIIVNPPYGADEFLASAYSAVHRILAPSGSGYVEVARLTPERMAQ*
Ga0116228_1091971713300009701Host-AssociatedNPPYGADTYLQQAYSAIHGAIADPGAGYVEVTRLTPERIAR*
Ga0074044_1089089013300010343Bog Forest SoilGSGLIIVNPPYGADDYLAGAYAAIHQGIATPGAGYVEVARLTAERAGH*
Ga0126356_1012671613300010877Boreal Forest SoilAGSGLLIVNPPYGADQFLRDAYGAIHQAIATAGAGYVEVARLTPERMAQ*
Ga0137393_1178773823300011271Vadose Zone SoilAGSGLMVVNPPYGADDFLNQAYEVIHGAVAAAGAGYVEVARLTPELMAK*
Ga0137360_1183178813300012361Vadose Zone SoilAGVGLAGSGLMIVNPPHGTDEYLRDAYAAIHSGIAVPGAGYVEVGRLTPERMAQ*
Ga0137416_1226703013300012927Vadose Zone SoilIVNPPFGTDAYLRAAYAAIHAGIAASGAGYVAVTRLTPERMAQ*
Ga0168317_107048813300012982Weathered Mine TailingsVNPPYGADDYLAAAYAAIHRILAPSGAGYVEVARLTPERMAQ*
Ga0181536_1036931233300014638BogPPYGADEYLQNAYTTVHQGIAAPGSGYVEVARLTRERMAQ*
Ga0181525_1071434933300014654BogDYLSGAYAAVHQGIAARGAGYVEVGRLTPERMAQ*
Ga0182030_1063007713300014838BogDAGVGLAGSGLAIVNPPHGTDEYLSDAYAAIHRGIAPTGAGYVEVARLTPERMAQ*
Ga0137405_140843023300015053Vadose Zone SoilVIVNPPYGADQFLRDAYTAIHRVIAAAGAGYVEVARLTPERVAQ*
Ga0167652_108957413300015164Glacier Forefield SoilPYGTDSYLREAYAAVHDAVASAGAGYVEVARLTPERMSQ*
Ga0167631_102216613300015168Glacier Forefield SoilGSGLMIVNPPYGADQFLRDAYAAIHQGIGAPGAGYVEVGRLTPERMAH*
Ga0167668_103119033300015193Glacier Forefield SoilLLIVNPPYGADDYLRNAYAMIHRGIAAPGAGYVEVGRLTAERMAQ*
Ga0167668_107255033300015193Glacier Forefield SoilDDFLQQAYGSIHSAVAAAGAGYVEVARLTPELVAK*
Ga0167650_111467833300015203Glacier Forefield SoilPPYGADDHLRDAYTAIHRGIATPGAGYVEVGRLTPERMAQ*
Ga0187847_1022226133300017948PeatlandLAGSGLMIVNPPFGADEYLASAYAAIHRGIAAADAGYVEVARLTPEKL
Ga0187782_1107368913300017975Tropical PeatlandVNPPFGADEFLASAYAAVHEALAMAGAGYVEVARLTPEQMA
Ga0187859_1046762233300018047PeatlandNPPFGTDADLKSAYGVIHGALAPAAGAGYVEVARLTPERMAQ
Ga0193728_108407713300019890SoilGSGLMIVNPPYGTDEYLREAYAAIHSGIAASGAGYVEVARLTPERMAQ
Ga0210407_1115999813300020579SoilPYGTDEYLQRAYGAIHGALGDSGVGYVEVARLTAERVAQ
Ga0210403_1012051153300020580SoilNPPYGADEYLREAYGAIHQGIAAAGAGYVEVARLTPERMAQ
Ga0210395_1089724513300020582SoilDEFFRDAYGAIHQAIAAAGAGYVEVARLTPERLAQ
Ga0210406_1008045613300021168SoilADQFLRDAYGAIHQAIAPAGAGYVEVARLTPERMAQ
Ga0210396_1129146813300021180SoilGLAGSGLMIVNPPYGTDEYLTDAYAAIHQGIAAPGAGYVEVARLTAERVAH
Ga0210388_1034419343300021181SoilVNPPYGADEFLSAAYSAVHRTLAPAGSGYVEVERLTPERMAQ
Ga0210393_1142069913300021401SoilLMVVNPPYGADDFLKQAYEAIHGRVAAAGAGYVEVARLTPELMAQ
Ga0210387_1003474913300021405SoilIVNPPYGADDFFRDAYGAIHQAIAAAGAGYVEVARLTPERLAQ
Ga0210387_1098076013300021405SoilVNPPYGADQFLRDAYGVIHQAIAAPGAGYVEVDRLTPERMAQ
Ga0210394_1034283243300021420SoilPPYGADEFLSGAYAAIHRGIATPGAGYVEVARLTPERMAQ
Ga0210394_1088305233300021420SoilLAGSGLMIVNPPYGADEYLRRAYTAIHKGIASPGAGYVEVARLTPERMAL
Ga0210394_1160760213300021420SoilGSGLMIVNPPYGTDDYLRDAYTAIHSGIAAPGAGYVEVARLTPERMAQ
Ga0210390_1032453313300021474SoilLIVNPPYGADEYLRRAYTAIHEGIAVPGAGYVEVARLTPERMAL
Ga0210398_1132073913300021477SoilGADQHLADAYSAIHGGIATPGAGYVEVGRLTPERVAQ
Ga0212123_1045102833300022557Iron-Sulfur Acid SpringDEYLRAAYTAIHAGIAASGAGYVEVARLTPERMAQ
Ga0242657_108653913300022722SoilVNPPYGADEYLRAAYGAIHQGIAAAGAGYVEVARLTPEGMAQ
Ga0224557_119908413300023101SoilGADDYLQEAYAAIHGAIATAGSGYVEVARLTPERMAQ
Ga0208935_103820523300025414PeatlandYGADEHLTAAYAAIHRGVAPATDAGYVEVARLTPEQL
Ga0208691_106462633300025612PeatlandGADEFLASAYSAVHRILAPSGSGYVEVARLTPERMAQ
Ga0208589_102867913300025634Arctic Peat SoilAGSGLLIVNPPYGADEYLSEAYTAIHRGIAVPGAGYVEVVRLTPERVAQ
Ga0209421_102853633300027432Forest SoilGLMIVNPPYGADGFLRDAYGAIHQGIAAPGAGYVEVARLTPERMAQ
Ga0209167_1069182813300027867Surface SoilAGSGLLIVNPPYGADEYLRRAYTAIHEGIALPGAGYVDVTRLTPERMAQ
Ga0209275_1059314233300027884SoilYGTDEYLTDAYAAIHQGIAAPGAGYVEVARLTAERVAH
Ga0209380_1005306813300027889SoilPYGADDFLRQAYQAIHASLAEAGRGYVEVARLTAERMAQ
Ga0265355_100134913300028036RhizosphereMIVNPPYGADQFLRDAYSAIHQAIAAPGAGYVEVARLTPERMAH
Ga0302220_1027055213300028742PalsaASVGLAGSGLLLVNPPYGADDFLREAYLAIHASLAEAGRGYVEVARLTPERMAQ
Ga0302225_1018736233300028780PalsaLMVINPPFGADEYLRAAYTAIHAGLAAAGSGYVEVDRLTPERIN
Ga0302228_1029059333300028808PalsaAGSGLLLVNPPYGADDFLREAYAAIHDSLAEAGRGYVEVARLTPERMAQ
Ga0302229_1005767413300028879PalsaSVGLAGSGLLLVNPPYGADDFLREAYLAIHASLAEAGRGYVEVARLTPERMAQ
Ga0308309_1012852713300028906SoilGLMIVNPPYGADQYLRDAYGAIHHAIAVPGAGYVEVVRLTPERLAQ
Ga0311361_1099273213300029911BogMLINPPFGADEYLRAAYTAIHAGIAAAGSGYVEVNRLTPERIG
Ga0311362_1018578113300029913BogGLVMVNPPHGTDEFLAAAYSAIHRTLAAPDAGYVEVARLTAERMAQ
Ga0311352_1019564213300029944PalsaGVGLAGSGLMIVNPPYGTDEYLTDAYAAIHQGIAAPGAGYVEVARLTAERVAH
Ga0311330_1070213813300029945BogGADEALADAYTAIHSALAAPGAGYVEVARLTSERMAQ
Ga0311371_1161244733300029951PalsaPPFGADADLKIAYGVIHGALAPAGAGYVEVARLTPERMAQ
Ga0311336_1030320743300029990FenDAFLKQAYEVIHGAVAASGAGYVEVARLTVELMAK
Ga0302302_128879123300029997PalsaVGLAGSGLMVINPPFGADEYLRAAYTAIHAGLAAAGSGYVEVDRLTPERIN
Ga0302270_1007832353300030011BogSGLMIVNPPYGTDEHLRRAYEAIHRAIAATGAGYVEVARLTPERMAQ
Ga0302195_1049817523300030051BogLAGSGLMIVNPPYGTDEHLRRAYEAIHRAIAATGAGYVEVARLTPERMAQ
Ga0302192_1009495443300030507BogPFGADDYLRQAYTAIHASVASAGHGYVEVTRLTPERMA
Ga0302183_1015540923300030509PalsaMIVNPPYGADQYLRDAYGAIHHAIAVPGAGYVEVARLTPERMAQ
Ga0311355_1118728533300030580PalsaGSGLMLVNPPFGADEYLLEAYTAIHAAVATADAGYVEVARLTPERMAH
Ga0310038_1047783513300030707Peatlands SoilGLMIVNPPFGADEYLATAYAAIHRGIAAPGAGYVEVARLTPEKL
Ga0265750_105108313300030813SoilAGSGLMIVNPPYGADQFLRDAYGAIHEAIAAPGAGYVEVARLTPERMAQ
Ga0302308_1029141533300031027PalsaSGLLLVNPPYGADDFLREAYLAIHASLAEAGRGYVEVARLTPERMAQ
Ga0170834_10184545213300031057Forest SoilMIVNPPYGADQFLRDAYGAIHRGIAAAGAGYVEVARLTPERLAQ
Ga0170834_10355337323300031057Forest SoilIVNPPYGADQFLRDAYGAIHRGIAAAGAGYVEVARLTPERMAH
Ga0170823_1509433713300031128Forest SoilLAGSGLMIVNPPYGADQFLRDAYGAIHRGIAAAGAGYVEVARLTPERMAH
Ga0302307_1029914233300031233PalsaMIVNPPYGTDEYLTDAYAAIHQGIAAPGAGYVEVARLTAERVAH
Ga0302325_1127333233300031234PalsaADEFLRQAYEAIHASVAEAGRGYVEVARLTGERMAQ
Ga0265340_1042228623300031247RhizosphereGVGLAGSGLMIVNPPYGADEFLNKAYAAIHKGLATPGAGYVEVARLTPERMAQ
Ga0170818_10772369133300031474Forest SoilDAGVGLAGSGLMIVNPPYGADDYLNDAYAAIHKGIAAPGSGYVEIARLTPERMAQ
Ga0170818_11061173733300031474Forest SoilVNPPYGADEYLSGAYAAIHKGIAAPGAGYVEVARLTPERMAQ
Ga0307373_1010035713300031672SoilPFGIEEPLRAAYAAIHRHLAPAGAGYVEVARLTPERMAQ
Ga0310686_10835125913300031708SoilPPYGADEYLHGAYAAIHAGIAAAASGYVEVARLTPERMAQ
Ga0310686_11603124933300031708SoilFGADEYLRAAYTAIHEGLAAAGSGYVEVDRLTPERIS
Ga0307474_1030047913300031718Hardwood Forest SoilADQFLRDAYGAIHQAIAAPGAGYVEVDRLTPERMAH
Ga0307474_1150428513300031718Hardwood Forest SoilGADRFLRDAYGAIHHGIAALGAGYVEVARLTPERMAQ
Ga0307471_10159914833300032180Hardwood Forest SoilYGADEYLRDAYAVIHRAIAAPGAGYVEVVRLTPERMAQ
Ga0307471_10242657133300032180Hardwood Forest SoilGVGLAGSGLMIVNPPYGADDYLTDAYAAIHKGIAAPGSGYVEIARLTPERMAQ
Ga0348332_1171876833300032515Plant LitterVNPPYGADEYLSEAYAAIHRGIATPGAGYVEVVRLTPERVAQ
Ga0334828_139019_14_1723300033822SoilVGLAGSGLMIVNPPYGTDEYLTEAYTAIHKGIAAPGAGYVEVARLTPERVAQ
Ga0334790_111094_711_8393300033887SoilMIVNPPFGADEYLATAYAAIHRGIAAPGAGYVEVARLTPEKL
Ga0370514_066857_786_9083300034199Untreated Peat SoilPPYGADAFLADAYGELHRGIAQPGAGYVQVARLTPERMPQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.