NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097991

Metagenome / Metatranscriptome Family F097991

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097991
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 39 residues
Representative Sequence EADPQVQKAVEAIPQARALYQNARKIVAQRMGGSFDQP
Number of Associated Samples 97
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 86.54 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.52

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (82.692 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.462 % of family members)
Environment Ontology (ENVO) Unclassified
(26.923 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.846 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.
1INPhiseqgaiiFebDRAFT_1007609531
2INPhiseqgaiiFebDRAFT_1007628011
3JGIcombinedJ26739_1006717853
4Ga0066679_100805351
5Ga0066388_1010519121
6Ga0066686_101693323
7Ga0070706_1011922881
8Ga0070707_1016169431
9Ga0073909_101549793
10Ga0070697_1019862833
11Ga0070730_108059542
12Ga0070731_100289471
13Ga0070693_1006678642
14Ga0066670_107376323
15Ga0066706_114382781
16Ga0066903_1056750971
17Ga0075018_100419571
18Ga0070765_1002632801
19Ga0074064_117306803
20Ga0073928_100869525
21Ga0099829_106765633
22Ga0099830_117308492
23Ga0099827_100515481
24Ga0066709_1018194971
25Ga0105248_108906711
26Ga0116137_10550752
27Ga0105249_120679201
28Ga0134064_100117361
29Ga0074046_105224522
30Ga0074045_100206291
31Ga0126378_124601532
32Ga0126377_111778553
33Ga0137379_109670842
34Ga0137385_103114494
35Ga0137361_101278235
36Ga0137361_104632441
37Ga0137397_105815432
38Ga0157289_104377871
39Ga0137413_106627982
40Ga0137404_102616152
41Ga0137407_104150363
42Ga0137410_101864083
43Ga0126369_112332391
44Ga0181539_11009711
45Ga0182032_101895731
46Ga0182040_110064471
47Ga0182040_111250221
48Ga0187818_104447981
49Ga0187824_102958101
50Ga0187825_104055712
51Ga0187803_102618861
52Ga0187817_110195112
53Ga0187777_101826871
54Ga0187886_11775721
55Ga0187869_104904162
56Ga0187784_109758602
57Ga0187771_109037601
58Ga0066669_101578711
59Ga0193726_12456022
60Ga0210407_108321891
61Ga0210403_107185021
62Ga0210403_110100371
63Ga0210404_100358001
64Ga0210400_101799211
65Ga0210385_102601803
66Ga0210397_114663562
67Ga0210384_105879461
68Ga0210391_114146311
69Ga0210390_112339151
70Ga0242654_103158971
71Ga0209236_10046739
72Ga0209236_11176833
73Ga0209239_11924551
74Ga0209154_10469911
75Ga0209160_11278393
76Ga0209388_10056863
77Ga0209626_10460193
78Ga0209447_100037019
79Ga0209038_101282861
80Ga0209772_101533071
81Ga0209074_103976632
82Ga0209656_104889431
83Ga0209039_100441763
84Ga0209039_101793502
85Ga0209283_100662171
86Ga0209068_103387991
87Ga0209526_102913773
88Ga0138301_18900691
89Ga0170822_140578701
90Ga0170824_1190132902
91Ga0306917_111215721
92Ga0307469_123755152
93Ga0306918_113416251
94Ga0307477_100591291
95Ga0307475_102434371
96Ga0307475_112916631
97Ga0307478_100973935
98Ga0307479_109156641
99Ga0318524_106623052
100Ga0335081_106375001
101Ga0335069_126705431
102Ga0335071_120230031
103Ga0335077_113640472
104Ga0310914_116687241
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.91%    β-sheet: 0.00%    Coil/Unstructured: 59.09%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035EADPQVQKAVEAIPQARALYQNARKIVAQRMGGSFDQPSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.52
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
82.7%17.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Peatland
Freshwater Sediment
Iron-Sulfur Acid Spring
Watersheds
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
5.8%4.8%13.5%2.9%2.9%7.7%4.8%13.5%4.8%5.8%3.8%2.9%2.9%3.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10076095313300000364SoilPQVXKGVEAVPQARALYQNARKIVAQRTGASIDQP*
INPhiseqgaiiFebDRAFT_10076280113300000364SoilLLEADPQVQKGVEAIPQARALYQNARKVVAQRTGAGIDQP*
JGIcombinedJ26739_10067178533300002245Forest SoilEADPQVQKAVDAIPQARALYQNARKIVAQRTGTPIDQP*
Ga0066679_1008053513300005176SoilKVLLEADPQVQKAVEAIPQARALYQNVKKIVAQRSGGAIEQP*
Ga0066388_10105191213300005332Tropical Forest SoilSEGSKVLIEADPQVQAAMDALPQARTLYQSARKIVAQRTGTTTLDQP*
Ga0066686_1016933233300005446SoilLEADPQVQKAVEAVPQAGALYQKVRRIVAQRTGGAAEQP*
Ga0070706_10119228813300005467Corn, Switchgrass And Miscanthus RhizosphereSDPQVQKAVEAIPQARALYENARKIVAQRAGLAPGQTQP*
Ga0070707_10161694313300005468Corn, Switchgrass And Miscanthus RhizosphereELEDDLQVQKAVESIPQARALYENARKVIAQRTGAQNTFR*
Ga0073909_1015497933300005526Surface SoilPQVQKAVDAIPQARALYLNAKRITAQRTGGAIDQP*
Ga0070697_10198628333300005536Corn, Switchgrass And Miscanthus RhizosphereLEADPQLQKGVEAMPQARALYQNARKIVAQRTGGTIDQP*
Ga0070730_1080595423300005537Surface SoilMLEADPQVQKAVEAIPQARALYQNARKIVAQRTGATFDQP*
Ga0070731_1002894713300005538Surface SoilVQKAVEAVPQARALYQNARKIVAQRAGTSSIDQP*
Ga0070693_10066786423300005547Corn, Switchgrass And Miscanthus RhizosphereELEADPQVQKAVDAIPQARALYENARKVVAQRMGGASVDQP*
Ga0066670_1073763233300005560SoilKVLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGNSIERP*
Ga0066706_1143827813300005598SoilLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGGPVEQP*
Ga0066903_10567509713300005764Tropical Forest SoilQVQKAVEAVPQARALYQNAHKIMAQRMGGSNDQP*
Ga0075018_1004195713300006172WatershedsVLLEADPQVQKAVEAIPQARALYQNARKIVAQRTGERGDQP*
Ga0070765_10026328013300006176SoilEADPQVQKAVEAVPQARALYQNVRKIVAQRAGGNLDQP*
Ga0074064_1173068033300006603SoilKVLLEADPQVQKGIEAIPQARALYQNARKIVAQRNGGSLDQP*
Ga0073928_1008695253300006893Iron-Sulfur Acid SpringVLLEADPQVQKAVDSIPQARALYQNARKIVAQRTGTPIDQP*
Ga0099829_1067656333300009038Vadose Zone SoilLLEADPQVQKAVEAVPEARALYQKTRRIVAQRTGGAVEQP*
Ga0099830_1173084923300009088Vadose Zone SoilDDVQLQKAIESIPQARALYENARRVIAQRSSNGQSPR*
Ga0099827_1005154813300009090Vadose Zone SoilLSVFGLPDSVKVELEADPQVQKAVESIPQARALYESVRKVIAQRSGEPVSKP*
Ga0066709_10181949713300009137Grasslands SoilADPQVQKAVEAIPQARALYQNAKKIVAQRAGGTIEQP*
Ga0105248_1089067113300009177Switchgrass RhizosphereQVQKGVEAVPQARALYQNARKIVAQRAGASIDQP*
Ga0116137_105507523300009549PeatlandLLEADPQVQKAIEAIPQARALYENARKVVAQRTGASPDQP*
Ga0105249_1206792013300009553Switchgrass RhizospherePQVQKAVEAVPQARALYQNARKIVAQRTGAGIDQP*
Ga0134064_1001173613300010325Grasslands SoilQVQKAVEAIPQARALYQNAKKIVAQRAGEPVEQP*
Ga0074046_1052245223300010339Bog Forest SoilADPQVQKAIEAIPQARALYENARKVVAQRAGTTPDQP*
Ga0074045_1002062913300010341Bog Forest SoilADPQVQKAVEAIPQARALYDNARKVVAQRAGATPTDQP*
Ga0126378_1246015323300010361Tropical Forest SoilQVQAAVDAIPKARALYQNAKKIIAQRSGGAVEQP*
Ga0126377_1117785533300010362Tropical Forest SoilDPQVQKAIESVPQARALYQNARKIVAQRNGGSIDQP*
Ga0137379_1096708423300012209Vadose Zone SoilFKVLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGGPVEQP*
Ga0137385_1031144943300012359Vadose Zone SoilLEADPQVQKAVEAVPQARALYQNARRIVAQRTGGAVEQP*
Ga0137361_1012782353300012362Vadose Zone SoilGDKVLLAADPQVQKAVESIPQARALYENARKVVAQRNGSGHDQP*
Ga0137361_1046324413300012362Vadose Zone SoilGTKVQLEADPQVQKAVEAIPQARALYETARKIVAQRAGLTPDQP*
Ga0137397_1058154323300012685Vadose Zone SoilFLSAYGQAEGTKVQLESDPQVQKAVEAIPAARALYESARKIVAQRAGLAPAQP*
Ga0157289_1043778713300012903SoilPQVQKAVEAVPQARALYQNAKKIVAQRTGAGIDQP*
Ga0137413_1066279823300012924Vadose Zone SoilQLESDPQVQKAVEAIPQARALYENARKIVAQRAGLAPAQP*
Ga0137404_1026161523300012929Vadose Zone SoilQVQKAVEAIPQARALYENARKIVAQRAGLAPAQP*
Ga0137407_1041503633300012930Vadose Zone SoilSAYGTQEGDKVLLAADPQVQKAVDSIPHARALYENARKVVAQRNGSNHDQP*
Ga0137410_1018640833300012944Vadose Zone SoilVLLEADPQVQKAVESIPQARALYQNARKIVAQRNGGSLDQP*
Ga0126369_1123323913300012971Tropical Forest SoilQVQKAVEAVPQARALYQNARKIVAQRTGESPDQP*
Ga0181539_110097113300014151BogQMQKAIEAIPQARALYENARKVVAQRSGSSPNQP*
Ga0182032_1018957313300016357SoilEADPQVQKGVEAVPQARALYQNARKIVAQRAGGGVDQP
Ga0182040_1100644713300016387SoilEADPQVQKAVEAVPQARALYENARKVVAQRNGTSDQP
Ga0182040_1112502213300016387SoilADPQVQKGVEAVPQARALYQNARKIVAQRAGGGVDQP
Ga0187818_1044479813300017823Freshwater SedimentDPQVQKAVEALPEARALYQNARKIVAQRMGGSFNQP
Ga0187824_1029581013300017927Freshwater SedimentQVQKAVEAIPQARALYENARKIVAQRSGSNVHFNQP
Ga0187825_1040557123300017930Freshwater SedimentPQVQKAVEAIPQARALYQNARKIVAQRTGATFDQP
Ga0187803_1026188613300017934Freshwater SedimentPQVQKAVEAIPQARALYENARKVVAQRSGGSPDQP
Ga0187817_1101951123300017955Freshwater SedimentFKVLLEADPQVQRAVDAVPQARALYENARKIIALRNGSTAHFNQP
Ga0187777_1018268713300017974Tropical PeatlandEGLKVLLEADPQVQAAVDAIPKARALYQNAKKIIAQRSGSAVEQP
Ga0187886_117757213300018018PeatlandDPQVQKAIEAIPQARALYENARKIVAQRAGASPDQP
Ga0187869_1049041623300018030PeatlandLEADPQVQKAIEAIPQARALYENARKVVAQRTGASPDQP
Ga0187784_1097586023300018062Tropical PeatlandFKVLLDADPQVQKAIDAVPQARALYENARKVVAQKSGSNIRFNQP
Ga0187771_1090376013300018088Tropical PeatlandPQVQKAIEAVPQARALYQNARRIVAQRSGASPDQP
Ga0066669_1015787113300018482Grasslands SoilLEGDPQVQKAVEAVPQARALYQNAHKIMAQRMGGSVDQP
Ga0193726_124560223300020021SoilAAAPQDQKAVEATPQARALYQNARKIVAQRTGGSIDQP
Ga0210407_1083218913300020579SoilAADPQVQKAVESIPQARALYENARKVVAQRSGSGHDQP
Ga0210403_1071850213300020580SoilLKVLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGGAIDQP
Ga0210403_1101003713300020580SoilKVLLEADPQVQKAVDTIPQARTLYENARKIVAQRTGGSFEHP
Ga0210404_1003580013300021088SoilDPQVQKAVEAIPQARALYENARKIVAQRAGLTSNQP
Ga0210400_1017992113300021170SoilLEADPQVQKAVDSIPQARALYENARKVVAQRMGNTPDQP
Ga0210385_1026018033300021402SoilDPQVQKAVDSIPQARALYENARKIVAQRMGGNAPDQP
Ga0210397_1146635623300021403SoilVLLAADPQVQKAVDSIPQARALYENARKVVAQRNGSNHDQP
Ga0210384_1058794613300021432SoilDPQVQKAVESIPQARALYENARKIVAQRNGSGHDQP
Ga0210391_1141463113300021433SoilEADPQVQKAVDSIPQARALYENARKVVAQRMGNTPDQP
Ga0210390_1123391513300021474SoilDPQVQKAVDSIPQARALYENARRIVAQRMGNTPDQP
Ga0242654_1031589713300022726SoilGDKVLLAADPQVQKAIESIPQARALYENARKVVAQRNGSGHDQP
Ga0209236_100467393300026298Grasslands SoilVLLEADPQVQKAVEVIPQARALYQNARKIIAQRMGESVDQP
Ga0209236_111768333300026298Grasslands SoilPDSVKVELEADPQVQKAVESIPQARALYENVRKVIAQRSGEPVSKP
Ga0209239_119245513300026310Grasslands SoilPQVQKAVEAIPQARALYQNAKKIVAQRAGNSIERP
Ga0209154_104699113300026317SoilDPQVQKAVEAVPQARALYQNAHKIMAQRMGGSADRPY
Ga0209160_112783933300026532SoilLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGGTIEQP
Ga0209388_100568633300027655Vadose Zone SoilEADPQVQKAVEAIPQARALYETARKIVAQRAGLTPDQP
Ga0209626_104601933300027684Forest SoilVLLEADPQVQKAVEAIPQARALYQNAKKIVAQRAGGPVEQP
Ga0209447_1000370193300027701Bog Forest SoilPQVMKAIEAIPQARALYQNARKVVAQRTSGSLDQP
Ga0209038_1012828613300027737Bog Forest SoilELEADRQVQASIEAVPQARALYENARKIMADRQGMTTFRP
Ga0209772_1015330713300027768Bog Forest SoilESVLLLADPQVEKAAESVPQARALYQNARKIVAQRSGGNTLEQP
Ga0209074_1039766323300027787Agricultural SoilESDPQVQKAVEAIPQARALYENARKIVAQRAGLRQDRP
Ga0209656_1048894313300027812Bog Forest SoilQVEKAAESVPQARALYQNARKIVAQRTGTPTLDQP
Ga0209039_1004417633300027825Bog Forest SoilILLEADPQVQKAIEAIPQARALYENARKVVAQRSGATPDQP
Ga0209039_1017935023300027825Bog Forest SoilILLEADPQVQKAIEAIPQARALYENARKVVAQRAGASPDQP
Ga0209283_1006621713300027875Vadose Zone SoilVELEADPQVQKAVDSIPQARALYENVRKVIAQRSGEPVSKP
Ga0209068_1033879913300027894WatershedsLLEADPQVQAAVDAIPQARALYQNVKKIIAQRSGAAVEQP
Ga0209526_1029137733300028047Forest SoilLLEADPQVQKAVDAIPQARALYQNARKIVAQRTGTPIDQP
Ga0138301_189006913300031022SoilPQVQKAVDSIPQARALYQNARKIVAQRTGAPIDQP
Ga0170822_1405787013300031122Forest SoilGLLEADPQLQKGIEAMPQARALYQNARKIVAQRAGAGSDQP
Ga0170824_11901329023300031231Forest SoilLEADPQVQKAVEALPQARALYENARKVVAQRNGGGSDQP
Ga0306917_1112157213300031719SoilADPQVQKGVEAVPQARALYQNARKIVAQRAGGSVDQP
Ga0307469_1237551523300031720Hardwood Forest SoilVLLAADPQVQKAVESIPQARALYENARKVVAQRSGSGHDQP
Ga0306918_1134162513300031744SoilFKVLLEADPQVQRAIEAVPQARALYENARKVVAQGNAATSHFKQP
Ga0307477_1005912913300031753Hardwood Forest SoilEADPQVQKAVEAIPQARALYQNARKIVAQRMGGSFDQP
Ga0307475_1024343713300031754Hardwood Forest SoilDPQVQKAVEAIPQARALYENARKIVAQRAGVTPDQP
Ga0307475_1129166313300031754Hardwood Forest SoilLLEADPQVQKAVEAIPQARALYQNAKRIVAQRAGGTIEQP
Ga0307478_1009739353300031823Hardwood Forest SoilPQVQAAVEAIPQARALYENARKIVAQRSGGANDQP
Ga0307479_1091566413300031962Hardwood Forest SoilDPQVQKAVEAIPQARALYENARKIVAQRAGITPNQP
Ga0318524_1066230523300032067SoilADPQVQKAIESIPQARALYENARKVVAQRNGADHDQP
Ga0335081_1063750013300032892SoilLKVLVEADPQVDKAIESIPQARALYQNARKIIAQRNGGASPDQP
Ga0335069_1267054313300032893SoilEADPQVQKAMEAIPQARALYENARKVVAQRAGNTAHLNQP
Ga0335071_1202300313300032897SoilLLSSDPQVQKAIESVPQARALYENARKVVAQRNGAGPDQP
Ga0335077_1136404723300033158SoilLLEADPQVQKAVEALPQARALYENARKVVAQRSGGGADQP
Ga0310914_1166872413300033289SoilDPQVQKAIESIPQARALYENARKVVAQRNGADHDQP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.