NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F087688

Metagenome / Metatranscriptome Family F087688

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F087688
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 42 residues
Representative Sequence AEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Number of Associated Samples 102
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.00 %
% of genes near scaffold ends (potentially truncated) 89.09 %
% of genes from short scaffolds (< 2000 bps) 80.00 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (90.909 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(11.818 % of family members)
Environment Ontology (ENVO) Unclassified
(20.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.818 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.
1Ga0066680_101971521
2Ga0070708_1001655081
3Ga0070731_104533902
4Ga0066697_105664131
5Ga0070732_100407631
6Ga0066700_110868322
7Ga0066693_103398982
8Ga0066694_101566651
9Ga0070762_108658301
10Ga0070766_109719731
11Ga0066789_101141241
12Ga0070717_101383014
13Ga0066656_109323741
14Ga0075029_1001537493
15Ga0075030_1014655602
16Ga0075018_106246502
17Ga0068871_1009268471
18Ga0075521_102917921
19Ga0066659_105184672
20Ga0079220_108198512
21Ga0075425_1030747792
22Ga0075436_1002643473
23Ga0099829_109445522
24Ga0116214_10300255
25Ga0116222_12523482
26Ga0116218_11536051
27Ga0116110_12171262
28Ga0105855_12043902
29Ga0116217_107682192
30Ga0134063_101210302
31Ga0134062_107714091
32Ga0074044_109954041
33Ga0126376_121825262
34Ga0126376_131209142
35Ga0126378_111643291
36Ga0126378_122885951
37Ga0126381_1005160281
38Ga0137399_116481431
39Ga0137376_106967751
40Ga0137367_107612721
41Ga0137390_117995991
42Ga0137397_102319773
43Ga0137394_110183452
44Ga0137410_107104302
45Ga0134077_102282732
46Ga0181538_103813531
47Ga0163163_106599621
48Ga0157376_112966391
49Ga0167668_10901752
50Ga0137409_102521373
51Ga0132255_1012939451
52Ga0132255_1041729501
53Ga0134074_12604271
54Ga0187825_102437832
55Ga0187848_103938622
56Ga0187783_101442741
57Ga0187781_106608622
58Ga0187815_100878893
59Ga0187804_101189623
60Ga0187864_102760351
61Ga0187867_104538832
62Ga0187784_101193814
63Ga0187771_113876761
64Ga0187774_100360551
65Ga0187774_103071921
66Ga0187770_101651763
67Ga0066669_110167112
68Ga0066669_112926231
69Ga0182031_12498261
70Ga0193751_12066001
71Ga0210399_114975002
72Ga0210401_111479922
73Ga0210406_103474253
74Ga0210390_104411552
75Ga0210398_109615981
76Ga0126371_117133232
77Ga0222622_114585752
78Ga0207656_104201442
79Ga0207685_100034171
80Ga0207665_104291561
81Ga0209686_11394941
82Ga0209801_12498462
83Ga0209473_10106922
84Ga0209267_12123262
85Ga0209159_10569323
86Ga0209378_11072853
87Ga0209474_102184841
88Ga0209579_102208512
89Ga0209275_106345221
90Ga0209698_112854092
91Ga0222749_107231352
92Ga0307468_1001329003
93Ga0307475_110896511
94Ga0307473_101743141
95Ga0307473_112961692
96Ga0310917_111625092
97Ga0318520_107067702
98Ga0306926_114559471
99Ga0318545_101865352
100Ga0311301_103036375
101Ga0307471_1042393122
102Ga0335079_103895731
103Ga0335078_101416744
104Ga0335078_121156992
105Ga0335072_1001135114
106Ga0335076_100841471
107Ga0335077_101071655
108Ga0335077_110024751
109Ga0314871_023116_410_529
110Ga0334847_002818_2_112
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.54%    β-sheet: 0.00%    Coil/Unstructured: 72.46%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540AEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
90.9%9.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog
Peatland
Freshwater Sediment
Watersheds
Groundwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Glacier Forefield Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Arctic Peat Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Peatland
Tropical Peatland
Bog Forest Soil
Permafrost Soil
Bog
Soil
Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
3.6%8.2%5.5%3.6%4.5%11.8%8.2%4.5%6.4%6.4%3.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066680_1019715213300005174SoilAEDKKLLLDIISSEKKWIAPKEGGRDPLATIAEPRKSAINL*
Ga0070708_10016550813300005445Corn, Switchgrass And Miscanthus RhizosphereKDKEVLLDIIRNEKKWIKEKELARDPLSTIAEPRKSAINL*
Ga0070731_1045339023300005538Surface SoilVMPSADDKKLLLEIIANENKWIAEKEGARDPFSTIGEPRKSAINL*
Ga0066697_1056641313300005540SoilTPEDKTLLLDIIRNEKKWIVAKEGGRDPLATIGEPRKSAINL*
Ga0070732_1004076313300005542Surface SoilLLEIIANDKKWIAEKEGARDPFATIGEPRKSAINL*
Ga0066700_1108683223300005559SoilLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL*
Ga0066693_1033989823300005566SoilFEAMPSAEDKKLLLEIIANEKNWIAEKEGARDPFETIGEPRKSAINL*
Ga0066694_1015666513300005574SoilAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL*
Ga0070762_1086583013300005602SoilADDKKLLLELIANNKHWIAEKEGARDPFLTIGEPRKSAINL*
Ga0070766_1097197313300005921SoilELLLDIIRNEKKWIKEKEGARDPLTTITEPRKSAINI*
Ga0066789_1011412413300005994SoilLLDIIGMEKKWIVAKEGGRDPLETIAEPRKSAINL*
Ga0070717_1013830143300006028Corn, Switchgrass And Miscanthus RhizosphereKLLLDIIANEKKWIAEKQGARDPFETIAEPRKSAINL*
Ga0066656_1093237413300006034SoilYMPSAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL*
Ga0075029_10015374933300006052WatershedsHLAECLPSAADKALLLDIIHNEKKWIKEKEGARDPFATIGEPRKSAINL*
Ga0075030_10146556023300006162WatershedsDKELLLDILRNEKKWIKEKEGARDPFATIGEPRRSAINL*
Ga0075018_1062465023300006172WatershedsIIGAEKKWIAPRIGGRDPLATIGEVRKNAINAEK*
Ga0068871_10092684713300006358Miscanthus RhizosphereDKKLLLELIGNEKGWIAEKEGARDPFLTIGEPRKSAINL*
Ga0075521_1029179213300006642Arctic Peat SoilLLDIIRNEKKWIKEKEGARDPLSTIGEPRKSAINI*
Ga0066659_1051846723300006797SoilEFLPSAEDKKLLLEIIANEKKWIAEKEGARDPFETIAEPRKSAINL*
Ga0079220_1081985123300006806Agricultural SoilAADKQLLVDIIANEKQWIAEKTGARDPFETIGQPRKNAINL*
Ga0075425_10307477923300006854Populus RhizosphereYAQQLDDFLPTAADKQLLVDIIANEKQWIAEKTGARDPFETIGQPRKNAINL*
Ga0075436_10026434733300006914Populus RhizospherePTAGDKKLLLEIIANEKKWIAEKEGARDPFETIAEPRKSAINI*
Ga0099829_1094455223300009038Vadose Zone SoilKEYMPTAEDRKLLLDIINNEKKWIVSKEGGRDPLATIAEPRKSAINL*
Ga0116214_103002553300009520Peatlands SoilLLDIIRNEKKWIKEKEGARDPLSTIAEPRKSAINI*
Ga0116222_125234823300009521Peatlands SoilLKETLASEKKWITPRTGARDPFETIAEPRKMAINV*
Ga0116218_115360513300009522Peatlands SoilSADKKLLKETLASEKKWITPRTGARDPFETIAEPRKMAINV*
Ga0116110_121712623300009643PeatlandELLLDIIRNEKKWIKEKTGARDPLSTIEEPRKSAINL*
Ga0105855_120439023300009649Permafrost SoilMPNAADKKLLMDIIGSEKKWIAPLMGGRHALVTIGEVRKKAI
Ga0116217_1076821923300009700Peatlands SoilLLLDIIRNEKKWIKEKEGARDPLSTIAEPRKSAINI*
Ga0134063_1012103023300010335Grasslands SoilMPSAADKKLLLDIIGTEKKWIAPKEGARDPFETIAEPRKSAINL*
Ga0134062_1077140913300010337Grasslands SoilEEHLKEYMPSAADKKLLLDIIGTEKKWIAPKEGARDPFETIAEPRKSAINL*
Ga0074044_1099540413300010343Bog Forest SoilWLPTAADKKLLKETLANEKKWITPRTGARDPFETIAEPRKMAINVP*
Ga0126376_1218252623300010359Tropical Forest SoilDKKLLLEIIANEKKWIVEKEGARDPFETIAEPRKSAINI*
Ga0126376_1312091423300010359Tropical Forest SoilLEIIANEKKWITEKEGTRDPLETIGEPRKSAINL*
Ga0126378_1116432913300010361Tropical Forest SoilEDKKLLLEIIANEKNWIAPKEGARDPFETIAEPRKSAINL*
Ga0126378_1228859513300010361Tropical Forest SoilMPTAADKKLLLDIIGSEKKWIAPRNARDPLLTIGEVRKNAINAVQ*
Ga0126381_10051602813300010376Tropical Forest SoilEWLPNADDKKLLLEVIANEKNWIAPRVGVRDPFETIGQPRKSAINI*
Ga0137399_1164814313300012203Vadose Zone SoilLLLDIIRNEKKWIKEKEGARDPLSTIGEPRKQAINI*
Ga0137376_1069677513300012208Vadose Zone SoilKILLLDIIRNEKKWIVAKEGGRDPLATIAEPRKSAINL*
Ga0137367_1076127213300012353Vadose Zone SoilVLLDILRNEKKWIKEKEGARDPLATIGEPRKSAVNL*
Ga0137390_1179959913300012363Vadose Zone SoilLDIIRNEKKWIKEKEGARDPLSTIGEPRKQAINI*
Ga0137397_1023197733300012685Vadose Zone SoilFGEVMPTAEDKSVLLEIIANEKKWIVEKEGARDPFQTIGEPRKSAINL*
Ga0137394_1101834523300012922Vadose Zone SoilEDRKLLLEIIANEKKWIAEKEGARDPFQTIGEPRKSAINL*
Ga0137410_1071043023300012944Vadose Zone SoilTAEDKKLLLETIANEKNWIAEKEGARDPFSTIGEPRKSAINL*
Ga0134077_1022827323300012972Grasslands SoilEQHLSKYMPSAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL*
Ga0181538_1038135313300014162BogLDIIRNEKKWIKEKTGARDPLSTIEEPRKSAINL*
Ga0163163_1065996213300014325Switchgrass RhizosphereEHLESVMPTADDKKLLLELIANEKGWIAEKEGARDPFLTIGEPRKSAINL*
Ga0157376_1129663913300014969Miscanthus RhizosphereTADDKKLLLELIANEKGWIAEKEGARDPFLTIGEPRKSAINL*
Ga0167668_109017523300015193Glacier Forefield SoilLDIIRNEKKWIKEKAGARDPLSTITEPRKSAINI*
Ga0137409_1025213733300015245Vadose Zone SoilPHFHEVMPTAEDRKLLLEIIANEKKWIAEKEGARDPFQTIGEPRKSAINL*
Ga0132255_10129394513300015374Arabidopsis RhizosphereLDIIANEKKWIKEKEGARDPLSSIGEPRKSAINL*
Ga0132255_10417295013300015374Arabidopsis RhizosphereTAEDKKLLLELIAYEKKWIAEKEGARDPFETIGEPRKSAINL*
Ga0134074_126042713300017657Grasslands SoilAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0187825_1024378323300017930Freshwater SedimentDKKLLLEIIASEKKWIAEKEGARDPFATIGEPRRSAINL
Ga0187848_1039386223300017935PeatlandQDKALLLDIIRNEKKWIKGKEGARDPFATIAEPRRSAINL
Ga0187783_1014427413300017970Tropical PeatlandLLKETLATEKKWITPRTGARDPFETIAEPRKAAINVP
Ga0187781_1066086223300017972Tropical PeatlandLPSAKDKELLLDIIRNEKKWIKEKTGARDPFSTIAEPRKSAINL
Ga0187815_1008788933300018001Freshwater SedimentLKETLANEKKWITPRTGARDPFETIAEPRKAAINVP
Ga0187804_1011896233300018006Freshwater SedimentLLDIIRNEKKWIKEKEGARDPLSTIGEPRKSAVNL
Ga0187864_1027603513300018022PeatlandLLLDIIRNEKKWIKEKTGARDPLSTIEEPRKSAINL
Ga0187867_1045388323300018033PeatlandVREVLPGLDDRKLLLDTLSTEKKWIVAKEGGRDPLATISEPRRSAINL
Ga0187784_1011938143300018062Tropical PeatlandTAADKKLLKETLATEKKWITPRTGARDPFETIAEPRKAAINI
Ga0187771_1138767613300018088Tropical PeatlandPTAADKKLLKETIASEKKWITPRTGARDPFETIAEPRKAAINVP
Ga0187774_1003605513300018089Tropical PeatlandAEDKKHLLDLITNEKRWIAGKQGARDPFATIGEPRRSAINL
Ga0187774_1030719213300018089Tropical PeatlandYDRQVSEWLPTAADKKLLKETLANEKKWITPRTGARDPFETIAEPRRAAINV
Ga0187770_1016517633300018090Tropical PeatlandKLLKETLATEKKWITPRTGARDPFETIAEPRKAAINI
Ga0066669_1101671123300018482Grasslands SoilEQHLSKYMPSAEDRRLLLAIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0066669_1129262313300018482Grasslands SoilFLPSAEDKKLLLEIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0182031_124982613300019787BogQQHCQENLPGAKDKELLLDIIRNEKKWIKEREEGARDPLSTISEPRKSAINI
Ga0193751_120660013300019888SoilALLLDIIRNEKKWIKEKTGARDPLSTMGEPRKSAINI
Ga0210399_1149750023300020581SoilLLLETIANEKHWIAEKEGARDPFLTIGEPRKSAINL
Ga0210401_1114799223300020583SoilELLLDIIRNEKKWIKEKAGARDPLSTISEPRKSAINL
Ga0210406_1034742533300021168SoilLLLDIIRNEKKWIKEKEGARDPLSTIGEPRKQAINI
Ga0210390_1044115523300021474SoilHFQEVMPSADDKKLLLEIIANENKWITEKEGARDPFSTIGEPRKSAINL
Ga0210398_1096159813300021477SoilDQHLAEVLPSADDKKLLLELIANNKHWIAEKEGARDPFLTIGEPRKSAINL
Ga0126371_1171332323300021560Tropical Forest SoilQFLPTAEDKKRLLEIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0222622_1145857523300022756Groundwater SedimentMPTAEDKTLLLDIIRNEKRWIVAKEGGRDPLATISEPRKSAINL
Ga0207656_1042014423300025321Corn RhizosphereLLLDIIASEPSWIVPKIGARDPFETITEPRRSAINL
Ga0207685_1000341713300025905Corn, Switchgrass And Miscanthus RhizosphereMPTAEDKKLLLEIIANEKNWIAEKEGARDPFATIGEPRKSAINL
Ga0207665_1042915613300025939Corn, Switchgrass And Miscanthus RhizosphereLLDIIRNEKKWIVAKEGGRDPLATIGEPRKSAINL
Ga0209686_113949413300026315SoilEYMPTPEDKTLLLDIIRNEKKWIVAKEGGRDPLATIGEPRKSAINL
Ga0209801_124984623300026326SoilYMPSAADKKLLLDIIGTEKKWIAPKEGARDPFETIAEPRKSAINL
Ga0209473_101069223300026330SoilLKDYMPSAEDKKLLLDIIANEKRWIAAKEGARDPFETIAEPRKSAINL
Ga0209267_121232623300026331SoilQHLSEYMPSAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0209159_105693233300026343SoilDKKLLLDIIGTEKKWIAPKEGARDPFETIAEPRKSAINL
Ga0209378_110728533300026528SoilKYMPSAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0209474_1021848413300026550SoilEQHLSEYMPSAEDKKLLLDIIANEKKWIAEKEGARDPFETIAEPRKSAINL
Ga0209579_1022085123300027869Surface SoilVMPSADDKKLLLEIIANENKWIAEKEGARDPFSTIGEPRKSAINL
Ga0209275_1063452213300027884SoilADDKKLLLELIANNKHWIAEKEGARDPFLTIGEPRKSAINL
Ga0209698_1128540923300027911WatershedsKDKELLLDILRNEKKWIKEKEGARDPFATIGEPRRSAINL
Ga0222749_1072313523300029636SoilTAEDKKLLLEIIANDKKWVAEKEGTRDPFATIGEPRKSAINL
Ga0307468_10013290033300031740Hardwood Forest SoilKKLLLEIIVNEKKWIAEKEGARDPFATIGEPRKSAINL
Ga0307475_1108965113300031754Hardwood Forest SoilEDKKLLLEIIANEKRWIAEKEGARDPFETIGEPRKSAINL
Ga0307473_1017431413300031820Hardwood Forest SoilTLLLDIIRNEEKWIVAKEGGRDPLATISEPRKSAINL
Ga0307473_1129616923300031820Hardwood Forest SoilQLEIIANEKKWIAEKEGARDPFATIGEPRKSAINL
Ga0310917_1116250923300031833SoilLLLELIANEKHWIAEKEGARDPFLTIGEPRKSAINL
Ga0318520_1070677023300031897SoilKLLLEIIANEKKWIAEKEGSRDPFATIGEPRKSAINL
Ga0306926_1145594713300031954SoilLLLEIILNEKNWIVAKENARDPLATIAEPRKSAINL
Ga0318545_1018653523300032042SoilHFAEAMPTAEDKKLLLEIIANEKKWIAEKEGSRDPFATIGEPRKSAINL
Ga0311301_1030363753300032160Peatlands SoilAGEWLPTAADKKLLKETLANEKKWITPRTGARDPFETIAEPRKMAINV
Ga0307471_10423931223300032180Hardwood Forest SoilEEHLKEYMPSAADKKLLLDIIGTEKKWIAPKEGARDPFETIAEPRKSAINL
Ga0335079_1038957313300032783SoilRAYREHTAECLPSAADKAHLLDIIRNEKSWIKEKEGARDPFATIGEPRRSAINL
Ga0335078_1014167443300032805SoilEVLPSAKDKASLLDVLRNEKKWIKEKQGARDPFATIGEPRRSAINL
Ga0335078_1211569923300032805SoilRLLPTAEDKKLLLEIIANEKQWIAPKVGARDPFETIGEPRKSAINL
Ga0335072_10011351143300032898SoilVLPSAKDKALLLDVLRNEKKWIKEKEGARDPFATIGEPRRSAINL
Ga0335076_1008414713300032955SoilDKALLLDVLRNEKKWIKEKEGARDPFATIGEPRRSAINL
Ga0335077_1010716553300033158SoilEDKAYERQVSEWLPTAADKKLLKETLATEKKWITPRTGARDPFETIAEPRKGAINVS
Ga0335077_1100247513300033158SoilLKEALPTAADKQLLLDIISTEKKWIAEKEGARDPFETIGEPRKSAINL
Ga0314871_023116_410_5293300033809PeatlandDKKLLLEIIANEKKWIAEKEGARDPFATIGEPRKSAINL
Ga0334847_002818_2_1123300033826SoilLLLDIIRNEKKWIKEKEGARDPLATIGEPRKSAINI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.