NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F091658

Metagenome / Metatranscriptome Family F091658

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091658
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 46 residues
Representative Sequence GSTASTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Number of Associated Samples 78
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 4.81 %
% of genes near scaffold ends (potentially truncated) 84.11 %
% of genes from short scaffolds (< 2000 bps) 88.79 %
Associated GOLD sequencing projects 72
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (57.944 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(57.944 % of family members)
Environment Ontology (ENVO) Unclassified
(79.439 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(57.944 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.
1Ga0008090_157029921
2Ga0066659_115577143
3Ga0126373_110003542
4Ga0126373_114594471
5Ga0126373_116243111
6Ga0126373_119419682
7Ga0126373_123336382
8Ga0126376_128926611
9Ga0126372_122472222
10Ga0126379_112482781
11Ga0126381_1009194121
12Ga0126381_1016620073
13Ga0126381_1034508402
14Ga0126383_129472932
15Ga0137397_100698421
16Ga0182041_122046522
17Ga0182035_103099771
18Ga0182035_110297261
19Ga0182035_113900392
20Ga0182035_120949471
21Ga0182032_120635081
22Ga0182034_107189814
23Ga0182040_118102492
24Ga0182037_105125251
25Ga0182039_112775352
26Ga0182039_114877492
27Ga0182038_116623762
28Ga0182038_117990541
29Ga0210407_104602691
30Ga0210408_111212191
31Ga0126371_116601443
32Ga0126371_124964391
33Ga0126371_132546912
34Ga0170820_150057511
35Ga0170818_1091772853
36Ga0318534_104691233
37Ga0318528_105482331
38Ga0318528_107945992
39Ga0310915_103500233
40Ga0318555_101983683
41Ga0318561_107683181
42Ga0318572_101343353
43Ga0318572_101874403
44Ga0318560_100335761
45Ga0318560_100442161
46Ga0318496_101807133
47Ga0306917_110938041
48Ga0318493_106382901
49Ga0318500_100070661
50Ga0318500_102393443
51Ga0318501_108215122
52Ga0306918_102121791
53Ga0306918_107767772
54Ga0318502_102402693
55Ga0318492_106377062
56Ga0318535_103257912
57Ga0318554_101996061
58Ga0318546_100444102
59Ga0318546_101653423
60Ga0318566_102568881
61Ga0318552_104788401
62Ga0318529_101810461
63Ga0318548_103928602
64Ga0318503_100645283
65Ga0318557_100194723
66Ga0318550_106464341
67Ga0318497_106443541
68Ga0318499_103907772
69Ga0318517_104720132
70Ga0318512_104989562
71Ga0318527_101912561
72Ga0318544_100584753
73Ga0306925_104549001
74Ga0306925_112837543
75Ga0318536_101645802
76Ga0318536_105304691
77Ga0318551_103338633
78Ga0318551_105196721
79Ga0318551_109599601
80Ga0306923_125160522
81Ga0306921_100286484
82Ga0306921_101130275
83Ga0306921_107502631
84Ga0310912_110081872
85Ga0310913_102124971
86Ga0310913_107828061
87Ga0310910_103536191
88Ga0306926_112710521
89Ga0318530_104778581
90Ga0318531_100123084
91Ga0306922_107241363
92Ga0306922_121824862
93Ga0318559_101877361
94Ga0318559_106231151
95Ga0318545_103501791
96Ga0318558_102511001
97Ga0318570_103780351
98Ga0318575_101975933
99Ga0318533_102626193
100Ga0318505_101322251
101Ga0318514_107480981
102Ga0318524_105035382
103Ga0318525_105919112
104Ga0318518_100781313
105Ga0318540_100463611
106Ga0310914_107409311
107Ga0310914_113429692
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.58%    β-sheet: 0.00%    Coil/Unstructured: 85.42%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045GSTASTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCACSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
42.1%57.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Vadose Zone Soil
Tropical Forest Soil
Soil
Soil
Forest Soil
Soil
Tropical Rainforest Soil
14.0%57.9%23.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0008090_1570299213300005363Tropical Rainforest SoilMRGSMASTPSNIQPHVYVWVGFRDHIVQRMEFNDHEDHEKNWNGADDCAC*
Ga0066659_1155771433300006797SoilMRGSTASTPSNVQPHAYVCVGFRDHIVQRMEFDDHEDH
Ga0126373_1100035423300010048Tropical Forest SoilVARLTIRGSTASTPSNSQPLAYVCVGFRDHIVQRMEFDDHEGHEKN*
Ga0126373_1145944713300010048Tropical Forest SoilHAYVCIGFHDHIVQRMEFDDHEDHEKNWNDTDDCAC*
Ga0126373_1162431113300010048Tropical Forest SoilHAYVCVGFRDHIVQRMEFDDHFDHQKNWNDADDCAC*
Ga0126373_1194196823300010048Tropical Forest SoilQPHAYVCVGFRDHIVQRMEFDDHENHQKSWNDADDCAC*
Ga0126373_1233363823300010048Tropical Forest SoilMRGSTSSTPSNINPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC*
Ga0126376_1289266113300010359Tropical Forest SoilKHQPHAYVCVGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC*
Ga0126372_1224722223300010360Tropical Forest SoilMRGSTALTPSNINPHAYVCVGFRDHIVQRMEFDDHEDHQKNWSDSDDCPC*
Ga0126379_1124827813300010366Tropical Forest SoilVARLTMRGSTASTPSNIQPHVYVCVGFRDHIVQRMEFNDHEDHEKN
Ga0126381_10091941213300010376Tropical Forest SoilVTRLRVRGSTVSTPSNIQPHAYVCIGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC*
Ga0126381_10166200733300010376Tropical Forest SoilPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNETDDCAC*
Ga0126381_10345084023300010376Tropical Forest SoilVARLTMRGSTASMPTNIQPHAYVCVGFRDHIVQRMEFDDNEDHDK
Ga0126383_1294729323300010398Tropical Forest SoilSTQSNINPHAYVCVGFRDHVVQRMEFDDHEDHDKNWNDTDDCAC*
Ga0137397_1006984213300012685Vadose Zone SoilAYVCIGFRDHIVQRMEFDNHEDHQKSWNDTDDCAC*
Ga0182041_1220465223300016294SoilAYVCVGFRDHIVQRMEFDDHENHQKNWNDTDDCAC
Ga0182035_1030997713300016341SoilVARITMRGSTTSTPSNIQPHGYVCVGFRDHIVQRMEFDDHEDHQKNWNDADDCAC
Ga0182035_1102972613300016341SoilTTAMPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0182035_1139003923300016341SoilTSSTPSNINPHAYVCVGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0182035_1209494713300016341SoilTRLTMRGSTASSQRNIQPHAYICVGLRDKIVKRKEFDNHEDHQKNWNDTDDCAC
Ga0182032_1206350813300016357SoilHAYVCVGFRDHIVQRMEFDDHEDYQKNWNDADDCGC
Ga0182034_1071898143300016371SoilSVARLTMRGSTASTPSNIQPHAYVCVGFRDDIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0182040_1181024923300016387SoilLTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0182037_1051252513300016404SoilIQPHAYVCIGYRDHIVHRMEFDDHAVRQKNWNDTDDCAC
Ga0182039_1127753523300016422SoilNIQPHAYVCVGFRDHIVQRMEFDDHENHQKNWNDTDDCAC
Ga0182039_1148774923300016422SoilFHSVARLTMRGSTSSTPTNINPHAYVCVGFRDHIVQRMEFDDHENHERNWNDTDDCAC
Ga0182038_1166237623300016445SoilTMRGSTAPMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHERNWNDTDDCAC
Ga0182038_1179905413300016445SoilPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0210407_1046026913300020579SoilIRASTVSMPDNIQPHAYVCIGFRDHIVQRMEFDNHEDHQKSWNDTDDCAC
Ga0210408_1112121913300021178SoilRGSAASLPGNLQPHGYVCVGFQDHIVQRMEFDDHEDHQRNWSDTDDCGC
Ga0126371_1166014433300021560Tropical Forest SoilVARLTMRGSTASTPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNETDDCAC
Ga0126371_1249643913300021560Tropical Forest SoilISPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0126371_1325469123300021560Tropical Forest SoilMRGSTSSTPSNINPHAYVCLGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0170820_1500575113300031446Forest SoilRFHSVARLTMRGSTASMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0170818_10917728533300031474Forest SoilTNIQPHAYVCVGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0318534_1046912333300031544SoilINPHAYVCVGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0318528_1054823313300031561SoilMRGSTSSTPSNINPHAYVCVGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0318528_1079459923300031561SoilMRGSTASTPSNIQPHAYICVGFRDHIVQRMEFDNHEDHQKNWNDTDDCAC
Ga0310915_1035002333300031573SoilAYVCVGFRDHIVQRMEFDNHEDHQKNWNDTDDCAC
Ga0318555_1019836833300031640SoilTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318561_1076831813300031679SoilSVVRLRMRGSTVSTPSNIQPHAYVCIGYRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318572_1013433533300031681SoilMRGSTSSTPNNINPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCDC
Ga0318572_1018744033300031681SoilHSVARLTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318560_1003357613300031682SoilASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318560_1004421613300031682SoilSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318496_1018071333300031713SoilPMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDYAC
Ga0306917_1109380413300031719SoilVPRLTMRRSTASTPTNIPPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318493_1063829013300031723SoilVPRLYIRGITVSMPGNIQPHAYICIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0318500_1000706613300031724SoilPMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318500_1023934433300031724SoilRFHSVARLTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318501_1082151223300031736SoilHAYVCIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0306918_1021217913300031744SoilSTVAMAGNIQPHANVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0306918_1077677723300031744SoilQPHTYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318502_1024026933300031747SoilLTMRGSTVAMPGNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318492_1063770623300031748SoilPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318535_1032579123300031764SoilIPGNLQPHAYVCIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0318554_1019960613300031765SoilAKLTMRGSTASTPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318546_1004441023300031771SoilMPGNIQPHAYLCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318546_1016534233300031771SoilVARLTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318566_1025688813300031779SoilRGSTTSTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318552_1047884013300031782SoilLTMRGSTAPMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEMNWNDTDDCAC
Ga0318529_1018104613300031792SoilNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318548_1039286023300031793SoilTSSTSGNIQPHAYVCVGFRDHMVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318503_1006452833300031794SoilAYVCIGFRDHIVQRMEFDDHEDHEKNWNGTDDCAC
Ga0318557_1001947233300031795SoilAPMPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318550_1064643413300031797SoilMRGSTASTPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318497_1064435413300031805SoilSGNIQPHAYVCVGFRDHIVQRMEFDDREDHQKNWNDTDDCAC
Ga0318499_1039077723300031832SoilAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCDC
Ga0318517_1047201323300031835SoilLYIRGITVSILGNIQPHAYVCIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0318512_1049895623300031846SoilMRGSTSSTPNNINPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318527_1019125613300031859SoilVSTPSNIQPHAYVCIGYRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318544_1005847533300031880SoilMRGSTSSTPNNINPHAYVCVGFRDHIVQRMEFEDHEDHEKNWNDTDDCDC
Ga0306925_1045490013300031890SoilNPHAYVCVGFRDHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0306925_1128375433300031890SoilPGNIQPHAYVCVGFRDHIVQRMEFDNHEDHQKNWNDTDDCAC
Ga0318536_1016458023300031893SoilSTPNNINPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCDC
Ga0318536_1053046913300031893SoilSSTPTNINPHAYVCVGFRDHIVQRMEFDDHENHERNWNDTDACAC
Ga0318551_1033386333300031896SoilMRGSTASTPSNIQPHAYICVGFRDHIVQRMEFDNHEDHQKNWN
Ga0318551_1051967213300031896SoilASTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0318551_1095996013300031896SoilAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0306923_1251605223300031910SoilMRGSTVAMPGNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0306921_1002864843300031912SoilMRGSTVSTPSNIQPHAYVCAGFRDHIVQRMEFDDHEDYQKNWNDADDCGC
Ga0306921_1011302753300031912SoilMRGSTASTPSNIQPHAYICVGFRDHIVQRMEFDDHEDHQKNLNDTDDCACQRLA
Ga0306921_1075026313300031912SoilSNINPHAYVCVGFRDHIVQRMEFDDHEDHERNWNDTDDCAC
Ga0310912_1100818723300031941SoilQPHAYVCIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0310913_1021249713300031945SoilPHVYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0310913_1078280613300031945SoilSTASTPTNIPPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0310910_1035361913300031946SoilLTMRGSTSSTPSNINPHAYVCVGFRDHIVQRMEFDDHEDHERNWNDTDDCAC
Ga0306926_1127105213300031954SoilMRGSTVSTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDYQKNWN
Ga0318530_1047785813300031959SoilVPKLYIRGITVSIPGNIQPHAYVCIGFRDHIVQRMEFDDHAVRQKNWNDTDDCAC
Ga0318531_1001230843300031981SoilAYICVGFRNHIVQRMEFDDHEDHDKNWNDTDDCAC
Ga0306922_1072413633300032001SoilSVARITMRGSTASTPTNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0306922_1218248623300032001SoilINPHAYVCVGFRGHIVQRMEFDDHEDHEKNWSDADDCAC
Ga0318559_1018773613300032039SoilPGNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318559_1062311513300032039SoilFHSVARLTMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318545_1035017913300032042SoilRLTMRGSTVPTSSNIQPHAYVCVGFRDHIVQRMEFDDHEDNERNWHDTDDCAC
Ga0318558_1025110013300032044SoilMRGSTSSTPNNINPHAYVCVGFRDHIVQRMEFDDHEDREKNWNHTDDCAC
Ga0318570_1037803513300032054SoilTVRGSTVSTSSNIQPHAYVCVGFRDHIVQRMEFDDHQDHERNWNDTDDCAC
Ga0318575_1019759333300032055SoilSPSCCVGFCDHIVQRMEFDDYEDHQKNWNDTDGCAC
Ga0318533_1026261933300032059SoilMRGSTVSTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDYQKNWNDADDCGC
Ga0318505_1013222513300032060SoilYAVARLTMRGSTASTSTNIQPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318514_1074809813300032066SoilRGSTVSTPSNIQPHAYVCIGYRDHIVQRMEFDDHEDHEKNWNGTDDCAC
Ga0318524_1050353823300032067SoilIQPHAYVCIGFRDHIVQRMEFDDYAVRQKNWNDTDDCAC
Ga0318525_1059191123300032089SoilVARLTMRGSTVSMPGNIQPHVYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC
Ga0318518_1007813133300032090SoilMRGSTASTPTNIQPHVYVCVGFRDHIVQRMEFDDHEDHAKNWNDTNDCAC
Ga0318540_1004636113300032094SoilPSCCVGFCDHIVQRMEFDDYEDHQKNWNDTDGCAC
Ga0310914_1074093113300033289SoilGSTASTPSNIQPHAYVCVGFRDHIVQRMEFDDHEDHEKNWNDTDDCAC
Ga0310914_1134296923300033289SoilNIPPHAYVCVGFRDHIVQRMEFDDHEDHQKNWNDTDDCAC


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.