NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103355

Metagenome / Metatranscriptome Family F103355

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103355
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 48 residues
Representative Sequence MLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Number of Associated Samples 96
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.99 %
% of genes near scaffold ends (potentially truncated) 46.53 %
% of genes from short scaffolds (< 2000 bps) 82.18 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.366 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Seawater
(25.743 % of family members)
Environment Ontology (ENVO) Unclassified
(53.465 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(92.079 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68
1P_2C_Liq_1_UnCtyDRAFT_10097131
2JGI20160J14292_101756221
3JGI24928J39210_10152462
4JGI26080J50196_10161953
5JGI26081J50195_10876803
6JGI26088J50261_10061073
7JGI26260J51721_10070481
8JGI26253J51717_10054463
9Ga0078893_118821232
10Ga0075462_100297522
11Ga0075466_10191312
12Ga0075502_10994262
13Ga0075516_12408631
14Ga0075488_15912812
15Ga0075467_100230792
16Ga0102951_12190781
17Ga0075480_100645604
18Ga0104261_10398182
19Ga0115566_107655122
20Ga0115549_10432482
21Ga0102814_102776081
22Ga0102815_106786192
23Ga0118687_100119604
24Ga0115551_10915803
25Ga0115551_14702912
26Ga0115554_10763612
27Ga0115564_102087493
28Ga0115102_109485171
29Ga0129324_102154562
30Ga0129329_10170243
31Ga0129328_10066673
32Ga0182045_13595243
33Ga0182091_11462991
34Ga0182046_11848411
35Ga0182095_15559311
36Ga0181607_106537992
37Ga0181561_101240002
38Ga0181560_101304502
39Ga0181553_105372442
40Ga0180037_10103791
41Ga0181562_101874131
42Ga0206125_100795563
43Ga0206125_101619532
44Ga0206125_103411362
45Ga0206127_11531822
46Ga0206130_102812652
47Ga0206130_104174291
48Ga0211504_10277163
49Ga0211505_10111752
50Ga0206678_104349592
51Ga0206682_103201101
52Ga0206123_102873422
53Ga0222717_104360421
54Ga0222715_101623581
55Ga0233426_100616082
56Ga0233432_101199192
57Ga0228688_1035892
58Ga0228679_10124962
59Ga0228697_1215062
60Ga0228685_10216462
61Ga0233399_10194451
62Ga0228653_10411382
63Ga0233444_101931361
64Ga0228661_10351632
65Ga0228660_10384872
66Ga0244775_105939172
67Ga0244775_112550071
68Ga0233393_10139842
69Ga0208148_11278652
70Ga0209405_10234773
71Ga0209716_10306171
72Ga0209136_10181034
73Ga0209251_10245112
74Ga0209306_10306931
75Ga0209602_10181131
76Ga0208150_10350602
77Ga0209137_10297414
78Ga0209307_10638471
79Ga0247565_10048933
80Ga0247589_10102191
81Ga0247580_10934582
82Ga0247556_11130821
83Ga0247591_10369162
84Ga0247577_10273902
85Ga0247600_10151651
86Ga0247603_10165821
87Ga0228641_10679822
88Ga0228604_10759822
89Ga0208954_10487921
90Ga0208950_10024187
91Ga0208948_10730332
92Ga0247563_10243491
93Ga0247586_11159292
94Ga0233397_10493701
95Ga0256417_10544252
96Ga0247597_10410311
97Ga0233394_10660362
98Ga0228627_10717522
99Ga0315320_100568952
100Ga0315315_101979742
101Ga0314692_103150901
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 91.67%    β-sheet: 0.00%    Coil/Unstructured: 8.33%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTISequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
66.3%33.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Seawater
Seawater
Aqueous
Seawater
Marine Surface Water
Freshwater To Marine Saline Gradient
Marine
Seawater
Estuarine
Salt Marsh
Marine
Enviromental
Estuarine
Estuarine Water
Pelagic Marine
Seawater
Pelagic Marine
Water
Sediment
Ocean Water
3.0%10.9%25.7%11.9%4.0%3.0%8.9%3.0%10.9%6.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
P_2C_Liq_1_UnCtyDRAFT_100971313300000418EnviromentalKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
JGI20160J14292_1017562213300001349Pelagic MarineMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI*
JGI24928J39210_101524623300002754MarineMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII*
JGI26080J50196_101619533300003345MarineRYHSKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
JGI26081J50195_108768033300003346MarineMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQ
JGI26088J50261_100610733300003409MarineMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
JGI26260J51721_100704813300003580MarineMLCAACILWFGILLNFAILTMQAGLILARLNEENKLAIKRQRNWEKTI*
JGI26253J51717_100544633300003583MarineMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKVI*
Ga0078893_1188212323300005837Marine Surface WaterKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI*
Ga0075462_1002975223300006027AqueousMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0075466_101913123300006029AqueousMLCAACILWFGILLNFAILTMQAGLILARLNEENKLAIKRQRNWEKVI*
Ga0075502_109942623300006357AqueousMLCAACILWFCTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0075516_124086313300006384AqueousMLCAACILWFGTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0075488_159128123300006397AqueousMLCAACILWFGVLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0075467_1002307923300006803AqueousMLCAACILWFATLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0102951_121907813300007725WaterILWFGILLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0075480_1006456043300008012AqueousNFAILTMQAGLILARLNEENKLAIKRQRNWEKVI*
Ga0104261_103981823300008956Ocean WaterQNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII*
Ga0115566_1076551223300009071Pelagic MarineVVSQYHIKKIKIKNMLCAACILWFGILLNFAILTAQAGLILARLNEGNKLAIKRQRNWEKTI*
Ga0115549_104324823300009074Pelagic MarineMLCAACILWFCTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0102814_1027760813300009079EstuarineMLCVAYILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKVI*
Ga0102815_1067861923300009080EstuarineMLCAACILWFGILLNFAILTVQAGLILARLNEGNMLAIKRQRNWEKTI*
Ga0118687_1001196043300009124SedimentMLCAACILWFGTLLNFAILTQQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0115551_109158033300009193Pelagic MarineILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0115551_147029123300009193Pelagic MarineMLCAACILWFGTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEK
Ga0115554_107636123300009472Pelagic MarineMLCAACILWFGTLLNFAILTVQVGLILARLNEENKLAIKRQRNWEKTI*
Ga0115564_1020874933300009505Pelagic MarineMLCAACILWFGILLNFAISTVQAGLILARLNEGNMLAIKRQRNWEKTI*
Ga0115102_1094851713300009606MarineLSDVEILLKKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII*
Ga0129324_1021545623300010368Freshwater To Marine Saline GradientMLCAACILWYGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0129329_101702433300012470AqueousMLCAACILWSGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0129328_100666733300012472AqueousMLRYHSKKIKNMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI*
Ga0182045_135952433300016726Salt MarshFGILLNFAILTEQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0182091_114629913300016766Salt MarshKKIKNMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI
Ga0182046_118484113300016776Salt MarshMLCAACILWFGTLLNFAILTMQAGLILARLNEENKLAIKRQRNWEKVI
Ga0182095_155593113300016791Salt MarshLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0181607_1065379923300017950Salt MarshMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEK
Ga0181561_1012400023300018410Salt MarshMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0181560_1013045023300018413Salt MarshFGTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKVI
Ga0181553_1053724423300018416Salt MarshCILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0180037_101037913300019214EstuarineMLKYYSKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0181562_1018741313300019459Salt MarshMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKVI
Ga0206125_1007955633300020165SeawaterMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAI
Ga0206125_1016195323300020165SeawaterMLCAACILWFCTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI
Ga0206125_1034113623300020165SeawaterVVSQYHIKKIKIKNMLCAACILWFGILLNFAILTAQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0206127_115318223300020169SeawaterMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI
Ga0206130_1028126523300020187SeawaterMLCAACILWFGILLNFAILTVQAGLILARLNEGNMLAIKRQRN
Ga0206130_1041742913300020187SeawaterMLCAACILWFGIILNFAILTVQAGLILARLNEENKLAIKRQRN
Ga0211504_102771633300020347MarineMLCAALILWSGVLLNLAILTVKAGLILARLNEGNKLAIKRQRNWEKII
Ga0211505_101117523300020352MarineVILQYHSKKFKNMLCAALILWSGVLLNLAILTVKAGLILARLNEGNKLAIKRQRNWEKII
Ga0206678_1043495923300021084SeawaterMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0206682_1032011013300021185SeawaterKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0206123_1028734223300021365SeawaterVILRYRSKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKVI
Ga0222717_1043604213300021957Estuarine WaterMLCAACILWFGILLNFAILTVQAGLILARLNEGNKL
Ga0222715_1016235813300021960Estuarine WaterITQKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
(restricted) Ga0233426_1006160823300022920SeawaterMLCAACILWFGILLNFAILTMQAGLILARLNEENKLAIKRQRNWEKVI
(restricted) Ga0233432_1011991923300023109SeawaterMLCAACILWFGILLNFAILTVQAGLILARLNEGNMLAIKRQRNWEKTI
Ga0228688_10358923300023565SeawaterLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0228679_101249623300023566SeawaterMLCAACILWSGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0228697_12150623300023674SeawaterMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKII
Ga0228685_102164623300023701SeawaterVVSQYHIKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKII
Ga0233399_101944513300024231SeawaterSILWSGGLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0228653_104113823300024237SeawaterIKNMLCAACILWSGILLNFAILTVQAGLILARLNEGNMLAIKRQRNWEKTI
(restricted) Ga0233444_1019313613300024264SeawaterMLCAACILWFDTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0228661_103516323300024266SeawaterMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAI
Ga0228660_103848723300024291SeawaterMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQQNWEK
Ga0244775_1059391723300024346EstuarineIKNMLCAACILWFGILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0244775_1125500713300024346EstuarineMLCVASILWSGGLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0233393_101398423300024413SeawaterVVSQYHIKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0208148_112786523300025508AqueousYHSKKIKNMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKVI
Ga0209405_102347733300025620Pelagic MarineVVSQYHIKKIKNMLCAACILWFGILLNFAILTAQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0209716_103061713300025626Pelagic MarineILLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209136_101810343300025636MarineACILWFGILLNFAILTMQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209251_102451123300025668MarineMLCAACILWFDTLLNFAPLTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209306_103069313300025680Pelagic MarineCILWFGTLLNFAILTLQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209602_101811313300025704Pelagic MarineMLCAACILWFGTLLNFAILTLQAGLILARLNEENKLAIKR
Ga0208150_103506023300025751AqueousMLCAACILWFCTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209137_102974143300025767MarineCAACILWFDTLLNFAILTVQAGLILARLNEENKLAIKRQRNWEKTI
Ga0209307_106384713300025832Pelagic MarineFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0247565_100489333300026406SeawaterLSDVEILLKKKFKNMLCVASILWSGGLLNFAILTVKACLILAKLNEGNKLAIKRQRNWEKII
Ga0247589_101021913300026407SeawaterMLKYYSKKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0247580_109345823300026423SeawaterIAISHKKIKNMLCAASILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0247556_111308213300026427SeawaterSILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0247591_103691623300026434SeawaterMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWETII
Ga0247577_102739023300026437SeawaterMLKYYSKKKFKNMLCVASILWSGVLLNFATLTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0247600_101516513300026461SeawaterLSDVEILLKKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0247603_101658213300026468SeawaterLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0228641_106798223300026491SeawaterFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0228604_107598223300026506SeawaterMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQ
Ga0208954_104879213300027081MarineKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0208950_100241873300027413MarineASILWSGGLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0208948_107303323300027501MarineMLCVASILWSGGLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKI
Ga0247563_102434913300028095SeawaterMLCAASILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKII
Ga0247586_111592923300028102SeawaterLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKII
Ga0233397_104937013300028111SeawaterIKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0256417_105442523300028233SeawaterILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0247597_104103113300028334SeawaterLSGVEILLKKKFKNMLCVASILWSGVLLNFAILTVKAGLILAKLNEGNKLAIKRQRNWEKII
Ga0233394_106603623300028391SeawaterNISLNILSGIAISHKKIKNMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0228627_107175223300028414SeawaterMLCAASILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0315320_1005689523300031851SeawaterMLCAACILWFGILLNFAILTVQAGLILARLNEGNKLAIKRQRNWEKTI
Ga0315315_1019797423300032073SeawaterMLCVASILWSGVLLNFAILTVKAGLFLAKLNEGNKLAIKRQRNWEKII
Ga0314692_1031509013300032754SeawaterMLCAACILWFVLLLNFAILTMQAGLILARLNEENKLAIKRQRNWEKVI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.