NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105496

Metagenome / Metatranscriptome Family F105496

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105496
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 46 residues
Representative Sequence STNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Number of Associated Samples 87
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 96.00 %
% of genes from short scaffolds (< 2000 bps) 94.00 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (98.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Oceanic → Unclassified → Marine
(23.000 % of family members)
Environment Ontology (ENVO) Unclassified
(79.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(80.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.
1none_1351561
2DelMOSum2011_101967451
3DelMOSum2011_102150281
4SI39nov09_135mDRAFT_10034546
5SI39nov09_10mDRAFT_10203102
6SI36aug09_120mDRAFT_10689182
7LPfeb09P261000mDRAFT_10327202
8SI34jun09_100mDRAFT_10295511
9JGI20160J14292_100835091
10JGI20158J14315_100134518
11GOScombined01_1009043381
12JGI26243J51142_10691762
13JGI26248J51725_10359273
14JGI26248J51725_10510241
15JGI26262J51727_11073521
16Ga0073579_11616871
17Ga0075119_10419664
18Ga0075447_102782872
19Ga0075445_102296542
20Ga0075444_103015511
21Ga0102871_11091982
22Ga0102827_10030786
23Ga0102954_10034766
24Ga0105744_10621301
25Ga0102907_11305532
26Ga0105349_102520972
27Ga0102909_11303372
28Ga0102892_10328681
29Ga0102814_102397281
30Ga0102815_106297531
31Ga0114995_104297852
32Ga0114995_104841262
33Ga0114996_106489221
34Ga0114994_110599912
35Ga0114998_104408062
36Ga0114997_104967141
37Ga0115008_104832391
38Ga0115559_11086431
39Ga0115563_12387371
40Ga0115554_13238751
41Ga0115555_13448881
42Ga0115003_107113201
43Ga0115003_107139282
44Ga0115003_107855652
45Ga0115004_105646871
46Ga0115000_108320792
47Ga0115001_104093942
48Ga0115001_108923962
49Ga0114999_110325162
50Ga0138265_11976071
51Ga0181398_11497742
52Ga0211684_10482662
53Ga0211681_10461522
54Ga0211686_104663982
55Ga0211687_103019731
56Ga0206682_103543362
57Ga0222691_10325511
58Ga0233426_102357862
59Ga0233427_100743271
60Ga0233439_104103791
61Ga0244775_112720342
62Ga0244776_103118043
63Ga0209556_11021351
64Ga0209658_10850292
65Ga0209041_11739781
66Ga0209360_11578972
67Ga0209661_11351572
68Ga0209252_11412582
69Ga0209308_103650781
70Ga0209632_103718602
71Ga0209932_10270474
72Ga0208803_10103551
73Ga0208680_10738632
74Ga0209482_11295711
75Ga0209816_10252245
76Ga0209816_10750931
77Ga0209816_11574231
78Ga0209192_103195622
79Ga0209502_102431821
80Ga0209502_102894412
81Ga0209830_104202852
82Ga0209091_102501841
83Ga0209090_102340501
84Ga0209035_105636042
85Ga0209402_106490372
86Ga0257106_10830651
87Ga0257110_12547222
88Ga0308010_12352611
89Ga0307488_101807451
90Ga0307996_11224171
91Ga0308007_101700591
92Ga0308004_101320241
93Ga0308004_102274572
94Ga0308004_104080192
95Ga0308018_101604941
96Ga0307986_101685801
97Ga0307986_102618652
98Ga0307986_103981021
99Ga0308016_101778731
100Ga0307998_12270211
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 10.64%    β-sheet: 14.89%    Coil/Unstructured: 74.47%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045STNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLNDSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Marine
Marine
Marine
Seawater
Sackhole Brine
Estuary Water
Marine
Seawater
Estuarine
Marine Estuarine
Marine
Marine
Methane Seep Mesocosm
Estuarine
Pelagic Marine
Pelagic Marine
Marine
Seawater
Saline Lake
Saline Water
Pond Water
Water
Polar Marine
23.0%3.0%3.0%15.0%10.0%11.0%12.0%5.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
none_13515612236876001Marine EstuarineLYLGQLQINPIILGGLGWEVKNSKVLESEIPQILQGWIIDLSD
DelMOSum2011_1019674513300000115MarineINPIILGGVGWVVKNSKVLESEIPQILQGWIIDSSD*
DelMOSum2011_1021502813300000115MarineVLTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
SI39nov09_135mDRAFT_100345463300000153MarineVCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
SI39nov09_10mDRAFT_102031023300000199MarineNPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
SI36aug09_120mDRAFT_106891823300000239MarineNLFDLGQLQINLSSDGVGWEVKNSKVLESEIPQILQDRVIDLND*
LPfeb09P261000mDRAFT_103272023300000250MarineLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
SI34jun09_100mDRAFT_102955113300000254MarineCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
JGI20160J14292_1008350913300001349Pelagic MarineLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
JGI20158J14315_1001345183300001355Pelagic MarineINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
GOScombined01_10090433813300002040MarineIGSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
JGI26243J51142_106917623300003501MarineLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
JGI26248J51725_103592733300003589MarineSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
JGI26248J51725_105102413300003589MarineNSFLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
JGI26262J51727_110735213300003602MarineLSTFNNSLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
Ga0073579_116168713300005239MarineNKSFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
Ga0075119_104196643300005931Saline LakeLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
Ga0075447_1027828723300006191MarineLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0075445_1022965423300006193MarineLFDLGQLQINPIILGGVGWEVKNSKVLESDIPQILQGWIIDLND*
Ga0075444_1030155113300006947MarineSLSYSTNLIDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLNDQLIF*
Ga0102871_110919823300007620EstuarineSTFNSTLSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
Ga0102827_100307863300007715EstuarineSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
Ga0102954_100347663300007778WaterSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLSD*
Ga0105744_106213013300007863Estuary WaterTLNSSLSLSYSTNLFDLGQLQINPIILGGVGWEVKNSKVFESEIPQILQGWIIDLND*
Ga0102907_113055323300007962EstuarineNNSLSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLNDLLIF*
Ga0105349_1025209723300008253Methane Seep MesocosmSTFNSSLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
Ga0102909_113033723300009050EstuarineLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRVIDLSD*
Ga0102892_103286813300009057EstuarineSTFNSSLSLSCTTNHFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
Ga0102814_1023972813300009079EstuarineNLFDLGQLQINPIILGGVGWEVKNSKILESEIPQILQGWIIDLND*
Ga0102815_1062975313300009080EstuarineSLLCPTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD*
Ga0114995_1042978523300009172MarineTNLFDFGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLND*
Ga0114995_1048412623300009172MarineDLGQLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLSD*
Ga0114996_1064892213300009173MarineQINPIILGGVGWAVKNSKILESEIPQILQDRVIDLND*
Ga0114994_1105999123300009420MarineLIDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLND*
Ga0114998_1044080623300009422MarineQLQINPIILGGVGCEVKNSKVLESEIPQILQGWIIDLND*
Ga0114997_1049671413300009425MarineGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115008_1048323913300009436MarineTFNSFLSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115559_110864313300009438Pelagic MarineDFGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115563_123873713300009442Pelagic MarineTFNSSLSLLFSTNLFDLGQLQINPIILGGVGCEVKNSKVLESEIPQILQGWIIDLND*
Ga0115554_132387513300009472Pelagic MarineLSLLCSTNLFDFGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115555_134488813300009476Pelagic MarineLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD*
Ga0115003_1071132013300009512MarineGQLQINPIILGGAGWEVKNSKVLESEIPQILQGWIIDLNG*
Ga0115003_1071392823300009512MarineLRLSNSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115003_1078556523300009512MarineDLGQLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLND*
Ga0115004_1056468713300009526MarineNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLND*
Ga0115000_1083207923300009705MarineNSSLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND*
Ga0115001_1040939423300009785MarineLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLSD*
Ga0115001_1089239623300009785MarineDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLNYQLIF*
Ga0114999_1103251623300009786MarineQLQINPIIFGGVGWVVKNSKVLESEIPQILQGWIIDLND*
Ga0138265_119760713300012408Polar MarineSYSTNLIDLGQLQINPIILGGVGWEVKNSKVLESQIPQILQGWIIDLNDQLIF*
Ga0181398_114977423300017725SeawaterRLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD
Ga0211684_104826623300020304MarineNLIDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0211681_104615223300020309MarineDLGQLQINPIILGGLGWEVKNSKVLESEIPQILQGWIIDLND
Ga0211686_1046639823300020382MarineTNLFELGQLQINPIILGGVGWEVKNSKVFESEIPQILQGWIIDLND
Ga0211687_1030197313300020396MarineGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0206682_1035433623300021185SeawaterCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLNDLLIF
Ga0222691_103255113300022851Saline WaterINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
(restricted) Ga0233426_1023578623300022920SeawaterLQINPIILGGLGWAVKNFKVLESEIPQILQGWIIDSND
(restricted) Ga0233427_1007432713300022933SeawaterFNSFLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
(restricted) Ga0233439_1041037913300024261SeawaterLSLLRSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0244775_1127203423300024346EstuarineTFNSFLSLSCLTNLFDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLSD
Ga0244776_1031180433300024348EstuarineLSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLNDLLIF
Ga0209556_110213513300025547MarineSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD
Ga0209658_108502923300025592MarineGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRVIDLSD
Ga0209041_117397813300025623MarineTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD
Ga0209360_115789723300025665MarineINPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD
Ga0209661_113515723300025700MarineSCSTNLFDLGQLQINPIIXGGVGWEVKNSKVLESEIPQILQDRVIDLSD
Ga0209252_114125823300025719MarineNPIILGGVGWEVKNSKVLESEIPQILQDRIIDLSD
Ga0209308_1036507813300025869Pelagic MarineNPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0209632_1037186023300025886Pelagic MarineSLSCSTNLFDLGQLQINPINLGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0209932_102704743300026183Pond WaterNSFLSLLCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0208803_101035513300027232EstuarineSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0208680_107386323300027253EstuarineLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0209482_112957113300027668MarineLQINPIILGGVGWEVKNSKVLESKIPQILQGCIIDLND
Ga0209816_102522453300027704MarineTNLFDLGQLQINPIILGGVGXEVKNSKVLESKIPQILQGWIIDLND
Ga0209816_107509313300027704MarineSTNLFDLGQLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLND
Ga0209816_115742313300027704MarineSLSLSCSTNLFDLGQLQINPIILGGVEWEVKNSKVLESEIPQILQGWIIDLND
Ga0209192_1031956223300027752MarineGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0209502_1024318213300027780MarineLSVSCSTNLIDLGQLQINPIILGGVGWEVKNSNVLESEIPQILQGWIIDLND
Ga0209502_1028944123300027780MarineINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0209830_1042028523300027791MarineFDLGQLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLND
Ga0209091_1025018413300027801MarineQIKPIIFGGVGCEVKNSKVLKSEIPQILQGWIIGLND
Ga0209090_1023405013300027813MarineSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0209035_1056360423300027827MarineTNLFDLGQLHINPTILGGVGWEVKNSKVLESEIPQILQGWIIDLSD
Ga0209402_1064903723300027847MarineLPCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0257106_108306513300028194MarineQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0257110_125472223300028197MarineGQLQINPIILGGVGWAVKNSKVLESEIPQILQDRVIDLND
Ga0308010_123526113300031510MarineLQINPIILGGLGWEVKNSKVLKSEIPQILQGWIIDLND
Ga0307488_1018074513300031519Sackhole BrineSLSCSTNLFDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLSD
Ga0307996_112241713300031589MarineFDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLNDQLIF
Ga0308007_1017005913300031599MarineGQLQIKPIILGGVGCEVKNSKVLESEIPQILQDWIIDLND
Ga0308004_1013202413300031630MarineLGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0308004_1022745723300031630MarineSTNLFDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLNDQLIF
Ga0308004_1040801923300031630MarineYSTNLFDLGQLQINPIILGGVGWEVKNSRVLESEIPQILQGWIIDLNDQLIF
Ga0308018_1016049413300031655MarineNPIIFGGVGWEVKNSKVLESDIPQILQGWIIDLND
Ga0307986_1016858013300031659MarineTFNSSLSLSYSTNLIDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLNDQLI
Ga0307986_1026186523300031659MarineINPIILGGVGWVVKNSKVLESEIPQILQGWIIDLND
Ga0307986_1039810213300031659MarineSTNLFDLGQLQINPIILGGVGWEVKNSRVLESEIPQILQGWIIDLNDQLIF
Ga0308016_1017787313300031695MarineCSTNLFDFGQLQINPIILGGVGWEVKNSKVLESEIPQILQGWIIDLND
Ga0307998_122702113300031702MarineTNLIDLGQLQINPIILGGVGWEVKNSKVLESETPQILQGWIIDLNDQLIF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.