NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F079460

Metagenome Family F079460

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079460
Family Type Metagenome
Number of Sequences 115
Average Sequence Length 39 residues
Representative Sequence MSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Number of Associated Samples 60
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 11.30 %
% of genes near scaffold ends (potentially truncated) 48.70 %
% of genes from short scaffolds (< 2000 bps) 84.35 %
Associated GOLD sequencing projects 53
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (52.174 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(28.696 % of family members)
Environment Ontology (ENVO) Unclassified
(49.565 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(73.913 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70
1AF_2010_repII_A01DRAFT_10198321
2AF_2010_repII_A1DRAFT_100138663
3AF_2010_repII_A1DRAFT_101329753
4AF_2010_repII_A100DRAFT_10188432
5Ga0066388_1028849563
6Ga0066903_1002910614
7Ga0066903_1003005844
8Ga0066903_1014784193
9Ga0066903_1018786933
10Ga0066903_1026872602
11Ga0066903_1027330663
12Ga0066903_1036071103
13Ga0066903_1040090031
14Ga0066903_1055522222
15Ga0066903_1059502341
16Ga0066903_1063950822
17Ga0066903_1088032761
18Ga0070717_102254022
19Ga0126374_106502522
20Ga0126374_111516272
21Ga0126380_113935682
22Ga0126384_105690271
23Ga0126384_116086272
24Ga0126382_119223661
25Ga0126373_110141552
26Ga0126373_120820381
27Ga0126373_123037432
28Ga0126370_101246754
29Ga0126370_125109082
30Ga0126376_101265271
31Ga0126378_103072561
32Ga0126378_107382542
33Ga0126378_130543732
34Ga0126379_110916283
35Ga0126379_119424823
36Ga0126379_121143051
37Ga0126381_1002527251
38Ga0126381_1003542781
39Ga0126381_1005134784
40Ga0126381_1012864161
41Ga0126381_1039515332
42Ga0126383_103583532
43Ga0126383_105047873
44Ga0124850_10369181
45Ga0124850_11092901
46Ga0124844_12443381
47Ga0150983_150452302
48Ga0137377_110828682
49Ga0164303_104079601
50Ga0126369_104167664
51Ga0126369_111650161
52Ga0126369_111935562
53Ga0126369_134135072
54Ga0182041_100525114
55Ga0182041_101519745
56Ga0182033_100910761
57Ga0182032_108138891
58Ga0182032_109946062
59Ga0182034_100880893
60Ga0182034_102167522
61Ga0182034_102903551
62Ga0182034_113738151
63Ga0182034_116749291
64Ga0182037_101598123
65Ga0182037_102235141
66Ga0182037_115175862
67Ga0182038_101762355
68Ga0126371_101753935
69Ga0126371_102950321
70Ga0126371_115334963
71Ga0126371_136410111
72Ga0207700_114772592
73Ga0318516_102259922
74Ga0318541_102973811
75Ga0318573_102596141
76Ga0318515_102066981
77Ga0310915_100315331
78Ga0310915_100542803
79Ga0310915_101367392
80Ga0310915_101861073
81Ga0310915_110769932
82Ga0306917_103080792
83Ga0306917_103918983
84Ga0318500_105306191
85Ga0318501_108380061
86Ga0306918_104010951
87Ga0306918_105873262
88Ga0306918_114032051
89Ga0318554_105712571
90Ga0318546_100493791
91Ga0318547_106550632
92Ga0318576_100865024
93Ga0318512_105177402
94Ga0306919_100432602
95Ga0306919_104181421
96Ga0306925_100275456
97Ga0306925_120201951
98Ga0306923_115974481
99Ga0306921_107698541
100Ga0310916_104669761
101Ga0310913_103545011
102Ga0310913_112928241
103Ga0310910_109107371
104Ga0306922_103851981
105Ga0318507_100795592
106Ga0318559_103029501
107Ga0318532_100474601
108Ga0318506_104917211
109Ga0318540_105581321
110Ga0306920_10001444811
111Ga0306920_1001815031
112Ga0306920_1026228951
113Ga0306920_1031347392
114Ga0310914_101448311
115Ga0310914_103448532
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 83.78%    β-sheet: 0.00%    Coil/Unstructured: 16.22%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVGExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
52.2%47.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Tropical Forest Soil
Soil
Forest Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
28.7%23.5%3.5%26.1%13.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A01DRAFT_101983213300000580Forest SoilMSLRQFIPAVIVGLLLAALLLWLTLGWLLDVITLPVG*
AF_2010_repII_A1DRAFT_1001386633300000597Forest SoilMSLRQFLPAVIAGLLLAALLLWLTLGWLLDAITLPVG*
AF_2010_repII_A1DRAFT_1013297533300000597Forest SoilMSPRQFFIPAVIVGLLLAALLLSLTLGWLLDAYAACWLA
AF_2010_repII_A100DRAFT_101884323300000655Forest SoilMSLKQFIPAVIVGLLVAALLLWFTLGWLLDAITLPVG*
Ga0066388_10288495633300005332Tropical Forest SoilMLERQRIRTGPKRFIPAVIAGLLLAALLLWLTLGWLLDAITLPA
Ga0066903_10029106143300005764Tropical Forest SoilMSLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0066903_10030058443300005764Tropical Forest SoilMSLRQFIPTVIVGLLLSALLLWLTLGWLLDAITLPVG*
Ga0066903_10147841933300005764Tropical Forest SoilMSPRQFILAVTAGLLLSALLLWLTLAWLLNAITLPVG*
Ga0066903_10187869333300005764Tropical Forest SoilMSPKLFIPGVIVGLLLAALLLWLMLGFLLDEITLPVG*
Ga0066903_10268726023300005764Tropical Forest SoilMSLKQFIPAVIVGLLLSALLLWLTLGWLLDAVTLPVG*
Ga0066903_10273306633300005764Tropical Forest SoilLCSLVMSLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0066903_10360711033300005764Tropical Forest SoilMSPRQFFIPAVIVGLLLAALLMWLTLAPLLDAITLPVG*
Ga0066903_10400900313300005764Tropical Forest SoilVLGVFLRLSLRQFIPAVIVGLLLAALLLWLTLGWLLDAI
Ga0066903_10555222223300005764Tropical Forest SoilMSLRQFIPAVIVGLLLAALLLWLTLAWLLDAITLPVG*
Ga0066903_10595023413300005764Tropical Forest SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG*
Ga0066903_10639508223300005764Tropical Forest SoilVQQFIPAVIAGLLLAALLLWLTLGWLLNTITLPVG*
Ga0066903_10880327613300005764Tropical Forest SoilVNRMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLRVG*
Ga0070717_1022540223300006028Corn, Switchgrass And Miscanthus RhizosphereMSLRQFIPAVIVGLLLAALLLWFTLGWLLDAITLPVG*
Ga0126374_1065025223300009792Tropical Forest SoilMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126374_1115162723300009792Tropical Forest SoilMSPRQFFIPAVIVGLLLAALLLSLTLGWLLDAITLPVG*
Ga0126380_1139356823300010043Tropical Forest SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLQVRDTKA*
Ga0126384_1056902713300010046Tropical Forest SoilMSPKRFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126384_1160862723300010046Tropical Forest SoilMSLRQFIPAVIVGLLVAALLLWFTLGWLLDAITLPVG*
Ga0126382_1192236613300010047Tropical Forest SoilHRMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126373_1101415523300010048Tropical Forest SoilMSPKRFIPGVIAGLLLAALLLWLTLGWLLDAITLPAG*
Ga0126373_1208203813300010048Tropical Forest SoilGDRMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126373_1230374323300010048Tropical Forest SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLNTITLPVG*
Ga0126370_1012467543300010358Tropical Forest SoilMSPKRFIPAVIAGLLLAALLLWLTLGWLLDAITLPAG*
Ga0126370_1251090823300010358Tropical Forest SoilMSLRQFILAVIVGLLLAALLLWLTLARLLDAITLPVG*
Ga0126376_1012652713300010359Tropical Forest SoilVSLRQFILAVIVGLLLAALLLWLTLAWLLDAITLPVG*
Ga0126378_1030725613300010361Tropical Forest SoilMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPV
Ga0126378_1073825423300010361Tropical Forest SoilMSLRQFIPAIIAGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126378_1305437323300010361Tropical Forest SoilRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126379_1109162833300010366Tropical Forest SoilMSLRQFIPAVIVGLLLATLLLWLTLGWLLDAITLPVG*
Ga0126379_1194248233300010366Tropical Forest SoilMSPKRFIPAVIAGLLVAALLLWLTLGWLLDAITLPAG*
Ga0126379_1211430513300010366Tropical Forest SoilMSLWQFIPAVIAGLLLAALLLWFTLGWLLTTITLPVG*
Ga0126381_10025272513300010376Tropical Forest SoilMSLKQFIPAVIVGLLLAALLLWLTLGWLLNAITLPVG*
Ga0126381_10035427813300010376Tropical Forest SoilHHASHHMSLRQFIPAVIVGLLLAALLLWLALGWLLDAITLPVG*
Ga0126381_10051347843300010376Tropical Forest SoilMSLKRFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126381_10128641613300010376Tropical Forest SoilRMSPKRFIPAVIAGLLVAALLLWLTLGWLLDAITLPAG*
Ga0126381_10395153323300010376Tropical Forest SoilMSLRQFIPAVIVGLLLTALLLWLTLGWLLDAITLPVG*
Ga0126383_1035835323300010398Tropical Forest SoilMSLRHFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG*
Ga0126383_1050478733300010398Tropical Forest SoilMNLRSFIPAVIVGLLLAALLLWLTLGWLLDAITLSAG*
Ga0124850_103691813300010863Tropical Forest SoilKQFIPAVIVGLLVAALLLWFTLGWLLDAITLPVG*
Ga0124850_110929013300010863Tropical Forest SoilMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLRVG*
Ga0124844_124433813300010868Tropical Forest SoilMSLRQFILAVTAGLLLSALLLWLTLARLLDAITLPAG*
Ga0150983_1504523023300011120Forest SoilMSLRQFIPAIIVGLLLAALLLWLTLAWLLDAITLPVG*
Ga0137377_1108286823300012211Vadose Zone SoilMSLKHFIPAVIVGLLVAALLLWLTLGWLLNAITLPVG*
Ga0164303_1040796013300012957SoilMSLRQFIPAIIVGLLLASLLLWLTLAWLLDAITIPVG*
Ga0126369_1041676643300012971Tropical Forest SoilLIREQFIPAVIVGLLLAALLLWLTLGWLLDAITLPAG*
Ga0126369_1116501613300012971Tropical Forest SoilMSLRQFIPVVIVGLLLAALLLWLTLAWLLDAITLPVG*
Ga0126369_1119355623300012971Tropical Forest SoilMSLRHFIPAVIAGLLLAAVPLWLTLDWLLNAITMPVG*
Ga0126369_1341350723300012971Tropical Forest SoilMSPRQFFIPAVIVGLLLAALLLSLTLGWLLDAITL
Ga0182041_1005251143300016294SoilLRQFILAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0182041_1015197453300016294SoilMTAVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVCPPSIVDIA
Ga0182033_1009107613300016319SoilESHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITMPVG
Ga0182032_1081388913300016357SoilRHEHRSARCMSLKQFIPAVIVGLLVAVLLLWLTLGWLLDAITLPVG
Ga0182032_1099460623300016357SoilESHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0182034_1008808933300016371SoilVRSFTNSHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLHAITLPVG
Ga0182034_1021675223300016371SoilHESHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITMPVG
Ga0182034_1029035513300016371SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPV
Ga0182034_1137381513300016371SoilCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0182034_1167492913300016371SoilMGLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPV
Ga0182037_1015981233300016404SoilLRQFIPAVIAGLLLAALLLWLTLGWLLHAITLPVG
Ga0182037_1022351413300016404SoilVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVCPPSIVDIA
Ga0182037_1151758623300016404SoilMNSHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0182038_1017623553300016445SoilHRMSRRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVCPPSIVDIA
Ga0126371_1017539353300021560Tropical Forest SoilVSLRQFILAVIVGLLLAALLLWLTLAWLLDAITLPVG
Ga0126371_1029503213300021560Tropical Forest SoilMSLRQFLPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0126371_1153349633300021560Tropical Forest SoilMSLKQFIPAVIVGLLVAALLLWFTLGWLLDAITLPVG
Ga0126371_1364101113300021560Tropical Forest SoilMSLRQFIPVVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0207700_1147725923300025928Corn, Switchgrass And Miscanthus RhizosphereMSLRQFIPAIIVGLLLAALLLWLTLAWLLDAITLPVG
Ga0318516_1022599223300031543SoilMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0318541_1029738113300031545SoilMSLRHFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318573_1025961413300031564SoilMGLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318515_1020669813300031572SoilHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0310915_1003153313300031573SoilRFGCGVRSFTNSHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLHAITLPVG
Ga0310915_1005428033300031573SoilMSRRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVCPPSIVDIA
Ga0310915_1013673923300031573SoilMSLKQFIPAVIVGLLLAALLLWLTLAWLLDAITLPVG
Ga0310915_1018610733300031573SoilNSHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLSVG
Ga0310915_1107699323300031573SoilSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0306917_1030807923300031719SoilMSLRQFIPAVIVGLLLAALLLWLTLAWLLDAITLPVG
Ga0306917_1039189833300031719SoilCMSLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318500_1053061913300031724SoilLPQDVGDRMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318501_1083800613300031736SoilSARCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0306918_1040109513300031744SoilDVDLCSLVMSLRHFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0306918_1058732623300031744SoilMSPRLFIPAVIVGLLLAALLLWLTLGWLLDAITLPAG
Ga0306918_1140320513300031744SoilARCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0318554_1057125713300031765SoilLRHFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318546_1004937913300031771SoilRMGLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318547_1065506323300031781SoilMSLRQFILAVTAGLLLSALLLWLTLAWLLDAITLPVG
Ga0318576_1008650243300031796SoilYRHRMSRRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318512_1051774023300031846SoilLTCVRSFTSLRQFIPAVIAGLLLTALLLWLTLGWLLDAITLPV
Ga0306919_1004326023300031879SoilMSLRQFILAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0306919_1041814213300031879SoilDLCSLVMSLRHFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0306925_1002754563300031890SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITMPVG
Ga0306925_1202019513300031890SoilLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0306923_1159744813300031910SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLSVG
Ga0306921_1076985413300031912SoilMSLKQFIPAVIVGLLVAALLLSLTLGWLLDAITLPVG
Ga0310916_1046697613300031942SoilVFAMNSHRMSLRQFIPAAIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0310913_1035450113300031945SoilMSRRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVC
Ga0310913_1129282413300031945SoilHRVRHEHRSARCMSLKQFIPAVIVGLLVAVLLLWLTLGWLLDAITLPVG
Ga0310910_1091073713300031946SoilRSARCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0306922_1038519813300032001SoilAHRMGLKQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVG
Ga0318507_1007955923300032025SoilSHRARHERRSARCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0318559_1030295013300032039SoilTSLRQFIPAVIAGLLLTALLLWLTLGWLLDAITLPVG
Ga0318532_1004746013300032051SoilARHERRSARCMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLPVG
Ga0318506_1049172113300032052SoilMSRRQFIPAVIVGLLLAALLLWLTLGWLLDAITLPVGWRRVCPTSI
Ga0318540_1055813213300032094SoilMSPRLFIPAVIVGLLLAALLLWLTLGWLLDAITLP
Ga0306920_100014448113300032261SoilMSLRQFIPAVIAGLLLAALLLWLTLGWLLHAITLPVG
Ga0306920_10018150313300032261SoilHRMSLRQFIPAVIAGLLLAALLLWLTLGWLLDAITMPVG
Ga0306920_10262289513300032261SoilMSLKQFIPAVIVGLLVAVLLLWLTLGWLLDAITLPVG
Ga0306920_10313473923300032261SoilMSLKQFIPAVIVGLLVAALLLWLTLGWLLDAITLP
Ga0310914_1014483113300033289SoilISLRQFIPAVIAGLLLAALLLWLTLGWLLDAITLPVG
Ga0310914_1034485323300033289SoilMSLRQFIPAVIVGLLLAALLLWLTLGWLLDAITLSVG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.