NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102148

Metagenome Family F102148

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102148
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 46 residues
Representative Sequence MSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAAD
Number of Associated Samples 89
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 95.10 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 90.20 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (50.980 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(36.274 % of family members)
Environment Ontology (ENVO) Unclassified
(47.059 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.039 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.
1JGI12701J14581_10077962
2Ga0070688_1014347762
3Ga0070714_1014024261
4Ga0070711_1007782062
5Ga0066905_1016399392
6Ga0066905_1017612221
7Ga0066905_1018089321
8Ga0066903_1022003341
9Ga0066903_1034661831
10Ga0066903_1077433431
11Ga0066653_106803202
12Ga0111539_105955223
13Ga0075418_112612381
14Ga0105242_127715712
15Ga0126374_117755731
16Ga0126380_106167183
17Ga0126384_107913861
18Ga0126382_101770923
19Ga0126382_119567171
20Ga0126372_109600823
21Ga0126378_115711072
22Ga0126378_122602061
23Ga0126379_118571372
24Ga0126383_122867752
25Ga0124850_11132942
26Ga0137365_110686541
27Ga0137376_100438165
28Ga0137385_115476332
29Ga0157295_103395101
30Ga0137396_105791983
31Ga0126375_100499504
32Ga0126375_113291153
33Ga0164300_102019151
34Ga0126369_120012361
35Ga0126369_135170042
36Ga0164305_110587282
37Ga0164305_116957342
38Ga0182036_101758051
39Ga0182041_122548251
40Ga0182032_109337413
41Ga0182034_117539281
42Ga0182034_118190071
43Ga0182040_102756731
44Ga0182040_115226172
45Ga0182040_116235041
46Ga0182038_117848761
47Ga0184624_104157562
48Ga0210406_110650071
49Ga0210386_117544492
50Ga0210392_106755082
51Ga0207663_106177191
52Ga0207700_105723211
53Ga0207679_116508181
54Ga0207981_10063514
55Ga0208990_11538871
56Ga0208981_10021886
57Ga0268266_107317771
58Ga0307282_100588101
59Ga0307294_100116404
60Ga0307501_101489381
61Ga0318538_107450931
62Ga0318571_104401971
63Ga0310915_105568713
64Ga0318560_107419901
65Ga0318496_100684801
66Ga0306917_108156532
67Ga0318500_100060446
68Ga0318501_106716722
69Ga0318537_100627511
70Ga0318535_100178664
71Ga0318509_101067611
72Ga0318498_104726011
73Ga0318529_100846471
74Ga0318503_100774413
75Ga0318557_100101931
76Ga0318576_105681581
77Ga0318523_103125973
78Ga0318511_102182323
79Ga0318495_101380311
80Ga0318536_103704082
81Ga0318551_108071001
82Ga0306921_113136321
83Ga0310916_114920871
84Ga0310910_110795221
85Ga0310909_108332083
86Ga0306922_101102171
87Ga0306922_107567813
88Ga0318563_100211846
89Ga0318569_105408702
90Ga0310902_111042272
91Ga0310911_108688471
92Ga0318549_101892513
93Ga0318532_103436921
94Ga0318506_105201262
95Ga0318504_100718093
96Ga0318514_100902783
97Ga0306924_126223982
98Ga0318525_105245672
99Ga0318518_106850321
100Ga0318540_102766791
101Ga0310889_106593631
102Ga0373958_0098337_1_117
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 17.78%    β-sheet: 0.00%    Coil/Unstructured: 82.22%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
49.0%51.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Soil
Soil
Soil
Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Agricultural Soil
Populus Rhizosphere
Miscanthus Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Corn Rhizosphere
6.9%3.9%13.7%36.3%13.7%6.9%2.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12701J14581_100779623300001369Forest SoilMSAHAPDKPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQ
Ga0070688_10143477623300005365Switchgrass RhizosphereMSGYAPDKSSTIVPFAPTPKAPPEANVVADDSGRTIIALLQKA
Ga0070714_10140242613300005435Agricultural SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQK
Ga0070711_10077820623300005439Corn, Switchgrass And Miscanthus RhizosphereMSAHAPDEPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQK
Ga0066905_10163993923300005713Tropical Forest SoilMSAYAPPDKAAGTIVPFAPTPKAQPEVAADESGRTIVAMLQKAA
Ga0066905_10176122213300005713Tropical Forest SoilMSAYDAPDNAGTIVPFAPTPKGQQDTGAAADDSGRTIVALLQKAADMAN
Ga0066905_10180893213300005713Tropical Forest SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAAD
Ga0066903_10220033413300005764Tropical Forest SoilMSGYAPDKSTTIIPFAPTPKPQPPEANVVADDSGRTIISLLQKAAE
Ga0066903_10346618313300005764Tropical Forest SoilMSAYDDAPDKAGTIVPFAATPKGQQDTGTAADDSGRTIVALLQKAADMANEDCK
Ga0066903_10774334313300005764Tropical Forest SoilMSGYVPDKSNTIIPFAPTPKTQPPEGNVVADDSGRTIISLLQKAA
Ga0066653_1068032023300006791SoilMSAYVPDKAGTIVPFAPAPKGQHDTGIIADDSGRTIVAMLQKAADMANADCKRA
Ga0111539_1059552233300009094Populus RhizosphereMSAYAPPDKAAGTIVPFAPTPKAQPEVAADESGRTIVAMLQ
Ga0075418_1126123813300009100Populus RhizosphereMSAYAPPDKAAGTIVPFAPTPKAQPEVAADESGRTIVAMLQKAADMAKEDC
Ga0105242_1277157123300009176Miscanthus RhizosphereMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVA
Ga0126374_1177557313300009792Tropical Forest SoilMSGYASDPGSTIVPFAPTPKGQPDAGIVADDSGRTIVALLQKAAEMAKQDC
Ga0126380_1061671833300010043Tropical Forest SoilMSAYAPPDKAAGTIVPFAPTPKAQPDVAADESGRTI
Ga0126384_1079138613300010046Tropical Forest SoilMSAYVSDKAGTIVPFAPAPKGQHDTGIMADESGRTI
Ga0126382_1017709233300010047Tropical Forest SoilMSAYVPDNAGTIVPFAPKGQQDTGAAADDSGRTIVALLQKAAHMANEDCKRAMD
Ga0126382_1195671713300010047Tropical Forest SoilMSAYDAPDKAGTIVPFAPTPKGQQDTGAAADDSGRTIVALLQKAADM
Ga0126372_1096008233300010360Tropical Forest SoilMSAYDDAPDKAGTIVPFAATPKGQQDTGTAADDSGGTIVALLQ
Ga0126378_1157110723300010361Tropical Forest SoilMSGYDAPDQAGTIVPFAPTPKGQQDTGAAADDSGRTIVALLQKAADM
Ga0126378_1226020613300010361Tropical Forest SoilMSAYDAPDKAGTIVPFAPAPKGQQDTGTIADDSGRTIVAMLQKAADMANEDCKRAMD
Ga0126379_1185713723300010366Tropical Forest SoilMSAYTPDKAGTIVPFAPAPKGQHDTGTVADDSGRT
Ga0126383_1228677523300010398Tropical Forest SoilMSGFASDQGGTIVPFAPAPKAQPDGRIVADDSGRSIVALLQKAADMA
Ga0124850_111329423300010863Tropical Forest SoilMSAYDAPDKAGTIVPFAPAPKGQQDTDSIADDSGRTIVALLQKAADMANEDCKRAMDL
Ga0137365_1106865413300012201Vadose Zone SoilMGGVSAMSAYVPDKAGTIVPFAPAPKGQHDTGIIADDSGRTIVAMLQKAADMANEDCKRAMD
Ga0137376_1004381653300012208Vadose Zone SoilMSAYVPNKAGTIVPFAPAPKGQHDTGIIADDSGRTIVAMLQKAADM
Ga0137385_1154763323300012359Vadose Zone SoilMGGVSAMSAYVPDKAGTIVPFAPAPKGQHDTGIIADDSGLTIVAMLQKAADMANEDC
Ga0157295_1033951013300012906SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTI
Ga0137396_1057919833300012918Vadose Zone SoilMGGVSAMSAYVPNKAGTIVPFAPAPKGQHDTGIIADDSGRTIVAMLQKA
Ga0126375_1004995043300012948Tropical Forest SoilMSAYDDAPDKAGTIVPFAPTPKGQQDTGAAADDSGRTIVALLQKAADMANEDCKRA
Ga0126375_1132911533300012948Tropical Forest SoilMSGYAPDKSTTIIPFAPTPKPQPPEANVVADDSGRTIISLLQKAAEMAKDD
Ga0164300_1020191513300012951SoilMSAYAPDKAGTIVPFAPAPKGQHDTGIAADDSGRTIVAMLQKAADMANEDCKRAMD
Ga0126369_1200123613300012971Tropical Forest SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAEVDDSGRTIVAMLQKAADMANEDCKRAM
Ga0126369_1351700423300012971Tropical Forest SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALL
Ga0164305_1105872823300012989SoilMGGVSAMSAYVPDKAGTIVPFAPAPKGQHDTGILADDSGRTIVAMLQKAADMANEDC
Ga0164305_1169573423300012989SoilMSAHAPDEPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQKAAEM
Ga0182036_1017580513300016270SoilMSAYAPDKAGTIVPFAPAPKGQHDTGIVADDSGRTIVAMLQKAADMANEDCKRAM
Ga0182041_1225482513300016294SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSGRTIVAMLQKAADMANEDCKRAMDL
Ga0182032_1093374133300016357SoilMSAYAPDKAVTIVPFAPTPKGQQDTAAVVDDSGRTIVALLQKAA
Ga0182034_1175392813300016371SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCKRA
Ga0182034_1181900713300016371SoilMSAYDVPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQKAAD
Ga0182040_1027567313300016387SoilMSVQSSDRPATVVPFAPTPKVQPDANIIADDSGRTIVAMLQKAAELAKED
Ga0182040_1152261723300016387SoilMSAYAPDKAGTIVPFAPAPKGQHDTAIVADDSGRTIVAMLQKAADMANEDCKRAMDL
Ga0182040_1162350413300016387SoilMSAYAPDKAGTIVPFAPAPKGQHDTGIVADDSGRTIVAMLQK
Ga0182038_1178487613300016445SoilMSAYVPDKAGTIVPFAPAPKGQHDTAIVADDSGRTIMA
Ga0184624_1041575623300018073Groundwater SedimentMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVAMLQKAADM
Ga0210406_1106500713300021168SoilMSAHAPDKPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQKAAEMA
Ga0210386_1175444923300021406SoilMSAYAPDKAGTIVPFAPAPKGQHDTGIAADDSGRTIVAM
Ga0210392_1067550823300021475SoilMSAHAPDKPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQKAAEMAKDDCARA
Ga0207663_1061771913300025916Corn, Switchgrass And Miscanthus RhizosphereMSAHAPDEPATIVPFAPMSKGQAEGHIVADDSGRTIVALLQKAAEMAKDD
Ga0207700_1057232113300025928Corn, Switchgrass And Miscanthus RhizosphereMSGYAPDKSSTIVPFAPTPKAPPEANVVADDSGRTIIALLQKAAEMAK
Ga0207679_1165081813300025945Corn RhizosphereMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTII
Ga0207981_100635143300027560SoilMSAYAPDKAATVVPFAPTPKGEQPEVVADESGRTIVALLQKAAEMAKE
Ga0208990_115388713300027663Forest SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQKA
Ga0208981_100218863300027669Forest SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQKAA
Ga0268266_1073177713300028379Switchgrass RhizosphereMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQKAAEMAKED
Ga0307282_1005881013300028784SoilMSAYAPDKAATVVPFAPTPKGQHPEVVADESGRTIVALLQKAADMAKEDC
Ga0307294_1001164043300028810SoilMSAYAPDKAATVVPFAPTPKGQQPEVLADESGRTIVAMLKKA
Ga0307501_1014893813300031152SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQKAAEMAKEDC
Ga0318538_1074509313300031546SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSGRTIVAMLQKA
Ga0318571_1044019713300031549SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQRAADMANE
Ga0310915_1055687133300031573SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCKR
Ga0318560_1074199013300031682SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSERTIVAMLQK
Ga0318496_1006848013300031713SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMAN
Ga0306917_1081565323300031719SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDC
Ga0318500_1000604463300031724SoilMSAYAPDKAGTIVPFAPAPKGQHDTAIVADDSGRTIVAMLQKAADMA
Ga0318501_1067167223300031736SoilMSAYDAADKAGTIVPFAPAPKGQQDTDSIADDSGRTIVAMLQKAADMANE
Ga0318537_1006275113300031763SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSERTIVAMLQKAAD
Ga0318535_1001786643300031764SoilMSVHASDRPATVVPFAPAAKGQPDGSIVADDSGRTIVALLQKAAELAKED
Ga0318509_1010676113300031768SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQRAA
Ga0318498_1047260113300031778SoilMSAYVPDKAGTIVPFAPAPKGQHDTGIVADDSGRTIVAMLQKAADMANEDC
Ga0318529_1008464713300031792SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQRAAD
Ga0318503_1007744133300031794SoilMSAYAPDKAGTIVPFAPTPKGQQDTAAVVDDSGRTIVALL
Ga0318557_1001019313300031795SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCN
Ga0318576_1056815813300031796SoilMSAYVPDKAGTIVPFAPAPKGQHDTGIVADDSGRTIVAMLQKA
Ga0318523_1031259733300031798SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCKRAMD
Ga0318511_1021823233300031845SoilMSVHAAERPATVVPFAPAKGQPDANIVADDSGRTIVALLQKAAELAKE
Ga0318495_1013803113300031860SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDC
Ga0318536_1037040823300031893SoilMSAYDAADKAGTIVPFAPAPKGQQDTDSIADDSGRTIVAMLQKAADM
Ga0318551_1080710013300031896SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIL
Ga0306921_1131363213300031912SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCKRAM
Ga0310916_1149208713300031942SoilMSAYVPDKAGTIVPFAPAPKGQHDTGIVADDSGRTIVAMLQK
Ga0310910_1107952213300031946SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANDTATT
Ga0310909_1083320833300031947SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANE
Ga0306922_1011021713300032001SoilMSGYDAPDQAGTIVPFAPTPKWQQDTGAAADDSGRTIVALLQKAAD
Ga0306922_1075678133300032001SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAA
Ga0318563_1002118463300032009SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANE
Ga0318569_1054087023300032010SoilMSAYAPDKAVTIVPFAPTPKGQQDTGAVVDDSGRTIVALLQKAADMANEDCKRAM
Ga0310902_1110422723300032012SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIVALLQKAAEMAK
Ga0310911_1086884713300032035SoilMSAYAPDKAGTIVPFAPTPKGQQDTGAVVDDSGRTIV
Ga0318549_1018925133300032041SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQRAADMA
Ga0318532_1034369213300032051SoilMSAYDAADKAGTIVPFAPAPKGQQDTDSIADDSGRTI
Ga0318506_1052012623300032052SoilMSAYAPDKAGTIVPFAPAPKGQHDTAIVADDSGRTIVAMLQKAADMANEDCK
Ga0318504_1007180933300032063SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQ
Ga0318514_1009027833300032066SoilMSAYDAPDKAGTIVPFAPAPKGQHDTGTIADDSGRTIVAMLQRAADMANEDCKRA
Ga0306924_1262239823300032076SoilMGGVSAMSAYAPDKAGTIVPFAPAPKGQHDTGIAA
Ga0318525_1052456723300032089SoilMSEYAPDKAGTIVPFAPAPKGQHDTGIIADDSGRTIVAM
Ga0318518_1068503213300032090SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSERTIVAML
Ga0318540_1027667913300032094SoilMSAYDAPDKAGTIVPFAPAPKGQQKTGTAADDSGRTIVA
Ga0310889_1065936313300032179SoilMSAYAPDKAATVVPFAPTPKGKQPEVVADESGRTIIALLQKAAD
Ga0373958_0098337_1_1173300034819Rhizosphere SoilMSAYAPDKAATVVPFAPTPKGQQPEVVADESGRTIIALL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.