NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099803

Metagenome Family F099803

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099803
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 46 residues
Representative Sequence AYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Number of Associated Samples 73
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 6.80 %
% of genes from short scaffolds (< 2000 bps) 5.83 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (95.146 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(31.068 % of family members)
Environment Ontology (ENVO) Unclassified
(50.485 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.311 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.
1Ga0066690_110447961
2Ga0066388_1016006383
3Ga0066388_1025726681
4Ga0066388_1043546131
5Ga0066388_1076798762
6Ga0070711_1009166712
7Ga0070711_1009166742
8Ga0066903_1025084101
9Ga0066903_1049468572
10Ga0066903_1049716652
11Ga0066903_1062838731
12Ga0066903_1075554101
13Ga0066903_1086898102
14Ga0075023_1000461721
15Ga0075024_1004364352
16Ga0075018_104022761
17Ga0070716_1018242361
18Ga0070712_1005398852
19Ga0079220_112694101
20Ga0116215_15344262
21Ga0126373_103398501
22Ga0126373_106876213
23Ga0126373_120708582
24Ga0126370_100100546
25Ga0126370_123004671
26Ga0126370_123981412
27Ga0126372_114434102
28Ga0126372_117041881
29Ga0126378_109667803
30Ga0126379_137245171
31Ga0126383_116647531
32Ga0126383_119118871
33Ga0137363_109367731
34Ga0126369_106085243
35Ga0126369_115867782
36Ga0182041_108486641
37Ga0182033_105833513
38Ga0182033_120112241
39Ga0182035_100198061
40Ga0182035_110007441
41Ga0182035_112201472
42Ga0182032_119171051
43Ga0182032_119650721
44Ga0182034_107965961
45Ga0182040_105465561
46Ga0182037_111926862
47Ga0182039_119320142
48Ga0182039_120649591
49Ga0187802_103062791
50Ga0187819_106098731
51Ga0187817_102682531
52Ga0187779_106756052
53Ga0187783_110388191
54Ga0187777_104712682
55Ga0210383_110728191
56Ga0210402_113975691
57Ga0126371_102537891
58Ga0207745_10184782
59Ga0207836_10334351
60Ga0209583_104344661
61Ga0170820_127915423
62Ga0318541_103788752
63Ga0318538_105658251
64Ga0318542_101606431
65Ga0318561_100135031
66Ga0318561_103448371
67Ga0310686_1048147163
68Ga0307474_115058002
69Ga0306917_102560203
70Ga0318501_100170795
71Ga0306918_107956082
72Ga0306918_108592801
73Ga0318543_100267791
74Ga0318529_101818141
75Ga0318503_103072661
76Ga0318557_101076291
77Ga0318550_100655731
78Ga0318523_102649482
79Ga0318497_103416291
80Ga0307478_106154463
81Ga0318517_101003252
82Ga0306919_101402472
83Ga0306925_113756152
84Ga0306925_122030542
85Ga0318520_102065861
86Ga0306923_100986711
87Ga0310912_100228711
88Ga0310912_104718153
89Ga0310916_107372083
90Ga0310916_110013791
91Ga0310910_113737871
92Ga0310910_115035141
93Ga0310909_114580001
94Ga0310909_115767431
95Ga0318530_104347642
96Ga0318559_101932813
97Ga0318559_104889211
98Ga0318532_101294871
99Ga0318513_102485342
100Ga0318514_103852002
101Ga0306920_1001446836
102Ga0306920_1015523673
103Ga0310914_118303441
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 63.04%    β-sheet: 0.00%    Coil/Unstructured: 36.96%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045AYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTTSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
4.9%95.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Peatlands Soil
Agricultural Soil
Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
2.9%3.9%14.6%31.1%21.4%2.9%11.7%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066690_1104479613300005177SoilWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGRST*
Ga0066388_10160063833300005332Tropical Forest SoilYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0066388_10257266813300005332Tropical Forest SoilEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0066388_10435461313300005332Tropical Forest SoilAADSVPDTVAACQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT*
Ga0066388_10767987623300005332Tropical Forest SoilWLNEEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGTTT*
Ga0070711_10091667123300005439Corn, Switchgrass And Miscanthus RhizosphereDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGMTT*
Ga0070711_10091667423300005439Corn, Switchgrass And Miscanthus RhizosphereDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT*
Ga0066903_10250841013300005764Tropical Forest SoilAVAAYQEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT*
Ga0066903_10494685723300005764Tropical Forest SoilAAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWANVGMTT*
Ga0066903_10497166523300005764Tropical Forest SoilEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0066903_10628387313300005764Tropical Forest SoilARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0066903_10755541013300005764Tropical Forest SoilYQEWLNEEMGARAEDARLLMSNGQKFMDASSRLLSNSWTSTRATT*
Ga0066903_10868981023300005764Tropical Forest SoilMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0075023_10004617213300006041WatershedsARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT*
Ga0075024_10043643523300006047WatershedsAYQEWLNEEMGARAEDARLLMSNGQKFMDTSSRFLSSGWTSAGTTT*
Ga0075018_1040227613300006172WatershedsEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGMTT*
Ga0070716_10182423613300006173Corn, Switchgrass And Miscanthus RhizosphereAAYQEWLNEEMGARAQDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0070712_10053988523300006175Corn, Switchgrass And Miscanthus RhizospherePDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT*
Ga0079220_1126941013300006806Agricultural SoilPLIQFLTPLRPQEWLNEEMGARAQDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0116215_153442623300009672Peatlands SoilQEWLSEEMGARAEDARRLMSNGQKFMDTSTRLLSNGWSSVSTTT*
Ga0126373_1033985013300010048Tropical Forest SoilQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0126373_1068762133300010048Tropical Forest SoilVPDAVAAYQEWLNEEMSARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT*
Ga0126373_1207085823300010048Tropical Forest SoilYQEWLNEEMSARAEDARLLMSSGQKFMDTTSRFLSSGWTSAGMTT*
Ga0126370_1001005463300010358Tropical Forest SoilACQEWLNEEMGARAEDARLLMSNGQKFTDATSRYLSSGWTNVGTTT*
Ga0126370_1230046713300010358Tropical Forest SoilSVPDAVAAYQEWLNEEMGARAEDARLLMSSGQKFVDASSRFLSSSWTSASTTT*
Ga0126370_1239814123300010358Tropical Forest SoilEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT*
Ga0126372_1144341023300010360Tropical Forest SoilHSVPDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT*
Ga0126372_1170418813300010360Tropical Forest SoilEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT*
Ga0126378_1096678033300010361Tropical Forest SoilAYQEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT*
Ga0126379_1372451713300010366Tropical Forest SoilLAAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0126383_1166475313300010398Tropical Forest SoilWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT*
Ga0126383_1191188713300010398Tropical Forest SoilTAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRYLSSGWTNVGTTT*
Ga0137363_1093677313300012202Vadose Zone SoilMSEVFGGGAAYQEWLNEEMGARAEDARLLMSNEQKFMDTTSRFLSSGWTNAGMST*
Ga0126369_1060852433300012971Tropical Forest SoilDAVAAYQEWLNEEMSARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT*
Ga0126369_1158677823300012971Tropical Forest SoilDAVAAYQEWLNEEMSARAEDARLLMSSGQKFMDTTSRFLSSGWTSAGMTT*
Ga0182041_1084866413300016294SoilDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT
Ga0182033_1058335133300016319SoilDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGMTT
Ga0182033_1201122413300016319SoilYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWANAGMTT
Ga0182035_1001980613300016341SoilNARAEDARLLMSNGQKFMDATSRFLSSGWTNANMTT
Ga0182035_1100074413300016341SoilSVPDAVAAYQEWLNEEMGARAEDARLLMSSGQKFMDASSRFLSSSWTSASTTT
Ga0182035_1122014723300016341SoilYQEGLSEEMSARAEDARLLMSNGQKLIDASSRFLSSGWTNAGTTT
Ga0182032_1191710513300016357SoilMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVAMTT
Ga0182032_1196507213300016357SoilYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTSVGMST
Ga0182034_1079659613300016371SoilEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT
Ga0182040_1054655613300016387SoilTDAHSVPDAVAAYQEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT
Ga0182037_1119268623300016404SoilLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGMTT
Ga0182039_1193201423300016422SoilDAVTAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0182039_1206495913300016422SoilEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0187802_1030627913300017822Freshwater SedimentNDEMGVRAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0187819_1060987313300017943Freshwater SedimentDEMGVRAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0187817_1026825313300017955Freshwater SedimentPDAVAAYQEWLNDEMGVRAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0187779_1067560523300017959Tropical PeatlandYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0187783_1103881913300017970Tropical PeatlandYQEWLNEEVGARAEDARLLMSSGQKFMDASSRFLSTGWTSAGPTT
Ga0187777_1047126823300017974Tropical PeatlandAAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0210383_1107281913300021407SoilDSVPDAVAACQEWLNEEMSARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0210402_1139756913300021478SoilAACQEWLNEEMSARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0126371_1025378913300021560Tropical Forest SoilAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0207745_101847823300026889Tropical Forest SoilQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0207836_103343513300026932Tropical Forest SoilEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0209583_1043446613300027910WatershedsQEWLNEEMGARAEDARLLMSNGQKFMDTSSRFLSSGWTSAGTTT
Ga0170820_1279154233300031446Forest SoilSVPDAVAAYQEWLNEEMGARAEDARLLMSNGQKFMDTSSRFLSSGWTNAGMTT
Ga0318541_1037887523300031545SoilVPDAVAAYQEWLNDEMGARAEDARLLMSYGQKFMDTTSRFLSSGWTNAGMTT
Ga0318538_1056582513300031546SoilVAAYQEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT
Ga0318542_1016064313300031668SoilAAYQEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT
Ga0318561_1001350313300031679SoilAYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTSVGMST
Ga0318561_1034483713300031679SoilVPDTVAACQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT
Ga0310686_10481471633300031708SoilPDAIAAYQEWLSEEMSARAEDARRLIFNGQKFMNTGSRLMSNGWTNVSS
Ga0307474_1150580023300031718Hardwood Forest SoilADSVPDAVAACQEWLNEEMSARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMTT
Ga0306917_1025602033300031719SoilPDAVATYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTSVGMST
Ga0318501_1001707953300031736SoilITAYQEWLSEEMNARAEDARLLMSNGQKFMDATSRFLSSGWTNANMTT
Ga0306918_1079560823300031744SoilSVPDTVAAYQERLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTSVGMTT
Ga0306918_1085928013300031744SoilWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNAGMTT
Ga0318543_1002677913300031777SoilADSVPDTVAACQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT
Ga0318529_1018181413300031792SoilEWLSEEMNARAEDARLLMSNGQKFMDATSRFLSSGWTNANMTT
Ga0318503_1030726613300031794SoilLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT
Ga0318557_1010762913300031795SoilAYQEWLSEEMNARAEDARLLMSNGQKFMDATSRFLSSGWTNANMTT
Ga0318550_1006557313300031797SoilWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0318523_1026494823300031798SoilYEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT
Ga0318497_1034162913300031805SoilVAACQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT
Ga0307478_1061544633300031823Hardwood Forest SoilAADSVPDALAAYQEWLNEEMGARAQDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0318517_1010032523300031835SoilEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTNVGMTT
Ga0306919_1014024723300031879SoilQEWLNEEMGARAEDARLLMSNGQKFTDATSRYLSSGWTNVGTTT
Ga0306925_1137561523300031890SoilEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0306925_1220305423300031890SoilTDAHSVPDAVAAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0318520_1020658613300031897SoilAAYQEWLNEEMGARAEDARLLMSSGQKFMDASSRFLSSSWTSASTTT
Ga0306923_1009867113300031910SoilDAHSVPDAVAAYQEWLNEEMGARAEDARLLMSSGQKFMDASSRFLSSSWTSASTTT
Ga0310912_1002287113300031941SoilMNARAEDARLLMSNGQKFMDATSRFLSSGWTNANMTT
Ga0310912_1047181533300031941SoilPDAVAAYQEWLNDEMGARAEDARLLMSNGQKFMNTTSRFLSSGGTNVGMTT
Ga0310916_1073720833300031942SoilSVPDAVAAYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0310916_1100137913300031942SoilTADSVPEALAAYQEWLNEEMGARAEDARLLMSNGQKFTDATSRFLSSGWTNVGMTT
Ga0310910_1137378713300031946SoilEEMGARAEDARLLMSNGQKFMDTTSRFLSSSWTSFGMTT
Ga0310910_1150351413300031946SoilLNEEMGARAEDARLLMSSGQKFVDASSRFLSSGWTSASTTT
Ga0310909_1145800013300031947SoilADSVPDAVAAYQEWLNEEMGARAEDVRLLMSNGQKFMDTTSRFLSSGWTSASMTT
Ga0310909_1157674313300031947SoilQEWLNEEMGARAEDARLLMSSGQKFVDASSRFLSSGWTSASTTT
Ga0318530_1043476423300031959SoilLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0318559_1019328133300032039SoilWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT
Ga0318559_1048892113300032039SoilDAVAAYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0318532_1012948713300032051SoilVAAYQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT
Ga0318513_1024853423300032065SoilEWLNDEMGARAEDARLLMSSGQKFMDTTSRFLSSGWTNVGMTT
Ga0318514_1038520023300032066SoilDAVAAYQEWLNEEMGARAEDARLLMSNGQKFMDTTSRFLSSGWTSVGMST
Ga0306920_10014468363300032261SoilSVPDAVAAYQEWLNEEMGARAEDARLLMSSGQKFVDASSRFLSSGWTSARTTT
Ga0306920_10155236733300032261SoilLNDEVGARAEDARLLMSNGQKFMDTTSRFLSSGWTNVGMST
Ga0310914_1183034413300033289SoilQEWLNDEMGARAEDARLLMSNGQKFMDTTSRFLSSGGTNVGMTT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.