NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F103967

Metagenome Family F103967

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103967
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 49 residues
Representative Sequence QAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Number of Associated Samples 79
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 29.70 %
% of genes from short scaffolds (< 2000 bps) 29.70 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (70.297 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(50.495 % of family members)
Environment Ontology (ENVO) Unclassified
(69.307 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(51.485 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.
1AF_2010_repII_A01DRAFT_10254101
2AF_2010_repII_A1DRAFT_100665724
3AF_2010_repII_A100DRAFT_10098711
4Ga0066395_109103622
5Ga0066388_1061609221
6Ga0066692_104044681
7Ga0066704_106370742
8Ga0066903_1011465302
9Ga0066903_1075586161
10Ga0070715_103155751
11Ga0070712_1017136652
12Ga0075425_1000354671
13Ga0126384_106089391
14Ga0126384_108731891
15Ga0126382_101076333
16Ga0126382_124899971
17Ga0126373_120633951
18Ga0134080_100501861
19Ga0126370_103419231
20Ga0126370_126086403
21Ga0126378_124731352
22Ga0126377_127823761
23Ga0126379_112417392
24Ga0126381_1044516771
25Ga0126383_130736832
26Ga0126375_113208341
27Ga0182041_106557991
28Ga0182033_110355521
29Ga0182035_109236311
30Ga0182035_111557701
31Ga0182032_111912541
32Ga0182032_113417162
33Ga0182034_114720501
34Ga0182037_114077492
35Ga0182037_115498671
36Ga0182037_116890372
37Ga0182037_117435001
38Ga0066662_100844564
39Ga0209239_10254184
40Ga0209056_100308546
41Ga0307278_101507572
42Ga0318541_103888011
43Ga0318538_100641031
44Ga0318571_103550771
45Ga0318573_101017194
46Ga0318515_101705733
47Ga0318515_105291262
48Ga0310915_106459161
49Ga0318542_100502481
50Ga0318572_109605841
51Ga0318496_106251951
52Ga0318493_103958481
53Ga0318493_106495763
54Ga0318493_107703181
55Ga0318500_100496061
56Ga0318500_102169801
57Ga0318501_100641611
58Ga0318537_100606594
59Ga0318537_102789761
60Ga0318554_100705834
61Ga0318554_105992192
62Ga0318509_104746381
63Ga0318509_105447333
64Ga0318521_100466131
65Ga0318521_102037291
66Ga0318543_101864432
67Ga0318547_104285532
68Ga0318529_103012262
69Ga0318567_102366051
70Ga0318499_100615103
71Ga0310917_106106193
72Ga0318517_100306951
73Ga0306919_103030453
74Ga0306919_107112871
75Ga0318544_101762632
76Ga0306925_116732731
77Ga0318522_102334992
78Ga0318551_108252601
79Ga0306923_118953922
80Ga0310912_101029831
81Ga0310916_108602982
82Ga0310916_113354202
83Ga0310913_106681411
84Ga0310913_108572813
85Ga0306926_118509501
86Ga0318530_102093913
87Ga0306922_102490831
88Ga0318507_100062391
89Ga0318549_103652361
90Ga0318558_105709121
91Ga0318506_104079861
92Ga0318533_103494891
93Ga0318510_101734252
94Ga0318553_105357243
95Ga0306924_101299841
96Ga0306924_122909121
97Ga0318518_101207051
98Ga0318518_104944141
99Ga0318577_105642172
100Ga0307471_1042296082
101Ga0310914_113217431
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 81.25%    β-sheet: 0.00%    Coil/Unstructured: 18.75%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045QAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNPExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
29.7%70.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Tropical Forest Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Populus Rhizosphere
12.9%3.0%50.5%3.0%18.8%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A01DRAFT_102541013300000580Forest SoilAQAGSSWSTYLLLLLGAALAAASTMWFFLRMASIFARRAANSRMDMSNP*
AF_2010_repII_A1DRAFT_1006657243300000597Forest SoilAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFAXRXTNSRLHMSNP*
AF_2010_repII_A100DRAFT_100987113300000655Forest SoilAESSWSTYLLLLLGAALAAASAVWFFLRMPSIFARRAANSRMHMSNP*
Ga0066395_1091036223300004633Tropical Forest SoilYLLLILGAALAATAVWFFTKLTAVYARRAAGPPMHMRSEW*
Ga0066388_10616092213300005332Tropical Forest SoilAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP*
Ga0066692_1040446813300005555SoilAGEAHEPAHAAATVESPWITYLLLILGAALAAASAMWLFSGMRSPFARQAANPRMHMSKS
Ga0066704_1063707423300005557SoilEAHEPAHAAATVESPWITYLLLILGAALAAASAMWLFSGMRSPFARQAANPRMHMSKS*
Ga0066903_10114653023300005764Tropical Forest SoilMSSIVPPRKSSWSSYLSLLLGAALAAASAMWFFLRVASIFARRAANLRMHMSNP*
Ga0066903_10755861613300005764Tropical Forest SoilQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP*
Ga0070715_1031557513300006163Corn, Switchgrass And Miscanthus RhizosphereESSWSTYLLLILGAALAVAAAVWFFAKLTAVYARRAAGPRLHMRSEW*
Ga0070712_10171366523300006175Corn, Switchgrass And Miscanthus RhizosphereAESSWSTYLLLLLGAALAAASATWFLVKLSPVYARRAAGPRMHTSEWQ*
Ga0075425_10003546713300006854Populus RhizosphereLLILGAALAAASATWFLVKMAPVYARRAAGPRMHTSEWQ*
Ga0126384_1060893913300010046Tropical Forest SoilTESSWSTYLLWILGAALAAASAMWFFVKMAPVYARRAAGPRMHMRNQ*
Ga0126384_1087318913300010046Tropical Forest SoilAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP*
Ga0126382_1010763333300010047Tropical Forest SoilETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP*
Ga0126382_1248999713300010047Tropical Forest SoilSTYLLLLMLGAALAAAAGLWFFAKMTPVYARRAAGPRVHMPSEW*
Ga0126373_1206339513300010048Tropical Forest SoilAAATVSTESSWSTYLLLIFAALAAASATWFLVKMAPVYARRAAGPRMHTSELQ*
Ga0134080_1005018613300010333Grasslands SoilAHAAAIVESPWITYLLLILGAALAAASAMWLFSGMRSPFARQAANPRMHMSKS*
Ga0126370_1034192313300010358Tropical Forest SoilSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP*
Ga0126370_1260864033300010358Tropical Forest SoilAAETGQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP*
Ga0126378_1247313523300010361Tropical Forest SoilAAETAQAESYWLLLLGAALAAASATWFLVKMSPVYARRAERPRTL*
Ga0126377_1278237613300010362Tropical Forest SoilSWITYLLLILGAALAAACAFWFFSRTTSLFARRAANPHMSNS*
Ga0126379_1124173923300010366Tropical Forest SoilESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP*
Ga0126381_10445167713300010376Tropical Forest SoilSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP*
Ga0126383_1307368323300010398Tropical Forest SoilNELDRAAETAQAESSWSTYLLLLLGAALAAASATWFLRRAAGPRMHTSEWQ*
Ga0126375_1132083413300012948Tropical Forest SoilSWSTYLLLIFAALAAASAMWFLVKMAPVYARRAAGPRMHTSERQ*
Ga0182041_1065579913300016294SoilSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0182033_1103555213300016319SoilLDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0182035_1092363113300016341SoilETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0182035_1115577013300016341SoilEVNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0182032_1119125413300016357SoilAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0182032_1134171623300016357SoilDRAAETAQAESSWSTYLLLLLGAALAAASAMWFLLRMASIFARRAANSRMHMSNP
Ga0182034_1147205013300016371SoilWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0182037_1140774923300016404SoilAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0182037_1154986713300016404SoilAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0182037_1168903723300016404SoilAAATVSTESSWSTYLLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0182037_1174350013300016404SoilLDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0066662_1008445643300018468Grasslands SoilDAGEAHEPDHAAATVESSWITYLLLILGAALAAASAMWLFSGMRSPFARQAANPRMHMSK
Ga0209239_102541843300026310Grasslands SoilVSTESSWSTYLLLILGAALAAASATWFLVKMAPVYARRAAGPRTTSEWQ
Ga0209056_1003085463300026538SoilGEAHEPDHAAATVESSWITYLLLILGAALAAASAMWLFSGMRSPFARQAANPRMHMSKS
Ga0307278_1015075723300028878SoilWITYLLLILGAALAAASVMWFFSRMTSMLARRAANPRMHMSNS
Ga0318541_1038880113300031545SoilAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318538_1006410313300031546SoilAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318571_1035507713300031549SoilRAAATVSTESSWSTYLLLIFAALAAASATWFLVKMAPVYARRAAGPRMHTSERQ
Ga0318573_1010171943300031564SoilWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318515_1017057333300031572SoilTYLLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0318515_1052912623300031572SoilTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0310915_1064591613300031573SoilESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318542_1005024813300031668SoilARKRNAAATVSTESSWSTYLLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0318572_1096058413300031681SoilNEVNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0318496_1062519513300031713SoilVNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318493_1039584813300031723SoilLLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0318493_1064957633300031723SoilYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0318493_1077031813300031723SoilYLLLLLGAALAAASAMWFFLRMASIFARRAANSGMHMSNP
Ga0318500_1004960613300031724SoilSWSTYLLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0318500_1021698013300031724SoilQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318501_1006416113300031736SoilAESSWSTYLLLLLGAALAAASAMWFLLRMASIFARRAANSRMHMSNP
Ga0318537_1006065943300031763SoilQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318537_1027897613300031763SoilLLLLLGAALAAASAMWFFLRMASIFARRAANPRVHMSNP
Ga0318554_1007058343300031765SoilSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318554_1059921923300031765SoilRAAETAQAESSWSTYLLLLLGAVLAAASAMWFLVKMAPVYARRAAGPRMHTSEWQ
Ga0318509_1047463813300031768SoilESSWTTYLLLLLGAALAAASAMWFFLRMASIFARRAANPRMHMSNP
Ga0318509_1054473333300031768SoilVNELDRAAETGQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0318521_1004661313300031770SoilTAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318521_1020372913300031770SoilAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318543_1018644323300031777SoilTETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318547_1042855323300031781SoilNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318529_1030122623300031792SoilDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318567_1023660513300031821SoilPNEVNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318499_1006151033300031832SoilAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0310917_1061061933300031833SoilAAETGQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0318517_1003069513300031835SoilAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0306919_1030304533300031879SoilSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0306919_1071128713300031879SoilSVSTKSSWSTYLLLIFAALAAASAMWFLVKMAPVYARRAAGPRMHTSERQ
Ga0318544_1017626323300031880SoilELDRVAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0306925_1167327313300031890SoilAAATVSTSWSTYLLLILGAALAAASAMWFFVKMAPVYARRAAGPRMHMRNQ
Ga0318522_1023349923300031894SoilTYLLLLLGAVLAAASAMWFLVKMAPVYARRAAGPRMHTSEWQ
Ga0318551_1082526013300031896SoilELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0306923_1189539223300031910SoilTYLLLILGAALAAASAVWFFVKMIPVYARRAAGPRMHMRNQWL
Ga0310912_1010298313300031941SoilAQVESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0310916_1086029823300031942SoilELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0310916_1133542023300031942SoilSWITYLLLILGAALAAACAFWFFSRTTSLFARRAENPRSA
Ga0310913_1066814113300031945SoilAETTQAESSWTTYLLLLLGAALAAASAMWFFLRMASIFARRAANPRMHMSNP
Ga0310913_1085728133300031945SoilLDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0306926_1185095013300031954SoilSTYLLLLLGAALAAASAMWFLLRMASIFARRAANSRMHMSNP
Ga0318530_1020939133300031959SoilDRAAATVSTESSWSTYLLLIFAALAAASATWFLVKMAPVYARRAAGPRMHTSERQ
Ga0306922_1024908313300032001SoilYLLLLLGAALAAASAMWFFLRMASIFARRAANSRMHMSNP
Ga0318507_1000623913300032025SoilVPDESSWITYLLLILGAALAAACAFWFFSRTTSLFARRAENPRSA
Ga0318549_1036523613300032041SoilLLILGAALAAASATWFLVKMPPVYARRAAGLRMHTSEWQ
Ga0318558_1057091213300032044SoilTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0318506_1040798613300032052SoilSWSTYLLLIFAALAAASATWFLVKMAPVYARRAAGPRMHTSERQ
Ga0318533_1034948913300032059SoilSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAANLRMHMSNP
Ga0318510_1017342523300032064SoilVNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318553_1053572433300032068SoilAATVSTESFWSTYLLLIFAALAAASATWFLVKMAPVYARRAAGPRMHTSERQ
Ga0306924_1012998413300032076SoilAAAVPAETSWITYLLLILGAALAAASAVWFFSRMTPMFARRAANPRLRMSNS
Ga0306924_1229091213300032076SoilAAAVSTESSWSTYLLLILGAALAAASAVWFFVKMIPVYARRAAGPRMHMRNQWL
Ga0318518_1012070513300032090SoilITYLLLILGAALAAASAVWFFSRMTPMFARRAANPRLRMSNS
Ga0318518_1049441413300032090SoilNELDRAAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRATNSRLHMSNP
Ga0318577_1056421723300032091SoilAETAQAESSWSTYLLLLLGAALAAASAMWFFLRMASIFARRAAGPRMHTS
Ga0307471_10422960823300032180Hardwood Forest SoilSWSTYLLLILGAALAAASAMWLFVKMAPVYARRAAGPRMHMRNQ
Ga0310914_1132174313300033289SoilSAAVPDESSWITYLLLILGAALAAACAFWFFSRTTSLFARRAANPRMSNS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.