NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F092819

Metagenome / Metatranscriptome Family F092819

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092819
Family Type Metagenome / Metatranscriptome
Number of Sequences 107
Average Sequence Length 42 residues
Representative Sequence MSAPSVQVETRGAVALVTLNRPESANTLNLQMAMDLLAA
Number of Associated Samples 105
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 96.26 %
% of genes near scaffold ends (potentially truncated) 99.07 %
% of genes from short scaffolds (< 2000 bps) 85.98 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (94.393 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(23.364 % of family members)
Environment Ontology (ENVO) Unclassified
(24.299 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(44.860 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.
1GBAN_1578580
2ICChiseqgaiiDRAFT_24295011
3JGI1027J12803_1012336581
4Ga0068869_1020111302
5Ga0070709_110841991
6Ga0070713_1023462732
7Ga0068867_1008783791
8Ga0066697_103922383
9Ga0070733_109609592
10Ga0070686_1003056503
11Ga0066700_100125761
12Ga0070664_1000233401
13Ga0066706_104023471
14Ga0066903_1013398504
15Ga0068863_1008890501
16Ga0070715_100125551
17Ga0075021_102813711
18Ga0066710_1021921163
19Ga0105241_116796301
20Ga0105237_109702651
21Ga0126380_119855331
22Ga0126373_108654011
23Ga0126370_109979041
24Ga0136449_1019481233
25Ga0134124_119474831
26Ga0134121_100497156
27Ga0126357_13336202
28Ga0150983_147220673
29Ga0137399_104522441
30Ga0137384_104365231
31Ga0150984_1167203302
32Ga0137397_103579441
33Ga0137416_105434123
34Ga0164305_100098021
35Ga0163163_124028181
36Ga0182032_116053471
37Ga0182037_100468926
38Ga0182744_12492222
39Ga0187824_101437963
40Ga0187825_101986121
41Ga0187809_103293871
42Ga0187778_101761821
43Ga0187776_104603261
44Ga0187783_113865541
45Ga0187781_110682042
46Ga0187780_100416581
47Ga0190272_126007871
48Ga0066655_100167231
49Ga0210399_114840052
50Ga0210406_108536201
51Ga0210408_110852372
52Ga0210388_110703063
53Ga0210393_101670631
54Ga0210385_113750111
55Ga0210389_115588271
56Ga0210383_102655271
57Ga0242658_12234252
58Ga0247758_10434823
59Ga0247771_11506952
60Ga0247784_10163264
61Ga0209341_103467983
62Ga0207680_113744881
63Ga0207671_105180223
64Ga0207700_117797062
65Ga0207711_104367241
66Ga0209236_11469131
67Ga0179593_11437291
68Ga0179593_17793991
69Ga0207505_1037942
70Ga0209420_11999911
71Ga0209380_103479163
72Ga0209168_101929221
73Ga0308309_116045362
74Ga0311339_115099561
75Ga0302179_101101581
76Ga0302311_101523081
77Ga0170834_1047040383
78Ga0170823_149386721
79Ga0307508_103036574
80Ga0318496_106416942
81Ga0318493_101079521
82Ga0318493_107236421
83Ga0307468_1009216111
84Ga0306918_115123561
85Ga0318492_101463524
86Ga0318494_102392314
87Ga0318535_105428692
88Ga0318509_100563551
89Ga0318521_102653421
90Ga0310917_101471781
91Ga0318517_102018031
92Ga0318511_100715451
93Ga0306925_111362121
94Ga0318522_103453411
95Ga0310913_103476054
96Ga0310909_110594211
97Ga0307479_111064143
98Ga0318549_100213055
99Ga0318533_102508794
100Ga0306920_1002761601
101Ga0335085_100731377
102Ga0335079_116335293
103Ga0335080_102040745
104Ga0335081_104872404
105Ga0335072_109405101
106Ga0335076_100578511
107Ga0310811_114560821
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 17.91%    β-sheet: 20.90%    Coil/Unstructured: 61.19%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MSAPSVQVETRGAVALVTLNRPESANTLNLQMAMDLLAASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
94.4%5.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Compost
Surface Soil
Peatlands Soil
Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Soil
Palsa
Plant Litter
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Ectomycorrhiza
Switchgrass Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Avena Fatua Rhizosphere
Green-Waste Compost
Boreal Forest Soil
2.8%2.8%5.6%2.8%4.7%2.8%23.4%4.7%5.6%4.7%3.7%2.8%2.8%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GBAN_15785802029527000Green-Waste CompostMSTPTVNVETRGAVALVTLNRPDVSNTLNLQVAMDLLA
ICChiseqgaiiDRAFT_242950113300000033SoilMSAPTVEVQSRGAVAIVTINRPEVSNTLNLQTAMDLL
JGI1027J12803_10123365813300000955SoilMSAPVVQVETRGAVALVILNRPESGNALNLQVAMDLLAAAMTCARNAAV
Ga0068869_10201113023300005334Miscanthus RhizosphereMSAPTVQVESRGAVTVVTLNRPDSSNTLNIQMAMDLLAAAMT
Ga0070709_1108419913300005434Corn, Switchgrass And Miscanthus RhizosphereMSAPCVQVETRGTVAVVTLNRPDTSNAINLQTAMDLLAAAMTCARNNT
Ga0070713_10234627323300005436Corn, Switchgrass And Miscanthus RhizosphereMSTPSVQVETRGAVALVTLNRPDGANTLNLEMAMDLLAAAMTCGR
Ga0068867_10087837913300005459Miscanthus RhizosphereMSTTTVEVESRGAVAVVTINRPESGNALSLEVGMDLMAAA
Ga0066697_1039223833300005540SoilMAAATVEVETRGAVAIVTLNRPQSANTLNLQMAMDL
Ga0070733_1096095923300005541Surface SoilMAAPAVQVETRGAVALVTLNRPDSGNALNLQMAMDLMAA
Ga0070686_10030565033300005544Switchgrass RhizosphereMSAPLVEMETRGSVAVITLNRPDLSNTLNLQMAMDLLAAAMTCGRNSAVR
Ga0066700_1001257613300005559SoilMAAASVEVETRGAVALVTLNRPESANTLNLQMAMDLLAAAMTCARN
Ga0070664_10002334013300005564Corn RhizosphereMSTSSVQVDTRGAVAVVTLNRPESSNTLNLEMAMDLLAAAMTCGRNPA
Ga0066706_1040234713300005598SoilMAPASVEVETRGAVALVTLNRPQSSNTLNLRMAMDLLAAAMT
Ga0066903_10133985043300005764Tropical Forest SoilMSAPAVQVETRGAVALVTLNRPESGNALNLQVAMDLLA
Ga0068863_10088905013300005841Switchgrass RhizosphereMSAPTVQVESRGAVAIVTINRPEVSNTLNLQTAMDLLAAAMTCGRN
Ga0070715_1001255513300006163Corn, Switchgrass And Miscanthus RhizosphereVSAPTVQVETRGAVALVTLNRPENANTLNLQMAMD
Ga0075021_1028137113300006354WatershedsVSTSSVQVETRGAVALVTLNRPESANTLNLEMAMDLLAAALTCARN
Ga0066710_10219211633300009012Grasslands SoilMAAASVEVETRGAVALVTLNRPESANTLNLQMAMDLL
Ga0105241_1167963013300009174Corn RhizosphereMSAPLVEMETRGSVAVITLNRPDLSNTLNLQMAMDLLAAAMTCGRNS
Ga0105237_1097026513300009545Corn RhizosphereMSTSSVQVETRGAVAVVTLNRPDSSNTLNLEMAMDLLAAAMTCG
Ga0126380_1198553313300010043Tropical Forest SoilMATVTVEVETRGPVALVTLNRPDSANTLNLQVAMDLL
Ga0126373_1086540113300010048Tropical Forest SoilVSAANVQVETRGAVALVTLNRPEHSNTLNLQLAMDLLAA
Ga0126370_1099790413300010358Tropical Forest SoilMSAPSVQAETHGAVALVTLNRPEHSNTLNLQMAMDLLAAAMACARN
Ga0136449_10194812333300010379Peatlands SoilMSAPSVQVETHGAVALVTLNRPEHSNTLNLQMAMDLLAAAMACAR
Ga0134124_1194748313300010397Terrestrial SoilMSAPTVQVESRGAVTVVTLNRPDSSNTLNIQMAMDLL
Ga0134121_1004971563300010401Terrestrial SoilMSAPVVQVETRGAVALVILNRPESGNALNLQVAMDLLAA
Ga0126357_133362023300010864Boreal Forest SoilMSTSSVQVETRGAVALVTLNRPDSANTFNLEMAMDLLAAAM
Ga0150983_1472206733300011120Forest SoilMSAPVVQVDTRGAVALVTLNRPDSGNALNLQVAMDLLAAAMTCAR
Ga0137399_1045224413300012203Vadose Zone SoilMAAASIEVETRGAVALVTLNRPESANTLNLQMAMDL
Ga0137384_1043652313300012357Vadose Zone SoilMAAATVEVETRGAVAVVTLNRPQSANTLNLQMAMDLLAAAMACA
Ga0150984_11672033023300012469Avena Fatua RhizosphereMSAPTVQVESRGAVAIVTINRPDVSNTLNLQTAMDLLAAAMTCGRNSGVR
Ga0137397_1035794413300012685Vadose Zone SoilMSTPSVQVETRGPVALVRLNRPESSNAINLQMAMDLLAAAMTC
Ga0137416_1054341233300012927Vadose Zone SoilMAAASIEVETRGAVALVTLNRPESANTLNLQMAMDLLAAAMTCARNAA
Ga0164305_1000980213300012989SoilVSAPTVQVETRGAVALVTLNRPENANTLNLQMAMDLLAAALACARNAAV
Ga0163163_1240281813300014325Switchgrass RhizosphereMSTPSVQVETRGAVALVTLNRPDSANTLNLETAMDLLAAAMTCARNPA
Ga0182032_1160534713300016357SoilMSAPAVQVETRGAVALVTLNRPESGNAINLQVAMDLLAAAMT
Ga0182037_1004689263300016404SoilMSAPAVQVETRGAVALVTLNRPESGNALNLQVAMDLLAAAMT
Ga0182744_124922223300017553CompostMSAETVEVESRGAVAIVTINRPDASNTLNLQTAMDLLAAAMTCGRNSAV
Ga0187824_1014379633300017927Freshwater SedimentMSAPSVQVETRGAVALVTLNRPESGNALNLRMAMDLLAAAMTCAR
Ga0187825_1019861213300017930Freshwater SedimentMSAPSVQVETRGAVALVTLNRPESGNALNLRMAMDLLAAAMT
Ga0187809_1032938713300017937Freshwater SedimentMSAPSVQVETRGAVALVTLNRPESGNALNLRMAMDLL
Ga0187778_1017618213300017961Tropical PeatlandMSASTVSVDTRGAVAIITLNRPDNANTLNLQMGMDLLA
Ga0187776_1046032613300017966Tropical PeatlandVSVPSVQLETRGAVALVTLNRPDHSNTLNLQMAMDLLAAAMAC
Ga0187783_1138655413300017970Tropical PeatlandVSAPSVQVETRGAVALVTLNRPDNGNAINLQMAMDLLAAALTCAGNTSVR
Ga0187781_1106820423300017972Tropical PeatlandMAAPAVQVETRGAVALVTLNRPDSGNALNLQMAMDLLA
Ga0187780_1004165813300017973Tropical PeatlandMSAASVQVETHGAVALVTLNRPEHSNTLNLQMAMDLLAAAMACARNAAVRA
Ga0190272_1260078713300018429SoilMSTSTIDVETRGAVALVTINRPDSSNTLNLQVAMDLLAAA
Ga0066655_1001672313300018431Grasslands SoilMAAATVEVETRGAVALVTLNRPESANTLNLQMAMDLLAAAMTCARNAA
Ga0210399_1148400523300020581SoilMSAPSVQVETHGAVALVTLNRPEHSNTLNLQMAMDLLA
Ga0210406_1085362013300021168SoilMSTSSVQVETRGAVALVTLNRPDSSNTLNLEMAMDLLAAAMTCGRNPA
Ga0210408_1108523723300021178SoilVSTASVQVETHGAVALVRLNRPEHANTLNLQMAMDLLAAAMACAR
Ga0210388_1107030633300021181SoilVAAPTVQVETRGAVALVTLNRPESANTLNLQMGMDLLAAALACA
Ga0210393_1016706313300021401SoilVAAPTVQVETRGAVALVTLNRPESANTLNLQMGMDLLAAALAC
Ga0210385_1137501113300021402SoilMSAPTVQVETRGAVALVTLNRPDSANTLNLQMAMDLLA
Ga0210389_1155882713300021404SoilVSAPTVQVETRGAVALVTLNRPESANTLNLQMGMDLLAAAL
Ga0210383_1026552713300021407SoilMSAPSVQVETHGAVALVTLNRPEHSNTLNLQMAMDLLAAAMAC
Ga0242658_122342523300022530SoilMSAPNVHVETRGPVALISLNRPESANTINLQTAMDLLAA
Ga0247758_104348233300023079Plant LitterMSAPSVQVESRGAVAVVTINRPDVSNTLNLQTAMDLLAA
Ga0247771_115069523300023267Plant LitterMSAPSVQVESRGAVAVVTINRPDVSNTLNLQTAMDL
Ga0247784_101632643300023270Plant LitterMSAPTVQVESRGPVAVITLNRPDVSNTLNLQTAMDLLAAAMTCGRNSAV
Ga0209341_1034679833300025325SoilMSQSTVNVETRGAVALVTIDRPDDANTLNVQVGMD
Ga0207680_1137448813300025903Switchgrass RhizosphereMSTSSVQVETRGAVAVVTLNRPESSNTLNLEMAMDLLAAAM
Ga0207671_1051802233300025914Corn RhizosphereMSAPSVQVETRGRVALVTLNRPDSSNTINLQMAMDLLAAAM
Ga0207700_1177970623300025928Corn, Switchgrass And Miscanthus RhizosphereMSTPSVQVETRGAVALVTLNRPDGANTLNLEMAMDLLAAAMTCGRNP
Ga0207711_1043672413300025941Switchgrass RhizosphereVSAPTVQVETRGAVALVTLNRPENANTLNLQMAMDLLAAALACARNAAVR
Ga0209236_114691313300026298Grasslands SoilMAAATVEVETRGAVAVVTLNRPQSANTLNLQMAMDLLAAAMTC
Ga0179593_114372913300026555Vadose Zone SoilMAPASVEVETRGAVALVTLNRPQSSNTLNLQMAMDLLAAAMTCA
Ga0179593_177939913300026555Vadose Zone SoilMAPASVEVRTRGAVAMGTLKPHAAPKRPLQMSMDLLAAAMTCAATPR
Ga0207505_10379423300027459SoilVAAPSVQVETRGAVALVTLNRPDSANTLNLQMAMDLLA
Ga0209420_119999113300027648Forest SoilMSTATVNVETRGAVAVVTLNRPAQSNTLSLQMGMDLLAA
Ga0209380_1034791633300027889SoilMATATVEVETQGAVALVTLNRPDSNNTLNLQMAMDLLAAS
Ga0209168_1019292213300027986Surface SoilMSVPSVQVETRGPVALVTLNRPDSANTINLQMAMDL
Ga0308309_1160453623300028906SoilMSTATVDVETRGAVAIVTLNRPAQSNTLSLQMGMDLLAAAMTCARSTAV
Ga0311339_1150995613300029999PalsaMTNSSVYVETSGAAALVTLNRPDSANTLDLQAAMDLLSVAMTCARNPTVRV
Ga0302179_1011015813300030058PalsaMSTATVDVETRGAVAIVTLNRPAQSNTLSLQMGMDLLAAAMTCARS
Ga0302311_1015230813300030739PalsaMSAPTVQVETRGPVALVRLNRPESSNAINLQMAMDLLAAAMT
Ga0170834_10470403833300031057Forest SoilVSTASVQVETHGAVALVTLNRPEHSNTLNLQMARA
Ga0170823_1493867213300031128Forest SoilMSTPSVQVETRGAVALVTLNRPESANTLNLQMGMDL
Ga0307508_1030365743300031616EctomycorrhizaMSTPSVQVETRGPVALVRLNRPESSNTINLQMAMDLLAAAMTCARNNNV
Ga0318496_1064169423300031713SoilVSAPTVQVETRGSVALVTFNRPESGNTLNLQMAMDLLAAAMTCARNAA
Ga0318493_1010795213300031723SoilMSAPAVQVETRGAVALVTLNRPESGNALNLQVAMDLLAAAMTCARN
Ga0318493_1072364213300031723SoilMSAPAVQVETHGAVALVILNRPESGNAINLQVAMDLL
Ga0307468_10092161113300031740Hardwood Forest SoilVSAPTVQVETRGAVALVTLNRPENANTLNLQMAMDL
Ga0306918_1151235613300031744SoilMSAPAVQVETRGAVALVTLNRPESGNALNLQVAMDLLAAAM
Ga0318492_1014635243300031748SoilMSAPAVQVETRGAVALVILNRPESGNALNLRVAMDLLAAAMTCARNAA
Ga0318494_1023923143300031751SoilMSAPAVQVETRGAVALVILDRPESGNALNLQVAMDLLA
Ga0318535_1054286923300031764SoilMSAATVQVETHGAVALVTLNRPDNSNTLNLQMAMDLLAAALTCARNAAVR
Ga0318509_1005635513300031768SoilMSAPVVQVETRGAVALVTLNRPESGNALNLQVAMDLL
Ga0318521_1026534213300031770SoilVSAPTVQVETRGSVALVTFNRPESGNTLNLQMAMDLLAA
Ga0310917_1014717813300031833SoilMSAPAVQVETRGAVALVTLNRPESGNALNLQVAMDLL
Ga0318517_1020180313300031835SoilMSAPAVQVETHGAVALVILNRPESGNAINLQVAMDLLAAAMTCARN
Ga0318511_1007154513300031845SoilMSAPAVQVETHGAVALVILNRPESGNAINLQVAMDLLAAAMTCARNAAVRA
Ga0306925_1113621213300031890SoilMSAPAVQVETRGAVALVILNRPESGNALNLQVAMD
Ga0318522_1034534113300031894SoilMSAPAVQVETRGAVALVILNRPESGNALNLRVAMDLL
Ga0310913_1034760543300031945SoilMSAPAVQVETRGAVALVILNRPESGNALNLRVAMDLLAAAMTC
Ga0310909_1105942113300031947SoilMSAPAVQVETRGAVALVTLNRPESGNAINLQVAMDLLAAAMTCARNAAVR
Ga0307479_1110641433300031962Hardwood Forest SoilMSAPVVQVDTRGAVALVTLNRPDSGNALNLQVAMDL
Ga0318549_1002130553300032041SoilMSAPAVQVETHGAVALVILNRPESGNAINLQVAMDLLAAAMTCARNAAVR
Ga0318533_1025087943300032059SoilMSAPVVQVETRGAVALVTLNRPDSGNALNLQVAMDLLAAA
Ga0306920_10027616013300032261SoilMSAPAVQVETRGAVALVILNRPDSGNALNLQVAMDLLAAAMT
Ga0335085_1007313773300032770SoilMSAPSVQVETRGAVALVTLSRPESANTLNLQMAMDLLAAAL
Ga0335079_1163352933300032783SoilMSASTVEVDTRGAVTIITLNRPDDGNALNLQMGMD
Ga0335080_1020407453300032828SoilMSAPSVQVETRGAVALVTLNRPESANTLNLQMGMDLLAAALACARNAEVR
Ga0335081_1048724043300032892SoilMSAPSVQVETRGAVALVTLNRPESANTLNLQMAMDLLAA
Ga0335072_1094051013300032898SoilMSAPCVHVETRGPVALVRLNRPESANTIDLQTAMDLLAAAMTCSRN
Ga0335076_1005785113300032955SoilVSVASVQVETHGAVALVTLNRPEHSNTLNLQMAMDLLAAA
Ga0310811_1145608213300033475SoilMSAPSVQVETRGPVALVTLNRPESSNTLNLQMAMDLLAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.