NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089321

Metagenome / Metatranscriptome Family F089321

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089321
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 38 residues
Representative Sequence MRLFIALDIDDVIRERIARFVEGVRNFAPDARWVKPE
Number of Associated Samples 101
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 50.93 %
% of genes near scaffold ends (potentially truncated) 96.33 %
% of genes from short scaffolds (< 2000 bps) 89.91 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.248 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(7.339 % of family members)
Environment Ontology (ENVO) Unclassified
(18.349 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.459 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50
1Ga0066816_10156211
2Ga0066671_101993952
3Ga0070714_1020177781
4Ga0070713_1021744941
5Ga0070699_1006330982
6Ga0070679_1003475812
7Ga0066697_104252761
8Ga0066692_109769722
9Ga0066704_101493472
10Ga0066705_100778601
11Ga0066903_1028871461
12Ga0075275_10421561
13Ga0075277_10943361
14Ga0075279_100284232
15Ga0066789_100477191
16Ga0075019_109506071
17Ga0070716_1011634742
18Ga0075021_102022101
19Ga0066709_1040162362
20Ga0116218_15731821
21Ga0105237_100977423
22Ga0116122_10351621
23Ga0116216_103679422
24Ga0126374_107902552
25Ga0116223_102682172
26Ga0126373_102706971
27Ga0126373_108101362
28Ga0126370_110702782
29Ga0126372_110668172
30Ga0126379_136846121
31Ga0126383_127410952
32Ga0137393_114217532
33Ga0137360_111966402
34Ga0134087_102690632
35Ga0164305_118123681
36Ga0181523_100697891
37Ga0182039_118741762
38Ga0182038_118366742
39Ga0181505_102377761
40Ga0187818_100461513
41Ga0187778_109370172
42Ga0187776_105419901
43Ga0187780_102816691
44Ga0187822_102631711
45Ga0187816_100400591
46Ga0187816_102336331
47Ga0187810_101784322
48Ga0187788_103138062
49Ga0187863_100082338
50Ga0187766_100498931
51Ga0187773_106180321
52Ga0187770_105533881
53Ga0066667_110097381
54Ga0210401_101225233
55Ga0210401_102958711
56Ga0210406_104376942
57Ga0210385_114663702
58Ga0126371_131809052
59Ga0242657_12379101
60Ga0228598_10186151
61Ga0210114_10991392
62Ga0207642_103770171
63Ga0207645_109482861
64Ga0207702_119681401
65Ga0207648_110017991
66Ga0209761_10743453
67Ga0209806_11361272
68Ga0209807_13377481
69Ga0209058_12114801
70Ga0209626_11679221
71Ga0209910_100328402
72Ga0209580_100912752
73Ga0209167_106653481
74Ga0209068_101891722
75Ga0209068_107451491
76Ga0209624_108064102
77Ga0209067_101710002
78Ga0302147_100573722
79Ga0302233_102663172
80Ga0302219_100173803
81Ga0302266_102589361
82Ga0308309_106418491
83Ga0311329_110253651
84Ga0302150_103589641
85Ga0311338_102025303
86Ga0302300_11087452
87Ga0302286_105563102
88Ga0311370_103504723
89Ga0318572_109383932
90Ga0307474_111112632
91Ga0307475_109944532
92Ga0318546_101267651
93Ga0307478_115483661
94Ga0310917_103710231
95Ga0307410_113283742
96Ga0307410_113707682
97Ga0306926_107670711
98Ga0307470_102273932
99Ga0307471_1006138181
100Ga0307471_1021216051
101Ga0348332_108410887
102Ga0335085_101076411
103Ga0335079_110538141
104Ga0335079_117307882
105Ga0335079_118470072
106Ga0335078_109262911
107Ga0335075_105629482
108Ga0335083_103111771
109Ga0326728_100935885
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.15%    β-sheet: 0.00%    Coil/Unstructured: 73.85%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MRLFIALDIDDVIRERIARFVEGVRNFAPDARWVKPESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog
Peatland
Freshwater Sediment
Natural And Restored Wetlands
Watersheds
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Rice Paddy Soil
Tropical Peatland
Thawing Permafrost
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Fen
Palsa
Bog
Peat Soil
Plant Litter
Corn Rhizosphere
Miscanthus Rhizosphere
Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
4.6%4.6%7.3%7.3%7.3%5.5%6.4%6.4%4.6%3.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066816_101562113300005158SoilMRLFIALDIDDQIRQRIGRFLEGVSGFSPDARWVPLESLHITL
Ga0066671_1019939523300005184SoilMRLFVALDIADEIRQRIARFMEGVREFAPEPGWVREESLH
Ga0070714_10201777813300005435Agricultural SoilMRLFVALDIDERIARFMEGVRNFASDAGWAKEESLHVTLKF
Ga0070713_10217449413300005436Corn, Switchgrass And Miscanthus RhizosphereMRLFVALDIDDPIRERIARFVVGLRNFAPDARWAKRNRCT*
Ga0070699_10063309823300005518Corn, Switchgrass And Miscanthus RhizosphereLTMRIFIAVDIDDVIRERLRLFMEGVRGFAVDARWVRSE*
Ga0070679_10034758123300005530Corn RhizosphereMRLFVALDIDDPIRERIARFVEGVRNFAPDGRWAKEESLHV
Ga0066697_1042527613300005540SoilMRIFVALDLDDDIRQRIQTFMDGVQNFSPDARWVNAASLHVT
Ga0066692_1097697223300005555SoilMRIFIALDIDDAIRERIARFVEGVSSFAPDARWAKPE
Ga0066704_1014934723300005557SoilMRLFVALDIADEIRQRIARFMEGVREFAPEPRWVREESLHV
Ga0066705_1007786013300005569SoilMRVFVALDVDNAIREQLQLFMDGVSGFAPDARWIRPESMHVTL
Ga0066903_10288714613300005764Tropical Forest SoilMRLFVALDIDQSIRERIVRFVDGLHNFAPDARWMKPESMHV
Ga0075275_104215613300005892Rice Paddy SoilMRLFVAFDIDPDIRERLARFLDGVREFAPDARWVRAESL
Ga0075277_109433613300005895Rice Paddy SoilMRLFVALDIADEVRERIGRYVEGVQNFAPEARWVKE
Ga0075279_1002842323300005903Rice Paddy SoilMRLFIALDIDDAIRERLAKFVEGVRGFAPDVRFVGVESLHIT
Ga0066789_1004771913300005994SoilMRLFIALDIDDAIRDRITRFVEGVTGFAPDARWAKPE
Ga0075019_1095060713300006086WatershedsMRLFLALDIDEAIRQRIERFLEGVHPFAPDARWAKPE
Ga0070716_10116347423300006173Corn, Switchgrass And Miscanthus RhizosphereMRLFIALEIDGAIRERVARFIEGVGPFAPDARWVTSESLHI
Ga0075021_1020221013300006354WatershedsMRLFIALDIDGAIRERIVRFVDGVREFAPDARWVPP
Ga0066709_10401623623300009137Grasslands SoilMRIFIALDIDDSIRERIKRFMEGVRGFDPEARWVRH
Ga0116218_157318213300009522Peatlands SoilMRLFIALDIADAVRERLARFTEGVQAFAPDARWAK
Ga0105237_1009774233300009545Corn RhizosphereMRLFVALDIEETIRERIASFVKEVCPLAPGVRWVASEPCT*
Ga0116122_103516213300009639PeatlandMRLFIALDIDDAIRERIARFIEGVQGFAPEARWVKP
Ga0116216_1036794223300009698Peatlands SoilMRLFIALDIDDAIRERIARFVEGVSGFAPDARWAKPE
Ga0126374_1079025523300009792Tropical Forest SoilMAAFMRLFIALDIDDGIRERISRFVESVRSFSPDARW
Ga0116223_1026821723300009839Peatlands SoilMRLFIAVDIDDAIRERIARFIEGVQGFAPDARWVK
Ga0126373_1027069713300010048Tropical Forest SoilMRLFIALEIDEAIRQRIARFTEGVRGFAPDARWVKEESLH
Ga0126373_1081013623300010048Tropical Forest SoilMRIFIGLDLENSIRERIRRFMDGVRGFAPDARWTRPESL
Ga0126370_1107027823300010358Tropical Forest SoilMRIFVALDIDEDIRRRIVGFVDDLRPYARDARWVKPES
Ga0126372_1106681723300010360Tropical Forest SoilMRLFVALDIDDLIRERIARFVEGVCNFAPEARWVKPESLH
Ga0126379_1368461213300010366Tropical Forest SoilMRIFIALDLDDAIRERIDRFIEGVRGFAPGARWVLPES
Ga0126383_1274109523300010398Tropical Forest SoilMRVFIALDIDQSIRERITRFLDGVREFAPDARWVRTE
Ga0137393_1142175323300011271Vadose Zone SoilMRIFIALDIDDAIRDRISRFMDGVREFAPDARWVRP
Ga0137360_1119664023300012361Vadose Zone SoilMRLFIALDIDDPIRERITRFADEVRNFSPDARWVKL
Ga0134087_1026906323300012977Grasslands SoilMRIFVALDLDDGIRQRIQRFMDGVQNFAPDARWVNAAS
Ga0164305_1181236813300012989SoilLLNVRLFIALDIDEEIRQRIGRFLDGVSGFAADARWV
Ga0181523_1006978913300014165BogMRLFVALDIDGAIRGRIAQFMDGMRGFAPDARWVSAES
Ga0182039_1187417623300016422SoilMRLFVALDIDDAIRERIVRFVEVVHPFAPDARWVKPESMHVTL
Ga0182038_1183667423300016445SoilMRLFLALDIDDAIRDRLTRFLEGVRNFAPDARWVKPESL
Ga0181505_1023777613300016750PeatlandMRLFIALDIDDAIRERIARFLEGVSGFVPDARWAKPES
Ga0187818_1004615133300017823Freshwater SedimentMRLFVALDIDSAIRAKIAQFMEGVREFAPDARRISAESL
Ga0187778_1093701723300017961Tropical PeatlandMRLFIALDIDDAIRECIARFVEGVQGFAPDARWVKPES
Ga0187776_1054199013300017966Tropical PeatlandMRLFVALEIDPEIRARIAQFMDGVREFAPDARWVSAES
Ga0187780_1028166913300017973Tropical PeatlandMRLFVALDIDAEIRARIAQFMDGVCAFAPDARWVSAESLHLTLK
Ga0187822_1026317113300017994Freshwater SedimentMRLFIALDIDDVIRERIARFVEGVRNFAPDARWVKPE
Ga0187816_1004005913300017995Freshwater SedimentMRLFIALDLDPSIRHRIAQFMDGVRGFAPDARWVSAE
Ga0187816_1023363313300017995Freshwater SedimentMRLFVALDIDDGIRERITRFVDGVRNFAPDARWMKPESL
Ga0187810_1017843223300018012Freshwater SedimentMRLFIALDLDDAIRERIARFVEGVSNFAPDARWVKPESLH
Ga0187788_1031380623300018032Tropical PeatlandMRLFLALDIDPEIRSRIAEFMDGVRGFAPDARWVS
Ga0187863_1000823383300018034PeatlandMRLFLALDIDDAIRERITRFVDGVRNFSPDARWMQPE
Ga0187766_1004989313300018058Tropical PeatlandMRLFVALDLHEEIRQRIARFVEGVGEFAPQPRWVSPQSLH
Ga0187773_1061803213300018064Tropical PeatlandMRLFVALDIDPPIRRRIAQFMDGVREFAPDARWVSAESL
Ga0187770_1055338813300018090Tropical PeatlandMRIFIALDIDDAIRERIARFMDGVREFAPDARWVKPE
Ga0066667_1100973813300018433Grasslands SoilMFAMRLFVALDIDEAIRERITRFLDAVEDLAPDARW
Ga0210401_1012252333300020583SoilMRLFVAFDIDDNIRDRIVRFLDGVRGFAPDARWARPESL
Ga0210401_1029587113300020583SoilMRLFIALDIDDAIRGRIARFVEGVSGFALDARWAKPESMHV
Ga0210406_1043769423300021168SoilMRLFIALDIDDGIRAQIAQFIEAVQAFAPEARWMKPESLLV
Ga0210385_1146637023300021402SoilMRLFIALDIDEAVRERIARFVEGVTGFAADARWMRPE
Ga0126371_1318090523300021560Tropical Forest SoilMRLFIALDIDHSIRERIARFMEGVRNFAPDARWMKEE
Ga0242657_123791013300022722SoilMRIFVALDIDAAIRQRIQRFMEGVSGFAPDARWVR
Ga0228598_101861513300024227RhizosphereMRLFIALDIDDEIRERIARFAEGVSGFAPDARWARPDSL
Ga0210114_109913923300025795Natural And Restored WetlandsMRLFVGIDIEPAIRERISKFVEGVRNFAPDVRWVNAETFH
Ga0207642_1037701713300025899Miscanthus RhizosphereMRLFVALDIDDAIRERIALFQDGVGGFAPDAKWVRAESLHITL
Ga0207645_1094828613300025907Miscanthus RhizosphereMRLFVALDLADPIRERIQQFMEGVRGFAPDVRWVTPESL
Ga0207702_1196814013300026078Corn RhizosphereMRLFIALDIDEEIRRRIERFVEGVRGFAPDVRFVGPQSFHVTLK
Ga0207648_1100179913300026089Miscanthus RhizosphereMRIFVALDIDDAIRSRIQRFMEGVQEFAPDVRWVRP
Ga0209761_107434533300026313Grasslands SoilMRIFIALDIDDSIRERIKRFMEGVRGFDPEARWVRHESLHIT
Ga0209806_113612723300026529SoilMRIFIALDVEDSIRQRIARFMEGVRGFAPDVRWVR
Ga0209807_133774813300026530SoilMRVFVALDVDNAIREQLQLFMDGVSGFAPDARWIRP
Ga0209058_121148013300026536SoilMRIFVALDLDDGIRQRIQRFMDGVQNFAPDARWVNA
Ga0209626_116792213300027684Forest SoilMRLFVALDVDDAIRGRIAGFMDGVRGFAPDARWLE
Ga0209910_1003284023300027803Thawing PermafrostMRLFVALDIDDAIRSRIARFLDGVREFAPDARWARPESL
Ga0209580_1009127523300027842Surface SoilMRLFIALDIDDAIRERLTGFLEGVHNFAPDARWVKPESL
Ga0209167_1066534813300027867Surface SoilMRIFVALDIDDAIRQRILRFMEGVSGFAPDARWVRL
Ga0209068_1018917223300027894WatershedsMRLFIALDIDGAIRERIVRFVDGVREFAPDARWVPPE
Ga0209068_1074514913300027894WatershedsMRIFVALHIDEAIRERIQRFMDGVRGFAADAHWARPE
Ga0209624_1080641023300027895Forest SoilMRLFVALDINDDIRNRIARYLEGVRGFAPAVRWMRS
Ga0209067_1017100023300027898WatershedsMRLFLALDIDDAIRARISRFVEGVRNFAPDARWAKEE
Ga0302147_1005737223300028566BogMRLFVALDIDDAIRSRIARFLDGVREFAPDARWAR
Ga0302233_1026631723300028746PalsaMRLFIALDITDAIRGRIARFVEGVTGFAPDARGAK
Ga0302219_1001738033300028747PalsaMRLFVALDIDDVIRSRIARFLDGVREFAPDARWARSESL
Ga0302266_1025893613300028779BogMRLFVALDIDEAIRVRIARFLDGVRGFAPEARWVRIE
Ga0308309_1064184913300028906SoilMRLFVALDIDDGIRSRIARFLDGVRGFAPDVRWARPEAL
Ga0311329_1102536513300029907BogMRLFIALDLDNEIRNRIARFLDGVCEFASDARWARP
Ga0302150_1035896413300029956BogMRLFVALDIDDDIRGSIARFIDEFRDVAAQARWVKPESLH
Ga0311338_1020253033300030007PalsaMRLFVALDLDDNLRSRIVRFVEGVRGFAPEARWARPESL
Ga0302300_110874523300030042PalsaMRLFVALDIDDVIRSRIARFLDGVREFAPDARWAR
Ga0302286_1055631023300030047FenMRIFVALDIEDIISQRIARFMDGVREFAPDARWVR
Ga0311370_1035047233300030503PalsaMRLFIAVDLDDAIRERISLFMEGVRPFAPDARWLKPESL
Ga0318572_1093839323300031681SoilMRLFIALDIDDAIRERITRFVEGVRSFAPDGRWVKPES
Ga0307474_1111126323300031718Hardwood Forest SoilMRVFVALDVDDAIRSRIARFLDGVRGFAPDARWVKPES
Ga0307475_1099445323300031754Hardwood Forest SoilVRLFIALDIDDAIRERMTGFMDGLRGFAPDVRWVRTE
Ga0318546_1012676513300031771SoilMRIFIALDLDDPIRERIDHFIEGVRGFAPDARWVLPESLHI
Ga0307478_1154836613300031823Hardwood Forest SoilMRLFIALDIDEAVRERIARFVEGVSGFAADARWMRPES
Ga0310917_1037102313300031833SoilMRIFIALDLDDPIRERIDHFIEGVRGFAPDARWVLPESLH
Ga0307410_1132837423300031852RhizosphereMRLFVGIDIEPAIRERISKFVDGVRNFAPDVRWVNPETFHV
Ga0307410_1137076823300031852RhizosphereMRLFVGIDIEPAIRERISKFVEGVRNFAPDVRWVNVET
Ga0306926_1076707113300031954SoilMRLFLALDIDDAIRDRLTRFLEGVRNFAPDARWVKPE
Ga0307470_1022739323300032174Hardwood Forest SoilMRLFVALDIDDSIRSRITRFLDGLRGFAPDARWVRSES
Ga0307471_10061381813300032180Hardwood Forest SoilMRIFVALDIDDAIRNRIQRFMDGVRGFAPDARWVR
Ga0307471_10212160513300032180Hardwood Forest SoilMRIFIALDIEDVVRDRIRRFMDGVREFAPDARWVR
Ga0348332_1084108873300032515Plant LitterMRLFIALDIDDEIRERIARFAEGVSGFAPDARWARPDS
Ga0335085_1010764113300032770SoilMRLFVALDIDNAIRERIVRFVEGVNPFAPEARWLKPESMHVT
Ga0335079_1105381413300032783SoilMRLFIALDIVDAIRGRIGRFLEGVRNFAPDARWVRDESLHV
Ga0335079_1173078823300032783SoilMRLFLALDIDGAIRERIARFMDGVRGFAPDARWIQPESLH
Ga0335079_1184700723300032783SoilMRLFIALDIDDAIRERIVRFLDGVQGFAPDARWVK
Ga0335078_1092629113300032805SoilMARLPMRLFVALDIDPEIRNRIAQFMDGVRGFAPEARWVSVESLHLT
Ga0335075_1056294823300032896SoilMRLFVALDLDESVREKIARFMDGVCGLAPEARWIQPESLH
Ga0335083_1031117713300032954SoilMRLFVALDIPSEIRERITRFVEGLVRFSPDANWVKPA
Ga0326728_1009358853300033402Peat SoilMRLFVALDIDFAVRERIAGFMAEVQKLAPEARWVKPESFHIT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.