NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086871

Metagenome / Metatranscriptome Family F086871

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086871
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 39 residues
Representative Sequence MDLNLTAEELAFRDELRAWLASNVPKDWNEWREKPLEE
Number of Associated Samples 94
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 98.18 %
% of genes from short scaffolds (< 2000 bps) 87.27 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.818 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(25.455 % of family members)
Environment Ontology (ENVO) Unclassified
(28.182 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(57.273 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54
1JGI12694J13545_10091621
2JGI12704J13340_10047961
3JGI12635J15846_108788561
4JGIcombinedJ26739_1015108171
5Ga0062385_103504471
6Ga0062389_1027198192
7Ga0073909_101888561
8Ga0070730_108521801
9Ga0070732_107642751
10Ga0068855_1003153533
11Ga0066702_100610411
12Ga0066903_1073398223
13Ga0066659_102570991
14Ga0099830_102700481
15Ga0099830_117402351
16Ga0126373_127863031
17Ga0126373_132001912
18Ga0074046_108035792
19Ga0126378_134230391
20Ga0105239_102870953
21Ga0126361_106005551
22Ga0137392_103923762
23Ga0137393_101243111
24Ga0137388_112911982
25Ga0137360_112834872
26Ga0137390_104853702
27Ga0137405_12480811
28Ga0182041_115271512
29Ga0182037_108972031
30Ga0187812_10414932
31Ga0187823_101970081
32Ga0187805_101774052
33Ga0187859_100202721
34Ga0187784_110733322
35Ga0187772_104908782
36Ga0066662_109396072
37Ga0137408_11371093
38Ga0179592_102879791
39Ga0210407_114415651
40Ga0210403_103077232
41Ga0210403_110088951
42Ga0210399_105453811
43Ga0210399_106515222
44Ga0215015_101835441
45Ga0210404_101332242
46Ga0210397_100796081
47Ga0210387_106434982
48Ga0210386_112742382
49Ga0210394_106035281
50Ga0210394_109479382
51Ga0210394_110549912
52Ga0210394_114775982
53Ga0210391_101515021
54Ga0210402_111600401
55Ga0210402_119278932
56Ga0210409_101034061
57Ga0210409_103236391
58Ga0242662_103131411
59Ga0208691_11171872
60Ga0207692_107867882
61Ga0207692_108872522
62Ga0207685_101105603
63Ga0207654_106563562
64Ga0207707_105335591
65Ga0207707_110160101
66Ga0207665_109167982
67Ga0257150_10612691
68Ga0257160_10890922
69Ga0257181_10359242
70Ga0209648_106377773
71Ga0209730_10403882
72Ga0209115_10054913
73Ga0209625_10092441
74Ga0209076_11749081
75Ga0209217_10425972
76Ga0209388_10329951
77Ga0209118_10353642
78Ga0209333_11861622
79Ga0209580_100145091
80Ga0209166_102573211
81Ga0209701_104808442
82Ga0209067_101944011
83Ga0302231_101084761
84Ga0302221_104441672
85Ga0311352_100304971
86Ga0311371_108272602
87Ga0302304_101266131
88Ga0311370_108707182
89Ga0311355_107295542
90Ga0311355_110135062
91Ga0170834_1070960141
92Ga0310686_1012806472
93Ga0310686_1085879863
94Ga0307476_101729742
95Ga0307477_100136191
96Ga0307477_102302902
97Ga0318546_107639712
98Ga0318548_101658182
99Ga0318497_100550043
100Ga0310913_108081251
101Ga0306926_100250461
102Ga0318505_104232702
103Ga0306920_1001361291
104Ga0335085_110258361
105Ga0335079_105718982
106Ga0335079_108197022
107Ga0335079_112329711
108Ga0335078_126422022
109Ga0335080_120988562
110Ga0310914_108879662
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.33%    β-sheet: 0.00%    Coil/Unstructured: 66.67%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MDLNLTAEELAFRDELRAWLASNVPKDWNEWREKPLEESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
71.8%28.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Peatland
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Palsa
Corn Rhizosphere
Corn Rhizosphere
Boreal Forest Soil
11.8%4.5%25.5%3.6%5.5%9.1%3.6%7.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12694J13545_100916213300001166Forest SoilMDLNLTSEEMQFRDELRSWLAAHVPSDWAERREESLESRFE
JGI12704J13340_100479613300001170Forest SoilMDLNLTAEELTFRDELRAWLASNVPKDWSDWREKPMEESFP
JGI12635J15846_1087885613300001593Forest SoilMDLNLTADELQFRDELRSWLTANLPKDWDEWREKPIEVSFPYL
JGIcombinedJ26739_10151081713300002245Forest SoilMDLKLTPEESSFRDELRAWLATNAPKDWAEWREKPLEE
Ga0062385_1035044713300004080Bog Forest SoilMDLRLTAEESAFRDELRAWLAINVPRDWSEWREKPIEESYPYLRA
Ga0062389_10271981923300004092Bog Forest SoilMDLNLSTDEQTFQDELRSWLETNVPREWEHAREQSLDVRFEFHR
Ga0073909_1018885613300005526Surface SoilMDLNLSPEEIKFRDELRTWLAANAPKDWDERREESMESRFEYLKRWQR
Ga0070730_1085218013300005537Surface SoilMDLNLSPEEIKFRDELRNWLSANAPKDWDERREESM
Ga0070732_1076427513300005542Surface SoilMDLKLSLEELQFRDELRAWLRANVPRDWGEWREKPIE
Ga0068855_10031535333300005563Corn RhizosphereMDLNLSPDEIKFRDELRTWLAANAPTDWDERREES
Ga0066702_1006104113300005575SoilMDLSLSPDEIKFRDVLRTWLAGNAPTDWDERREESMESRFEYLKRWQ
Ga0066903_10733982233300005764Tropical Forest SoilMDLNLSADELKFRDELRAWLAANVPKDWDEYRDESLEARF
Ga0066659_1025709913300006797SoilMDLKLTAEELKFRDELRAWLKSNVPKDWDEWREETLEARF
Ga0099830_1027004813300009088Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWREKP
Ga0099830_1174023513300009088Vadose Zone SoilMDLNLTLEEKQFRDELRIWLEANVPKDWSEWREKPIEESFTYL
Ga0126373_1278630313300010048Tropical Forest SoilMDLNLTREEVAFRDAFRSWLASNVPNDWSRWREKPLEESFSYL
Ga0126373_1320019123300010048Tropical Forest SoilMDLNLTREEVAFRDEFRSWLASNVPNDWSRWREKPLE
Ga0074046_1080357923300010339Bog Forest SoilMDLNLSSEERQFRDEFRGWLEANVPKDWPEWREKPL
Ga0126378_1342303913300010361Tropical Forest SoilMDLNLSADELKFRDELRAWLAANVPKDWNEHREESLEAR
Ga0105239_1028709533300010375Corn RhizosphereMDLNLSPDEIKFRDELRTWLSANAPTDWDERREESMESRFEYL
Ga0126361_1060055513300010876Boreal Forest SoilMDLKLTPEESSFRDELRSWLAANAPKDWAEWREKPLEES
Ga0137392_1039237623300011269Vadose Zone SoilMNLNLSPEELQFRDELREWLRANVPRDWSEWREKPIEESFPYLRAW
Ga0137393_1012431113300011271Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWREKPIEE
Ga0137388_1129119823300012189Vadose Zone SoilMDLNLSTEELKFRDELRAWLTANVPRDWDERREESLEVRFDYLK
Ga0137360_1128348723300012361Vadose Zone SoilMNLNLSPEELQFRDELRKWLRANVPRDWSEWREKPIEESF
Ga0137390_1048537023300012363Vadose Zone SoilMDLNLTKEEIAFRDELRAWLKASVPKDWDERRESPM
Ga0137405_124808113300015053Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWRGVA*
Ga0182041_1152715123300016294SoilMDLNLTPDEAAFRDELRLWLAANVPTDGNTWREKSLEESFP
Ga0182037_1089720313300016404SoilMDLNLNNEEKQFRDELRSWLEANAPKDWAEWRERPL
Ga0187812_104149323300017821Freshwater SedimentVDLNLTPGERQFRDDLRAWLAVHVPKDWNEWREKPIEVSF
Ga0187823_1019700813300017993Freshwater SedimentMDLNLNAEERQFRDELRAWLEANTPKDWSDWREKPLEESF
Ga0187805_1017740523300018007Freshwater SedimentVDLNLNAEEKQFRDELRAWLSANVPKDWAEWREKPLE
Ga0187859_1002027213300018047PeatlandMDLNLTPEELTFRDELRAWLASNVPKDWKEWREKPMEESF
Ga0187784_1107333223300018062Tropical PeatlandMDLNLSAEERSFRDEFRTWLEANVPRDWPEWREKPLE
Ga0187772_1049087823300018085Tropical PeatlandMDLNLNAEERQFRDELRAWLEMHVPKDWSEWREKPLEESFPYLR
Ga0066662_1093960723300018468Grasslands SoilMDLNLTAGELKFRDELRAWLATNVPKDWEEWREESLEAR
Ga0137408_113710933300019789Vadose Zone SoilMDLNLSPDEIKFRDELRSWLSANAPTDWDERREESMESRFEYL
Ga0179592_1028797913300020199Vadose Zone SoilMDLNLTAGESAFREELRAWLAANAPKDWNEWREKPLE
Ga0210407_1144156513300020579SoilMDLKLTPEESSFRDELRAWLATNAPKDWAEWREKPLE
Ga0210403_1030772323300020580SoilMDLKLTPEESSFRDELRAWLATNAPKDWAEWREKPL
Ga0210403_1100889513300020580SoilMDLNLTADEKLFRDELRSWLALNAPKDWPAWQNKPLEE
Ga0210399_1054538113300020581SoilMDLKLTAEELAFRDELRAWLASNIPKDWEEWREKPIEES
Ga0210399_1065152223300020581SoilMDLNLTADESAFRDELRAWLASHVPKDWDAWREKPMEESF
Ga0215015_1018354413300021046SoilMDLNLSVEELRFRDELRAWLLANAPRDWSEWPVSY
Ga0210404_1013322423300021088SoilMDLNLTAEELAFRDELRGWLAANAPKDWNEWREKPLEES
Ga0210397_1007960813300021403SoilMDLNLTADELQFRDELRSWLASNVPKDWNEWREKP
Ga0210387_1064349823300021405SoilMDLNLTADEKVFRDELRSWLAANAPKDWPVWQNKP
Ga0210386_1127423823300021406SoilMDLNLTAEELAFRDELRSWLASHVPKDWDEWREKHM
Ga0210394_1060352813300021420SoilMDLNLTAEELAFRDELRSWLASNLPIDWEEWREKPIEESFP
Ga0210394_1094793823300021420SoilMDLNLTSEEMQFRDELRSWLTANVPTDWAERREESLE
Ga0210394_1105499123300021420SoilMDLNLTPDELQFRDELRSWLATNVPKDWNEWREKPI
Ga0210394_1147759823300021420SoilMDLNLNTEEKQFRDELRAWLDANVPKDWAQWREKPLEEVFPYH
Ga0210391_1015150213300021433SoilMDLKLTAEESAFRDEFCAWLTSNVPKDWTEWREKPIEES
Ga0210402_1116004013300021478SoilMDLKLSREELQFRDELRAWLAANLPRDWGEWREKPIEESFSYLR
Ga0210402_1192789323300021478SoilMDLNLTAEELAFRDELRAWLATNVPKDWNEWREKP
Ga0210409_1010340613300021559SoilMDLNLTAEEKAFRDELRAWLASNVPRDWSEWREKPIE
Ga0210409_1032363913300021559SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWREKPIEES
Ga0242662_1031314113300022533SoilMDLNLTAEELAFRDELRAWLASNVPKDWNEWREKPLEE
Ga0208691_111718723300025612PeatlandMDLNLTPEELTFRDELRAWLASNVPKDWKEWREKPMEES
Ga0207692_1078678823300025898Corn, Switchgrass And Miscanthus RhizosphereMDLNLNPAETKFRDELRAWLTANVPKDWDERREESLE
Ga0207692_1088725223300025898Corn, Switchgrass And Miscanthus RhizosphereMDLNLSPDEIKFRDELRSWLAANAPKDWDERREESMESRFEY
Ga0207685_1011056033300025905Corn, Switchgrass And Miscanthus RhizosphereMDLNLSPEEIKFRDELRSWLATNAPKDWDERREES
Ga0207654_1065635623300025911Corn RhizosphereMDLNLSPDEIKFRDELRTWLAANAPSDWDERREESMESRFEYLKRWQRPSTQPA
Ga0207707_1053355913300025912Corn RhizosphereMDLNLSPDEIKFRDELRTWLAANAPTDWDERREESMESRFEYLK
Ga0207707_1101601013300025912Corn RhizosphereMDLNLSPDEIKFRDELRTWLAANAPTDWDERREESMESRFEYL
Ga0207665_1091679823300025939Corn, Switchgrass And Miscanthus RhizosphereMDLNLSPEEIKFRDELRAWLAANVPKDWDERREESLESRF
Ga0257150_106126913300026356SoilMDLNLTPDELQFRDELRSWLATNVPKDWNEWREKPIEES
Ga0257160_108909223300026489SoilMDLNLNAEESAFRDELGAWLASNVPKDWNQWREKP
Ga0257181_103592423300026499SoilMDLNLAAEESAFRDEFRAWLAANAPKDWNEWCEKPLEESFPYL
Ga0209648_1063777733300026551Grasslands SoilMDLNLSADELKFRDELRAWLAANVPKDWDEHREESLEA
Ga0209730_104038823300027034Forest SoilMDLNLTAEELAFRDELRAWLAVNVPRDWNEWREKPLEESFPYL
Ga0209115_100549133300027567Forest SoilMDLNLTKEELSFRDELRAWLANNLPKDWNEWREKP
Ga0209625_100924413300027635Forest SoilMDLKLTPEESSFRDELRAWLATNAPKDWAEWREKPLEESFPYL
Ga0209076_117490813300027643Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWREKPIEESFPY
Ga0209217_104259723300027651Forest SoilMNLNLNAEESTFRDELRLWLASNVPKDWNMWSEKPIEESFAY
Ga0209388_103299513300027655Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNEWREKPIEESFP
Ga0209118_103536423300027674Forest SoilMDLNLTAEEMAFRDELRAWLASNAPKDWNEWREKPLEESF
Ga0209333_118616223300027676Forest SoilVDLNLTSEEMQFRDELRSWLTANVPTDWAERREES
Ga0209580_1001450913300027842Surface SoilMDLNLSPEEIKFRDELRSWLSANAPTDWDERREESMESRFEYLK
Ga0209166_1025732113300027857Surface SoilMDLNLSPDEIKFRDELRTWLAANAPTDWDERREESMESRFEYLKRWQR
Ga0209701_1048084423300027862Vadose Zone SoilMDLNLTTEELSFRDELRAWLVSNVPKDWNERREKP
Ga0209067_1019440113300027898WatershedsMDLKLTAEELAFRDELRTWLASNVPADWSEGREKP
Ga0302231_1010847613300028775PalsaMDLNLTEQELSFRDELRAWLAANLPKDWSEWREKPI
Ga0302221_1044416723300028806PalsaMDLNLTDEELKFRDELRAWLASNVPKDWKEWREKPM
Ga0311352_1003049713300029944PalsaMDLSLTAEESAFRDQLRAWLASNVPKDWSEWRDKPMEES
Ga0311371_1082726023300029951PalsaMDLNLTPDEKQFRDELRTWLAANTPKDWPEWQNKPLEESYPYL
Ga0302304_1012661313300029993PalsaMDLNLTDEELKFRDELRAWLASNVPKDWKEWREKPMEES
Ga0311370_1087071823300030503PalsaMDLNLTADEKLFRDELRSWLAANAPKDWPAWQNKPLEESYPY
Ga0311355_1072955423300030580PalsaVDLNLTSDEMQFRDELRSWLTANVPADWAERREES
Ga0311355_1101350623300030580PalsaMDLNLTDEELKFRDELRAWLASNVPKDWKEWREKPMEESFP
Ga0170834_10709601413300031057Forest SoilMDLNLTPDEAAFRDELRPWLAANVPKDWSTWREKPLEES
Ga0310686_10128064723300031708SoilMDLNLTPDELQFRDELRSWLATNVPKDWNEWREKPIE
Ga0310686_10858798633300031708SoilMDLNLTAEELTFRDELRAWLASNVPKDWAEWREKPIE
Ga0307476_1017297423300031715Hardwood Forest SoilMDLNLTPEETKFRDELRTWLEANVPKDWGEWREKPLEESFP
Ga0307477_1001361913300031753Hardwood Forest SoilMDLNLTGEEVAFRDEFRSWLGINAPKDWSSWREKPLEESFA
Ga0307477_1023029023300031753Hardwood Forest SoilMDLKLSLEELQFREELRAWLGANLPRDWGEWREKPIEESFP
Ga0318546_1076397123300031771SoilMDLNLTRDEVAFRDELRSWLAANVPEDWSSWREKPLEE
Ga0318548_1016581823300031793SoilMDLNLTREEVAFRDEFRSWLASNVPNDWRRWREKPLEES
Ga0318497_1005500433300031805SoilMDLNLTREEVAFRDEFRSWLASNVPNDWSRWREKPLEE
Ga0310913_1080812513300031945SoilMDLNLTREEAAFRDEFRSWLATNVPRDWSAWREKPLEESF
Ga0306926_1002504613300031954SoilMDLNLTREETAFRDELRAWLAGNVPKDWSSWREKPLAVSFPYLR
Ga0318505_1042327023300032060SoilMDLTLNDEEKEFRNELRAWLEANAPNDWAEWREKPLEESFPYLR
Ga0306920_10013612913300032261SoilMDLNLSADELKFRDELRAWLAANVPKDWNEHREESLEV
Ga0335085_1102583613300032770SoilMDLNLTTEEKQFRDELRAWLEANVPKDWGEWREKP
Ga0335079_1057189823300032783SoilMDLNLSAEEREFRDEFRGWLEANVPKDWPVWREKPLEESF
Ga0335079_1081970223300032783SoilMDLNLSPDELKFRDELRAWLETNVPREWDEAREESLD
Ga0335079_1123297113300032783SoilMDLNLNAEERQFRDELRAWLEANTPKDWSDWREKPLE
Ga0335078_1264220223300032805SoilMDLNLNAEELAFRGELRAWLEANVPKDWREWREKPLEE
Ga0335080_1209885623300032828SoilMDLNLNAEERQFRDELRAWLEANTPKDWSDWREKPLEES
Ga0310914_1088796623300033289SoilMDLKLTTEELAFRNELRAWLEANIPTDWSEWREKPLDESF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.