NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F082941

Metagenome / Metatranscriptome Family F082941

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082941
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 40 residues
Representative Sequence QATINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Number of Associated Samples 104
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 95.58 %
% of genes from short scaffolds (< 2000 bps) 92.92 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (90.265 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(30.088 % of family members)
Environment Ontology (ENVO) Unclassified
(33.628 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.018 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56
1JGIcombinedJ26739_1007462183
2JGIcombinedJ26739_1016627931
3Ga0062387_1005126811
4Ga0062389_1041286171
5Ga0066677_103498053
6Ga0070711_1017698142
7Ga0070678_1010597491
8Ga0070741_105610461
9Ga0066701_105562191
10Ga0066702_103116173
11Ga0066903_1077738441
12Ga0075274_10934402
13Ga0070717_111020641
14Ga0070765_1022066432
15Ga0099794_106869152
16Ga0099829_106586611
17Ga0099829_107507021
18Ga0099828_101996651
19Ga0099792_110548411
20Ga0116219_1000659714
21Ga0126315_111466072
22Ga0126373_129641641
23Ga0134062_106646361
24Ga0074044_104348221
25Ga0126378_105085911
26Ga0126378_108360712
27Ga0126379_109909861
28Ga0124850_10810411
29Ga0126350_100867432
30Ga0137383_105625821
31Ga0137399_104955091
32Ga0137399_113627301
33Ga0137376_109174461
34Ga0137379_105487483
35Ga0137378_106659601
36Ga0137377_110827801
37Ga0137390_109272663
38Ga0137358_107494402
39Ga0137394_107054563
40Ga0137416_102746471
41Ga0137410_110964181
42Ga0126369_115442482
43Ga0164309_110950811
44Ga0164304_110173042
45Ga0134081_102984141
46Ga0134079_102022863
47Ga0163163_125616642
48Ga0182024_127767611
49Ga0137403_105829731
50Ga0134073_102325981
51Ga0182041_120298792
52Ga0182034_100292673
53Ga0182040_106409162
54Ga0182038_101215243
55Ga0187816_103590661
56Ga0184629_103941061
57Ga0187769_115220792
58Ga0193728_13289052
59Ga0210404_103917561
60Ga0210408_112804341
61Ga0213872_104362622
62Ga0213876_103208623
63Ga0210393_108575831
64Ga0210402_105253223
65Ga0210402_107508753
66Ga0242654_102464221
67Ga0228598_11071941
68Ga0209686_12297141
69Ga0207787_10009945
70Ga0208859_10006165
71Ga0209219_11493042
72Ga0209446_10943051
73Ga0209006_108603482
74Ga0209526_103161112
75Ga0170820_101799252
76Ga0318541_107661152
77Ga0318528_107065741
78Ga0318528_107317082
79Ga0318573_104575312
80Ga0318573_105351271
81Ga0318573_107199201
82Ga0318515_101443423
83Ga0318555_100360423
84Ga0306917_104518641
85Ga0306918_109090382
86Ga0318502_102493381
87Ga0307475_109589052
88Ga0318537_102149771
89Ga0318566_104290112
90Ga0318552_106424861
91Ga0318557_101425103
92Ga0318550_103228282
93Ga0307478_100998204
94Ga0318499_101898641
95Ga0310917_105333601
96Ga0318512_105591012
97Ga0318527_100129981
98Ga0306925_102364441
99Ga0318536_106374432
100Ga0310909_110940351
101Ga0318530_103382081
102Ga0318569_102706083
103Ga0318556_103225601
104Ga0318575_105193762
105Ga0318533_106790162
106Ga0318553_101794941
107Ga0306924_126153681
108Ga0318518_105335281
109Ga0318577_105232131
110Ga0306920_1008678093
111Ga0306920_1024155763
112Ga0335085_107323151
113Ga0335073_106335174
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.39%    β-sheet: 0.00%    Coil/Unstructured: 60.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035QATINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
90.3%9.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Freshwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Rice Paddy Soil
Tropical Peatland
Bog Forest Soil
Permafrost
Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Plant Roots
Rhizosphere
Boreal Forest Soil
15.9%4.4%3.5%30.1%8.8%5.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10074621833300002245Forest SoilRHAEMLPNAAVSVEDLQGANAALRRLERFWIRAGDFVQRPPQFAA*
JGIcombinedJ26739_10166279313300002245Forest SoilIEMLGQTTITDTDLQGVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0062387_10051268113300004091Bog Forest SoilHIEMLSHTTVTDENLQSVIVTLGRLERFWVRAGDLVQRPGRFAA*
Ga0062389_10412861713300004092Bog Forest SoilNQTAITDEDLTAATVTLRRLERFWIRAGDLVQRPGQYAA*
Ga0066677_1034980533300005171SoilEMLNQTTITTADLEGVVVTLRRLERFWIRASDLLQRPGQFAA*
Ga0070711_10176981423300005439Corn, Switchgrass And Miscanthus RhizosphereLPQAALSAEDLQGAGATLRRLERFWIRAGDFVQRPPQFAA*
Ga0070678_10105974913300005456Miscanthus RhizosphereRHVDLLAQTAVSDEDLAAVTLTLRRLERFWIRAGDLVVQRPGQFAA*
Ga0070741_1056104613300005529Surface SoilAVSGEDLDTVIVTLRRLERFWIRAGDLVQRPGQYAA*
Ga0066701_1055621913300005552SoilTTSDLEGVVVTLRRLERFWIRASDLLQRPGQFAA*
Ga0066702_1031161733300005575SoilHIEMLGQTTITEADLQTVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0066903_10777384413300005764Tropical Forest SoilQRHAAMLPQAGFTAEDLQDGATMLRRLERFWIRAGDVVQRSPQFAA*
Ga0075274_109344023300005901Rice Paddy SoilVTGDDLQNVIVTLRRLERFWIRASDLVQRPGQFAA*
Ga0070717_1110206413300006028Corn, Switchgrass And Miscanthus RhizosphereHIEMLGETRITDSDLQGVVTTLRRLERFWLRDSDLS*
Ga0070765_10220664323300006176SoilQTAITEDDLQNATVTLRRLERFWIRAGDLVQRPGQFAA*
Ga0099794_1068691523300007265Vadose Zone SoilEIGVASVEELDAMNGVLRRLERFWVRAGDLVQRPGQFAA*
Ga0099829_1065866113300009038Vadose Zone SoilTAVTDGDLDSVIVTLRRLERFWVRAGDLVQRPGRFAA*
Ga0099829_1075070213300009038Vadose Zone SoilMLNQTTITDADLQGVVVTLRRLERFWMRASDLLQRAGQSAA*
Ga0099828_1019966513300009089Vadose Zone SoilIELLSQTAVTDGDLDSVIVTLRRLERFWVRAGDLVQRPGRFAA*
Ga0099792_1105484113300009143Vadose Zone SoilRHVEMLGQTAITDADLQGVAVTLGRLERFWMRASDLLQQPGRFAA*
Ga0116219_10006597143300009824Peatlands SoilLNQTAITGEDLTAATVTLRRLERFWIRAGDLVQRPGQYAA*
Ga0126315_1114660723300010038Serpentine SoilVEMLAQTAVTEDDLKNVMVTLRRLERFWIRASDLVQRPGNNFAAA*
Ga0126373_1296416413300010048Tropical Forest SoilATINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA*
Ga0134062_1066463613300010337Grasslands SoilTTITNADLEGVVVTLRRLERFWIRASDLLQRPGQFAA*
Ga0074044_1043482213300010343Bog Forest SoilTLNMDDLQAVGVTLRRLERFWVRAADLVQRPPRFAA*
Ga0126378_1050859113300010361Tropical Forest SoilHHAEMLPRAGVAAEDLQGTNATLRRLERFWIRAGDLVQRPPQLAA*
Ga0126378_1083607123300010361Tropical Forest SoilGLLSQAAVSVEDLQAVGVTLGRLERFWIRASDLVQRPLQFAA*
Ga0126379_1099098613300010366Tropical Forest SoilHAAMLPQAGLTAEDLQDGATMLRRLERFWIRAGDVVQRPPQFAA*
Ga0124850_108104113300010863Tropical Forest SoilMLSQATINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA*
Ga0126350_1008674323300010880Boreal Forest SoilLDQTPITGPDLQNSLVTLRRLERFWVRAADLAQRPRQRAA*
Ga0137383_1056258213300012199Vadose Zone SoilTDENLQEVIVTLRRLERFWVRAGDLVQRPGRFAA*
Ga0137399_1049550913300012203Vadose Zone SoilTLNMEDLQTVGATLRRLERFWIRASDLVQRPPQFAA*
Ga0137399_1136273013300012203Vadose Zone SoilQTTITNADLEGVVVTLRRLERFWIRASDISLQPEQFAA*
Ga0137376_1091744613300012208Vadose Zone SoilHIEMLSQTAISDTDPEGVVVTLRRLERFWMRAGDPLQRPGQFAA*
Ga0137379_1054874833300012209Vadose Zone SoilAITDGDLQGVAVTLRQLERFWIRASDISQQPAQFAA*
Ga0137378_1066596013300012210Vadose Zone SoilRHVEMLGQTAITDGDLQGAAVTLRQLERFWIRASDISQQPAQFAA*
Ga0137377_1108278013300012211Vadose Zone SoilMHRRHVEMLSQTEISDTDLQGVGVTFRRLKRFWIRASDLLQQPGQFAA*
Ga0137390_1092726633300012363Vadose Zone SoilVMHKRHVEMLGQTAITDGDLQGVAVTLRQLERFWIRASDISQQPAQFAA*
Ga0137358_1074944023300012582Vadose Zone SoilVEMLNQTTITDADLQGVGVTLRRLERFWIRASDLSQRPGQFAA*
Ga0137394_1070545633300012922Vadose Zone SoilSAEDLQGAGGTLRRLERFWIRAGDFVQRPPQFAA*
Ga0137416_1027464713300012927Vadose Zone SoilTLSADDLQAAGVTLRRLERFWIRAADLVQRPPQFAA*
Ga0137410_1109641813300012944Vadose Zone SoilLNMEDLQTVGATLRRLERFWIRASDLVQRPPQFAA*
Ga0126369_1154424823300012971Tropical Forest SoilMLPAGLASENLQDANATLRRLERFWIRAGDLVQRPAQVAA*
Ga0164309_1109508113300012984SoilTEADLQTVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0164304_1101730423300012986SoilMPKMLPNAAVSVEDLQGANAALRRLERFWIRAGDFVQHPPQFAA*
Ga0134081_1029841413300014150Grasslands SoilTTITEADLQTVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0134079_1020228633300014166Grasslands SoilRHIEMLGQTTITESDLQGVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0163163_1256166423300014325Switchgrass RhizosphereHRRHVDLLAQTDISDENLTAVTLTLRRLERFWVRAGDPVVQRQGQFAA*
Ga0182024_1277676113300014501PermafrostIELLSQTAVTTEDLDAVLVTLGRLERFWIRAGDLVQRPGQFAA*
Ga0137403_1058297313300015264Vadose Zone SoilTITNADLEGVVVTLRRLERFWIRASDLLQRPGQFAA*
Ga0134073_1023259813300015356Grasslands SoilITEADLQTVVTTLRRLERFWIRASDLVQRPGQFAA*
Ga0182041_1202987923300016294SoilQAALSAEDLQGAGATLRRLERFWIRAGDLVQRPPQFAA
Ga0182034_1002926733300016371SoilLGADDLQAAGVTLRRLERFWIRAADLVQRPPPQFAA
Ga0182040_1064091623300016387SoilLVGADDLQAVGVTLRRLERFWIRAGDMMHRPPQFAA
Ga0182038_1012152433300016445SoilRHAEMLPQALVGADDLQAVGVTLRRLERFWIRAGDMMQRPPQFAA
Ga0187816_1035906613300017995Freshwater SedimentLLPQAAVAMDDLQSVGVTLRRLERFWIRASDLVQRPPQFAA
Ga0184629_1039410613300018084Groundwater SedimentMHDHHIQLLSQTTPVTDENLQEVIVTLRRLERFWVRAGDLVQRPGQFAAA
Ga0187769_1152207923300018086Tropical PeatlandAITEEDLQAATVTLRRLERFWIRAGDLVQRPGQFAA
Ga0193728_132890523300019890SoilLDVMARRHVEMLTETGITDGDLQGVVGTLHRLERFWIRAGDLLQQPGQFAA
Ga0210404_1039175613300021088SoilRHVEMLAQTAITDADLQGVAVTLRRLERFWVRASDISQQPGRFAA
Ga0210408_1128043413300021178SoilTTITSADLDGVVVTLRRLERFWIRASDLLQRPGQFAA
Ga0213872_1043626223300021361RhizosphereAMLSQAALSAEDLQDVAGTLRRLERFWIRAGDVVQRPPLFAA
Ga0213876_1032086233300021384Plant RootsEMLSQTTVTDEDLASVTTTLRRLERFWIRAGDLVQRPGQFAA
Ga0210393_1085758313300021401SoilPNAAVSVEDLQGANAALRRLERFWIGAGDFVQRPPQFAA
Ga0210402_1052532233300021478SoilQAALSTEDLQDAGATLRRLERFWIRAGDFVQRPPQFAA
Ga0210402_1075087533300021478SoilSQAPINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0242654_1024642213300022726SoilPQATINMDDLQAVGVTLRRLERFWVRAADLVQRPPQFAA
Ga0228598_110719413300024227RhizospherePQALVGAEDLQAASVTLRRLERFWIRAGDMIQRPPQFAA
Ga0209686_122971413300026315SoilQAAVSAEDLQGASATLRRLERFWIRAGDFVQRPPQFAA
Ga0207787_100099453300026908Tropical Forest SoilMLSRTAVSADDLQAAGVTLRRLERFWMRATDLRQRPQ
Ga0208859_100061653300027069Forest SoilMMRRLDLTDNERAVLITTLRRLERFWIRAGDFVQRPPQFAA
Ga0209219_114930423300027565Forest SoilHQRHAETLSQTAITDADLQGAVGTLRRLERFWMLATDFSSRPGQFAA
Ga0209446_109430513300027698Bog Forest SoilRHAEMLPNAAVSVEDLQGANAALRRLERFWIRAGDFVQRPPQFAA
Ga0209006_1086034823300027908Forest SoilLGQTTITDTDLQGVVTTLRRLERFWIRASDLVQRPGQFAA
Ga0209526_1031611123300028047Forest SoilMHQRHAETLSQTAITDADLQGAVGTLRRLERFWMLATDFSSRPGQFAA
Ga0170820_1017992523300031446Forest SoilAPINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0318541_1076611523300031545SoilAVSVEDLQAVGVTLGRLERFWIRASDLVQRPLQFAA
Ga0318528_1070657413300031561SoilGGLSLEDLQTASGTLRRLERFWIRTSDLVQRRPQFAA
Ga0318528_1073170823300031561SoilQALVGADDLQAVGVTLRRLERFWIRAGDMMQRPPQFAA
Ga0318573_1045753123300031564SoilLVGADDLQAVGVTLRRLERFWIRAGDMMQRPPQFAA
Ga0318573_1053512713300031564SoilMLPQAALSADDLQAAVTLRRLERFWIRASDLVQRPQFAA
Ga0318573_1071992013300031564SoilLEMLPQATINMDDLQAVGVTLRRLERFWVRAADLVQRPPQFAA
Ga0318515_1014434233300031572SoilLNIDDLQAVGVTLRRLERFWIRAGDLMQRPPQFAA
Ga0318555_1003604233300031640SoilEMLSQAGLNIDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0306917_1045186413300031719SoilLNMDDLQATGVTLRRLERFWIRAGDLAQRPPQFAA
Ga0306918_1090903823300031744SoilQATINIDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0318502_1024933813300031747SoilPQGGLSLEDLQTASGTLRRLERFWIRTSDLVQRRPQFAA
Ga0307475_1095890523300031754Hardwood Forest SoilQTTITDADLQGVVVTLRRLERFWIRASDLLQRPGQFAA
Ga0318537_1021497713300031763SoilLEMLPQATLNMDDLQAVGGTLRRLERFWIRAGDLAQRPPQFAA
Ga0318566_1042901123300031779SoilQATINMDDLQAVGVTLRRLERFWVRAADLVQRPPQFAA
Ga0318552_1064248613300031782SoilSQAALNMDDLQAVGVTLRRLERFWIRAGDLAQRPPQFAA
Ga0318557_1014251033300031795SoilMLPQGGLSLEDLQTASGTLRRLERFWIRTSDLVQRRPQFAA
Ga0318550_1032282823300031797SoilQTALGADDLQAAGVTLRRLERFWIRAADLVQRPPPQFAA
Ga0307478_1009982043300031823Hardwood Forest SoilIELLKQTAVTEGDLDGVVTTLRRLERFWIRASDLVQRPGQFAA
Ga0318499_1018986413300031832SoilAGLNIDDLQAVGVTLRRLERFWIRAGDLMQRPPQFAA
Ga0310917_1053336013300031833SoilALNIDDLQATGVTLRRLERFWIRASDLVQRPPQFAA
Ga0318512_1055910123300031846SoilATINIDDLQAVGVTLRRLERFWIRASDLVQRPPQFAA
Ga0318527_1001299813300031859SoilLNIDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0306925_1023644413300031890SoilHLEMLPQATLNMDDLQAVGGTLRRLERFWIRAGDLAQRPPQFAA
Ga0318536_1063744323300031893SoilLEMLPQATLNMDDLQAVGVTLRRLERFWIRAGDLAQWPPQFAA
Ga0310909_1109403513300031947SoilVGADDLQAVGVTLRRLERFWIRAGDMMQRPPQFAA
Ga0318530_1033820813300031959SoilLNMDDLQAVGVTLRRLERFWIRASDLVQRPPRFAA
Ga0318569_1027060833300032010SoilSQATLNMDDLQAVGVTLRRLERFWIRASDLVQRPPRFAA
Ga0318556_1032256013300032043SoilMLPQATLNMDDLQAVGVTLRRLERFWIRAGDLAQWPPQFAA
Ga0318575_1051937623300032055SoilPPGGLNVEDLQATSATLRRLERFWIRTSDLVQRRPQFAA
Ga0318533_1067901623300032059SoilLNMDDLQAVGGTLRRLERFWIRAGDLAQRPPQFAA
Ga0318553_1017949413300032068SoilAEMLPPGGLNVEDLQATSATLRRLERFWIRTSDLVQRRPQFAA
Ga0306924_1261536813300032076SoilTEMLVPGAVAVGDLDALCATLRRLERFWIRAGDMVQRPPQAAA
Ga0318518_1053352813300032090SoilQAGLNIDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0318577_1052321313300032091SoilSQTALGADDLQAAGVTLRRLERFWIRAADLVQRPPPQFAA
Ga0306920_10086780933300032261SoilQATINMDDLQAVGVTLRRLERFWIRAGDLVQRPPQFAA
Ga0306920_10241557633300032261SoilHAELLPQAAVSAEDLDTANTALRRLERFWIRAGDLVLRPPQFAA
Ga0335085_1073231513300032770SoilEMLNQTAITDEDLAAATVTLRRLERFWIRAGDLVQRPGQYAA
Ga0335073_1063351743300033134SoilHVEMLNQTAITDDDLTATTVTLRRLERFWIRAGDLVQRPGQYAAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.