NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F093675

Metagenome / Metatranscriptome Family F093675

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093675
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 40 residues
Representative Sequence MAPFIAIVLSIGILTLLGIAAGFGIDSRPSYGDDHAR
Number of Associated Samples 95
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 31.82 %
% of genes near scaffold ends (potentially truncated) 14.15 %
% of genes from short scaffolds (< 2000 bps) 16.98 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (79.245 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(21.698 % of family members)
Environment Ontology (ENVO) Unclassified
(28.302 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(37.736 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62
1JGI10214J12806_106324062
2JGI10216J12902_1000338102
3JGI10216J12902_1065631942
4JGI10216J12902_1107135371
5A305W6_10045255
6Ga0055486_100851361
7Ga0062593_1028521202
8Ga0063455_1004860521
9Ga0062589_1017437992
10Ga0070683_1001055493
11Ga0070691_101255761
12Ga0070694_1018498801
13Ga0070708_1020883131
14Ga0070697_1007325233
15Ga0066695_101010033
16Ga0066698_103603682
17Ga0068866_109255421
18Ga0068861_1005105593
19Ga0074473_110136002
20Ga0068858_1022031142
21Ga0068860_1013691041
22Ga0079222_109508171
23Ga0075428_1008954481
24Ga0079217_104422952
25Ga0075426_106837313
26Ga0079219_117188572
27Ga0111538_109288063
28Ga0126313_117580592
29Ga0126312_113298171
30Ga0126311_109839232
31Ga0127483_12154631
32Ga0126377_100310343
33Ga0134125_109019371
34Ga0134124_101488054
35Ga0134127_101665094
36Ga0134127_101690904
37Ga0134127_125394632
38Ga0134122_124740631
39Ga0137437_12140932
40Ga0136621_12352102
41Ga0137376_100914514
42Ga0150985_1107587271
43Ga0134061_12139961
44Ga0136633_12958571
45Ga0137396_112577541
46Ga0137410_120317041
47Ga0164304_113236892
48Ga0163163_128917612
49Ga0182000_105629491
50Ga0182008_102366982
51Ga0182008_107341351
52Ga0167655_10064396
53Ga0167653_10374392
54Ga0167646_10173663
55Ga0137418_107167252
56Ga0180075_11042561
57Ga0183260_101197962
58Ga0184621_101105541
59Ga0184632_101012332
60Ga0190265_112640092
61Ga0190272_104155812
62Ga0190272_132468142
63Ga0066655_105326201
64Ga0190275_102157334
65Ga0066667_109573442
66Ga0190268_110030381
67Ga0190268_116605501
68Ga0190270_111493532
69Ga0190274_127724351
70Ga0184644_15386951
71Ga0190264_109249803
72Ga0190264_120879902
73Ga0190267_100213363
74Ga0206350_103413691
75Ga0179584_14643653
76Ga0193699_104948101
77Ga0210334_101515933
78Ga0222624_15555711
79Ga0222625_12420043
80Ga0222625_14839552
81Ga0222625_17231341
82Ga0222625_17772061
83Ga0222622_113903042
84Ga0193714_10285362
85Ga0209640_112187971
86Ga0207660_10000002790
87Ga0207662_112265221
88Ga0210067_10142672
89Ga0207625_1010743
90Ga0209818_11963892
91Ga0268264_121047261
92Ga0247818_106711962
93Ga0307322_101659162
94Ga0307280_100971072
95Ga0307282_101798702
96Ga0307296_100318404
97Ga0307277_100746531
98Ga0307308_102496303
99Ga0308203_10139721
100Ga0307408_1008522172
101Ga0310813_123263081
102Ga0307468_1020680992
103Ga0326597_116612982
104Ga0307416_1012274531
105Ga0307411_105348632
106Ga0316598_204772_402_515
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 41.54%    β-sheet: 0.00%    Coil/Unstructured: 58.46%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MAPFIAIVLSIGILTLLGIAAGFGIDSRPSYGDDHARSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
20.8%79.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Polar Desert Sand
Estuarine
Natural And Restored Wetlands
Soil
Sediment (Intertidal)
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Glacier Forefield Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Agricultural Soil
Permafrost
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Untreated Peat Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Soil
Avena Fatua Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Rhizosphere
Rhizosphere
5.7%2.8%21.7%4.7%5.7%2.8%2.8%2.8%3.8%5.7%3.8%3.8%2.8%2.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1063240623300000891SoilMAPIITIILMIGLMTALGIAAVLWGVDSRPSYGDDHAR*
JGI10216J12902_10003381023300000956SoilMAPIITIVLMVGLLTALGIAAVLWGVDTRPSYGDDHAR*
JGI10216J12902_10656319423300000956SoilMAPMIAMVLSMGIATALGIFAIQFGADSRPTYGDDHAR*
JGI10216J12902_11071353713300000956SoilRRCRMAPFIAIVLSIGILTLLGIAAGFGFDSRPSYGDDHAR*
A305W6_100452553300001330PermafrostMAPFIAIVLSIGIMAAFGIAGAIWGTDSRPSYGDDHAR*
Ga0055486_1008513613300004071Natural And Restored WetlandsRGYGRRCRMAPYIAIVLSIGLLALIGFAAIGWGIDSRPSYSDDHIR*
Ga0062593_10285212023300004114SoilRGYGRDRRCRMAPFIAMLVSIGITTLFGIAALAFGADSRPSYRDDHAR*
Ga0063455_10048605213300004153SoilKGLLRDRRCRMAPFIAMLLSIGITTVFGIAALAFGTDSRPTYLDDHAR*
Ga0062589_10174379923300004156SoilRGYGRERRCRMAPFIAMIMSIGIGTVLGIFALTFGTDSRPTYGDDHAR*
Ga0070683_10010554933300005329Corn RhizosphereMTLEGVTRRRCLMAPFIIMIVLSIGLMALFGIAGGSGIDSRPSYGDDHAR*
Ga0070691_1012557613300005341Corn, Switchgrass And Miscanthus RhizosphereRKKTQKGLRERKRRCLMAPLIAMIASIGIASVFGMIAFTFGSDSRPTYGDDHAR*
Ga0070694_10184988013300005444Corn, Switchgrass And Miscanthus RhizosphereEQPRRGYGTDRRRRMAPFIVMIVSIGITTVFGIAALAFGVDSRPTYGDDHAR*
Ga0070708_10208831313300005445Corn, Switchgrass And Miscanthus RhizosphereRRCRMAPFIAIILSIGIATLIGIAAAGFGADTRPTYRDDHAR*
Ga0070697_10073252333300005536Corn, Switchgrass And Miscanthus RhizosphereRRCRMAPLIAIILTIGLLIVLGIAAQDWGTDSRPSYGDDHAR*
Ga0066695_1010100333300005553SoilMAPLIAIILTIGILILLGIAAQDWGADSRPTYGDDHAR*
Ga0066698_1036036823300005558SoilMAPFIGIILWIGITIVLGIAAVTFGTDSTPTYGDDHAR*
Ga0068866_1092554213300005718Miscanthus RhizosphereMAPFIAIILSIGIATLIGIAAAGFGADTRPTYRDDHAR*
Ga0068861_10051055933300005719Switchgrass RhizosphereMAPFIAIVLSIGILTLLGIAAGFGIDSRPSYGDDHAR*
Ga0074473_1101360023300005830Sediment (Intertidal)MAPFIAIILTIGLMTLLGVAASVGADSRPSYGDDHTR*
Ga0068858_10220311423300005842Switchgrass RhizosphereMAPFIMIVLSIGLLALFGLAAGTGIDSRPSYGDDHAR*
Ga0068860_10136910413300005843Switchgrass RhizosphereMAPFIMIVLSIGLLALFGLAAGSGIDSRPSYGDDHAR*
Ga0079222_1095081713300006755Agricultural SoilMAPFIMIVLSIGLMGLLGIASVGWGIDSRPSYGDDHAR*
Ga0075428_10089544813300006844Populus RhizosphereMAPIITIVLMVGLLTALGIAAVLWGVDSRPSYGDDHAR*
Ga0079217_1044229523300006876Agricultural SoilMVPFIAIVLSIGILTALGIAAVLWGADSRPTYGDDHAR*
Ga0075426_1068373133300006903Populus RhizospherePFIAIVLSIGIMTAFGIAAAIWGTDSRPTYRDDHAR*
Ga0079219_1171885723300006954Agricultural SoilMAPFIMIVLSIGLMALLGFASAGWGTDSRPSYGDDHAR*
Ga0111538_1092880633300009156Populus RhizosphereREEPKRGYSRDRRCRMAPFIALLLSIGITTVIGMAALTFGADSRPSYSDDHAR*
Ga0126313_1175805923300009840Serpentine SoilMAPFIIMIVLSVGLMALLGIAGGVGIDSRPSYGDDHAR*
Ga0126312_1132981713300010041Serpentine SoilMVAIVSIVLMVGIMSALGIAAGLWGVDSRPSYGDDHAR*
Ga0126311_1098392323300010045Serpentine SoilMIAMILSIGLTTILGFAALTWGVDSRPSIGDDHAR*
Ga0127483_121546313300010142Grasslands SoilRERRCRMAPLIAIILTIGILILLGIAAQDWGADSRPTYGDDHAR*
Ga0126377_1003103433300010362Tropical Forest SoilMAPYIAILMSIGITTLIGIAALAFGTDSRPTYGDDHAR*
Ga0134125_1090193713300010371Terrestrial SoilMAPFIAIILSIGLTTLIGIAAAGFGADTRPTYRDDHAR*
Ga0134124_1014880543300010397Terrestrial SoilMAPFIAIVLLIGIMSALGIAAAMWGTDSRPTYVDDHAR*
Ga0134127_1016650943300010399Terrestrial SoilMAPFIAIILSIGFATLIGIAAAGFGADTRPTYRDDHAR*
Ga0134127_1016909043300010399Terrestrial SoilTKKRLREKERRCRMASLIFIALMIGIMTSLGIAAVLYGNDSRPSYGDDHAR*
Ga0134127_1253946323300010399Terrestrial SoilMAPIITIVLMVGIMTALGVAATYWGVDSRPSYRDDHAR*
Ga0134122_1247406313300010400Terrestrial SoilMAPFIAIVLSIGILTLLGMLAGFGTDSRPSYGDDHAR*
Ga0137437_121409323300011442SoilMALFIAIVLLIGIMAAFGIAAAVWGTDSRPSYGDDHAR*
Ga0136621_123521023300012092Polar Desert SandMAPFIAIILSMGIMTALGIAAALWGTASRPSYGDDHAR*
Ga0137376_1009145143300012208Vadose Zone SoilMAPLIAIILTIGILILLGIAAQDWGADSRPTYRDDHAR*
Ga0150985_11075872713300012212Avena Fatua RhizosphereTQKGLLRDRRCRMAPFIAMLLSIGITTVFGIAALAFGTDSRPTYLDDHAR*
Ga0134061_121399613300012399Grasslands SoilPRRGYGKERRCRMAPLIAIIFTIGILILLGIAAQDWGADSRPTYGDDHAR*
Ga0136633_129585713300012527Polar Desert SandMAPFIAIVLSIGIMTALGIAAFLWGFDSRPTYGDDHTR*
Ga0137396_1125775413300012918Vadose Zone SoilMAPLIAIIMTIGILVLLGIAAQNWGEDSRPTYGDDHAR*
Ga0137410_1203170413300012944Vadose Zone SoilERRCRMAPLIAIIMTIGILILLGIAAQDWGEDSRPTYGDDHAR*
Ga0164304_1132368923300012986SoilMAPFIAIVLSIGIMTAFGIAAAIWGTDSRPTYGDDHAR*
Ga0163163_1289176123300014325Switchgrass RhizosphereMAPFMMIVLSIGLLALFGLAAGSGIDSRPSYGDDHAR*
Ga0182000_1056294913300014487SoilMAPIITIVLMVGIMTALGVAAALWGVDSRPSYGDDHAR*
Ga0182008_1023669823300014497RhizosphereMAPFIIMIVLSIGLMALFGIAGGSGIDSRPSYGDDHAR*
Ga0182008_1073413513300014497RhizosphereMAPFIAIVLSIGIMSAVGIAAALWGTDSRPSYGDDHAR*
Ga0167655_100643963300015086Glacier Forefield SoilMAPFIAIVLSIGIMTLLGIAAGIGVDSRPSYGDDHAR*
Ga0167653_103743923300015162Glacier Forefield SoilMAPFIAIVLLIGIMCALGIAAAMWGTDSRPTYADDHAR*
Ga0167646_101736633300015192Glacier Forefield SoilMAPFIAIVLSIGILSLLGIFAGFGIDSRPSYGDDHAR*
Ga0137418_1071672523300015241Vadose Zone SoilMAPLIAIIMTIGILILLGIAAQDWGEDSRPTYGDDHAR*
Ga0180075_110425613300015252SoilMAPFIAIVLLIGIMAAFGIAAAVWGTDSRPSYGDDHAR*
Ga0183260_1011979623300017787Polar Desert SandMAPFIAIFLSMGIMTALGIAAALWGTDSRPSYGDDHAR
Ga0184621_1011055413300018054Groundwater SedimentMAPFIAVILTIGLLIVLGIAAQDWGTDSRPTYGDDHAR
Ga0184632_1010123323300018075Groundwater SedimentMAPLIAIVLTIGIMTAIGIAALTWGVDTTPSYGDDHAR
Ga0190265_1126400923300018422SoilMAPFIAIVLSMGILTALGIAAVLWGADSRPTYRDDHAR
Ga0190272_1041558123300018429SoilMAPFIAIVLSMGILTALGIAAVLWGADSRPTYGDDHAR
Ga0190272_1324681423300018429SoilMAPFIAIILSMGILTALGIAAVLWGADSRPTYGDDHAR
Ga0066655_1053262013300018431Grasslands SoilLIAIILTIGILILLGIAAQDWGADSRPTYGDDHAR
Ga0190275_1021573343300018432SoilMAPMITIVLMVGLLTALGIAAVLWGVDTRPSYGDDHAR
Ga0066667_1095734423300018433Grasslands SoilMAPLIAIILTIGILILLGIAAQDWGADSRPTYGDDHAR
Ga0190268_1100303813300018466SoilMAPIITIVLIVGLMTALGIAAVLWGVDSRPSYGDDHAR
Ga0190268_1166055013300018466SoilMAPIITIVLMVGLLTALGIAAVLWGVDTRPSYGDDHAR
Ga0190270_1114935323300018469SoilMAPIITIFLIVGLMTALGIAAVLWGVDTRPSYGDDHAR
Ga0190274_1277243513300018476SoilMAPIITIVLMVGLMTALGIAAVLWGVDSRPSYGDDHAR
Ga0184644_153869513300019269Groundwater SedimentMAPFIAIVLSIGILTLLGMAAGFGIDSRPSYGDDHAR
Ga0190264_1092498033300019377SoilMAPIITIFLIVGLMAALGIAAVLWGVDTRPSYGDDHAR
Ga0190264_1208799023300019377SoilMAPIITIGLILGLMTALGIAAVLWGVDSRPSYGDDHAR
Ga0190267_1002133633300019767SoilMAPIITIVLMVGLMTALGIAAVLWGVDTRPSYGDDHAR
Ga0206350_1034136913300020080Corn, Switchgrass And Miscanthus RhizosphereEPKRGYSRDRRCRMAPFIALLLSIGITTVIGMAALTFGADSRPSYSDDHAR
Ga0179584_146436533300021151Vadose Zone SoilPRHGLRSEKRRCRMAPFIAFVLSIGILTAFGIAAAIWGTDSRSTYRDDHAR
Ga0193699_1049481013300021363SoilMAPFIAIVLSIGLLTLLGMLAGFGIDSRPSYGDDHAR
Ga0210334_1015159333300021859EstuarineMAPFIAIILTIGLMTLLGVAASVGADSRPSYGDDHTR
Ga0222624_155557113300021951Groundwater SedimentMAPFIAIVLSIGILTLLGMLAGFGTDSRPSYGDDHAR
Ga0222625_124200433300022195Groundwater SedimentMAPIITIVLTIGIMTALGTIAALWGVDSRPSYGDDHAR
Ga0222625_148395523300022195Groundwater SedimentMAPIITIFLIVGLMTALGIAAVLWGVDSRPSYGDDHAR
Ga0222625_172313413300022195Groundwater SedimentMAPFIAIILTLGILIVLGIAAQDWGADSRPTYGDDHARSSETTST
Ga0222625_177720613300022195Groundwater SedimentSPRRGYGRERRCRMAPFILMIVPIGILVILGIAALTFGTDSRPTIGDDHAR
Ga0222622_1139030423300022756Groundwater SedimentRRCRMAPFIAVILTIGLLIVLGIAAQDWGTDSRPTYGDDHAR
Ga0193714_102853623300023058SoilMAPFIAIVLSIGILTAFGIVAAIWGTDSRPTYRDDHAR
Ga0209640_1121879713300025324SoilMAPLITIVLSIGILALFGFAALRWGVDSRPTYRDDHAR
Ga0207660_100000027903300025917Corn RhizosphereMAPFIAIILSIGFATLIGIAAAGFGADTRPTYRDDHAR
Ga0207662_1122652213300025918Switchgrass RhizosphereGGYARRCRMAPFIMIVLSIGLLALFGLAAGSGIDSRPSYGDDHAR
Ga0210067_101426723300025947Natural And Restored WetlandsMAPYIAIVLSIGLLALIGFAAIGWGIDSRPSYSDDHIR
Ga0207625_10107433300027457SoilMAPFIAIVLSIGIMIAFGIAAAIWGSDSRPTYGDDHAR
Ga0209818_119638923300027637Agricultural SoilMVPFIAIVLSIGILTALGIAAVLWGADSRPTYGDDHAR
Ga0268264_1210472613300028381Switchgrass RhizosphereGLRERKRRCRMAPLIAMIASIGIASVFGMIAYTFGTDSRPTYGDDHAR
Ga0247818_1067119623300028589SoilMAPIITIILMIGLMTALGIAAVLWGVDSRPSYGDDHAR
Ga0307322_1016591623300028710SoilMAPFIAIVLSIGILTLFGIAAGFGIDSRPSYGDDHAR
Ga0307280_1009710723300028768SoilMAPFIAIVLSIGIMVAFGIAGAIWGTDSRPSYGDDHAR
Ga0307282_1017987023300028784SoilMAPFIAIILTIGVLIVLGIAAQDWGADSRPSYGDDHTR
Ga0307296_1003184043300028819SoilMAPFIAVILTIGLLIVLGIVAQDWGTDSRPTYGDDHAR
Ga0307277_1007465313300028881SoilMAPFIAIILTIGILIVLGIAAQDWGTDSRPSYGDD
Ga0307308_1024963033300028884SoilRRGYGRDRRCRMAPFIAIILTIGILIVLGIAAQDWGADSRPTYGDDHARSSETTST
Ga0308203_101397213300030829SoilRGYGKDRRCRMAPFIAMIASIGIGTVFGILAFTFGTDSRPTYGDDHAR
Ga0307408_10085221723300031548RhizosphereMAPFIMIVLSIGLLALIGTAGLGWGSDSRPSYGDDHAR
Ga0310813_1232630813300031716SoilMTLKGVTQRRCRMAPFIIMIVLSIGLMALLGIAGGMGIDSRPSYGDDHAR
Ga0307468_10206809923300031740Hardwood Forest SoilMAPFIAIVLSIGILTLLGIAAGIGIDSRPSYGDDHAR
Ga0326597_1166129823300031965SoilMAPIIAIVLSIGILTAFGIAAVLWGVDSRPTIGDDHARWAGPSPR
Ga0307416_10122745313300032002RhizosphereMAPIITIVLMVGLLTALGMAALLWGVDTRPSYGDDHAR
Ga0307411_1053486323300032005RhizosphereMAPMIAMVLSMGIATALGIFAIQFGVDSRPTYGDDHTR
Ga0316598_204772_402_5153300034652Untreated Peat SoilMAPFIAIVLSIGILTLLGIAAGIGFDSRPTYVDDHAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.