NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105962

Metagenome / Metatranscriptome Family F105962

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105962
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 47 residues
Representative Sequence QLEKRIAAHPETADAALDAAENAARMLWKALDGIEARRTAMAAA
Number of Associated Samples 81
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(16.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.
1JGI1027J11758_126439031
2JGI1027J11758_126522011
3JGIcombinedJ13530_1018656111
4JGI12269J14319_102039581
5Ga0070698_1009285713
6Ga0070735_106639782
7Ga0070732_110261842
8Ga0066707_106355112
9Ga0070766_105469872
10Ga0075029_1001107493
11Ga0075019_106877121
12Ga0070715_106616202
13Ga0075014_1003427951
14Ga0075014_1007722831
15Ga0099829_111706391
16Ga0116128_11987171
17Ga0116222_12882461
18Ga0116105_10096344
19Ga0116217_100135911
20Ga0116219_102948353
21Ga0116219_105688651
22Ga0126381_1012640201
23Ga0126383_100565995
24Ga0126383_126102511
25Ga0126383_130286922
26Ga0126383_134319811
27Ga0137383_111717321
28Ga0137382_101229391
29Ga0181531_102408131
30Ga0181535_106802061
31Ga0181522_107807102
32Ga0132255_10000165423
33Ga0132255_1053519492
34Ga0187808_103706112
35Ga0187819_108333532
36Ga0187879_101835601
37Ga0187847_103639571
38Ga0187817_101422994
39Ga0187817_107740541
40Ga0187816_105221142
41Ga0187863_107504391
42Ga0187890_105958701
43Ga0187784_100370756
44Ga0187784_101087161
45Ga0193713_11486612
46Ga0210407_103046732
47Ga0210403_114583212
48Ga0210399_102602783
49Ga0210395_107499761
50Ga0210395_107549542
51Ga0210404_102493831
52Ga0210405_108943312
53Ga0210396_110646602
54Ga0210388_101103694
55Ga0210393_108128123
56Ga0210389_107047142
57Ga0210386_106680803
58Ga0210383_105084703
59Ga0210391_103014752
60Ga0210398_113489702
61Ga0224541_10248902
62Ga0208936_10014726
63Ga0208935_10143161
64Ga0208192_10975251
65Ga0207663_104791013
66Ga0207700_111174491
67Ga0207700_111963181
68Ga0207700_120261642
69Ga0209422_10292481
70Ga0209580_106872272
71Ga0209169_100902373
72Ga0209380_104551161
73Ga0209380_104559812
74Ga0209415_108800312
75Ga0209415_109988201
76Ga0209698_100461077
77Ga0302303_101145111
78Ga0302225_104967101
79Ga0308309_103139462
80Ga0308309_110803833
81Ga0311368_110422531
82Ga0311339_112951862
83Ga0302282_13182731
84Ga0311353_100730491
85Ga0316363_100697381
86Ga0265760_100867613
87Ga0265760_103225702
88Ga0302307_102798883
89Ga0310686_1011688451
90Ga0307476_101318284
91Ga0307476_104813811
92Ga0307476_107231871
93Ga0307476_110035382
94Ga0307474_101878823
95Ga0318544_103736461
96Ga0307479_113013833
97Ga0307471_1027274442
98Ga0335076_104333531
99Ga0335084_111143572
100Ga0371490_11825901
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 56.94%    β-sheet: 0.00%    Coil/Unstructured: 43.06%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540QLEKRIAAHPETADAALDAAENAARMLWKALDGIEARRTAMAAASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog
Peatland
Freshwater Sediment
Wetland
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Fen
Palsa
Peat Soil
Arabidopsis Rhizosphere
4.0%3.0%5.0%5.0%5.0%4.0%3.0%5.0%3.0%8.0%3.0%16.0%7.0%6.0%6.0%6.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J11758_1264390313300000789SoilDVHHARVWRGLLEKRIAAHPEAADAALDAAENAARLLWQALDGIDSRRNALAIA*
JGI1027J11758_1265220113300000789SoilHHARVLRGLLEKRIAAHPEAADAALDAAENAARLLWQALDGIDSRRNALAIA*
JGIcombinedJ13530_10186561113300001213WetlandAHPEQTEAALQAAENAAQWLWQALDGIERRRQQQLAA*
JGI12269J14319_1020395813300001356Peatlands SoilANPETAPAALDAAENAAKALWRALDGIEARRMALAA*
Ga0070698_10092857133300005471Corn, Switchgrass And Miscanthus RhizosphereLEKRIAAHPEAADAALDAAENAAHALWQALDGINSRRNATAVA*
Ga0070735_1066397823300005534Surface SoilIWRTQLENRINAHPQAADAALEAAEKAARMLWKALDGIEARRVAVAAA*
Ga0070732_1102618423300005542Surface SoilHANVWRKQLEKRIVAHPQAAESALDAAENAARMLWKALDGIEARRTAMAVA*
Ga0066707_1063551123300005556SoilFSLHTTADVYHSRAWRSQLEKRVAANTDSVEKTLDAGEDAARMLWRALDGIEARRTAMAA
Ga0070766_1054698723300005921SoilNVWRQQLENRIAANPETAQAALDAAENTAKLLWQALDGIEAARMTHAA*
Ga0075029_10011074933300006052WatershedsLEKRIAAHPESADAALDAAENAARMLWQALDGIDARRTAPALV*
Ga0075019_1068771213300006086WatershedsPEAADAALDAAENAARMLWQALDGINSRRHALAIA*
Ga0070715_1066162023300006163Corn, Switchgrass And Miscanthus RhizosphereLLEKRVAAHPEAADAALDAAENAARALWQALDGINSRRNAITAA*
Ga0075014_10034279513300006174WatershedsAQHPEAAEAALNAAENAAHMLWQALDGIENRRLAAA*
Ga0075014_10077228313300006174WatershedsYHSNVWRNQLEKRIEAHPEAADAALDAAENAARMLWRALDGIDARRTAMAVA*
Ga0099829_1117063913300009038Vadose Zone SoilSQLEQRVAANPDSADATLDAAENAARMLWRALDGIEARRRTVAA*
Ga0116128_119871713300009518PeatlandSNIWREQLAKRIAADPEKADGALDAGENAARMLWQALDGIETARMTYAV*
Ga0116222_128824613300009521Peatlands SoilRIAAHPETADAALDAAENAALMLWKALDGINARRMATE*
Ga0116105_100963443300009624PeatlandIYHSNVWRKQLENRIAAHPETADAALDAAENAAKMLWRALDGIEAARTTFAA*
Ga0116217_1001359113300009700Peatlands SoilANPAQAEPALAAAENTAQALWRALDGIEARRQAAHAHFA*
Ga0116219_1029483533300009824Peatlands SoilAAHPETADAALDAAENAARMLWQALDGIEARRTAMAIA*
Ga0116219_1056886513300009824Peatlands SoilPEAADAALDAAENAARMLWKALDGIEARRTAMAIA*
Ga0126381_10126402013300010376Tropical Forest SoilARVWRNLLEKRIAQHPEVADASLDAAENAARMLWEALDGIDARRHTLAIA*
Ga0126383_1005659953300010398Tropical Forest SoilEKRIEARPQTAEAALDAAENAAKKLWQALDGIDARRHAASVA*
Ga0126383_1261025113300010398Tropical Forest SoilKRIAAHPENADAALDAAENAAQALWRALDGIEARRTATAVA*
Ga0126383_1302869223300010398Tropical Forest SoilLEKRIAANPEAAHAALDAAENAARMLWQALDGIDARRTALAIA*
Ga0126383_1343198113300010398Tropical Forest SoilLEKRIAERPELADAALDAAENASRMLWQALDGIDARRNALAIA*
Ga0137383_1117173213300012199Vadose Zone SoilPEAADAALDAAESAARMLWKALDGINARRMTMAAA*
Ga0137382_1012293913300012200Vadose Zone SoilRVAAHPEAADAALDAAENAARALWQALDGINSRRNAITAA*
Ga0181531_1024081313300014169BogNVWRGQLEKRLEANPEAAAAALMAAENTAGMLWRALDGIDAARMACAA*
Ga0181535_1068020613300014199BogHATADVYHSNVWRSQLEKRIAAKPEAADGALDAAENAARMLWKALDGINARRTAMAAV*
Ga0181522_1078071023300014657BogWRKQLEKRVAAHPQAADAALDAAENAARALWQALDGIEARRMAAA*
Ga0132255_100001654233300015374Arabidopsis RhizosphereEHPEAADAALDAAENAARMLWQALDGIDAMRHSVAVA*
Ga0132255_10535194923300015374Arabidopsis RhizosphereSKVWRKQLEKRIASHPETADAALDAAENAACMLWKALDGIDARRTAKAAA*
Ga0187808_1037061123300017942Freshwater SedimentGKRIAAHPEAAEAALDAAENAARMLWKALDGIEARRTALAVA
Ga0187819_1083335323300017943Freshwater SedimentATADVYHSNVWRQQLEKRIAAHPEAADAALDTAENAARMLWKALDGIESRRRAVAAA
Ga0187879_1018356013300017946PeatlandHATADIYHSNVWRKQLENRIAAHPETADAALDAAENAAKMLWRALDGIEAARTTFAA
Ga0187847_1036395713300017948PeatlandTTADVYHSQVWRKQLAKRVAANPEIAEKALAAAEAAARKLWQALDGIESRRMAAA
Ga0187817_1014229943300017955Freshwater SedimentEAHPETADAALDAAENAARMLWKALDGIDARRMAMAVA
Ga0187817_1077405413300017955Freshwater SedimentLEKRIEAHPETADAALDAAENAARMLWKALDGIDARRMAMAVA
Ga0187816_1052211423300017995Freshwater SedimentISARPEAADAALDAAGKAAWMLWKALDGIEARRMAAAVA
Ga0187863_1075043913300018034PeatlandDVHHSNVWHKQLEKRIAAHPESTDAALDAAENAARMLWKALDGINARRMATA
Ga0187890_1059587013300018044PeatlandNPEKAEGALDEGENAARMLWHALDGIETARMAYAM
Ga0187784_1003707563300018062Tropical PeatlandSRVWRSQLEKRITANPGAANVALDAAEKTARMLWKALDGIEARRGAFAAA
Ga0187784_1010871613300018062Tropical PeatlandTADVYHSNVWRKQLEKRIAESPEAASRALNAGENAARMLWKALDGIEARRMAAAVA
Ga0193713_114866123300019882SoilNVWREQLGKRVAVNPATADKALASAENAARALWVALDGIEATRLLKAA
Ga0210407_1030467323300020579SoilHSQVWRKQLEKHIAAHPEAADAALDAAENAARMLWKALDGIDRRRMATA
Ga0210403_1145832123300020580SoilLEKRIAAHPETADAALDAAENAARMLWKTLDGIEARRAAMAVA
Ga0210399_1026027833300020581SoilRTQLEKRITAHPEAADRALDAAENAARMLWKALDGIEARRTAMAVA
Ga0210395_1074997613300020582SoilNQLEKRILANPETAEAALNAAENAARTLWQALDGIEVRRIAAAA
Ga0210395_1075495423300020582SoilHSNVWRNQLEKQLKANPDAAQPALDAAENAAKILWQALDGIETARTSCAA
Ga0210404_1024938313300021088SoilAAHPEAADAALDAAENAARALWQALDGIDSRRNAIAVA
Ga0210405_1089433123300021171SoilAAHPESAGAALDAAENAARMLWKALDGIDARRTVMAVA
Ga0210396_1106466023300021180SoilQLEKRIAAHPENGDAALDAAESAARMLWQAMDGIEARRTASAVA
Ga0210388_1011036943300021181SoilRKQLEKRIAAHPEAADAALDAAENAARTLWKALDGIEARRTELAAA
Ga0210393_1081281233300021401SoilQLAKRIAADPEKAEGALHEGENAGRMLWQALDGIETARLAHAM
Ga0210389_1070471423300021404SoilSLHATADVYHSNVWRKQLEKRIAAHPESAGAALDAAENAARMLWKALDGIDARRTVMAVA
Ga0210386_1066808033300021406SoilVHHSNVWRRQLEKRVATNPEAANAAWDSAENTARMLWRVLDGIEAARTACMA
Ga0210383_1050847033300021407SoilHSKVWRKQLENRITESPEAVQAALDAAEKTAKLLWQALDGIEAARLALAA
Ga0210391_1030147523300021433SoilEKRLDANPETAPAALDAAENAAKALWRALDGIEARRMATAAA
Ga0210398_1134897023300021477SoilNVWRKQLEKRIAAHPESAGAALDAAENAARMLWKALDGIDARRTVMAVA
Ga0224541_102489023300022521SoilSNVWHKQLEKRIAANPETAQAALDAAEKTAQLLWKALDGIEAARMTYAA
Ga0208936_100147263300025404PeatlandDIFHSNVWRKQLEKRIAANPETAQAALDAAEKTAQLLWKALDGIEAARMTYAA
Ga0208935_101431613300025414PeatlandIYHSNVWRKQLENRIAAHPETADAALDAAENAAKMLWRALDGIEAARTTFAA
Ga0208192_109752513300025477PeatlandHAHVWREQLEKRLAAKPETAPAALDAAENTARLLWQALDGIDAARMTCAA
Ga0207663_1047910133300025916Corn, Switchgrass And Miscanthus RhizosphereRVWRGLLEKRVAAHPEAADAALDAAENAARALWQALDGINSRRNAITAA
Ga0207700_1111744913300025928Corn, Switchgrass And Miscanthus RhizosphereDPETAQAALNAAENAAHMLWKALDGIEARRTANAAA
Ga0207700_1119631813300025928Corn, Switchgrass And Miscanthus RhizosphereEKRIAAKPEAAEAALDAAEDAARWLWQALDGIDARRTAVAAA
Ga0207700_1202616423300025928Corn, Switchgrass And Miscanthus RhizosphereRDQLEKRVAAHPETAGAALDAAENAARMLWKALDGIEARRTAMAA
Ga0209422_102924813300027629Forest SoilLHATADIYHSNVWRMQLQKRLAANPEAAPAALSAAEQTAKLLWQALDGINAARTAFAA
Ga0209580_1068722723300027842Surface SoilHANVWRKQLEKRIVAHPQAAESALDAAENAARMLWKALDGIEARRTAMAVA
Ga0209169_1009023733300027879SoilKRILANPETAEAALNAAENAARTLWQALDGIEVRRIAAAA
Ga0209380_1045511613300027889SoilHANVWRQQLENRIAANPETAQAALDAAENTAKLLWQALDGIEAARMTHAA
Ga0209380_1045598123300027889SoilLDANPETAPAALDAAENAAQALWRALDGIEARRMTTAA
Ga0209415_1088003123300027905Peatlands SoilDVRHSKVWRNLLDKLLAADPAQAAAALDAAENAALALWRALDGIEARRLASHCAN
Ga0209415_1099882013300027905Peatlands SoilYFALHATADVYHSKVWRQQLENRIEANPEAAQAALDAAENTAKLLWQTLDGIEAARLALA
Ga0209698_1004610773300027911WatershedsHHARVWRGLLEKRIAANPEAADAALDAAENAARVLWQALDGIDARRSALAIA
Ga0302303_1011451113300028776PalsaHATADIFHSNVWHKQLEKRIAANPETAQAALDAAEKTAQLLWKALDGIEAARMTYAA
Ga0302225_1049671013300028780PalsaHATADVYHSNVWRKQLEKRIEANPETADAALDAAENSARMLWKALDGIEARRLAVAA
Ga0308309_1031394623300028906SoilSRVWRNQLEKRILANPETAEAALNAAENAARTLWQALDGIEVRRIAAAA
Ga0308309_1108038333300028906SoilIAAHPEAADAALDAAENAARALWQALDGIDSRRNAITVA
Ga0311368_1104225313300029882PalsaNPETADAALDAAENSARMLWKALDGSEARRLAVAA
Ga0311339_1129518623300029999PalsaALHATADIFHSNVWRKQLEKRIAANPETAQAALDAAEKTAQLLWKALDGIEAARLTYAA
Ga0302282_131827313300030045FenYHSNVWRKQLENRIAAHPETAQAALDAAENAAKMLWRALDGIEAARTTFAA
Ga0311353_1007304913300030399PalsaEKRLEANPEAAAAALMAAENTAGRLWRALDGIDAVRMACAA
Ga0316363_1006973813300030659Peatlands SoilIYHSNVWRTQLEKRIAAHPEAAEGALDAAENAAKMLWQALDGIEAARMSYAV
Ga0265760_1008676133300031090SoilYHSNVWRNQLEKRIEAHPETADAALDAAENTARMLWKALDGIDARRMAVAA
Ga0265760_1032257023300031090SoilWRSQLEKRIEAHPETADAALDAAENAARMLWKALDGIDTRRRAMAVA
Ga0302307_1027988833300031233PalsaLEKRIAANPETASGALDAAENAAQILWKVLDGIEARRESLAA
Ga0310686_10116884513300031708SoilQLEKRIGAHPESADDALDAAENAARMLWKALDGIEARRMALAAA
Ga0307476_1013182843300031715Hardwood Forest SoilRGLLEKRVAAHPEAADAALDAAENAARALWQALDGINSRRNAITAA
Ga0307476_1048138113300031715Hardwood Forest SoilVWRGLLEKRIAAHPEAADAALDAAENAARALWQALDGINSRRNAITAA
Ga0307476_1072318713300031715Hardwood Forest SoilIAAHPEAADTALEAAENAARMLWKALDGIESRRRAVAAA
Ga0307476_1100353823300031715Hardwood Forest SoilEHPEAAEPALDAAENAARMLWQALDGIDARRNAVAIA
Ga0307474_1018788233300031718Hardwood Forest SoilADEKTCSYFSLHATADVYHSNVWRKQLQKRIEAHPETADAALDAAENTARMLWKALDGINARRMTAV
Ga0318544_1037364613300031880SoilDVHHARVWRSLLKKRITEHPEAADAALDAAESAARMLWQALDGIDARRNAVAIA
Ga0307479_1130138333300031962Hardwood Forest SoilQLEKRIAAHPETADAALDAAENAARMLWKALDGIEARRTAMAAA
Ga0307471_10272744423300032180Hardwood Forest SoilVWRKQLEKRIAAHPETADAALDAAENAARMLWMALDGIEARRTAVAVA
Ga0335076_1043335313300032955SoilTLHTTADVYHSKVWKKLLEKRVTAHPETAARALDAAENAARALWKALDGIEARRTAALSA
Ga0335084_1111435723300033004SoilRGLLEKRIDEHSEAADAALDAAENAARLLWQALDGIDARRQAMAIA
Ga0371490_118259013300033561Peat SoilYHSQVWRKQLEKRVAANPEAAEKALSAAEGAARKLWEALDGIEARCMAAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.