NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103696

Metagenome / Metatranscriptome Family F103696

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103696
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 43 residues
Representative Sequence VGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKREIHQ
Number of Associated Samples 89
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.01 %
% of genes from short scaffolds (< 2000 bps) 99.01 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.21

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.010 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(9.901 % of family members)
Environment Ontology (ENVO) Unclassified
(23.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.584 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54
1Ga0063356_1044349892
2Ga0066395_101719433
3Ga0066683_107577712
4Ga0065712_107586731
5Ga0065705_103012243
6Ga0066388_1065658422
7Ga0070669_1004695143
8Ga0070688_1005059571
9Ga0070709_114214381
10Ga0070694_1003262211
11Ga0070685_107875891
12Ga0070734_108368702
13Ga0070672_1018113301
14Ga0070695_1014273082
15Ga0070704_1002375263
16Ga0068859_1021990282
17Ga0068864_1006626591
18Ga0068870_112903082
19Ga0068858_1017140511
20Ga0066652_1011631822
21Ga0075028_1008683091
22Ga0075015_1003121033
23Ga0075014_1009832421
24Ga0070765_1007837233
25Ga0079222_121437201
26Ga0079220_112863091
27Ga0099793_107208271
28Ga0105248_106191761
29Ga0105248_115365421
30Ga0105249_112986553
31Ga0126382_109725262
32Ga0126373_112144931
33Ga0134080_103070031
34Ga0126372_130480992
35Ga0126378_121831292
36Ga0126378_123778402
37Ga0126377_107372301
38Ga0126377_108203311
39Ga0126377_133867962
40Ga0126383_123658062
41Ga0134127_136752532
42Ga0150983_164504291
43Ga0137388_107278423
44Ga0150984_1126770302
45Ga0150984_1182561631
46Ga0137398_102912811
47Ga0157305_101333332
48Ga0126375_108048062
49Ga0164305_103225021
50Ga0157378_116837712
51Ga0163162_125989101
52Ga0157379_107755371
53Ga0157379_116470591
54Ga0132258_122846743
55Ga0132255_1040860501
56Ga0182035_101326041
57Ga0182037_110745452
58Ga0182037_113904212
59Ga0184604_100438621
60Ga0184605_101115541
61Ga0184626_102983441
62Ga0187773_110782622
63Ga0187772_111519171
64Ga0190270_102995083
65Ga0190274_107214691
66Ga0066669_110106971
67Ga0066669_112435601
68Ga0173481_101189341
69Ga0193754_10322481
70Ga0213881_104999051
71Ga0210384_104775933
72Ga0242662_101174841
73Ga0242657_10861423
74Ga0247752_10587551
75Ga0207699_113317182
76Ga0207643_102000711
77Ga0207681_103724261
78Ga0207681_118361862
79Ga0207701_103639063
80Ga0207701_109645651
81Ga0207703_108395123
82Ga0207703_124042291
83Ga0207674_113689902
84Ga0209422_11420321
85Ga0209118_10873091
86Ga0209465_101274433
87Ga0209168_105561862
88Ga0268264_121648011
89Ga0307282_101176393
90Ga0308197_101315201
91Ga0308187_103474261
92Ga0170824_1145947713
93Ga0307468_1005474311
94Ga0306925_106565223
95Ga0306925_116241801
96Ga0308176_105731283
97Ga0308173_100667094
98Ga0310889_103120442
99Ga0335077_118431872
100Ga0314780_173582_390_545
101Ga0314792_252161_394_516
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 8.70%    Coil/Unstructured: 91.30%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540VGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKREIHQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.21
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Switchgrass Rhizosphere
Soil
Agricultural Soil
Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Exposed Rock
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Avena Fatua Rhizosphere
3.0%3.0%8.9%3.0%9.9%3.0%5.0%5.0%3.0%3.0%5.0%5.9%3.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0063356_10443498923300004463Arabidopsis Thaliana RhizosphereAYVGTWKGDKKIFQLVEKPTRSPFNDFNEDICVWSETKREVRP*
Ga0066395_1017194333300004633Tropical Forest SoilYVGTWKGDKKIFQLAEKTTRSAYNDFPEDICVWSETKRPVLQ*
Ga0066683_1075777123300005172SoilTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKRPVLQ*
Ga0065712_1075867313300005290Miscanthus RhizosphereDPKAYVGTWKGDKKIFRLTEKPPRSEYNDLPEGICVWSETKREIHP*
Ga0065705_1030122433300005294Switchgrass RhizosphereITDPKAYVGTWKGDKKIFQLVEKPTRSPFNDFNENICAWSETKRPVLQ*
Ga0066388_10656584223300005332Tropical Forest SoilYAGTWKGDKKIFQLVEKPARSEFNDLPESICVWSETKREIHP*
Ga0070669_10046951433300005353Switchgrass RhizosphereNITDPKTYVGTWKGDKKIFQLVEKPTRSPFNDFNEDICVWSETKREVRP*
Ga0070688_10050595713300005365Switchgrass RhizosphereYVGTWKGDKKIYQLVEKPTRSAFNDFNEDICVWSENKRPVLQ*
Ga0070709_1142143813300005434Corn, Switchgrass And Miscanthus RhizosphereGTWKGDKKVFQRIENPDRSDYHDLPEGICVWSDTKREVHP*
Ga0070694_10032622113300005444Corn, Switchgrass And Miscanthus RhizosphereTYVGTWKGDKKIFQLVEKPTRSAYSDFNENICVWSETKREVRP*
Ga0070685_1078758913300005466Switchgrass RhizosphereYVGTWKGDKKIFQLVEKPTRSAYNDFNENICVWSETKREVHP*
Ga0070734_1083687023300005533Surface SoilPWKGDKKIFRLVEDPTHAAYKDLPEGICVWSETKREIHP*
Ga0070672_10181133013300005543Miscanthus RhizosphereGDKKIFQLVENPTGSAFNDLPEDICAWSETRREIPQ*
Ga0070695_10142730823300005545Corn, Switchgrass And Miscanthus RhizosphereYVGTWKGDKKIFQLVEKPTRSAFNDLPEDICVWSEAKREIHQ*
Ga0070704_10023752633300005549Corn, Switchgrass And Miscanthus RhizosphereTWKGDKKIFQLVEKPTRSAYNDFPEDICVWSETKREVHR*
Ga0068859_10219902823300005617Switchgrass RhizosphereMAYVGTWKGDKKIFQLVEKPTRSAFNDFNENICVWSENKREVHP*
Ga0068864_10066265913300005618Switchgrass RhizosphereTYVGTWKGDKKIFQLVEKPTHDFPEDICAWSETKRPVLQ*
Ga0068870_1129030823300005840Miscanthus RhizosphereGDKKIFQLVEKPTRSAYNDFNENICVWSETKRPVLQ*
Ga0068858_10171405113300005842Switchgrass RhizosphereKGDKKIFQLVEKPTRSAYNDFPEDICVWSETKREVHP*
Ga0066652_10116318223300006046SoilTYVGTWKGDKKILRLAEKSTHSAYNDFPEDICAWSETKRPVLQ*
Ga0075028_10086830913300006050WatershedsGTWKGDKKIFQLVEKPARSEYNDLRENICAWSESKSKGGLR*
Ga0075015_10031210333300006102WatershedsWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKRPVLQ*
Ga0075014_10098324213300006174WatershedsYVGTWKGDKKIFRLAEKPARSDYNDLPEDICAWSETKRPVNP*
Ga0070765_10078372333300006176SoilWKGDKKIFQLVEKPARSEYNDLPEGICVWSETKREIHP*
Ga0079222_1214372013300006755Agricultural SoilWKGDKKIFRLVEKTKSAFNDFPEDICVWSETKREIHP*
Ga0079220_1128630913300006806Agricultural SoilKGDKKIFQLVEKPTRSTFNDFNENICAWSETKRPVLQ*
Ga0099793_1072082713300007258Vadose Zone SoilVGTWKGDKKIFRLAEKPTRSAYNDFPEDICVWSETKRPVLQ*
Ga0105248_1061917613300009177Switchgrass RhizosphereITDPQTYVGTWKGDKKIFQLVEKPTRSAFNDLPEDICAWSETKREIHE*
Ga0105248_1153654213300009177Switchgrass RhizosphereKKIFRLVEKPTRSAFGDLPEDICVWSDTKRPVQEP*
Ga0105249_1129865533300009553Switchgrass RhizosphereTWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKREVLQ*
Ga0126382_1097252623300010047Tropical Forest SoilPKAYVGTWKGDKKIFQLAEQSTRSAYNDFPEDICVWSETKRPVLK*
Ga0126373_1121449313300010048Tropical Forest SoilPKAYTGTWKGDKKIFQLVEKPARSEYNDLPEGICVWSETKREIHP*
Ga0134080_1030700313300010333Grasslands SoilKKIFQLVEKPTRSAFNDFPEDICVWSETKRPVLQ*
Ga0126372_1304809923300010360Tropical Forest SoilTLNITDPKAYVGTWKGDKKIFRLAEKPARSDYNDLPEDICVWSETKRPVNP*
Ga0126378_1218312923300010361Tropical Forest SoilGTWKGDKKIFLLVEKPALSDYNDLPESICVWSETKREIHP*
Ga0126378_1237784023300010361Tropical Forest SoilTVPKVHVATAKADKKIFQLVEKPTRSPFNDFPENICVWSETKRPVLQ*
Ga0126377_1073723013300010362Tropical Forest SoilITDPQIYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKREVHP*
Ga0126377_1082033113300010362Tropical Forest SoilPKAYVGIWKGDKKIFQLAEKSTRSTYNDFPEDICVWSETKREIHQ*
Ga0126377_1338679623300010362Tropical Forest SoilAYVGTWKGDKKIFQLAEKSTRSAYNDFPEDICVWSETKRPVLK*
Ga0126383_1236580623300010398Tropical Forest SoilVGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKREIHQ*
Ga0134127_1367525323300010399Terrestrial SoilVGTWKGDKKIFQLVEKPTRSAYNDFNENICVWSETKREVHP*
Ga0150983_1645042913300011120Forest SoilITDPKTYVGTWKGDKKIFRLADKPARSDYNDLPEDICVWSETKRPVNP*
Ga0137388_1072784233300012189Vadose Zone SoilWKGDKKIFQLTEKPTRSAYNDFPEDICVWSETKRPVLQ*
Ga0150984_11267703023300012469Avena Fatua RhizosphereGTWRGDKKIFQLAEKPTRSAYNDFPEDICVWSETKRQVLQ*
Ga0150984_11825616313300012469Avena Fatua RhizosphereLNITDLKAYVGTWKGDKKIFQLAEKPARSAFNDFPEDICVWSETKRPVLE*
Ga0137398_1029128113300012683Vadose Zone SoilPMAYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSENKREVHP*
Ga0157305_1013333323300012891SoilWKGDKKIFQLVEKPTRSPYNDFNENICVWSETKRPVLQ*
Ga0126375_1080480623300012948Tropical Forest SoilKTYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSETKREIHQ*
Ga0164305_1032250213300012989SoilTLNITDPKAYVGTWKGDKKIFLLAEKTTRSVYNDFPEDICVWSETKRPVLK*
Ga0157378_1168377123300013297Miscanthus RhizosphereYVGTWKGDKKIFQLVEKPTHDFPEDICAWSETKRPVLQ*
Ga0163162_1259891013300013306Switchgrass RhizosphereYVGTWKGDKKIFQLVEKPTHDFPEDICVWSETKRPVLQ*
Ga0157379_1077553713300014968Switchgrass RhizosphereKKIFQLAEKPPRSEFNDLPEGICVWSETKREVHP*
Ga0157379_1164705913300014968Switchgrass RhizosphereAYVGTWKGDKKIFQLVEKPAHDFPEDICVWSEAKRPVLP*
Ga0132258_1228467433300015371Arabidopsis RhizosphereDPKAYVGTWKGDKKIFQLVEKPTRSAFNDFNENICVWSETKREVRP*
Ga0132255_10408605013300015374Arabidopsis RhizosphereKAYVGTWKGDKKIFQLVEKPTRSAFNDFNEDICVWSETKREVHP*
Ga0182035_1013260413300016341SoilGTWKGDKKIFQLVEKPARSQYNDFPEDICVWSETKRPIHQ
Ga0182037_1107454523300016404SoilWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKRPVLK
Ga0182037_1139042123300016404SoilWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKREIHP
Ga0184604_1004386213300018000Groundwater SedimentNITDPQTYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICAWSETKREIPQ
Ga0184605_1011155413300018027Groundwater SedimentDKKTFQLVEKPTRSAFNDFPEDICASSETKREIHQ
Ga0184626_1029834413300018053Groundwater SedimentPDTYVGTWQGDKKIFQLVEKPTRSAYNDFNENICVWSETKREVHP
Ga0187773_1107826223300018064Tropical PeatlandYVGTWKGDKKIFRLAEKPARSDYNDLPEDICVWSETKRPVNP
Ga0187772_1115191713300018085Tropical PeatlandPKTYTGTWKGDPKIFQLVDKPARSEYNDLPEGICVWSETKREIHP
Ga0190270_1029950833300018469SoilYLGTWQGDKKIFQLVENPTGSEFNDLPEDICAWSETKREIPQ
Ga0190274_1072146913300018476SoilLNITDPKTYVGTWKGDKKIFQLVEKPARSAYNDFNENICVWSETKREVHP
Ga0066669_1101069713300018482Grasslands SoilLGTWKGDKKIFQLVEKPIRSAFNDFPEDICAWSETKREIHQ
Ga0066669_1124356013300018482Grasslands SoilAYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICVWSENKREVHP
Ga0173481_1011893413300019356SoilITDPQTYVGTWKGDKKIFQLVEKPTRSAFNDLPEDICVWSDTKREIPE
Ga0193754_103224813300019872SoilLNITDPMAYVGTWKGDKKIFQLVEKPTRSAFNDFNENICVWSENKREVHP
Ga0213881_1049990513300021374Exposed RockDPKTYVGTWKGDKKIFRLAERPARSDNNDLPEDICVWSETKRPVNP
Ga0210384_1047759333300021432SoilPKTYVGTWKGDKKIFQLVEKPPRSEFNDLPEGICVWSETKREIHP
Ga0242662_1011748413300022533SoilGTWKGDKKIFQLVEKPVRSEFNDLRENICAWSEKHPKGGQR
Ga0242657_108614233300022722SoilWQGDKKIFQLAATPTRSAFNDFPEDICVWSETKRPVLQ
Ga0247752_105875513300023071SoilLNITDPKTYVGTWKGDKKIFQLVEKPNHDFPEDICVWSENKRPVLQ
Ga0207699_1133171823300025906Corn, Switchgrass And Miscanthus RhizosphereKTYVGTWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKRPVLQ
Ga0207643_1020007113300025908Miscanthus RhizosphereDPRTYAGTWKGDKKIFQLVEKPTRSPYNDFNENICVWSETKRPVLQ
Ga0207681_1037242613300025923Switchgrass RhizosphereLNITDPKTYVGTWKGDKKIFQLVEKPTRSAYNDFNEGICVWSETKREVHP
Ga0207681_1183618623300025923Switchgrass RhizosphereEMVLNITDPKTYVGTWKGDKKIFQLVEKPTHDFPEDICAWSETKRPVLQ
Ga0207701_1036390633300025930Corn, Switchgrass And Miscanthus RhizosphereKGDKKIFQLVEKPTRSGFNDFPEDICVWSETKRPVQEP
Ga0207701_1096456513300025930Corn, Switchgrass And Miscanthus RhizosphereGDKKIFQLVEKPTRSAFNDFNENICVWSENKREVHP
Ga0207703_1083951233300026035Switchgrass RhizosphereTDPMAYVGTWKGDKKIFQLVERPTRSAFNDFNENICVWSENKREVHP
Ga0207703_1240422913300026035Switchgrass RhizosphereILNITDPKTYVGTWKGDKKIFQLVEKPTRSAFGDFPEDICAWSETKREIHP
Ga0207674_1136899023300026116Corn RhizosphereTDPKAYVGTWKGDKKIFQLVEKPTRSAYNDFNENICVWSETKRPVLQ
Ga0209422_114203213300027629Forest SoilMILNIIYPKTYVGNWKGDKKIFQLVEKPARSEFNDLPENICAFLK
Ga0209118_108730913300027674Forest SoilKAYVGTWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKRPVLQ
Ga0209465_1012744333300027874Tropical Forest SoilWKGDKKIFQLAEKSTRSAYNDFPEDICVWSETKRPVLK
Ga0209168_1055618623300027986Surface SoilGDKKIFHLAEKPAGSEFNDLPENICAWSETKRPVNP
Ga0268264_1216480113300028381Switchgrass RhizosphereVGTWKGDKKIFQLVEKPTRSAFNDFNENICVWSENKREVHP
Ga0307282_1011763933300028784SoilWNGDKKIFQLVEKPTRSAFNDFPEDICVWSETRREIPQ
Ga0308197_1013152013300031093SoilKAYVGTWKGDKKIFQLVEKPTRSAYNDFPEDICVWSETKREVLQ
Ga0308187_1034742613300031114SoilDPKTYVGTWKGDKKIFQLAEKPTRSAYNDFPEDICVWSETKREVHP
Ga0170824_11459477133300031231Forest SoilGDKKIFQLAEKPARSEYNDLPEGICVWSETKREIHP
Ga0307468_10054743113300031740Hardwood Forest SoilITDPQTYVGTWKGDKKIFQLVEKPTRSAFNDFPEDICAWSETKREIHE
Ga0306925_1065652233300031890SoilWKGDKKIFQLVEKPSRSAFNDFPEDICVWSETKREIHP
Ga0306925_1162418013300031890SoilDPKAYVGTWKGDKKIFQLTEKSTHSAYNDFPEGICVWSETKRPVLK
Ga0308176_1057312833300031996SoilNITDPTAYVGTWKGDKKIFQLAEKPPRSDFNDLPEGICVWSETKREIHP
Ga0308173_1006670943300032074SoilVGTWKGDKKMFQLAEKPPRSDFNDLPEGICVWSETKREIHP
Ga0310889_1031204423300032179SoilYVGTWKGDKKIFQLVEKPTRSAYNDFNEGICVWSETKREVHP
Ga0335077_1184318723300033158SoilTAVWKGDKKIFQLVEKPARSAYKDLPENICVWSERKREVHP
Ga0314780_173582_390_5453300034659SoilTLNITDPQTYVGTWKGDKKIFQLVEKPTRSAYNDFNENICVWSETKRPVLK
Ga0314792_252161_394_5163300034667SoilGTWKGDKKIFQLVEKPTRSAYNDFNENICVWSETKREVRK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.