NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097584

Metagenome Family F097584

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097584
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 39 residues
Representative Sequence KDGRIRFAGPDRVRIERAAPTLSDRVGLVREFLGKLT
Number of Associated Samples 100
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 97.12 %
% of genes from short scaffolds (< 2000 bps) 91.35 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (86.538 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Peat → Unclassified → Unclassified → Fen
(11.539 % of family members)
Environment Ontology (ENVO) Unclassified
(35.577 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.385 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1E4A_11143280
2WSSedB2BaDRAFT_10168011
3JGI12635J15846_102414743
4Ga0055499_101057321
5Ga0062386_1015812622
6Ga0066677_102019552
7Ga0066679_100291984
8Ga0070666_108649941
9Ga0070705_1006848101
10Ga0070663_1001514781
11Ga0068867_1015673321
12Ga0070695_1012757231
13Ga0070696_1000124361
14Ga0066700_108510502
15Ga0066708_104760672
16Ga0068857_1006765991
17Ga0068854_1022282612
18Ga0068859_1001558541
19Ga0068861_1000294256
20Ga0068851_104160971
21Ga0068870_103276834
22Ga0097621_10000137418
23Ga0075425_1024630562
24Ga0075434_1005343331
25Ga0075434_1006319362
26Ga0075426_112194071
27Ga0075435_1000220885
28Ga0075418_124379012
29Ga0066709_1045655491
30Ga0105243_124227701
31Ga0126374_113949021
32Ga0131092_113985841
33Ga0134067_103161142
34Ga0126379_101366454
35Ga0126383_116843822
36Ga0126383_134515701
37Ga0134121_114347832
38Ga0120134_10348211
39Ga0136632_105347812
40Ga0137363_104542502
41Ga0137370_103658771
42Ga0137366_105227371
43Ga0137384_113413521
44Ga0136613_105375631
45Ga0164241_110708621
46Ga0157375_105470712
47Ga0075340_11264312
48Ga0163163_124714931
49Ga0157380_134183871
50Ga0157379_100788313
51Ga0132255_1044748481
52Ga0190264_120397721
53Ga0224510_102671591
54Ga0207680_109261892
55Ga0207660_106536152
56Ga0207644_118476711
57Ga0207689_116810651
58Ga0207712_106970522
59Ga0207668_110657812
60Ga0210117_10338492
61Ga0207703_121537011
62Ga0207639_116764692
63Ga0207674_105928211
64Ga0209863_100939711
65Ga0209890_102150412
66Ga0209808_12258182
67Ga0209807_12604881
68Ga0209999_10286812
69Ga0208991_11298271
70Ga0209797_104654251
71Ga0209496_102844362
72Ga0268266_115456102
73Ga0268265_121409272
74Ga0302160_101671771
75Ga0311332_117815312
76Ga0311334_108131412
77Ga0311365_100531661
78Ga0311337_117376572
79Ga0302172_102271722
80Ga0311349_108309252
81Ga0311349_114319592
82Ga0311366_105700431
83Ga0311366_115260282
84Ga0307497_102579261
85Ga0302323_1017878842
86Ga0307506_105302201
87Ga0310813_103840031
88Ga0302321_1014943871
89Ga0310907_101891891
90Ga0315297_116325132
91Ga0315274_105866572
92Ga0310897_100964173
93Ga0318514_103344772
94Ga0308173_117050992
95Ga0318518_106904781
96Ga0310895_103536471
97Ga0307472_1014440602
98Ga0315270_103975352
99Ga0335083_107435392
100Ga0318519_104266461
101Ga0310810_105211523
102Ga0316603_107036961
103Ga0316601_1013935122
104Ga0326723_0200105_3_140
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 36.92%    β-sheet: 0.00%    Coil/Unstructured: 63.08%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035KDGRIRFAGPDRVRIERAAPTLSDRVGLVREFLGKLTSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
86.5%13.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Wetland
Sediment
Bog Forest Soil
Wetland Sediment
Polar Desert Sand
Natural And Restored Wetlands
Wetland
Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Permafrost
Soil
Grasslands Soil
Grass Soil
Soil
Hardwood Forest Soil
Soil
Soil
Natural And Restored Wetlands
Soil
Prmafrost Soil
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Fen
Peat Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Activated Sludge
2.9%5.8%3.8%3.8%4.8%2.9%3.8%2.9%2.9%11.5%4.8%3.8%2.9%5.8%4.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E4A_111432802170459003Grass SoilVQQDGRYRFAGPDRVRIERAAPTLEERVALVEAFLGRLT
WSSedB2BaDRAFT_101680113300000312WetlandPPFDPGKLILLVQRDGRIRFAGPDRVRIERGAPALTERFALVRDFLARLA*
JGI12635J15846_1024147433300001593Forest SoilGRYRFAGPDRVRIERAAPTLEERVALVEGFLGRLT*
Ga0055499_1010573213300004047Natural And Restored WetlandsGKLILNMQRDGRTRFAGPDRVRIDRASPALADRVALVRSFLAQLA*
Ga0062386_10158126223300004152Bog Forest SoilIQKDGRYRLAGQDRVRIERAAPSLEERVALVREFLGRLA*
Ga0066677_1020195523300005171SoilIQKDGRHRFAGQDRVRIERAAPSLEERSALVREFLGRLA*
Ga0066679_1002919843300005176SoilVQQDGRYRFAGPDRVRIERAAPTLEERVALVESFLGRLT*
Ga0070666_1086499413300005335Switchgrass RhizosphereLEVQRDGRTRFAGPDRIRLERAAPTLDDRVALVRDFLGRL*
Ga0070705_10068481013300005440Corn, Switchgrass And Miscanthus RhizosphereLVQRDGRIRFAGQDRVRIERAAPALAERVGLVVDFLGRLR*
Ga0070663_10015147813300005455Corn RhizospherePPFDAGKLIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR*
Ga0068867_10156733213300005459Miscanthus RhizosphereVAMIRNDGRLRFAGPDRIRIDRAAPTLADRVTLVKEFVGRIG*
Ga0070695_10127572313300005545Corn, Switchgrass And Miscanthus RhizosphereAIVQKDGRVRFAGPDKVRIECAAPALADRVALVRDFVGKLT*
Ga0070696_10001243613300005546Corn, Switchgrass And Miscanthus RhizosphereIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR*
Ga0066700_1085105023300005559SoilRHRFAGQNRVRIERAAPSLDQRVALVREFLGRLS*
Ga0066708_1047606723300005576SoilIQKDRRHKLAGPDKVRIEREAPTIEDRVAVVREFLGRLA*
Ga0068857_10067659913300005577Corn RhizosphereRIRFAGPDRIRIERAAPTLNDRVAVVRDFLGKLT*
Ga0068854_10222826123300005578Corn RhizosphereQRDGRTRFAGPDRVRIDRAAPTLADRVALVRSFLGQLA*
Ga0068859_10015585413300005617Switchgrass RhizosphereMIRNDGRLRFAGPDRIRIDRAAPTLADRVTLVKEFVGRIG*
Ga0068861_10002942563300005719Switchgrass RhizosphereGRVRFAGPDKVRIECAAPALADRVALVRDFAGKLT*
Ga0068851_1041609713300005834Corn RhizosphereDGRIRFAGPDRVRIDRGAPTLAERVALVRDFLAKLA*
Ga0068870_1032768343300005840Miscanthus RhizosphereLIRFAGPDRIRIERAAPALAERVALVREFIERLK*
Ga0097621_100001374183300006237Miscanthus RhizosphereDGRVRFAGPDKVRIECAAPALADRVALVRDFAGKLT*
Ga0075425_10246305623300006854Populus RhizosphereLVQRDGRMRFAGPDRLRIEQAAPTLDERVTLVRDFVARLK*
Ga0075434_10053433313300006871Populus RhizosphereRMRFAGPDRIRIEQAAPTLDERVTLVRDFLTRLK*
Ga0075434_10063193623300006871Populus RhizosphereIPIIRNDGRVRFAGPDRLRIDRAAPTLAERVSLVKEFIGRLN*
Ga0075426_1121940713300006903Populus RhizosphereAGKLIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR*
Ga0075435_10002208853300007076Populus RhizosphereGRHRFAGQNRVRIDRAAPTLDERVALVREFLGRLA*
Ga0075418_1243790123300009100Populus RhizosphereGRTRFAGPDRVRLERAAPTLDERYALVRDFLNRL*
Ga0066709_10456554913300009137Grasslands SoilKLISLVQRDGHYRFAGQDRVRIERAAPELADRVVLVEEFLGKLK*
Ga0105243_1242277013300009148Miscanthus RhizosphereGRIRFHGQDRIRIERGAPTLADRVALVRDFLGKLQ*
Ga0126374_1139490213300009792Tropical Forest SoilRVRFAGPDRIRIDRPAPTLAERVALVKEFIGRLA*
Ga0131092_1139858413300009870Activated SludgeRTRFSGPDRVRIDRAAPMLDERIALVREFLGTLA*
Ga0134067_1031611423300010321Grasslands SoilDGRHRFAGQDRVRIERAAPTLEERSALVREFLARLA*
Ga0126379_1013664543300010366Tropical Forest SoilQKDGRHRLAGQDRVRIERAAPALEERVALVREFLSRLS*
Ga0126383_1168438223300010398Tropical Forest SoilGRMRFAGPDRIRIDRAAPTLDERVALVRDFLARLR*
Ga0126383_1345157013300010398Tropical Forest SoilKDGRYRLAGPDKVRIERAAPMLDERVTSVREFLATLS*
Ga0134121_1143478323300010401Terrestrial SoilALMKVDGHIRFAGPNRVRIERGAPGLPDRVALIRDFLARLV*
Ga0120134_103482113300012004PermafrostQKDGRVRFAGPDRVRIERAAATLADRIALVREFIGTLV*
Ga0136632_1053478123300012093Polar Desert SandVQKDGRIRFAGNDKIRIERGAPMLAERVALVQDFLVGLK*
Ga0137363_1045425023300012202Vadose Zone SoilLVQRDGRYRFAGQDRVRIERAAPMLGDRVALVEEFLTKLR*
Ga0137370_1036587713300012285Vadose Zone SoilGRHRFAGQNRVRIERAAPSLDERVALVREFLGRLA*
Ga0137366_1052273713300012354Vadose Zone SoilKDGRHRFAGQDRVRIERAAPTLEERSALVREFLARLA*
Ga0137384_1134135213300012357Vadose Zone SoilRYRFAGADRVRIERAAPSLDERVALVEEFLSKVS*
Ga0136613_1053756313300012681Polar Desert SandFDAANLITRVQKDGRIRFAGNDKIRIERGAPMLAERVALVQDFLVGLK*
Ga0164241_1107086213300012943SoilDGRIRFAGPDRIRIDHAAPGLVERIALVRDFLGKLG*
Ga0157375_1054707123300013308Miscanthus RhizosphereRNDGRVRFAGPDRVRIDRAAPTLAERVALVKEFVAKLA*
Ga0075340_112643123300014304Natural And Restored WetlandsLDGRIRFAGPDRIRIERAAPALADRFALVKDFIARLA*
Ga0163163_1247149313300014325Switchgrass RhizosphereQKDVRIRFAGPDRVRIERAAPTLSERIALVRDFLASLR*
Ga0157380_1341838713300014326Switchgrass RhizosphereEVQRDGRTRFAGPDRIRLERAAPTLDDRVALVRDFLGRL*
Ga0157379_1007883133300014968Switchgrass RhizosphereKDGRVRFAGPDRIRIERAAPTLAERVAVIREFLEMLR*
Ga0132255_10447484813300015374Arabidopsis RhizosphereVQKDGRIRFAGPDRVRIDRAAPELADRVGLVREFLGRLD*
Ga0190264_1203977213300019377SoilDGRVRFAGPDRVRIERAAPALADRVTLVKEFLAKVSGSDQRPAA
Ga0224510_1026715913300022309SedimentGRTRFAGPDRVRIDRAAPALAERAALVREFFARLR
Ga0207680_1092618923300025903Switchgrass RhizosphereNDGRLRFAGPDRIRIDRAAPTLADRVTLVKEFVGRIG
Ga0207660_1065361523300025917Corn RhizosphereKLIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGKLR
Ga0207644_1184767113300025931Switchgrass RhizosphereDGHIRFAGPDRIRIERAAPALAERVALVREFIERLK
Ga0207689_1168106513300025942Miscanthus RhizosphereAGKLIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR
Ga0207712_1069705223300025961Switchgrass RhizosphereAEEFHDGRIRFAGQDRVRIERAAPALAERVGLVVEFLGRLR
Ga0207668_1106578123300025972Switchgrass RhizosphereMAQSIEVQRDGRTRFAGPDRIRLERAAPTLDDRVALVRDFLGRL
Ga0210117_103384923300025985Natural And Restored WetlandsREDGRVRFAGPDRVKIECAAPTLADRVALVREFAGRLA
Ga0207703_1215370113300026035Switchgrass RhizosphereLIALIQQDGRYRFAGQDRVRIERAAPTVEDRVGLIEEFLGRLR
Ga0207639_1167646923300026041Corn RhizosphereLLVQKDGRIRFAGPDRVRIERAAPTLADRVTLVGDFLGRLV
Ga0207674_1059282113300026116Corn RhizosphereKDGRIRFAGPDRVRIERAAPTLSDRVGLVREFLGKLT
Ga0209863_1009397113300026281Prmafrost SoilRDGRVRFHGQDRIRIERGAPTLIDRVALVREFLGKLK
Ga0209890_1021504123300026291SoilLIALVQQDGRYRFAGPDRIRIERAAPSLEERVTLVESFLGRLA
Ga0209808_122581823300026523SoilIQKDRRHKLAGPDKVRIEREAPTIEDRVAVVREFLGRLA
Ga0209807_126048813300026530SoilLIVQRDGRIRFAGQDRVRIERAAPTLAERVALVEDFLGRLR
Ga0209999_102868123300027543Arabidopsis Thaliana RhizosphereQKDGRIRFAGPDRIRIERGAPTLNERVGLVRDFLTKLV
Ga0208991_112982713300027681Forest SoilGRYRFAGPDRIRIERAAPTLEERVALVEAFLGRLT
Ga0209797_1046542513300027831Wetland SedimentDGRFRFAGPDRLRIEWPTRTLDDRVTLVKDFLKKLA
Ga0209496_1028443623300027890WetlandVQRDGRFRFAGPDRVRIEKAAPVLAERVALVREFLGRLA
Ga0268266_1154561023300028379Switchgrass RhizosphereVQKDGRVRFAGPDRIRIERAAPTLAERVAVIREFLEMLR
Ga0268265_1214092723300028380Switchgrass RhizosphereMQRDGRTRFAGPDRVRIDRAAPTLADRVALVRSFLGQLA
Ga0302160_1016717713300028665FenGRIRFHGQDRIRIERGAPTLVDRVALVRDFLGKLK
Ga0311332_1178153123300029984FenGRIRFHGQDRIRIERGAPTLVDRVNLVREFLGRLG
Ga0311334_1081314123300029987FenLILLVQKDGRIRFAGQDRIRIERGAPTLGDRIALVGDFLRRLA
Ga0311365_1005316613300029989FenLVQKDGRIRFHGQDRIRIERGAPTLVDRVALVRDFLGKLK
Ga0311337_1173765723300030000FenLEQKDGRSRFHGQDRIRIERGAPTLVDRVALVRDFLGKLK
Ga0302172_1022717223300030003FenDGRYRFAGQDRVRIERAAPTIDDRLVLVKEFLGKLQ
Ga0311349_1083092523300030294FenRDGRYRFAGQDRVRIERAAPSIDDRLALVQEFLGRLA
Ga0311349_1143195923300030294FenQKDGRIRFHGQDRIRIERGAPTLVDRVALVRDFLGKLK
Ga0311366_1057004313300030943FenDGRYRFAGQDRVRIERAAPTIDDRLALVQEFLGKLQ
Ga0311366_1152602823300030943FenKDGRIRFHGQDRIRIERGAPTLVDRVALVRDFLGKLK
Ga0307497_1025792613300031226SoilLIQQDGRYRFAGQDRVRIERAAPTVEDRVGLIEEFLGRLR
Ga0302323_10178788423300031232FenGRIRFAGPDRVRIERAAPSVAERVALVRDFLGRLQ
Ga0307506_1053022013300031366SoilRDGRVRFAGPDRLRIDRAAPTLGERVALVRGFLEKLT
Ga0310813_1038400313300031716SoilKLIALIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR
Ga0302321_10149438713300031726FenLILLVQKDGRIRFHGQDRIRIERGAPTLVDRVNLVREFLGRLG
Ga0310907_1018918913300031847SoilRLILHMQRDGRTRFAGPDRVRIERAAPALDERVALVRGFLATLA
Ga0315297_1163251323300031873SedimentVQKDGRIRFAGPDRVRIERAAPMLADRFALVKDFIARLA
Ga0315274_1058665723300031999SedimentMQEDGRVRFAGPDRVRIERAAPTLADRFALVRDFVARLA
Ga0310897_1009641733300032003SoilQRDGRTRFAGPDRVRIERAAPALDERVALVRGFLATLA
Ga0318514_1033447723300032066SoilGTHRLAGPNRVRIERAAPSLNERAVLVREFLSQLG
Ga0308173_1170509923300032074SoilKDGRIRFAGPDRLRIESAAPALSDRVTLVRDFLAKLA
Ga0318518_1069047813300032090SoilGRYRFAGQDRVRIERAAPELADRVATVGEFLEKLR
Ga0310895_1035364713300032122SoilPARLILHMQRDGRTRFAGPDRVRIERAAPALDERVALVRGFLATLA
Ga0307472_10144406023300032205Hardwood Forest SoilDGRIRFAGPERIRIERAAPTLQERVALLRDFLGKVR
Ga0315270_1039753523300032275SedimentQKDGRIRFHGPDRVRIERAAPALADRVALLREFLGRLA
Ga0335083_1074353923300032954SoilGRVRFAGPDRVRIDRAAPALAERVALVKEFIGKLA
Ga0318519_1042664613300033290SoilRSRRTRFAGPDRVRLERAAPTLDERVQLVRDFLSRL
Ga0310810_1052115233300033412SoilLIQQDGRYRFAGQDRVRIERAAPAVEDRVGLIEEFLGRLR
Ga0316603_1070369613300033413SoilQKDGRIRFAGPDRVRIDRAAPTLAERIMLVKDFLGRLG
Ga0316601_10139351223300033419SoilLMVQKDGRIRFAGPDRLRIERAAPTLSDRAALIKEFLGRLG
Ga0326723_0200105_3_1403300034090Peat SoilTKLITLVQRDGRYRFAGQDRVRIERAAPDLEDRVALVEDFLGKLT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.