NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103757

Metagenome / Metatranscriptome Family F103757

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103757
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 43 residues
Representative Sequence FEQLYNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASV
Number of Associated Samples 92
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 94.06 %
% of genes from short scaffolds (< 2000 bps) 98.02 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.50

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (61.386 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(12.871 % of family members)
Environment Ontology (ENVO) Unclassified
(33.663 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(51.485 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.
1JGI10216J12902_1122111591
2C688J35102_1208633413
3Ga0063454_1012628491
4Ga0062593_1000220961
5Ga0062590_1001855883
6Ga0070660_1007186052
7Ga0070691_103154681
8Ga0070673_1008993541
9Ga0070703_102619232
10Ga0070701_106424872
11Ga0070685_111902491
12Ga0070707_1019700582
13Ga0068855_1010056811
14Ga0070664_1017019312
15Ga0066903_1064174611
16Ga0068851_101473562
17Ga0068862_1006160992
18Ga0070716_1009666773
19Ga0070716_1012133552
20Ga0070712_1013371901
21Ga0074059_101955901
22Ga0079221_107061721
23Ga0075428_1017314211
24Ga0075430_1010798121
25Ga0074063_133304993
26Ga0079218_117005272
27Ga0111539_101281215
28Ga0066709_1035295511
29Ga0114129_126820092
30Ga0105243_105285341
31Ga0105242_113462211
32Ga0126307_102313162
33Ga0126307_108161122
34Ga0126308_111583232
35Ga0126314_107325281
36Ga0126310_100769792
37Ga0126311_115967051
38Ga0134084_100969443
39Ga0105239_132950132
40Ga0137376_114605882
41Ga0137370_105053601
42Ga0137371_113578382
43Ga0150984_1160439612
44Ga0164300_101663472
45Ga0164309_104333551
46Ga0164307_103549232
47Ga0164306_101186841
48Ga0164306_109763732
49Ga0157369_121797102
50Ga0157374_105825693
51Ga0157378_110427532
52Ga0163162_124412312
53Ga0134081_103485572
54Ga0182000_106626532
55Ga0157376_109531452
56Ga0134073_103874002
57Ga0132258_117296073
58Ga0132255_1057212091
59Ga0190275_110643931
60Ga0190270_106639353
61Ga0190270_113632871
62Ga0207642_106530692
63Ga0207688_105954512
64Ga0207685_105840161
65Ga0207654_104470811
66Ga0207657_105119792
67Ga0207646_111878861
68Ga0207687_117839861
69Ga0207700_107145102
70Ga0207700_111416221
71Ga0207644_109066891
72Ga0207704_118136821
73Ga0207665_108312951
74Ga0207689_102440571
75Ga0207661_119819761
76Ga0207639_117266502
77Ga0207678_102578832
78Ga0207708_107398011
79Ga0209074_100363891
80Ga0209254_107033751
81Ga0247818_111735421
82Ga0247818_112059592
83Ga0247820_111135281
84Ga0247819_108713962
85Ga0247827_109453651
86Ga0247826_104127361
87Ga0307506_104099681
88Ga0318534_107176171
89Ga0318538_103173562
90Ga0310887_102588022
91Ga0307405_112680471
92Ga0318529_104006101
93Ga0307473_108561751
94Ga0306925_103287121
95Ga0308176_120712711
96Ga0308176_128667741
97Ga0307416_1005555252
98Ga0307416_1007607292
99Ga0307416_1025840051
100Ga0315275_120744951
101Ga0247830_101944342
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.17%    β-sheet: 0.00%    Coil/Unstructured: 47.83%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540FEQLYNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASVSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.50
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
61.4%38.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Lake Sediment
Sediment
Soil
Soil
Vadose Zone Soil
Serpentine Soil
Grasslands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Avena Fatua Rhizosphere
8.9%3.0%3.0%5.9%3.0%3.0%3.0%9.9%12.9%3.0%3.0%4.0%4.0%3.0%3.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_11221115913300000956SoilLVLPRDLFEQLFNGEDAVSRGFLDVIQKDLVATLRETLRPCARLAASV*
C688J35102_12086334133300002568SoilFARLFHSEDAVSRVFLEAIQRDLLATLRQALRPYARLTASV*
Ga0063454_10126284913300004081SoilAILLVLPRDVFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPYARLTASV*
Ga0062593_10002209613300004114SoilILLVLPRDPFDKLFNGEDAVSRGLLDVIQRDLTATLRETFRPCARLAASV*
Ga0062590_10018558833300004157SoilPFDKLFNGEDAVSRGLLDVIQRDLTATLRETFRPCARLAASV*
Ga0070660_10071860523300005339Corn RhizosphereVLPRDVFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPLARLTASV*
Ga0070691_1031546813300005341Corn, Switchgrass And Miscanthus RhizosphereFDQLFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0070673_10089935413300005364Switchgrass RhizosphereLPREPFDQLFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0070703_1026192323300005406Corn, Switchgrass And Miscanthus RhizosphereQLFHREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0070701_1064248723300005438Corn, Switchgrass And Miscanthus RhizosphereRAILLVLPRDPFDQLFNGEDAVSRGLLDVIQRDLTATLRETFRPCARLAASI*
Ga0070685_1119024913300005466Switchgrass RhizosphereEHFDQLFHKEDAVARVFLEAIQRDLLASLRQTLRPAARLAASV*
Ga0070707_10197005823300005468Corn, Switchgrass And Miscanthus RhizosphereMAEQLFNAEDAVSRVFLDVIQRDLVATLRETSRPNARLSASV*
Ga0068855_10100568113300005563Corn RhizosphereLPREPFDQLFHREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0070664_10170193123300005564Corn RhizosphereFEQLYNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASV*
Ga0066903_10641746113300005764Tropical Forest SoilLLVLARDPFEQLYNRNDTIARLFLDVLLRDLVATVRLTLRPHARLAASV*
Ga0068851_1014735623300005834Corn RhizosphereEDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0068862_10061609923300005844Switchgrass RhizosphereVFARLFHSEDAVSRVFLEAIQRDLLAMLRQTLRPYARLTASV*
Ga0070716_10096667733300006173Corn, Switchgrass And Miscanthus RhizosphereAVTRERSLLLVLPRAHFEQLFNGENAVSRVFLEVLQRDQVATLRQTLRPHARLAASL*
Ga0070716_10121335523300006173Corn, Switchgrass And Miscanthus RhizosphereVLPSDRFTQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL*
Ga0070712_10133719013300006175Corn, Switchgrass And Miscanthus RhizosphereREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0074059_1019559013300006578SoilPRDPFEQLFNGENAVSRVFLEVLQRDQVATLRQTLRPHARLAASL*
Ga0079221_1070617213300006804Agricultural SoilVLPRDRFTQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL*
Ga0075428_10173142113300006844Populus RhizospherePRDLFGKLFDGEDAVSRGFLDVIQKDLMATLRETLRPCARLAARAP*
Ga0075430_10107981213300006846Populus RhizosphereVTPSSSFLDREDAVSRVFLDVIQLDLVATLRQHARLAASV*
Ga0074063_1333049933300006953SoilHGPFEQLYNGEDAVSRVFLDVIRRELVATLRQTLRQQARLEASV*
Ga0079218_1170052723300007004Agricultural SoilVPRDLFEQLFNGEDAVSRGFLDVIQKDLMATLRDTLRPCARLAASG*
Ga0111539_1012812153300009094Populus RhizosphereVLPRDVFARLFHSEDAVSRVFLEAIQRDLLAMLRQTLRPYARLTASV*
Ga0066709_10352955113300009137Grasslands SoilPRDPFEQLFNGEDAVSRVFLDVIQRDLVATLRQTLRPNARLAASV*
Ga0114129_1268200923300009147Populus RhizosphereVTPSSSFLDGEDAVSRVFLDVIQLDLVATLRQTLRPHARLAASL*
Ga0105243_1052853413300009148Miscanthus RhizosphereQLFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0105242_1134622113300009176Miscanthus RhizosphereVLPRGPFDQLFNGEDAVSRVFLDVIQRELVATLRQTLRQHARLAASV*
Ga0126307_1023131623300009789Serpentine SoilLLLVVRGDLFQRLFNGEDAVSRGFLDVIQRDLMATLRETLRPCARLAASGS*
Ga0126307_1081611223300009789Serpentine SoilGEDAVSRGFLDVIQKDLVATLRETLRPCARLAASG*
Ga0126308_1115832323300010040Serpentine SoilFDKDDAVGRVFLDAIQRDLLATLRQALRPYARVAASV*
Ga0126314_1073252813300010042Serpentine SoilEQLFNGEDAVSRGFLDVIQKDLVATLRETLRPCARLAASG*
Ga0126310_1007697923300010044Serpentine SoilFNGEDAVSRVFLDVIQRELVAALRQPLRQYARLAASV*
Ga0126311_1159670513300010045Serpentine SoilLLYKEDAVGRVFLEAIQRDLLATVRRTLRPCARLAASL*
Ga0134084_1009694433300010322Grasslands SoilPSDVFEQLFHREDAISRVFLEAIQRDLLVTLRHALRPSARLASSV*
Ga0105239_1329501323300010375Corn RhizospherePRDHFEQLYNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASV*
Ga0137376_1146058823300012208Vadose Zone SoilLFNGEDAVSRVFLDVIHRELVATLRQTLRPHARLAASV*
Ga0137370_1050536013300012285Vadose Zone SoilLAREPFGQLFAREDTISEVFLDVILRDLAATLRQTLRPHARLASSV*
Ga0137371_1135783823300012356Vadose Zone SoilVLARESFGQLFEGEDTISEVFLDVILRDLAATLRQTLRPHARLASSV*
Ga0150984_11604396123300012469Avena Fatua RhizosphereGEDAVSRVFLDVIRRDLLATLRQTLRPHARLAASV*
Ga0164300_1016634723300012951SoilERSLLLVLPREPFEQLFHREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0164309_1043335513300012984SoilFDTLFAGEDAVSRVILDVIQADLVATLRQTLRPQARLSASL*
Ga0164307_1035492323300012987SoilFDRRFHNEDAVSRVFLEAIHRDLLATLRQTLRPCARLTPSV*
Ga0164306_1011868413300012988SoilEQLFHREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0164306_1097637323300012988SoilFVLPSDRFTQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL*
Ga0157369_1217971023300013105Corn RhizosphereFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0157374_1058256933300013296Miscanthus RhizosphereRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV*
Ga0157378_1104275323300013297Miscanthus RhizosphereLAIARENFAELFHREDAIARVFLDLLLRELVATLRITLRPHARLAASI*
Ga0163162_1244123123300013306Switchgrass RhizosphereFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL*
Ga0134081_1034855723300014150Grasslands SoilLVLPRGPFEELFNGEDAVSRGLLDVIQRDLMATLRATFRPCARLAASV*
Ga0182000_1066265323300014487SoilITRERAVLLVLPRDLFEPLFNGDDAVSRGFLDVIQKDLLATLRETLRPCARLAASL*
Ga0157376_1095314523300014969Miscanthus RhizosphereREDAIARVFLDLLLRELVATLGLTLRPHARLAASV*
Ga0134073_1038740023300015356Grasslands SoilVLLLVLARDPFEQLFNRDDAISHVFLDVIQRDMVATLRQTLRPHARLAASL*
Ga0132258_1172960733300015371Arabidopsis RhizosphereLVLPREPFDQLFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASL*
Ga0132255_10572120913300015374Arabidopsis RhizosphereEDAVSRVFLEAIHRDLLATLRQTLRPCARLTESV*
Ga0190275_1106439313300018432SoilLFARLFAGEDAVSGGFLDVVRNDLMASLRESLRPAARLAASG
Ga0190270_1066393533300018469SoilERSLLLVIPQGLFDQLLQGEDAVSRSFLDVIQKEMIATLRETLRPCARLGASV
Ga0190270_1136328713300018469SoilLFNGEDAVSRGFLDVIQKDLMATLRETLRPCARLAASR
Ga0207642_1065306923300025899Miscanthus RhizosphereVFDQLYRREDAVGRVFVDAIQRDLLVTVRQTLRPYARLASSV
Ga0207688_1059545123300025901Corn, Switchgrass And Miscanthus RhizosphereFDGEDAVSRGFLDVIQKDLMATLRETLRPCARLAARAP
Ga0207685_1058401613300025905Corn, Switchgrass And Miscanthus RhizosphereHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL
Ga0207654_1044708113300025911Corn RhizosphereQLFRREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV
Ga0207657_1051197923300025919Corn RhizosphereILLVLPRDVFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPLARLTASV
Ga0207646_1118788613300025922Corn, Switchgrass And Miscanthus RhizosphereMAEQLFNAEDAVSRVFLDVIQRDLVATLRETSRPNARLSASV
Ga0207687_1178398613300025927Miscanthus RhizosphereTLILSIPRENFTELFHREDTIARVFLDLLLRDLVATLRLTLRPHARLAASV
Ga0207700_1071451023300025928Corn, Switchgrass And Miscanthus RhizospherePRDRFAQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL
Ga0207700_1114162213300025928Corn, Switchgrass And Miscanthus RhizosphereFTQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL
Ga0207644_1090668913300025931Switchgrass RhizosphereAQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL
Ga0207704_1181368213300025938Miscanthus RhizosphereLLVIPHELFGRLFAGEDAVSRGFLDVIQRDLMTALRETLRPRARLAASV
Ga0207665_1083129513300025939Corn, Switchgrass And Miscanthus RhizosphereDRFTQLFHGEDAVSRVFLDVIQRELVATLRQTLRPHARLAASL
Ga0207689_1024405713300025942Miscanthus RhizosphereGVFARLFHSEDAVSRVFLEAIQRDLLAMLRQTLRPYARLTASV
Ga0207661_1198197613300025944Corn RhizosphereGLLLVLPRDPFEQLFNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASV
Ga0207639_1172665023300026041Corn RhizosphereGEDAVSRGFLDVIQKDLMATLRETLRPCARLAARAP
Ga0207678_1025788323300026067Corn RhizosphereGEDAVSRGFLDVIQRDLMTALRETLRPRARLAASV
Ga0207708_1073980113300026075Corn, Switchgrass And Miscanthus RhizosphereRDVFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPLARLTASV
Ga0209074_1003638913300027787Agricultural SoilEPFDQLFHREDAISRVFLDVIQRDLVATLRQSLRPHARLAASV
Ga0209254_1070337513300027897Freshwater Lake SedimentTLFTAEDPISRVFLDVIQRDLVATLRQTLSPLARLVASRRA
Ga0247818_1117354213300028589SoilKLFDGEDAVSRGFLDVIQKDLMATLRETLRPCARLAARAP
Ga0247818_1120595923300028589SoilEPLFHREDAVSRVFLDAIQRDLLATLRQTLRPYARLAASV
Ga0247820_1111352813300028597SoilLYRREDAVGRVFVDAIQRDLLVTVRQTLRPYARLASSV
Ga0247819_1087139623300028608SoilVLPRDVFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPLARLTASV
Ga0247827_1094536513300028889SoilGEDTVSRGFLDVIQKDLMATLRETLRPCARLAASR
Ga0247826_1041273613300030336SoilDLFARLFNGEDAVSRGFLDVIQKDLTTTLRETLRPYARLGASI
Ga0307506_1040996813300031366SoilHFEQLYNGEDAVSRVFLDVIQRDLVATLRETLRPHARLAASV
Ga0318534_1071761713300031544SoilLLILPGNVFDHLFYREDAVSRVFLEAIQRDLLVALRQALRQWARLRASA
Ga0318538_1031735623300031546SoilLFYREDAVSRVFLEAIQRDLLVAVRQALSQWARLRASV
Ga0310887_1025880223300031547SoilLVLPRDLFARLFHSEDAVSRVFLEAIQRDLLATLRQTLRPYARLTASV
Ga0307405_1126804713300031731RhizosphereDHFGQLFHKEDAVARVFLEAIQRDLLASLRQTLRPCARLAASV
Ga0318529_1040061013300031792SoilPFEQLYNRHDTIARLFIEMLLRDLMTTVRSTLRPHARLAASV
Ga0307473_1085617513300031820Hardwood Forest SoilRLFHSEDAVSRVFLEAIQRDLLATLRQALRPYARLTASV
Ga0306925_1032871213300031890SoilLLVVPGNVFDHLFYREDAVSRVFLEAIQRDLLVSLRQALRQWARLRASA
Ga0308176_1207127113300031996SoilFTGEDAVSRVILDVIQADLVATLRQTLRPQARLAASV
Ga0308176_1286677413300031996SoilRACLLVIPHDLFERLFEGEDAVSRGFLDVIQRDLMTALRETLRPRARQAASAPPPTGDR
Ga0307416_10055552523300032002RhizosphereLLLVLQRDVFDQLFHKEDAVSRVFLEAIQRDLLATLRRTLRPCARLAASL
Ga0307416_10076072923300032002RhizosphereVIPRELFEQLFNGEDAVSRGFIDVIQKELVATLRQTLRQHARIAASV
Ga0307416_10258400513300032002RhizosphereFEGEDAVSRGFLDVIQKDLMTALRETLRPRARLAASV
Ga0315275_1207449513300032401SedimentPRGPFEQLFNGEDAVSRVFLDVILRDLVATLRQTLRPHARLAASV
Ga0247830_1019443423300033551SoilHDVFDQLYRREDAVGRVFVDAIQRDLLVTVRQTLRPYARLASSV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.