NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103376

Metagenome / Metatranscriptome Family F103376

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103376
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 44 residues
Representative Sequence DAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Number of Associated Samples 96
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.01 %
% of genes from short scaffolds (< 2000 bps) 96.04 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (84.158 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment
(11.881 % of family members)
Environment Ontology (ENVO) Unclassified
(37.624 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.594 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.
1Ga0055437_102502901
2Ga0062595_1000887691
3Ga0062383_101376701
4Ga0066673_106906091
5Ga0068998_100669061
6Ga0070682_1004505311
7Ga0068868_1023040641
8Ga0070691_102502813
9Ga0070671_1017652351
10Ga0070667_1009726313
11Ga0066699_100706623
12Ga0068855_1012179512
13Ga0066691_103616221
14Ga0074472_106464562
15Ga0074470_114586411
16Ga0068870_101657453
17Ga0068863_1012611191
18Ga0068860_1003366641
19Ga0070717_120065682
20Ga0066696_104726331
21Ga0075021_102447082
22Ga0074050_118577401
23Ga0079222_110790882
24Ga0066658_103688262
25Ga0066658_104218832
26Ga0075425_1009929171
27Ga0075435_1003002671
28Ga0105251_106032822
29Ga0115026_107720761
30Ga0115027_103230251
31Ga0105243_102697993
32Ga0105241_109493641
33Ga0105238_107851303
34Ga0105249_131102842
35Ga0074044_109348342
36Ga0134125_115592121
37Ga0105239_104671794
38Ga0134121_127965341
39Ga0137362_102164811
40Ga0137377_111774683
41Ga0137366_108764753
42Ga0137384_102056422
43Ga0150984_1115766211
44Ga0164303_109409291
45Ga0164306_101852651
46Ga0157374_125920122
47Ga0163162_105875092
48Ga0182008_107864942
49Ga0132257_1027755073
50Ga0163161_104522222
51Ga0190266_102298611
52Ga0187777_101165281
53Ga0184625_106329201
54Ga0187772_114237501
55Ga0066669_113680182
56Ga0210335_12601623
57Ga0207671_108117131
58Ga0207700_102885222
59Ga0207690_102499461
60Ga0207690_113102422
61Ga0207709_101314854
62Ga0207691_101911724
63Ga0207679_112969351
64Ga0207651_121435431
65Ga0207658_121354762
66Ga0207676_100773546
67Ga0207683_111848552
68Ga0209155_12232182
69Ga0209059_11386631
70Ga0209474_104574893
71Ga0209040_102577551
72Ga0209668_106057193
73Ga0268264_111327693
74Ga0302298_102231711
75Ga0311365_112148162
76Ga0311348_104633522
77Ga0311333_103330951
78Ga0311349_101965143
79Ga0302321_1032727062
80Ga0308175_1022237622
81Ga0315278_103756811
82Ga0315278_104944491
83Ga0315278_105184033
84Ga0315274_119037231
85Ga0308173_110046593
86Ga0315292_105035553
87Ga0315281_114121221
88Ga0315283_103161951
89Ga0315268_111021011
90Ga0315268_112993321
91Ga0307471_1017430832
92Ga0315270_111404392
93Ga0315287_112094293
94Ga0315275_115633371
95Ga0316603_100495521
96Ga0316625_1026186922
97Ga0316629_116822211
98Ga0316624_107120751
99Ga0316628_1000502228
100Ga0316616_1005995201
101Ga0370509_0328909_1_111
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 41.79%    β-sheet: 0.00%    Coil/Unstructured: 58.21%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035DAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTISequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
84.2%15.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Wetland
Freshwater Lake Sediment
Sediment
Bog Forest Soil
Wetland Sediment
Estuarine
Natural And Restored Wetlands
Sediment (Intertidal)
Groundwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Soil
Untreated Peat Soil
Tropical Peatland
Bog Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Fen
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Avena Fatua Rhizosphere
11.9%3.0%4.0%6.9%5.9%3.0%5.9%4.0%4.0%4.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0055437_1025029013300004009Natural And Restored WetlandsADAAIVSRFETRPPAGLYGAPPGRQKAVIRDPELLRRLTI*
Ga0062595_10008876913300004479SoilGDQVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0062383_1013767013300004778Wetland SedimentEVNVTFDKSCSDPAILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI*
Ga0066673_1069060913300005175SoilADAAVLSRFDSNPPAGLYGAPLSRQKSVIRDPATLKKLTT*
Ga0068998_1006690613300005213Natural And Restored WetlandsAQNCADPAILSRLETNPPAGLYGAPPDRQKALIKDPETLRRLAI*
Ga0070682_10045053113300005337Corn RhizosphereVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0068868_10230406413300005338Miscanthus RhizosphereAMFGQPCLSLGRIAGEEVDVIFAKACTEPSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI*
Ga0070691_1025028133300005341Corn, Switchgrass And Miscanthus RhizosphereGIAGDQVSIAFDRRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0070671_10176523513300005355Switchgrass RhizosphereRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0070667_10097263133300005367Switchgrass RhizosphereAGDQVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0066699_1007066233300005561SoilVVFDKSCADPSVLGRLETNPPAGLYAAPPSRQRTVIKNPEMLRKLTV*
Ga0068855_10121795123300005563Corn RhizosphereDARCADAAVLSRLETNPPAGLYGAPPARLKTIVVNPDTRRKLTI*
Ga0066691_1036162213300005586SoilPSVLGRLETNPPAGLYGAPLSRQKTVIKNPEMLRKLTV*
Ga0074472_1064645623300005833Sediment (Intertidal)LNAKFGQTCLALGRTAGDEVNVTFDKSCGDPAILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI*
Ga0074470_1145864113300005836Sediment (Intertidal)ILSRFETNPPAGLYGAPPGRQKAVIRDPETLRRLTV*
Ga0068870_1016574533300005840Miscanthus RhizosphereVLSRLETNPPAGLYGAPLSRQKAVIKDPEALKKLMI*
Ga0068863_10126111913300005841Switchgrass RhizosphereVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI*
Ga0068860_10033666413300005843Switchgrass RhizosphereKACTEPSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI*
Ga0070717_1200656823300006028Corn, Switchgrass And Miscanthus RhizosphereFDRRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0066696_1047263313300006032SoilVLNRLETNPPAGLYGAPLSRQKTVIKNPETLRKLTI*
Ga0075021_1024470823300006354WatershedsATSGETVTVTFDKGCADATVLSRFESNPPAGLYGAPLSRQKSVIKDPATLKKLTT*
Ga0074050_1185774013300006577SoilKGCAEASVLSRLETNPPAGLYAAPLPRQKALIKDPETLKRLSI*
Ga0079222_1107908823300006755Agricultural SoilALDKRCTEASVLNRLETNPPAGLYSAPPARQKTVIRNPDLLRKITI*
Ga0066658_1036882623300006794SoilLSRLETNPPAGLYGAPAARQKTVVKNPETLRKLMI*
Ga0066658_1042188323300006794SoilFDKSCADPSVLGRLETNPPAGLYGAPLSRQKTVIKNPEMLRKLTV*
Ga0075425_10099291713300006854Populus RhizosphereGLARNRLETNPPAGLYGAPPARQKAVIKNPEVLRKLVV*
Ga0075435_10030026713300007076Populus RhizosphereSILDKLETNPPAGLYGAPPGRQKTVIKNPETLRKLSI*
Ga0105251_1060328223300009011Switchgrass RhizosphereCFSLGRIAGEEVSVIFAKGCTEPSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI
Ga0115026_1077207613300009111WetlandALSRLESNPPAGLYGAPPTRQKSVIRNPQTLSKLGVSA*
Ga0115027_1032302513300009131WetlandKFDPRCADPSVLSRLETNPPAGLYGAPPGRQKSVVKNPDTLRKLTTI*
Ga0105243_1026979933300009148Miscanthus RhizosphereLARLDTNPPASLYGAPAARQKAVIKNAETLKKLMI*
Ga0105241_1094936413300009174Corn RhizosphereGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0105238_1078513033300009551Corn RhizosphereSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0105249_1311028423300009553Switchgrass RhizosphereDAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0074044_1093483423300010343Bog Forest SoilLAGVSGAEVSVVFDKRCAEPSVLNRFETNPPAGLYGAPPGRQKAVLKNPESLRKLSI*
Ga0134125_1155921213300010371Terrestrial SoilDQVSIAFDRRCADASVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0105239_1046717943300010375Corn RhizosphereCFSVRGIAGDQVSIAFDRRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0134121_1279653413300010401Terrestrial SoilLRARLETNPPAGLYRAPEGRRKAIIKNPEMLRKLV*
Ga0137362_1021648113300012205Vadose Zone SoilKRCAEASVLNRLETNPPAGLYGAPLSRQKTVIKNPEMLRRLTI*
Ga0137377_1117746833300012211Vadose Zone SoilSVLLRLETNPPAGLYGAPAGRQKTVVKNPETLRKLMI*
Ga0137366_1087647533300012354Vadose Zone SoilVTGDPVANKFDKSCADPSVLGRIETNPPAGLYGAPLSSQKTVIKNPEMLRKLTI*
Ga0137384_1020564223300012357Vadose Zone SoilAVSGERVSITFDKRCAEASVLNRLETNPPAGLYGAPLSRQKTVIKNPEMLRKLTI*
Ga0150984_11157662113300012469Avena Fatua RhizosphereRVSITFDASCADPSILGRLETNPPAGLYGAPLSRQKTVIKNPEMLRKLTV*
Ga0164303_1094092913300012957SoilRARFETNPPAALYGAPPARQKAIIKNPDTLRKLV*
Ga0164306_1018526513300012988SoilRVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0157374_1259201223300013296Miscanthus RhizosphereSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI*
Ga0163162_1058750923300013306Switchgrass RhizosphereDGPILSRLDTNLPAGLYSAPPGRQKDVISDPETLKKLTI*
Ga0182008_1078649423300014497RhizosphereARCADASVLSRLETNPPAGLYGAPPARLKAIVANPDTRKKLTI*
Ga0132257_10277550733300015373Arabidopsis RhizosphereLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI*
Ga0163161_1045222223300017792Switchgrass RhizosphereADAPILSRLDTNLPAGLYSAPPGRQKDVISDPETLKKLTI
Ga0190266_1022986113300017965SoilADATTRLETLPPAGLYGAPPARQRAVIRSADTLRKLTV
Ga0187777_1011652813300017974Tropical PeatlandVLNRLETNPPAGLYGAPPGRQKTVIRNPETLRKITL
Ga0184625_1063292013300018081Groundwater SedimentLNRLETNPPAGLYGAPPGRQKTVIKNPETLRKLSI
Ga0187772_1142375013300018085Tropical PeatlandIARFDTNPPAALYEAPPARQKWVLKDPEAVKKLSM
Ga0066669_1136801823300018482Grasslands SoilGTIAGERVAIVFDPRCADPSVLGRLETNPPAGLYGAPPSRQKTVIKNPEMLRKLTV
Ga0210335_126016233300021346EstuarineVTFDKSCGNPQILSRFETNPPAGLYGAPPGRQKALISDPETLRRLTI
Ga0207671_1081171313300025914Corn RhizosphereAGDQVSIAFDRRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0207700_1028852223300025928Corn, Switchgrass And Miscanthus RhizosphereVLSRLETNPPAGLYAAPPARQKALIANPEARRKLMI
Ga0207690_1024994613300025932Corn RhizosphereAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0207690_1131024223300025932Corn RhizosphereQVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0207709_1013148543300025935Miscanthus RhizosphereLARLDTNPPASLYGAPAARQKAVIKNAETLKKLMI
Ga0207691_1019117243300025940Miscanthus RhizosphereIAGEEVDVIFAKACTEPSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI
Ga0207679_1129693513300025945Corn RhizosphereARCADAAVLSRLETNPPAGLYGAPPARLKTIVVNPDTRRKLTI
Ga0207651_1214354313300025960Switchgrass RhizospherePGGRLETNPPAGLYGAPPARQKSIIKNPDTLRKLV
Ga0207658_1213547623300025986Switchgrass RhizosphereLSRFDTNPPAGLYGAPAPRQKALIKNPETLKQLSM
Ga0207676_1007735463300026095Switchgrass RhizosphereEVDVIFAKACTEPSVLSRLETNPPAGLYGAPPGRQKALIKDPETLRRLSI
Ga0207683_1118485523300026121Miscanthus RhizosphereGVASEQVTVVFDKSCSDASVMSRFETSPPAGLYSAPLPRQKAVIKDPETLRRVSI
Ga0209155_122321823300026316SoilDAAVLSRFDSNPPAGLYGAPLSRQKSVIRDPATLKKLTT
Ga0209059_113866313300026527SoilDTSILSRLETNPPAGLYGAPAGRQKTVVKNPETLRKLMI
Ga0209474_1045748933300026550SoilVLNRLETNPPAGLYGAPLSRQKTVIKNPETLRKLTI
Ga0209040_1025775513300027824Bog Forest SoilVLAGASGTEVSVVFDKRCADAAVLNRFETNPPAGLYGAPPGRQKAVLKNPEALRKLAT
Ga0209668_1060571933300027899Freshwater Lake SedimentFDKSCGDPAILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI
Ga0268264_1113276933300028381Switchgrass RhizosphereDQVSIAFDSRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0302298_1022317113300029980FenTAGDEVGVVFDARCNDATVLGGLDTNPPAGLYGAPPGRQKAVVRNPETRKKLMI
Ga0311365_1121481623300029989FenDATVLGGLDTNPPAGLYGAPPGRQKAVVRNPETRKKLMI
Ga0311348_1046335223300030019FenDPAVLSRLESYPPAGLYGATPGRQKTVTRNPETLRKLTT
Ga0311333_1033309513300030114FenDEVGVVFDARCNDATVLGGLDTNPPAGLYGAPPGRQKAVVRNPETRKKLMI
Ga0311349_1019651433300030294FenVLSRLETNPPAALYGAPLSRQKAVIKDPEALKKLMI
Ga0302321_10327270623300031726FenRCGDAAILSRLETNPPAGLYGAPLARQRAVIKDPEVLKKLMI
Ga0308175_10222376223300031938SoilGQECFSVRGIAGDQVSIAFDRRCADAGVLSRLETNPPAGLYGAPPGRQRTVVKDPELLRKLTI
Ga0315278_1037568113300031997SedimentGQTCLALGRTAGDEVNVTFDKSCSDPAILARLETNPPAGLYGAPPDRQKALIKDPETLRRLTI
Ga0315278_1049444913300031997SedimentTGDEVSVIFDKNCGEPTILSRLETNPPAGLYGAPPSRQKALIKDPETLRRLTI
Ga0315278_1051840333300031997SedimentGEAAVLSRLETNPPAGLYGAPPGRQKAVLRDPETLRKLVI
Ga0315274_1190372313300031999SedimentCFSLGPIAGYEVNVVFAPNCTEPAILARLETNPPAGLYGAPTARQKALIKDPETLRRLVI
Ga0308173_1100465933300032074SoilVVSRLETNPPASLYNAPAQRQKAIVKNPDTLKKLLI
Ga0315292_1050355533300032143SedimentEILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI
Ga0315281_1141212213300032163SedimentAVLSRLETNPPAGLYGAPSGRQKAVIRDPETLRRLTI
Ga0315283_1031619513300032164SedimentGRIAGDEVNVNFDKNCGDPTILSRLETNPPAGLYGAPPSRQKALIKDPETLRRLTI
Ga0315268_1110210113300032173SedimentLGNVAGDQVSVTFDQQRCGDAAVLSRFETNPPAGLYGAPPGRKKAVIRDPETLRRLTI
Ga0315268_1129933213300032173SedimentAVLSRLETNPPAALYGAPLSRQKAVIKDPEALKKLMI
Ga0307471_10174308323300032180Hardwood Forest SoilNSAVLGRLESNPPAGLYGAPPQRQKSVIKDPGTLRKLST
Ga0315270_1114043923300032275SedimentAVVFDKQRCGDAAVLSRLETNPPAGLYGAPPGRQKAVIRDPEMLRRLTI
Ga0315287_1120942933300032397SedimentTFDKSCSDPAILARLETNPPAGLYGAPPDRQKALIKDPETLRRLTI
Ga0315275_1156333713300032401SedimentNVTFDKSCSDPAILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI
Ga0316603_1004955213300033413SoilVLSRLESNPPAGLYGASPTRQKSVIRNPQTLSKLGVSA
Ga0316625_10261869223300033418SoilDAGVLSRLESNPPAGLYGASPTRQKSVIRNPQTLSKLGVSA
Ga0316629_1168222113300033483SoilEVNVTFDKSCSDPAILARLETNPPAGLYGAPPGRQKALIKDPETLRRLTI
Ga0316624_1071207513300033486SoilDGLVRARLESNPPAGLYGAPPGRQKAVIKNPEMLRKLVV
Ga0316628_10005022283300033513SoilAFDARCADGLARGRLESNPPAGLYGAPPGRQKAVIKNPELLRKLVV
Ga0316616_10059952013300033521SoilSRLETNPPAALYGAPPARQKSVIRNPETLSKLGVTA
Ga0370509_0328909_1_1113300034159Untreated Peat SoilMGRFETAPPAGLYGAPPARQKSVIKNPETLKRITTI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.