NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F096083

Metagenome Family F096083

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096083
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 43 residues
Representative Sequence LEVTRIEPGAYDEKVYGPGIGIVREQALTGTSEFAQLVSVTG
Number of Associated Samples 88
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 4.95 %
% of genes near scaffold ends (potentially truncated) 88.57 %
% of genes from short scaffolds (< 2000 bps) 91.43 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.57

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.143 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(16.191 % of family members)
Environment Ontology (ENVO) Unclassified
(24.762 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.762 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.
1E1_03596590
2JGI10216J12902_1263765461
3F14TB_1005402901
4Ga0066683_106700801
5Ga0066690_105673672
6Ga0066685_100773955
7Ga0066678_104940002
8Ga0066675_111698452
9Ga0070713_1019888371
10Ga0070710_102175961
11Ga0070705_1015708511
12Ga0066682_102768851
13Ga0070707_1001128743
14Ga0070696_1001702993
15Ga0070704_1000197526
16Ga0066661_102469091
17Ga0066670_109747531
18Ga0066699_109930491
19Ga0066705_105501772
20Ga0066905_1006920843
21Ga0066903_1010879392
22Ga0066903_1036685392
23Ga0066903_1068123612
24Ga0066903_1085526992
25Ga0066696_110665152
26Ga0075432_103381752
27Ga0070715_109119292
28Ga0075427_100199702
29Ga0079221_102724291
30Ga0075431_1011149302
31Ga0075431_1019087551
32Ga0075433_107530442
33Ga0075433_107992453
34Ga0075433_117098381
35Ga0075425_1020861142
36Ga0075424_1009900991
37Ga0075436_1004774482
38Ga0099791_105879001
39Ga0066710_1012395192
40Ga0111539_105284504
41Ga0066709_1011794491
42Ga0066709_1014199101
43Ga0066709_1015594641
44Ga0114129_113294082
45Ga0114129_120611592
46Ga0111538_117392531
47Ga0075423_103334971
48Ga0075423_119133222
49Ga0075423_127028651
50Ga0126374_109850331
51Ga0126380_108981062
52Ga0126384_102971661
53Ga0134088_105456121
54Ga0134062_102798573
55Ga0134062_102962651
56Ga0126372_123564741
57Ga0126372_127899891
58Ga0126377_126683872
59Ga0134128_103472353
60Ga0134122_108487921
61Ga0105246_102749951
62Ga0137364_100946501
63Ga0137381_104234821
64Ga0137376_100234811
65Ga0137376_107746342
66Ga0137372_102789561
67Ga0137369_108727053
68Ga0137359_113196421
69Ga0137404_107458581
70Ga0137410_110815102
71Ga0137410_116951841
72Ga0164298_108332751
73Ga0163162_129597012
74Ga0137409_109476773
75Ga0134072_101497682
76Ga0134085_103576002
77Ga0132257_1006760271
78Ga0184638_12958521
79Ga0184618_103960711
80Ga0184625_101463221
81Ga0066662_113456261
82Ga0190270_112017323
83Ga0066669_119301402
84Ga0210380_105918882
85Ga0193699_102982333
86Ga0222623_102784432
87Ga0222623_103352152
88Ga0222622_100364261
89Ga0207652_105474972
90Ga0207646_100633112
91Ga0207646_106907582
92Ga0207709_112703811
93Ga0207641_123235741
94Ga0209239_11741272
95Ga0209154_13197372
96Ga0209804_12849902
97Ga0268265_114678442
98Ga0137415_103530522
99Ga0307295_101727831
100Ga0307282_105504972
101Ga0307305_101293191
102Ga0307312_105397512
103Ga0310813_123546162
104Ga0307471_1038240711
105Ga0310810_108484701
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 34.29%    Coil/Unstructured: 65.71%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540LEVTRIEPGAYDEKVYGPGIGIVREQALTGTSEFAQLVSVTGSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.57
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
77.1%22.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Grass Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
2.9%2.9%8.6%12.4%5.7%4.8%13.3%6.7%4.8%8.6%16.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E1_035965902170459002Grass SoilTSLEATRIEPGLYDKKVYAAGIGIVLEVAVTGPTEIAQLVSMSG
JGI10216J12902_12637654613300000956SoilMLEATRIEPGAYDQKVYAPGIGIVQEKALTGTPEFAELVSVSG*
F14TB_10054029013300001431SoilLESTRIEPGAYDEKVYGPGIGIVSEHALTGDNEVAQLVSVSG*
Ga0066683_1067008013300005172SoilLRNTLTTLEATRVEPGLYDQKIYAPGIGMVLEQALTGPTEFAKLVSVTGP*
Ga0066690_1056736723300005177SoilTSLEATRIEPGAYDRKIYAPGIGMVREEALTGAPENAELVSVSG*
Ga0066685_1007739553300005180SoilLRNTLTTLEATRVEPGLYDQKIYAPGIGLVLEQALTGPTEFAKLVSVTGP*
Ga0066678_1049400023300005181SoilATRLEPGAYDQKVYAPGIGIAREQALTGAPEFAELVSVTG*
Ga0066675_1116984523300005187SoilLEATRLEPGAYDQKVYAPGIGIAREQALTGASEFAELVSVTG*
Ga0070713_10198883713300005436Corn, Switchgrass And Miscanthus RhizosphereNALTTLEATRLEPGAYDQKVYAPGLGIVREQALSGAPEFAELVSVTG*
Ga0070710_1021759613300005437Corn, Switchgrass And Miscanthus RhizosphereTRLEPGAYDQKVYAPGLGIVREQALTGAPEFAELVSVTG*
Ga0070705_10157085113300005440Corn, Switchgrass And Miscanthus RhizosphereNVLTSLEATRVEPGSYDKKVYGPGIGIVREQALTGTSEFAQLVSVSG*
Ga0066682_1027688513300005450SoilLRNTLTTLEATRIEPGLYDQKIYAPGVGLVVEQALTGPTEFAKLVRVTGP*
Ga0070707_10011287433300005468Corn, Switchgrass And Miscanthus RhizosphereVPYGTLKNVLTTLEATRLVPGAYDQKIYAPGIGIVREQALTGAQELAELVSISG*
Ga0070696_10017029933300005546Corn, Switchgrass And Miscanthus RhizosphereALTTLEATRLEPGAYDQKVYAPGLGIVREQALSGAPEFAELVSVTG*
Ga0070704_10001975263300005549Corn, Switchgrass And Miscanthus RhizosphereTRVEAGAYDEKVYGPGIGIVSERSLTGPAEVAQLVSVSG*
Ga0066661_1024690913300005554SoilVEPGIYDQKIYGPGLGIVVEQSLTGPTEYAKLVSVSG*
Ga0066670_1097475313300005560SoilFRHILVSLEFTRLEPRVIDQKIYAPGIGIVQEKALTGAPEFAELVSVSG*
Ga0066699_1099304913300005561SoilEPGAFDKKVYGPGIGIVLERALSGPPEVARLVSVQG*
Ga0066705_1055017723300005569SoilLEATRIEPGAYDRKVYAPGVGMIREEALTGATEYAELVGVSG*
Ga0066905_10069208433300005713Tropical Forest SoilEPGAYDEKVYGPGIGIVSERSLTGPMEFAQLVSMNG*
Ga0066903_10108793923300005764Tropical Forest SoilLTTLEATQIEAGAYDEKIYGPGIGIVSERSLTGSNEVAQLVSVSP*
Ga0066903_10366853923300005764Tropical Forest SoilVPDGTFKNVLTSLEATRIEPGAYDRKVYAPGIGMIREEALTGSPEFAELVSVGG*
Ga0066903_10681236123300005764Tropical Forest SoilTLEATRIEPGAYDRKVYAPGIGIVREQALTGAPETAELVSVTG*
Ga0066903_10855269923300005764Tropical Forest SoilTLESTRLEPGAYDEKIYGPGIGIVSERSLTGPNELAQLVSVSG*
Ga0066696_1106651523300006032SoilLTSLEATRIEPGAYDRKVYAPGVGMIREEALTGATEYAELVGVSG*
Ga0075432_1033817523300006058Populus RhizosphereLKNALTTLEATRLEPGAYDQKVYAAGLGIVREQALSGAPEFAELVSVTG*
Ga0070715_1091192923300006163Corn, Switchgrass And Miscanthus RhizosphereNALTTLEATRLEPGAYDQKVYAPGLGIVREQALSGAPEFAELVSVIG*
Ga0075427_1001997023300006194Populus RhizosphereHHVLTSLESTRVEPGAYDEKIYGLGIGIVRERALTGTSEFAQLVSVSG*
Ga0079221_1027242913300006804Agricultural SoilATRLEPGAYDQKVYAPGLGIVREQALSGAPEFAELVSVTG*
Ga0075431_10111493023300006847Populus RhizosphereVRNVLTTLEATRVEAGAYDEKVYGPGIGIVSERSLTGPAEFAQLVSVTG*
Ga0075431_10190875513300006847Populus RhizosphereVRNVLTTLEATRVEAGAYDEKVYGPGIGIVSERSLTGPAEVAQLVSVTG*
Ga0075433_1075304423300006852Populus RhizosphereKVRNVLTTLESTRLEPGGYDEKIYAPGIGIVSERSLTGNEVAQLVSVSG*
Ga0075433_1079924533300006852Populus RhizosphereLEATRIEPGAYDEKTYGPGIGIVSERSLTGPNEFAQLVSVSG*
Ga0075433_1170983813300006852Populus RhizosphereLTTLESTQLEPGAYDEKVYGPGIGIVSERSLTGPNELAQLVSVSR*
Ga0075425_10208611423300006854Populus RhizosphereEATRLEPGAYDQKVYGPGIGIVSEQSLTGPNEYATLVSVTG*
Ga0075424_10099009913300006904Populus RhizosphereLTTLESTRVEPGAYDEKIYGPGIGIVRERALTGTSEFAQLVSVSG*
Ga0075436_10047744823300006914Populus RhizosphereRIEPKVVDQKVYAPGIGIVFERALSGPPEIAKLVSVTG*
Ga0099791_1058790013300007255Vadose Zone SoilLEATRLEPGAYDQKAYAPGIGIVLEQSLTGANEFARLESVSGP*
Ga0066710_10123951923300009012Grasslands SoilVRGVLTTLEATRVEPGAYDQKIYGRGIGIVVEHALTGEPEIAKLVSMTG
Ga0111539_1052845043300009094Populus RhizosphereRIEPGAYDQKVYAPGIGIVSEQSLTGPNEFAQLVSVSG*
Ga0066709_10117944913300009137Grasslands SoilTSLEFARIEPGVVDQKIYGPGIGIVSETALTGPQEIAKLVSMTG*
Ga0066709_10141991013300009137Grasslands SoilTLEATQVEPGIYDQKIYGPGIVIVVEQSLTGPNEYAKLVSVTG*
Ga0066709_10155946413300009137Grasslands SoilRLEPGAYDQKVYAPGIGIAREQALIGAPEFAELVSVTG*
Ga0114129_1132940823300009147Populus RhizosphereEPGAYDQKVYGPGLGIVLEQALSGPPEVAKLVSISGP*
Ga0114129_1206115923300009147Populus RhizosphereAGAYDEKVYGPGIGIVSERSLTGPAEVAQLVSVTG*
Ga0111538_1173925313300009156Populus RhizosphereLEATAVEPGAYDQKVYGPGLGIVLEQALSGPPEVAKLVSVSGP*
Ga0075423_1033349713300009162Populus RhizosphereEPGAYDEKIYAPGIGIVSERSLTGSNEFAQLVSASN*
Ga0075423_1191332223300009162Populus RhizosphereGAYDEKVYGPGIGIVSERSLTGPAEVAQLVSVTG*
Ga0075423_1270286513300009162Populus RhizosphereEPGSYDLKIYAPGIGIVLEQSLSGPNETAKLVSITGP*
Ga0126374_1098503313300009792Tropical Forest SoilEPGAYDEKVYGPGIGIVTEKALTGPAEFAQLVSVSP*
Ga0126380_1089810623300010043Tropical Forest SoilLRNVLTTLEATQLEPGSYDEKVYGLGIGIVSERSLTGPSEVAQLVSVSG*
Ga0126384_1029716613300010046Tropical Forest SoilKVRNVLTTLEATRIEPGAYDEKIYGPGIGIVSERSLTGSNEVAQLVSVSQ*
Ga0134088_1054561213300010304Grasslands SoilLYDQKIYARGVGIVLEQSLTGPTEIAKLVSVSGP*
Ga0134062_1027985733300010337Grasslands SoilTLEATRLEPGAYDQKVYAPGIGIAREQALTGTPEFAELVSVNG*
Ga0134062_1029626513300010337Grasslands SoilRLKNVLTTLEATQLEPGAYDQKVYAPGLGIVREQALTGAAEFAELVSVSG*
Ga0126372_1235647413300010360Tropical Forest SoilTLESTQLEAGAYDEKVYGPGIGIVSERALTGEPETAQLVSVSG*
Ga0126372_1278998913300010360Tropical Forest SoilTLESTRLEPGAYDEKVYGPGIGIVSERALTGPTETAQLVSISG*
Ga0126377_1266838723300010362Tropical Forest SoilESTRLEPGAYDEKVYGPGIGIVSERSLTGPTEYALLVSVSG*
Ga0134128_1034723533300010373Terrestrial SoilEATRLEPGAYDQKVYAPGLGIVREQALSGAPEFAELVSVTG*
Ga0134122_1084879213300010400Terrestrial SoilTRIEPGAYDEKVYGTGIGIVSERALTGDTEVAQLVSVSG*
Ga0105246_1027499513300011119Miscanthus RhizosphereGAYDEKVYGPGIGIVSEHALTGDNEVAKLVSVSG*
Ga0137364_1009465013300012198Vadose Zone SoilVEPGIYDQKIYGPGIGIVVEQSLTGPNEYAKLVSVTG*
Ga0137381_1042348213300012207Vadose Zone SoilLEAARIEPGHYDLKIYAPGIGIVLEQALTGPTETAQLVSVTGP*
Ga0137376_1002348113300012208Vadose Zone SoilGAYDHKVYAPGVGMIREEALTGATEYAELVSVSG*
Ga0137376_1077463423300012208Vadose Zone SoilEAGAYDQKVYAPGIGIVREQALTGAPEFAELVSVSG*
Ga0137372_1027895613300012350Vadose Zone SoilRLEPGAYDQKVYAPGIGIAREQALTGEPEFAELVSVTG*
Ga0137369_1087270533300012355Vadose Zone SoilNVLTTLEATQIEPGAYDQKVYGPGIGIVSEQSLTGPNEVAQLVSVTG*
Ga0137359_1131964213300012923Vadose Zone SoilRNALTTLEATQVEPGSYDRKVYAPGIGIALEEAITGTPERAELVSVTGP*
Ga0137404_1074585813300012929Vadose Zone SoilPGAYDEKIYGPGIGIVIERSLTGPSEYAQLVSVSG*
Ga0137410_1108151023300012944Vadose Zone SoilLEATRIEPGAYDQKIYAPGIGIVLEQSLTGPTEIAQLVSVTGP*
Ga0137410_1169518413300012944Vadose Zone SoilLVTLEATRIEPGAYDQKIYAPGIGIVLEQALTGGPETAVLVSISGP*
Ga0164298_1083327513300012955SoilTRVEPGSYDEKVYGPGIGIVREQALTGTSEFAQLVSVSG*
Ga0163162_1295970123300013306Switchgrass RhizosphereIEPGAYDEKVYGPGIGIVSEHALTGDHEVARLVSVSG*
Ga0137409_1094767733300015245Vadose Zone SoilTQVEPGSYDRKVYAPGIGIVLELALTGTPESAQLVSVTGP*
Ga0134072_1014976823300015357Grasslands SoilKLKNVLTTLEATQLEPGAYDQKVYAPGIGIVREQALTGAAEFAELVSVSG*
Ga0134085_1035760023300015359Grasslands SoilRLEPGAYDLKVYAPGIGIAREQALTGTPEFAELVSVTG*
Ga0132257_10067602713300015373Arabidopsis RhizosphereHVLTTLESTRIEPGGYDEKVYGPGIGIVSEHALTGDHEVARLVSVSG*
Ga0184638_129585213300018052Groundwater SedimentTTLEATRIEAGAYDEKVYGPGIGIVSEQSLTGPNEFAQLVSVSG
Ga0184618_1039607113300018071Groundwater SedimentIEPGAYDEKVYGPGIGIVSERALTGTSEVAQLVSVTG
Ga0184625_1014632213300018081Groundwater SedimentVRDVLSTLEATRIEPGAYDEKVYGPGIGIVREQALTGTSEFAQLVSVTG
Ga0066662_1134562613300018468Grasslands SoilVEPGSYDEKVYSPGIGIVREQALTGTSEFAQLVSVTG
Ga0190270_1120173233300018469SoilIEPGAFDEKVYGPGIGIVSERALSGDNEVAQLVSVSA
Ga0066669_1193014023300018482Grasslands SoilVEMRDGKVYAPGVGIVREQALTGETEVAELVSVTG
Ga0210380_1059188823300021082Groundwater SedimentVQHVLTTLESTRIEPGAYDEKVYGPGIGIVSEHALTGDNEVAKLVSVSR
Ga0193699_1029823333300021363SoilLEATQLEPGAYDEKVYGLGIGIVTEKSLAGPNEFAELVSVSG
Ga0222623_1027844323300022694Groundwater SedimentVRSVLTTLEATQIEPGAYDEKVYGPGVGIVSEQSLTGPNEFAQLVSVTG
Ga0222623_1033521523300022694Groundwater SedimentVQHVLSTLESTRIEPGAYDEKVYGPGIGIVSERALTGTSEVAQLVSVTG
Ga0222622_1003642613300022756Groundwater SedimentLEVTRIEPGAYDEKVYGPGIGIVREQALTGTSEFAQLVSVTG
Ga0207652_1054749723300025921Corn RhizosphereGKVRNVLTSLEATRVEPGSYDEKVYGPGIGIVRERALTGTSEFAQLVSVTG
Ga0207646_1006331123300025922Corn, Switchgrass And Miscanthus RhizosphereVPYGTLKNVLTTLEATRLVPGAYDQKIYAPGIGIVREQALTGAQELAELVSISG
Ga0207646_1069075823300025922Corn, Switchgrass And Miscanthus RhizosphereEATRVEPGSYDEKVYGPGIGIVREQALTGTSEFAQLVSVSG
Ga0207709_1127038113300025935Miscanthus RhizosphereLTSLEATRVEPGSYDEKVYGPGIGIVRERALTGTSEFAQLVSVTG
Ga0207641_1232357413300026088Switchgrass RhizosphereTIHHKLSTLEATRIEPGAYDLKVYGPGLGIVLERALSGPPEVARLVRVIEP
Ga0209239_117412723300026310Grasslands SoilTRLEPGAYDLKVYAPGIGIAREQALSGTPEFAELVSVTG
Ga0209154_131973723300026317SoilIEPGAYDRKIYAPGIGMVREEALTGAPENAELVSVSG
Ga0209804_128499023300026335SoilTSLEATRIEPGAYDRKIYAPGIGMVREEALTGAPENAELVSVSG
Ga0268265_1146784423300028380Switchgrass RhizosphereSTRVEPGAYDEKIYGLGIGIVRERALTGTSEFAQLVSVSG
Ga0137415_1035305223300028536Vadose Zone SoilVRFRTREIEPGLYDLKIYAPGIGIVLEQSLTGPTEIARLVSVTGP
Ga0307295_1017278313300028708SoilLEATRLEPGAYDQKIYAPGIGIVREQALTGAPEVAELVSING
Ga0307282_1055049723300028784SoilGRVRNVLTTLEATRLEPGAYDQKVYGPGIGIVLEQSLTGDPEVAKLESVSGP
Ga0307305_1012931913300028807SoilTVRNVLTTLEATQVEPDVYDQKIYGPGIGIVVEQSLTGPNEYAKLVSVTG
Ga0307312_1053975123300028828SoilTRLEAGAYDQKVYAPGIGIVREQALTGTPEFAELVSVSG
Ga0310813_1235461623300031716SoilTTLESTRIEPGAYDEKVYGPGIGIVSEHALTGDNEVARLVSVSG
Ga0307471_10382407113300032180Hardwood Forest SoilIEAGAYDEKIYGPGIGIVSERSLTGPAEFAQLVSVSG
Ga0310810_1084847013300033412SoilTRIEPGAYDQKIYAPGIGIALEKSLTGPTEIAQLVSVTGP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.