NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F097442

Metagenome Family F097442

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097442
Family Type Metagenome
Number of Sequences 104
Average Sequence Length 46 residues
Representative Sequence DFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE
Number of Associated Samples 99
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.96 %
% of genes near scaffold ends (potentially truncated) 97.12 %
% of genes from short scaffolds (< 2000 bps) 98.08 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(17.308 % of family members)
Environment Ontology (ENVO) Unclassified
(34.615 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.231 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1JGI10216J12902_1016922801
2A2065W1_116925261
3A2135W6_11025292
4JGI24743J22301_100792702
5soilL2_100148124
6JGI25321J50212_101288101
7Ga0062589_1019369201
8Ga0062590_1021683182
9Ga0066671_101290882
10Ga0066671_104423951
11Ga0066671_108624202
12Ga0066676_107622951
13Ga0070689_1002745101
14Ga0070692_108373671
15Ga0070674_1004079292
16Ga0066682_106372401
17Ga0070678_1008054442
18Ga0070698_1003499692
19Ga0070699_1014620061
20Ga0070697_1003732201
21Ga0070730_107394522
22Ga0070696_1011930992
23Ga0070704_1004917792
24Ga0070704_1013270052
25Ga0066701_102378911
26Ga0066661_108613332
27Ga0066707_106786302
28Ga0066700_103265611
29Ga0066699_102982201
30Ga0066703_104351881
31Ga0066703_108610961
32Ga0068857_1018711631
33Ga0066654_101747802
34Ga0068864_1013456872
35Ga0068870_103031782
36Ga0075279_101144792
37Ga0070717_107022861
38Ga0066656_102557772
39Ga0068871_1003039361
40Ga0066660_107983531
41Ga0075434_1020122691
42Ga0075436_1006901471
43Ga0079219_113005331
44Ga0079218_105438161
45Ga0099828_116347681
46Ga0105245_132961771
47Ga0066709_1015281391
48Ga0114129_116379481
49Ga0075423_124439251
50Ga0105347_14366152
51Ga0126309_108231082
52Ga0134084_104350451
53Ga0134063_104838481
54Ga0134127_129011091
55Ga0134122_120219352
56Ga0134121_105775743
57Ga0137433_11386522
58Ga0120134_10771852
59Ga0137380_108497121
60Ga0137381_107724852
61Ga0137376_117417132
62Ga0137367_107795251
63Ga0137407_112477421
64Ga0134110_101384012
65Ga0157378_131873102
66Ga0120109_10307681
67Ga0157377_112396461
68Ga0120170_10226371
69Ga0157376_124619012
70Ga0137412_112157242
71Ga0134085_101460942
72Ga0134085_104886801
73Ga0132257_1036335242
74Ga0187821_100471751
75Ga0066667_122990062
76Ga0190268_108294021
77Ga0190270_107170281
78Ga0193609_10778872
79Ga0190273_103404041
80Ga0193722_11345272
81Ga0179590_10861221
82Ga0193699_104156752
83Ga0213874_101064232
84Ga0224512_103134091
85Ga0247677_10167212
86Ga0209640_106132571
87Ga0207684_105922532
88Ga0207681_114853592
89Ga0207687_114905701
90Ga0207669_104356371
91Ga0207704_100528342
92Ga0207711_120389212
93Ga0207689_114879662
94Ga0207676_113549791
95Ga0207683_113494802
96Ga0209807_11601973
97Ga0209577_101408021
98Ga0137415_106158932
99Ga0307405_107584181
100Ga0326597_112555232
101Ga0307416_1025801122
102Ga0334961_034140_741_872
103Ga0372943_0930706_409_576
104Ga0372946_0225099_2_136
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 46.67%    β-sheet: 8.89%    Coil/Unstructured: 44.44%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045DFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Serpentine Soil
Grasslands Soil
Surface Soil
Soil
Agricultural Soil
Sugarcane Root And Bulk Soil
Permafrost
Soil
Grasslands Soil
Sub-Biocrust Soil
Soil
Rice Paddy Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Soil
Soil
Deep Subsurface
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Plant Roots
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
5.8%8.7%2.9%4.8%4.8%17.3%2.9%8.7%2.9%3.8%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10169228013300000956SoilMVLEENFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGSRYVFTN*
A2065W1_1169252613300001537PermafrostRNYDEILGLFENLRALETQLVGIFVSQETSATRYVFTD*
A2135W6_110252923300001566PermafrostNYDELLGMFENLRSLESQLLGIFVSQETSATRYVFTK*
JGI24743J22301_1007927023300001991Corn, Switchgrass And Miscanthus RhizosphereFRDFKEFDQRNYDEILGLFENLRALETQLVGIFVPQETSATRYVFTE*
soilL2_1001481243300003319Sugarcane Root And Bulk SoilFDQRNYDEILGLFENLRALETQLVGIFVSQETSATRYVFTD*
JGI25321J50212_1012881013300003366Deep SubsurfaceFKEFDQRNYDEILGLFENLRALESQIVGIFVAQEMSGTRYVFTD*
Ga0062589_10193692013300004156SoilVLEENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0062590_10216831823300004157SoilENFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGTRYVFTN*
Ga0066671_1012908823300005184SoilFKEFDQRNYDELLVLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0066671_1044239513300005184SoilNFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVPQETSATRYVFAN*
Ga0066671_1086242023300005184SoilLEDNFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0066676_1076229513300005186SoilNFRDFKEFDQRNYDEILGLFENLRSLESQLVGIFVSQETSATRYVFTE*
Ga0070689_10027451013300005340Switchgrass RhizosphereENFRDFKEFDQRNYDELLGLFENLRSMETQLVGIFVSQETQASRYVFTQ*
Ga0070692_1083736713300005345Corn, Switchgrass And Miscanthus RhizosphereFDQRNYDELLGLFENLRSLEIQLVGIFVPQETSASRYIFTE*
Ga0070674_10040792923300005356Miscanthus RhizosphereDFKEFDQRNYDEILGLFENLRALETQLVGIFVAQETSATRYVFTE*
Ga0066682_1063724013300005450SoilRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0070678_10080544423300005456Miscanthus RhizosphereEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0070698_10034996923300005471Corn, Switchgrass And Miscanthus RhizosphereQRNYDEILGLFENLRALESQLVGIFVSSETTAARYVFTE*
Ga0070699_10146200613300005518Corn, Switchgrass And Miscanthus RhizosphereFKEFDQRNYDEILGLFENLRALEQQLVGIFVSQETSATRYVFTD*
Ga0070697_10037322013300005536Corn, Switchgrass And Miscanthus RhizosphereEKVLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0070730_1073945223300005537Surface SoilFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQEMSATRYVFTE*
Ga0070696_10119309923300005546Corn, Switchgrass And Miscanthus RhizosphereNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVSQETSAARYVFTE*
Ga0070704_10049177923300005549Corn, Switchgrass And Miscanthus RhizosphereFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0070704_10132700523300005549Corn, Switchgrass And Miscanthus RhizosphereEENFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGTRYVFTN*
Ga0066701_1023789113300005552SoilEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0066661_1086133323300005554SoilDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE*
Ga0066707_1067863023300005556SoilENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0066700_1032656113300005559SoilNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0066699_1029822013300005561SoilDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0066703_1043518813300005568SoilRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSAARYVFTE*
Ga0066703_1086109613300005568SoilFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0068857_10187116313300005577Corn RhizosphereNFRDFKEFDQRNYDEILGLFENLRAMETQLVGLFVPQETSATRYVFTE*
Ga0066654_1017478023300005587SoilFDQRNYDELLGLFENLRSLETQLVGMFVPQETSATRYVFTE*
Ga0068864_10134568723300005618Switchgrass RhizosphereRDFKEFDQRNYDEILGLFENLRALEGQLVGIFVSSETSAARYVFTE*
Ga0068870_1030317823300005840Miscanthus RhizosphereKEFDQRNYDEILGLFENLRAMETQLVGIFVPQETSATRYVFTE*
Ga0075279_1011447923300005903Rice Paddy SoilFRDFKEFDQRDYDEILGLFENLRSLETQLLGIFVSQKTTATRYLFTE*
Ga0070717_1070228613300006028Corn, Switchgrass And Miscanthus RhizosphereFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0066656_1025577723300006034SoilGEKVIEDNFRDFKEFDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE*
Ga0068871_10030393613300006358Miscanthus RhizosphereFKEFDQRNYDEILGLFENLRSLESQLVGIFVPQETTASRYVFTE*
Ga0066660_1079835313300006800SoilKEFDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE*
Ga0075434_10201226913300006871Populus RhizosphereEKVLEENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0075436_10069014713300006914Populus RhizosphereDNFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYVFTE*
Ga0079219_1130053313300006954Agricultural SoilDFKEFDQRNYDELLGLFENLRSLETQLVGMFVPQETSATRYVFTE*
Ga0079218_1054381613300007004Agricultural SoilDEILGLFENLRALESQLVGIFVAQETSGSRYVFTN*
Ga0099828_1163476813300009089Vadose Zone SoilDELLGLFENLRSLETQLIGIFVPQETSASRYIFTE*
Ga0105245_1329617713300009098Miscanthus RhizosphereTGEKVLEENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0066709_10152813913300009137Grasslands SoilEKVLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSAARYVFTE*
Ga0114129_1163794813300009147Populus RhizosphereDEILGLFENLRSLETQLVGIFVPQETSAARYVFTE*
Ga0075423_1244392513300009162Populus RhizosphereLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSAARYVFTE*
Ga0105347_143661523300009609SoilEENFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGTRYVFTD*
Ga0126309_1082310823300010039Serpentine SoilQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE*
Ga0134084_1043504513300010322Grasslands SoilDQRNYDEILGLFENLRAMETQLVGIFVPQETSATRYVFTE*
Ga0134063_1048384813300010335Grasslands SoilQRQYDEILGLFENLRSLETQLVGIFVPQETSGSRYVFTE*
Ga0134127_1290110913300010399Terrestrial SoilLEDNFRDFKEFDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE*
Ga0134122_1202193523300010400Terrestrial SoilDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0134121_1057757433300010401Terrestrial SoilRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0137433_113865223300011440SoilFKEFDQRNYDEILGLFENLRALEQQLVGIFVAQETSATRYVFTD*
Ga0120134_107718523300012004PermafrostGEKVLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0137380_1084971213300012206Vadose Zone SoilVLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0137381_1077248523300012207Vadose Zone SoilLEDNFRDFKEFDQRNYDEILGLFENLRALETQLVGIFVPQETSAARYVFTE*
Ga0137376_1174171323300012208Vadose Zone SoilDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSAARYVFTE*
Ga0137367_1077952513300012353Vadose Zone SoilEILGLFENLRSLEQQLVGIFVSQETSATRYVFTD*
Ga0137407_1124774213300012930Vadose Zone SoilEFDQRNYDEILGLFENLRSLETQLVGIFVSQESSAMRYVFTES*
Ga0134110_1013840123300012975Grasslands SoilYDEILGLFENLRSLETQLVGIFVPQETSGSRYVFTE*
Ga0157378_1318731023300013297Miscanthus RhizosphereFRDFKEFDQRNYDEILGLFENLRSLEAQLVGIFVSQETSASRYVFTE*
Ga0120109_103076813300014052PermafrostKVLEDNFRDFKAFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE*
Ga0157377_1123964613300014745Miscanthus RhizosphereDFKEFDQRNYDEILGLFENLRALEGQLVGIFVSSETSAQRYMFTE*
Ga0120170_102263713300014823PermafrostDKVLEENFRDFKEFDQRNYDEILGLFENLRSLEAQLLGIFVSQETSASRYVFTE*
Ga0157376_1246190123300014969Miscanthus RhizosphereDQRNYDEILGLFENLRSMEAQLVGIFVSQETSASRYVFTE*
Ga0137412_1121572423300015242Vadose Zone SoilGEKVLEDNFRDFKEFDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE*
Ga0134085_1014609423300015359Grasslands SoilVEENFRDFKEFDQRNYDEILGLFENLRSLEQQLVGIFVSQETSVTRYVFTD*
Ga0134085_1048868013300015359Grasslands SoilENFRDFKEFDQRNYDEILGLFENLRALETQLVGIFVSQETSATRYVFTE*
Ga0132257_10363352423300015373Arabidopsis RhizosphereFDQRQYDEILGLFENLRSLETQLVGIFVPQETSGSRYVFTE*
Ga0187821_1004717513300017936Freshwater SedimentKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSATRYVFTE
Ga0066667_1229900623300018433Grasslands SoilDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE
Ga0190268_1082940213300018466SoilGEMILEENFRDFKEFDQRNYDEILGLFENLRALEAQLVGIFVAQETSGTRYVFTE
Ga0190270_1071702813300018469SoilRNYDEILGLFENLRSLETQLVGIFVSQETSAARYVFTE
Ga0193609_107788723300018906SoilVEENFRDFKEFDQRHYDEILGLFENLRALESQLVGIFVSQETSASRYVFSR
Ga0190273_1034040413300018920SoilNFRDFKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGSRYVFTN
Ga0193722_113452723300019877SoilNEVLGLFENLRSMETQLVGIFVPQETSATRYVFTE
Ga0179590_108612213300020140Vadose Zone SoilDTGEKVLEENFRDFKEFDQRNYDEILGLFENLRSLEAQLVGIFVSQETSASRYVFTE
Ga0193699_1041567523300021363SoilRNYDEILGLFENLRALESQLIGIFVAQETSGSRYVFTN
Ga0213874_1010642323300021377Plant RootsDSGEKVLEDNFRDFKEFDQRNYDELLGLFENLRSLETQLVGMFVPQETSATRYVFTE
Ga0224512_1031340913300022226SedimentRDFKEYAQRDYDEILGLFENLRSLESQILSIFVAQERSATRFLFVP
Ga0247677_101672123300024245SoilKVLEDNFRDFKEFDQRNYDEILGLFENLRSMETQLVGIFVPQETSATRYVFTE
Ga0209640_1061325713300025324SoilDNFRDFKEFDQRNYDEILGLFENLRSLETQLLGIFVSQETSATRFVFRH
Ga0207684_1059225323300025910Corn, Switchgrass And Miscanthus RhizosphereKVLEENFRDFKEFDQRNYDEILGLFENLRSMEAQLVGIFVSQETSASRYVFTE
Ga0207681_1148535923300025923Switchgrass RhizosphereFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSASRYVFTE
Ga0207687_1149057013300025927Miscanthus RhizosphereTGEKVLEENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE
Ga0207669_1043563713300025937Miscanthus RhizosphereRDVKEFDQRNYDEILGLFENLRALETQLVGIFVAQETSATRYVFTE
Ga0207704_1005283423300025938Miscanthus RhizosphereLEENFRDFKEFDQRNYDEILGLFENLRSMEAQLVGIFVSQETSASRYVFTE
Ga0207711_1203892123300025941Switchgrass RhizosphereNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSASRYVFTE
Ga0207689_1148796623300025942Miscanthus RhizosphereDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE
Ga0207676_1135497913300026095Switchgrass RhizosphereRDFKEFDQRNYDEILGLFENLRALEGQLVGIFVSSETSAARYVFTE
Ga0207683_1134948023300026121Miscanthus RhizosphereENFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSASRYIFTE
Ga0209807_116019733300026530SoilYDEILGLFENLRSLETQLVGIFVPQETSASRYVFTE
Ga0209577_1014080213300026552SoilLEDNFRDFKEFDQRNYDELLGLFENLRSLETQLVGIFVPQETSATRYVFTE
Ga0137415_1061589323300028536Vadose Zone SoilVLEDNFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVPQETSAARYVFTQ
Ga0307405_1075841813300031731RhizosphereEENFRDFKEFDQRNYDEILGLFENLRSLETQLVGIFVSRETSATRYVFTD
Ga0326597_1125552323300031965SoilVEENFRDFKEFDQRNYDEILGLFENLRSLESQLVGIFVPQETSATRYVFTE
Ga0307416_10258011223300032002RhizosphereTGEKVLEENFRDFKEFDQRNYDEILGLFENLRALETQLVGIFVSRETSATRYVFTD
Ga0334961_034140_741_8723300034143Sub-Biocrust SoilKEFDQRNYDEILGLFENLRALESQLVGIFVAQETSGTRYVFTE
Ga0372943_0930706_409_5763300034268SoilGEKVVEENFRDFKEFDQRNYDEILGLFENLRALEAQLVGIFVPQETSASRYVFTD
Ga0372946_0225099_2_1363300034384SoilFKEFDQRNYDEILGLFENLRALETQLVGIFVAQETSGSRYVFTN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.