NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F090752

Metagenome / Metatranscriptome Family F090752

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F090752
Family Type Metagenome / Metatranscriptome
Number of Sequences 108
Average Sequence Length 42 residues
Representative Sequence VWDGMDGPIVERENVALDSQFAVHRYLRLAEKTAVGFHWDW
Number of Associated Samples 101
Number of Associated Scaffolds 108

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 3.70 %
% of genes near scaffold ends (potentially truncated) 97.22 %
% of genes from short scaffolds (< 2000 bps) 90.74 %
Associated GOLD sequencing projects 98
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (51.852 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(27.778 % of family members)
Environment Ontology (ENVO) Unclassified
(27.778 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.
1JGI26340J50214_100065943
2JGIcombinedJ51221_103219802
3Ga0066688_107832162
4Ga0066388_1044533482
5Ga0070668_1002816711
6Ga0070707_1006892832
7Ga0070741_108079312
8Ga0070684_1001534253
9Ga0066697_107601562
10Ga0066670_107537492
11Ga0066693_102173991
12Ga0066696_103111312
13Ga0075029_1008973241
14Ga0075030_1007314201
15Ga0075014_1001605032
16Ga0070712_1003358892
17Ga0070765_1023188881
18Ga0079222_110031502
19Ga0079220_101561591
20Ga0073928_105060362
21Ga0075424_1019702951
22Ga0079219_120242051
23Ga0066710_1006039203
24Ga0105237_104456062
25Ga0116216_106327041
26Ga0126374_111536381
27Ga0116219_101408372
28Ga0134082_102375541
29Ga0126376_129168552
30Ga0126361_100762821
31Ga0126350_103167682
32Ga0137365_102853631
33Ga0137380_106484823
34Ga0157329_10012961
35Ga0157342_10034971
36Ga0137398_106406821
37Ga0164306_101700731
38Ga0164306_117412202
39Ga0157372_119235981
40Ga0134081_104019671
41Ga0134078_101890251
42Ga0181526_110883772
43Ga0182000_100628591
44Ga0182041_109487282
45Ga0182041_119618981
46Ga0182033_111965632
47Ga0182039_113382722
48Ga0182038_101019621
49Ga0187781_108917231
50Ga0187777_100953323
51Ga0187883_106489033
52Ga0187887_103764432
53Ga0187774_102730752
54Ga0210395_113718422
55Ga0210400_110534802
56Ga0210393_106601981
57Ga0210389_102930962
58Ga0210384_107315741
59Ga0210398_100329694
60Ga0222622_106233572
61Ga0247692_10135181
62Ga0207681_101237601
63Ga0209152_103774212
64Ga0208098_10002383
65Ga0207777_10423282
66Ga0209115_11295892
67Ga0209517_106143241
68Ga0209590_103696051
69Ga0302223_103124391
70Ga0307306_100808311
71Ga0307312_111787111
72Ga0308309_114170282
73Ga0311340_110562921
74Ga0302176_101885161
75Ga0311357_110320091
76Ga0302317_104190012
77Ga0265462_111659453
78Ga0302325_103856963
79Ga0318541_103433061
80Ga0307508_108482842
81Ga0318542_103078182
82Ga0318572_105944122
83Ga0310686_1180179651
84Ga0318493_106413502
85Ga0318548_103429191
86Ga0318548_104528251
87Ga0318565_101993721
88Ga0318568_105704122
89Ga0307473_111247741
90Ga0318564_104442591
91Ga0310917_102913451
92Ga0310917_107286621
93Ga0318527_104734362
94Ga0306919_103600571
95Ga0306926_113859441
96Ga0318562_106911881
97Ga0318562_107048231
98Ga0310911_102693932
99Ga0318559_102361251
100Ga0318556_104915122
101Ga0318505_100142053
102Ga0318505_101616022
103Ga0318514_104965782
104Ga0318514_105279732
105Ga0318524_105645182
106Ga0306924_105091862
107Ga0335081_100594216
108Ga0335077_102058311
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.02%    β-sheet: 0.00%    Coil/Unstructured: 60.98%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540VWDGMDGPIVERENVALDSQFAVHRYLRLAEKTAVGFHWDWSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
48.1%51.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Iron-Sulfur Acid Spring
Watersheds
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Palsa
Arabidopsis Rhizosphere
Ectomycorrhiza
Populus Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Boreal Forest Soil
2.8%4.6%3.7%2.8%2.8%2.8%5.6%27.8%7.4%2.8%2.8%5.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI26340J50214_1000659433300003368Bog Forest SoilAVWDGMDGPIVERESVTLDSQFAVHRYLRLAEKTTAGFHWDW*
JGIcombinedJ51221_1032198023300003505Forest SoilDGMDGPIVEREPVGLDSQFAVHRYLRLAEQTAVGFHWDW*
Ga0066688_1078321623300005178SoilVEWAVWDGLDGPIVERERVALDYQFAVHRYLRLAEKTAVGFHWDW*
Ga0066388_10445334823300005332Tropical Forest SoilWAVWDGMDGPIVERENVVLDSQFAVHRYLRLAEKTVIGFHWDW*
Ga0070668_10028167113300005347Switchgrass RhizosphereKLPGNVVWAVWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0070707_10068928323300005468Corn, Switchgrass And Miscanthus RhizosphereGNVVWAVWDGMDGPIIERESVALDSQFAVHRYLRLAEQTVVGFHWDW*
Ga0070741_1080793123300005529Surface SoilDGMEGPIVDAQPVTLDSQHAVHRYLRLAERTVAGFHWEW*
Ga0070684_10015342533300005535Corn RhizosphereWAVWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0066697_1076015623300005540SoilPGKLPGNVVWAIWDGMDGPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0066670_1075374923300005560SoilFHPGKLPGNVVWAIWDGMDGPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0066693_1021739913300005566SoilWDGMDGPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0066696_1031113123300006032SoilVWAIWDGMDGPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0075029_10089732413300006052WatershedsVVWAVWDGMDGPIVERERVALDSQFAVHRYLRLAEKTTVGFHWDW*
Ga0075030_10073142013300006162WatershedsVWDGMDGPIVERESVALDSQFAAHRYLRLAEKTTVGFHWDW*
Ga0075014_10016050323300006174WatershedsWAVWDGMDGPIIERENVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0070712_10033588923300006175Corn, Switchgrass And Miscanthus RhizosphereDGMDGPIIERENVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0070765_10231888813300006176SoilAVWDGMDGPIVERQPVVLDSQFAVHRYLRLAEKTVAGFHWDW*
Ga0079222_1100315023300006755Agricultural SoilWAVWDGMDGPIIERENVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0079220_1015615913300006806Agricultural SoilGPIAESERVTLDSQFAVHRYLRLAEKTTVGFHWDW*
Ga0073928_1050603623300006893Iron-Sulfur Acid SpringWAVWDGLDGPIVNREDVTLDSQFAVHRYLRLAEKTAVGFHWDW*
Ga0075424_10197029513300006904Populus RhizosphereKLPGNVVWAVWDGMDGPIIERESVVLDSQFAVHRYLRLAETTVVGFHWDW*
Ga0079219_1202420513300006954Agricultural SoilPIAESEHVTLDSQFAVHRYLRLAEQTTVGFHWDW*
Ga0066710_10060392033300009012Grasslands SoilWDGMDGPIIERENVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0105237_1044560623300009545Corn RhizosphereGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0116216_1063270413300009698Peatlands SoilLPAKVEWAVWDGMDGPIVERESVTLDSQFAVHRYLRLAEKTTVGFHWDW*
Ga0126374_1115363813300009792Tropical Forest SoilPASCRAPSWAVWDGMDGPIIERESVALDSQFAVHRYLRLAENTAVGFHWDW*
Ga0116219_1014083723300009824Peatlands SoilDKLPSNVEWAVWDGLEGAIVERESVALDTQFAVHRYLRLAEKTAVGFHWDW*
Ga0134082_1023755413300010303Grasslands SoilPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW*
Ga0126376_1291685523300010359Tropical Forest SoilVWDGMDGPIIERENVALDSQFAVHRYLRLAEKTAVGFHWDW*
Ga0126361_1007628213300010876Boreal Forest SoilMDGPIVKRESVTLDSQFAVHRYLRLAEKTTVGFHWDWSH*
Ga0126350_1031676823300010880Boreal Forest SoilPVNVVWAVWDGLDGPIVNREDVTLDSQFAVHRYLRLAEKTAVGFHWDW*
Ga0137365_1028536313300012201Vadose Zone SoilFHPGKLPGNVVWAIWDGMDGPIIERESVALDSQFAVHRYLRLAEKTAVGFHWDW*
Ga0137380_1064848233300012206Vadose Zone SoilDGMDGPIAEREHVTLDSQFAVHRYLRLAEKTAVGFHWDW*
Ga0157329_100129613300012491Arabidopsis RhizosphereNVVWAVWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0157342_100349713300012507Arabidopsis RhizosphereMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0137398_1064068213300012683Vadose Zone SoilGIDGPIAESERVTLDSQFAVHRYLRLAEQTTVGFHWDW*
Ga0164306_1017007313300012988SoilKLPGSVVWAVWDGMDSPINEREHVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0164306_1174122023300012988SoilWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0157372_1192359813300013307Corn RhizosphereGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0134081_1040196713300014150Grasslands SoilPIIERESVVLDSQFAVHRYLRLAETTVVGFHWDW*
Ga0134078_1018902513300014157Grasslands SoilLPDSVVWAVWDGMDGPIIDREQVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0181526_1108837723300014200BogMDGPIVERESVTLDSQFAVHRYLRLAEKTTVGFHWDW*
Ga0182000_1006285913300014487SoilPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW*
Ga0182041_1094872823300016294SoilVWAIWDGMDGPIVERERVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0182041_1196189813300016294SoilNVEWAVWDGLDGPIVEREPVVLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0182033_1119656323300016319SoilVAVWDGMDGPIIERESVTLDTQFAVHRYLRLAEKTAVGFHWDW
Ga0182039_1133827223300016422SoilLPRNVVWAVWDGMDGPIVERERVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0182038_1010196213300016445SoilGPIVEREPVALDSQFAVHRYLRLAERTAVGFHWDW
Ga0187781_1089172313300017972Tropical PeatlandNVVWAVWDGMDGPIVEQEKVSLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0187777_1009533233300017974Tropical PeatlandPSDVVWAVWDGMDGPIVEREHVSLDRQFAVHRYLRLAEKTTVGFHWDW
Ga0187883_1064890333300018037PeatlandDGMDGPIAESEQVTLDRQFAVHRYLRLAEQTTVGFHWNW
Ga0187887_1037644323300018043PeatlandDGPIVEREPVELDSQFAVHRYLRLAEKTTVGFHWDW
Ga0187774_1027307523300018089Tropical PeatlandVWDGMDGPIVERENVALDSQFAVHRYLRLAERTVVGFHWDW
Ga0210395_1137184223300020582SoilWAVWDGMQGDVADDQEVVLDSQHSVHKYLRLAERTVVGYHWEW
Ga0210400_1105348023300021170SoilVWDGLEGPIVEREPVALDTQFAVHRYLRLAEQTAVGFHWDW
Ga0210393_1066019813300021401SoilGMEGPIVERESVALDSQFAVHRYLRLAEKTTVGFHWDW
Ga0210389_1029309623300021404SoilLPAQVVWAVWDGLDGPIVDREDVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0210384_1073157413300021432SoilWDGMDGPIIERESVALDSQFAVHRYLRLAEQTVVGFHWDW
Ga0210398_1003296943300021477SoilLPAKVVWAVWDGLDGPIVDREDVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0222622_1062335723300022756Groundwater SedimentNVVWAVWDGMDGPIIERESVVLDSQFAVHRYLRLAETTVVGFHWDW
Ga0247692_101351813300024279SoilDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0207681_1012376013300025923Switchgrass RhizosphereGNVVWAVWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0209152_1037742123300026325SoilDGPIIERESVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0208098_100023833300027172Forest SoilGLDGPIVEREPVALDSQFAVHRYLRLAEQTAVGFHWDW
Ga0207777_104232823300027330Tropical Forest SoilDGLDGPIVQREPVVLDSQFAVHRYLRLAEKTTAGFHWDW
Ga0209115_112958923300027567Forest SoilGPIVEREPVALDSQFAVHRYLRLAEQTAVGFHWDW
Ga0209517_1061432413300027854Peatlands SoilVWAVWDGMDGPIVERESVALDSQFAVHRYLRLAEKTTVGFHWDW
Ga0209590_1036960513300027882Vadose Zone SoilVWDGMDGPIVERERVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0302223_1031243913300028781PalsaDGLDGPIVEREHVALDSQFAVHRYLRLAEKTAAGFHWDW
Ga0307306_1008083113300028782SoilVWDGMDGPIVERESVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0307312_1117871113300028828SoilPGNGVWAVWDGMDGPIIERESVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0308309_1141702823300028906SoilGPIVEREHVALDSQFAVHRYLRLAEKTAAGFHWDW
Ga0311340_1105629213300029943PalsaSIVWAVWDGLDGPIVEREPAILDSQFAVHRYLRLAEQTAVGFHWDW
Ga0302176_1018851613300030057PalsaLPGNVVWAVWDGMDGPIVEREPVVLDSQFAVHRYLRLAEKTVAGFHWDW
Ga0311357_1103200913300030524PalsaVVWAVWDGMDGPIVEREPVVLDSQFAVHRYLRLAEKTVAGFHWDW
Ga0302317_1041900123300030677PalsaWDGMDGPIVEREPVVLDSQFAVHRYLRLAEKTAVGSHWDW
Ga0265462_1116594533300030738SoilDVVWAVWDGMDGPIVDRESVGLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0302325_1038569633300031234PalsaDGLDGPIVEREPAVLDSQFAVHRYLRLAEQTAVGFHWDW
Ga0318541_1034330613300031545SoilDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0307508_1084828423300031616EctomycorrhizaWAVWDGIDGPIAESERVTLDSQFAVHRYLRLAEQTTVGFHWDW
Ga0318542_1030781823300031668SoilWDGMDGPIVERENVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318572_1059441223300031681SoilVWDGLDGPIVEREPVVLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0310686_11801796513300031708SoilEWAVWDGLDGPIVEREPVALDSQFAVHRYLRLAEHTAAGFHWDW
Ga0318493_1064135023300031723SoilPSSVVWAVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0318548_1034291913300031793SoilKLPGSVVWAVWDGMDGPIIERENVSLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318548_1045282513300031793SoilVVWAVWDGMDGPIVEREPVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318565_1019937213300031799SoilLPGSVVWAVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0318568_1057041223300031819SoilVWDGLDGPIVQREPVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0307473_1112477413300031820Hardwood Forest SoilAVWDGMDGPIVEREDVALDSQFAVHRYLRLAETTVVGFHWDW
Ga0318564_1044425913300031831SoilVPGNVEWAIWDGLDGPIVERERVDLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0310917_1029134513300031833SoilKLPGSVVWAVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0310917_1072866213300031833SoilLDGPIVEREPVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318527_1047343623300031859SoilLDGPIVEREPVVLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0306919_1036005713300031879SoilEGSIVEREPVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0306926_1138594413300031954SoilMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0318562_1069118813300032008SoilVWDGLDGPIVQREPVVLDSQFAVHRYLRLAEQTAVGFHWDW
Ga0318562_1070482313300032008SoilGSVVWAVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0310911_1026939323300032035SoilLPSNVVWAVWDGMDGPIIERENVSLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318559_1023612513300032039SoilWAVWDGLDGPIVQREPVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318556_1049151223300032043SoilGMDGPIVEREPVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318505_1001420533300032060SoilVWDGLDGPIVEREPVALDSQFAVHRYLRLAERTAVGFHWDW
Ga0318505_1016160223300032060SoilDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW
Ga0318514_1049657823300032066SoilVDWAVWDGLDGPIVEREPVVLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318514_1052797323300032066SoilVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0318524_1056451823300032067SoilGMDGPIIERESVMLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0306924_1050918623300032076SoilEWAVWDGLDGPIVERETVALDSQFAVHRYLRLAEKTAVGFHWDW
Ga0335081_1005942163300032892SoilMDGPIVEREPVTLDSQFAVHRYLRLAEKTAVGFHWDW
Ga0335077_1020583113300033158SoilWAVWDGMDGPIVERENVALDSQFAVHRYLRLAEKTVVGFHWDW


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.