NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096105

Metagenome / Metatranscriptome Family F096105

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096105
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 41 residues
Representative Sequence MEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRA
Number of Associated Samples 89
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 94.29 %
% of genes near scaffold ends (potentially truncated) 94.29 %
% of genes from short scaffolds (< 2000 bps) 92.38 %
Associated GOLD sequencing projects 86
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (82.857 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(34.286 % of family members)
Environment Ontology (ENVO) Unclassified
(35.238 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(39.048 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.
1JGI12635J15846_105454991
2Ga0062595_1020627301
3Ga0066684_103960792
4Ga0070671_1010761002
5Ga0070709_106462592
6Ga0070714_1007054941
7Ga0070710_100010344
8Ga0070710_110098521
9Ga0070700_1009906012
10Ga0066707_104222301
11Ga0068857_1010247422
12Ga0066706_113302151
13Ga0068860_1011953332
14Ga0070717_109686701
15Ga0070717_116196132
16Ga0070715_106768911
17Ga0070765_1006024711
18Ga0070765_1008648982
19Ga0079221_108337192
20Ga0079220_102736642
21Ga0075425_1005246962
22Ga0079219_100210293
23Ga0066709_1008651571
24Ga0126384_102629182
25Ga0134062_107184422
26Ga0126376_127505291
27Ga0126376_129387351
28Ga0126378_121608702
29Ga0126378_130707222
30Ga0126379_115095661
31Ga0126379_123062972
32Ga0134128_118321401
33Ga0126381_1011891032
34Ga0126381_1018719912
35Ga0126347_12897392
36Ga0137776_14331333
37Ga0105246_101635982
38Ga0137382_100779573
39Ga0157369_106572622
40Ga0182036_113427501
41Ga0182041_109673281
42Ga0182033_106342811
43Ga0182040_118921721
44Ga0187814_100995642
45Ga0187819_107716161
46Ga0187779_110572851
47Ga0187782_116684971
48Ga0187766_108077983
49Ga0187766_113643752
50Ga0193723_11039462
51Ga0206356_112191532
52Ga0210408_110555981
53Ga0210385_114108851
54Ga0210383_110738492
55Ga0210402_107278312
56Ga0247671_10721191
57Ga0207692_101852741
58Ga0207699_102876422
59Ga0207663_106129472
60Ga0207660_108202402
61Ga0207646_115996422
62Ga0207664_103851541
63Ga0207658_110734211
64Ga0209265_10937751
65Ga0268265_106622642
66Ga0307282_100069501
67Ga0311355_107734552
68Ga0307499_100374042
69Ga0318538_105707741
70Ga0318571_100834062
71Ga0318571_104475652
72Ga0318515_100536911
73Ga0310915_109669431
74Ga0318555_107251132
75Ga0307469_102564901
76Ga0318493_104358922
77Ga0318493_105591721
78Ga0318493_108853881
79Ga0318492_105849091
80Ga0318494_104294051
81Ga0318494_105052082
82Ga0318521_100326681
83Ga0318546_105764291
84Ga0318547_104175762
85Ga0318547_107386791
86Ga0318565_100297864
87Ga0318497_108727121
88Ga0318568_103524191
89Ga0318564_101062531
90Ga0318517_103679382
91Ga0318511_101519922
92Ga0318511_103360461
93Ga0318512_102457481
94Ga0318536_101232192
95Ga0306923_105900671
96Ga0306923_108827932
97Ga0318569_104942081
98Ga0318569_105001991
99Ga0318559_104446761
100Ga0318549_103296892
101Ga0318556_103241701
102Ga0318570_100402251
103Ga0335078_121342092
104Ga0335083_102264212
105Ga0335077_106554771
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 43.59%    β-sheet: 0.00%    Coil/Unstructured: 56.41%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
82.9%17.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Palsa
Corn Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Boreal Forest Soil
2.9%8.6%2.9%3.8%34.3%5.7%2.9%3.8%10.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1054549913300001593Forest SoilVKHPLEMDAEAMRQAGYATVDALVARLARPEADPVLR
Ga0062595_10206273013300004479SoilVEHPLRMDTEAMRRAGYATVDALVDRLAAPAAGPVLRRASP
Ga0066684_1039607923300005179SoilVNHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRADAA
Ga0070671_10107610023300005355Switchgrass RhizosphereVNHPLAMDAEAMRRAGYATVDALVARLADPEGDPVLRRAGAADM
Ga0070709_1064625923300005434Corn, Switchgrass And Miscanthus RhizosphereVNHPLAMDAEAMRRAGYATVDALVARLADPEGDPVLRRAG
Ga0070714_10070549413300005435Agricultural SoilMEHPLAMDAEAMRRAGYATVDALVARLADPGADPVLRRAGAAEM
Ga0070710_1000103443300005437Corn, Switchgrass And Miscanthus RhizosphereMVDHPLTMDVEAMRRAGYAAVDALAARLADPEGGPVLRRASPAQMRAG*
Ga0070710_1100985213300005437Corn, Switchgrass And Miscanthus RhizosphereVDHPLAMDAETMRRAGYATVDALVARLADPEGDPVLRRAGAADMRTRL
Ga0070700_10099060123300005441Corn, Switchgrass And Miscanthus RhizosphereMEHPLAMDADAMRRAGYATVDALVARLADPGADPVLRRAGAAQM
Ga0066707_1042223013300005556SoilMEHPLAMDAEAMRRAGYATVDALVARLADPGADPVLRRA
Ga0068857_10102474223300005577Corn RhizosphereVDHPLGMDAEAMRRAGYATVDALVARMTEPGAGPVVRRAEPAEMRARL
Ga0066706_1133021513300005598SoilVDHPLAMEVEAMRRAGYAAVDALAARMADPEAEPVLRRASPA
Ga0068860_10119533323300005843Switchgrass RhizosphereMEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRR
Ga0070717_1096867013300006028Corn, Switchgrass And Miscanthus RhizosphereMEHPLAMDAEAMRRAGYATVDALVARLADPQADPVLRRADAADMRARL
Ga0070717_1161961323300006028Corn, Switchgrass And Miscanthus RhizosphereVNHPLAMDAEAMRRAGYATVDALVARLADPEGDPVLRRAGAAD
Ga0070715_1067689113300006163Corn, Switchgrass And Miscanthus RhizosphereMEHPLAMDADAMRRAGYATVDALVARLADPGADPV
Ga0070765_10060247113300006176SoilMEHPLAMDAEAMRHAGYATVDALVARLADPEADPVLRRADAAG
Ga0070765_10086489823300006176SoilVEHPLAMDAEAMRRAGYATVDALVARLADPGADPVL
Ga0079221_1083371923300006804Agricultural SoilMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLRRAGAADM
Ga0079220_1027366423300006806Agricultural SoilVDHPLAMDAETMRRAGYATVDALVARLADPEGDPVLRRAGAADMRTRLGG
Ga0075425_10052469623300006854Populus RhizosphereMEHPLAMDAETMRRAGYATVDALVARLADPGADPVLRRAGA
Ga0079219_1002102933300006954Agricultural SoilMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLRRAG
Ga0066709_10086515713300009137Grasslands SoilMEVEAMRRAGYAAVDALAARMADPEAEPVLRRASPAEMRERLG
Ga0126384_1026291823300010046Tropical Forest SoilMEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRGSASAGGPAGR*
Ga0134062_1071844223300010337Grasslands SoilMEHPLAMDAEAMRRAGYATVDALVARLADPTADPVLRRADAA
Ga0126376_1275052913300010359Tropical Forest SoilMDIEAMRRAGYATVDALVARLADPGADPVLRRADAAEMRS
Ga0126376_1293873513300010359Tropical Forest SoilMEVEAMRRAGYAAVDALAAWMADPEARPVLRRASPAEMR
Ga0126378_1216087023300010361Tropical Forest SoilMEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRAGAAEMRA
Ga0126378_1307072223300010361Tropical Forest SoilMEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRASP
Ga0126379_1150956613300010366Tropical Forest SoilMEHPLAMDAEAMRRAGYATVDALVARLADPGADPV
Ga0126379_1230629723300010366Tropical Forest SoilVEHPLAMDIEEMRRAGYATVDALVARLADPAADPVLRRAD
Ga0134128_1183214013300010373Terrestrial SoilVDHPLAMDAEAMRRAGYATVDALVARLADPEGDPVLRRAG
Ga0126381_10118910323300010376Tropical Forest SoilVEHPLAMDIEEMRRAGYATVDALVARLADPAADPVLRRADA
Ga0126381_10187199123300010376Tropical Forest SoilMEVEAMRRAGYAAVDALAALMADPEARPVLRRASPAEMRE
Ga0126347_128973923300010867Boreal Forest SoilMDVEAMRRAGYATVDALVARLADPEADPVLRRAGAADMRAR
Ga0137776_143313333300010937SedimentMSVEAMRRAGYAAEDALAARMADPEAEPVLRRASPA
Ga0105246_1016359823300011119Miscanthus RhizosphereMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLR
Ga0137382_1007795733300012200Vadose Zone SoilMDAEAMRRAGYATVDALVARLADPEADPVLRRADPAALR
Ga0157369_1065726223300013105Corn RhizosphereMDAEAMRRAGYATVDALVARMTEPGADPVVRRAEPAEMRARLG
Ga0182036_1134275013300016270SoilMNHPLAMDAETMRRAGYATVDALVARLADPEADPVLRRADAAGLRAR
Ga0182041_1096732813300016294SoilMVDHPLAMDAEAMRRTGYAAVNALVARLADPEAEPVL
Ga0182033_1063428113300016319SoilMVDHPLAMDVEAMRRTGYAAVDALVARLADPEAEPV
Ga0182040_1189217213300016387SoilMVDHPLAMDVEAMRRTGYAAVDALVARLADPEAEPVLRRASPAVMRE
Ga0187814_1009956423300017932Freshwater SedimentMKHPLAMHVEDMRRAGYATVDALVARLADPEADPVLLRAEPGSLR
Ga0187819_1077161613300017943Freshwater SedimentMTHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRADAAQMRSRL
Ga0187779_1105728513300017959Tropical PeatlandVEHPLAMDIEAMRRAGYATVDALVARLADPAADPVLRRAGAAEMRSRL
Ga0187782_1166849713300017975Tropical PeatlandVEHPLAMDIQEMRRAGYATVDALVARLADPAGDPVLRRADAATM
Ga0187766_1080779833300018058Tropical PeatlandMEHPLAMDAEAMRRAGYATVDALVARLADPAADPVLRG
Ga0187766_1136437523300018058Tropical PeatlandVDHPLAMDVEAMRRTGYAAVDALAARLADPEADPVLRRASP
Ga0193723_110394623300019879SoilMEHPLAMDVEAMRRAGYATVDALVARLADPEADPV
Ga0206356_1121915323300020070Corn, Switchgrass And Miscanthus RhizosphereMEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRA
Ga0210408_1105559813300021178SoilVEHPLAMDAEAMRRAGYATVDALVARLADPGADPVLRRAEAADMASRL
Ga0210385_1141088513300021402SoilMEHPLAMDAEAMRRAGYATVDALVARLADPAADPVLRRADA
Ga0210383_1107384923300021407SoilVEHPLAMDAEAMRRAGYATVDALVARLADPGADPVLRRAEA
Ga0210402_1072783123300021478SoilMEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRADA
Ga0247671_107211913300024284SoilMEHPLAMDAEAMRRAGYATVDALVARLADPEADPVLRRAGAAD
Ga0207692_1018527413300025898Corn, Switchgrass And Miscanthus RhizosphereMVDHPLTMDVEAMRRAGYAAVDALAARLADPEGGPVLRRASPAQMRAG
Ga0207699_1028764223300025906Corn, Switchgrass And Miscanthus RhizosphereVDHPLGMDAEAMRRAGYATVDALVARMTEPGAGPV
Ga0207663_1061294723300025916Corn, Switchgrass And Miscanthus RhizosphereMEHPLAMDAEAMRRAGYATVDALVARLADPAADPVLRRA
Ga0207660_1082024023300025917Corn RhizosphereMEHPLAMDAEAMRRAGYATVDALVARLADPEADPVL
Ga0207646_1159964223300025922Corn, Switchgrass And Miscanthus RhizosphereLAMDVEAMRRAGYATVDALVARLANPRVDPVLRRATA
Ga0207664_1038515413300025929Agricultural SoilVDHPLAMDAEAMRRAGYATVDALVARLADPESDPV
Ga0207658_1107342113300025986Switchgrass RhizosphereMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLRRAGAAD
Ga0209265_109377513300026308SoilVDHPLAMEVEAMRRAGYAAVDALAARMADPEAEPVLR
Ga0268265_1066226423300028380Switchgrass RhizosphereMEHPLAMDVETMRRAGYATVDALVARLADPEADPVLR
Ga0307282_1000695013300028784SoilMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLRRAGA
Ga0311355_1077345523300030580PalsaVEHPLEMDVEAMRRAGYATVDALVARLADPAADPVLR
Ga0307499_1003740423300031184SoilVEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLR
Ga0318538_1057077413300031546SoilMEHPLAMDIEAMRRAGYATVDALVARLADPGADPVLRRAGAAQ
Ga0318571_1008340623300031549SoilMVDHPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRRPG
Ga0318571_1044756523300031549SoilVEHPLAMDIEAMRRAGYATVDALVARLASPEADPV
Ga0318515_1005369113300031572SoilMVDHPLAMDVEAMRRTGYAAVDALVARLADPGADPVLRRPG
Ga0310915_1096694313300031573SoilVDHPLAMDVEAMRRAGYAAVDALAARMADPEAEPVLRRA
Ga0318555_1072511323300031640SoilMEHPLAMDVEDMRRAGYATVDALVARLADPEADPVLRRADPARLRPLLG
Ga0307469_1025649013300031720Hardwood Forest SoilMEHPLAMDAEAMRRAGYTTVDALVARLADPGADPV
Ga0318493_1043589223300031723SoilHPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRRPG
Ga0318493_1055917213300031723SoilMVDHPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRRASPE
Ga0318493_1088538813300031723SoilMEHPLAMDVEDMRRAGYATVDALVARLADPEADPVLRRADPA
Ga0318492_1058490913300031748SoilMRRAGYAAVDALAARMADPEAEPVLRRASPAEMRERLGGAP
Ga0318494_1042940513300031751SoilGGMVDHPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRRPG
Ga0318494_1050520823300031751SoilVEHPLAMDIEAMRRAGYATVDALVARLASPGADPV
Ga0318521_1003266813300031770SoilMVDHPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRR
Ga0318546_1057642913300031771SoilVEHPLAMDIEAMRRAGYATVDALVARLANPEADPVLRRADVAEMR
Ga0318547_1041757623300031781SoilMNHPLAMDAETMRRAGYATVDALVARLADPEADPVLRRADAA
Ga0318547_1073867913300031781SoilVEQPLAMDVEAMRRAGYATVDALVARLAHPEADPVLRRAEAAQMRS
Ga0318565_1002978643300031799SoilMVDHPLAMDVEAMRRTGYAAVDALVARLTDPGADPVLRRPG
Ga0318497_1087271213300031805SoilMEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRADAAQM
Ga0318568_1035241913300031819SoilVDHPLAMDVEAMRRAGYAAVDALAARMADPEAEPV
Ga0318564_1010625313300031831SoilMVDHPLAMDVEAMRRTGYAAVDALVARLADPEADPVLRRA
Ga0318517_1036793823300031835SoilPLAMDVEAMRRTGYAAVDALVALLADPGADPVLRRPG
Ga0318511_1015199223300031845SoilMEHPLAMDAETMRRAGYATVDALVARLADPGADPVL
Ga0318511_1033604613300031845SoilMNHPLAMDAETMRRAGYATVDALVARLADPEADPVLRRADAAGLRARLG
Ga0318512_1024574813300031846SoilMEHPLAMDAETMRRAGYATVDALVARLADPGADPVLRRAGAA
Ga0318536_1012321923300031893SoilVDHPLAMEVEAMRRAGYAAVDALAARLADPQAIPYFTT
Ga0306923_1059006713300031910SoilMNHPLAMDAETMRRAGYATVDALVARLADPGADPVLRRAGAAAMRARL
Ga0306923_1088279323300031910SoilVEHPLAMDIEAMRRAGYATVDALVARLASPGADPVLRRA
Ga0318569_1049420813300032010SoilMEHPLAMDAEAMRRAGYATVDALVTRLADPGDDPVLRRADAAD
Ga0318569_1050019913300032010SoilMNHPLAMDAETMRRAGYATVDALVARLADPEADPVLRRADAAGLR
Ga0318559_1044467613300032039SoilMEHPLAMDVEAMRRAGYATVDALVARLADPGADPVLRRADA
Ga0318549_1032968923300032041SoilMNHPLAMDAETMRRAGYATVDALVARLADPEADPVLRRADAAGLRARR
Ga0318556_1032417013300032043SoilWWWEGGMVDHPLAMDVEAMRRTGYAAVDALVARLADPGADPVLRRPG
Ga0318570_1004022513300032054SoilVDHPLAMDVEAMRRAGYAAVDALAARMADPEAEPVLR
Ga0335078_1213420923300032805SoilMEHPLAMDVEAMRRAGYATVDALVARLADPEADPVLRRAGAADMRARL
Ga0335083_1022642123300032954SoilMEHPLAMDAEAMRRAGYATVDALVARLADPGAGDG
Ga0335077_1065547713300033158SoilVDHPLAMDVEAMRRTGYAAVDALAARLADPAAGPVLRRASAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.