NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101410

Metagenome / Metatranscriptome Family F101410

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101410
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 43 residues
Representative Sequence GRALGLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGP
Number of Associated Samples 91
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 100.00 %
% of genes from short scaffolds (< 2000 bps) 89.22 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.451 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(27.451 % of family members)
Environment Ontology (ENVO) Unclassified
(23.529 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(38.235 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1Ga0062389_1045386782
2Ga0066388_1015840722
3Ga0066388_1037098692
4Ga0070666_100889975
5Ga0070682_1009240222
6Ga0070714_1012273621
7Ga0070710_105992021
8Ga0070706_1019559691
9Ga0070698_1014377741
10Ga0070699_1014114352
11Ga0070679_1004563822
12Ga0070672_1002420593
13Ga0070665_1024519052
14Ga0070702_1010817541
15Ga0066903_1087487991
16Ga0097621_1000269826
17Ga0074048_133899943
18Ga0079222_104357461
19Ga0079220_119219041
20Ga0075429_1013832161
21Ga0105245_124726092
22Ga0105247_113935631
23Ga0126373_121808741
24Ga0134111_103470612
25Ga0126370_100916864
26Ga0126376_116091812
27Ga0126378_100177788
28Ga0126377_108465071
29Ga0134128_100643651
30Ga0126381_1020892561
31Ga0126381_1042670072
32Ga0126383_101849822
33Ga0134127_113581422
34Ga0124850_11516891
35Ga0137393_103282012
36Ga0137365_112734741
37Ga0137399_103605522
38Ga0137380_103019151
39Ga0137371_103135411
40Ga0137385_110876241
41Ga0157340_10025912
42Ga0157338_10030503
43Ga0126375_106151591
44Ga0164299_100296405
45Ga0164299_108061482
46Ga0132256_1008212512
47Ga0182041_104866491
48Ga0182032_116899382
49Ga0182040_103354722
50Ga0187777_102016341
51Ga0187777_108552101
52Ga0187767_103857512
53Ga0187766_106995631
54Ga0193692_11030692
55Ga0206356_115669092
56Ga0210401_101596351
57Ga0210408_110005661
58Ga0210394_104904162
59Ga0210394_115108632
60Ga0210410_102243281
61Ga0210410_115284251
62Ga0126371_124470351
63Ga0247670_10202461
64Ga0207713_10830732
65Ga0207699_102551633
66Ga0207663_102840202
67Ga0207663_113543421
68Ga0207687_111878022
69Ga0207678_119413362
70Ga0208637_10347742
71Ga0209179_10588502
72Ga0268266_103070773
73Ga0307312_102185183
74Ga0170824_1008284961
75Ga0170819_178470012
76Ga0318516_106496592
77Ga0318541_106144841
78Ga0318493_102636951
79Ga0318493_102779092
80Ga0318500_100726363
81Ga0318526_100713023
82Ga0318547_101794542
83Ga0318548_101960062
84Ga0318550_102781443
85Ga0318523_100158321
86Ga0318523_102263442
87Ga0318497_104334791
88Ga0318568_104400842
89Ga0318564_103273881
90Ga0310917_100353043
91Ga0318511_101587422
92Ga0318536_101436162
93Ga0310913_106336521
94Ga0318531_100290191
95Ga0307470_101483923
96Ga0307470_114387532
97Ga0335080_122240921
98Ga0335076_110771351
99Ga0335073_101282771
100Ga0310914_111209541
101Ga0310914_116629012
102Ga0314864_0106986_568_684
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: No Secondary Structure distribution: α-helix: 34.72%    β-sheet: 0.00%    Coil/Unstructured: 65.28%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540GRALGLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGPSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
77.5%22.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Peatland
Tropical Peatland
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.9%6.9%9.8%27.5%2.9%2.9%3.9%3.9%7.8%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0062389_10453867823300004092Bog Forest SoilGRGTGRALLLVRELAGFTGRGQAPPADLEPLVDAVRAGQFAQAAGRG*
Ga0066388_10158407223300005332Tropical Forest SoilRLVRELAGFTARGQAPPGDLEPLVDAVRAGRFARAALPGR*
Ga0066388_10370986923300005332Tropical Forest SoilRLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGH*
Ga0070666_1008899753300005335Switchgrass RhizosphereGEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0070682_10092402223300005337Corn RhizosphereGTGRALRLVRELADFTARGQAPPADLEPLVDAVRAGSFARAGLPGR*
Ga0070714_10122736213300005435Agricultural SoilRFPRLGEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0070710_1059920213300005437Corn, Switchgrass And Miscanthus RhizospherePRLGEGTGRALRLLRELATFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPA*
Ga0070706_10195596913300005467Corn, Switchgrass And Miscanthus RhizosphereRFPRLGEGTGRALRLLRELAAFTGHGQAPPADLEPLVDAVRAGRFTDAAGLTP*
Ga0070698_10143777413300005471Corn, Switchgrass And Miscanthus RhizosphereEGTGRALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFTDAAGA*
Ga0070699_10141143523300005518Corn, Switchgrass And Miscanthus RhizosphereLGEGTGRALRLLRELAAFTGPGQAPPADLEPLVDAVRAGRFTDAAAGLTP*
Ga0070679_10045638223300005530Corn RhizospherePRLGEGTGRALRLLRELAAFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0070672_10024205933300005543Miscanthus RhizosphereEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0070665_10245190523300005548Switchgrass RhizosphereRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG*
Ga0070702_10108175413300005615Corn, Switchgrass And Miscanthus RhizosphereGRALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPPQRL*
Ga0066903_10874879913300005764Tropical Forest SoilGTGRALRLVRELAGFTARGQAPPADLEPLVDAVRAGRLARTALPGQ*
Ga0097621_10002698263300006237Miscanthus RhizosphereRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFANAAG*
Ga0074048_1338999433300006581SoilALRLVRELVDFTARGETPPAELEPLVDAVRAGRFARAAAGR*
Ga0079222_1043574613300006755Agricultural SoilTGRALRLLRELAAFTGHGQAPPADLEPLVDAVRAGRFTDAAGADAVT*
Ga0079220_1192190413300006806Agricultural SoilLADFTARGQAPPADLEPLVDAVRAGRFVQAAGLG*
Ga0075429_10138321613300006880Populus RhizosphereEGTGRAVDLVREFADFTARGQAPPAELEPLVDAVRAGRFAGTALPGP*
Ga0105245_1247260923300009098Miscanthus RhizosphereLGEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0105247_1139356313300009101Switchgrass RhizosphereGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG*
Ga0126373_1218087413300010048Tropical Forest SoilGRALGLVRQLAGFTARGQAPPPDMEPLVDAVRAGRFAWTALPGQ*
Ga0134111_1034706123300010329Grasslands SoilSRLGEGTGRALRLVRELAGFTAHGQAPPAELEPLVDAVRAGRFARAALPGR*
Ga0126370_1009168643300010358Tropical Forest SoilGEGTGRALGLVRELTDFTARGQAPPAELEPLVDAVRAGRFARTALPGE*
Ga0126376_1160918123300010359Tropical Forest SoilALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFADAAGLTP*
Ga0126378_1001777883300010361Tropical Forest SoilLRLVRELAGFTARGQAPPADLEPLVEAVRAGRFARAAGLGS*
Ga0126377_1084650713300010362Tropical Forest SoilLREIATFTGHGQAPPADLEPLVDAVRAGRFTDAAGPTP*
Ga0134128_1006436513300010373Terrestrial SoilRALRLVRERAGFTARGQAPPAELEPLVDAVRAGRFAWAALPGE*
Ga0126381_10208925613300010376Tropical Forest SoilEGTGRALRLVRELVGFTARGQAPPAELEPLVDAVRAGSFARTALPGE*
Ga0126381_10426700723300010376Tropical Forest SoilRELAGFTARGQAPPPEIEPLVDAVRAGRFARTARPGQ*
Ga0126383_1018498223300010398Tropical Forest SoilAGFTARGQAPPADLEPLVDAVRAGRFARTALPGQ*
Ga0134127_1135814223300010399Terrestrial SoilLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0124850_115168913300010863Tropical Forest SoilTGRALRLVRELADFTGHGQAPPADLEPLVDAVRAGRFARAALPDQ*
Ga0137393_1032820123300011271Vadose Zone SoilALRLLRELAAFTGDGQAPPSDLEPLVDAVRAGRFTDAAGLTP*
Ga0137365_1127347413300012201Vadose Zone SoilLGEGTGRALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFTDAAGLTP*
Ga0137399_1036055223300012203Vadose Zone SoilAAFTGHGQAPPADLELLVDAVRAGRFTDAAGLTP*
Ga0137380_1030191513300012206Vadose Zone SoilRFPRLGEGTGRALRLLRELAAFTGHGQAPPSDLEPLVDAVRAGRFTDAAGLTP*
Ga0137371_1031354113300012356Vadose Zone SoilLLRELAAFTGQGQAPPSDLELLVDAVRAGRFTDAAGLTP*
Ga0137385_1108762413300012359Vadose Zone SoilTGRALRLLRELATFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPT*
Ga0157340_100259123300012473Arabidopsis RhizosphereLRLLRELASFTGHGQAPPADLELLVDAVRAGRFTDAAGLTP*
Ga0157338_100305033300012515Arabidopsis RhizosphereGEGTGRALHLVRELAAFTGHGQAPPADLEPLVDAVRAGRFTDAAG*
Ga0126375_1061515913300012948Tropical Forest SoilCALRLVRELADFTGRGEAPPEELEPLVDAVRAGRFARAAMPGQ*
Ga0164299_1002964053300012958SoilLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG*
Ga0164299_1080614823300012958SoilGTGRALRLLRELATFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPA*
Ga0132256_10082125123300015372Arabidopsis RhizosphereLVRELADFTARGQAPPADLEPLVDAVRAGSFARTALPGQ*
Ga0182041_1048664913300016294SoilTGRALGLVRELADFTARGQAPPAELEPLVDAVRAGRFARAALPGE
Ga0182032_1168993823300016357SoilGRPLRLVRELTGFTARGQAPPADLEPLVDAVRAGRFAQAAAQ
Ga0182040_1033547223300016387SoilVRELTDFTARGQAPPADLEPLVDAVRAGRFARAALPGE
Ga0187777_1020163413300017974Tropical PeatlandTGRALGLVRELVGFAAPGQAPPAELEPLVDAVRAGRFARECR
Ga0187777_1085521013300017974Tropical PeatlandGRALGLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGP
Ga0187767_1038575123300017999Tropical PeatlandAFRLVRELAGFTARGQAPPADLEPLVDAVRAGRFARTALSGQ
Ga0187766_1069956313300018058Tropical PeatlandRLVRELAGFTARGQAPPADLEPLVEAVRAGRFARAGLPGQ
Ga0193692_110306923300020000SoilEGTGRALRLLRELASFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPPQRL
Ga0206356_1156690923300020070Corn, Switchgrass And Miscanthus RhizosphereLGEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFTDAAG
Ga0210401_1015963513300020583SoilLRLVRELAGFTARGQVPPPDIEPLVDAVRAGGFARACGTADDQG
Ga0210408_1100056613300021178SoilRPSRLGDGTGRALHLVRELADFTARGQAPPADLEPLVDAVRAGQFAQAAGRG
Ga0210394_1049041623300021420SoilPRLGEGTGRALRLLRQLAAFTGHGQAPPADLEPLVDAVRAGRFTDAAGLTP
Ga0210394_1151086323300021420SoilRALGLVRELAGFTARGQAPPAELEPLVDAVRAGRFARTALPGG
Ga0210410_1022432813300021479SoilGEGTGRALRLLRELAAFTGHGQAPPADLEPLVDAVRAGRFTDAAGLTP
Ga0210410_1152842513300021479SoilGRAHRLLRELAAFTGHGQAPPSDLEPLVDAVRAGRFTDAAGLTP
Ga0126371_1244703513300021560Tropical Forest SoilVRELAGFTARGQAPPAELEPLVDAVRAGRFALTALPGR
Ga0247670_102024613300024283SoilRLGEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG
Ga0207713_108307323300025735Switchgrass RhizosphereEGTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG
Ga0207699_1025516333300025906Corn, Switchgrass And Miscanthus RhizosphereRALRMLRELATFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPA
Ga0207663_1028402023300025916Corn, Switchgrass And Miscanthus RhizosphereLGEGTERARRLVRELAGFTAQGQAPPAELEPLVDAVREGRFARTALPG
Ga0207663_1135434213300025916Corn, Switchgrass And Miscanthus RhizosphereTGRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG
Ga0207687_1118780223300025927Miscanthus RhizosphereRLLRELASFTGHGQAPPADLELLVDAVRAGRFTDAAGLTP
Ga0207678_1194133623300026067Corn RhizosphereLADFTARGQAPPADLEPLVDAVRAGSFARAALPGQ
Ga0208637_103477423300027401SoilRALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFTDAAEPTPT
Ga0209179_105885023300027512Vadose Zone SoilGEGTGRALRLLRELAAFTGHGQAPPADLELLVDAVRAGRFTDAAGLTP
Ga0268266_1030707733300028379Switchgrass RhizosphereGEGTSRALRLLRELATFTGHGQAPPADLEPLVDAVRAGRFADAAG
Ga0307312_1021851833300028828SoilELASFTGHGQAPPADLELLVDAVRAGRFTDAAGLTPPQRL
Ga0170824_10082849613300031231Forest SoilDSRTRDLAGFTARGESPPAELEPLVDAVRAGRFARAAGPGA
Ga0170819_1784700123300031469Forest SoilEGTGRAFRLVRDLAGFTARGESPPAELEPLVDAVRAGRFARAAGPGA
Ga0318516_1064965923300031543SoilQALRLVRELAGFTARGQIPPADLEPLVDAVRAGRFARAASSG
Ga0318541_1061448413300031545SoilGRAHHLVRELAGFTARGQAPPAELEPLVDAVRAGRFARTALPGA
Ga0318493_1026369513300031723SoilTGRALGLVRELVGFTARGQAPPADIEPLVDAVRMGRFARTALPGQ
Ga0318493_1027790923300031723SoilGLVRELAGFTDRGQAPPPDIEPLVDAVRAGRFARTALPGQ
Ga0318500_1007263633300031724SoilRELTGFTARGQAPPADLEPLVDAVRAGRFAQAAAQ
Ga0318526_1007130233300031769SoilLAAFTGHGQAPPADLEPLVDAVRAGRFAEAAGLAP
Ga0318547_1017945423300031781SoilTGRALDLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGR
Ga0318548_1019600623300031793SoilEGTGQALRLVRELADFTARGQVPPADLEPLVDAVRAGRFARAASPG
Ga0318550_1027814433300031797SoilPRLGEGTGQALRLVRELADFTARGQVPPADLEPLVDAVRAGRFARAASPG
Ga0318523_1001583213300031798SoilPRALHLVRELVGFTARGQAPPAELEPLVDAVRAKRPARTAVPGE
Ga0318523_1022634423300031798SoilALVLVRELAGFTARGQAPPADLEPLVDAVRAGRFAQAAGRD
Ga0318497_1043347913300031805SoilARLGEGTGRALHLVRELAGFTARGQAPPADLEPLVDAVRAGQFARAAGLG
Ga0318568_1044008423300031819SoilLVGFTARGQAPPADIEPLVDAVRMGRFARTALPGQ
Ga0318564_1032738813300031831SoilRELADFTARGQAPPAELEPLVDAVRAGRFARAALPGE
Ga0310917_1003530433300031833SoilHLVRELVGFTARGQAPPAELEPLVDAVRAGRFARTAVPGE
Ga0318511_1015874223300031845SoilDLVRELAGFTARGQAPPADLEPLVDAVRAGRFARAALPGR
Ga0318536_1014361623300031893SoilEGTGRAHHLVRELAGFTARGQAPPAELEPLVDAVRAGRFARTALPGA
Ga0310913_1063365213300031945SoilALGLVRELADFTARGQAPPAELEPLVDAVRAGRFARAALPGE
Ga0318531_1002901913300031981SoilTGRALGLVRELTDFTARGQAPPADLEPLVDAVRAGRFARAALPGE
Ga0307470_1014839233300032174Hardwood Forest SoilRFPCLGEGTGRALRLLRELAAFTGHGQAPPADLEALVDAVRAGRFTDAAGLTP
Ga0307470_1143875323300032174Hardwood Forest SoilLGEGTGRALGLVRELAGFTARGQAPPAELEPLVDAVRAGRFAWAALPGE
Ga0335080_1222409213300032828SoilTGRAFRLARELAGFTARGQAPPADLEPLVDAVRAGRFARAALSGQ
Ga0335076_1107713513300032955SoilARRLVRELAGFTAQGQAPPAELEPLVDAVREGRFARTALPG
Ga0335073_1012827713300033134SoilLRLVRELAGFTARGQAPPADLEPLVEAVRAGRFAASSGGCR
Ga0310914_1112095413300033289SoilGTGRALGLVRELTDFTARGQAPPAELEPLVDAVRAGRFARTALPGE
Ga0310914_1166290123300033289SoilLVRELAGFTARGQAPPPDIEPLVDAVRAGRFARAALSGQ
Ga0314864_0106986_568_6843300033805PeatlandVRELAGFTARGQAPPADLEPLVDAVRAGRFARTALSGQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.