NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F105283

Metagenome Family F105283

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105283
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 42 residues
Representative Sequence FLALVLAAGIAWFPRGNGSIADAEREARALASEETARESPLS
Number of Associated Samples 80
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 91.00 %
% of genes from short scaffolds (< 2000 bps) 88.00 %
Associated GOLD sequencing projects 78
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(14.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.
1GPIPI_02888320
2GPIPI_02229470
3E41_10335150
4FA3_10042350
5JGI10216J12902_1091892951
6JGI24036J26619_100371993
7JGIcombinedJ43975_100480411
8Ga0062590_1013545312
9Ga0066398_102067791
10Ga0062595_1018339182
11Ga0062595_1022360071
12Ga0066672_106212871
13Ga0066685_106004752
14Ga0066676_104580392
15Ga0065705_100617701
16Ga0065705_104556551
17Ga0065707_105022701
18Ga0066388_1000088089
19Ga0066388_1000568534
20Ga0066388_1082326351
21Ga0070711_1009214872
22Ga0070705_1016153041
23Ga0070662_1006197031
24Ga0070698_10000240625
25Ga0070698_1000118841
26Ga0070741_110285361
27Ga0066697_107435431
28Ga0066905_1016815383
29Ga0066905_1017988991
30Ga0066903_1022652521
31Ga0066903_1030719782
32Ga0066903_1073943781
33Ga0081539_102049121
34Ga0080027_103969631
35Ga0068871_1009191461
36Ga0075430_1006302271
37Ga0075429_1011382121
38Ga0075424_1001727725
39Ga0079219_100581663
40Ga0079219_120633562
41Ga0075419_102274082
42Ga0075419_108284942
43Ga0105095_102743893
44Ga0099827_118812872
45Ga0105245_123668691
46Ga0066709_1022908022
47Ga0114129_100810441
48Ga0114129_131000462
49Ga0111538_107181611
50Ga0111538_138445892
51Ga0126384_100842482
52Ga0126384_115505282
53Ga0126384_123970401
54Ga0126382_118505931
55Ga0134063_103461142
56Ga0134062_104699371
57Ga0126378_112043211
58Ga0126378_123902242
59Ga0126378_128296892
60Ga0126379_101729022
61Ga0126379_137259912
62Ga0105239_109518023
63Ga0126381_1047615711
64Ga0126383_101597034
65Ga0134122_130233872
66Ga0137382_101891553
67Ga0137363_116455291
68Ga0137399_114544032
69Ga0137380_103262643
70Ga0137376_117133932
71Ga0137370_105909381
72Ga0137372_106442591
73Ga0137413_104494982
74Ga0126375_115441381
75Ga0164300_102438861
76Ga0164299_109601501
77Ga0126369_110359722
78Ga0126369_129303481
79Ga0132255_1021126063
80Ga0182040_110794801
81Ga0184632_100095213
82Ga0193726_12570141
83Ga0193724_10772572
84Ga0193709_10833422
85Ga0222623_101308262
86Ga0207693_103880842
87Ga0207706_109272871
88Ga0209799_11675352
89Ga0307307_102828191
90Ga0307312_104984841
91Ga0170823_156680204
92Ga0265328_100450471
93Ga0170820_136295011
94Ga0310915_103704843
95Ga0318560_106884332
96Ga0318546_107975761
97Ga0307473_106514681
98Ga0307479_113936711
99Ga0318519_105770693
100Ga0318519_106458451
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 37.14%    β-sheet: 0.00%    Coil/Unstructured: 62.86%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540FLALVLAAGIAWFPRGNGSIADAEREARALASEETARESPLSSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
71.0%29.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Grass Soil
Surface Soil
Switchgrass Rhizosphere
Soil
Agricultural Soil
Soil
Grasslands Soil
Grass Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Forest Soil
Soil
Hardwood Forest Soil
Prmafrost Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Tabebuia Heterophylla Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Rhizosphere
Corn Rhizosphere
7.0%9.0%14.0%3.0%3.0%7.0%5.0%10.0%5.0%9.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_028883202088090014SoilIVFLALVLAAGIAWFPRGXXXXXXXXREARALASEETAREGPLS
GPIPI_022294702088090014SoilLAVVLAAGIAWFPRGSGSIADAXXXXXXLASEETARERPLS
E41_103351502170459005Grass SoilLALVLAAGFAWFPRGKDAIADAEREEQALESEETAREGPL
FA3_100423502170459023Grass SoilVFLALVLAAGIVWFPRGSGSIADAEREARALASEETTRESPLS
JGI10216J12902_10918929513300000956SoilFLALVLAAGIGWFPRGSGSIADAEHEAQALASEETARESPLS*
JGI24036J26619_1003719933300002128Corn, Switchgrass And Miscanthus RhizosphereLVLAAGIAWFPRSKGSIADAEREARALASEETARESPLS*
JGIcombinedJ43975_1004804113300002899SoilLVLAAGIAWFPRGSGSIADAEREARALASKETARESPLS*
Ga0062590_10135453123300004157SoilVFLALVLAAGIAWFPRGNGFIADAKREARALASEETARESPRS*
Ga0066398_1020677913300004268Tropical Forest SoilLLALVLAAGFAWFPRAKGSIADAERGAKALTSEETAREGSSL*
Ga0062595_10183391823300004479SoilLATAIAFLAVVLAAGIVWFPRGSGSIAGAEREARALTSEETARENPLS*
Ga0062595_10223600713300004479SoilMVFLALVLAAGIAWFPRGKGSMADAEHEARALASDETATESPLS*
Ga0066672_1062128713300005167SoilRFSLATAIVFLALVLAAGIAWFPRGSGSIAGAEREAQALASEETARESPLS*
Ga0066685_1060047523300005180SoilFLAVVLAAGIAWFPRGSGSIADAQREARALASEETARESPLS*
Ga0066676_1045803923300005186SoilFLAVVLAAGIAWFPRGSGSIADAQREARPLASEETARESPLS*
Ga0065705_1006177013300005294Switchgrass RhizosphereAMVFLALVLAAGFAWFPRGKGAIADAEREAQAIASEETARESPLS*
Ga0065705_1045565513300005294Switchgrass RhizosphereLALVLAAGFAWFPRGKGVIADAEREAQALESEEAAREGPL*
Ga0065707_1050227013300005295Switchgrass RhizosphereLALVLAAGIAWFPRGSGSIADAEREARALVSEETARESPIS*
Ga0066388_10000880893300005332Tropical Forest SoilMAMVFLALVLAAGFAWFPRGKGAIADAEREAQALESEESTREIPLS*
Ga0066388_10005685343300005332Tropical Forest SoilAMVLLALVLAAGFAWFPRGKGAIADAEREASALESEETAREGPL*
Ga0066388_10823263513300005332Tropical Forest SoilLALVLAAGFAWFPRGKGAIADAEREAQALESEETAREGPL*
Ga0070711_10092148723300005439Corn, Switchgrass And Miscanthus RhizosphereTAIVFLALVLAAGIVWFPRGSGSIAGAEREARALASEETTRESPLS*
Ga0070705_10161530413300005440Corn, Switchgrass And Miscanthus RhizosphereFSLATAMVFLALVLAAGFAWFPRGKGAIADAEREAQALESEETAREGPL*
Ga0070662_10061970313300005457Corn RhizosphereFLALVLAAGIVWFPLGSGSIADAEHEARALASEETTRESPLS*
Ga0070698_100002406253300005471Corn, Switchgrass And Miscanthus RhizosphereALVLAAGFAWFPRAKGSNADAERGAKALASEETARESHLSYD*
Ga0070698_10001188413300005471Corn, Switchgrass And Miscanthus RhizosphereALVLAAGFAWFPRAKGSNADAERGAKALASEETARESHL*
Ga0070741_1102853613300005529Surface SoilLAAGIAWFPRGRGSIAGAEREARTLASEEAASENPAG*
Ga0066697_1074354313300005540SoilAAGIVWFPRGSGSIADAEHEARTLASEETARESPLS*
Ga0066905_10168153833300005713Tropical Forest SoilAAGFAWFPRGKGAIADAERGAKALESEETARESTL*
Ga0066905_10179889913300005713Tropical Forest SoilTAILILALVLAAGYAWFPRGKGGLADARREARAVSSEEAIREPG*
Ga0066903_10226525213300005764Tropical Forest SoilFLALVLAAGIAWFPRGRGSIADAQREARALTSEETARESPLS*
Ga0066903_10307197823300005764Tropical Forest SoilVVLAAGILWFPRGRGSIADAEREARALTSEETARESPLS*
Ga0066903_10739437813300005764Tropical Forest SoilAAGIVWFPRGKGSIVDAEREARALASEETVRESPLS*
Ga0081539_1020491213300005985Tabebuia Heterophylla RhizosphereAAGIAWFPRGSGSIANAEREARALASEETARESPLS*
Ga0080027_1039696313300005993Prmafrost SoilVFLALVLAAGFAWFPRGKGSVADAEREARAIASEENAGLS*
Ga0068871_10091914613300006358Miscanthus RhizosphereSLATAIVFLAIVLAAGIAWFPRGEGSIAGAEREAQALASEETARESPLS*
Ga0075430_10063022713300006846Populus RhizosphereVKIEDRPTAAGFAWFPRGKGADAEREAEALESEETA
Ga0075429_10113821213300006880Populus RhizosphereATAMVLLALVLAAGFVWFPRGKHSIADAEREARAVASEEAVRESLLRRE*
Ga0075424_10017277253300006904Populus RhizosphereFLALVLAAGIAWFPPGKSSIADAEREARALASDETARESPPS*
Ga0079219_1005816633300006954Agricultural SoilFLVLVLAAGIAWFPRGNGSIADAEREARALKSEETARNSSLS*
Ga0079219_1206335623300006954Agricultural SoilVLAAGIAWFPRGKGSMADAERQARALASEETARESPLS*
Ga0075419_1022740823300006969Populus RhizosphereVKIEDRPTAAGFAWFPRGKGADAEREAEALESEETAREGPL*
Ga0075419_1082849423300006969Populus RhizosphereVKIEDRPTAAGFAWFPRGKGAIADAEREEQALESEETAREGPL*
Ga0105095_1027438933300009053Freshwater SedimentLVLALVLAAGVAWFPKGRGGIADAAREASALESEEARRGSPEGAGIG*
Ga0099827_1188128723300009090Vadose Zone SoilFLALVLAAGIAWFPRGNGSIADAEREARALASEETARESPLS*
Ga0105245_1236686913300009098Miscanthus RhizosphereVFLALVLAAGITWFPRGTGSIADAEREARALASEETARESPLS*
Ga0066709_10229080223300009137Grasslands SoilAAGIVWFPRGSGSIAGAEREARTLASEETARESPLS*
Ga0114129_1008104413300009147Populus RhizosphereLVLAAGIGWFPRGKRSIADAEREARALASEESARESPPS*
Ga0114129_1310004623300009147Populus RhizosphereFLALVLAAGFAWFPRGKGAIADAEREAQALESEETAREGPL*
Ga0111538_1071816113300009156Populus RhizosphereFLTLVLAAGFAWFPRGKGAIADAEREARALESEETAREGPL*
Ga0111538_1384458923300009156Populus RhizosphereVFLVVVLAAGIAWFPRGSGSIADAEREARALASEETARESPLS*
Ga0126384_1008424823300010046Tropical Forest SoilMVLLALVLAAGFAWFPRGKGAIADAEREARALESEETAREGPL*
Ga0126384_1155052823300010046Tropical Forest SoilVLLTLVLAAGFAWFPRGKGSIVEAEREAQALTSEEAARESNLS*
Ga0126384_1239704013300010046Tropical Forest SoilLATAMVLLALVLASGFAWFPRGRGAITDAEREAQALESEEAAREGPL*
Ga0126382_1185059313300010047Tropical Forest SoilVLLALVLAAGFAWFPRGKGAITDAEREAQALESEEAAREGPL*
Ga0134063_1034611423300010335Grasslands SoilVLAAGIAWFPRGSGSIAGAEREAQALTSEETARESPLS*
Ga0134062_1046993713300010337Grasslands SoilAAGIAWFPRGSGSIAGAEREAQALTSEETARESPLS*
Ga0126378_1120432113300010361Tropical Forest SoilAIVLAGGIVWFPRGSGSIAGAEREARALASEETARESPPQNR*
Ga0126378_1239022423300010361Tropical Forest SoilAGFVWFPRGSGSIADAEREARALASEESARESPLS*
Ga0126378_1282968923300010361Tropical Forest SoilVFLAVVLAVGIAWFPRGSGSIADAKREARALASEETARESPLS*
Ga0126379_1017290223300010366Tropical Forest SoilMVFLALVLAGGIAWFPRGKGSVADAEREAHALAFEETTRETPLS*
Ga0126379_1372599123300010366Tropical Forest SoilAIAFLTVVLAAGILWFPRGRGSIAGAEREARSLASEETARESPLS*
Ga0105239_1095180233300010375Corn RhizosphereTAGFTWFPRGKGSIVANAEREAQALASEETARESPLS*
Ga0126381_10476157113300010376Tropical Forest SoilAGILWFPRGRGSIADAEREARALTSEETARESPLS*
Ga0126383_1015970343300010398Tropical Forest SoilLVLAAGFAWFPRGKGAIADAEREARALESEETAREGPL*
Ga0134122_1302338723300010400Terrestrial SoilIAFLAVVLAAGIAWFPRGSGSIAGAEREARALASEETARESPLS*
Ga0137382_1018915533300012200Vadose Zone SoilFSLATAIAFLAVVLAAGIVWFPRGSGSIADAQREARVLASEETARESPLS*
Ga0137363_1164552913300012202Vadose Zone SoilMVFLALVLAAGFAWFPRGKGAIADAEREEQALASEETAREGPL*
Ga0137399_1145440323300012203Vadose Zone SoilLAVVLAAGIAWFPRGSGSIANAEREAQALASEETARESPLS*
Ga0137380_1032626433300012206Vadose Zone SoilATAIVFLGLVLAAGIVWFPRGKGSIADAEHEAKALASDETARESPLS*
Ga0137376_1171339323300012208Vadose Zone SoilFLAVVLAAGIAWFPRGSGSIANAEREARALASEETAPESPLS*
Ga0137370_1059093813300012285Vadose Zone SoilVVLAAGIAWFPRGSGSITNAKREARALASEETAPESPLS*
Ga0137372_1064425913300012350Vadose Zone SoilTAIVFLALVLAAGIAWFPRGNGSIADAEREARALASEETARESPLS*
Ga0137413_1044949823300012924Vadose Zone SoilVTAIAFLAVVLAAGIAWFPRGSGSIADAQREARALASEETARESPLS*
Ga0126375_1154413813300012948Tropical Forest SoilIVFLALVLAAGFAWFPRGKGSIADAEREARELASEETARESPLS*
Ga0164300_1024388613300012951SoilLALILAAGIAWFPRGKGSMADAEHEARALASEETARETPLS*
Ga0164299_1096015013300012958SoilLVLAAGIAWFPRGNGSLADAEREARALGSEETARESPRS*
Ga0126369_1103597223300012971Tropical Forest SoilVVLAAGIVWFPRGSGSIAGAEREARALASEETAHENPLS*
Ga0126369_1293034813300012971Tropical Forest SoilFSLATAIVFLGLVLAAGIAWFPRGKGSMADAEHEARALASEETARETPLS*
Ga0132255_10211260633300015374Arabidopsis RhizosphereMVFLTLVLAAGFVWFPRGKGAIADAEREEQALESEETAREGPL*
Ga0182040_1107948013300016387SoilLATAIVFLGLVLAAGILWFPRGGGSIIGAEREARALASEETARESPLS
Ga0184632_1000952133300018075Groundwater SedimentFSLATAMVFLALVLAAGFAWFPRGKGAIADAEREARALASETAREGPL
Ga0193726_125701413300020021SoilVLAAGIAWFPRGSGSIADAEREARTLASEETARESPLS
Ga0193724_107725723300020062SoilATAIVFLTIVLAAGMVWFPRGSGSIADAEREARALASEETARESPLS
Ga0193709_108334223300021411SoilFLAVVLAAGIAWFPRGSGSIADAQREARALASEETARESPLS
Ga0222623_1013082623300022694Groundwater SedimentFLAVVLAAGIAWFPRGSGSIANAEREARALASEETARESPLS
Ga0207693_1038808423300025915Corn, Switchgrass And Miscanthus RhizosphereMVFLALVLAAGFAWFPRGKGAIADAEREAQALESEETAREGPL
Ga0207706_1092728713300025933Corn RhizosphereLAAGIVWFPLGSGSIADAEHEARALASEETTRESPLS
Ga0209799_116753523300027654Tropical Forest SoilLLALVLAAGFAWFPRAKGSIADAERGAKALTSEETAREGSSL
Ga0307307_1028281913300028718SoilVLAAGMVWFPRGSGSIADAEREARTLESEETASESPLS
Ga0307312_1049848413300028828SoilLATAIVFLAVVLAAGIVWFPRGSGSIADAEREARTLESEETASESPLS
Ga0170823_1566802043300031128Forest SoilFLAVVLAAGIAWFPRGSGSIADAEREAQTLASEETARESPLS
Ga0265328_1004504713300031239RhizosphereVVLLCLVLAAGLKWFPRGKGGIADATREQKSLESEESSRAGPESAGLG
Ga0170820_1362950113300031446Forest SoilAIGFLALVLAAGIAWFPRGSGSIADAESEARALASEETARESPLS
Ga0310915_1037048433300031573SoilLVLAAGIAWFPRGKSSIADAEREARALASDETARESPPS
Ga0318560_1068843323300031682SoilIVFLAIVLAAGIAWFPRGSGSITDAKCEARALASEETARESPLS
Ga0318546_1079757613300031771SoilLTLVLAAGITWFPRGKGSMADAEHEARALTSDETARENPLS
Ga0307473_1065146813300031820Hardwood Forest SoilATAILFLTVVLVAGIAWFPRGSGSIADAESEARALASEETARESPLS
Ga0307479_1139367113300031962Hardwood Forest SoilLAVVLAAGIAWFPRGSGSIADAEREARALVSEETARESPLS
Ga0318519_1057706933300033290SoilAAGIAWFPRGSGSITDAKCEARALASEETARESPLS
Ga0318519_1064584513300033290SoilAIVFLAVVLAAGIAWFPRGSGSIAGAEREAQALASEETARESPLS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.