NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F089245

Metagenome Family F089245

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089245
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 42 residues
Representative Sequence FSAGAVTFQLRAWTDRHEDWAQLRSDLSVAVNEALAREKIAIA
Number of Associated Samples 83
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.92 %
% of genes near scaffold ends (potentially truncated) 99.08 %
% of genes from short scaffolds (< 2000 bps) 90.83 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (59.633 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(27.523 % of family members)
Environment Ontology (ENVO) Unclassified
(38.532 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.055 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52
1JGI1027J12803_1038388941
2JGI12635J15846_105071663
3JGI12635J15846_106086712
4JGIcombinedJ26739_1015046951
5JGI25614J43888_100032581
6JGI26341J46601_101475562
7Ga0062385_110444601
8Ga0066388_1085944482
9Ga0070710_107782131
10Ga0070706_1005193412
11Ga0070699_1016772303
12Ga0070697_1010231901
13Ga0066903_1005376734
14Ga0066903_1034092812
15Ga0070717_119407851
16Ga0070712_1013402721
17Ga0070765_1018348991
18Ga0126384_124265322
19Ga0126382_116403061
20Ga0126382_120429951
21Ga0131853_113792872
22Ga0074045_103521791
23Ga0074044_107534481
24Ga0126378_114938162
25Ga0126378_115939451
26Ga0126381_1002074914
27Ga0126381_1004720991
28Ga0126381_1009877321
29Ga0126383_103908121
30Ga0126383_124195171
31Ga0153922_10983561
32Ga0137382_110459952
33Ga0137399_101285792
34Ga0137399_110294953
35Ga0137394_101451391
36Ga0164300_102284103
37Ga0126369_119677441
38Ga0181523_107325302
39Ga0182041_103769482
40Ga0182041_114890111
41Ga0182033_108552853
42Ga0182033_113764672
43Ga0182033_116492731
44Ga0182035_107200133
45Ga0182032_107307401
46Ga0182032_107432201
47Ga0182034_103251271
48Ga0182034_106260631
49Ga0182040_101046331
50Ga0182037_100645061
51Ga0182038_105284271
52Ga0182038_108145861
53Ga0182038_116293532
54Ga0187782_105404951
55Ga0184626_102005701
56Ga0190271_137649631
57Ga0179594_102932231
58Ga0210407_113197542
59Ga0210399_115659342
60Ga0210406_109467161
61Ga0210408_111581511
62Ga0210396_100187969
63Ga0213872_103215332
64Ga0213876_105965201
65Ga0210385_111044431
66Ga0210397_104014601
67Ga0210384_102791401
68Ga0213878_101577771
69Ga0213878_103252102
70Ga0210410_110440791
71Ga0126371_103048561
72Ga0207699_113662002
73Ga0257161_10935071
74Ga0209626_11767021
75Ga0209039_101950202
76Ga0209039_102068661
77Ga0209006_111864582
78Ga0302225_103789951
79Ga0318573_106238331
80Ga0310915_100162421
81Ga0318542_107718561
82Ga0307476_106910661
83Ga0306918_101521061
84Ga0306918_113224832
85Ga0307475_110669991
86Ga0318547_104413352
87Ga0310917_103719671
88Ga0306919_102828031
89Ga0306925_104286781
90Ga0306925_109177401
91Ga0318520_109776961
92Ga0306923_123113152
93Ga0306921_102457601
94Ga0306921_108053742
95Ga0310912_105158782
96Ga0310912_114729281
97Ga0310910_106975321
98Ga0310909_106758031
99Ga0306926_102661722
100Ga0306926_119613493
101Ga0307479_103905971
102Ga0307479_115457051
103Ga0318533_105658202
104Ga0306924_102301982
105Ga0306924_104607541
106Ga0306920_1012896211
107Ga0306920_1013419243
108Ga0306920_1013560902
109Ga0310914_100308157
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.30%    β-sheet: 0.00%    Coil/Unstructured: 50.70%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540FSAGAVTFQLRAWTDRHEDWAQLRSDLSVAVNEALAREKIAIASequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
59.6%40.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Bog
Groundwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Bulk Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Termite Gut
Plant Roots
Rhizosphere
Attine Ant Fungus Gardens
3.7%4.6%11.0%20.2%27.5%3.7%4.6%6.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10383889413300000955SoilAGAITFQLRVWTDQYHQWAQLRSDLSVAVNDALAREKIAIA*
JGI12635J15846_1050716633300001593Forest SoilRLATRAWTDRYQDWSQIRSDLSVAVNDALLRDKIAIA*
JGI12635J15846_1060867123300001593Forest SoilPSPEVYVLSFTGGAVTLQLRAWTDRYQDWAQLRSDLSKAVSGVLAREKIGLT*
JGIcombinedJ26739_10150469513300002245Forest SoilVNFSSGAVTFQLRAWTDRNEEWAQLRSDLSVAVNDALAREKIAIA*
JGI25614J43888_1000325813300002906Grasslands SoilFTAGAVTFQLRAWTDRYQGWAQLRSDLGVAVNDALAREKIAIA*
JGI26341J46601_1014755623300003219Bog Forest SoilVTNFSAGAVTYQLRAWTDRHEDWAQLRSDLSVAVNDALAREKIAIA*
Ga0062385_1104446013300004080Bog Forest SoilFSAGAVTFQLRAWTDRHEDWAQLRSDLSVAVNEALAREKIAIA*
Ga0066388_10859444823300005332Tropical Forest SoilVYVTNFSAGAVTFQLRAWIDRYEDLPQLRSDLSLAVNNALAREKIAIA*
Ga0070710_1077821313300005437Corn, Switchgrass And Miscanthus RhizosphereAITYQLRAWTDRYQDWAQVRSDLAVAVNEALFKEKIAIA*
Ga0070706_10051934123300005467Corn, Switchgrass And Miscanthus RhizospherePQVYVVNFSAGAITFKLRFWTDLDRDWSQLQSDISIAVNDALIREKIAIA*
Ga0070699_10167723033300005518Corn, Switchgrass And Miscanthus RhizosphereFSAGAVTFQLRAWTDRHDDWAQLHSDLSVAVNDVLAREKIAIA*
Ga0070697_10102319013300005536Corn, Switchgrass And Miscanthus RhizosphereTNLSAGAVTFKIRAWTDRHEDWAQLRSDLSLAANEALAREKIAIA*
Ga0066903_10053767343300005764Tropical Forest SoilQLRAWTDRQEDWAQLRSDLSLAVKDALARENIAMA*
Ga0066903_10340928123300005764Tropical Forest SoilVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIT*
Ga0070717_1194078513300006028Corn, Switchgrass And Miscanthus RhizosphereVLPTVVNVTAGAVTFQLRAWTDRYHEWVQLRGDLSVAVNDALAREKIAIA*
Ga0070712_10134027213300006175Corn, Switchgrass And Miscanthus RhizosphereTFQLRAWTDRHEDWAQLRSDLSLAVNEALAREKIAIA*
Ga0070765_10183489913300006176SoilVNFSAGAVTFQLRAWTDRYQDWARVRSDLAVAVNSALVEERIAIA*
Ga0126384_1242653223300010046Tropical Forest SoilAGAVNFQLRAWIDRYEDLAHLRSDLSLALNNALAREKIAIA*
Ga0126382_1164030613300010047Tropical Forest SoilLRAWIDRYEDMAELRSDLSLAVNKALAHEKIAIT*
Ga0126382_1204299513300010047Tropical Forest SoilTFQLRAWIDRYEDSAQLRSDLSLAVNNALAREKIAIA*
Ga0131853_1137928723300010162Termite GutVYAVSFAAGATTFQLRAWTDRYQDWAQLRSDLALAVNDALGREKIALA*
Ga0074045_1035217913300010341Bog Forest SoilFQLRAWTDRHEDWAQLRSDLSVAVNEALAREKIAIV*
Ga0074044_1075344813300010343Bog Forest SoilTYQLRAWTDRHEDWAQLRSDLSVAVNDALAREKIAIA*
Ga0126378_1149381623300010361Tropical Forest SoilTFQLRVWIDRYEDLAQLRSDLSLAAKNALARENIAIT*
Ga0126378_1159394513300010361Tropical Forest SoilQLRAWIDRYEDLAQLRSDLSLEIKDALARENIAIT*
Ga0126381_10020749143300010376Tropical Forest SoilTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA*
Ga0126381_10047209913300010376Tropical Forest SoilAGALTFQLRVWIDRHEDLAQLRSDLSLAAKNALARENIAIA*
Ga0126381_10098773213300010376Tropical Forest SoilLTFQIRVWIDRYEDLAQLRSDLSLAAKNALARENIAIT*
Ga0126383_1039081213300010398Tropical Forest SoilLRAWIDRYEDLAQLRSDLSLAVKNALARESIAIA*
Ga0126383_1241951713300010398Tropical Forest SoilNFSAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIT*
Ga0153922_109835613300012181Attine Ant Fungus GardensTFQLRVWTDRSREWAQLRSDLAVAINSALAREKIAIA*
Ga0137382_1104599523300012200Vadose Zone SoilFQLRVWTDRHEDWAQLRSDLSVAVNDALAREKIAIA*
Ga0137399_1012857923300012203Vadose Zone SoilALTSQLRAGTDRNEEGAQPRNGLSVAVNDALAREKIAIA*
Ga0137399_1102949533300012203Vadose Zone SoilGAITFQLRVWTDQYHEWAQLRSDLSVAVNDALAREKIAIA*
Ga0137394_1014513913300012922Vadose Zone SoilTAGAITFQLRVWTDQYHEWAQLRSDLSVAVNDALAREKIAIA*
Ga0164300_1022841033300012951SoilADAVTFQLRAWTDRHQDWAQVRSDLSIAINDTLARENIAIA*
Ga0126369_1196774413300012971Tropical Forest SoilVSAGALTFQLRAWIDRYEDLAQLRSDLSVAVKNALARENIAIA*
Ga0181523_1073253023300014165BogQVYVLNFSAGAVTFQLRAWTDRHEDWAQLRSDLSVAVNNALAREKIAIA*
Ga0182041_1037694823300016294SoilALTFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0182041_1148901113300016294SoilFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVKNAFARENIAIA
Ga0182033_1085528533300016319SoilRHRAWIDRYEDLAQLRSDLSLAVKNALARESIAIA
Ga0182033_1137646723300016319SoilTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVNNALAREKIAIA
Ga0182033_1164927313300016319SoilVTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVNTALAREKIAIA
Ga0182035_1072001333300016341SoilQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIT
Ga0182032_1073074013300016357SoilTARAVNYPLRSWTARQEDWAQLRSDLSLAVKDALAREKIAIT
Ga0182032_1074322013300016357SoilVYVTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVNNALAREKIAIT
Ga0182034_1032512713300016371SoilTFQLRAWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0182034_1062606313300016371SoilNFSAGALTFQLRAWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0182040_1010463313300016387SoilYVTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVNNALAREKIAIT
Ga0182037_1006450613300016404SoilTFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0182038_1052842713300016445SoilQLRAWIDRYEDLAQLRSDLSLAVNNALAREKIAIA
Ga0182038_1081458613300016445SoilAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIT
Ga0182038_1162935323300016445SoilFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0187782_1054049513300017975Tropical PeatlandTNFSAGAVTFQLRAWTDRQEDWAQLRSDLSLAIQDALARAKIAIA
Ga0184626_1020057013300018053Groundwater SedimentAVTFQLRVWTDRHEAWAQLRSDLSLAINETLSREKIAIA
Ga0190271_1376496313300018481SoilAPQVNVVTLTASAVTLQLRVWTDRHEAWAQLRSDLSIAINDALTREKIAIV
Ga0179594_1029322313300020170Vadose Zone SoilTAGAITFQLRVWTDQYHEWAQLRSDLSVAVNDALAREKIAIA
Ga0210407_1131975423300020579SoilQVYVVNISSGAVTFQLRAWTDRNEEWAQLRSDLSVAVNDALAREKIAIA
Ga0210399_1156593423300020581SoilAGAVTFQLRTWTDRHEDWAQLRSDLSVAVNEALAREKIAIA
Ga0210406_1094671613300021168SoilSVTFQLRAWTDRHEDWTQLRSDLAVAANEALAREKIAIA
Ga0210408_1115815113300021178SoilKIRARTDRHEDWAQLRSDLSLAANEALAREKIAIA
Ga0210396_1001879693300021180SoilLPTVVNVTAGAVTFQLRAWTDRYHEWVQLRGDLSVAVNDALAREKIAIA
Ga0213872_1032153323300021361RhizosphereAGALTFQIRVWIDRYEDLAQLRSDLSLAAKNALARENIAIT
Ga0213876_1059652013300021384Plant RootsSFSAGVLSFQLRAWTDGDEDWAQLRSDLAVAVKDALAREQIALA
Ga0210385_1110444313300021402SoilAGAVTFQLRVWTDRSRQWIQVRSDLALAVNEALARERIAIV
Ga0210397_1040146013300021403SoilVTFQLRAWTDRHEDWAQLRSDLSVAVNNALAREKIAIA
Ga0210384_1027914013300021432SoilQEEDGVRDTAGAVTFQRRPWTDRYQDWARVRSDLAVAVNSALVEQRIAIA
Ga0213878_1015777713300021444Bulk SoilVVNFSAGAVTFQLRAWIDRYEDLAQLRSDLSVAVNNALAREKIVIA
Ga0213878_1032521023300021444Bulk SoilAGAVTFQLRAWTDRYQDWARVRSDLAVAVNSALAQERIAIA
Ga0210410_1104407913300021479SoilTNFSAGAVTFQLRAWTDRHEDWAQLRSDLSVAVNNALAREKIAIA
Ga0126371_1030485613300021560Tropical Forest SoilQVYVINFNAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALVREKIVIA
Ga0207699_1136620023300025906Corn, Switchgrass And Miscanthus RhizosphereTFQLRAWTDRYHEWVQLRGDLSVAVNDALAREKIAIA
Ga0257161_109350713300026508SoilVNFTAGAITFQLRVWTDQYHEWAQLRSDLSIAVNDALAREKIAIA
Ga0209626_117670213300027684Forest SoilTGGAVTLQLRAWTDRYQDWAQLRSDLSKAVSGVLAREKIALT
Ga0209039_1019502023300027825Bog Forest SoilFSAGAVTYQLRAWTDRHEDWAQLRSDLSVAVNDALAREKIAIA
Ga0209039_1020686613300027825Bog Forest SoilFSAGAVTYQLRAWTDRHEDWAQLRSDLSVAVNEALAREKIAIA
Ga0209006_1118645823300027908Forest SoilGAVTLQLRAWTDRYQDWAQLRSDLSKAVSGVLAREKIVLT
Ga0302225_1037899513300028780PalsaAAGAVAFQVRAWTDRYQDWAQVRSDLSVRLGEALLREKITIV
Ga0318573_1062383313300031564SoilVYAVSFAAGATTFQLRAWTDRYQDWAQLRSDLALAVNDALAREKIALA
Ga0310915_1001624213300031573SoilHVYVTNFSAGALTFQLRAWIERYEDLAQLRSDLSLALNNALACEKIAIA
Ga0318542_1077185613300031668SoilFQLRAWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0307476_1069106613300031715Hardwood Forest SoilYVTSFTAGAIVFQLRAWTDRYQDWAQVRSDLALGLSDALAREKIGIA
Ga0306918_1015210613300031744SoilAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIA
Ga0306918_1132248323300031744SoilLTFQLRAWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0307475_1106699913300031754Hardwood Forest SoilVYVTNLSAGAVTFKIRAWTDRHEDWAQLRSDLSLAANEALAREKIAIA
Ga0318547_1044133523300031781SoilFTAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALARETIAIT
Ga0310917_1037196713300031833SoilTNFSAGAVTFQLRAWIDRYEDLAQLRSDLSLAVNNALAREKIAIT
Ga0306919_1028280313300031879SoilGAVSFQLRAWIDRYEDLAQLRSDLSLAVKNTLARDQIAIA
Ga0306925_1042867813300031890SoilAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAMV
Ga0306925_1091774013300031890SoilAGALTFQLRAWIDRYEDLAQLRSDLSLAVKNALARENIAIA
Ga0318520_1097769613300031897SoilVYVTSFTAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALARETIAIT
Ga0306923_1231131523300031910SoilFQLRAWIDRYEDLAQLRSDLSLEVKGALARENIAIA
Ga0306921_1024576013300031912SoilGALTFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0306921_1080537423300031912SoilLTFQLRVWIDRYEDLAQLRSDLSLAAKNALARENIAIA
Ga0310912_1051587823300031941SoilFQLRAWIDRYEDLAQLRSDLSLAVKNALARENVAIA
Ga0310912_1147292813300031941SoilFSAGALTFQLRAWIDRYEDLAQLRSDLSLAVKNALARENIAIA
Ga0310910_1069753213300031946SoilLSFRLRAWIDRYEDLAQLRSDLSLAVKNALARESIAIA
Ga0310909_1067580313300031947SoilSAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIA
Ga0306926_1026617223300031954SoilLTFQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0306926_1196134933300031954SoilYITNFSAGALSFRLRAWIDRYEDLAQLRSDLSLAVKNALARESIAIA
Ga0307479_1039059713300031962Hardwood Forest SoilVTFQLRAWIDRYEDLAQLRSDLSVAVNDALARDKIAIA
Ga0307479_1154570513300031962Hardwood Forest SoilVTFQLRAWTDRYQDWARVRSDLAVAVNSALVEERIAIA
Ga0318533_1056582023300032059SoilSAGALTFQLRAWIDRYEDLAQLRSDLSLEVKNALARENIAIA
Ga0306924_1023019823300032076SoilQLRAWIDRYEDLAQLRSDLSLALNNALASEKIAIA
Ga0306924_1046075413300032076SoilTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0306920_10128962113300032261SoilPSPHVYVTNFSAGALTFQLRAWIDRYEDLAQLRSDLSLALNTALAREKIAIA
Ga0306920_10134192433300032261SoilNFSAGAVTYQLRAWTDRQEDWAQLRSDLSLAVKDALAREKIAIA
Ga0306920_10135609023300032261SoilSAGALTFQLRVWIDRYEDLAQLRSDLSLALNNALAREKIAIA
Ga0310914_1003081573300033289SoilHVYVTNFSAGALTFQLRAWIERYEDLAQLRSDLSLALNNALAREKIAIA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.