NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F095275

Metagenome Family F095275

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095275
Family Type Metagenome
Number of Sequences 105
Average Sequence Length 43 residues
Representative Sequence LDLTDAERAAFVARDMRRINELGGYLHLVMSVPGLAAH
Number of Associated Samples 100
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 99.05 %
% of genes from short scaffolds (< 2000 bps) 92.38 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (88.571 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(19.048 % of family members)
Environment Ontology (ENVO) Unclassified
(26.667 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.857 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.
1Ga0055465_103125021
2Ga0062590_1022997691
3Ga0063356_1059889632
4Ga0066680_105368441
5Ga0066685_110838051
6Ga0066388_1017807062
7Ga0070689_1014680611
8Ga0070669_1018232161
9Ga0070694_1004085233
10Ga0066686_106735922
11Ga0066686_111112191
12Ga0066687_103935462
13Ga0070707_1010016391
14Ga0070684_1009945361
15Ga0066697_100355564
16Ga0066697_100985664
17Ga0066695_106599271
18Ga0066692_102989522
19Ga0066698_101827544
20Ga0066703_104451191
21Ga0066654_108880381
22Ga0068859_1025294952
23Ga0066653_101531581
24Ga0075431_1015127741
25Ga0075424_1007113273
26Ga0075436_1015146222
27Ga0099794_106460331
28Ga0066710_1017412601
29Ga0114129_101782353
30Ga0114129_132855741
31Ga0111538_133910251
32Ga0075423_126768232
33Ga0114945_105345402
34Ga0105238_113410712
35Ga0126374_108839741
36Ga0105087_10746512
37Ga0105064_10377081
38Ga0126384_101417654
39Ga0126382_115160732
40Ga0134111_104617892
41Ga0126370_103362851
42Ga0126370_107063373
43Ga0126372_100857634
44Ga0126372_112207092
45Ga0137391_109306561
46Ga0137456_11257092
47Ga0137338_11310722
48Ga0137388_113642392
49Ga0137383_101404931
50Ga0137363_117306931
51Ga0137399_102662761
52Ga0137379_114136171
53Ga0137371_112021131
54Ga0157290_100835122
55Ga0137396_113003432
56Ga0137394_1000434414
57Ga0137416_100994021
58Ga0137404_110631102
59Ga0153916_131738011
60Ga0134110_101417861
61Ga0137409_114098081
62Ga0137403_110509232
63Ga0134085_105200821
64Ga0132258_115207171
65Ga0132255_1031330241
66Ga0134112_101021501
67Ga0134083_105726502
68Ga0184608_100956031
69Ga0184634_105205321
70Ga0190272_120563382
71Ga0193723_11533262
72Ga0193730_10103621
73Ga0193755_12015421
74Ga0194131_101043211
75Ga0194120_101512951
76Ga0193695_11135321
77Ga0210073_10193871
78Ga0207647_102978541
79Ga0207709_112352691
80Ga0207668_106729481
81Ga0209055_10115001
82Ga0209152_101256892
83Ga0209802_12626302
84Ga0209803_12545162
85Ga0257173_10079773
86Ga0257171_10399682
87Ga0209690_12091821
88Ga0209157_10384713
89Ga0209156_101346552
90Ga0256865_10901932
91Ga0209726_100969464
92Ga0209481_103525542
93Ga0247828_102708161
94Ga0247818_111588561
95Ga0247820_105222892
96Ga0307504_104197261
97Ga0307312_104133231
98Ga0299907_104283211
99Ga0310888_108038481
100Ga0307469_106867641
101Ga0307473_115522222
102Ga0214473_107280543
103Ga0335079_117119232
104Ga0335083_104892321
105Ga0316628_1037190721
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 39.39%    β-sheet: 0.00%    Coil/Unstructured: 60.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035LDLTDAERAAFVARDMRRINELGGYLHLVMSVPGLAAHSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
88.6%11.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Lake
Freshwater Wetlands
Groundwater
Natural And Restored Wetlands
Thermal Springs
Soil
Groundwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Thaliana Rhizosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
9.5%13.3%6.7%4.8%19.0%3.8%2.9%7.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0055465_1031250213300004013Natural And Restored WetlandsADAPAALAGLDLSDAERAAFVARDRRKINELGGYLHLVMSVPGLAAH*
Ga0062590_10229976913300004157SoilALAGADLSADERAAFVARDMRRINELGGYLHLVMSIPGLAAH*
Ga0063356_10598896323300004463Arabidopsis Thaliana RhizosphereAEPQAALADVDLSEEERSAFVARDARRINELGGYLHLVMSVPGIAGH*
Ga0066680_1053684413300005174SoilEADAGTALAGADLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH*
Ga0066685_1108380513300005180SoilTDPEAALAGADLTEAERAAFMARDMRKVNELGGYLHLVMSIPGLAAH*
Ga0066388_10178070623300005332Tropical Forest SoilALAGADLSAEERAAFVARDMRRINELGGYLHLVMSIPGLAAH*
Ga0070689_10146806113300005340Switchgrass RhizosphereAGADLTESERAAFVARDMRRINELGGYLHLVMSIPGHAAH*
Ga0070669_10182321613300005353Switchgrass RhizosphereLAGLDLTEAERAAFVARDTRKINELGGYLHLVMSVPGLAAH*
Ga0070694_10040852333300005444Corn, Switchgrass And Miscanthus RhizosphereAAAVAGADLTESERAAFVARDMRRINELGGYLHLVMSIPGHAAH*
Ga0066686_1067359223300005446SoilPAAVLAGADLSAEERAAFVARDMRRINELGGYLHLVMSIPGLAAH*
Ga0066686_1111121913300005446SoilLDLTDAERAAFLARDMQRINALGGYLHLVMSVPGMAAHVTHPEPE*
Ga0066687_1039354623300005454SoilTADPAAALAGLDLTEPERSAFIARDMRKINELGGYLHLVMSIPGLAAH*
Ga0070707_10100163913300005468Corn, Switchgrass And Miscanthus RhizosphereLADAPSVLAGLDLTEAERAAFIARDMRRINELGGYLHLVMSVPGLAAH*
Ga0070684_10099453613300005535Corn RhizosphereDAAAALASVDLTEAERAAFVNHDMRRINELGGYLHLVMSVPGLAAH*
Ga0066697_1003555643300005540SoilLADADLTEAERAAFLARDMRRINELGGYLHLVLSIPGLAVH*
Ga0066697_1009856643300005540SoilAERSAFVARDMRRINELGGYLHLVMSIPGLAASQRATT*
Ga0066695_1065992713300005553SoilNAERSAFVARDMRRINELGGYLHLVMSIPGLAASQRATT*
Ga0066692_1029895223300005555SoilAAFRARDMERINALGGYLHLVMSVPGMAAHVTHTEPE*
Ga0066698_1018275443300005558SoilEPERSAFIARDMRKINELGGYLHLVMSIPGLAAH*
Ga0066703_1044511913300005568SoilMGPAASIGDLDLTDAERAAFVARDLRKVNELGGYLHLVISIPSLAAR*
Ga0066654_1088803813300005587SoilLADADLTEAERAAFLARDMQRINELGGYLHLVLSIPGLAVH*
Ga0068859_10252949523300005617Switchgrass RhizosphereSRDAAAAVAGADLTESERAAFVARDMRRINELGGYLHLVMSIPGHAAH*
Ga0066653_1015315813300006791SoilLSADERAAFAARDMRRINELGGYLHLVLSIPGLAVHPPRH*
Ga0075431_10151277413300006847Populus RhizosphereRDASAALAGADLSDVERAAFVARDMRAINELGGYLHLVMSIPGLAAH*
Ga0075424_10071132733300006904Populus RhizosphereDPAAAIVDLDLTADERAAFVARDMRRINELGGYLHLVLSIPGLAIHPPRR*
Ga0075436_10151462223300006914Populus RhizosphereREPNAALADVDLTDDERAAFIARDARRINELGGFLHLVMSVPGIAGH*
Ga0099794_1064603313300007265Vadose Zone SoilDADLTHAERSAFVARDMRRINELGGYLHLVMSIPGLAASQRATT*
Ga0066710_10174126013300009012Grasslands SoilGRDLTDAERAAFLARDMERINALGGYLHLVMSVPGMAAHVMHAEHE
Ga0114129_1017823533300009147Populus RhizosphereDLTDDERAAFIARDARRINELGGFLHLVMSVPGIAGH*
Ga0114129_1328557413300009147Populus RhizosphereASALAGLDLTEAEQAAFVARDMRRINELGGYLHLVMSVPGLAAH*
Ga0111538_1339102513300009156Populus RhizosphereADLTEAEREAFVAHDMRRINELGGYLHLVMSIPGLAAH*
Ga0075423_1267682323300009162Populus RhizosphereFDLSDAERAAVLARDLHVLNDLGGYLHLLLSIPGFAPH*
Ga0114945_1053454023300009444Thermal SpringsALAGLDLTEDERAAFLARDMRKLNELGGYLHLVMSVPGMAAHVTHADRH*
Ga0105238_1134107123300009551Corn RhizosphereASVDLTEAERAAFVNHDMRRINELGGYLHLVMSVPGLAAH*
Ga0126374_1088397413300009792Tropical Forest SoilLTEAERAAFVARDMRRINELGGYLHLVMSIPGLAAH*
Ga0105087_107465123300009819Groundwater SandLDLTDAERAAFVARDMRRINELGGYLHLVMSVPGLAAH*
Ga0105064_103770813300009821Groundwater SandDLTEAERAAFVAHDMRKINELGGYLHLVMSIPGLAAHGGRA*
Ga0126384_1014176543300010046Tropical Forest SoilVADADLTDAERSAFVARDMRKINELGGYLHLVMSIPGLAAGQHRPT*
Ga0126382_1151607323300010047Tropical Forest SoilDLTEAERAAFMARDMRKINELGGYLHLVMSIPGLAAH*
Ga0134111_1046178923300010329Grasslands SoilAALAGADLTEAERAAFIARDMRKVNELGGYLHLVMSIPGLAAH*
Ga0126370_1033628513300010358Tropical Forest SoilDADLTDAERSAFLARDMRRINELGGYLHLVMSIPGLAAGQHRPT*
Ga0126370_1070633733300010358Tropical Forest SoilADADLDDAERAAFLTRDMRKINELGGYLHLVMSIPGLAAH*
Ga0126372_1008576343300010360Tropical Forest SoilFGADPAASTADLDLNAEERAAFVARDMRRINELGGYLHLVLSIPGLAVHAPRG*
Ga0126372_1122070923300010360Tropical Forest SoilADARGVLEGLDLTEEERAAFLARDMTRINALGGYLHLVMSVPGLAIHITHPERD*
Ga0137391_1093065613300011270Vadose Zone SoilLAGADLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH*
Ga0137456_112570923300011428SoilDLTDAERAAFVARDMRKINELGGYLHLVMSVPGLAAH*
Ga0137338_113107223300012174SoilDLSDEERRAFVARDARRINELGGYLHLVMSVPGIAGH*
Ga0137388_1136423923300012189Vadose Zone SoilEPEAALADVDLSDDERRAFVARDARRINELGGYLHLVMSVPGIAGH*
Ga0137383_1014049313300012199Vadose Zone SoilALAGADLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH*
Ga0137363_1173069313300012202Vadose Zone SoilAGTALAGADLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH*
Ga0137399_1026627613300012203Vadose Zone SoilPAAALAGLDLTEPERSAFVARDMRKINELGGYLHLVMSIPGLAAH*
Ga0137379_1141361713300012209Vadose Zone SoilPAAALAGLDLTEPERSAFIARDMRKINELGGYLHLVMSVPGLAAH*
Ga0137371_1120211313300012356Vadose Zone SoilALDGLDLTDAERAAFLARDMRRINALGGYLHLVMSVPGMAAHVTHAEPE*
Ga0157290_1008351223300012909SoilRAALADADLTEAEREAFVAHDMRRINELGGYLHLVMSIPGLAAH*
Ga0137396_1130034323300012918Vadose Zone SoilLTAAERAAFEERDMRKINELGGYLHLVMSVPGLAAH*
Ga0137394_10004344143300012922Vadose Zone SoilRSAFVARDMRRINELGGYLHLVMSIPGLAASQRATT*
Ga0137416_1009940213300012927Vadose Zone SoilALAGLDLTDAERAAFEARDMRKINELGGYLHLVMSVPGLAAH*
Ga0137404_1106311023300012929Vadose Zone SoilRAAFLARDMERINALGGYLHLVMSVPGMAAHVTHTEPE*
Ga0153916_1317380113300012964Freshwater WetlandsNEVADLTDAERAAFIARDMRRINGLRGYLHLVMSVPDLAAH*
Ga0134110_1014178613300012975Grasslands SoilLSADERAAFAARDMRRINELGGYLHLVLSIPGLAVHPPRR*
Ga0137409_1140980813300015245Vadose Zone SoilREPEAALADVDLSDDERRAFIARDARRINELGGYLHLVMSVPGIAGH*
Ga0137403_1105092323300015264Vadose Zone SoilGADLTEAERAAFVRRDMRALNELGGYLHLVLSIPGMAAH*
Ga0134085_1052008213300015359Grasslands SoilPEAALAGADLTEAERAAFIARDMRKVNELGGYLHLVMSIPGLAAH*
Ga0132258_1152071713300015371Arabidopsis RhizosphereADARAALAGLDLSEAERAAFVARDVRRINELGGYLHLVMSVPGLAAH*
Ga0132255_10313302413300015374Arabidopsis RhizosphereTRTIADLDLTDPERAAFIARDLRKINELGGYLHLVMSVPGLAVH*
Ga0134112_1010215013300017656Grasslands SoilTRDPGTAVADADLTDAERSAFVARDMRKINELGGYLHLVMSIPGLAASHRATT
Ga0134083_1057265023300017659Grasslands SoilGLDLTDAERAAFLARDMQRINALGGYLHLVMSVPGMAAHVTHPEPE
Ga0184608_1009560313300018028Groundwater SedimentTEAERAAFVARDMRAINELGGYLHLVMSIPGLAAH
Ga0184634_1052053213300018031Groundwater SedimentFEADARAALAGLDLTDAERAAFVARDMRKINELGGYLHLVMSVPGLAAH
Ga0190272_1205633823300018429SoilLAGLDLTDAERAAFEARDMRKINELGGYLHLVMSVPGLAAH
Ga0193723_115332623300019879SoilAGLDLTDAEQAAFEARDMRKINELGGYLHLVMSVPGLAAH
Ga0193730_101036213300020002SoilADAAASLAGLDLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH
Ga0193755_120154213300020004SoilSVLAGLDLTEAERAAFIARDMRRINELGGYLHLVMSVPGLAAH
Ga0194131_1010432113300020193Freshwater LakeLAGADLTDTERAAFVAHDMRKINELGGYLHLVMSIPGLAAH
Ga0194120_1015129513300020198Freshwater LakeDLTDTERAAFVAHDMRKINELGGYLHLVMSIPGLAAH
Ga0193695_111353213300021418SoilAGLDLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH
Ga0210073_101938713300025569Natural And Restored WetlandsLDLTDAERAAFVARDMRKINELGGYLHLVMSVPGLAAH
Ga0207647_1029785413300025904Corn RhizosphereSALADLDLTEDERAAFIARDMRRINELGGYLHLVMSVPGLAAH
Ga0207709_1123526913300025935Miscanthus RhizospherePAQAIADLDLSEVERAAFVERDRRRINELGGYLHLVMSIPGLARH
Ga0207668_1067294813300025972Switchgrass RhizosphereARFVANGADALADLDLSNEERAAFLAHDMRKINELGGYLHLVMSIPGLAGH
Ga0209055_101150013300026309SoilLAGLDLTEPERSAFIARDMRKINELGGYLHLVMSIPGLAAH
Ga0209152_1012568923300026325SoilGERAAFRARDMERINALGGYLHLVMSVPGMAAHVTHTEPE
Ga0209802_126263023300026328SoilEADAGTALAGADLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH
Ga0209803_125451623300026332SoilAAALAGADLTADERAAFVARDMRRINELGGYLHLVMSIPGLAAH
Ga0257173_100797733300026360SoilQADAAAALAGLDLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH
Ga0257171_103996823300026377SoilRAALVGLDLTDAERAAFVARDMRKINELGGYLHLVMSVPGLAAH
Ga0209690_120918213300026524SoilSAFVARDMRRINELGGYLHLVMSIPGLAASQRATT
Ga0209157_103847133300026537SoilDLTEAERAAFIARDMRKVNELGGYLHLVMSIPGLAAH
Ga0209156_1013465523300026547SoilDAERAAFLARDMQRINALGGYLHLVMSVPGMAAHVTHPEP
Ga0256865_109019323300027657SoilDPAAAVAGLDLTDAERAAFIARDMKRINALGGYLHLVMSVPGLAAHVTHPEREA
Ga0209726_1009694643300027815GroundwaterLIGLDLTDAERAAFVARDMRKINELGGYLHLVMSVPGLAAH
Ga0209481_1035255423300027880Populus RhizosphereVVHADLTEAERAAFVARDMRRINELGGYLHLVMSIPGLAAH
Ga0247828_1027081613300028587SoilVDLTEAERAAFVNHDMRRINELGGYLHLVMSVPGLAAH
Ga0247818_1115885613300028589SoilDPGPAIATADLSDAERAAFVARDMRRINELGGYLHLVMSIPGLAAH
Ga0247820_1052228923300028597SoilEALADLDLSNEERAAFLAHDMRKINELGGYLHLVMSIPGLAGH
Ga0307504_1041972613300028792SoilYLADTTGSIVELDLTDAERAAFVARDMRRINELGGYLHLVMSVPGLAAH
Ga0307312_1041332313300028828SoilGLDLTEAERAAFIARDMRRINELGGYLHLVMSVPGLAAH
Ga0299907_1042832113300030006SoilEDAAGALAEADLTDAERAAFVARDMRQINELGGYLHLVMSIPGLAGH
Ga0310888_1080384813300031538SoilADADLTEAERAAFAAHDMRRINELGGYLHLVMSVPGLAAH
Ga0307469_1068676413300031720Hardwood Forest SoilEPEAALANVDLTEDERRAFVARDARRINELGGYLHLVMSVPGIAGH
Ga0307473_1155222223300031820Hardwood Forest SoilLTGIDLTEAERAAFVAHDMRRINELGGYLHLVMSVPGLAAH
Ga0214473_1072805433300031949SoilRAAFIARDMTRINALGGYLHLVMSVPGLAAHVTHAERAD
Ga0335079_1171192323300032783SoilAFLARDMTRINALGGYLHLVMSVPGLAVHITHPERE
Ga0335083_1048923213300032954SoilDEERAAFLARDMTRINALGGYLHLVMSVPGLAVHITHPERE
Ga0316628_10371907213300033513SoilADADLTEAERAAFATRDVRRINELGGYLHLVMSVPGLAAH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.