NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099551

Metagenome / Metatranscriptome Family F099551

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099551
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 43 residues
Representative Sequence MAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERM
Number of Associated Samples 87
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 97.09 %
% of genes from short scaffolds (< 2000 bps) 92.23 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.55

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.146 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(26.214 % of family members)
Environment Ontology (ENVO) Unclassified
(33.981 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.718 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1ICChiseqgaiiDRAFT_23919091
2NODE_03412741
3AP72_2010_repI_A10DRAFT_10576942
4JGI11643J12802_103118341
5JGI1027J12803_1063267881
6JGI24738J21930_101140792
7JGI24742J22300_100300721
8Ga0055453_100889543
9Ga0062589_1025980811
10Ga0062595_1017181712
11Ga0070709_111562622
12Ga0070714_1013405971
13Ga0070711_1000321621
14Ga0066681_104097371
15Ga0070695_1011663522
16Ga0070665_1016220321
17Ga0070704_1003194563
18Ga0066699_108961012
19Ga0066699_111919922
20Ga0070702_1007089563
21Ga0075364_101142811
22Ga0075367_110807541
23Ga0079222_111139442
24Ga0066660_111977851
25Ga0075424_1028055651
26Ga0075436_1015309682
27Ga0111539_129431972
28Ga0105245_111381311
29Ga0111538_111995962
30Ga0126313_104411621
31Ga0126312_103195323
32Ga0126314_109129831
33Ga0126314_110443781
34Ga0126310_117125492
35Ga0134125_126735741
36Ga0134122_103571533
37Ga0134122_113249161
38Ga0126317_110418711
39Ga0120148_10784732
40Ga0137371_109133322
41Ga0137419_100918123
42Ga0162653_1000091573
43Ga0134110_102042762
44Ga0137418_105102253
45Ga0134073_100446952
46Ga0132256_1019812021
47Ga0184605_104035262
48Ga0184617_10773581
49Ga0184635_1000135110
50Ga0184609_103455192
51Ga0184625_105787941
52Ga0066662_117171922
53Ga0184642_10526941
54Ga0184642_10625452
55Ga0184642_13222632
56Ga0193704_10532503
57Ga0193728_11591641
58Ga0193730_10988053
59Ga0193730_11546262
60Ga0210382_102891972
61Ga0210382_103277741
62Ga0179584_11133851
63Ga0247743_10488272
64Ga0247679_10370322
65Ga0207647_106685111
66Ga0207652_106161711
67Ga0207644_115527471
68Ga0207667_104235761
69Ga0207667_114939942
70Ga0208777_10204872
71Ga0207428_108609271
72Ga0207428_111811311
73Ga0247749_10355481
74Ga0307295_100082191
75Ga0307301_102533871
76Ga0307318_101800321
77Ga0307280_101611151
78Ga0307288_104826541
79Ga0307323_103430182
80Ga0307290_100205395
81Ga0307290_101949642
82Ga0307290_103399902
83Ga0247824_108470202
84Ga0307302_100581321
85Ga0307302_106251792
86Ga0307310_102950132
87Ga0307314_100196914
88Ga0307286_100211031
89Ga0307286_100566951
90Ga0307308_101258142
91Ga0102757_100030131
92Ga0102757_100504213
93Ga0308205_10265822
94Ga0308205_10660622
95Ga0308206_11577091
96Ga0308200_11429512
97Ga0308200_11673802
98Ga0308204_101246991
99Ga0307505_102369452
100Ga0307469_120812792
101Ga0310896_106663212
102Ga0247829_108087311
103Ga0373950_0004089_2016_2123
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 11.43%    β-sheet: 20.00%    Coil/Unstructured: 68.57%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.55
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
96.1%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Natural And Restored Wetlands
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Serpentine Soil
Grasslands Soil
Soil
Soil
Agricultural Soil
Permafrost
Soil
Grasslands Soil
Soil
Forest Soil
Hardwood Forest Soil
Soil
Rice Paddy Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Soil
Corn Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Populus Endosphere
Populus Rhizosphere
Miscanthus Rhizosphere
Rhizosphere Soil
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Sugar Cane Bagasse Incubating Bioreactor
4.9%4.9%26.2%3.9%2.9%4.9%6.8%2.9%2.9%4.9%3.9%5.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_239190913300000033SoilMAVQHRCRRHGRPAVASRVRLLPDGTQETEYLCEIDLAEERXSSGF
NODE_034127413300000156Sugar Cane Bagasse Incubating BioreactorMAEQHLCQRHGRPAIASRVRLRPDGTQEIEYLCEIDL
AP72_2010_repI_A10DRAFT_105769423300000651Forest SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGG
JGI11643J12802_1031183413300000890SoilMAEQHLCQRHGRPAVASRVRMRPDGTQEVEYLCEIDLAEERMSGRLGRG
JGI1027J12803_10632678813300000955SoilMAVDQPVCQRHGRPAVASRVRRRPDGTQEVEYLCEIDLAEEQMSS
JGI24738J21930_1011407923300002075Corn RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMS
JGI24742J22300_1003007213300002244Corn, Switchgrass And Miscanthus RhizosphereMAVDQPVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEEQMSSRFG
Ga0055453_1008895433300004006Natural And Restored WetlandsMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSSRLD
Ga0062589_10259808113300004156SoilMATEQHYCERHHRPAVASRVRLRADGSQEVEYLCELDLAEERLSSRGL
Ga0062595_10171817123300004479SoilVAAEGPLCERHHRPAVASRVRILPDGTRETEYLCEIDLAEE
Ga0070709_1115626223300005434Corn, Switchgrass And Miscanthus RhizosphereMAVEHRCKRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMS
Ga0070714_10134059713300005435Agricultural SoilMAVEHRCKRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSSR
Ga0070711_10003216213300005439Corn, Switchgrass And Miscanthus RhizosphereMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSSRLGGSS
Ga0066681_1040973713300005451SoilMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSS
Ga0070695_10116635223300005545Corn, Switchgrass And Miscanthus RhizosphereMAEQHLCKRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGSR
Ga0070665_10162203213300005548Switchgrass RhizosphereMAAEQHFCERHHRPAVASRVRILADGTRETEYLCELDLA
Ga0070704_10031945633300005549Corn, Switchgrass And Miscanthus RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGSRS
Ga0066699_1089610123300005561SoilMAVEHLCKRHGRPAVASRVRLRPDGTQEVEYLCEI
Ga0066699_1119199223300005561SoilMAAAEHLCQKHGRPAVATRTRLLPDGTQEVEYLCEI
Ga0070702_10070895633300005615Corn, Switchgrass And Miscanthus RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERM
Ga0075364_1011428113300006051Populus EndosphereMADQPVCQRHGRPAVASRVHTRPDGTQEVEYLCEIDVAEERMR
Ga0075367_1108075413300006178Populus EndosphereMAEQHLCKRHGRPAVASRVRLRPDGTQEVEYVCEIDLA
Ga0079222_1111394423300006755Agricultural SoilMAVEHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEE
Ga0066660_1119778513300006800SoilMSVEQRCQRHGRPAVATRTRLRPDGTQEVEYLCEIDLAEERMAN
Ga0075424_10280556513300006904Populus RhizosphereMAAADQPVCPRHGRPAVASRVRLNPDGTQEVEYLCEIDLAEEQMSSRFGGR
Ga0075436_10153096823300006914Populus RhizosphereVAAEGPVCERHHRPAVASRVRILPDGTRETEYLCEIDLAEERMSGRFG
Ga0111539_1294319723300009094Populus RhizosphereMAEQHLCKRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGSRSL
Ga0105245_1113813113300009098Miscanthus RhizosphereMAAEQHFCERHHRPAVASRVRILADGTRETEYLCEIDLAEERMSG
Ga0111538_1119959623300009156Populus RhizosphereMATEQWCQRHNRPAVASRVRRRADGTEQVEYLCEIDLAE
Ga0126313_1044116213300009840Serpentine SoilMAVEHRCQRHGRPAIASRVRLRPDGTQEVEYLCELDVAEERMPGRFGG
Ga0126312_1031953233300010041Serpentine SoilMAIEHRCQRHGRPAVASRVRLRPDGTQETEYLCELD
Ga0126314_1091298313300010042Serpentine SoilMAVEHRCQRHGRPAIASRVRLRPDGTREVEYLCELDVAEERMSGRFGGR
Ga0126314_1104437813300010042Serpentine SoilVATEQHFCERHGRPAVASRVRINPDGTREVEYLCD
Ga0126310_1171254923300010044Serpentine SoilMAEQPRCQRHGRPAVASRIRLRPDGTQETEYLCELDLAEERMS
Ga0134125_1267357413300010371Terrestrial SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEE
Ga0134122_1035715333300010400Terrestrial SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGSR
Ga0134122_1132491613300010400Terrestrial SoilMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERM
Ga0126317_1104187113300011332SoilMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCDID
Ga0120148_107847323300011999PermafrostMAAEQHFCERHHRPAVASRVRTLPDGTQEIEYLCEIDLAAGRIVVARGFADHDYPR*
Ga0137371_1091333223300012356Vadose Zone SoilMAAADQPVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEEQM
Ga0137419_1009181233300012925Vadose Zone SoilMAAEQHHCEKHHRPAVASRVRILPDGTRQTEYLCEIDLAEQ
Ga0162653_10000915733300012937SoilMSEQPRCPRHGRPAIASRVRLRPDGTQETEYLCEIDLAEERMQRGFGRRSLQLGES*
Ga0134110_1020427623300012975Grasslands SoilMAVEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCDIDLAEERMGRLGGGR
Ga0137418_1051022533300015241Vadose Zone SoilMAAEQHFCERHHRPAVASRVRTRPDGTQEVEYLCEIDLAEERMASRFGPQ*
Ga0134073_1004469523300015356Grasslands SoilMAAEQHLCRRHGRPAVASRVRVRPDGTQEVEYLCDI
Ga0132256_10198120213300015372Arabidopsis RhizosphereMAEQHLCKRHGRPAVASRVRLRPDGTQEVEYLCEI
Ga0184605_1040352623300018027Groundwater SedimentMAEQHLCQRHGRPAVASRIRTRPDGTQEVEYLCEIDLAEE
Ga0184617_107735813300018066Groundwater SedimentMAEQPVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDVAEERMGRLG
Ga0184635_10001351103300018072Groundwater SedimentMAVDQPVCKRHGRPAVASRVVTRPDGTQEVEYLCELDLAEERMRSPFGG
Ga0184609_1034551923300018076Groundwater SedimentMAVEQPVCKRHGRPAVASRVVTRPDGTQEVEYLCELDLAEERMRSP
Ga0184625_1057879413300018081Groundwater SedimentMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSSR
Ga0066662_1171719223300018468Grasslands SoilMSVEHLCQRHGRPAVPTRTRPRPDGTQEVEYLCENDLAEARQAH
Ga0184642_105269413300019279Groundwater SedimentMAVEQPVCQRHGRPAVAQRIRRLPDGTEEVEYLCELDLAEERMSG
Ga0184642_106254523300019279Groundwater SedimentMATEQHYCQRHNRPAVASRVRVRPDGTQEVEYLCEIDLAEERM
Ga0184642_132226323300019279Groundwater SedimentMATEQHYCERHHRPAVASRVRVLPDGTQEVEYLCELDLAE
Ga0193704_105325033300019867SoilMAEQHLCQRHGRPAVASRVRMRPDGTQEVEYLCEI
Ga0193728_115916413300019890SoilMSVEHRCQRHGRPAVATRTRLRPDGTQEVEYLCEIDLAE
Ga0193730_109880533300020002SoilMTVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEE
Ga0193730_115462623300020002SoilMAAADQPICPRHGRPAVASRVRLNPDGTQEVEYLCEIDLAEEQMSSR
Ga0210382_1028919723300021080Groundwater SedimentMAAADQPVCPRHGRPAVASRVRLRPDGTQEVEYLC
Ga0210382_1032777413300021080Groundwater SedimentMATEQHYCQRHHRPAVASRVRTLPDGTQEVEYLCEIDLAEERM
Ga0179584_111338513300021151Vadose Zone SoilMAVEHRCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEER
Ga0247743_104882723300023067SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLA
Ga0247679_103703223300024251SoilMAVEHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMG
Ga0207647_1066851113300025904Corn RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLG
Ga0207652_1061617113300025921Corn RhizosphereMAVEHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMGRLGG
Ga0207644_1155274713300025931Switchgrass RhizosphereMAVDQPVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEEPMSS
Ga0207667_1042357613300025949Corn RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGSQEVEYLCEIDLAEERMSGRLGGSR
Ga0207667_1149399423300025949Corn RhizosphereMAEQHLCKRHGRPAVASRVRLRPDGTQEVEYLCEID
Ga0208777_102048723300025996Rice Paddy SoilMTAEHLCQRHGRPAVASRVRLRPDGTEEVEYLCEIDLAEERMGRLGG
Ga0207428_1086092713300027907Populus RhizosphereMADQPVCQRHGRPAVASRVHTRPDGTQEVEYLCEIDV
Ga0207428_1118113113300027907Populus RhizosphereMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAE
Ga0247749_103554813300027993SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGR
Ga0307295_1000821913300028708SoilMAEQHLCQRHGRPAVASRVRQRPDGTQEVEYLCEIDLAEERMS
Ga0307301_1025338713300028719SoilMAAADQPVCPRHGRPAVASRVRLNPDGTQEVEYLCEIDLAEEQM
Ga0307318_1018003213300028744SoilVATEQHYCQRHHRPAVASRVRTLPDGTEEVEYLCELDIAE
Ga0307280_1016111513300028768SoilMAAADQPVCPRHGRPAVASRVRLNPDGTQEVEYLCEIDLAEEQMS
Ga0307288_1048265413300028778SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDL
Ga0307323_1034301823300028787SoilMAEQHLCQRHGRPAVASRVRQRPDGTQEVEYLCEIDLAEERMSGRLGGSRSL
Ga0307290_1002053953300028791SoilMAVEQPVCQRHGRPAVAQRIRRLPDGTEEVEYLCE
Ga0307290_1019496423300028791SoilMAEQHLCQRHGRPAVASRVRMRPDGTQEVEYLCEIDLAEERMSGRL
Ga0307290_1033999023300028791SoilMAVEHVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMG
Ga0247824_1084702023300028809SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCELDLAEERMS
Ga0307302_1005813213300028814SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGG
Ga0307302_1062517923300028814SoilMAVEHLCQRHGRPAVASRVRLLPDGTQETEYLCELD
Ga0307310_1029501323300028824SoilMASEQHYCQRHNRPAVASRVRTRPDGTQEVEYLCEIDLAEERMQS
Ga0307314_1001969143300028872SoilMAEQHLCQRHGRPAVASRVRMRPDGTQEVEYLCEID
Ga0307286_1002110313300028876SoilMAVEQPVCQRHGRPAVASRVVTRPDGTQGVEYLCELDLA
Ga0307286_1005669513300028876SoilMAEQPRCQRHGRPAVASRVRLRPDGTQETEYLCELD
Ga0307308_1012581423300028884SoilMAAADQPVCAQHGRPAVASRVRLRPDGTQEVEYLCEIDLAEEQMSSRFGGR
Ga0102757_1000301313300030785SoilMASEQHFCARHHRPAIASRVHLRPDGSQEVEYLCELDVAEERMSNRF
Ga0102757_1005042133300030785SoilMAVEHRCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSSR
Ga0308205_102658223300030830SoilMAVEQHYCQRHGRPAVASRVRILPDGTQETEYLCELDLA
Ga0308205_106606223300030830SoilMAVEHRCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSTRLGGS
Ga0308206_115770913300030903SoilVATEQHYCQRHHRPAVASRVRTLPDGSQEVEYLCELDIAEERLSRGF
Ga0308200_114295123300030905SoilVAAEQHFCERHHRPAVASRVRILPDGTRETEYLCEIDLAEERM
Ga0308200_116738023300030905SoilVATEQHYCQRHHRPAVASRVRTLPDGTQEVEYLCEI
Ga0308204_1012469913300031092SoilMAVEHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMGHLG
Ga0307505_1023694523300031455SoilMAVQHRCQRHGRPAVASRVRLLPDGTQETEYLCEIDLAEERLSSGFGN
Ga0307469_1208127923300031720Hardwood Forest SoilMAVDQPVCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEEQMSSRFGGRGS
Ga0310896_1066632123300032211SoilMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELDLAEERMSTRL
Ga0247829_1080873113300033550SoilMAEQHLCQRHGRPAVASRVRLRPDGTQEVEYLCEIDLAEERMSGRLGGS
Ga0373950_0004089_2016_21233300034818Rhizosphere SoilMAVEHLCQRHGRPAVASRVRLRPDGTQETEYLCELD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.