NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089371

Metagenome / Metatranscriptome Family F089371

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089371
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 42 residues
Representative Sequence FLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIESE
Number of Associated Samples 100
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 5.50 %
% of genes near scaffold ends (potentially truncated) 93.58 %
% of genes from short scaffolds (< 2000 bps) 90.83 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.16

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.248 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(18.349 % of family members)
Environment Ontology (ENVO) Unclassified
(37.615 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(40.367 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56
1L01_03504680
2L01_04616480
3F47_01432450
4JGI11643J12802_109893591
5JGI11643J12802_112385104
6JGI10220J13317_100260391
7JGI25404J52841_100058095
8Ga0062593_1012356752
9Ga0063356_1043586221
10Ga0062595_1001291222
11Ga0066814_101126171
12Ga0066684_109395241
13Ga0065715_104460611
14Ga0068869_1019562441
15Ga0070682_1005421493
16Ga0070709_102749451
17Ga0070711_1008397271
18Ga0070693_1006219532
19Ga0066698_110140911
20Ga0066703_101610003
21Ga0066705_100518444
22Ga0066691_101870373
23Ga0066903_1067766461
24Ga0080027_104509742
25Ga0066652_1003807751
26Ga0075018_105204582
27Ga0070712_1002801391
28Ga0070765_1021177052
29Ga0099795_103409451
30Ga0099829_107723202
31Ga0066709_1013553901
32Ga0105249_120657441
33Ga0126374_116289351
34Ga0126373_124419271
35Ga0134070_103732631
36Ga0134109_104065041
37Ga0134084_102693691
38Ga0126370_121478091
39Ga0126370_123162931
40Ga0126377_104110951
41Ga0105246_102715033
42Ga0137391_108713532
43Ga0137388_110755752
44Ga0137363_1000405611
45Ga0137363_104397791
46Ga0137363_106107971
47Ga0137399_115923421
48Ga0137372_109866272
49Ga0137367_110804761
50Ga0137366_109455212
51Ga0137371_112024942
52Ga0137368_104665211
53Ga0137359_100889531
54Ga0137359_114216611
55Ga0137419_117493242
56Ga0137404_107882211
57Ga0164300_103683822
58Ga0164302_100057291
59Ga0164306_115972831
60Ga0134081_100488713
61Ga0134078_102114212
62Ga0132258_126805651
63Ga0132258_136051063
64Ga0132257_1036490971
65Ga0132255_1058009891
66Ga0182039_119880562
67Ga0134112_102371363
68Ga0184608_104270931
69Ga0184612_101888031
70Ga0173481_100556353
71Ga0137408_13389143
72Ga0193748_10086643
73Ga0193720_10266852
74Ga0193747_11374472
75Ga0193728_12530442
76Ga0193731_10463491
77Ga0193721_10541382
78Ga0193733_10193983
79Ga0210406_112455022
80Ga0193719_100096626
81Ga0210392_104431791
82Ga0222621_11153171
83Ga0126371_102254941
84Ga0224452_12872791
85Ga0247688_11011071
86Ga0207707_111773292
87Ga0207671_110739532
88Ga0207693_101878983
89Ga0207687_108256851
90Ga0207702_106332691
91Ga0209863_102420002
92Ga0209648_100523034
93Ga0209590_102773661
94Ga0268266_101683052
95Ga0268266_110948382
96Ga0137415_108839461
97Ga0307305_100105817
98Ga0307289_104226641
99Ga0307277_102046582
100Ga0075386_121136911
101Ga0075382_115391141
102Ga0170834_1062632372
103Ga0307469_105752632
104Ga0307469_124836711
105Ga0318510_101886821
106Ga0307471_1020962801
107Ga0307472_1002449753
108Ga0310810_103659953
109Ga0310811_101101725
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540FLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIESESequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.16
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Watersheds
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Soil
Soil
Grasslands Soil
Grass Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Prmafrost Soil
Tropical Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
15.6%18.3%5.5%5.5%8.3%5.5%3.7%4.6%3.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
L01_035046802170459006Grass SoilFLSTRTDPRQEVELEASRESDSKITQVEISLPLPKPNIESE
L01_046164802170459006Grass SoilDPRQEVELEASREPDSKITQVEISLPLPKPNIESE
F47_014324502170459009Grass SoilTCTDPRQEVELEASREPDSKITQVEISSPLPEPNIDSE
JGI11643J12802_1098935913300000890SoilDPRQEVELEASREPDSKITHVEISSPLPKPDIESE*
JGI11643J12802_1123851043300000890SoilKITRILFLSTRTDPRQEVELEASREPDSKVTQVEISCPLPKPEIASE*
JGI10220J13317_1002603913300001139SoilDPRLEVELEASREPDSNITQVEISSPLPKPDIENE*
JGI25404J52841_1000580953300003659Tabebuia Heterophylla RhizosphereFLSTRTDPRQEVELEASREPDGKITHVEISSPLPKPNIDDE*
Ga0062593_10123567523300004114SoilSSNGNLTTIMFLSTRTDPVQKVELDARRAPGEKISRIKISSPLFKPDETSE*
Ga0063356_10435862213300004463Arabidopsis Thaliana RhizosphereLSTRTDPRQEVELEASREPDSKITHVEISSPLPKPDIVSE*
Ga0062595_10012912223300004479SoilLSTRTDPRQEVEMEAKREAGGKITEVNISSPLPKPEAASD*
Ga0066814_1011261713300005162SoilKITRILFLSTRTDPRQEVELEASREPDSKITRIEISSPLPKPDIGSE*
Ga0066684_1093952413300005179SoilFLSTRTDPRQEVELEASREPDGKITQVEISSPLPKPEIASE*
Ga0065715_1044606113300005293Miscanthus RhizosphereILFLSTRTDPRQEVELEASREPDSKVTQIEISSPLPKPDIGSE*
Ga0068869_10195624413300005334Miscanthus RhizosphereLFLSTRTDPRQEVELEASREPESKVTQVEISSPLPKPGIGSE*
Ga0070682_10054214933300005337Corn RhizosphereTRILFLSTRTDPRQEVELEASREPGGKVTRIEIASPLPKPDIGSE*
Ga0070709_1027494513300005434Corn, Switchgrass And Miscanthus RhizosphereSTRTDPRQEVELEASREPDSKVTQVEISSPLPKPDIGSE*
Ga0070711_10083972713300005439Corn, Switchgrass And Miscanthus RhizosphereTRILFLSTRTDPRQEVELEASREPESKVTQVEISSPLPKPEIGSE*
Ga0070693_10062195323300005547Corn, Switchgrass And Miscanthus RhizosphereLSTRTDPRQEVELEASREPGGKVTRIEIASPLPKPDIGSE*
Ga0066698_1101409113300005558SoilLSTRTDPRQEVELEASREPDNKITQVEISSPLPKPDIESE*
Ga0066703_1016100033300005568SoilNGKLTTIMFLSTRTDPVQEAKLEASREPGGKITQVEISSPLPKPDIGSE*
Ga0066705_1005184443300005569SoilKITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPVIESQ*
Ga0066691_1018703733300005586SoilFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPVIESQ*
Ga0066903_10677664613300005764Tropical Forest SoilSTRTDPRQEVELEASREPDSKVTQVQISSPLPKPELGSE*
Ga0080027_1045097423300005993Prmafrost SoilVRNNISSNGNLTTIMFLSTRTDPVQKVEMDARRAPGEKISRVEISSPLPKPG
Ga0066652_10038077513300006046SoilDPRQEVELEASREPDSKITKVEISSPLPKPDIESD*
Ga0075018_1052045823300006172WatershedsSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIESE*
Ga0070712_10028013913300006175Corn, Switchgrass And Miscanthus RhizosphereSTRTDPRQEVELEASREPDGKITQVEISSPLPKPNIESE*
Ga0070765_10211770523300006176SoilSTRTDPRQEVELEASREPDGKITQVEISSPLPKPDIEGE*
Ga0099795_1034094513300007788Vadose Zone SoilNPVEKVELEASSEPNSKVTQVEISSPLPKPDIESE*
Ga0099829_1077232023300009038Vadose Zone SoilLFLSTRTDPRQEVELEASREADNKITQVEISSPLPKPDIESE*
Ga0066709_10135539013300009137Grasslands SoilMNVNGKNTRILFLSTRTNPREKVELEASREADSKVTHAEISSPL
Ga0105249_1206574413300009553Switchgrass RhizosphereRTDPRQEVELEASREPDGKITQIEISSPLPKPNIESE*
Ga0126374_1162893513300009792Tropical Forest SoilPRQEVELEASREPDSKITQVEISSPLPKPVVASE*
Ga0126373_1244192713300010048Tropical Forest SoilTRILFLSTRNDPREEVELEASREPGSKVTQVQISSPLPKPEIGSE*
Ga0134070_1037326313300010301Grasslands SoilKITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPNIDSE*
Ga0134109_1040650413300010320Grasslands SoilLFLSTRTDPRQEVELEASRDPDSKVTQVEISSPLPKPEIGSE*
Ga0134084_1026936913300010322Grasslands SoilLFLSTRTDPREEVELEASREPNGKITQVEISSALPKPDFGSE*
Ga0126370_1214780913300010358Tropical Forest SoilTNPGQEVELEASREKDSKVTQVAISSPPPKPDIGSE*
Ga0126370_1231629313300010358Tropical Forest SoilRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDVASE*
Ga0126377_1041109513300010362Tropical Forest SoilDPRQEVELEASREPDAKITQVEISSPLPKPDIDTE*
Ga0105246_1027150333300011119Miscanthus RhizosphereLSTRTDPRQEVELEASREPDSKVTQVQISSPLPKPEIGTE*
Ga0137391_1087135323300011270Vadose Zone SoilILFLSTRTDPKQEVELEASREPGSKVTQVEISSPLPKPEIGSE*
Ga0137388_1107557523300012189Vadose Zone SoilSTRSDPMQEVELEASREPDGKITQVEISSPLPKPDIESE*
Ga0137363_10004056113300012202Vadose Zone SoilMFLSTRTDPLQEVKLEASREPGGKITQIEISFPLPKPDVQGE*
Ga0137363_1043977913300012202Vadose Zone SoilTRTDPRQEVELEASREPNSKVTQIQISSPLPKPEVGSE*
Ga0137363_1061079713300012202Vadose Zone SoilTDPRQEVELAASREQDSKITQVEISSPLPKPDIESE*
Ga0137399_1159234213300012203Vadose Zone SoilKITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIGSE*
Ga0137372_1098662723300012350Vadose Zone SoilSTRTDPRLEVELEASREPDSNITQVEISSPLPKPDIESE*
Ga0137367_1108047613300012353Vadose Zone SoilFLSTRTDPRQEVELEASREPDNKVTQIQISSPLPKPEIGSE*
Ga0137366_1094552123300012354Vadose Zone SoilFLSTRTDPRQEVELEASREPNNKVTQIQISSPLPKPEVGSE*
Ga0137371_1120249423300012356Vadose Zone SoilFLSTLTDLRQEVELEASREPDSKITQIEISSPLPKPDVESE*
Ga0137368_1046652113300012358Vadose Zone SoilITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIESE*
Ga0137359_1008895313300012923Vadose Zone SoilNGKLITITSLSTRSDPVQEVKLEASREADGKVTQVEISSPLPKPDIGSE*
Ga0137359_1142166113300012923Vadose Zone SoilPRQEVELEASREPDSKITQIEISSRLPKPDIESE*
Ga0137419_1174932423300012925Vadose Zone SoilTQTNPVEKVELEASREPNSKVTQVEISSPLPKPDIESE*
Ga0137404_1078822113300012929Vadose Zone SoilTDPRQEVELEASREPDGKITQVEISSPLPKPNIESE*
Ga0164300_1036838223300012951SoilNGKVTRILFLSTRTDPRQEVELEASREPESKVTQVEISSPLPKPEIGSE*
Ga0164302_1000572913300012961SoilGKLMTVTSLSTRSDPVQEVKLEASREADGKITQVEISSPLPKPNIETE*
Ga0164306_1159728313300012988SoilNGKITRILFLSTRTDPRQEVELEASREPDSKITQVQISSPLPKPDIESE*
Ga0134081_1004887133300014150Grasslands SoilVNGKITRILFLSTRTDPREEAELEASREPDSKITRVEISSPLPKPNIESE*
Ga0134078_1021142123300014157Grasslands SoilMFLSTRTDPVQEVKLEAFREPDGKITQVEISSPLPKPDIGSE*
Ga0132258_1268056513300015371Arabidopsis RhizosphereDPRQEVELEASREPESKITQIEISSPLPKPDIESE*
Ga0132258_1360510633300015371Arabidopsis RhizosphereTTRILFLSTRTDPRQEVELEASREPASKVTQVEISSPLPKPEIGSE*
Ga0132257_10364909713300015373Arabidopsis RhizosphereTRTDPRQEEEMEAKRDAGGKTTEVNISSPLPKPEAASD*
Ga0132255_10580098913300015374Arabidopsis RhizosphereNPVEKVELEASREPNSKVTQVEISSPLTKPDIASE*
Ga0182039_1198805623300016422SoilRILFLSTRTDPRQGVELEASREPNSKITQVQISSPLPKPDIGSE
Ga0134112_1023713633300017656Grasslands SoilLFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPNIESE
Ga0184608_1042709313300018028Groundwater SedimentFLSTRTDPRQEVELEASREPDGKITKVEISSPLPKPNIESE
Ga0184612_1018880313300018078Groundwater SedimentTQTDPVERVELEASREPNSKVTQVEISSPLPKPKIASE
Ga0173481_1005563533300019356SoilFLSTRTDPRQEVELEASREPDGKITQIEISSPLPKPNIESE
Ga0137408_133891433300019789Vadose Zone SoilMNVNADHEILFLSTRTDPRQEVELEASREPDSKVTQIEISSPLPKPEIGSE
Ga0193748_100866433300019865SoilSVNGKTTRILFLSTRTDPRQEVELEASLEPGSKVTQVEISSPLPKPEIGSE
Ga0193720_102668523300019868SoilRILFLSTRTDPRQEVELEASRQPDSKVTQVEISSPLPKPEIGSE
Ga0193747_113744723300019885SoilGKITRILSLSTQTDPVERVELEASREPKSKVTQVEISSPLPKPKIGSE
Ga0193728_125304423300019890SoilTTRILFLSTRTDPRQEVELEASREPDSKVTQVEISSPLPKPEIGSE
Ga0193731_104634913300020001SoilSTRTDPRQEVELEASREPDSKITQIEISSPLPKPNIESE
Ga0193721_105413823300020018SoilLFLSTRTDPRQEVELEASREPDGKITQVEISSPLPKPDIESE
Ga0193733_101939833300020022SoilMFLSTRTDPVQKVELAARRAPGEKISRVEISSPLPKPEEAGE
Ga0210406_1124550223300021168SoilRTDPRQEVELEASREPDAKITRVEISSPLPKPNIESE
Ga0193719_1000966263300021344SoilEPVERVELEASREPDGKITQVEISSPLPKPEIESE
Ga0210392_1044317913300021475SoilVNGKITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIGSE
Ga0222621_111531713300021510Groundwater SedimentKVTRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIGSE
Ga0126371_1022549413300021560Tropical Forest SoilGTRTDPRQEVELEASREPESKITQVEISSPLPKPEIDSE
Ga0224452_128727913300022534Groundwater SedimentSTQTDPVERVELEASREPNSKVTQVEISSPLPKPEIGSE
Ga0247688_110110713300024186SoilILFLSTRTDPRQEVELEASREPDSKVTQVQISSPLPKPEIESE
Ga0207707_1117732923300025912Corn RhizosphereNGKITRILFLSTRTDPRQEVELEASREPDSKITQIQISSPLPKPDIESE
Ga0207671_1107395323300025914Corn RhizosphereITRLLFLSTRTDPRQEVELEASREPDSKITQVQISSPLPKPDIESE
Ga0207693_1018789833300025915Corn, Switchgrass And Miscanthus RhizosphereSTRTDPRQEVELEASREPDGKITQVEISSPLPKPNIESE
Ga0207687_1082568513300025927Miscanthus RhizosphereTDPRQEVELEASREPDGKITQIEISSPLPKPNIESE
Ga0207702_1063326913300026078Corn RhizosphereLSTRTDPRQEVELEASREPDSKITQIQISSPLPKPDIESE
Ga0209863_1024200023300026281Prmafrost SoilVRNNISSNGNLTTIMFLSTRTDPVQKVEMDARRAPGEKISRVEISSP
Ga0209648_1005230343300026551Grasslands SoilMFLSTRTGPLQEVKLEASREPGGKITQIEISFPLPKPDVQGE
Ga0209590_1027736613300027882Vadose Zone SoilKLTTIMFLSTRSEPVERVELEASREPDGKITQVEISSPLPKPEIESE
Ga0268266_1016830523300028379Switchgrass RhizosphereDPRQEVKMEAKREAGGKTTEVNISSPLPKPEAASD
Ga0268266_1109483823300028379Switchgrass RhizosphereILFLSTRTDPRQEVELEASREPDGKITQVEISSPLPKPNIESE
Ga0137415_1088394613300028536Vadose Zone SoilGKITRILFLSTRTDPRQEVELEASREPDNKITQVEISSPLAKPNIESE
Ga0307305_1001058173300028807SoilNGKITRILFLSTRTDPRQEVELEASREPDGKITKVEISSPLPKPNIESE
Ga0307289_1042266413300028875SoilFLSTRTDPRQEVELEASREPDSKTTQIEISSPLPKPDIGSE
Ga0307277_1020465823300028881SoilTRILSLSTQTDPVERVELEASRMPNSKVTQVEISSPLPKPKIASE
Ga0075386_1211369113300030916SoilLFLSTRTDPRQEVELEASREPASKVTQVEISSPLPKPEIGSE
Ga0075382_1153911413300030917SoilFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIGNE
Ga0170834_10626323723300031057Forest SoilDPRQEVELEASREPDSKITQVQISSPLPKPDIESE
Ga0307469_1057526323300031720Hardwood Forest SoilMNVNGKITRILFLSTRTDPRQEVELEASREPDSKVT
Ga0307469_1248367113300031720Hardwood Forest SoilTDPRQEVELEASREPDSKITQVEISSPLPKLNIESE
Ga0318510_1018868213300032064SoilRTDPRQEVELEASREPESKITQVKISSPLPKPEIDSE
Ga0307471_10209628013300032180Hardwood Forest SoilDPRQEVELEVSREQDGKITQVEISSPLPKPDIESE
Ga0307472_10024497533300032205Hardwood Forest SoilFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIESE
Ga0310810_1036599533300033412SoilRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPEIGSE
Ga0310811_1011017253300033475SoilITRILFLSTRTDPRQEVELEASREPDSKITQVEISSPLPKPDIVSE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.