NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101699

Metagenome / Metatranscriptome Family F101699

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101699
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 41 residues
Representative Sequence DGDFEFRRTEYDVPRAAAGYRSMGGDFGEFAARRIERGSD
Number of Associated Samples 94
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 5.88 %
% of genes from short scaffolds (< 2000 bps) 5.88 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.62

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (95.098 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(18.628 % of family members)
Environment Ontology (ENVO) Unclassified
(26.471 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.902 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46
14MG_00832310
2C688J18823_105677713
3JGI24738J21930_100833071
4Ga0062595_1013944201
5Ga0062594_1001760473
6Ga0066677_106348621
7Ga0066679_105590612
8Ga0068868_1003779713
9Ga0070688_1009641221
10Ga0070701_109777671
11Ga0070700_1007838013
12Ga0066689_108094772
13Ga0070707_1018452021
14Ga0066694_104296452
15Ga0070717_101983373
16Ga0066652_1007815211
17Ga0070716_1016925131
18Ga0070712_1017765272
19Ga0074051_117845141
20Ga0074055_115597662
21Ga0079222_122927891
22Ga0066653_100521263
23Ga0075436_1001419844
24Ga0099827_111933293
25Ga0075418_112887383
26Ga0066709_1002572014
27Ga0066709_1004849821
28Ga0066709_1012297663
29Ga0105248_128743012
30Ga0105238_109969331
31Ga0126374_101839581
32Ga0126309_107792652
33Ga0126314_101330533
34Ga0126382_115844771
35Ga0126319_14402053
36Ga0134086_100462241
37Ga0134064_103449901
38Ga0134065_101709631
39Ga0134080_104322413
40Ga0126376_111647113
41Ga0105239_101861184
42Ga0150985_1143422561
43Ga0150985_1177056443
44Ga0150984_1016503422
45Ga0150984_1228665731
46Ga0157314_10490301
47Ga0157282_100024754
48Ga0164300_103354241
49Ga0164300_106630942
50Ga0164309_111526373
51Ga0134089_103150041
52Ga0132258_115802994
53Ga0132257_1025631392
54Ga0134083_105683001
55Ga0184608_103183282
56Ga0184619_100857461
57Ga0184640_103642132
58Ga0066655_107562312
59Ga0066669_108553641
60Ga0173482_100900691
61Ga0210382_104546292
62Ga0210382_105283831
63Ga0222622_102290201
64Ga0247795_10145181
65Ga0247790_101151941
66Ga0247664_11129621
67Ga0210120_11192352
68Ga0207687_106093543
69Ga0207700_101805143
70Ga0207691_111882611
71Ga0207689_101717953
72Ga0207667_115758572
73Ga0207648_116593312
74Ga0209687_10326354
75Ga0209267_13153481
76Ga0209803_12724221
77Ga0209159_12354601
78Ga0209795_101213991
79Ga0247683_10041571
80Ga0307313_102301971
81Ga0307301_101602712
82Ga0307301_101884483
83Ga0307317_100232411
84Ga0307315_102915441
85Ga0307318_102437531
86Ga0307280_101980771
87Ga0307320_100517081
88Ga0307305_100720583
89Ga0307310_101635451
90Ga0307312_104869823
91Ga0307300_100147301
92Ga0307277_100239981
93Ga0247826_102514541
94Ga0247826_107167301
95Ga0308204_102385922
96Ga0318516_102177603
97Ga0318534_102119101
98Ga0318563_102087223
99Ga0307471_1032127101
100Ga0334913_088871_505_639
101Ga0373948_0135566_479_604
102Ga0373950_0001343_3070_3189
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 32.35%    β-sheet: 0.00%    Coil/Unstructured: 67.65%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540DGDFEFRRTEYDVPRAAAGYRSMGGDFGEFAARRIERGSDSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.62
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
4.9%95.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Natural And Restored Wetlands
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Sub-Biocrust Soil
Soil
Hardwood Forest Soil
Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Rhizosphere Soil
Miscanthus Rhizosphere
Avena Fatua Rhizosphere
Agave
Switchgrass, Maize And Mischanthus Litter
2.9%18.6%2.9%5.9%8.8%4.9%4.9%6.9%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
4MG_008323102170459019Switchgrass, Maize And Mischanthus LitterRVRFQRTEYDVERAADGVPAMGGDFGEFAARRIERGSD
C688J18823_1056777133300001686SoilDGEFEFRRTEYDNERAADAFRELGGRFGEMVAGRILRGSD*
JGI24738J21930_1008330713300002075Corn RhizosphereRRTDYDVERAAAAYRQMRGDFGEFAANRIERGSD*
Ga0062595_10139442013300004479SoilAVRDDEGRFEFRRTEYDNERSAAAYLALGGGFGEMVARRLERGSD*
Ga0062594_10017604733300005093SoilWDGDFEFRRTEYDVARAAAGYRSMGGDFGEFAAGRIERGSD*
Ga0066677_1063486213300005171SoilWEDDFTFRRTEYDVEAAAAAYRAMGGDFGEFAARRIEKGSD*
Ga0066679_1055906123300005176SoilGEFEFRRTEYDIERAAAGWRKLGGDFGKFAAERIERGRD*
Ga0068868_10037797133300005338Miscanthus RhizosphereWDGDFEFRRSDYDVARAAEGYRSLGGDFGEFAARRIERGSD*
Ga0070688_10096412213300005365Switchgrass RhizosphereTWDGDFTFRRTEYDAEAAAAAYRSMGGDFGEFAAHRIERGSD*
Ga0070701_1097776713300005438Corn, Switchgrass And Miscanthus RhizosphereIQDDDGEFAFRRTDYDVERAAAAYRQMRGDFGEFAANRIERGSD*
Ga0070700_10078380133300005441Corn, Switchgrass And Miscanthus RhizosphereDDHFTFRRTEYDVERAAEAYRGMGGAFGEFAANRILRGSD*
Ga0066689_1080947723300005447SoilWDGDFTFRRTEYDFEAAAAAYRAMGGDFAEFAARRIEKGSD*
Ga0070707_10184520213300005468Corn, Switchgrass And Miscanthus RhizosphereDGDFTFRRTEYDAEAAAAAYRSMGGDFGEFAARRLERGSD*
Ga0066694_1042964523300005574SoilAWATWDGDFEFRRTEYDVARAAAGYRSLAGEFGEFAARRIELGSD*
Ga0070717_1019833733300006028Corn, Switchgrass And Miscanthus RhizosphereVRADDGEFAFRRTEYDAQAAADGYRRLGGEFGEFAARRIERGSD*
Ga0066652_10078152113300006046SoilWAVRGDDGQFAFRRTEYDVERAATAYRELGGDFGQFAADRILRGSD*
Ga0070716_10169251313300006173Corn, Switchgrass And Miscanthus RhizosphereFRRTEYDVERAAAAYRAMGGDFGEFAARRIEKGSD*
Ga0070712_10177652723300006175Corn, Switchgrass And Miscanthus RhizosphereGAFEFRRCEYDVERAAAGYRSMGGNFGEFAARRIEKGSD*
Ga0074051_1178451413300006572SoilGELEFRRTEYDVERAADAYRAMSGRFGPLAARRIERGSD*
Ga0074055_1155976623300006573SoilDGDFEFRRTEYDVPRAAAGYRSMGGDFGEFAARRIERGSD*
Ga0079222_1229278913300006755Agricultural SoilDFELRRTEYDIQRAADGFRALGGDFAEFAANRIERGSD*
Ga0066653_1005212633300006791SoilEFRFERTDYDVERAAEAYRSLGGGFGDMAARRIERGSD*
Ga0075436_10014198443300006914Populus RhizosphereWATWNGDFTFRRTDYDFEAAAAAYRAMGGEFGEFAARRLEKGSD*
Ga0099827_1119332933300009090Vadose Zone SoilWATWDGDFVFRRTEYDVARAAAGYRSLAGEFGEFAARRIERGSD*
Ga0075418_1128873833300009100Populus RhizosphereFRRTEYDVERALAGWRAVPGPFGEMVTHRIEHGSD*
Ga0066709_10025720143300009137Grasslands SoilWDGDFTFRRTEYDVAPAAAAYRAMGGDFGEFAARRIEKGSD*
Ga0066709_10048498213300009137Grasslands SoilDDGEFEFRRTEYDVERAAAAWRKLGGDFGTFAAARIERGSD*
Ga0066709_10122976633300009137Grasslands SoilRRTEYAVARAAEGYRSMGGDFGEFAARRIERGSD*
Ga0105248_1287430123300009177Switchgrass RhizosphereWATWDGDFQLRRTEYDVARAAEGYRALAGDFGEFAAARIERGSD*
Ga0105238_1099693313300009551Corn RhizosphereWDGDFELRRTEYEVARAAAGYRSMGGDFGEFAARRIERGSD*
Ga0126374_1018395813300009792Tropical Forest SoilDFTFRRTAYNAEPAAAAYRSMGGDFGEFAARRIERGSD*
Ga0126309_1077926523300010039Serpentine SoilATWDGDFEFRRTEYDVARAAAGYAGLGGNFGEFAAARIRKGSD*
Ga0126314_1013305333300010042Serpentine SoilFEFRRTEYDTERAAAEWRVLASPWCEQAAGRIERGSD*
Ga0126382_1158447713300010047Tropical Forest SoilRRTEYDVQRAADAYRRMGGSFGEFASRRIERGSD*
Ga0126319_144020533300010147SoilAVRYDDGQFEFRRTEYDNQRAADAYRKLGGEFGEFAARRLERGSD*
Ga0134086_1004622413300010323Grasslands SoilRRTAYDLEPAAAAYRAMGGDFGEFAARRIEKGSD*
Ga0134064_1034499013300010325Grasslands SoilFEFRRTEYDVERAAAAYRAMGGEFGEFAARRIERGSD*
Ga0134065_1017096313300010326Grasslands SoilWATWEGDFAFRRTAYDVEAPAAAYRAMGGDFGEFAARRIEKGSD*
Ga0134080_1043224133300010333Grasslands SoilFERTEYDVQRAADAYRSMGGGFGDMAAGRIERGSD*
Ga0126376_1116471133300010359Tropical Forest SoilVRTPEGDLEFRRTEYDVERAAEAYEALGGRFGRFMGRRIRRGSD*
Ga0105239_1018611843300010375Corn RhizosphereWATVEGDFVLRRTEYDVRRAADAYRALGGRFGEFMGRRIERGSD*
Ga0150985_11434225613300012212Avena Fatua RhizosphereGEFEFRRTEYDNERAAAAFRELGGNFGEMVAGRIERGSD*
Ga0150985_11770564433300012212Avena Fatua RhizosphereDGAFGLRRTEYDVQRAADAYRALGGGFGEMVANRIEKGSD*
Ga0150984_10165034223300012469Avena Fatua RhizosphereAVRTEGEFAFRRTTYDVERAVDGFRRLGGGFGAMVVRRLERGSD*
Ga0150984_12286657313300012469Avena Fatua RhizosphereATWDSDFAFRRTDYDVARAADGYRVMGGNFGEMASRRIERGSD*
Ga0157314_104903013300012500Arabidopsis RhizosphereRRTEYDVERAAAGYRSLDGDIAEFAASRIERGSD*
Ga0157282_1000247543300012904SoilAWATVEGDFVLRRTEYDVRRAADAYRALGGRFGEFMGRRIERGSD*
Ga0164300_1033542413300012951SoilTWNGDFAFRRTEYDVERAAATYRAMGGDFAEFAARRIEKGSD*
Ga0164300_1066309423300012951SoilFRRTEYDNQRAADAYRKLGGEFGEFAARRLERGSD*
Ga0164309_1115263733300012984SoilDFEFRRTEYDVARAAAGYRSMGGEFGEFAATRIERGSD*
Ga0134089_1031500413300015358Grasslands SoilPQPGEFEFRRCEYDVERAADGYRRMGGDFGEMAAGRIERGSD*
Ga0132258_1158029943300015371Arabidopsis RhizosphereDGDFELRRTGYDVARAAAGYRSLGGDFGEFAARRIERGSD*
Ga0132257_10256313923300015373Arabidopsis RhizosphereWDGDFEFRRTEYDVARAAAGYRSMGGDFGEFAARRIERGSD*
Ga0134083_1056830013300017659Grasslands SoilAAWATWAGDFEFRRTEYDVQRAADGYRAMGGDFGEFAATRIERGSD
Ga0184608_1031832823300018028Groundwater SedimentEFHRTAYDVARAAAGYRSMGGEFGEFASRRIERGSD
Ga0184619_1008574613300018061Groundwater SedimentGEFEFRRTEYDVEQAAAGYRSMGGEFGEFAARRIERGSD
Ga0184640_1036421323300018074Groundwater SedimentDFTFRRVEYDWQRAAAGFRRMGGDFGEFAAVRIERGSD
Ga0066655_1075623123300018431Grasslands SoilRPGEFWFERTDYDVERAAEAYRLMGGQFGEFAARRIERGSD
Ga0066669_1085536413300018482Grasslands SoilSAEPGEFEFRRCEYDVERAADGYRRMGGDFGEFAARRIERGSD
Ga0173482_1009006913300019361SoilTFDDDDFTFRRTEYDTQRAADAYRAMGGAFGEMAGNRIERGSD
Ga0210382_1045462923300021080Groundwater SedimentVRTDGGEFEFRRTEYDVEKAAAGYRSMGGEFGEFAARRIERGSD
Ga0210382_1052838313300021080Groundwater SedimentWDGDFDFRRTDYDVARAAAGYRSLAGDFGEFAARRIEKGSD
Ga0222622_1022902013300022756Groundwater SedimentWATRDGDFEFRRTEYDVARAAAGYRSMGGDFGEFAANRIERGSD
Ga0247795_101451813300022899SoilTVEGDFVLRRTEYDVRRAADAYRALGGRFGEFMGRRIERGSD
Ga0247790_1011519413300022915SoilDFILRRTEYDVRRAADAYRALGGRFGEFMGRRIERGSD
Ga0247664_111296213300024232SoilWAVRRDDGDFEFRRAEYDVERAAAGWRTLGSDFGELAARRVERGRD
Ga0210120_111923523300025556Natural And Restored WetlandsAFEFRRTEYDNQRAAAAYRELAGDFGAMAAGRILRGSD
Ga0207687_1060935433300025927Miscanthus RhizosphereFRRTEYDVARAAAGYRSMGGDFSEFAARRIERGSD
Ga0207700_1018051433300025928Corn, Switchgrass And Miscanthus RhizosphereSAWATFDDHFTFRRTEYDVERAAEAYRGMGGAFGEFAANRILRGSD
Ga0207691_1118826113300025940Miscanthus RhizosphereADDGEFAFRRTEYDAQAAADGYRRLGGEFGEFAARRIERGSD
Ga0207689_1017179533300025942Miscanthus RhizosphereAVRDDEGRFEFRRTEYDNERSAAAYLALGGGFGEMVARRLERGSD
Ga0207667_1157585723300025949Corn RhizosphereDGQFEFRRTSYDNERAAHAYRKLGGEFGEFAARRIERGSD
Ga0207648_1165933123300026089Miscanthus RhizosphereFRRTDYDVERAAAAYRQMRGDFGEFAANRIERGSD
Ga0209687_103263543300026322SoilWATWDGDFTFRRTEYDVEPAAAAYRAMGGDFGEFAARRIEKGSD
Ga0209267_131534813300026331SoilQPGEFEFRRCEYDVERAAEAYRAMGGDFGAFAARRIERGSD
Ga0209803_127242213300026332SoilWDGDFTFRRTEYDFEAAAAAYRAMGGDFAEFAARRIEKGSD
Ga0209159_123546013300026343SoilAVSERPGEFLFERTDYDVERAAEAYRLMGGQFGEFAARRIERGSD
Ga0209795_1012139913300027718AgaveLEDGEFAFRRTEYDVERAAEAYRRMGGAFGEMAAARIEKGSD
Ga0247683_100415713300027991SoilTFRRTEYDVERAAEAYRGMGGAFGEFAANRILRGSD
Ga0307313_1023019713300028715SoilEFRRTEYDVEKAAAGYRSMGGEFGEFAARRIVRGSD
Ga0307301_1016027123300028719SoilRTDDGEFQFRRTEYDADSAAAAYRRMGGGFGEMAAKRIEKGSD
Ga0307301_1018844833300028719SoilTWDGDFEFRRTDYDVARAAEGYRSMGGDFGEFAARRIERGSD
Ga0307317_1002324113300028720SoilWAIQDYDGEFAFRRTQYDVERTAAAYRQMPGDFGEFAANRIERGSD
Ga0307315_1029154413300028721SoilDDGVFEFRRTAYDNQRAADAYRKLGGDFGEFAARRLERGSD
Ga0307318_1024375313300028744SoilWAVRTDDGEFQFRRTEYDADSAAEAYRRMGGGFGEMAARRIEKGSD
Ga0307280_1019807713300028768SoilTRDGDFEFRRTEYDVARAAAGYRSMGGDFGEFAANRIEKGSD
Ga0307320_1005170813300028771SoilTWDYDFTFRRTEYDVERAAAAYCALGGAFGEMAARRIEHGSD
Ga0307305_1007205833300028807SoilAVRTDDGEFQFRRTEYDADSAAAAYRRMGGGFGEMAAKRIEKGSD
Ga0307310_1016354513300028824SoilDDGEFQFRRTEYDADSAAEAYRRMGGGFGEMAARRIEKGSD
Ga0307312_1048698233300028828SoilDFEFRRTEYDVARAAEGYRSMGGDFGEFAARRIERGSD
Ga0307300_1001473013300028880SoilTWEGDFAFRRTEYDVERAAAAYRSLGGPFGEMAANRILKGSD
Ga0307277_1002399813300028881SoilFDFRRSAYDVARAAAGYRSMSGEFGQFAAGRIERGSD
Ga0247826_1025145413300030336SoilEGDFVLRRTEYDVRRAAEAYRALGGRFGEFMGRRIERGSD
Ga0247826_1071673013300030336SoilFEFRRTDYDTERAAAGFRTMGGELAEWAANRILRGSD
Ga0308204_1023859223300031092SoilFAFRRTEYDVERAAAAYRQMRGDFGKFAANRIERGSD
Ga0318516_1021776033300031543SoilRDDGEFEFRRTEYDVARAIEGWRRLGAGFPELAARRLELGRD
Ga0318534_1021191013300031544SoilWAVRRDDGEFEFRRTEYDVARAIEGWRRLGAGFPELAARRLELGRD
Ga0318563_1020872233300032009SoilDDGEFEFRRTEYDVARAIEGWRRLGAGFPELAARRLELGRD
Ga0307471_10321271013300032180Hardwood Forest SoilDGHLTFRRTEYDVERAAEAYRGMGGAFGEFAANRILRGSD
Ga0334913_088871_505_6393300034172Sub-Biocrust SoilVRADDGELVFRRTEYDVERAVDAYRRMGGDFGGMAARRLERGSD
Ga0373948_0135566_479_6043300034817Rhizosphere SoilFDGHFTFRRTEYDVERAAEAYRGMGGAFGEFAANRILRGSD
Ga0373950_0001343_3070_31893300034818Rhizosphere SoilGDFTLRRTEYDAEAAAAAYRSMGGDFGEFAARRLERGSD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.