NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105119

Metagenome / Metatranscriptome Family F105119

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105119
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 43 residues
Representative Sequence MSAVAVPARAQRRNVLASVWAFVRRHVLTVYSILFFAYLLLPI
Number of Associated Samples 91
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 96.00 %
% of genes near scaffold ends (potentially truncated) 97.00 %
% of genes from short scaffolds (< 2000 bps) 88.00 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(24.000 % of family members)
Environment Ontology (ENVO) Unclassified
(35.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58
1ARcpr5yngRDRAFT_0105052
2JGI10216J12902_1000328662
3JGI10216J12902_1071883192
4C688J14111_100678473
5C688J14111_100982881
6Ga0066680_106004552
7Ga0070670_1007513602
8Ga0066388_1040472161
9Ga0070689_1018788212
10Ga0070700_1011566522
11Ga0070708_1020469792
12Ga0070678_1006209892
13Ga0070681_112501541
14Ga0066697_102750961
15Ga0070664_1020229231
16Ga0066905_1006814242
17Ga0066903_1005443571
18Ga0066651_105284872
19Ga0074059_120902512
20Ga0074054_119966121
21Ga0074049_130372362
22Ga0074060_118206461
23Ga0066653_100191981
24Ga0066658_103512782
25Ga0075419_105840631
26Ga0105251_105242362
27Ga0105248_106550173
28Ga0126380_122327041
29Ga0127449_10014091
30Ga0134065_104880261
31Ga0134071_106072213
32Ga0126376_116686871
33Ga0105239_101168591
34Ga0137364_101811871
35Ga0137365_105540441
36Ga0137380_115861581
37Ga0137378_100660334
38Ga0137378_113500962
39Ga0137377_114556781
40Ga0150985_1152145163
41Ga0157343_10249581
42Ga0157350_10443762
43Ga0157285_102700982
44Ga0157301_102455872
45Ga0164301_112087752
46Ga0164309_114933711
47Ga0164304_102226753
48Ga0157378_127375092
49Ga0134079_101896292
50Ga0163163_120647052
51Ga0182008_104074561
52Ga0157377_113175231
53Ga0157376_117435772
54Ga0132256_1036955961
55Ga0132255_1011827101
56Ga0187785_106081962
57Ga0184617_10916821
58Ga0066667_100145686
59Ga0066667_107834602
60Ga0066669_117315621
61Ga0193704_10402302
62Ga0193722_10427571
63Ga0193722_11484791
64Ga0193755_10864501
65Ga0210381_100878021
66Ga0222623_100077841
67Ga0247665_10463811
68Ga0207654_100356521
69Ga0207654_104936482
70Ga0207657_114553451
71Ga0207659_103888103
72Ga0207701_114060851
73Ga0207704_113830331
74Ga0207665_102770863
75Ga0207661_106561782
76Ga0207661_110798781
77Ga0207702_114745142
78Ga0207641_122144451
79Ga0209468_10602121
80Ga0209472_10720511
81Ga0209057_11941341
82Ga0207591_1012792
83Ga0307295_101620491
84Ga0307298_100033311
85Ga0307307_100926611
86Ga0307282_101553741
87Ga0307282_104214571
88Ga0307284_100910521
89Ga0307284_102599341
90Ga0307294_100049704
91Ga0307292_100233022
92Ga0307296_105223552
93Ga0307310_100034218
94Ga0307286_101405301
95Ga0307300_100122861
96Ga0308197_102244481
97Ga0307500_102467891
98Ga0308175_1000302341
99Ga0308176_104112721
100Ga0247830_117097362
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 46.48%    β-sheet: 0.00%    Coil/Unstructured: 53.52%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MSAVAVPARAQRRNVLASVWAFVRRHVLTVYSILFFAYLLLPIExtracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
42.0%58.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Unplanted Soil
Soil
Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Soil
Tropical Peatland
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
24.0%4.0%6.0%4.0%10.0%3.0%3.0%3.0%3.0%3.0%3.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ARcpr5yngRDRAFT_01050523300000043Arabidopsis RhizosphereMTSVAVEAKQPRRSALSATWAFVKRHVLTVYSLLFFVYLLLPIAIVVVF
JGI10216J12902_10003286623300000956SoilMSAVAVEARRPRKSALATAWAFVKHHALTVYSLLFFAYLLLPIG
JGI10216J12902_10718831923300000956SoilMSVVAVRTGRPRRSVLAAVWAFVKRYALAVYSLLFFAYLMLPI
C688J14111_1006784733300001305SoilVQRSRPRRGVLASTWAFVKRHVLTVYSILFFLYLLL
C688J14111_1009828813300001305SoilMSAVAVERHTARPSAVSRVLAFVRRHVLTVYSILFFIYLLLPI
Ga0066680_1060045523300005174SoilMSSVAVPALSRRYTLATTLRFVRRHLLTAYSILFFAYLLLP
Ga0070670_10075136023300005331Switchgrass RhizosphereMSLVAVPAQGRASAAVKVWAFVKHHLLAVYSMLFFLYLLLP
Ga0066388_10404721613300005332Tropical Forest SoilVSAVAVQRSRPKRGVLASVWTFVKRNVLTVYSILFFIYLLLPI
Ga0070689_10187882123300005340Switchgrass RhizosphereMSAVAVERARTRRSAFATAWAFVKRHVLTVYSILFFVYLLVPIAVVAVFSFNN
Ga0070700_10115665223300005441Corn, Switchgrass And Miscanthus RhizosphereMSAVAVERARTRGSALATAWAFVKRHLLTVYSILFFAYLLLPIAV
Ga0070708_10204697923300005445Corn, Switchgrass And Miscanthus RhizosphereLSAVAVQRNRPRRSVLASVWAFVKRNVLTVYSILFFIYLMLPIAVV
Ga0070678_10062098923300005456Miscanthus RhizosphereLSAVAVQPNRPRRGVLASTWTFVKRHVLTVYSVLFFVYLLLPIAVVV
Ga0070681_1125015413300005458Corn RhizosphereMSAVAVERRERRNPLAGIWAFVKRNVLTVYSILFFVYLLLPIAVVIVF
Ga0066697_1027509613300005540SoilMSSIAVPTAREPNVLATVLAFVRRHVLTVYSFLFFAYLLLPIAV
Ga0070664_10202292313300005564Corn RhizosphereVSAVAVEGGRARGGALSGAWAFVKRHVLTVYSILFFAYLLLPIAVVAVF
Ga0066905_10068142423300005713Tropical Forest SoilMSTVAVPAPRERNVLASVWAFVRRHVLTVYSILFFIYLLL
Ga0066903_10054435713300005764Tropical Forest SoilMSAVAVPARAQRRNVLASVWAFVRRHVLTVYSILFFAYLLLPI
Ga0066651_1052848723300006031SoilMSSIAVPRQRERNVLAAAFGFVRYHVLTVYSLLFFA
Ga0074059_1209025123300006578SoilMSTVAVPARRERRGVLSAVWAFVKRHVLTVYSILFFAYLLLPIAVVVV
Ga0074054_1199661213300006579SoilMTTVVVEAKQPRGSALSATWAFVKRHILTVYSLLFFA
Ga0074049_1303723623300006580SoilMTTAVVEAGRPRRSVLARTWAFVKRHILTVYSLLFFAYL
Ga0074060_1182064613300006604SoilMSTVAVPARRERRGVLSAVWAFVKRHVLTVYSILFFAYLLLPI
Ga0066653_1001919813300006791SoilMSAVAATAQPRARNFAASAWAFVKHHVLTVYSILFFLYLLLP
Ga0066658_1035127823300006794SoilMTSVAVSAPRRRSATSVWAFARRNVLTVYSILFFAYLLL
Ga0075419_1058406313300006969Populus RhizosphereMSTVAVERATRTNVLSRVWTFVKHHILTVYSILFFVYLLLPIAVVVVFSFNNP
Ga0105251_1052423623300009011Switchgrass RhizosphereMSAVAVERARTRRSAFATAWAFVKRHVLTVYSILFFVYLLVPIAVVAVFSFNNP
Ga0105248_1065501733300009177Switchgrass RhizosphereMSAVAGERGRTRGSAFATAWAFVKRHVLTVYSILFFVYLL
Ga0126380_1223270413300010043Tropical Forest SoilLSAVAVQRSRPGRGVLASTWAFVKRHVLTVYSILFFVYLLLPIAVVVL
Ga0127449_100140913300010117Grasslands SoilMSAVAVERRAGRPSVLARAWAFVRHHLLTVYSILFFAYLLLPIAVVV
Ga0134065_1048802613300010326Grasslands SoilLTSVAVQRSRPRRGVLASTWAFVKRHVLTVYSILFFAY
Ga0134071_1060722133300010336Grasslands SoilMSSIAVPARREGGNALRSVLAFIRRHVLTVYSLLFFAYLLLPIAV
Ga0126376_1166868713300010359Tropical Forest SoilMSAVAVQRGRPKRGVLASTWTFVKRHVLTVYSILFFIYLLLPIAVVNSK*
Ga0105239_1011685913300010375Corn RhizosphereLSAVAVQPNRPRRGVLASTWTFVKRHVLTVYSVLF
Ga0137364_1018118713300012198Vadose Zone SoilMSSIAVDQRAGRSPAAAALAFVRRHILTAYSVLFFAYLLLPIAVVVVF
Ga0137365_1055404413300012201Vadose Zone SoilMSSIAVSAQREGNVLRTSLAFVRRHVLTVYSLLFFAYLLL
Ga0137380_1158615813300012206Vadose Zone SoilMSSIAVEQRAGRGPAAAALAFVRRHILTVYSVLFFAY
Ga0137378_1006603343300012210Vadose Zone SoilMSAVAVEGGRERHLATTLWAFVRRHLPTLYSFLFFAYLLLPIA
Ga0137378_1135009623300012210Vadose Zone SoilLSAVAVDRSHSRRSTPAAAWAFVKRHVLTLYSALFFAYLM
Ga0137377_1145567813300012211Vadose Zone SoilLSAVAVERSWPRRRVLASTWAFVKRHVLTVYSVLFFLYL
Ga0150985_11521451633300012212Avena Fatua RhizosphereVNAVAVERHTVRPNAVSRALAFVRRHVLTVYSILFFVYLLLPIAVVVVFS
Ga0157343_102495813300012488Arabidopsis RhizosphereMSAVAATTASPRRNPLAAVWAFVRHHVLTVYSILFF
Ga0157350_104437623300012499Unplanted SoilMSTVAVERATRTNVLSRVWTFVKHHILTVYSILFFVYLLLPIAVVVVF*
Ga0157285_1027009823300012897SoilMSAVAVERARTRGSALATAWAFVKRHVLTVYSLLF
Ga0157301_1024558723300012911SoilMSAVAVERARTRGSALATAWAFVKRHVLTVYSHLFFAYLLLPIAVI
Ga0164301_1120877523300012960SoilMSAVAVERRERRNPLAGIWAFVKRNVLTVYSILFFVYLLLPIAVVIVFSF
Ga0164309_1149337113300012984SoilMSSIAVPARSRRNTLATTLGFVRRHLLTAYSILFFAYLLLPIAVVILFS
Ga0164304_1022267533300012986SoilMSAVAVEQRRARAGFLTDAWAFVKHHVLTVYSILFFAYLLLPIGVVVLFSFN
Ga0157378_1273750923300013297Miscanthus RhizosphereLSAVAVQPNRPRRGVLASTWTFVKRHALTVYSVLFFVYLLLP
Ga0134079_1018962923300014166Grasslands SoilMSSIAVPARREGSNALRTVLSFIRRHVLTVYSLLFFVY
Ga0163163_1206470523300014325Switchgrass RhizosphereMSVVAVRTGRPRRSVLAAVWAFVKRYALAVYSLLFFAYLML
Ga0182008_1040745613300014497RhizosphereMSAVAVPAGSRVGAAAGVWAFVKRHVLTVYSILFFLYLLLPIGV
Ga0157377_1131752313300014745Miscanthus RhizosphereMSAVAATTASPRRNPLAAVWAFVRHHVLTVYSILFFV
Ga0157376_1174357723300014969Miscanthus RhizosphereMSAVAVERSRPKRGVLASTWAFVKRHVLTVYSILFFIYLLLPIAVVVVF
Ga0132256_10369559613300015372Arabidopsis RhizosphereMTTATEAIQAPAQRVRTSPLAFVRRHVLTVYSLLVFAYLLLPIVIVV
Ga0132255_10118271013300015374Arabidopsis RhizosphereVSAVAAERSRPRRGALASTWAFVKRNVLAAYSILFFIYLLLPIAVVVAFSF
Ga0187785_1060819623300017947Tropical PeatlandVSAVAAERPPAGRSFAASTWDLVKRHVLTVYSILFFV
Ga0184617_109168213300018066Groundwater SedimentMSAVAVEAQRERRSFLAAAWAFVKHHILTLYSILFFAYLLIPIAV
Ga0066667_1001456863300018433Grasslands SoilMSSIAVPRQRERNVLAAAFGFVRYHVLTVYSLLFFAYLL
Ga0066667_1078346023300018433Grasslands SoilMSSAAAAVRSRPSVAAGVWAFARHHVLTVYSILFFLYLLLPIAI
Ga0066669_1173156213300018482Grasslands SoilMSSIAVPRQRERNVLAAAFGFVRYHVLTVYSLLFFAYLLLPIAIVVV
Ga0193704_104023023300019867SoilMSSIAVPARSRRNPLATTLGFVRRHLLTAYSILFFAYLLLPIAVVILFS
Ga0193722_104275713300019877SoilMSAVAVATQPRRGLLAAAWAFVRHHLLTLYSFLFFAYLLFPI
Ga0193722_114847913300019877SoilMSSIAVPARSRRNPLATTLGFVRRHLLTAYSILFFAY
Ga0193755_108645013300020004SoilMSTVAVPARRERPGFLAAAWAFVKRHVLTVYSILFFAYLLLPIAVVVVFSF
Ga0210381_1008780213300021078Groundwater SedimentMSSIAVDQQAGRRPAAAALAFVRRHILTVYSVLFFAYLLLPIAVV
Ga0222623_1000778413300022694Groundwater SedimentMSSIAVPARSRRNPLATTLGFVRRHLLTAYSILFFAYLL
Ga0247665_104638113300024219SoilMSAVAVERARTRGSAPATAWAFVKRHVLTVYSVLFFAYLLLPIAVVA
Ga0207654_1003565213300025911Corn RhizosphereLSAVAVQPNRPRRGVLASTWTFVKRHVLTVYSVLFFVYLLLPIAVV
Ga0207654_1049364823300025911Corn RhizosphereVSAVAVEGGRARGGALSGAWAFVKRHVLTVYSILFFAYLLLPIAVV
Ga0207657_1145534513300025919Corn RhizosphereLSAVAVQRNRPRRSVLASVWAFVKRNVLTVYSILFFIY
Ga0207659_1038881033300025926Miscanthus RhizosphereLSAVAVQRNRPRRSVLASVWAFVKRNVLTVYSILFFIYLMLPIAVVVVFS
Ga0207701_1140608513300025930Corn, Switchgrass And Miscanthus RhizosphereMSAVAVERRERRNPLAGIWAFVKRNVLTVYSILFFVY
Ga0207704_1138303313300025938Miscanthus RhizosphereLSAVAVQPNRPRRGVLASTWTFVKRHVLTVYSVLFFVNLLLPIAVVVV
Ga0207665_1027708633300025939Corn, Switchgrass And Miscanthus RhizosphereMSAVAVDQRRPRAGFFAAAWAFVKHHVLTVYSILFFGYLLLPIGV
Ga0207661_1065617823300025944Corn RhizosphereMSAVAMPARQKANPVVSLWAFVRRHTLTVYSVLFF
Ga0207661_1107987813300025944Corn RhizosphereMSAVAVERARTRGSAPATAWAFVKRHVLTVYSVLFFAY
Ga0207702_1147451423300026078Corn RhizosphereMSAVAVERARTRGSIFATAWAFVKRHVLTVYSILFFVYLLVPIAVVAV
Ga0207641_1221444513300026088Switchgrass RhizosphereMSAVAATTASPRRNPLAAVWAFVRHHVLTVYSILFFVYLLLPIAVVVLF
Ga0209468_106021213300026306SoilMSAVAATAQPRARNFAASAWAFVKHHVLTVYSILFFLYLL
Ga0209472_107205113300026323SoilMSSIAVPRQRERNVLAAAFGFVRYHVLTVYSLLFFAYLLLPIAIVVVFS
Ga0209057_119413413300026342SoilMSSIAVPTAREPNVLATVLAFVRRHVLTVYSLLFF
Ga0207591_10127923300026827SoilMSTVAVERATRTNVLSKVWTFVKHHILTVYSILFFVYLLLPIAV
Ga0307295_1016204913300028708SoilMSSIAVPARSRRNPLATTLGFVRRHLLTAYSILFFAYLLL
Ga0307298_1000333113300028717SoilMSAVAVEPRRERTNLLAKVLAFVRHHLLTAYSLLFFAYLLLPIAVVV
Ga0307307_1009266113300028718SoilLSSIAAPARRERSVLATVLAFVRRKALAVYSLLFFAYLLLPI
Ga0307282_1015537413300028784SoilMSSIAVPRQRERNVLAAALGFVRYHVLTVYSLLFFAYLLLPIAIVVV
Ga0307282_1042145713300028784SoilMSSIAAPVRRERSVLATVLAFVRRHALAAYSLLFFAYLLLPI
Ga0307284_1009105213300028799SoilMSSIAVPVRRQGNALAKTFAFVRRHVLTVYSLLFF
Ga0307284_1025993413300028799SoilLSSIAAPARRERSVLATVLAFVRRHALAAYSLLFFAYLLLPIAIVIAFS
Ga0307294_1000497043300028810SoilMSAVAVERARPNVFSKVWAFVRHHILTAYSILFFVYLLLPIAVVVV
Ga0307292_1002330223300028811SoilMSSIAVTRARERNVLGATLGFVRHHILTVYSLLFFAYLLLPIAIVVAF
Ga0307296_1052235523300028819SoilLSSIAAPARRERSVLATVLAFVRRKALAVYSLLFFAYLLLPIAIV
Ga0307310_1000342183300028824SoilMSAVAVESRPRRHVLAGAWAFVRRHILTVYSVLFF
Ga0307286_1014053013300028876SoilMSTVAVPARRERRGFLSAVWAFVKRHVLTVYSILFFAYLLLPIAVVV
Ga0307300_1001228613300028880SoilMSAVAVSAPRRSRGLARVWAFVRRHVLTVSAMLGFAYLL
Ga0308197_1022444813300031093SoilMSSIAVTRERERNVLGAALGFVRHHILSVYSLLFFA
Ga0307500_1024678913300031198SoilVSSATAVSARPRRSALANVWAFVRRHVLTAYSILFFIY
Ga0308175_10003023413300031938SoilMSAIAVERARTRGSALATTWAFVKRHVLTVYSILFFAYLL
Ga0308176_1041127213300031996SoilVTTVAIERRPGRPSVLAKTWSFVRRHVLTAYSILFFVYLLLPIA
Ga0247830_1170973623300033551SoilMSTVAVPAGRERPGFLAAAWAFVKRHVLTVYSILFFAYLL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.