NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105824

Metagenome / Metatranscriptome Family F105824

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105824
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 46 residues
Representative Sequence MVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGSSAVRALD
Number of Associated Samples 91
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 58.00 %
% of genes near scaffold ends (potentially truncated) 99.00 %
% of genes from short scaffolds (< 2000 bps) 96.00 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.22

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (85.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(18.000 % of family members)
Environment Ontology (ENVO) Unclassified
(26.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.
1JGIcombinedJ26739_1012028192
2JGIcombinedJ26739_1012883761
3JGIcombinedJ51221_101179792
4Ga0066674_101586652
5Ga0068868_1007474171
6Ga0070709_104466011
7Ga0070714_1008430501
8Ga0070679_1008113651
9Ga0066706_102471061
10Ga0070717_114264501
11Ga0075029_1013434281
12Ga0075019_109343551
13Ga0075018_101664611
14Ga0070716_1005423921
15Ga0074055_100142053
16Ga0074047_115838071
17Ga0074049_128963711
18Ga0099793_102715601
19Ga0099792_104759092
20Ga0105241_122152041
21Ga0099796_100691582
22Ga0134084_102868282
23Ga0126372_103134043
24Ga0134128_108938352
25Ga0134128_124277332
26Ga0126381_1004229731
27Ga0136449_1022504511
28Ga0134123_100401211
29Ga0137382_104478172
30Ga0137382_112174681
31Ga0137376_103721901
32Ga0137371_111880632
33Ga0157342_10225701
34Ga0157371_102998251
35Ga0157369_124873491
36Ga0137418_104420223
37Ga0182005_11325321
38Ga0132256_1009454572
39Ga0182041_102733382
40Ga0182041_120274432
41Ga0187779_108067741
42Ga0187816_102700881
43Ga0187863_105292421
44Ga0187769_109368421
45Ga0066655_105501451
46Ga0173482_101290451
47Ga0210399_112928682
48Ga0210404_107602181
49Ga0210409_106840942
50Ga0126371_130145701
51Ga0126371_135464351
52Ga0126371_138192561
53Ga0224712_103761681
54Ga0242662_102210732
55Ga0247688_10439842
56Ga0247677_10600312
57Ga0207713_12242211
58Ga0207680_103985502
59Ga0207647_100346521
60Ga0207700_111626821
61Ga0207664_108566911
62Ga0207664_111421701
63Ga0207711_106718461
64Ga0207679_111875502
65Ga0207658_106350291
66Ga0207639_108218851
67Ga0207702_111614442
68Ga0209154_12958372
69Ga0257156_10356902
70Ga0209325_10040911
71Ga0209522_10279422
72Ga0209274_106899821
73Ga0209275_104565122
74Ga0307303_101803501
75Ga0302232_105712462
76Ga0302306_100618413
77Ga0311355_109501861
78Ga0308190_10124121
79Ga0302324_1011409841
80Ga0308194_100291661
81Ga0318516_106041121
82Ga0318561_107993781
83Ga0310813_105678052
84Ga0318493_105273611
85Ga0318526_100578431
86Ga0318498_101875761
87Ga0318508_11867361
88Ga0307473_115494642
89Ga0318551_103728352
90Ga0318520_101881281
91Ga0310916_105824361
92Ga0318558_101935971
93Ga0318533_110917962
94Ga0307471_1028375341
95Ga0306920_1008458771
96Ga0306920_1010710782
97Ga0335078_102215691
98Ga0335078_117086101
99Ga0335080_111243162
100Ga0335076_110793632
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 4.05%    β-sheet: 5.41%    Coil/Unstructured: 90.54%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGSSAVRALDSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered Regions
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.22
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains




 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
85.0%15.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Freshwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Peatlands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Agricultural Soil
Palsa
Corn Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Corn Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
3.0%5.0%3.0%8.0%3.0%5.0%3.0%18.0%4.0%4.0%5.0%4.0%3.0%4.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10120281923300002245Forest SoilMVRQVNSVTVTDRRAAPSVGGSVRLTGVSKVFGRGRTAVRALDQV
JGIcombinedJ26739_10128837613300002245Forest SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRGES
JGIcombinedJ51221_1011797923300003505Forest SoilVSRVTVTDRRTAPPVEGVDGSVRLTGVSKVFGRGSSAVRALDKVSLEV
Ga0066674_1015866523300005166SoilMVRQVSSVTVADRRAALPVRGHRDSVRLTGVSKVFGRGESAVRALDNVSLEVPP
Ga0068868_10074741713300005338Miscanthus RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSLEVPPGE
Ga0070709_1044660113300005434Corn, Switchgrass And Miscanthus RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSLEVP
Ga0070714_10084305013300005435Agricultural SoilMVRQVNSVTVTDRRAAPPVGGIRESVRLTSVSKVFGRGSSAVRALDQVSLEV
Ga0070679_10081136513300005530Corn RhizosphereMVRQVNSVTVTDRRAAPQVGEPVKLTSVSKVFGRGGAAVRALDQ
Ga0066706_1024710613300005598SoilMVRQVNSVTVTDRRAAPPVGSQRDSVRLTAVSKVFGRGESAVRALDNVT
Ga0070717_1142645013300006028Corn, Switchgrass And Miscanthus RhizosphereVNRVTITDRRAAPPVAGAAGSVRLAGVSKVFGRGSSAVRALDDVSLEVA
Ga0075029_10134342813300006052WatershedsVNRVTVTDRRAAPPIGGVGVSVRLAGVSKVFGRGASAVRALDQVSL
Ga0075019_1093435513300006086WatershedsVNRVTVADRRTAPPVGGVGVSVRLADVSKVFGRGASAVRA
Ga0075018_1016646113300006172WatershedsMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGDSAVRALDNV
Ga0070716_10054239213300006173Corn, Switchgrass And Miscanthus RhizosphereMVRQVNSVTVTDRRAAPPVGGIRESVRLTSVSKVF
Ga0074055_1001420533300006573SoilVTVTNRRAAPPSQGSVRLTGVSKVFGRGSSAVRALDQVS
Ga0074047_1158380713300006576SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRA
Ga0074049_1289637113300006580SoilMVRQVSSVTVTDRRAAPPVGGIRESVRLTSVSKVFGRGSSAVRALDQVS
Ga0099793_1027156013300007258Vadose Zone SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVCKVFGRGESAVRALDHVTLEVTPGE
Ga0099792_1047590923300009143Vadose Zone SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDKVSLE
Ga0105241_1221520413300009174Corn RhizosphereMVRQVNSVTVTDRRAAPQVGEPVKLTSVSKVFGRGGAA
Ga0099796_1006915823300010159Vadose Zone SoilMVRQVSSVTVTDRRAAPPVGGQRESVRLTAVSKVFG
Ga0134084_1028682823300010322Grasslands SoilMVRQVSSVTVTDRRAAPPIGGQRDAVRLTGVSKVFGRGSSA
Ga0126372_1031340433300010360Tropical Forest SoilMVRQVSSVTVADRRAAPPVGGQRDPVRLTAVSKVFGRGESAV
Ga0134128_1089383523300010373Terrestrial SoilMVRQVSSVTVADRRAAPQAGLRRDSVRLTAVSKVFGRGESAVRALDNVSLEVPPG
Ga0134128_1242773323300010373Terrestrial SoilMVRQVNSVTVTDRRAAPPVGEPVTLTSVSKVFGRGSSAVRALDQVSLEVPPGEFT
Ga0126381_10042297313300010376Tropical Forest SoilMVRHVSSVTVTDRRAAPPTGHAVRLTGVSKVFGRGSSAVR
Ga0136449_10225045113300010379Peatlands SoilMVRQVSSVTVTDRRAAPPVAGAVRLTGVSKVFGRGSS
Ga0134123_1004012113300010403Terrestrial SoilMVRQVSSVTVADRRAAPPVGGQRDAVRLTGVSKVFGRGESAV
Ga0137382_1044781723300012200Vadose Zone SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVTLE
Ga0137382_1121746813300012200Vadose Zone SoilMVRQVSTVTVTDRRAAPPTGDLREAVRLTDVSKVFGRGSSAVRALDNVSL
Ga0137376_1037219013300012208Vadose Zone SoilMVRQVSTVTVTDRRAAPPTGDLREAVRLTGVSKVFGRGSSAVRALDNVSLEVPP
Ga0137371_1118806323300012356Vadose Zone SoilMVRQVNSVTVTDRRAAPQAGLQRDSVRLTGVSKVFGRGASAVRALDQVSLEVP
Ga0157342_102257013300012507Arabidopsis RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVS
Ga0157371_1029982513300013102Corn RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDN
Ga0157369_1248734913300013105Corn RhizosphereMVRQVNSVTVTDRRAAPQVGEPVKLTSVSKVFGRGGAAVRA
Ga0137418_1044202233300015241Vadose Zone SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVTLEVPPGEF
Ga0182005_113253213300015265RhizosphereMVRQVNSVTVTDRRAAPPAGGLRGAVRLTGVSKVFGRGESAVRA
Ga0132256_10094545723300015372Arabidopsis RhizosphereMVRQVNSVTVTDRRAAPQAGDLTGQGVTVRLTGVSKVFGRGSSAVRALDQVSLEVPPG*
Ga0182041_1027333823300016294SoilVTVADRRAAPPAEGTRDAVRLTGVSKVFGRGSSAVRALDQVSLEV
Ga0182041_1202744323300016294SoilMVRQVNSVTVANRRTAPPAAGVRGAVRLTRVSKVFGRGSSAVRALDQVSLEVPPG
Ga0187779_1080677413300017959Tropical PeatlandMVRQVNSVAVSDERSAVGGSVRLTGVSKVFGRGSSAVR
Ga0187816_1027008813300017995Freshwater SedimentVSPVTVTDRPASAVGGSVQLTTVTKVFGRGSSAVRALDQ
Ga0187863_1052924213300018034PeatlandMVRQVNSVTVTDRRAAPPVGGPVKLADVSKVFGQGSSAVRALDHISLEVSP
Ga0187769_1093684213300018086Tropical PeatlandVTVTDRAAAPAVGEVKGAVRLAGVSKVFGRGSSAVRALDQVS
Ga0066655_1055014513300018431Grasslands SoilMVRQVSSVTVTDRRAAPPVGGSVRLTSVSKVFGRGASAVRALDQVSLEVRP
Ga0173482_1012904513300019361SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVFGRGESAVRALDNVSLEVPPGE
Ga0210399_1129286823300020581SoilMVRQVNSVTVTDRRAAPPTGDLRGMVRLTGVSKVFGRGGSAVRALDQVS
Ga0210404_1076021813300021088SoilMVRQVSSVTVTDRRAAPPVTGQRDSVRLTAVSKVFGRG
Ga0210409_1068409423300021559SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGE
Ga0126371_1301457013300021560Tropical Forest SoilMVRQVNSVTVTDRRAAPPAGGLRGTVRLTSVSKVFGRGSSAVRA
Ga0126371_1354643513300021560Tropical Forest SoilMVRQVSSVTVTDRRAAPPVGRHRDPVRLTAVSKVFGRG
Ga0126371_1381925613300021560Tropical Forest SoilVTVTDRRAAPPTGHAVRLTGVSKVFGRGSSAVRALDQVSLE
Ga0224712_1037616813300022467Corn, Switchgrass And Miscanthus RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALD
Ga0242662_1022107323300022533SoilMMRQVNSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRG
Ga0247688_104398423300024186SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDTV
Ga0247677_106003123300024245SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESA
Ga0207713_122422113300025735Switchgrass RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGES
Ga0207680_1039855023300025903Switchgrass RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVF
Ga0207647_1003465213300025904Corn RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAV
Ga0207700_1116268213300025928Corn, Switchgrass And Miscanthus RhizosphereMVRQVNSVTVTDRRAAPPVGGSVTLTSVSKVFGRGSSA
Ga0207664_1085669113300025929Agricultural SoilMVRQVNSVTVTDRRAAPPVGGIRESVRLTSVSKVFGRGSSAVRALDQVSLEVPP
Ga0207664_1114217013300025929Agricultural SoilMVRQVNSVTVTDRRAAPPVGGSVTLTSVSKVFGRGSSAVRALDQVSLEVPPG
Ga0207711_1067184613300025941Switchgrass RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVFGRGESAVRALDNVS
Ga0207679_1118755023300025945Corn RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVF
Ga0207658_1063502913300025986Switchgrass RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVFGRGGS
Ga0207639_1082188513300026041Corn RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVFGRGG
Ga0207702_1116144423300026078Corn RhizosphereMVRQVSSVTVADRRAAPPVGGQRDSVRLTGVSKVFGRGESAVR
Ga0209154_129583723300026317SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDKVS
Ga0257156_103569023300026498SoilMVRQVSSVTVTDRRAAPPVGGQRDPVRLTAVSKVFGRGESA
Ga0209325_100409113300027050Forest SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGSSAVRALD
Ga0209522_102794223300027119Forest SoilMVRQVNSVTVTDRRAAPPTGDLRGMVRLTGVSKVFGRGGSAVRALDQVSLEVPP
Ga0209274_1068998213300027853SoilMMRQVNSVAVTDRRAAPPVAGTVRLTGVSKVFGRGNSAVRAL
Ga0209275_1045651223300027884SoilMVRQVSSVTVTDRRAAPSVKGQRDSVRLTGVSKVFGRG
Ga0307303_1018035013300028713SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRG
Ga0302232_1057124623300028789PalsaMVRQVISVAVTDRKAAPPVGTSVRLTGVSKVFGRGTSAVRALEDVS
Ga0302306_1006184133300030043PalsaMVRQVSSVAVTERRAPATAVATGAVRLTGVSKVFGRGSSAVQALDQVSLEV
Ga0311355_1095018613300030580PalsaMVRQVSSLAVTDHRVPATAKSTATGAVRLTGVSKVFGRGSSAVRALDQVSLEVA
Ga0308190_101241213300030993SoilMVRQVSSVTVTDRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSL
Ga0302324_10114098413300031236PalsaMVRQVSSVTVTDRQTAPPIGTSVRMTGVSKVFGRG
Ga0308194_1002916613300031421SoilMVRQVSSVTVADRRAAPPVGGERDSVRLTAVSKVF
Ga0318516_1060411213300031543SoilMVRQVNSVTVTNRRTAPPAEDVRGAVRLTRASKVFGRGSSAVRALDQVSLEVPPGE
Ga0318561_1079937813300031679SoilMVRQVNSVTVTNRRTAPPAEDVRGAVRLTRASKVFGRGSSAVRALDQVSLEVP
Ga0310813_1056780523300031716SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSLEV
Ga0318493_1052736113300031723SoilMVRQVNSVTVTDRRAAPSAESMRGAVRLAGVSKVFGRGSSAEH
Ga0318526_1005784313300031769SoilMVRQVNSVTVTDRRAAPPAESTRGAVRLAGVSKVFGRGSSAVRALDQVSLEVPP
Ga0318498_1018757613300031778SoilMVRQVNSVTVTNRRTAPPAEDVRGAVRLTRVSKVFGRGS
Ga0318508_118673613300031780SoilMVRQVNSVTVTDRRAAPSAESMRGAVRLAGVSKVFGRGSSA
Ga0307473_1154946423300031820Hardwood Forest SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSL
Ga0318551_1037283523300031896SoilMVRQVNSVTVTDRRAAPSAESMRGAVRLAGVSKVFGRGSSAVRALDQVSLEVP
Ga0318520_1018812813300031897SoilMVRQVNSVTVTDRRAAPSAESMRGAVRLAGVSKVFGRGSSAVRALDQVSLE
Ga0310916_1058243613300031942SoilMVRQVNSVTVTDRRAAPPAESMRGAVRLASVSKVFGRGSSAVRALDQVSLEVPPGEFI
Ga0318558_1019359713300032044SoilMVRQVNSVTVTDRRAAPPAESTRGAVRLAGVSKVFG
Ga0318533_1109179623300032059SoilMVRQVNSVTVTDRRAAPPAESMRGAVRLAGVSKVFGRGSSAVRALDQ
Ga0307471_10283753413300032180Hardwood Forest SoilMVRQVSSVTVADRRAAPPVGGQRDSVRLTAVSKVFGRGESAVRALDNVSLEVPPGEFT
Ga0306920_10084587713300032261SoilMVRQVNSVTVADRRTAPQAGDLRGQGVTVRLTGVSKVFGRGSSAVRALDQVSLEVPPG
Ga0306920_10107107823300032261SoilMVRQVSSVTVTDRRAAPPVGGSVRLTSVSKVFGRGSSA
Ga0335078_1022156913300032805SoilVNRVTVTGRRAAPPAEKVKGAVRLTGVSKVFGRGSSAVRALDQVS
Ga0335078_1170861013300032805SoilMVRQVSSVTVTDRRTAPLAGGSVRLASVSKVFGRGGSAVRALDQVSLEVPPGEF
Ga0335080_1112431623300032828SoilMVRQVNSVTVTDRRAAPPAGDLRGHGVAVRLTSVSKVFGRGSSAVRALDQVSLEVPPGE
Ga0335076_1107936323300032955SoilMVRQVNSVMVTDRRAAPQAGGLRGQGATVRLTGVSKVFGRGNSVVRALDQVSL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.