NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F102129

Metagenome / Metatranscriptome Family F102129

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102129
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 42 residues
Representative Sequence IASRYLGPEAGAAYAERGNDDLLIRLEPGDLRAWDFADDL
Number of Associated Samples 95
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 2.94 %
% of genes near scaffold ends (potentially truncated) 96.08 %
% of genes from short scaffolds (< 2000 bps) 92.16 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.39

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (87.255 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(11.765 % of family members)
Environment Ontology (ENVO) Unclassified
(25.490 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.863 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64
1KansclcFeb2_05222990
2JGI10216J12902_1062755042
3JGI20190J14840_10111973
4C688J18823_104281161
5C688J35102_1203842593
6Ga0062590_1029084321
7Ga0062591_1029967062
8Ga0066388_1007565403
9Ga0070714_1008424602
10Ga0066697_102368131
11Ga0070693_1005433472
12Ga0066707_102918843
13Ga0066706_104821511
14Ga0066903_1091174882
15Ga0081455_100613335
16Ga0081455_104741181
17Ga0066652_1018280642
18Ga0074055_115998662
19Ga0074053_117135421
20Ga0074057_121014193
21Ga0066653_102937463
22Ga0079221_115320542
23Ga0075433_111371821
24Ga0075425_1010425383
25Ga0075424_1016266701
26Ga0075424_1026501652
27Ga0075436_1007675192
28Ga0079219_120040861
29Ga0105245_112497301
30Ga0066709_1001074591
31Ga0116218_11310861
32Ga0105249_111118973
33Ga0105249_112690371
34Ga0126374_111049801
35Ga0126374_114294651
36Ga0126309_111938542
37Ga0134086_100399731
38Ga0134080_105560731
39Ga0134062_104195102
40Ga0126370_123383912
41Ga0126376_112813992
42Ga0126378_103421571
43Ga0126379_120770802
44Ga0105239_122018342
45Ga0134122_115273832
46Ga0134121_105913501
47Ga0134121_112429962
48Ga0126350_114004021
49Ga0137393_100598351
50Ga0137364_108161762
51Ga0137365_105883201
52Ga0137374_111612281
53Ga0137381_104221103
54Ga0137386_111144301
55Ga0137369_101198911
56Ga0137368_107802301
57Ga0157313_10148942
58Ga0157303_100075361
59Ga0157293_100117744
60Ga0157301_102055152
61Ga0164304_116828572
62Ga0157375_128451012
63Ga0134079_103849983
64Ga0173480_106991812
65Ga0187816_104379142
66Ga0187815_104870002
67Ga0066667_111568331
68Ga0066669_114046251
69Ga0210399_102718982
70Ga0210388_107310141
71Ga0210402_104375743
72Ga0247791_10933192
73Ga0207656_102937762
74Ga0207685_104521562
75Ga0207685_107737991
76Ga0207699_111568981
77Ga0207644_106355711
78Ga0207709_118263492
79Ga0207675_1004238551
80Ga0207675_1004570031
81Ga0209795_101456522
82Ga0209177_103193442
83Ga0307291_10087033
84Ga0307311_102084501
85Ga0307297_103771362
86Ga0307316_103023301
87Ga0307323_100044187
88Ga0307305_103893832
89Ga0307278_105321632
90Ga0307469_122274641
91Ga0306918_101935081
92Ga0318546_104992311
93Ga0306919_110136072
94Ga0306921_119552371
95Ga0308175_1017550992
96Ga0308174_104792302
97Ga0318504_102119442
98Ga0310895_105824081
99Ga0307472_1010932043
100Ga0335078_101092211
101Ga0335081_116429901
102Ga0335083_109079802
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 14.71%    β-sheet: 17.65%    Coil/Unstructured: 67.65%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540IASRYLGPEAGAAYAERGNDDLLIRLEPGDLRAWDFADDLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.39
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
87.3%12.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Peatlands Soil
Soil
Soil
Agricultural Soil
Arctic Peat Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Corn Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Agave
Boreal Forest Soil
11.8%2.9%7.8%2.9%5.9%3.9%2.9%6.9%2.9%4.9%2.9%2.9%2.9%3.9%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
KansclcFeb2_052229902124908045SoilHSIAIRYLGERDGAAYADGGEDDTLIRLEPGRLRAWDFADQF
JGI10216J12902_10627550423300000956SoilIRIASRYLGPQAAAEYAESGGDDLIVRLEPGHLRAWDFADSVD*
JGI20190J14840_101119733300001384Arctic Peat SoilYLGREAGAAYADRAEDDLLIRLEPGDLRAWDFADELS*
C688J18823_1042811613300001686SoilHAVRDIASRYLGDDEGGAYADRGSDDTLIRLEPGDLRAWDFADDL*
C688J35102_12038425933300002568SoilLAHADASAVRDIASRYLGDDEGSAYADRGSDDTLIRLEPRDLRAWDFADDL*
Ga0062590_10290843213300004157SoilRYLGREAGIAYVAEGSDDLLVRLEPGTLRAWDFADEFGVSA*
Ga0062591_10299670623300004643SoilRIAARYLGPEAGQRYAETAGDDLLIRLEPGELRGWDFADDFT*
Ga0066388_10075654033300005332Tropical Forest SoilKRIASRYLGVEAGSAYAERGGDDLLIRLELGDLRGWDFADDFS*
Ga0070714_10084246023300005435Agricultural SoilADVLEIAVRYLGEEEGGAYAAGGEDDTLIRLEPGRLRAWDFVDDL*
Ga0066697_1023681313300005540SoilAVERIAKRYLGSPGGETYAESAGDDLLIRLEPGTLRTWDFADEFEVPG*
Ga0070693_10054334723300005547Corn, Switchgrass And Miscanthus RhizosphereIAARYLGQEAGERYAESAGDDLLIRLEPGQLRAWDFSDYFA*
Ga0066707_1029188433300005556SoilGGETYAASAGDDLLIRLEPGTLRTWDFADEFEVSG*
Ga0066706_1048215113300005598SoilYLGEEAGAAYAERMRDDVVVRLEPGGLRAWDFSDESA*
Ga0066903_10911748823300005764Tropical Forest SoilHYLGPEAGPAYAARSGDDLLVRLEPETLRAWDFADEYA*
Ga0081455_1006133353300005937Tabebuia Heterophylla RhizosphereLGAELGEQYAETDADDLLVRLEPGELRGWDFADDFA*
Ga0081455_1047411813300005937Tabebuia Heterophylla RhizosphereRIAARYLGPEFGEQYAETDADDLLIRLEPGELRGWDFADDFA*
Ga0066652_10182806423300006046SoilIAVRYLGERDGAAYADRAGDDTLIRLEPGRLRAWDFADDL*
Ga0074055_1159986623300006573SoilDTVREIAARYLGDVEGGAYADRGYDDTLIRLEPGRLRAWDFADDL*
Ga0074053_1171354213300006575SoilAARYLGDVEGGAYADRGYDDTLIRLEPGRLRAWDFADDL*
Ga0074057_1210141933300006605SoilVIGRAGADTVREIAARYLGDVEGGAYADRGYDDTLIRLEPGRLRAWDFADDL*
Ga0066653_1029374633300006791SoilGPEAGDRYAETAGDDLLIRLEPGELRGWDFADYFA*
Ga0079221_1153205423300006804Agricultural SoilRIAVRYLGPEEGERYVAGGGDDLLIRLEPGKLRAWDFADDF*
Ga0075433_1113718213300006852Populus RhizosphereRRIATRYLGPEAGQRYGETAGDDLLIRLEPGELRGWDFADDFAG*
Ga0075425_10104253833300006854Populus RhizosphereRRIAIRYLGPEAGERYAETAGDDLLIRLEPGKLRGWDFADDFA*
Ga0075424_10162667013300006904Populus RhizosphereIVRRIASRYLGPEAGQRYGETAGDDLLIRLEPGELRGWDFADDFAG*
Ga0075424_10265016523300006904Populus RhizosphereSIVRRIAARYLGPEAGQRYAETAGDDLLIRLEPGELRGWDFADDFT*
Ga0075436_10076751923300006914Populus RhizosphereTEDRSIVRRIAARYLGPEFGEEYAGTDADDLLIRLEPGELRGWDFADDFA*
Ga0079219_1200408613300006954Agricultural SoilPDEGQRYDETAGDDLLIRLEPGELRGWDFADHFAG*
Ga0105245_1124973013300009098Miscanthus RhizosphereQEAGERYAETGGDDLLIRLEPGELRAWDFADDFA*
Ga0066709_10010745913300009137Grasslands SoilSTPADRSIIRRIATRYLGPEAGDRCAEGGGDDLLIRLEPGELRAWDFADDFG*
Ga0116218_113108613300009522Peatlands SoilGREAGAAYADSATDDLLIRLEPGELRAWDFADQLS*
Ga0105249_1111189733300009553Switchgrass RhizosphereRYLGVEEGEKYVSEGYDDTLIRLEPGRLRAWDFVDDM*
Ga0105249_1126903713300009553Switchgrass RhizosphereVVREIAGRYLGDEEGRAYADGGHDDTLIRLEPGRLRAWDFADDL*
Ga0126374_1110498013300009792Tropical Forest SoilRYLGPEAGGQYADSAGDDLLIRLEPGELRAWDFADDIA*
Ga0126374_1142946513300009792Tropical Forest SoilLGPEGGEQYAETAGDDLLIRLEPGELRAWDFADEFA*
Ga0126309_1119385423300010039Serpentine SoilADADDVYEIAGRYLGDEEGAAYADRGHDDTLIRLEPGRLRAWDFTDDL*
Ga0134086_1003997313300010323Grasslands SoilPEAGEQYAETEGDDLLIRLEPGELRAWDFADDFV*
Ga0134080_1055607313300010333Grasslands SoilSAGLSTLEDRSIVRRIATRYLGPEAGEQYAETTGGDDLLIRLEPGELRGWDFADDFA*
Ga0134062_1041951023300010337Grasslands SoilRYLGGEEGEAYADRGDDDTLIRLEPGRLRAWDFADDL*
Ga0126370_1233839123300010358Tropical Forest SoilYLGEDEGDAYTEMGEDDTLIRLEPGRLRAWDFADDL*
Ga0126376_1128139923300010359Tropical Forest SoilYLGRETGGAYAGRAADDLLIRLEPGDLRAWDFADELA*
Ga0126378_1034215713300010361Tropical Forest SoilGAEAGAAYAESAGDDLLIRLEPGNLRAWDFADEFS*
Ga0126379_1207708023300010366Tropical Forest SoilAEAGSAYAERGGDDLLIQLEPGDLRGWDFADDFS*
Ga0105239_1220183423300010375Corn RhizospherePVDGPAMVARIAGRYLGPQAGGRYAESGNDDLLIRIEPGELRGWDFTDDFD*
Ga0134122_1152738323300010400Terrestrial SoilSRYLGAEGGATYVEEGGDDLVLRLEPGILRTWDFADDLS*
Ga0134121_1059135013300010401Terrestrial SoilLSTPDDRSIVRRIATRYLGPEADEQYAESGGDDLLIRLEPGELRAWDFADDFT*
Ga0134121_1124299623300010401Terrestrial SoilARYLGQEGAAAYVESGGDDLLVRLEPGVLRTWDFADD*
Ga0126350_1140040213300010880Boreal Forest SoilAVRYLGVEAGEAYVADFSDDVVIRLEPGTVRAWDFADEYT*
Ga0137393_1005983513300011271Vadose Zone SoilVAVRYLGPELGPAYVEGGGDSLLIRLEPGDLRAWDFAEKLGEIERS*
Ga0137364_1081617623300012198Vadose Zone SoilRKGGDRYTASAGDDLLIRLEPGELRAWDFADDFA*
Ga0137365_1058832013300012201Vadose Zone SoilATRYLGPEAGERYAETSGDDQLIRLEPGELRGWDFADDFS*
Ga0137374_1116122813300012204Vadose Zone SoilRYLGERAGAAYAQTAGDDTVIRLEPGRLRAWDFSDEAPL*
Ga0137381_1042211033300012207Vadose Zone SoilPGAGVAYAERVGDDLLIRLEPGDLRAWDFAGELT*
Ga0137386_1111443013300012351Vadose Zone SoilIATRYLGEEAGAAYAERMRDDVVVRLEPGRFRAWDFSDEGV*
Ga0137369_1011989113300012355Vadose Zone SoilRDGTAYAESAGDDTLIRLAPGRLRAWDFSDEYLA*
Ga0137368_1078023013300012358Vadose Zone SoilIAVRYLGAEEGEAYVASAGDDTLIRLEPGHVRAWDFADDL*
Ga0157313_101489423300012503Arabidopsis RhizosphereRYLGPEAGEQYAEGGGDDLLVRLEPGELRAWDFADDFA*
Ga0157303_1000753613300012896SoilARYLGPEFGEQYAETDADDLLIRLEPGELRGWDFADDFA*
Ga0157293_1001177443300012898SoilVRYLGVEEGEKYVSEGYDDTLIRLEPGHLRAWDFVDDM*
Ga0157301_1020551523300012911SoilRYLGPQAAAEYAESGGDDVIVRLEPGHLRAWDFADSVD*
Ga0164304_1168285723300012986SoilADADFVRDIAVRYLGDDQGQAYVDRGYDDTLIRLEPGHLRAWDFTDDF*
Ga0157375_1284510123300013308Miscanthus RhizosphereVVARVAGRYLGGEEGEQYAQTSGDDLLIRLEPGNLRAWDFADFYS*
Ga0134079_1038499833300014166Grasslands SoilEEAGNAIRHIAVRYLGERDGVAYADSAGDDTLIRLEPGRLRAWDFADQL*
Ga0173480_1069918123300015200SoilMSVPVLDIATRYLGAEKGRAYVDKGYDDTLIRLEPEYLRAWDFKDDPEL*
Ga0187816_1043791423300017995Freshwater SedimentLRRGEVAYASRSMSLLLIRLEPGDLRAWDFADELS
Ga0187815_1048700023300018001Freshwater SedimentYLGVEQGEAYAAGAGDDTLIRLEPGMLRAWDFADDY
Ga0066667_1115683313300018433Grasslands SoilRRLATRCRGPEGGDQYAQGGGDDLLIRLEPDALRAWDFADEFA
Ga0066669_1140462513300018482Grasslands SoilIAARYLGERDGAAYAASSGDDTLIRLEPGRLRAWDFADQL
Ga0210399_1027189823300020581SoilSRYLGAEAGAAYAESAGDDLLIRVEPGELRAWDFADEFS
Ga0210388_1073101413300021181SoilIASRYLGLEAGAAYADSADDDLLVRLEPGELRAWDFAGEHS
Ga0210402_1043757433300021478SoilRYLGPRAGAAYADTARDDLLIRLEPGDLRAWDFADEYS
Ga0247791_109331923300023062SoilGAEVAAAYVRSLTGNDLLVRVEPGELRVWDFADDFPSG
Ga0207656_1029377623300025321Corn RhizosphereERSIVGRIAARYLGSEAGEQYAETAGDDVLIRLEPGTLRGWDFADYFAC
Ga0207685_1045215623300025905Corn, Switchgrass And Miscanthus RhizosphereIRYLGETEGAAQAERLADDVVLRLEPGDLRAWDFTDEY
Ga0207685_1077379913300025905Corn, Switchgrass And Miscanthus RhizosphereDGSVVSRIAGRYLGREAGERYAETAGDDLLIRLEPGDLRAWDFSDYFT
Ga0207699_1115689813300025906Corn, Switchgrass And Miscanthus RhizosphereVSRIAARYLGQEAGERYAESAGDDLLIRLEPGQLRAWDFSDYFA
Ga0207644_1063557113300025931Switchgrass RhizosphereRYLGQEAGERYAETAGDDLLIRLEPGQLRAWDFADDFA
Ga0207709_1182634923300025935Miscanthus RhizosphereARYLGPEFGEQYAETDADDLLIRLEPGELRGWDFADDFA
Ga0207675_10042385513300026118Switchgrass RhizosphereLGDEEGRAYADGGHDDTLIRLEPGRLRAWDFADDL
Ga0207675_10045700313300026118Switchgrass RhizosphereRYLGPDEGQRYDETAGDDLLIRLEPGELRGWDFADHFAG
Ga0209795_1014565223300027718AgaveVPEDRSIVRRIATRYLGPEAGEQYAEGAGDDLLIRLEPGELRGWDFADDFA
Ga0209177_1031934423300027775Agricultural SoilITLEERSIVGRIAARYLGSEAGEQYAETAGDDVLIRLEPGTLRGWDFADYFAC
Ga0307291_100870333300028707SoilRDIAGRYLGDEEGGAYADRGYDDTLIRLEPGRLRAWDFADDL
Ga0307311_1020845013300028716SoilADVVRDIAGRYLGDEEGGAYADRGYDDTLIRLEPGRLRAWDFADDL
Ga0307297_1037713623300028754SoilRDIAVRYLGDEEGGAYADRGYDDTLIRLEPGRLRAWDFADDL
Ga0307316_1030233013300028755SoilSRYLGAEAGAAYAEQTTDDTIIRLEPGRLRAWDFSDEAT
Ga0307323_1000441873300028787SoilRYLGDEEGGAYADRGYDDTLIRLEPGRLRAWDFADDL
Ga0307305_1038938323300028807SoilRYLGREKGTAYAAEEADDVLIRLEPGRLRAWDFADEYPTTET
Ga0307278_1053216323300028878SoilATRYLGPDEGERYGEVGVDDLLIRLEPGELRAWDFADYFA
Ga0307469_1222746413300031720Hardwood Forest SoilLGPEEGERYVESGNDDLLIRLEPGELRAWDFVDSFA
Ga0306918_1019350813300031744SoilVGDAVLRIASHYLGPEAGAAYADSAGDDLLIRLEPGRLRAWDFADELA
Ga0318546_1049923113300031771SoilIASRYLGPEAGAAYAERGNDDLLIRLEPGDLRAWDFADDL
Ga0306919_1101360723300031879SoilSRYLGPEAGAAYAERGYDDLLIRLEPGDLRAWDFADDL
Ga0306921_1195523713300031912SoilVTRIASRYLGPEAGAAYAERGNDDLLIRLEPGDLRAWDFADDL
Ga0308175_10175509923300031938SoilTTARLSTLEDRSVVARIAARYLGREAGDRYAETAGDDLLIRLEPGDVRAWDFSDTYA
Ga0308174_1047923023300031939SoilRIAARYLGREAGDRYAETAGDDLLIRLEPGDVRAWDFSDSYA
Ga0318504_1021194423300032063SoilYLGPEAGAAYADSAGDDLLIRLEPGRLRAWDFADELA
Ga0310895_1058240813300032122SoilIAGRYLGDEEGRAYADGGHDDTLIRLEPGRLRAWDFADDL
Ga0307472_10109320433300032205Hardwood Forest SoilRAIAIRYLGEEAGTAYAEQIADETLIRLEAGRVRAWDFADEYS
Ga0335078_1010922113300032805SoilITSRYLGRQAGAAYAGSAADDLLIRLEPGDLRAWDFAGEPS
Ga0335081_1164299013300032892SoilGGADAIVAIASRYLGPEEGRAYADSASDDTLIRLEPGTIRAWDFSDE
Ga0335083_1090798023300032954SoilYLGRQAGAAYADSGLDDLLIRLEPGELRAWDFADQLS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.