NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091046

Metagenome Family F091046

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091046
Family Type Metagenome
Number of Sequences 108
Average Sequence Length 42 residues
Representative Sequence MRRFLCRVVGHRLPRRRRPFFLTERYQRCERCGERVRLRGR
Number of Associated Samples 88
Number of Associated Scaffolds 108

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.74 %
% of genes near scaffold ends (potentially truncated) 21.30 %
% of genes from short scaffolds (< 2000 bps) 78.70 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (71.296 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere
(5.556 % of family members)
Environment Ontology (ENVO) Unclassified
(23.148 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(37.037 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62
1JGI1356J14229_1000722713
2soilL1_100356064
3soilL1_100748443
4soilH2_101690212
5Ga0062593_1001828723
6Ga0062589_1022020032
7Ga0062590_1002370822
8Ga0062590_1016344222
9Ga0062595_1006534011
10Ga0062592_1011527453
11Ga0066676_109946371
12Ga0066675_102071863
13Ga0068996_101685302
14Ga0066388_1082473982
15Ga0070680_1001190913
16Ga0070682_1019237572
17Ga0070708_1018624012
18Ga0070741_100545044
19Ga0070741_105386542
20Ga0066692_103120052
21Ga0066706_112795642
22Ga0081455_100880402
23Ga0073934_102006883
24Ga0079217_117431462
25Ga0075426_112869182
26Ga0075436_1007912822
27Ga0075436_1007998822
28Ga0066709_1001195815
29Ga0066709_1046148982
30Ga0075423_101776502
31Ga0116123_11247251
32Ga0105089_10350551
33Ga0105058_10913952
34Ga0126313_108568693
35Ga0134088_103338642
36Ga0126377_103495852
37Ga0137391_104386861
38Ga0137363_114298322
39Ga0137377_106465093
40Ga0137390_100923643
41Ga0164301_117511902
42Ga0126369_105092732
43Ga0126369_114961962
44Ga0075302_11629732
45Ga0182021_126572982
46Ga0137409_113262462
47Ga0132258_102548223
48Ga0132258_108614524
49Ga0132258_117371082
50Ga0132258_127220392
51Ga0132256_1011735132
52Ga0132256_1022612971
53Ga0132255_1001566174
54Ga0132255_1027496041
55Ga0187788_101205262
56Ga0187788_104633231
57Ga0187765_101848152
58Ga0187765_104674072
59Ga0187765_110072981
60Ga0187773_107610261
61Ga0190265_138415682
62Ga0066667_108565932
63Ga0066667_115956352
64Ga0173481_101332941
65Ga0247786_10768532
66Ga0179589_103713842
67Ga0209521_102514652
68Ga0209642_101490703
69Ga0209324_106160631
70Ga0209172_101406933
71Ga0209323_100703023
72Ga0209519_107598461
73Ga0209640_105863382
74Ga0209341_104581511
75Ga0209342_104388853
76Ga0209342_106520571
77Ga0209751_105006751
78Ga0207657_107535052
79Ga0207679_109216852
80Ga0210117_10888482
81Ga0209378_12815671
82Ga0209897_10733111
83Ga0209514_1000380918
84Ga0209514_100221831
85Ga0265337_11370732
86Ga0265326_102426232
87Ga0247818_105231602
88Ga0247818_110791531
89Ga0265322_101854661
90Ga0265336_100428492
91Ga0265338_100542425
92Ga0265338_101119284
93Ga0247825_100910865
94Ga0307498_100306182
95Ga0302321_1031308542
96Ga0318565_103981171
97Ga0307409_1028112651
98Ga0307416_1025888781
99Ga0310890_114677832
100Ga0315281_100216659
101Ga0307471_1024521332
102Ga0335070_100089934
103Ga0335070_100461272
104Ga0335081_107232322
105Ga0335069_1002268110
106Ga0335077_112612472
107Ga0334722_101116891
108Ga0247830_107423163
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 11.59%    β-sheet: 8.70%    Coil/Unstructured: 79.71%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MRRFLCRVVGHRLPRRRRPFFLTERYQRCERCGERVRLRGRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
71.3%28.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Sediment
Peatland
Groundwater
Natural And Restored Wetlands
Hot Spring Sediment
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Serpentine Soil
Grasslands Soil
Surface Soil
Soil
Soil
Agricultural Soil
Sugarcane Root And Bulk Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Soil
Natural And Restored Wetlands
Tropical Peatland
Fen
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Soil
Groundwater Sand
Fen
Arabidopsis Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Rhizosphere
Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
2.8%3.7%3.7%5.6%2.8%5.6%2.8%4.6%3.7%4.6%4.6%5.6%5.6%2.8%5.6%3.7%5.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1356J14229_10007227133300001380GroundwaterMLHLSRSLVCRIVGHRLPRSKRPFFLTERFQRCERCGERVRAKYR*
soilL1_1003560643300003267Sugarcane Root And Bulk SoilMRRVICRALGHKLPRRRRPFFLTERYQRCERCGERVRLEDR*
soilL1_1007484433300003267Sugarcane Root And Bulk SoilVNILCRVFGHRLPRRRRPFFLTDHFQRCERCGERVRLKDR*
soilH2_1016902123300003324Sugarcane Root And Bulk SoilMRRVICRALGHRLPRRRRPFFLTERYQRCERCGERVRLEDR*
Ga0062593_10018287233300004114SoilVNLRCRIFGHRLPRRKRPFFLTDRFQRCERCGERVALKGR*
Ga0062589_10220200323300004156SoilVTPLRCRLFGHRLPRRKRPFFLIDRFQRCERCGARVPLKDRERP*
Ga0062590_10023708223300004157SoilVSLRCRIFGHRLPRRKRPFFLTDRFQRCERCGERVALKGR*
Ga0062590_10163442223300004157SoilVNWICRVFGHRLPRRRRPFFLTDLFQKCERCGERVRLKDR*
Ga0062595_10065340113300004479SoilMKRAVCRVIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0062592_10115274533300004480SoilVTPLRCRLFGHRLPRRKRPFFLIDRFQRCERCGARVPLKDRDRP*
Ga0066676_1099463713300005186SoilVKLRCRLLGHKLPRRKRPFFLTDRFQRCERCGERVLLKDR*
Ga0066675_1020718633300005187SoilMRRHLCRVLGHKLPRRRRPFFLTERFQRCERCGERVMLKGRE*
Ga0068996_1016853023300005218Natural And Restored WetlandsPYPGYRVGSHMKRALCSVIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0066388_10824739823300005332Tropical Forest SoilMRGTFCRVLGHRLPRRKRPFFLIERYQRCERCGARVRLKDR*
Ga0070680_10011909133300005336Corn RhizosphereMRRLTCRVLGHKLPRRRRPFFLTERFQRCERCGKRIPLKGR*
Ga0070682_10192375723300005337Corn RhizosphereMRRLTCRFLGHKLPRRRRPFFLTERFQRCERCGARVRLRGR*
Ga0070708_10186240123300005445Corn, Switchgrass And Miscanthus RhizosphereMRRAVCRLIGHKLPRRKRPFFLIERYQRCERCGERVRLRGR*
Ga0070741_1005450443300005529Surface SoilMRNVICRVLGHKLPRRKRPFFLIERYQRCERCGERVRLRDR*
Ga0070741_1053865423300005529Surface SoilMRNLICRVLGHKLPRRKRPFFLIERYQRCERCGERVRLRDR*
Ga0066692_1031200523300005555SoilMRRELLCRLVGHKLPRRRRPFFFPERFQRCERCGERVKVRGL*
Ga0066706_1127956423300005598SoilVRRILCTVIGHRLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0081455_1008804023300005937Tabebuia Heterophylla RhizosphereVTLRCRVLGHRLPRRKRPFFLTDRFQRCERCGDRVLLKNR*
Ga0073934_1020068833300006865Hot Spring SedimentVKLRCRLLGHRLPRRKRPFFLTERFQRCERCGRRIRLRDR*
Ga0079217_1174314623300006876Agricultural SoilMLRCRLFGHRLPRRRRPFFLTERYQRCERCGKRVLLRGR*
Ga0075426_1128691823300006903Populus RhizosphereMRRAVCRLIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0075436_10079128223300006914Populus RhizosphereMRRTLCRLIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0075436_10079988223300006914Populus RhizosphereVCRLIGHKLPRRKRPFFLIERYQRCERCGERVRLRGR*
Ga0066709_10011958153300009137Grasslands SoilMRRALCRVIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0066709_10461489823300009137Grasslands SoilMRRILCSVVGHRLPRRKRPFFLIERYQRCERCGERVRLKDR*
Ga0075423_1017765023300009162Populus RhizosphereMKRALCRVIGHKLPRRKRPFFLIERYQRCERCGERVRLKDR*
Ga0116123_112472513300009617PeatlandDDRLAAGRNGMARFICRIVGHRLPRRRRPFFLTERYQRCERCGERVRLRGR*
Ga0105089_103505513300009809Groundwater SandMCRLFGHRLPRRKRPFFVLEPPLQRCERCGKRALVKIRR*
Ga0105058_109139523300009837Groundwater SandMNLLCRLLGHRLPRRKRPFFLTERFQRCERCGKRVRLKDR*
Ga0126313_1085686933300009840Serpentine SoilLCRLVGHRLPRRKRPFFLLEPPLQRCERCGKRVLVRIGR*
Ga0134088_1033386423300010304Grasslands SoilMRRALCRVIGHKLPRRKRPFFLIERYQRCQRCGERVRLKGR*
Ga0126377_1034958523300010362Tropical Forest SoilVLGHRLPRRKRPFFLIERYQRCERCGARVRLKDR*
Ga0137391_1043868613300011270Vadose Zone SoilAGMRRELLCRLVGHKLPRRRRPFFFPERFQRCERCGERVKVRGL*
Ga0137363_1142983223300012202Vadose Zone SoilMRRLTCRILGHKLPRRRRPFFLTERFQRCERCGKRVALKGR*
Ga0137377_1064650933300012211Vadose Zone SoilMRRELLCRLVGHKLPRRRRPFFFPERFQRCERCGERVKLRGQRW*
Ga0137390_1009236433300012363Vadose Zone SoilMRRELLCRLVGHKLPRRRRPFFFPERFQRCERCGERVKVR
Ga0164301_1175119023300012960SoilMRRLTCRFLGHKLPRRRRPFFLTERFQRCERCGKRIPLKGR*
Ga0126369_1050927323300012971Tropical Forest SoilMRRILCSMVGHRLPRRKRPFFLIERYQRCERCGERVRLKGR*
Ga0126369_1149619623300012971Tropical Forest SoilMRRILCSVVGHRLPRRKRPFFLIDRYQRCERCGERVRLKGR*
Ga0075302_116297323300014269Natural And Restored WetlandsMTLRCRVFGHKLLRRKRPFFLTERYQRCDRCGKRIRLKGR*
Ga0182021_1265729823300014502FenMKRVLCRVVGHRLPRRKRPFFLTDRYQRCERCGKRVRLKDR*
Ga0137409_1132624623300015245Vadose Zone SoilMLCNVLGHKLPRRKRPFFLIERYQPCSRCGERVRLKGR*
Ga0132258_1025482233300015371Arabidopsis RhizosphereVNWMCRVLGHRLPRRRRPFFLTDLFQKCERCGERVRLKDR*
Ga0132258_1086145243300015371Arabidopsis RhizosphereMTLRCRLLGHKLPRRKRPFFLTDRFQRCERCGQRVLLKDR*
Ga0132258_1173710823300015371Arabidopsis RhizosphereVKRALCHVLGHRLPRRKRPFFLIERYQRCERCGERVRLKDR*
Ga0132258_1272203923300015371Arabidopsis RhizosphereMNWLCRVFGHRLPKRRRPFFLTDLFQKCERCGERVRLKDR*
Ga0132256_10117351323300015372Arabidopsis RhizosphereRGSLCRVLGHRLPRRKRPFFLIESYQRCERCGARVRLKDR*
Ga0132256_10226129713300015372Arabidopsis RhizosphereVTPLRCRLFGHRLPRRKRPFFLIDRFQRCERCGERVPLKDRDRS*
Ga0132255_10015661743300015374Arabidopsis RhizosphereVNWLCRVFGHRLPRRRRPFFLTDHFQKCERCGERVRLKDR*
Ga0132255_10274960413300015374Arabidopsis RhizosphereAGRVRGSLCRVLGHRLPRRKRPFFLIERYQRCERCGARVRLKDR*
Ga0187788_1012052623300018032Tropical PeatlandMRRFLCRVVGHRLPRRRRPFFLTERYQRCERCGERVRLRGR
Ga0187788_1046332313300018032Tropical PeatlandMRRRFLCRVVGHRLPRRRRPFFLTERYQRCERCGERVRLRGR
Ga0187765_1018481523300018060Tropical PeatlandVKRLLCSVIGHKLPRRKRPFFLIERYQRCDRCGERVRLKGR
Ga0187765_1046740723300018060Tropical PeatlandVKRVLCNVIGHKLPRRKRPFFLIERYQRCDRCGERIRLKGR
Ga0187765_1100729813300018060Tropical PeatlandMRRILCNVVGHRLPRRKRPFFLIERYQRCERCGERVR
Ga0187773_1076102613300018064Tropical PeatlandAAVRRVLCRLVGHRLPRRHRPFFLTERYQRCERCGERVRLRGR
Ga0190265_1384156823300018422SoilVVRCRIFGHRLPRRRRPFFLTERFQRCERCGRRIRLKGR
Ga0066667_1085659323300018433Grasslands SoilVKLRCRLLGHKLPRRKRPFFLTDRFQRCERCGERVLLKDR
Ga0066667_1159563523300018433Grasslands SoilMRRALCRVIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR
Ga0173481_1013329413300019356SoilVSLRCRIFGHRLPRRKRPFFLTDRFQRCERCGERVALKGR
Ga0247786_107685323300022883SoilVNWICRVFGHRLPRRRRPFFLTDLFQKCERCGERVRLKD
Ga0179589_1037138423300024288Vadose Zone SoilMRRLTCRILGHKLPRRRRPFFLTERFQRCERCGKRVALKGR
Ga0209521_1025146523300025164SoilMIRLSRALLCRVLGHRLPRSKRPFFLTERFQRCDRCGERVRAKYR
Ga0209642_1014907033300025167SoilMVRLSRSLLCRVLGHRLPRSKRPFFLTERFQRCERCGERVRAKYR
Ga0209324_1061606313300025174SoilVAGYSPRMIRLSRALLCRVLGHRLPRSKRPFFLTERFQRCDRCGERVRAKYR
Ga0209172_1014069333300025310Hot Spring SedimentVKLRCRLLGHRLPRRKRPFFLTERFQRCERCGKRIRLHDR
Ga0209323_1007030233300025314SoilSPRMVRLSRSLLCRVLGHRLPRSKRPFFLTERFQRCDRCGERVRAKYR
Ga0209519_1075984613300025318SoilMIRLSRSLLCRVLGHRLPRSKRPFFLTERFQRCERCGERVRAKYR
Ga0209640_1058633823300025324SoilMIRLSRSLLCRVLGHRLPRSKRSFFLTERFQRCERCGERVRAKYR
Ga0209341_1045815113300025325SoilVIRLSRSLLCRVLGHRLPRSKRPFFLTERFQRCERCGERVRAKHR
Ga0209342_1043888533300025326SoilMIRLSRSFLCRVLGHRLPRSKRPFFLTERFQRCERCGERVRAKHR
Ga0209342_1065205713300025326SoilRVLGHGLPRSKRPFFLTERFQRCERCGERVRAKHR
Ga0209751_1050067513300025327SoilVSAAAGYSARMIRLSRSLLCRVLGHRLPRSKRPFFLTERFQRCERCGERVRAKYR
Ga0207657_1075350523300025919Corn RhizosphereMNWLCRVFGHRLPKRRRPFFLTDLFQKCERCGERVRLKDR
Ga0207679_1092168523300025945Corn RhizosphereVTPLRCRLFGHRLPRRKRPFFLIDRFQRCERCGARVPLKDRDRP
Ga0210117_108884823300025985Natural And Restored WetlandsPYPGYRVGSHMKRALCSVIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR
Ga0209378_128156713300026528SoilMRRELLCRLVGHKLPRRRRPFFFPERFQRCERCGERVKVRGL
Ga0209897_107331113300027169Groundwater SandMCRLFGHRLPRRKRPFFVLEPPLQRCERCGKRVLVKIRR
Ga0209514_10003809183300027819GroundwaterMLHLSRSLVCRIVGHRLPRSKRPFFLTERFQRCERCGERVRAKYR
Ga0209514_1002218313300027819GroundwaterVAGYSPRMLRLSLSLVCRVFGHRLPRSKRPFFLTERFQRCERCGERVRAKYR
Ga0265337_113707323300028556RhizosphereMPKLLCRVLGHRLPRRRRPFFLTERYQRCERCGERVRLRGR
Ga0265326_1024262323300028558RhizosphereMARFLCRILGHRLPRRRRPFFLTERYQRCERCGERVRLRGR
Ga0247818_1052316023300028589SoilVKLRCRVLGHRLPRRKRPFFLTERFQRCERCGERIRLKDR
Ga0247818_1107915313300028589SoilVNWLCRVLGHRLPRRRRPFFLTDLFQRCERCGERVRLKDR
Ga0265322_1018546613300028654RhizosphereGDDMRFLCRILGHRLPRRRRPFFLTERYQRCERCGERIRLRGR
Ga0265336_1004284923300028666RhizosphereMRFLCRILGHRLPRRRRPFFLTERYQRCERCGERIRLRGR
Ga0265338_1005424253300028800RhizosphereMPKLLCRVLGHRLPRRRRPFFLTERYQRCDRCGERVRLRGR
Ga0265338_1011192843300028800RhizosphereMARFLCRIFGHRLPRRRRPFFLTERYQRCERCGERVRLRDR
Ga0247825_1009108653300028812SoilMKLRCRLLGHRLPRRKRPFFLTELFQRCERCGERIRLKDR
Ga0307498_1003061823300031170SoilVKLRCRILGHRLPRRRRPFFLTERYQRCERCGKRVRLRDR
Ga0302321_10313085423300031726FenMKRVLCRVVGHRLPRRKRPFFLTDRYQRCERCGKRVRLKDR
Ga0318565_1039811713300031799SoilSSVLCRVVGHRLPRRKRPFFITEHFQRCERCGERVLIRDR
Ga0307409_10281126513300031995RhizosphereVNWICRVVGHRLPRRRRPFFLTDLFQRCERCGERVR
Ga0307416_10258887813300032002RhizosphereVNWICRVVGHRLPRRRRPFFLTDLFQRCERCGERVRLKDR
Ga0310890_1146778323300032075SoilVNWLCRVLGHRLPRRRRPFFLTDPFQKCERCGERVRLKD
Ga0315281_1002166593300032163SedimentMALLRRSLLCRVFGHRLPRRKRPFFLTERFQRCERCGERVTAKYL
Ga0307471_10245213323300032180Hardwood Forest SoilMRRAVCRLIGHKLPRRKRPFFLIERYQRCERCGERVRLKGR
Ga0335070_1000899343300032829SoilVPRFLCRVLGHRLPRRRRPFFLTERYQRCERCGERVRLRDR
Ga0335070_1004612723300032829SoilMRRVMCRIVGHRLPRRRRPFFLTERYQRCERCGARVRLRGR
Ga0335081_1072323223300032892SoilMARLLCRILGHRLPRRRRPFFLTERYQRCERCGERVRLRDR
Ga0335069_10022681103300032893SoilMRRVLCRIVGHRLPRRHRPFFLTERYQRCERCGARIRLRGR
Ga0335077_1126124723300033158SoilVRRHLCRVIGHRLPRRRRPFFLTERYQRCERCGERVRL
Ga0334722_1011168913300033233SedimentVNGPAKRFLCTVLGHKLPRRKRPFFLIERHQRCERCGERVRLKGR
Ga0247830_1074231633300033551SoilVRKKGRVLGHRLPRRKRPFFLTERFQRCERCGERIRLKDR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.