NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102762

Metagenome Family F102762

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102762
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 41 residues
Representative Sequence VLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Number of Associated Samples 67
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 7.92 %
% of genes near scaffold ends (potentially truncated) 22.77 %
% of genes from short scaffolds (< 2000 bps) 95.05 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.59

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (82.178 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(30.693 % of family members)
Environment Ontology (ENVO) Unclassified
(37.624 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(53.465 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.
1C688J35102_1196165572
2C688J35102_1203826052
3Ga0070683_1007275452
4Ga0070670_1005208072
5Ga0070682_1001649171
6Ga0070692_111294451
7Ga0070674_1018046192
8Ga0070700_1012929052
9Ga0066687_109353222
10Ga0070684_1003658263
11Ga0068855_1009053423
12Ga0070702_1013408291
13Ga0081538_100094542
14Ga0075428_1015775372
15Ga0079215_101266661
16Ga0079216_118836332
17Ga0105679_108153753
18Ga0105679_108207562
19Ga0111539_106782232
20Ga0111539_127474982
21Ga0075418_129295542
22Ga0114129_105097532
23Ga0105237_120991732
24Ga0105249_130459592
25Ga0126307_102204642
26Ga0126307_103616702
27Ga0126307_105102412
28Ga0126307_105494572
29Ga0126307_111267992
30Ga0126307_111607682
31Ga0126307_113557301
32Ga0126313_100032425
33Ga0126313_100144022
34Ga0126313_106117562
35Ga0126313_109532402
36Ga0126309_103322612
37Ga0126308_110306531
38Ga0126311_118085232
39Ga0126311_119223582
40Ga0126306_106267642
41Ga0134125_106745401
42Ga0105239_128669972
43Ga0134127_111685922
44Ga0134127_115941812
45Ga0157310_104112802
46Ga0162650_1001047491
47Ga0164306_118361571
48Ga0163162_111990392
49Ga0182001_100793152
50Ga0120104_10946381
51Ga0132257_1026350031
52Ga0184620_100911152
53Ga0190265_102285581
54Ga0190265_104632941
55Ga0190265_104686653
56Ga0190265_106625523
57Ga0190265_108162663
58Ga0190265_122667032
59Ga0190265_125121822
60Ga0190265_129441162
61Ga0190272_106926162
62Ga0190275_103068871
63Ga0190275_106261121
64Ga0190275_107032851
65Ga0190275_112693072
66Ga0190275_117547142
67Ga0190275_130259852
68Ga0190268_111963181
69Ga0190268_114897802
70Ga0190270_101467972
71Ga0190270_125130441
72Ga0190274_122336522
73Ga0190274_139946882
74Ga0190271_112966912
75Ga0190271_121958832
76Ga0190264_103031972
77Ga0190264_121198871
78Ga0196958_100142564
79Ga0207643_106504562
80Ga0207643_109888691
81Ga0207650_104120713
82Ga0207659_105637742
83Ga0207690_112241121
84Ga0207689_108906452
85Ga0207679_119602621
86Ga0207712_106374801
87Ga0207648_103457782
88Ga0247818_109506452
89Ga0307315_101981902
90Ga0307287_103283042
91Ga0307305_103725961
92Ga0268242_10679322
93Ga0307502_100659012
94Ga0307405_114989571
95Ga0307410_104058912
96Ga0308175_1007483472
97Ga0308175_1023511031
98Ga0307409_1014292002
99Ga0308173_112448041
100Ga0334961_003344_152_277
101Ga0334961_030907_214_339
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 38.24%    β-sheet: 5.88%    Coil/Unstructured: 55.88%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540VLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRRExtracel.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.59
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
82.2%17.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Soil
Terrestrial Soil
Serpentine Soil
Agricultural Soil
Permafrost
Soil
Sub-Biocrust Soil
Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Soil
Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
30.7%3.0%15.8%4.0%3.0%3.0%3.0%4.0%5.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J35102_11961655723300002568SoilVLAHIAGIPVEEALVAAPGLLAGLSMIAAYVRATATRPRASADE*
C688J35102_12038260523300002568SoilVLAHVAGIPVEEALQAAPALLAGVTVFAGYLRATAARLRR*
Ga0070683_10072754523300005329Corn RhizosphereVFAHVGGLPVEEALQAAPAALAALTVLAGYVRATLARPRR*
Ga0070670_10052080723300005331Switchgrass RhizosphereVLAHVGGVPVEEALQAAPAALAALTVLGGYVRATLARPRR*
Ga0070682_10016491713300005337Corn RhizosphereVFAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR*
Ga0070692_1112944513300005345Corn, Switchgrass And Miscanthus RhizosphereVFAHVGGLPVEEALQAAPAALAALTVLAGYLRATLARPRR*
Ga0070674_10180461923300005356Miscanthus RhizosphereVFAHVAGIPVEEALQAAPALLAGLAVVAGYIRATAARPRR*
Ga0070700_10129290523300005441Corn, Switchgrass And Miscanthus RhizosphereVLAHVAGIPVEEALPAAPALPAGAAVIAGTSGRPPRGRRR*
Ga0066687_1093532223300005454SoilHSDVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR*
Ga0070684_10036582633300005535Corn RhizosphereDVFAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR*
Ga0068855_10090534233300005563Corn RhizosphereVLAHVAGIPVEEALPAAPALPAGAAVIAGYVRATAAPGR*
Ga0070702_10134082913300005615Corn, Switchgrass And Miscanthus RhizosphereVFAHVGGVPVEEALQAAPVLLTGLTVFAGYVRATAAGRRRDQ*
Ga0081538_1000945423300005981Tabebuia Heterophylla RhizosphereVLAHVAGIPVEEALLVAPALLAGVTVIAGYVRATAARPRR*
Ga0075428_10157753723300006844Populus RhizosphereVLAHVAGIPVEEALQAAPALLAGLTVFAGYLRATAARTRR*
Ga0079215_1012666613300006894Agricultural SoilVLAHVAGFPVEEALQAAPALLAGLTVLAGYVRATATRPRR*
Ga0079216_1188363323300006918Agricultural SoilVLAHVAGIPVEEALLAAPALLAGVTVLAGYVRAAATRPRR*
Ga0105679_1081537533300007790SoilVLAHVGALPVEEALLAAPALLAWVSTIAGYARAHLVRNG*
Ga0105679_1082075623300007790SoilMFAHIGGVPVEEALTAAPALLASATVLAGYLRATAGRARR*
Ga0111539_1067822323300009094Populus RhizosphereVFAHVGGIPVEEALQAAPAALAALTVLAGYVRATLARPRR*
Ga0111539_1274749823300009094Populus RhizospherePSSDVFAHVGCLPVEEALQAAPAALAAMTVLAGYLRATLARPRR*
Ga0075418_1292955423300009100Populus RhizosphereVLAHVAGIPVEEALQAAPALLAAVTVIAGYVRAIAARHALAEAEGS
Ga0114129_1050975323300009147Populus RhizosphereVLAHVAGIPVEEALQAAPGLLAGLTVIAGYIRATAARRRR*
Ga0105237_1209917323300009545Corn RhizosphereAGTTPTTPSSDVFAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR*
Ga0105249_1304595923300009553Switchgrass RhizosphereVLAHVAGIPVEEALLAAPALLAGAAVIAGYVRATAARRRR*
Ga0126307_1022046423300009789Serpentine SoilVLAHIGGIPLEEALQAAPALLTGLTVIAGYLRTTAARPRR*
Ga0126307_1036167023300009789Serpentine SoilVLAHVAGIPVEEALLAAPVLLAGVTAIAGYVRATAARSRR*
Ga0126307_1051024123300009789Serpentine SoilVFAHVAGIPVEEALVAGPALLAGVTLIAGYLRATVARPRR*
Ga0126307_1054945723300009789Serpentine SoilVLAHVAGIPLEEALQAAPALLAGVTVIAGYVRATAARPRR*
Ga0126307_1112679923300009789Serpentine SoilVLAHVAGVPVEEALQAAPALLAAVTMIAGYVRATVARPRR*
Ga0126307_1116076823300009789Serpentine SoilMTVLAHVAGIPVEEALLAAPALLAGVTVIAGYLRATAARPRR*
Ga0126307_1135573013300009789Serpentine SoilVLAHIAGIPVEEALMAAPALLTGLTVIAGYLRASAARPRR*
Ga0126313_1000324253300009840Serpentine SoilVLAHVAGISVEEALPAGVSVIAGDVRATAARRAS*
Ga0126313_1001440223300009840Serpentine SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR*
Ga0126313_1061175623300009840Serpentine SoilVLAHVAGVPVEEALLAAPGLMAGVTVVAGYVRATGARLRR*
Ga0126313_1095324023300009840Serpentine SoilVLAHVAGLPVEEALLAEPALVAGVTVAAGYVRATAARPRR*
Ga0126309_1033226123300010039Serpentine SoilVLAHVAGFPVEEALTAAPALLAALTMIAGYVRATAARSRR*
Ga0126308_1103065313300010040Serpentine SoilVLAHVAGIPVEEALLAAPVLLAGVTAIAGYVRATAARPRR*
Ga0126311_1180852323300010045Serpentine SoilVLAHVAGIPIEEALLAAPALLAGLTVIAGYVRAIAARPRR*
Ga0126311_1192235823300010045Serpentine SoilVLAHVAGIPAEEALLAEPALVAGVTVAAGYVRATAARPRR*
Ga0126306_1062676423300010166Serpentine SoilVLAHVGGIPVEEALAAAPALLAGVTVIAGYVRATAARARR*
Ga0134125_1067454013300010371Terrestrial SoilVLAHVAGIPVEEALMAAPALLAGVTMIAGYVRATAARPRR*
Ga0105239_1286699723300010375Corn RhizosphereVLAHVAGIPVEEALLAAPALLAGAALIAGYVRATAARRRR*
Ga0134127_1116859223300010399Terrestrial SoilAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR*
Ga0134127_1159418123300010399Terrestrial SoilVLAHVAGIPVEEASMAAPALLAGVTMIAGYVRATAARPRR*
Ga0157310_1041128023300012916SoilVLAHVAGIPVEEALQAAPALLAGATVIAGYIRATAARRR*
Ga0162650_10010474913300012939SoilVLAHVAGIPVEEALLAAPALLAGVTVLAGYVRATAARPRR*
Ga0164306_1183615713300012988SoilVLAHVAGIAVEEVLLAAPALLAGMTVIVGYVRRAW
Ga0163162_1119903923300013306Switchgrass RhizospherePVEEALQAAPVLLTGLTVFAGYVRATAAGRRRDQ*
Ga0182001_1007931523300014488SoilVLAHVAGIPVEEALLAAPALVAAVTVIAGYLRATVARPRR*
Ga0120104_109463813300014829PermafrostTSTTTHSDVLAHVAGIPIEEALLAAPALLAGVTVIAGYVRATAARPRR*
Ga0132257_10263500313300015373Arabidopsis RhizosphereVFAHVGGLPVEEALQAAPAALAALTVLAGHVRATLARPRR*
Ga0184620_1009111523300018051Groundwater SedimentVLAHVAGFPVEEALQAAPALLAGVTVMVGYVRATAARPRR
Ga0190265_1022855813300018422SoilSDVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0190265_1046329413300018422SoilSDVLAHVAGIPVEEALLAAPALLASVTVIVSYVRATAARPRR
Ga0190265_1046866533300018422SoilVLAHVAGIPVEEALLAAPALLASLTMIAGYVRTAAARPRR
Ga0190265_1066255233300018422SoilVLAHVAGIPVEEALMAAPALLAGVTVIAGYVRATVARPRR
Ga0190265_1081626633300018422SoilAHVAGIPVEEALLAAPALLASVTVIAGYVRAAASRPRR
Ga0190265_1226670323300018422SoilVLAHVAGIPVEEALLAAPALLAGVTAIAGYVRATAARPRR
Ga0190265_1251218223300018422SoilVLAHVAGVPVEEALLAAPALLTGLTVIAGYVRATAARARSRGPRTY
Ga0190265_1294411623300018422SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRALRT
Ga0190272_1069261623300018429SoilTTRSDVFAHVGGIPVEEALRAAPALLAGLTVIAGYLRATAARPRR
Ga0190275_1030688713300018432SoilTTTHSDVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATATRPRR
Ga0190275_1062611213300018432SoilTTTHSDVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0190275_1070328513300018432SoilTAHSDVLAHVAGIPVEEALAAAPALLAGLTVIAGYVRATAARPRH
Ga0190275_1126930723300018432SoilVLAHVRGIPVEEALLAAPALLAGVTVIAGYIRATAARPRR
Ga0190275_1175471423300018432SoilMLAHISGIPVEEALQAAPALLASVTVIAGYIRATAARPRR
Ga0190275_1302598523300018432SoilMFAHVAGFPVEEALLAAPALLAGVTVIVGYVRATAARPRRSRRPR
Ga0190268_1119631813300018466SoilVLAHIAGIPVEEALQAAPALLASVTVIAGYVRATAARPRR
Ga0190268_1148978023300018466SoilVLAHIAGFPVEEALQAAPALLAGVTVIAGYLRATAARPRR
Ga0190270_1014679723300018469SoilVLLAHVAGFPVEEALLAAPALLAGVTVIAGYIRATVARPRR
Ga0190270_1251304413300018469SoilMTSTTTHSDVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0190274_1223365223300018476SoilVVLAHVAGFPVEEALLAAPALLAGVTVIGGYIRATVARPRR
Ga0190274_1399468823300018476SoilVFAHVAGIPLEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0190271_1129669123300018481SoilVLAHVAGIPVEEALLAAPALLAGVAVVAGYVRATAARLRR
Ga0190271_1219588323300018481SoilVLAHVVGIPVEEALLAAPALLAGVTVIAGYVRATAASLRR
Ga0190264_1030319723300019377SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0190264_1211988713300019377SoilVLAHVAGIPVEEALLAVPALLAGATAIAGYVRATAARPRR
Ga0196958_1001425643300020181SoilVLAHVSGIPVEEALLAAPALLAGVTAIAGYVRATAARPRTRRPTD
Ga0207643_1065045623300025908Miscanthus RhizosphereTTSTTGRSDVLAHVGGVPVEEALQAAPAALAALTVLGGYVRATLARPRR
Ga0207643_1098886913300025908Miscanthus RhizosphereTTTHSDVLAHVAGIPVEEALPAGVTVIAGCVRHDTRRPR
Ga0207650_1041207133300025925Switchgrass RhizosphereVLAHVGGVPVEEALQAAPAALAALTVLGGYVRATLARPRR
Ga0207659_1056377423300025926Miscanthus RhizospherePTTPRSDVLAHVGGLPVEEALQAAPAALAALTVLAGYVRATLARPRR
Ga0207690_1122411213300025932Corn RhizosphereVFAHVGGLPVEEALQAAPAALAALTVLAGYVRATLARPRR
Ga0207689_1089064523300025942Miscanthus RhizosphereVLAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR
Ga0207679_1196026213300025945Corn RhizosphereVLAHVAGIPVEEALLAAPALLAGAAVIAGYVRATAARLRR
Ga0207712_1063748013300025961Switchgrass RhizosphereGTTPTTPRSDVFAHVGGLPVEEALQAAPAALAAMTVLAGYLRATLARPRR
Ga0207648_1034577823300026089Miscanthus RhizosphereVFAHVGGLPVEEALQAAPAALAALTVLAGYLRATLARPRR
Ga0247818_1095064523300028589SoilVLAHIAGIPVEEALQAAPALLAGATVIAGYVRATAARSRRGIRRGV
Ga0307315_1019819023300028721SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARHRR
Ga0307287_1032830423300028796SoilVLAHVAGIPVEEALLAAPALLAGVTVIVGYIRATAARPRR
Ga0307305_1037259613300028807SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRPCPQCTSP
Ga0268242_106793223300030513SoilVLAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRQLRSRIAHASA
Ga0307502_1006590123300031164SoilVLAHVAGIPVEEALLAAPALLAGVTVIVGYVRATVARPRR
Ga0307405_1149895713300031731RhizosphereVFAHVAGIPVEEALLAAPALLAGVTVIAGYVRATAARPRR
Ga0307410_1040589123300031852RhizosphereVLAHAAGIPVEEALLAAPALLAGVTVIAGYVRAAAARPRR
Ga0308175_10074834723300031938SoilVLAHVAGIPVEESLLAAPALLAGVTVIAAYVRATAARRRR
Ga0308175_10235110313300031938SoilVLAHVAGIPVEEALMAAPGALAGLTVLAGYVRATLARPRR
Ga0307409_10142920023300031995RhizosphereVLAHVAGIPVEEALVAAPALLAGVTVIAGYVRATAARPGR
Ga0308173_1124480413300032074SoilTRSDVFAHVAGIPVEEALTAAPALLAGVTMIAAYVRATAARRRR
Ga0334961_003344_152_2773300034143Sub-Biocrust SoilMLLAHVGIIPLEEALLAAPALLAGATVIAGYLRTIAVRPRR
Ga0334961_030907_214_3393300034143Sub-Biocrust SoilMLLAHVGGIPVEEALMAAPALLAGATVIADYVRAIVARPRR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.