NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105825

Metagenome / Metatranscriptome Family F105825

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105825
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 41 residues
Representative Sequence VNFVSSGVVANQTGQLGFKKAFLVRDPDGHAIEIEEK
Number of Associated Samples 84
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 3.00 %
% of genes near scaffold ends (potentially truncated) 95.00 %
% of genes from short scaffolds (< 2000 bps) 94.00 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (95.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(15.000 % of family members)
Environment Ontology (ENVO) Unclassified
(31.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62
1JGI12416J11903_1019771
2JGI12627J18819_103736651
3JGI25382J37095_102288281
4Ga0062594_1030679271
5Ga0066679_101578904
6Ga0070663_1003895913
7Ga0070678_1014607071
8Ga0073909_1000077710
9Ga0070696_1006267321
10Ga0070665_1016405513
11Ga0066692_107495851
12Ga0066700_105200071
13Ga0068852_1009458611
14Ga0066652_1015402241
15Ga0075017_1000310071
16Ga0075018_101548043
17Ga0075018_105546161
18Ga0070712_1010072163
19Ga0075021_104414403
20Ga0068871_1006467923
21Ga0079221_101134493
22Ga0075425_1017776462
23Ga0075425_1025127211
24Ga0066710_1015423443
25Ga0099830_102236011
26Ga0126380_112606581
27Ga0126373_129807651
28Ga0134082_103684263
29Ga0134063_104311991
30Ga0074046_102905563
31Ga0126378_105298593
32Ga0126378_112559431
33Ga0126381_1022230383
34Ga0126381_1029902541
35Ga0126381_1031120273
36Ga0136449_1008909833
37Ga0137391_101969183
38Ga0137382_107502511
39Ga0137365_104799621
40Ga0137363_114673381
41Ga0137380_107806021
42Ga0137377_110151021
43Ga0134028_11566753
44Ga0137387_103877373
45Ga0137387_104462563
46Ga0137361_110475441
47Ga0137361_117302503
48Ga0134040_11567651
49Ga0134049_11388481
50Ga0137407_115477141
51Ga0157371_104071001
52Ga0157371_111164063
53Ga0157374_102446911
54Ga0157375_129034853
55Ga0181538_101279792
56Ga0137405_11280351
57Ga0137409_104407141
58Ga0182036_106867494
59Ga0182037_109476473
60Ga0182037_115525401
61Ga0182038_104739173
62Ga0187814_104067161
63Ga0187879_102707433
64Ga0187822_103015951
65Ga0187887_101388241
66Ga0210406_106779253
67Ga0210408_107251642
68Ga0213882_100641611
69Ga0213875_102402643
70Ga0207684_104763382
71Ga0207649_114012282
72Ga0209152_103022703
73Ga0257180_10620761
74Ga0209056_104237212
75Ga0209805_10206421
76Ga0209684_10539431
77Ga0209040_101071853
78Ga0209040_101411692
79Ga0209283_103300471
80Ga0209698_104271041
81Ga0268265_126463522
82Ga0268264_102040343
83Ga0170824_1103072631
84Ga0170824_1113090081
85Ga0170824_1176051063
86Ga0302323_1013443583
87Ga0318547_105171083
88Ga0306923_107965821
89Ga0306921_101027404
90Ga0306922_119744881
91Ga0311301_108812923
92Ga0307471_1007848591
93Ga0307471_1028125431
94Ga0307471_1029366093
95Ga0307471_1036020461
96Ga0335079_101821051
97Ga0335079_108903901
98Ga0335084_113185381
99Ga0335077_106823431
100Ga0310914_110274763
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 27.69%    Coil/Unstructured: 72.31%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035VNFVSSGVVANQTGQLGFKKAFLVRDPDGHAIEIEEKSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
95.0%5.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Fen
Exposed Rock
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Miscanthus Rhizosphere
Plant Roots
Populus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
5.0%15.0%7.0%5.0%7.0%5.0%3.0%7.0%4.0%4.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12416J11903_10197713300000703Tropical Forest SoilRNARARFVSSGVVANHMKEIEFSKAFIVRDPDGHPVEIAAQ*
JGI12627J18819_1037366513300001867Forest SoilAQVSFVSSGVVVNHTEQLGFTKAVVVRDPDGHAVEVEQK*
JGI25382J37095_1022882813300002562Grasslands SoilEAKVSFVSSEVVANQNRELGFPKAFVVRDPDGHAIEIKEK*
Ga0062594_10306792713300005093SoilSAHTKFVSSGLVTESDGQLGFRNALLVRDPDGHPILIEEK*
Ga0066679_1015789043300005176SoilELGAGKTSFVSSGVVVNQKEQLGFHRAFVIRDPDGHAIELEEK*
Ga0070663_10038959133300005455Corn RhizosphereSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK*
Ga0070678_10146070713300005456Miscanthus RhizosphereRQLQSSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK*
Ga0073909_10000777103300005526Surface SoilVSSGVVANQTGQLGFKKALLVRDPDGHAIEIEEK*
Ga0070696_10062673213300005546Corn, Switchgrass And Miscanthus RhizosphereQTSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK*
Ga0070665_10164055133300005548Switchgrass RhizosphereKFVSSGLVTESDGQLGFRNALLVRDPDGHPILIEEK*
Ga0066692_1074958513300005555SoilVNFVSSGVVANQTGQLGFSKALLVRDPDGHAIEIEEK*
Ga0066700_1052000713300005559SoilQLHSGKVNFVSSGVVANQTGQLGFSKALLVRDPDGHAVEIEEK*
Ga0068852_10094586113300005616Corn RhizosphereKLVSAHTKFVSSGLVTESDGQLGFRNALLVRDPDGHPVLIEEK*
Ga0066652_10154022413300006046SoilRVRFVSSGVVANHMQMLDFSKAFLVRDPDGHAIEIAAQ*
Ga0075017_10003100713300006059WatershedsVSSGVVANQTGQLGFSKALLVRDPDGHAIEIEEK*
Ga0075018_1015480433300006172WatershedsVSSGVIANQNGQLGFSKAFVVRDPDGHAVEIEQK*
Ga0075018_1055461613300006172WatershedsVNFVSSGVVANHTGQLGFSKALLVRDPDGHAIEIEEK*
Ga0070712_10100721633300006175Corn, Switchgrass And Miscanthus RhizosphereFVSSGVVANQTGELGFSKAFVVRDPDGHAVELEEKNLEEK*
Ga0075021_1044144033300006354WatershedsSSRVNFVSSGVVPNQTGQLGFSKAFVVRDPDGHAVEIEEK*
Ga0068871_10064679233300006358Miscanthus RhizosphereSVAKANFVSSGLIVNHNGELGFSAAFIARDPDGHAVEIEQK*
Ga0079221_1011344933300006804Agricultural SoilVSSGLVTESDGQLGFRNALLVRDPDGHPILIEEK*
Ga0075425_10177764623300006854Populus RhizosphereMRRPVNCIAGNVIFVSSRVVANQNAQFEFTNAFLVRDAAGHAIEIEQK*
Ga0075425_10251272113300006854Populus RhizosphereFVSSGVFIESDGQLGFKRAFLIRDPDGHAILIEEK*
Ga0066710_10154234433300009012Grasslands SoilTGRVNFVSSGVVANQTGQLGFSKALLVRDPDGHVIEIEEK
Ga0099830_1022360113300009088Vadose Zone SoilFVSSGVVANQKTQLGFNKAFLVRDPDGHTIAIEER*
Ga0126380_1126065813300010043Tropical Forest SoilDHAARDLSSKVNFVSFGVITNQMDKLGCKSAFIVRDPDGHAVEIEQK*
Ga0126373_1298076513300010048Tropical Forest SoilDEAARDLFSAKVNFVSSGVIANQKDELDYKSAFIARDPDGHAMEIEQK*
Ga0134082_1036842633300010303Grasslands SoilKVNFVSSGVVANQNEQLGFRKAFLARDPDGHAIEIEEK*
Ga0134063_1043119913300010335Grasslands SoilKVNFVSSSVVANQNEQLGFRKAFLARDPDGHAIEIEEK*
Ga0074046_1029055633300010339Bog Forest SoilAARQLQTSRVNFVSSGVFVNQTGELGFSKAFLVRDPDGHTVEIEEK*
Ga0126378_1052985933300010361Tropical Forest SoilLYSAKVNFVSSAVIANQKDELGYRTAFIVRDPDGHAVEIEQK*
Ga0126378_1125594313300010361Tropical Forest SoilAARNLFSAKVSFVSSGVIANQKRELGYGSAFIVRDPDGHAIEIEQK*
Ga0126381_10222303833300010376Tropical Forest SoilTARVNFVSSDVVVNQITQLGFSKAFVVRDPDGHALEIEET*
Ga0126381_10299025413300010376Tropical Forest SoilTVLVTESTDQAARDLFSAKVNFVSSAEIANQKDELGYRTAFIVRDPDGPAVEIEQK*
Ga0126381_10311202733300010376Tropical Forest SoilRDLLLAKVNFVSSGVVENQMDKLGYRTAFIVRDPDGHAVEIEQK*
Ga0136449_10089098333300010379Peatlands SoilSRVNFVSSGVFVNQTGELGFSKAFLVRDPDGHAVEIEEK*
Ga0137391_1019691833300011270Vadose Zone SoilLYGGKVRFVSFGVVANQKGQLGFDKAFLARDPDGRVMAIAER*
Ga0137382_1075025113300012200Vadose Zone SoilHAGNVIFVSSVMVANQNAQLGFAKAFLVRDADEHAIEIEQK*
Ga0137365_1047996213300012201Vadose Zone SoilNFVSSGLIVNHSRELAFTSAFIVRDPDGHAVEVEQK*
Ga0137363_1146733813300012202Vadose Zone SoilFVSSGVVANQTGQLGFNKAFLIRDPDGHVIELEEK*
Ga0137380_1078060213300012206Vadose Zone SoilRRLQSSRVNFVSAGVVANQTGQLGFSKALLVRDPDGHAIEIEEK*
Ga0137377_1101510213300012211Vadose Zone SoilVSPRVVALPDGPLGFREAFLARDPDGHALQFRSR*
Ga0134028_115667533300012224Grasslands SoilVSSGVVANQNEQLGFRKAFLARDPDGHAIEIEEK*
Ga0137387_1038773733300012349Vadose Zone SoilFVSSGVVVNQTGQLGFNKAFVVRDPDGHPIEIEEK*
Ga0137387_1044625633300012349Vadose Zone SoilQLQSGKVNFVSSGVVANQTGELGFSKALLVRDPDGHAIEIEEK*
Ga0137361_1104754413300012362Vadose Zone SoilSFVSSGVLANPTGQLGFQKALLVRDPDGHTIEIEEK*
Ga0137361_1173025033300012362Vadose Zone SoilNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVELEEK*
Ga0134040_115676513300012389Grasslands SoilKVNFVSSGVVANQNEQLGFRKAFLARDPDGHAIEVEEK*
Ga0134049_113884813300012403Grasslands SoilFVSSGVVANPTGEPGFKKALLVRDPDGHVIELEQK*
Ga0137407_1154771413300012930Vadose Zone SoilNCNAGNVIFVSSGVVANQNAQLGFAKAFLVRDADGHAIEIEQK*
Ga0157371_1040710013300013102Corn RhizosphereVGARVNFVSSGVVANQNNRLGFSKALVVRDPDGHAIEVEQK*
Ga0157371_1111640633300013102Corn RhizosphereADQAARNLALAKLNFVSSEVVANHNAQLEFTSAFIVRDPDGHAVEIQQK*
Ga0157374_1024469113300013296Miscanthus RhizosphereSSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK*
Ga0157375_1290348533300013308Miscanthus RhizosphereARQLQSSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK*
Ga0181538_1012797923300014162BogMTRSSAETERVTLVSSGVIANQAGQLGFSKALPVRDRDGHAIEMEEK*
Ga0137405_112803513300015053Vadose Zone SoilSSGVVANQKGQQLGFNKAFLARDPDGHVMAIAER*
Ga0137409_1044071413300015245Vadose Zone SoilRDRVNFVSSGVIANQTGQLRFSKAFLVRDPDGHAIEIEEK*
Ga0182036_1068674943300016270SoilAQKLSANGTNFVSSGVVPNPTGQIGFSKAFLVRDADGHPIEFEEK
Ga0182037_1094764733300016404SoilLFSAKVNFVSSGVIANQMDELGYRAAFMVRDPDGHAVEIEQK
Ga0182037_1155254013300016404SoilTTQNADGAAQKLSANGTNFVSSGVVPNPTGQIGFSKAFLVRDADGHPIEFEEK
Ga0182038_1047391733300016445SoilLAKVNFVSSGVIANQKDELGFRTALIVRDPDGHAVEIEQK
Ga0187814_1040671613300017932Freshwater SedimentARSLSAAQVNFVSSGAVVNHMEQLGFTKAVIVRDPDGHAIEVEEK
Ga0187879_1027074333300017946PeatlandTFVSSGVVANQTGKLGFSKAFVVRDPDGHAIEIEEK
Ga0187822_1030159513300017994Freshwater SedimentEQAAHDLTSAKVNLVSSRLVANQTEQLGFKSALIVRDPDGHAVEIEQK
Ga0187887_1013882413300018043PeatlandTSRVTFVSSGVVANQTGKLGFSRAFVVRDPDGHAIEIEEK
Ga0210406_1067792533300021168SoilNQFVSSGMIANQNGQLEFSKALVIRYSDGHAIEIEQK
Ga0210408_1072516423300021178SoilVAKHTSRVNFVSSGVVADQTTQMGFNKTFLVRDPDGHVVENEEK
Ga0213882_1006416113300021362Exposed RockLTTGADNAAHNLSSAQVNFVSSGVVVNQIHLLGFSRAFVIRDPDGHAIEIEEK
Ga0213875_1024026433300021388Plant RootsVNFVSSGVVANQIHLLGFSRAFVIRDPDGHAIEIEEK
Ga0207684_1047633823300025910Corn, Switchgrass And Miscanthus RhizosphereHASFVSSGVIAESDGQLGFKKAFVVRDPDGHAILVEEK
Ga0207649_1140122823300025920Corn RhizosphereFSNAHASFVSSGVVTESDGQLGFKKAFVARDPDGHAILVEEK
Ga0209152_1030227033300026325SoilSNAHASFVSSGVVAESDGQLGFKKAFLARDPDGHAILVEER
Ga0257180_106207613300026354SoilSFVSSEVVANQNRGLEFTKAFVVRDPDGHAIEIEER
Ga0209056_1042372123300026538SoilGRNLANAKINFVSSGVVVNQNNQLGFSKALVVRDPDGHAIEIEQK
Ga0209805_102064213300026542SoilNFVSSGVVANQNEQLGFRKAFLARDPDGHAIEIEEK
Ga0209684_105394313300027527Tropical Forest SoilSAKVNFVSSAVIANQKDELGYRTAFIVRDPDGHAVEIEQK
Ga0209040_1010718533300027824Bog Forest SoilRVTFVSSGVVANHMRMLDFSKAFLVRDPDGHAIEIAAP
Ga0209040_1014116923300027824Bog Forest SoilTSRVNFVSSGVFVNQTGELGFSKAFLVRDPDGHAVEIEEK
Ga0209283_1033004713300027875Vadose Zone SoilYMARVNFVSSGVVVNQKRQLGFNKAFLVRDPDGHAILIAEK
Ga0209698_1042710413300027911WatershedsKTQFVSSGVIANQNGQLGFSKAFTVRDPDGHAIEIEQK
Ga0268265_1264635223300028380Switchgrass RhizosphereVSSGVVANQTGELGFSKAFVVRDPDGHAVELEEKNLEEK
Ga0268264_1020403433300028381Switchgrass RhizosphereQSSRVNFVSSGVVANQTGQLGFNKAFIVRDPDGHAVEFEEK
Ga0170824_11030726313300031231Forest SoilVNFVSSGVVANQTGQLGFSKALLVRDPDGHAIEIEEK
Ga0170824_11130900813300031231Forest SoilARQLNSGKVNFVSSGVVTNQTGQLGFSKALLVRDPDGHAVEIEEK
Ga0170824_11760510633300031231Forest SoilVNFVSSGVVANQTGQLGFKKAFLVRDPDGHAIEIEEK
Ga0302323_10134435833300031232FenSAARQLSTAKAIFVSSGVVPNHNPQLGFTKALVVRDPDGHAVELEQK
Ga0318547_1051710833300031781SoilARDLFLAKVNFVSSGVIANQKDELGFRTALIVRDPDGHAVEIEQK
Ga0306923_1079658213300031910SoilVNFVSSGVIANQKDELGYRAAFIVRDPDGHAVEIEQK
Ga0306921_1010274043300031912SoilLLLAKVNFVSSGVIANQKDELGFRTALIVRDPDGHAVEIEQK
Ga0306922_1197448813300032001SoilAAFVSSGVVANPTGQLGFKKALLVRDPDRHVIELEEK
Ga0311301_1088129233300032160Peatlands SoilSRVNFVSSGVFVNQTGELGFSKAFLVRDPDGHAVEIEEK
Ga0307471_10078485913300032180Hardwood Forest SoilAAKVKFVSSGVVANHMESLDFSKAFLVRDPDGHAIEIAAQ
Ga0307471_10281254313300032180Hardwood Forest SoilVDRAVRDLSAAQVNFVSSGIVVNHTEQIGFTNAVMVRDPDGHAVEVEEK
Ga0307471_10293660933300032180Hardwood Forest SoilLRAGKTSFVSSGVVVNQKEQLGFHRAFVIRDPDGHAIELKEK
Ga0307471_10360204613300032180Hardwood Forest SoilAARVNFVSSGVVANQKTQLGFNKAFLVRDPDGHTIAIEER
Ga0335079_1018210513300032783SoilLAKTNFVSSGLIANQNRELGFKAAFVVRDPDGHAIEIEEK
Ga0335079_1089039013300032783SoilELGLAKATFVSSGLIANQTRELGFKAAFVVRDPDGHALEIEEK
Ga0335084_1131853813300033004SoilAKANFVSSGLIANQNKELGFKTAFVVRDPDGHAIEIEEK
Ga0335077_1068234313300033158SoilNFVSSGVIANQMDELGYRNALIVRDPDGHAVEIEQQ
Ga0310914_1102747633300033289SoilSRLVSSGVVVNHTQELEFTKGFLVRDPDGHAIEIAGR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.