NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105315

Metagenome / Metatranscriptome Family F105315

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105315
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 37 residues
Representative Sequence MEIALIVFILLVGIGAVVSGRDSRIDETARRRRYLG
Number of Associated Samples 85
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 23.00 %
% of genes near scaffold ends (potentially truncated) 16.00 %
% of genes from short scaffolds (< 2000 bps) 83.00 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (62.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment
(15.000 % of family members)
Environment Ontology (ENVO) Unclassified
(29.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(24.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46
1LWAnN_06927680
2A2_c1_00722170
3A_all_C_00547390
4JGIcombinedJ13530_1008780691
5Ga0070665_1001196152
6Ga0074472_114309392
7Ga0066652_1011774453
8Ga0075021_100666372
9Ga0074062_120931542
10Ga0074063_137936173
11Ga0105044_101752672
12Ga0105248_130842692
13Ga0114942_14427872
14Ga0105238_115614072
15Ga0105249_128053492
16Ga0131092_101405623
17Ga0134123_108449662
18Ga0151490_16652402
19Ga0150985_1065729302
20Ga0150985_1076667373
21Ga0150984_1055240123
22Ga0157216_100878652
23Ga0157289_100878332
24Ga0157302_101480691
25Ga0164308_106562953
26Ga0157374_111559073
27Ga0075355_10766022
28Ga0075355_11049241
29Ga0163163_102917003
30Ga0157380_113080172
31Ga0157379_125860382
32Ga0157376_113491562
33Ga0167654_10441061
34Ga0167665_10044895
35Ga0167667_10036082
36Ga0167667_10037136
37Ga0167629_10864202
38Ga0163144_100613135
39Ga0163144_108980763
40Ga0132258_140053042
41Ga0136617_100068262
42Ga0163161_116800362
43Ga0190266_105198043
44Ga0184609_105797061
45Ga0190270_110453692
46Ga0190271_100535812
47Ga0190271_107685432
48Ga0163151_100415523
49Ga0163150_101816892
50Ga0210362_15740392
51Ga0193699_104660492
52Ga0182009_103349302
53Ga0247778_12365352
54Ga0247769_11416111
55Ga0233424_100955182
56Ga0233425_104575532
57Ga0207694_111353522
58Ga0207687_108059472
59Ga0207668_116394882
60Ga0207675_1023293343
61Ga0209797_100768282
62Ga0209798_101752631
63Ga0209798_104520812
64Ga0209591_102038753
65Ga0209023_100801343
66Ga0209023_103550663
67Ga0209068_101650071
68Ga0209254_100027055
69Ga0209069_101464262
70Ga0268266_107633003
71Ga0210366_104199822
72Ga0302264_10439492
73Ga0307316_103290192
74Ga0302257_11152712
75Ga0311336_109522371
76Ga0311333_101119142
77Ga0311333_102578662
78Ga0247826_114924812
79Ga0307498_104393602
80Ga0307498_104715941
81Ga0307506_101512303
82Ga0311364_117439492
83Ga0302321_1002898964
84Ga0315290_100547264
85Ga0315290_101981132
86Ga0315290_103555692
87Ga0315290_108807042
88Ga0315297_105894152
89Ga0302322_1032781291
90Ga0315278_108241112
91Ga0315278_110827381
92Ga0315292_103789102
93Ga0315283_101386472
94Ga0315283_101411753
95Ga0315283_108657291
96Ga0315271_101557733
97Ga0315270_104367232
98Ga0315287_107652262
99Ga0315273_117668232
100Ga0370497_0040727_252_389
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 53.12%    β-sheet: 0.00%    Coil/Unstructured: 46.88%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035MEIALIVFILLVGIGAVVSGRDSRIDETARRRRYLGSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
62.0%38.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater And Sediment
Freshwater Lake Sediment
Freshwater Sediment
Freshwater
Freshwater Microbial Mat
Sediment
Wetland Sediment
Groundwater
Polar Desert Sand
Freshwater
Estuarine
Wetland
Sediment (Intertidal)
Groundwater Sediment
Watersheds
Soil
Soil
Terrestrial Soil
Glacier Forefield Soil
Soil
Soil
Soil
Untreated Peat Soil
Natural And Restored Wetlands
Fen
Plant Litter
Arabidopsis Rhizosphere
Avena Fatua Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Avena Fatua Rhizosphere
Activated Sludge
4.0%15.0%3.0%3.0%12.0%4.0%6.0%8.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
LWAnN_069276802088090009Freshwater SedimentMEIALIVFILLVGLGAVTSGRDSRIDENARRRRYLG
A2_c1_007221702124908043SoilMEIALIIFILLVGIGAVTIGRDSRIDESARRRRYLG
A_all_C_005473902140918007SoilMAIAVVLFIVLVAVGSVLWGRDSRIDETDRRRRYLG
JGIcombinedJ13530_10087806913300001213WetlandMIRPMEIAILLFIVLVGIGAVLAGADSRIDESARRRRYLG*
Ga0070665_10011961523300005548Switchgrass RhizosphereMEIAVIVFILVVGIGAVVSGRDSRIDETARRRRYLG*
Ga0074472_1143093923300005833Sediment (Intertidal)MEIALLVFILLVGIAAVTSGRDSRIDENARRRRYLG*
Ga0066652_10117744533300006046SoilMEIALIVFILLVGIGALVSGRDSRIDETARRRRYLG*
Ga0075021_1006663723300006354WatershedsMAFALIIFIVLVGLGAVLGGRDSRIDDVARRRRYLG*
Ga0074062_1209315423300006606SoilMAFALIIFIVLVGLGAALVGRDSRVDEVARRRRYLG*
Ga0074063_1379361733300006953SoilMETALIVFMLLVGPLAALFGRDSRIDEKGRRQHYLG*
Ga0105044_1017526723300007521FreshwaterMEIALIVFIVLVAAGSVVAGKDSRIDESARRRRYLG*
Ga0105248_1308426923300009177Switchgrass RhizosphereMEIALIVFILLVGIGAVLAGRDSRIDEKARTRRYLG*
Ga0114942_144278723300009527GroundwaterMEIALIVFIVLVAVGSVLAGKDSRIDESARRRRYLG*
Ga0105238_1156140723300009551Corn RhizosphereMEIALIVFILLVGIGAVVAGRDSRIDETARRRRYLG*
Ga0105249_1280534923300009553Switchgrass RhizosphereMEIALIVFILLVGLGAVAAGRDSRIDETARRRRYLG*
Ga0131092_1014056233300009870Activated SludgeMEIALIVFILLVGPLAALYGRDSRIDERGRRQRYLG*
Ga0134123_1084496623300010403Terrestrial SoilMEIALIVFILLVGIGAVLSGRDSRIDEAARRRRYLG*
Ga0151490_166524023300011107SoilMEIALIVFILLVGIGAVVSGRDSRIDEAARRRRYLG*
Ga0150985_10657293023300012212Avena Fatua RhizosphereMEIALIVFIVLVAIGAVTSGRDSRIDEKARMRRYLG*
Ga0150985_10766673733300012212Avena Fatua RhizospherePILEIALIVFILLVGIGAVVAGRDSRIDETARRRRYLG*
Ga0150984_10552401233300012469Avena Fatua RhizosphereGIALIVLILLIGPLAVLYGRDSRIDERGRRERYLG*
Ga0157216_1008786523300012668Glacier Forefield SoilMIHDDHHVVMAIALIVLILVVGPLAVLLGSDSRIDETSRRRRYLG*
Ga0157289_1008783323300012903SoilMEIAVIVFILVVGIGAVVSGRDSRIDEAARRRRYLG*
Ga0157302_1014806913300012915SoilYLLSHHLRMEIALIVFILLVGIGAVLAGRDSRIDEKARTRRYLG*
Ga0164308_1065629533300012985SoilMEIALIVFILFVGIGAVAFGRDSRIDETARRRRYLG*
Ga0157374_1115590733300013296Miscanthus RhizosphereMEIAVIVFILVVGIGAVVSGRDSRIDETARRRRCLG*
Ga0075355_107660223300014322Natural And Restored WetlandsMEIAMLLFIVLVGIGAVLAGADSRIDESARRRRYLG*
Ga0075355_110492413300014322Natural And Restored WetlandsRPMEIVMLLFIVLVGIGAVFAGADSRIDESARRRRYLG*
Ga0163163_1029170033300014325Switchgrass RhizosphereMEIALIVFILLVGLGAVFAGRDSRIDETARRRRYLG*
Ga0157380_1130801723300014326Switchgrass RhizosphereMEIALIVFILLVGIGAVVGGRDSRIDETARRRRYLG*
Ga0157379_1258603823300014968Switchgrass RhizosphereMEIALIVFILIVGIGAVVAGRDSRIDETARRRRYLG*
Ga0157376_1134915623300014969Miscanthus RhizosphereMEIALIVFILLVGIGAVFAGRDSRIDETARRRRYLG*
Ga0167654_104410613300015084Glacier Forefield SoilMEIAVIVFIVLVALGAVTSGRDSRIDEAARRRRYLG*
Ga0167665_100448953300015163Glacier Forefield SoilMEIVLIVFILLVGIGAVVSGRDSRIDETARRRRYLG*
Ga0167667_100360823300015189Glacier Forefield SoilMEIALIVFIVLVAVGSVYAGKDSRIDESARRRRYLG*
Ga0167667_100371363300015189Glacier Forefield SoilMEIALIVFIVLVALGSVVAGKDSRIDESARRRRYLG*
Ga0167629_108642023300015209Glacier Forefield SoilMEIALIVFIVLVAIGAVTSGRDSRIDETARRRRYLG*
Ga0163144_1006131353300015360Freshwater Microbial MatMEIALLVFIILVALGSVFAGKDSRIDESARGRRYLG*
Ga0163144_1089807633300015360Freshwater Microbial MatMEIALIVFIVLVAAGSVVAGKDSRIDELARRRRYLG*
Ga0132258_1400530423300015371Arabidopsis RhizosphereMEIALIVFIVLVGIGAVVSGRDSRIDEKARTRRYLG*
Ga0136617_1000682623300017789Polar Desert SandMVAEREDERMELVLLGFLLLVGPLAVVFGRDSRIDEVDRRRRYLG
Ga0163161_1168003623300017792Switchgrass RhizosphereMEIAVIVFILVVGIGAVVSGRDSRIDETARRRRYLG
Ga0190266_1051980433300017965SoilMEIALIVFILLVGIGAVVSGRDSRIDETARRRRYLG
Ga0184609_1057970613300018076Groundwater SedimentMEIALIILLLLIGPLAALGGRDSRIDEAARRRHYLG
Ga0190270_1104536923300018469SoilMEIALIVFILLVGIGAVVGGRDSRIDETARRRRYLG
Ga0190271_1005358123300018481SoilMEIALIVFIVLVALGSVFAGKDSRLDESARRRRYLG
Ga0190271_1076854323300018481SoilMEIALIVFIVLVGIGAVVSGRDSRIDETARRRRYLG
Ga0163151_1004155233300020057Freshwater Microbial MatMEIALLVFIILVALGSVFAGKDSRIDESARGRRYLG
Ga0163150_1018168923300020195Freshwater Microbial MatMEIALLVFIILVALGSVLAGKDSRIDESARGRRYLG
Ga0210362_157403923300021329EstuarineMGIALLVFILLVGIAAVTSGRDSRIDENARRRRYLG
Ga0193699_1046604923300021363SoilMEIALIVFILVVGIGAVVAGRDSRIDETARRRRYLG
Ga0182009_1033493023300021445SoilMEIALIVFILVVGIGAVVSGRDSRIDEAARRRRYLG
Ga0247778_123653523300022894Plant LitterMEIALIVFILLVGLGAVFAGRDSRIDETARRRRYLG
Ga0247769_114161113300022904Plant LitterEIALIVFILLVGIGAVVSGRDSRIDEAARRRRYLG
(restricted) Ga0233424_1009551823300023208FreshwaterMGMEIALIVFLLLVGPLALLGGRDSRIDEAERRRHYLG
(restricted) Ga0233425_1045755323300024054FreshwaterMEIALIVFLLLVGPLALLGGRDSRIDEAERRRHYLG
Ga0207694_1113535223300025924Corn RhizosphereMEIALIVFILLVGIGAVVAGRDSRIDETARRRRYLG
Ga0207687_1080594723300025927Miscanthus RhizosphereMEIAVIVFILVVGIGAVVSGRDSRIDEAARRRRYLG
Ga0207668_1163948823300025972Switchgrass RhizosphereMEIALIVFIVLVAVGAVFSGRDSRIDEKARQRRYLG
Ga0207675_10232933433300026118Switchgrass RhizosphereADMEIAVIVFILVVGIGAVVSGRDSRVDEAARRRRYLG
Ga0209797_1007682823300027831Wetland SedimentMEIALILFILLVGIAAVTSGRDSRIDENARRRRYLG
Ga0209798_1017526313300027843Wetland SedimentMEIALLVFILLVGIAAVTSGRDSRIDENARRRRYLG
Ga0209798_1045208123300027843Wetland SedimentMEIALLLFVLLVAIGAVASGSDSRIDEAARRRRYLG
Ga0209591_1020387533300027850FreshwaterMDIALIVFIVLVAAGSVVAGKDSRIDESARRRRYLG
Ga0209023_1008013433300027870Freshwater And SedimentMEIALIVFILLVGIGAVVSGRDSRIDEIARRRRYLG
Ga0209023_1035506633300027870Freshwater And SedimentMQIALIVFIVLVAVGSVFAGKDSRIDESARRRRYLG
Ga0209068_1016500713300027894WatershedsMALALIIFIVIVGLGAVRGGRDSRIDEVARRRRYLG
Ga0209254_1000270553300027897Freshwater Lake SedimentMEIALLVFILLVGLAAVTSGRDSRIDENARRRRYLG
Ga0209069_1014642623300027915WatershedsMMRGMEIALIVFILLVGIGAVVSGRDSRIDETARRRRYLG
Ga0268266_1076330033300028379Switchgrass RhizosphereAAMEIALIVFILLVGIGAVVSGRDSRIDEAARRRRYLG
Ga0210366_1041998223300028420EstuarineMEIALIVFLVLVTVGAVLVGKDSRIDEASRRRRYLG
Ga0302264_104394923300028732FenMEIAVIVFIVLVGLGAVLAGADSRIDEAARRRRYLG
Ga0307316_1032901923300028755SoilSSHHVDMEIVLLLFILLVGVGAVLGGRDSRIDEVARRRRYHG
Ga0302257_111527123300028855FenMRLMEIALIVFILLVGIGAVTVGRDSRIDEAARRRRYLG
Ga0311336_1095223713300029990FenLIVPDRHTVSMEIALIVFMLLVGPLAVLFGHDSRIDERGRRQHYLG
Ga0311333_1011191423300030114FenMMRLMEIALIVFILLVGIGAVTVGRDSRIDEAARRRRYLG
Ga0311333_1025786623300030114FenMEIALIVFMLLVGPLAVLFGHDSRIDERGRRQHYLG
Ga0247826_1149248123300030336SoilMEIALIVFILLVGIGAVLSGRDSRIDEAARRRRYLG
Ga0307498_1043936023300031170SoilMEIAVIVFILLVGLAAVTVGRDSRIDETARRRRYLG
Ga0307498_1047159413300031170SoilIHVRHTVSMEIALIVLFLLVGPAAILAGRDSRIDEAGRRRRYLG
Ga0307506_1015123033300031366SoilMGIALIVFIVLVGIGAVTSGRDSRIDETARQRRYLG
Ga0311364_1174394923300031521FenMRDMEIALMVFILLVGIGAVTVGRDSRIDETARRRRYLG
Ga0302321_10028989643300031726FenMEIALIVFILLVGIGAVTVGRDSRIDETARRRRYLG
Ga0315290_1005472643300031834SedimentMEIALIVFVLLVGLGAVAFGSDSRIDENARRRRYLG
Ga0315290_1019811323300031834SedimentMEIAVIIFIVLVGLGSVAFGRDSRIDENARRRRYLG
Ga0315290_1035556923300031834SedimentMEIALIVFILLVGIGAVVFGRDSRIDEMARRRRYLG
Ga0315290_1088070423300031834SedimentMEIALIVFVLLVGLGAVAFGTDSRIDENARRRRYLG
Ga0315297_1058941523300031873SedimentMEIALIVFIVLVAVGSVFAGKDSRIDESARRRRYLG
Ga0302322_10327812913300031902FenLIIPDRHTVSMEIALIVFMLLVGPLAVLFGHDSRIDERGRRQHYLG
Ga0315278_1082411123300031997SedimentMEIAVIIFILLVGLGSVAFGRDSRIDENARRRRYLG
Ga0315278_1108273813300031997SedimentEIALIVFIVLVAVGSVFAGKDSRIDESARRRRYLG
Ga0315292_1037891023300032143SedimentMEIALIVFVLLIGLVAVTVGSDSRIDENARRRRYLG
Ga0315283_1013864723300032164SedimentMEIALIVFILLVGIAAVTSGRDSRIDENARRRRYLG
Ga0315283_1014117533300032164SedimentMEIALIVFILLVGLGAVVSGRDSRIDENARRRRYLG
Ga0315283_1086572913300032164SedimentSAHMEIALIVFVLLVGLGAVAFGTDSRIDENARRRRYLG
Ga0315271_1015577333300032256SedimentMEIALIVFVLLVAVGSIFAGKDSRIDETARTRRYLG
Ga0315270_1043672323300032275SedimentMEIAVIIFILLVGLGSVAFGRDSRIDENARRCRYLG
Ga0315287_1076522623300032397SedimentMEIALIVFVLLIGLAAVTIGSDSRIDENARRRRYLG
Ga0315273_1176682323300032516SedimentMEIALIVFILLVGIGAVVFGRDSRIDEAARRRRYLG
Ga0370497_0040727_252_3893300034965Untreated Peat SoilMIPVRHTVSMEIALIVFMLLVGPVALLLGRDSRIDEAGRRRNYLG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.