NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F101949

Metagenome / Metatranscriptome Family F101949

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101949
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 38 residues
Representative Sequence PVAPVVETVAVEVIEQPAPGVITVTEVEETEVRKAS
Number of Associated Samples 70
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 3.12 %
% of genes near scaffold ends (potentially truncated) 91.18 %
% of genes from short scaffolds (< 2000 bps) 86.27 %
Associated GOLD sequencing projects 64
AlphaFold2 3D model prediction Yes
3D model pTM-score0.17

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.824 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(29.412 % of family members)
Environment Ontology (ENVO) Unclassified
(54.902 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(56.863 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.
1Ga0066388_1003109521
2Ga0066903_1037479962
3Ga0066903_1066118872
4Ga0066903_1075780972
5Ga0075015_1002393683
6Ga0075015_1008300041
7Ga0070715_105532451
8Ga0075018_102060231
9Ga0126376_101452512
10Ga0126372_108585831
11Ga0126378_102982783
12Ga0126378_109643081
13Ga0126378_115089012
14Ga0126377_109349491
15Ga0126379_110540613
16Ga0126379_130794981
17Ga0126381_1005195781
18Ga0126381_1013325011
19Ga0126381_1039050091
20Ga0126381_1040330052
21Ga0126383_105322694
22Ga0126383_126461211
23Ga0137365_104596891
24Ga0137395_109705441
25Ga0126369_106117391
26Ga0126369_111745841
27Ga0120125_10482802
28Ga0182036_109554322
29Ga0182041_101039881
30Ga0182041_103812561
31Ga0182041_111080741
32Ga0182033_108150911
33Ga0182033_117726361
34Ga0182033_118108231
35Ga0182035_111396091
36Ga0182034_110777201
37Ga0182040_105529431
38Ga0182040_108802081
39Ga0182040_109271891
40Ga0182038_120830532
41Ga0187820_12335261
42Ga0187822_101944022
43Ga0187815_100545101
44Ga0187804_105381032
45Ga0210399_101773153
46Ga0210387_114393651
47Ga0208604_10179162
48Ga0209178_13312791
49Ga0209465_100473113
50Ga0307482_12359181
51Ga0170820_142635072
52Ga0170819_132037963
53Ga0318516_106222872
54Ga0318515_101475093
55Ga0318555_106595981
56Ga0318542_105394592
57Ga0318561_105320081
58Ga0310686_1015864671
59Ga0310686_1199038551
60Ga0318496_101894811
61Ga0306918_104185691
62Ga0318492_108242822
63Ga0318494_109448202
64Ga0307475_100421882
65Ga0307475_109259481
66Ga0318535_101248314
67Ga0318529_104531552
68Ga0318576_105923031
69Ga0318568_109909331
70Ga0306919_101300281
71Ga0306919_103889971
72Ga0306919_109483801
73Ga0318520_102616811
74Ga0306923_101374741
75Ga0306923_101952331
76Ga0306923_106476101
77Ga0306923_110329581
78Ga0306923_116183912
79Ga0306923_122225912
80Ga0306921_109288051
81Ga0306921_113220001
82Ga0310912_115058881
83Ga0310913_110222611
84Ga0306926_113913211
85Ga0306926_119098691
86Ga0318530_104126182
87Ga0307479_102827001
88Ga0318507_103464722
89Ga0310911_101895543
90Ga0318506_103983401
91Ga0318533_101490631
92Ga0318533_103778251
93Ga0318505_104211112
94Ga0318505_104285321
95Ga0318540_100025839
96Ga0307471_1030148992
97Ga0307472_1006858322
98Ga0306920_1000204901
99Ga0306920_1000909961
100Ga0306920_1043794702
101Ga0310914_117358091
102Ga0318519_108854981
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 0.00%    Coil/Unstructured: 100.00%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035PVAPVVETVAVEVIEQPAPGVITVTEVEETEVRKASSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.17
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
41.2%58.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Tropical Forest Soil
Agricultural Soil
Permafrost
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
3.9%2.9%15.7%27.5%29.4%5.9%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0066388_10031095213300005332Tropical Forest SoilPKVKQPVAPVVETVAAEVIEQPAPDVVTVTEVEATRQVS*
Ga0066903_10374799623300005764Tropical Forest SoilKAARKVKQPATPVIETVAVEVIEQPAPGVIAVTEVEETEIRKAS*
Ga0066903_10661188723300005764Tropical Forest SoilPVAPAVETVAAEVIEQPAPVIAVTEVEETEIRKAS*
Ga0066903_10757809723300005764Tropical Forest SoilKMKQPVAQVVETAAVEVVEQPAPGVITVTEVEEAEVRKAS*
Ga0075015_10023936833300006102WatershedsMKQPVAPAVDTVAVEVIEQSASGVITVTEVEETEVRQAS*
Ga0075015_10083000413300006102WatershedsPVTPVVETVAVEVIEQPAPGVITVTEIEETRKAS*
Ga0070715_1055324513300006163Corn, Switchgrass And Miscanthus RhizosphereKQPIAPVVETVAVEVIEQPAPGVITVTEVEETEIRKAS*
Ga0075018_1020602313300006172WatershedsKVARKVRQPVAPVVEIVAAEVIEQPAPNVITVTEVEETRKAS*
Ga0126376_1014525123300010359Tropical Forest SoilMKKAARPVAPAVETVAVEVIEQPAPDVITVTEVEETELRQAS*
Ga0126372_1085858313300010360Tropical Forest SoilKVKQPIAPVVETVASEVIEHPATGVITEVGETEIRKAS*
Ga0126378_1029827833300010361Tropical Forest SoilAQTRADEKAALPVAPSETVAVEVIEQPAPGVIAVTEVEETELRQAS*
Ga0126378_1096430813300010361Tropical Forest SoilPVAQVVETAAVEVVEQPAPGVITVTEVEEAEVRKAS*
Ga0126378_1150890123300010361Tropical Forest SoilAPVVETVAVGVIEQPAPGVISVTEVEETEIRKAS*
Ga0126377_1093494913300010362Tropical Forest SoilAPVAPVVETVAVEVIEQPAPDVITVTEVEETEIRKAS*
Ga0126379_1105406133300010366Tropical Forest SoilPVAPVVETVAVEVIGQPAPDVITVTEVEETEVRKAS*
Ga0126379_1307949813300010366Tropical Forest SoilKPPAAPAIETAAVEVTELPAPGVITVSEVEETEVRKAS*
Ga0126381_10051957813300010376Tropical Forest SoilIEQPVVTAVETVAVEVIERPAPDVITVTEVEETRKAS*
Ga0126381_10133250113300010376Tropical Forest SoilPVAPAVETVAVEVIEQPAPGVITVTEVEETEVRQVS*
Ga0126381_10390500913300010376Tropical Forest SoilPVTPVIEIAAVEVIAKPAPDVITVTEVEANEVRKAS*
Ga0126381_10403300523300010376Tropical Forest SoilARPVAPVVETVAVEVIEQPAPGVITVMEVEETEVRQAS*
Ga0126383_1053226943300010398Tropical Forest SoilKVKRPVAPVVETVAVEVIGQPAPDVITVTEVGEAEVRKAS*
Ga0126383_1264612113300010398Tropical Forest SoilQPVAPAVETVAAEVIEQPAPGVITVTEVEETEIRKAS*
Ga0137365_1045968913300012201Vadose Zone SoilKAARKVKQAVTPVVETVAVEVLEQPAPTVTEVEETRKAS*
Ga0137395_1097054413300012917Vadose Zone SoilKVKQAVTPVVETVAVDVIEQPAPTVTEVEETRKAS*
Ga0126369_1061173913300012971Tropical Forest SoilKVKPPAAPAIETVAVEVIEQPAPVITVTEVEEAEIRKAS*
Ga0126369_1117458413300012971Tropical Forest SoilPVKKAARPVAPVVGTVAVEVIEQPAPGVITVTEVEETEVR*
Ga0120125_104828023300014056PermafrostVKQPTAPVVETVAVEVIEQPVAVMEVEETLVREVSPDDPQ*
Ga0182036_1095543223300016270SoilARKVKRPVAPVGEIVAAEVMEQAAPGQITVTEIEETEVRKAS
Ga0182041_1010398813300016294SoilRPVAPAVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0182041_1038125613300016294SoilPVAPAVETVTVEVIEQPVPAVTATEVEETEIRQVS
Ga0182041_1110807413300016294SoilARPVAPAVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0182033_1081509113300016319SoilIKQPIVAAVETVAVEVIEQPPPVIAVTEVEETEVRKAS
Ga0182033_1177263613300016319SoilAARPVAPVVGTVAVEVIEQPAPGVITVAEVEETEVRQAS
Ga0182033_1181082313300016319SoilRRIKQPVAPAVETVAAEVIEQPAPVIAVTGVEETEIRKAS
Ga0182035_1113960913300016341SoilAARKVKQPVAPVVETVAVEVIEQPAPSVITVTEVEETRQVS
Ga0182034_1107772013300016371SoilAARKVKEPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRKAS
Ga0182040_1055294313300016387SoilVKKAARPVAPVVGTVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0182040_1088020813300016387SoilPIVAAVETVAVEVIEQPPPVIAVTEVEETEVRKAS
Ga0182040_1092718913300016387SoilVKQPVAPVVLVETVAAEVTEQPAPGMITVTEIEETEVRKAS
Ga0182038_1208305323300016445SoilKQPIAAVVETVAVEAIEQPAPGVITVTEVEETEVRRAG
Ga0187820_123352613300017924Freshwater SedimentTAPAVETVAVEVIEQPAPGVITVTEVEETELRKVS
Ga0187822_1019440223300017994Freshwater SedimentVKQPVAPVIETVAVEAIKQPAPDVITVTEAGETEIRKAS
Ga0187815_1005451013300018001Freshwater SedimentAARKVKQPAAPVVETVAVEMIEQPAPEVVEVTEVEKPEVRKAS
Ga0187804_1053810323300018006Freshwater SedimentAAPTVETVAVEVIEHPAPGVITVTEVEETEVRQAS
Ga0210399_1017731533300020581SoilAVRKVKQPVSSVVETVAVEVIEQPAPGVITVTEVEETRKAS
Ga0210387_1143936513300021405SoilKPSAPSVETVVVDVIEQPAPNVITVTEVEETRKAS
Ga0208604_101791623300027090Forest SoilKQPAAPVVETVAAEAIEQPAPGVIAVTEVEETEVRKAS
Ga0209178_133127913300027725Agricultural SoilPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRKAS
Ga0209465_1004731133300027874Tropical Forest SoilAARRIEQPVVTAVETVAVEMIERPAPDAITVTEVEETRKAS
Ga0307482_123591813300030730Hardwood Forest SoilKVKQPVAPAVETVAVEVVEQPAPGVITVTEVEETRKAS
Ga0170820_1426350723300031446Forest SoilKQPVTPVVETVAVEVLEQPASGVITVTEIEETRKAR
Ga0170819_1320379633300031469Forest SoilCDLQPAAPVVETVAVEVIEQPAPGLITVTEVEETRQVS
Ga0318516_1062228723300031543SoilPVKKAARPVAPVVGTVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0318515_1014750933300031572SoilVKQPVTPVIETAAVEMIEQPAPDVITVTEVEETRQVS
Ga0318555_1065959813300031640SoilKQPVAPVVESVAVEVIEQPAPGVITVTEVEETEIRKAS
Ga0318542_1053945923300031668SoilVKQPVAPVVETVAAEVIEQPAPVTVSEVEEIRKAS
Ga0318561_1053200813300031679SoilAARPVAPAVETVAVEVIEQPAPGVITVTEVEETDVRQAS
Ga0310686_10158646713300031708SoilMKQPPAVETVAVEVVEQPAPGVITVTEVEETELRK
Ga0310686_11990385513300031708SoilSLQQSKNVAVEVIEQPAPGVITVTEVEEAGVRQAS
Ga0318496_1018948113300031713SoilKAARKVKQPIAPVVETVAVEVIEQLAPGVITVTEVEETRQVS
Ga0306918_1041856913300031744SoilVKQPIAPVVETVATEVIEQPAPGIITVTEVEETEVRKAS
Ga0318492_1082428223300031748SoilQPVTPVIETAAVEVIEQPAPDVITVTEVEETEVRKAS
Ga0318494_1094482023300031751SoilKKVARKVRQPVTPVIETASVEMIEQPAPDVITVTEVEETRQVS
Ga0307475_1004218823300031754Hardwood Forest SoilMKQPIAPVVETVAIEVIEQPAPGVITVTEVEETEVRQVN
Ga0307475_1092594813300031754Hardwood Forest SoilRPAAPTVETAAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0318535_1012483143300031764SoilKTARKVKQPIAPVVETVAVEVIEQLAPGVIAVTEVEETRQVS
Ga0318529_1045315523300031792SoilERMKQPVAPILETVAAEVIEQPAPGVIAVTEIEETRQVS
Ga0318576_1059230313300031796SoilVAPVVETVAAEVIEQPAPGQITVTEIEETEVRKAS
Ga0318568_1099093313300031819SoilLPVAPSETVAVEVIEQPAPGVITVTEVEETELRQAS
Ga0306919_1013002813300031879SoilRKERMKQPVTAVVETVAADVIEQPAPGVVAVTEVEETEVRKAS
Ga0306919_1038899713300031879SoilKQPVTPVIETAAVEMIEQPAPDVITVTEVEETRQVS
Ga0306919_1094838013300031879SoilPVTPVIETAAVEMIEQPAPDVITVTEVEKTEVRKAS
Ga0318520_1026168113300031897SoilKTARKVKQPIAPVVEGVTAEVIEQSAPDVVTVTEVEAARQVS
Ga0306923_1013747413300031910SoilKVKQPIAPVVETVAVEVIEQPAPGVITVTEVEETRQVS
Ga0306923_1019523313300031910SoilAARPVAPAVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0306923_1064761013300031910SoilAARPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0306923_1103295813300031910SoilQPIVAAVETVAVEVIEQPPPVIAVTEVEETEVRKAS
Ga0306923_1161839123300031910SoilKAARPVAPAVETVAVEVIEQPAPGVITVTEVEETDVRQAS
Ga0306923_1222259123300031910SoilPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0306921_1092880513300031912SoilALCKSKQPVAPVVETVAAQVIEQPAPGVITVTEVAEISQVS
Ga0306921_1132200013300031912SoilRMKQPVAPVVETVAVEVIEQPAPSVMTVTEAEQTRKAS
Ga0310912_1150588813300031941SoilRPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0310913_1102226113300031945SoilGARKVKPPAAPAIETAAVEVIEQPAPGVITVTEVEETEVRKAS
Ga0306926_1139132113300031954SoilVKQPATPVIETVAVEVIEQPVTVTEIEQAEIRKAS
Ga0306926_1190986913300031954SoilQPAAPAVETVAAEVIEQPAPVIAVTEVEETEIRKAS
Ga0318530_1041261823300031959SoilKKATRKERMKQHVAPVVEIVATEVIEQPAANVITVTEVEETRKAS
Ga0307479_1028270013300031962Hardwood Forest SoilSVKKATRPVAPAVETVAVEVIEQPAPGVITVTEVVDETQVRQAS
Ga0318507_1034647223300032025SoilPVKKAARPVAPVVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0310911_1018955433300032035SoilARKERMKQPVAPILETVAAEVIEQPAPGVIAVTEIEETRQVS
Ga0318506_1039834013300032052SoilVKQPVAPVVESVAVEVIEQPAPGVITVTEVEETEIRKAS
Ga0318533_1014906313300032059SoilVAPAVETVAVEVIEQPAPGVITVTEVEETEVRQAS
Ga0318533_1037782513300032059SoilRKVKQPVTPVIETAAVEMIEQPAPDVITVTEVEETRQVS
Ga0318505_1042111123300032060SoilVAPVVETVVVEVVEQPAPGVTTVTEVEETEIRKAS
Ga0318505_1042853213300032060SoilKKAAPPVAPAVETAAVEVIEQPAPGVTTVTEVEETRQVT
Ga0318540_1000258393300032094SoilKKAALKVKQPIAPVVETVAVEVIEQPAPGVITVTEVEETRQVS
Ga0307471_10301489923300032180Hardwood Forest SoilKVKQPVAPVVETTVAEVIEQPAPDVISVTEVEETDVRKAS
Ga0307472_10068583223300032205Hardwood Forest SoilKVARKVKQAVTPVVETVAVEVIEQPAPGVITVTEIEETRKAS
Ga0306920_10002049013300032261SoilKQPVAPVVETVAAEVIEQPAPDVVTVTEVEATRQVS
Ga0306920_10009099613300032261SoilKQPVAPVVETVAAEVIEQPAPGVVTVTEAEETHQAS
Ga0306920_10437947023300032261SoilVKKVARPVAPAVETVAVEVIEQPAPGVMTVTEVEETEVRQAS
Ga0310914_1173580913300033289SoilAARKERMKQPVAPILETVAAEVIEQPAPGVIAVTEIEETRQVS
Ga0318519_1088549813300033290SoilKERMKQHVAPVVEIVATEVIEQPAANVITVTEVEETRKAS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.