NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F089226

Metagenome Family F089226

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089226
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 40 residues
Representative Sequence VVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSE
Number of Associated Samples 84
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 92.66 %
% of genes near scaffold ends (potentially truncated) 98.17 %
% of genes from short scaffolds (< 2000 bps) 97.25 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.19

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.881 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(48.624 % of family members)
Environment Ontology (ENVO) Unclassified
(68.807 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(48.624 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.
1Ga0070699_1002917921
2Ga0066654_108022052
3Ga0066903_1012919671
4Ga0066903_1088587062
5Ga0075024_1003306801
6Ga0070712_1020254801
7Ga0066653_103245633
8Ga0066660_111453882
9Ga0126380_105514541
10Ga0126384_112744071
11Ga0126373_108989622
12Ga0131853_108636703
13Ga0126381_1013651071
14Ga0137362_117736221
15Ga0137377_101306151
16Ga0137386_109718302
17Ga0137361_111035401
18Ga0137396_112463462
19Ga0126375_118506331
20Ga0126369_100598023
21Ga0126369_116964331
22Ga0132256_1030334351
23Ga0182041_107877381
24Ga0182041_110742621
25Ga0182041_111925403
26Ga0182033_118670392
27Ga0182035_109260031
28Ga0182032_118843142
29Ga0182040_101549223
30Ga0182040_102795354
31Ga0182039_111703472
32Ga0182038_105812173
33Ga0187804_102086381
34Ga0210403_111383231
35Ga0210404_103837472
36Ga0210386_109382631
37Ga0126371_121854252
38Ga0126371_136558422
39Ga0207700_106001203
40Ga0209625_10243711
41Ga0209465_100769021
42Ga0318541_101929501
43Ga0318541_105351851
44Ga0318528_103714071
45Ga0318573_102665501
46Ga0310915_108231122
47Ga0318542_106576081
48Ga0318574_106007551
49Ga0318572_106079041
50Ga0318560_104935422
51Ga0307469_118845192
52Ga0318501_100995151
53Ga0306918_104630771
54Ga0318492_102450032
55Ga0318494_105265081
56Ga0318509_102822682
57Ga0318509_104083143
58Ga0318521_108162942
59Ga0318547_102167961
60Ga0318547_102307923
61Ga0318503_101175871
62Ga0318557_100201861
63Ga0318576_105731741
64Ga0318523_102741452
65Ga0318568_108442882
66Ga0318568_108881702
67Ga0307473_115138801
68Ga0318512_105746792
69Ga0306919_101485183
70Ga0306919_105075653
71Ga0306919_111597613
72Ga0318544_101587402
73Ga0318536_102777041
74Ga0318522_101780772
75Ga0318551_103029522
76Ga0318551_107668201
77Ga0306923_108698701
78Ga0306923_111285013
79Ga0306923_115178732
80Ga0306921_106672173
81Ga0306921_108637311
82Ga0306921_116831272
83Ga0310912_105771491
84Ga0310912_108629701
85Ga0310912_114027142
86Ga0310916_108820341
87Ga0310916_108837251
88Ga0310913_110081611
89Ga0310910_109365652
90Ga0310909_104391692
91Ga0306926_115695431
92Ga0318507_103146901
93Ga0310911_101759981
94Ga0318556_104894601
95Ga0318558_100538081
96Ga0318532_101064583
97Ga0318532_101715991
98Ga0318533_102262082
99Ga0318504_101807041
100Ga0306924_104328992
101Ga0318525_102510751
102Ga0318525_104858371
103Ga0318540_102076011
104Ga0307471_1011366251
105Ga0306920_1006030293
106Ga0306920_1028398153
107Ga0306920_1031105881
108Ga0310914_108516233
109Ga0310914_110544471
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 8.82%    Coil/Unstructured: 91.18%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540VVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.19
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
56.9%43.1%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Soil
Soil
Soil
Hardwood Forest Soil
Tropical Forest Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Termite Gut
Arabidopsis Rhizosphere
4.6%8.3%48.6%22.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0070699_10029179213300005518Corn, Switchgrass And Miscanthus RhizosphereVVPSYVFAGVGGYYASKQSGLGGVFRRDAADGSDWKHALSEI
Ga0066654_1080220523300005587SoilVAPSYVFAGIGGYYASKQGGLAGVFRCATSGTDVALSRPY*
Ga0066903_10129196713300005764Tropical Forest SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHAL
Ga0066903_10885870623300005764Tropical Forest SoilVVPSCVFAGLGGYYASKQSGLAGVFRRDAADGSDWKHALREIEAFTVFV
Ga0075024_10033068013300006047WatershedsVALSYVFAGVGGYYASKQGGLAGVFRCATDGTDWKHALSEIEAF
Ga0070712_10202548013300006175Corn, Switchgrass And Miscanthus RhizosphereVVPSYIFAGVGGYYASKQGGLAGVFRGAADGSDWKHALSEIEAF
Ga0066653_1032456333300006791SoilVAPSYVFADIGGYYASKQGGLAGVFRCATSGTDVALSRPY*
Ga0066660_1114538823300006800SoilVTLSYVFAGVGGYYASKQGGLAGVFRCAADGTDWKHPLNEI
Ga0126380_1055145413300010043Tropical Forest SoilVSSSYVFAGVGGYYASKQGGLAGVFRCANDATDWKHALSEI
Ga0126384_1127440713300010046Tropical Forest SoilVFAGLGGYYASKQSGLAGVFRRDAADGSDWKHALSE
Ga0126373_1089896223300010048Tropical Forest SoilVASSYVFAGVGGYYASKQGGLAGVFRYATDGSDWKHALSEIE
Ga0131853_1086367033300010162Termite GutMAQSYIYAGVGGYYASKDQGDLAGVFRGALGEGWTHPLSEREAFTVCVHP
Ga0126381_10136510713300010376Tropical Forest SoilVFAGVGGYYASKQGGLAGVFRYATDGSDWKHALSEIEAFTVF
Ga0137362_1177362213300012205Vadose Zone SoilVFAGVGGYYASKQSGLAGVFCRDAADGSDWKHALSEIEAFTV
Ga0137377_1013061513300012211Vadose Zone SoilMARSYVFAGVGGYYGSKQSGLAGVFCCATDDTNWKHALS
Ga0137386_1097183023300012351Vadose Zone SoilVFAGVGGYYASKQSGLAGVFRCATDDSNWKHALSEIEAF
Ga0137361_1110354013300012362Vadose Zone SoilVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHALSEIEA
Ga0137396_1124634623300012918Vadose Zone SoilVFAGVGGYYASKQGGLAGVFRCAANGTDWKHTLSEIEAFTVFLHPRDPNL
Ga0126375_1185063313300012948Tropical Forest SoilVFTGVGGYYASKQSGLAGVFRRAADGSDWKHALSEIEAF
Ga0126369_1005980233300012971Tropical Forest SoilVCSAGVGGYYASKQSGLAGVFRRAADGSDWKHALSEIE
Ga0126369_1169643313300012971Tropical Forest SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALS
Ga0132256_10303343513300015372Arabidopsis RhizosphereVAQSYVFAGVGGYYASTEKGGKAGVFRRAADQSSWD
Ga0182041_1078773813300016294SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHA
Ga0182041_1107426213300016294SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDWKH
Ga0182041_1119254033300016294SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHAL
Ga0182033_1186703923300016319SoilVVPSYVFAGVGGYYASKQSGLAGVFRRDAADGSDWK
Ga0182035_1092600313300016341SoilVAPSYVFAGVGGYYANKQGGLAGVFRRATDDTNWKHALSEIEV
Ga0182032_1188431423300016357SoilMARSCVLAGVGGYYASKQGGLAGVFRRSAEGSEWKHALSEI
Ga0182040_1015492233300016387SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWK
Ga0182040_1027953543300016387SoilVAPSDVFAGVGRYSASKQGGLAGVFRCAADGSDWKHALSEV
Ga0182039_1117034723300016422SoilVSSSYVYAGVGGYYASKQGGLAGVFRCATDGTDWKHALSE
Ga0182038_1058121733300016445SoilVAPSYVFAGVGGYYASKQGGLAGVFRCADATDWKHALSEIE
Ga0187804_1020863813300018006Freshwater SedimentVALSYVFAGVGGYYASKQGGLAGVFRCATDGTDWKHALSE
Ga0210403_1113832313300020580SoilVAPSYVFAGVGGYYASKQGGLAGVFRCATGGTDWKHALNEIETFTVF
Ga0210404_1038374723300021088SoilMVPSYVFAGVGGYYASKQGGLAGVFRCAANGTDWKHALSEIEAFTVFL
Ga0210386_1093826313300021406SoilVVPSYIFAGVGGYYASKQGGLAGVFRSAADGSDWKH
Ga0126371_1218542523300021560Tropical Forest SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDADDSDWKHALSEI
Ga0126371_1365584223300021560Tropical Forest SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSE
Ga0207700_1060012033300025928Corn, Switchgrass And Miscanthus RhizosphereVALSYVFAAVGGYYASKQGGLAGVFRCATDGTDWKH
Ga0209625_102437113300027635Forest SoilVVPSYIFAGVGGYYASKQGGLAGVFRGAADGSDWKHAL
Ga0209465_1007690213300027874Tropical Forest SoilVVSSCVFAGVGGYYASKQSGLAGVFRRAADDSDWKHALSEIEAF
Ga0318541_1019295013300031545SoilVALSYVFAGVGGYYASKQGGLAGVFRYATDGTDWKHALSEIETFTSSSTRGI
Ga0318541_1053518513300031545SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSEIE
Ga0318528_1037140713300031561SoilVVPSYVFTGVGGYYASKQGGLAGVFRRATDGTDWKHALS
Ga0318573_1026655013300031564SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWKH
Ga0310915_1082311223300031573SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKH
Ga0318542_1065760813300031668SoilVAPSYVFAGVGGYYASKQSGLAGVFRCAADGSDWK
Ga0318574_1060075513300031680SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHA
Ga0318572_1060790413300031681SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALS
Ga0318560_1049354223300031682SoilVPWFPSCVFAGVGGYYASKQSGLAGVFRRAADGSDW
Ga0307469_1188451923300031720Hardwood Forest SoilVVRSCVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHALSE
Ga0318501_1009951513300031736SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSE
Ga0306918_1046307713300031744SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDAADGSEWKHALSEIEAFT
Ga0318492_1024500323300031748SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDWK
Ga0318494_1052650813300031751SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWKHALS
Ga0318509_1028226823300031768SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWKHALSE
Ga0318509_1040831433300031768SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSE
Ga0318521_1081629423300031770SoilVASSYVFAGVGGYYASKQGGLAGVFRCATDGTDWKHALSDIEA
Ga0318547_1021679613300031781SoilVAPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALSE
Ga0318547_1023079233300031781SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADDSDWK
Ga0318503_1011758713300031794SoilVVPSCVFAGIGGYYASKQSGLAGVFRRAADGSDWKHA
Ga0318557_1002018613300031795SoilMVPSFVCAGVGGYYASKQSGLAGVFRRAADGSDWKHAL
Ga0318576_1057317413300031796SoilVASSYVFAGVGGYYASKQGGLAGVFRYATDGTDWKHALSEIKTFIPI
Ga0318523_1027414523300031798SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAANGSDWKHALSE
Ga0318568_1084428823300031819SoilVALSYVFAGVGGYYASKQGGLAGVFRYATDGTDWKHALSEIETFTSSSTRGIPISSSRVPLMASI
Ga0318568_1088817023300031819SoilVAPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALSEVETFT
Ga0307473_1151388013300031820Hardwood Forest SoilVVRSCVFAGVGGYYTSKQSGLAGVFRRAADDSDWKHALSEIEAF
Ga0318512_1057467923300031846SoilVVPSYVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHA
Ga0306919_1014851833300031879SoilMARSCVLAGVGGYYASKQGGLAGVFRRSLEGSEWKHVLSEIEAFTV
Ga0306919_1050756533300031879SoilVTQSYVFAGVGGYYASKQGGLAGVFRCSTDSGDWKHALSEVEAF
Ga0306919_1115976133300031879SoilVAPSHVFAGVGGYYANKQDGLAGVFRCATDDTNWKHALSEIEVF
Ga0318544_1015874023300031880SoilVASSYVFAGVGGYYASKQGGLAGVFRCAIDGTDWK
Ga0318536_1027770413300031893SoilVALSYVFAGVGGYYASKQGGLAGVFRYATDGTDWKHALSEIETFT
Ga0318522_1017807723300031894SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSEWK
Ga0318551_1030295223300031896SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDNM
Ga0318551_1076682013300031896SoilVVSAHVFAGVGGYYASKQSGLAGVFRRATNGSDWK
Ga0306923_1086987013300031910SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKLALSEIEAFTV
Ga0306923_1112850133300031910SoilVSPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHAL
Ga0306923_1151787323300031910SoilVSPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALS
Ga0306921_1066721733300031912SoilVVASCVFAGVGGYYASKQSGLAGVFRRAADDSDWKH
Ga0306921_1086373113300031912SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSKIE
Ga0306921_1168312723300031912SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSEIEAF
Ga0310912_1057714913300031941SoilVVPSYVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHALSEIEAFTVF
Ga0310912_1086297013300031941SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWKHALSEIEAF
Ga0310912_1140271423300031941SoilVARSYVFAGVGGYYASKQGGLAGVFRCAADGSDWK
Ga0310916_1088203413300031942SoilVAPSCVFAGVGGYYANKQGGFAGVFRRATDDTNWKHALSE
Ga0310916_1088372513300031942SoilVAPSYVFAGVGGYYASKQGGLAGVFRCAAEGTDWKHALSEVETFTVFV
Ga0310913_1100816113300031945SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHALSEIEAFTV
Ga0310910_1093656523300031946SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWKHALSEI
Ga0310909_1043916923300031947SoilMVPSFVFAGVGGYYASKQSGLAGVFRRAADGSDWKHAL
Ga0306926_1156954313300031954SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADDSDWKH
Ga0318507_1031469013300032025SoilVASSYVFAGVGGYYASKQGGLAGVFRCAIDGTDWKH
Ga0310911_1017599813300032035SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDAADGSEWKH
Ga0318556_1048946013300032043SoilVVPSCVFAGVGGYYASKQTGLAGVFRRAADGSDWKHT
Ga0318558_1005380813300032044SoilVVSSCVLAGVGGYYASNQGGLAGVFRRGVDDADWK
Ga0318532_1010645833300032051SoilVAPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALSEVET
Ga0318532_1017159913300032051SoilVVPSCVFAGVGGYYASKQTGLAGVFRRAADGSDWKHTLSE
Ga0318533_1022620823300032059SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDAADGSDWKHALSEIEAFT
Ga0318504_1018070413300032063SoilVVPSCVFAGVGGYYASKQSGLAGVFRRTADGSDWR
Ga0306924_1043289923300032076SoilVASSYVFAGVGGYYAGKQDGLAGVFCRATDGTDWKHALSDIEVF
Ga0318525_1025107513300032089SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWEHA
Ga0318525_1048583713300032089SoilMVPSCVFAGVGGYYASKQSGLAGVFRRAADGSDWK
Ga0318540_1020760113300032094SoilVVPSCVFAGVGGYYASKQSGLAGVFRRAADGRDWKHALSEIEP
Ga0307471_10113662513300032180Hardwood Forest SoilVTQSYVFAGVGGYYASKQGGLAGVFRCSTNSGDWKHA
Ga0306920_10060302933300032261SoilVVPSCVFAGVGGYYASKQSGLAGVFRRDAADGSDW
Ga0306920_10283981533300032261SoilVASSYVFAGVGGYYAIKQGGLAGVFRRATDGTDWKYALSDIE
Ga0306920_10311058813300032261SoilVAPSYVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALSEV
Ga0310914_1085162333300033289SoilVVPSCVFAGIGGYYASKQSGLAGVFRRAADGSDWK
Ga0310914_1105444713300033289SoilVAPSDVFAGVGGYYASKQGGLAGVFRCAADGSDWKHALSEVETFTV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.