NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104833

Metagenome Family F104833

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104833
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 46 residues
Representative Sequence FPAFAHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI
Number of Associated Samples 92
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 19.00 %
% of genes from short scaffolds (< 2000 bps) 19.00 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.29

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (81.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(22.000 % of family members)
Environment Ontology (ENVO) Unclassified
(31.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.
1AL20A1W_10606211
2JGI10214J12806_106274884
3JGI24034J26672_101087771
4Ga0062594_1006885782
5Ga0066678_102130732
6Ga0066676_103570072
7Ga0070668_1019411392
8Ga0070688_1008958032
9Ga0070705_1014837331
10Ga0070662_1009440851
11Ga0070699_1001575171
12Ga0066697_104737951
13Ga0070695_1011949131
14Ga0066701_106680792
15Ga0066692_103304422
16Ga0070702_1013792371
17Ga0066652_1006304651
18Ga0066652_1012597712
19Ga0075364_108203091
20Ga0066665_109923441
21Ga0079221_105081681
22Ga0099830_111370842
23Ga0105242_112004071
24Ga0105061_10189621
25Ga0126309_111878812
26Ga0126312_106336542
27Ga0126310_103570982
28Ga0126311_114482142
29Ga0134070_104033401
30Ga0134084_103021771
31Ga0134063_100925151
32Ga0134062_106426082
33Ga0134126_103940851
34Ga0138514_1000959521
35Ga0137391_109621521
36Ga0120157_10280811
37Ga0137364_102460801
38Ga0137364_114540881
39Ga0137382_110619692
40Ga0137362_113463381
41Ga0137376_107125822
42Ga0137387_112491221
43Ga0137366_101200752
44Ga0137371_101460453
45Ga0137371_102996601
46Ga0137368_109166572
47Ga0137358_104969341
48Ga0137394_107464011
49Ga0137359_117767772
50Ga0164299_101725101
51Ga0164301_106918931
52Ga0164302_118436892
53Ga0134087_108229872
54Ga0157374_125150291
55Ga0120172_11183261
56Ga0120155_10211983
57Ga0120155_10734371
58Ga0120149_12011012
59Ga0134081_103278211
60Ga0134081_103294872
61Ga0157380_100554244
62Ga0120170_10280781
63Ga0137409_100415221
64Ga0134072_100996271
65Ga0134089_102662621
66Ga0134085_103212432
67Ga0132256_1011894252
68Ga0184605_104721031
69Ga0193704_10808781
70Ga0193730_10337252
71Ga0193732_10819012
72Ga0193696_10069674
73Ga0210382_104363312
74Ga0210382_105422331
75Ga0193699_103196261
76Ga0222622_1000147312
77Ga0207654_100157105
78Ga0207640_117237951
79Ga0207676_115478732
80Ga0209267_11175592
81Ga0209058_12565051
82Ga0209474_102839301
83Ga0209474_106958972
84Ga0209879_10648912
85Ga0209811_102706302
86Ga0209701_101397381
87Ga0307322_101504312
88Ga0307309_101250462
89Ga0307311_101680131
90Ga0307298_100026961
91Ga0307319_101400352
92Ga0307288_100286252
93Ga0307284_100855332
94Ga0307305_103237882
95Ga0307292_102615811
96Ga0307302_106048312
97Ga0307310_103490822
98Ga0307286_100048181
99Ga0307286_100183931
100Ga0307278_102290692
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 4.05%    β-sheet: 28.38%    Coil/Unstructured: 67.57%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045FPAFAHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRISequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.29
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
19.0%81.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Serpentine Soil
Grasslands Soil
Surface Soil
Soil
Agricultural Soil
Permafrost
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Groundwater Sand
Soil
Corn Rhizosphere
Switchgrass Rhizosphere
Corn, Switchgrass And Miscanthus Rhizosphere
Populus Endosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Arabidopsis Rhizosphere
22.0%17.0%4.0%10.0%7.0%13.0%4.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AL20A1W_106062113300000880PermafrostTSTNGSPPVLFPTFLHAAGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI*
JGI10214J12806_1062748843300000891SoilPPLVFPVFAHAAGCGCLASLQVDFAVSVKGTSVGAYELTDTIFLRNSTRI*
JGI24034J26672_1010877713300002239Corn, Switchgrass And Miscanthus RhizosphereTKIADYLTSANAFPAFAHTAGCGCLASLRVDFTISLKGSATIDTYDLGDTIYLRNSTRI*
Ga0062594_10068857823300005093SoilANAFPTFAHTVGCSCLASLRVDFVVSNKGSTVGAYELTDTVFLRNSTRL*
Ga0066678_1021307323300005181SoilTPNVFPAFAHASGCGCLASLQLDFPVSLKGSTIDQYELTDTIYLRNSTRI*
Ga0066676_1035700723300005186SoilPAFAHTAGCGCLASLQLDFPVSLKGSTIGQYELTDTIYLRNSTRI*
Ga0070668_10194113923300005347Switchgrass RhizospherePTFAHTVGCSCLASLRVDFVVSNKGSTVGAYELTDTVFLRNSTRL*
Ga0070688_10089580323300005365Switchgrass RhizosphereFPAFAHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI*
Ga0070705_10148373313300005440Corn, Switchgrass And Miscanthus RhizosphereAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI*
Ga0070662_10094408513300005457Corn RhizosphereDYLTSANAFPTFAHTVGCSCLASLRVDFVVSNKGSTVGAYELTDTVFLRNSTRL*
Ga0070699_10015751713300005518Corn, Switchgrass And Miscanthus RhizosphereAFAHVTGCLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0066697_1047379513300005540SoilCLASLQLDFPISLKGSTTVDLYELTDTVYLRNSTRI*
Ga0070695_10119491313300005545Corn, Switchgrass And Miscanthus RhizosphereAHTVGCQCLASLQVDFAVSNNGSTRDAYELTDTVFLRNSTRL*
Ga0066701_1066807923300005552SoilFSHVSGCGCLASLSVDLPVSVKGSSAGAYRLQDTIFLRNSTRI*
Ga0066692_1033044223300005555SoilAFSHVSGCGCLASLSVDLPVSVKGSSAGAYRLQDTIFLRNSTRI*
Ga0070702_10137923713300005615Corn, Switchgrass And Miscanthus RhizosphereAHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI*
Ga0066652_10063046513300006046SoilPAYVHTVGCSCLASLQLDFNVSNRGSTKDAYELTDTVFLRNSTRL*
Ga0066652_10125977123300006046SoilLASLQLDFPVSLKGSTIGQYELTDTVYLRNSTRI*
Ga0075364_1082030913300006051Populus EndosphereGCLASLEIDFKVSLKGSNVDSYELTDNIFLRNSTRI*
Ga0066665_1099234413300006796SoilNVFPAFAHASGCGCLASLQLDFPVSLKGSTIDQYELTDTIYLRNSTRI*
Ga0079221_1050816813300006804Agricultural SoilGRQLAFAHTVGCQCLASLQVDFAVSNNGSTRDAYELTDTVFLRNSTRL*
Ga0099830_1113708423300009088Vadose Zone SoilAHVTGCLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0105242_1120040713300009176Miscanthus RhizosphereSPPTVFPAFAHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI*
Ga0105061_101896213300009807Groundwater SandDAFPTFLHETGCSCLASLRVDFPVSLMGSTVDAYELTDTIYLRNSTRI*
Ga0126309_1118788123300010039Serpentine SoilLASLQVDFVVSVKGTAVGAYELTDTIFLINSTRI*
Ga0126312_1063365423300010041Serpentine SoilLASLKVDFVVSLKGSAIGSYELTDTIFLRNSTRL*
Ga0126310_1035709823300010044Serpentine SoilVGCSCLASLRVDFVISNKGSTVDAYELTDTIFLRNSTRL*
Ga0126311_1144821423300010045Serpentine SoilDYVSNTSGNVFVAFLHASGCNCLASLEVDFKVSLKGSNIDAYELTDKIFLRNSTRI*
Ga0134070_1040334013300010301Grasslands SoilSGVKFADYLTTANVFPTFAHTAGCGCLASLQLDFPISLKGSTTVDLYELTDTVYLRNSTRI*
Ga0134084_1030217713300010322Grasslands SoilHAVGCSCLASLRVDFLVSNKRSSLDAYELTDTIFLRNSTRL*
Ga0134063_1009251513300010335Grasslands SoilATCGSSGIKYGDYLTTANVFPSFAHASGCGCLASLQLDFPVSLKGSTIDQYELTDTIYLRNSTRI*
Ga0134062_1064260823300010337Grasslands SoilVFPAFAHTAGCGCLASLQLDFPISLKGSTTVDLYELTDTVYLRNSTRI*
Ga0134126_1039408513300010396Terrestrial SoilTTCGSSGTKIADYLISANAFPAFAHAAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI*
Ga0138514_10009595213300011003SoilYLTNANAFPTFAHASGCSCLASLRVDFLISNKGSTVDAYELTDTIFLRNSTRI*
Ga0137391_1096215213300011270Vadose Zone SoilDYLTSGNVFPAFAHVTGCLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0120157_102808113300011994PermafrostVTGCLCLASLQVDFPVSVKGTSIGAYELKDTIFLRNSTRI*
Ga0137364_1024608013300012198Vadose Zone SoilFPQFLHAVGCTCLASLRVDFLVSNKGSSRDAYELTDTIFLRNSTRL*
Ga0137364_1145408813300012198Vadose Zone SoilDYLTTANVFPAFAHTAGCGCLASLQLDFPISLKGSTTVDLYELTDTVYLRNSTRI*
Ga0137382_1106196923300012200Vadose Zone SoilFAHVTGCLCLASQQADFPVSVQGTSVGAYELQDTIFLRNSTRI*
Ga0137362_1134633813300012205Vadose Zone SoilGNAFPAFAHVTGCLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0137376_1071258223300012208Vadose Zone SoilVTGCLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0137387_1124912213300012349Vadose Zone SoilKYLTTGTPFTAFTHVVGCGCLASLSVNLPVSVKGSSVGTYRLQDTIFLRNSTRI*
Ga0137366_1012007523300012354Vadose Zone SoilVTGCLCLASLQVDFNVSVKGTSVGAYDLNDTIFLRNSTRI*
Ga0137371_1014604533300012356Vadose Zone SoilSLRVDFVVSNRGSTVGVQTYELTDTVFLRNSTRI*
Ga0137371_1029966013300012356Vadose Zone SoilNGSAPVVFPAFAHDVGCGCLASLQVDFAVSIKGTAVGAYELTDTIFLRNSTRI*
Ga0137368_1091665723300012358Vadose Zone SoilACAHDTGCNCLASLRVDFAVSVKGNAVGAYELTDTIYLRNSARV*
Ga0137358_1049693413300012582Vadose Zone SoilPAFAHTTGCLCLASLQVDFNVSVKGTSVGSYDLTDTIFLRNSTRI*
Ga0137394_1074640113300012922Vadose Zone SoilLTTAGAFPAFAHTSGCGCLASLQVDFKVSVKGSATIDSYDLADTIYLRNSTRI*
Ga0137359_1177677723300012923Vadose Zone SoilCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI*
Ga0164299_1017251013300012958SoilAFAHTAGCGCLASLQVDFTISLKSSATIDTYDLGDTIYLRNSTRI*
Ga0164301_1069189313300012960SoilHTTGCGCLASLQVDFAVSVKGTAVGAYELTVTIFLRNSTRI*
Ga0164302_1184368923300012961SoilMIADYLSNTTGNVFVAFLHTSGCGCLASLEIDFKVSLKGSNVDSYELTDNIFLRNSTRI*
Ga0134087_1082298723300012977Grasslands SoilCGCIASLSVNLPVSVKGSTVGAYRLQDTIFLRNSTRI*
Ga0157374_1251502913300013296Miscanthus RhizosphereHTAGCGCLASLRVDFTISLKGSATIDTYDLGDTIYLRNSTRI*
Ga0120172_111832613300013765PermafrostFLHDAGCGCLASLQVDFAVSIKGTAIGAYELTDTIFLRNSTRI*
Ga0120155_102119833300013768PermafrostGCGCLASLQVDFAVSIKGTAIGAYELTDTIFLRNSTRI*
Ga0120155_107343713300013768PermafrostTNGSPPVLFPTFLHAAGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI*
Ga0120149_120110123300014058PermafrostQASLQVDFAVSIKGTAVGAYELTDTIFLRNSTRI*
Ga0134081_1032782113300014150Grasslands SoilLHAVGCTCLASLRVDFLVSNKGSSRDAYELTDTIFLRNSTRL*
Ga0134081_1032948723300014150Grasslands SoilDYLTTTNVFPAFAHTAGCGCLASLQLDFPISLKGSTTVDLYELTDTVYLRNSTRI*
Ga0157380_1005542443300014326Switchgrass RhizosphereSSGTKIADYLTSANAFPAFAHTAGCGCLASLRVDFTISLKGSATIDTYDLGDTIYLRNSTRI*
Ga0120170_102807813300014823PermafrostFPAFAHVTGCLCLASLQVDFPVSVKGTSIGAYELKDTIFLRNSSRI*
Ga0137409_1004152213300015245Vadose Zone SoilHTTGCLCLASLQVDFNVSVKGTSVGSYDLTDTIFLRNSTRI*
Ga0134072_1009962713300015357Grasslands SoilDYLTNADAFPTFAHASGCSCLASLRVDFLISNKGSTVDAYELTDTIFLRNSTRI*
Ga0134089_1026626213300015358Grasslands SoilLASLQVDFPVSVKGTSVGAYELTDTIFLRNSTRI*
Ga0134085_1032124323300015359Grasslands SoilVTGCLCLASLQVDFPVSVKGTSVGAYELNDTIFLRNSTRI*
Ga0132256_10118942523300015372Arabidopsis RhizosphereASLASLSVDFEVSIKGSSSVNNYGLADTIFLRNSTRA*
Ga0184605_1047210313300018027Groundwater SedimentCLASLQVDFPVSVKGTSVGAYELKDTIFLRNSTRI
Ga0193704_108087813300019867SoilCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0193730_103372523300020002SoilFAHTAGCGCLASLQVDFTISLKGSATIDTYDLGDTIYLRNSTRI
Ga0193732_108190123300020012SoilPAFAHAAGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI
Ga0193696_100696743300020016SoilTKIADYLISANAFPAFAHTAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0210382_1043633123300021080Groundwater SedimentHDVGCGCLASLQVDFAVSVKGSAVGAYELTDTIFLRNSTRI
Ga0210382_1054223313300021080Groundwater SedimentSTSGSPPIVFPAFAHAAGCGCLASLQVDFAVSVKGSAVGAYELTDTIFLRNSTRI
Ga0193699_1031962613300021363SoilNAFPAFAHTAGCGCLASLKVDFTISLKSSATIGTYDLGDTIYLRNSTRI
Ga0222622_10001473123300022756Groundwater SedimentTPPAASVASLRVDFTVSVKSSTTVDTYDLGDTIYLRNSTRI
Ga0207654_1001571053300025911Corn RhizosphereIADYLTSANAFPAFAHTAGCGCLASLRVDFTISLKGSATIDTYDLGDTIYLRNSTRI
Ga0207640_1172379513300025981Corn RhizospherePAFAHTVGCTCLASLRVDFVVSNKGSTVGAYELTDTVFLRNSTRL
Ga0207676_1154787323300026095Switchgrass RhizosphereHTTGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI
Ga0209267_111755923300026331SoilYLTSGDVFPGFAHTTGCLCLASLQVDFNVSVKGTSAGAYDLTDTIFLRNSTRI
Ga0209058_125650513300026536SoilAGCGCLASLQLDFPVSLKGSTIGQYELTDTIYLRNSTRI
Ga0209474_1028393013300026550SoilSANAFPAFTHTSGCSCLASLRVDFLISNKGSTLDAYELTDTIFLRNSTRI
Ga0209474_1069589723300026550SoilLTTPNVFPAFAHASGCGCLASLQLDFPVSLKGSTIDQYELTDTIYLRNSTRI
Ga0209879_106489123300027056Groundwater SandLTSGNAFPAFAHTSGCGCLASLGVDFIVSVKGSSTVETYELTDTIYLRNSTRI
Ga0209811_1027063023300027821Surface SoilAHTAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0209701_1013973813300027862Vadose Zone SoilLCLASLQVDFNVSVTGTSVGAYDLTDTIFLRNSTRI
Ga0307322_1015043123300028710SoilAFAHTAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0307309_1012504623300028714SoilHAAGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI
Ga0307311_1016801313300028716SoilSANAFPAFAHTAGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0307298_1000269613300028717SoilAFPAFAHTSGCLCLASLRVDFTVSVKSSTTVDTYDLGDTIYLRNSTRI
Ga0307319_1014003523300028722SoilGCGCLASLRVDFTISLKGSATIGTYDLGDTIYLRNSTRI
Ga0307288_1002862523300028778SoilFPAFAHTSGCLCLASLRVDFTVSVKSSTTVDTYDLGDTIYLRNSTRI
Ga0307284_1008553323300028799SoilPLVFPVFAHAAGCGCLASLQVDFAVSVKGTAVGAYELTDTIFLRNSTRI
Ga0307305_1032378823300028807SoilVTGCLCLASLQVDFPVSVKGTSVGAYDLTDTIFLRNSTRI
Ga0307292_1026158113300028811SoilCLASLRVDFVVSNKGSTVGVDTYELTDTVFLRNSTRV
Ga0307302_1060483123300028814SoilDYLTTASAFPAFAHTSGCLCLASLQVDFTVSVKSSTTVDTYDLGDTIYLRNSTRI
Ga0307310_1034908223300028824SoilTMIADYLTTASAFPTFTHTTGSLAALQVDFIVSVKSSATVDSYDLGDTIYLRNSLRA
Ga0307286_1000481813300028876SoilGCGCLASLRVDFTISLKSSATIGTYDLGDTIYLRNSTRI
Ga0307286_1001839313300028876SoilPAFAHTSGCLCLASLRVDFTVSVKSSTTVDTYDLGDTIYLRNSTRI
Ga0307278_1022906923300028878SoilLTSTNGSPPLVFPAFAHDVGCGCLASLKVDFAVSIKGSAVGAYELTDTIFLRNSTRI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.