NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099704

Metagenome Family F099704

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099704
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 46 residues
Representative Sequence MPDQTNAWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Number of Associated Samples 60
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 32.04 %
% of genes near scaffold ends (potentially truncated) 33.01 %
% of genes from short scaffolds (< 2000 bps) 73.79 %
Associated GOLD sequencing projects 49
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (95.146 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.660 % of family members)
Environment Ontology (ENVO) Unclassified
(49.515 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.427 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54
1JGI25385J37094_1000003014
2JGI25385J37094_100047442
3JGI25385J37094_100189092
4JGI25384J37096_102462171
5JGI25382J37095_100946391
6JGI25382J37095_101801342
7JGI25382J37095_101908221
8JGI25382J37095_102541851
9Ga0066672_101342944
10Ga0066672_104937692
11Ga0066672_105467331
12Ga0066680_100290395
13Ga0066680_101497762
14Ga0066680_102658263
15Ga0066680_104375311
16Ga0066680_106052331
17Ga0066676_106410372
18Ga0066686_101447352
19Ga0066686_107624831
20Ga0066689_100012832
21Ga0070699_1000378936
22Ga0070697_1001366821
23Ga0066697_100835613
24Ga0066701_105235181
25Ga0066661_100727433
26Ga0066692_103884482
27Ga0066700_100173291
28Ga0066700_110140182
29Ga0066700_110929672
30Ga0066706_105046084
31Ga0070716_1018484822
32Ga0079222_108831721
33Ga0066659_101766421
34Ga0099791_101289281
35Ga0099791_101835774
36Ga0099793_106421132
37Ga0099794_100285662
38Ga0099794_101440592
39Ga0099794_103744542
40Ga0099829_101823213
41Ga0099829_102492482
42Ga0099830_105776951
43Ga0099830_106875001
44Ga0099828_103962501
45Ga0099828_110362422
46Ga0099828_117224122
47Ga0066709_1013579602
48Ga0134088_105764681
49Ga0137399_100345413
50Ga0137362_115364372
51Ga0137380_100488472
52Ga0137380_100952541
53Ga0137380_103964083
54Ga0137380_106633362
55Ga0137380_111956321
56Ga0137387_101520291
57Ga0137387_109510602
58Ga0137372_100675981
59Ga0137386_100575952
60Ga0137385_101449111
61Ga0137385_101459662
62Ga0137385_101468251
63Ga0137385_108251421
64Ga0137360_104589862
65Ga0137395_103010223
66Ga0137396_100466092
67Ga0137396_100821042
68Ga0137396_102323603
69Ga0137396_103085972
70Ga0137396_103415082
71Ga0137396_106044042
72Ga0134083_102395762
73Ga0066655_102999702
74Ga0066655_107612952
75Ga0066655_109322162
76Ga0066662_100635422
77Ga0215015_111075822
78Ga0210404_100409511
79Ga0137417_13665371
80Ga0209234_11449701
81Ga0209236_10828964
82Ga0209236_11115064
83Ga0209055_10566443
84Ga0209055_10678501
85Ga0209154_10060296
86Ga0209154_10344771
87Ga0209154_10712884
88Ga0209152_1000034222
89Ga0209802_10049488
90Ga0209690_10690314
91Ga0209806_10020066
92Ga0209056_101628184
93Ga0209156_104872892
94Ga0209588_10567492
95Ga0209689_11365642
96Ga0209180_103322682
97Ga0209180_103893222
98Ga0209701_105001461
99Ga0209283_102063592
100Ga0209283_106739821
101Ga0209590_106895982
102Ga0137415_103095691
103Ga0137415_107757591
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 41.67%    β-sheet: 20.83%    Coil/Unstructured: 37.50%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

51015202530354045MPDQTNAWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
95.1%4.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Vadose Zone Soil
Grasslands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
44.7%32.0%15.5%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_10000030143300002558Grasslands SoilMPDQTNAWKIRVRKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
JGI25385J37094_1000474423300002558Grasslands SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
JGI25385J37094_1001890923300002558Grasslands SoilMPDQTNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
JGI25384J37096_1024621713300002561Grasslands SoilMPEQPNTWRIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
JGI25382J37095_1009463913300002562Grasslands SoilMPDQMNTWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
JGI25382J37095_1018013423300002562Grasslands SoilMPEQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
JGI25382J37095_1019082213300002562Grasslands SoilMPDQVNAWKIRVKKGNSEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
JGI25382J37095_1025418513300002562Grasslands SoilCAMPDQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066672_1013429443300005167SoilMPDQMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKL
Ga0066672_1049376923300005167SoilMPEQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066672_1054673313300005167SoilNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0066680_1002903953300005174SoilMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0066680_1014977623300005174SoilMPDQTNSWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066680_1026582633300005174SoilQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066680_1043753113300005174SoilMPDQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066680_1060523313300005174SoilMPEQTNAWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0066676_1064103723300005186SoilMPDQTIPWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066686_1014473523300005446SoilMPEQTNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066686_1076248313300005446SoilAMRDQTNSWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066689_1000128323300005447SoilMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0070699_10003789363300005518Corn, Switchgrass And Miscanthus RhizosphereMPDQMSTWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0070697_10013668213300005536Corn, Switchgrass And Miscanthus RhizosphereMRSMPDQMNTWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066697_1008356133300005540SoilRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066701_1052351813300005552SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEE
Ga0066661_1007274333300005554SoilMPDQMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0066692_1038844823300005555SoilAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066700_1001732913300005559SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEEL
Ga0066700_1101401823300005559SoilMPDQMNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066700_1109296723300005559SoilMPDQMNAWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0066706_1050460843300005598SoilMPEQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLA
Ga0070716_10184848223300006173Corn, Switchgrass And Miscanthus RhizosphereMPDQTNAWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0079222_1088317213300006755Agricultural SoilMPDQMNTWKIRVRKGNYEVEVAGPSPETVQKIFEELVKKYMTKLASSR*
Ga0066659_1017664213300006797SoilMPDQTNTWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0099791_1012892813300007255Vadose Zone SoilMPEQTNSWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0099791_1018357743300007255Vadose Zone SoilMPEQTNVWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTELASSR*
Ga0099793_1064211323300007258Vadose Zone SoilMPDQANAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0099794_1002856623300007265Vadose Zone SoilMPEQMNVWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0099794_1014405923300007265Vadose Zone SoilMPEQTIPWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0099794_1037445423300007265Vadose Zone SoilMNVWKIRVKKGNSEVEVAGPSPEVVQKMFEELVKKYMTKLASSR*
Ga0099829_1018232133300009038Vadose Zone SoilMNVWKIRVKKGNSEVEVAGPSPEVVQKMFEDLVKKYMTKLASSR*
Ga0099829_1024924823300009038Vadose Zone SoilMNTWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0099830_1057769513300009088Vadose Zone SoilMPDQNTAWKIRVKKWNSEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0099830_1068750013300009088Vadose Zone SoilKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0099828_1039625013300009089Vadose Zone SoilMPDQVNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTRLASSR*
Ga0099828_1103624223300009089Vadose Zone SoilMPEQTNVWKIRVKKGNSEVEVAGPSPEIVQKMFEELVKKYMTKLASSR*
Ga0099828_1172241223300009089Vadose Zone SoilKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0066709_10135796023300009137Grasslands SoilMPEQPNTWRIRVKKGNYEVEVAGPSHEIVQKMFDELVKKYMTKLASSR*
Ga0134088_1057646813300010304Grasslands SoilMRDQTNSWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137399_1003454133300012203Vadose Zone SoilMPEQTNAWKIRVKKGNSEVEVAGPSPEIVQRMFEDLVKKYMTKLASSR*
Ga0137362_1153643723300012205Vadose Zone SoilIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137380_1004884723300012206Vadose Zone SoilMPDQTNTWKIKVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137380_1009525413300012206Vadose Zone SoilNSATAAMPDQTNSWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137380_1039640833300012206Vadose Zone SoilMPDQTNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASS
Ga0137380_1066333623300012206Vadose Zone SoilKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137380_1119563213300012206Vadose Zone SoilMPDQTNSWKIRVKKGNYEVEVAGPSPEIVQKMFED
Ga0137387_1015202913300012349Vadose Zone SoilWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137387_1095106023300012349Vadose Zone SoilAMPDQTNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137372_1006759813300012350Vadose Zone SoilWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0137386_1005759523300012351Vadose Zone SoilMPDESNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137385_1014491113300012359Vadose Zone SoilMPDQTNTWKIRVKKGNSEVEVAGPSPEIVQKMFEALVKNYITKLASSR*
Ga0137385_1014596623300012359Vadose Zone SoilMPDQTNSWKIRVKKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137385_1014682513300012359Vadose Zone SoilMPEQTNTWKIKVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137385_1082514213300012359Vadose Zone SoilTWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137360_1045898623300012361Vadose Zone SoilMPDQTNPWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137395_1030102233300012917Vadose Zone SoilMPEQTNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137396_1004660923300012918Vadose Zone SoilMPEQTNVWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR*
Ga0137396_1008210423300012918Vadose Zone SoilMPEQMNIWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0137396_1023236033300012918Vadose Zone SoilMPEQTNAWKIRVKKGNSEVEVAGPSPEIVQRMFEELVKKYMTKLASSR*
Ga0137396_1030859723300012918Vadose Zone SoilNVWKIRGKKGNSEVEVAGPSPQIVQKMFENLVKK*
Ga0137396_1034150823300012918Vadose Zone SoilMNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0137396_1060440423300012918Vadose Zone SoilCAMPEQTNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR*
Ga0134083_1023957623300017659Grasslands SoilELCNCAMPDQTIAWKITVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0066655_1029997023300018431Grasslands SoilMPEQTNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0066655_1076129523300018431Grasslands SoilMPEQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0066655_1093221623300018431Grasslands SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0066662_1006354223300018468Grasslands SoilMPDQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0215015_1110758223300021046SoilMPDQTNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0210404_1004095113300021088SoilMPDQTNAWKIRVRKGNYEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0137417_136653713300024330Vadose Zone SoilMPEQTNVWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209234_114497013300026295Grasslands SoilMPEQPNTWRIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209236_108289643300026298Grasslands SoilMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209236_111150643300026298Grasslands SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEDLVKKYMTKLASSR
Ga0209055_105664433300026309SoilMPDQTIPWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209055_106785013300026309SoilMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR
Ga0209154_100602963300026317SoilMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMT
Ga0209154_103447713300026317SoilMPDQMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR
Ga0209154_107128843300026317SoilMPDQMNVWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMT
Ga0209152_10000342223300026325SoilMPDQMNVWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209802_100494883300026328SoilMPEQMNAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209690_106903143300026524SoilMPDQVNAWKIRVKKGNSEVEVAGPSPEIVQKMFEELVKKYMTKLASSR
Ga0209806_100200663300026529SoilMPDQTNTWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209056_1016281843300026538SoilMPDQTIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLA
Ga0209156_1048728923300026547SoilVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209588_105674923300027671Vadose Zone SoilMPEQTIPWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209689_113656423300027748SoilMPDQTNSWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209180_1033226823300027846Vadose Zone SoilSRAMPDQPIAWKIRVKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209180_1038932223300027846Vadose Zone SoilMPDQPIAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209701_1050014613300027862Vadose Zone SoilGSANGAMPDQTNSWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR
Ga0209283_1020635923300027875Vadose Zone SoilMNVWKIRVKKGNSEVEVAGPSPEVVQKMFEDLVKKYMTKLASSR
Ga0209283_1067398213300027875Vadose Zone SoilKKGNYEVEVAGPSHEIVQKMFEELVKKYMTKLASSR
Ga0209590_1068959823300027882Vadose Zone SoilKIRVKKGNSEVEVAGPSPEIVQKMFEELVKKYMTKLASSR
Ga0137415_1030956913300028536Vadose Zone SoilMPEQTNAWKIRVKKGNYEVEVAGPSPEIVQKMFEELVKKYMTKLASSR
Ga0137415_1077575913300028536Vadose Zone SoilMPDQANAWKIRVKKGNSEVEVAGPSPEIVQKMFEDLVKKYMTKLASSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.