NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099048

Metagenome / Metatranscriptome Family F099048

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099048
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 43 residues
Representative Sequence MISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Number of Associated Samples 90
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 77.67 %
% of genes near scaffold ends (potentially truncated) 33.01 %
% of genes from short scaffolds (< 2000 bps) 88.35 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (93.204 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(34.952 % of family members)
Environment Ontology (ENVO) Unclassified
(46.602 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(43.689 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56
1GPICI_03177890
22PV_00526630
3F24TB_113509602
4JGI11643J11755_117297792
5JGI11643J12802_100662842
6JGI1027J12803_1058414831
7JGI25617J43924_101850371
8JGI25617J43924_102877441
9Ga0063454_1012659712
10Ga0062593_1014533921
11Ga0062590_1009944752
12Ga0066672_100506431
13Ga0066672_100637214
14Ga0066688_101512622
15Ga0066684_101847423
16Ga0066684_105602261
17Ga0065704_103612412
18Ga0065707_100261723
19Ga0070713_1004089002
20Ga0066686_103813762
21Ga0070697_1008539581
22Ga0066701_100791623
23Ga0066661_100795591
24Ga0066707_103390262
25Ga0066708_101837502
26Ga0066706_107055334
27Ga0081540_10864182
28Ga0066653_104878832
29Ga0066658_104506672
30Ga0099793_102811062
31Ga0099794_101462532
32Ga0099795_100018432
33Ga0099829_113778951
34Ga0099827_101825133
35Ga0066709_1014484734
36Ga0126382_102435412
37Ga0134080_104672182
38Ga0137364_100306066
39Ga0137382_111367291
40Ga0137382_112676711
41Ga0137376_102211621
42Ga0137376_117738751
43Ga0137372_102246043
44Ga0137366_107354611
45Ga0137384_105823171
46Ga0137360_103741903
47Ga0137361_103661814
48Ga0137361_115901522
49Ga0137358_101860411
50Ga0137358_105571172
51Ga0137358_107817972
52Ga0137398_103899721
53Ga0137397_105100383
54Ga0137396_100607803
55Ga0137413_101483392
56Ga0137413_113255772
57Ga0137404_113208381
58Ga0137404_115161792
59Ga0137407_107793352
60Ga0137407_118719711
61Ga0164309_115083132
62Ga0134079_101136951
63Ga0137405_12084543
64Ga0137420_14242983
65Ga0137412_101011273
66Ga0134073_101816092
67Ga0163161_118831531
68Ga0184618_103243222
69Ga0066655_101777784
70Ga0066667_103221513
71Ga0066669_101252192
72Ga0193756_10338952
73Ga0193707_10162554
74Ga0193713_10372932
75Ga0193734_10619232
76Ga0193724_11265331
77Ga0179594_101281773
78Ga0179592_101005773
79Ga0179596_101236812
80Ga0193709_10882571
81Ga0193750_10839011
82Ga0193695_11093021
83Ga0207701_110420072
84Ga0209438_11379512
85Ga0209055_10694892
86Ga0209153_12028641
87Ga0209687_13042462
88Ga0209801_11229263
89Ga0209801_11325843
90Ga0209267_10974282
91Ga0209803_12643432
92Ga0209808_10423065
93Ga0209648_100173545
94Ga0209577_100762124
95Ga0179587_102741011
96Ga0209590_105002201
97Ga0222749_103464153
98Ga0170823_102920363
99Ga0170824_1001607252
100Ga0307475_100340494
101Ga0307475_102415093
102Ga0307479_109751642
103Ga0307472_1014429352
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 72.09%    β-sheet: 0.00%    Coil/Unstructured: 27.91%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRSSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
93.2%6.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Soil
Grasslands Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Forest Soil
Hardwood Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Tabebuia Heterophylla Rhizosphere
Switchgrass Rhizosphere
Switchgrass, Maize And Mischanthus Litter
8.7%35.0%2.9%25.2%7.8%3.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPICI_031778902088090015SoilMISIAIACALIIYLFGYLDSPNYRGIAWAAKMICHRLTNNQHN
2PV_005266302170459014Switchgrass, Maize And Mischanthus LitterMISIAIACVLIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
F24TB_1135096023300000550SoilMISISIAYALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
JGI11643J11755_1172977923300000787SoilMISIAXACXLIIXLFGYLDSPNYRGIAWAAKMICHRL
JGI11643J12802_1006628423300000890SoilMISIAIACALIIYLFGYLDSPNYRGIAWAAKMICHRLTNNQHN*
JGI1027J12803_10584148313300000955SoilISIACALIIYLFGYLDSVDGRGIASAAKLMCRKLASSKRS*
JGI25617J43924_1018503713300002914Grasslands SoilAIACALIIYLFGYLDSPNYHGIVWAAKMICHRLTNNQHN*
JGI25617J43924_1028774413300002914Grasslands SoilMISISIAWALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0063454_10126597123300004081SoilPTMISISIACALTIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0062593_10145339213300004114SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCQKLAGIKRS*
Ga0062590_10099447523300004157SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0066672_1005064313300005167SoilMISISTACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0066672_1006372143300005167SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS*
Ga0066688_1015126223300005178SoilMISISIACALIIYLFGYLDSAGGRGIASAAKLMSRKLASSKRS*
Ga0066684_1018474233300005179SoilMISISTACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS*
Ga0066684_1056022613300005179SoilMISISIACALIIYLFGYLDSAGRGIASVAKLMCRKLASSKRS*
Ga0065704_1036124123300005289Switchgrass RhizosphereMISIAIACALIIYLFGYLDSPNYSGIAWAAKMICHRLTNNQHN*
Ga0065707_1002617233300005295Switchgrass RhizosphereMISIAIACALIIXLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0070713_10040890023300005436Corn, Switchgrass And Miscanthus RhizosphereMISIAIACALIIYLFGYLDSPNYQAMAWAAKMICHRLTNNQHN*
Ga0066686_1038137623300005446SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLAGSKRS*
Ga0070697_10085395813300005536Corn, Switchgrass And Miscanthus RhizosphereMISIAIACALIIYLFGYRDSPNYHGIACAAKMICHRLTNNQHN*
Ga0066701_1007916233300005552SoilMISISIACALIIYLFGYLDSAGGRGIASAAKLMCRKLASSKRS*
Ga0066661_1007955913300005554SoilIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0066707_1033902623300005556SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0066708_1018375023300005576SoilMISISIAYALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS*
Ga0066706_1070553343300005598SoilMISISTACALIIYLFGYLDSADGRGIASAAKLMCRKLASGK
Ga0081540_108641823300005983Tabebuia Heterophylla RhizosphereMISISVACALIVYLFGYLDSPNYRGIAWAARMICHRLTNQR*
Ga0066653_1048788323300006791SoilMISISIACALIIYLFGYPDSADGRGIASAAKLMCRKLASGKRS*
Ga0066658_1045066723300006794SoilMISISIACALIIYLFGYLDSAGRGIASVAQLMCRKLASSKRS*
Ga0099793_1028110623300007258Vadose Zone SoilMISISIACALIIYLFGYPDSADGRGIASAAKLMCRKLASSKRS*
Ga0099794_1014625323300007265Vadose Zone SoilMISISIACALTIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0099795_1000184323300007788Vadose Zone SoilMISIAIACGLIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0099829_1137789513300009038Vadose Zone SoilPGGPTMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0099827_1018251333300009090Vadose Zone SoilMISISIACALVVYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0066709_10144847343300009137Grasslands SoilIACALIIYLLGYLDSADSRGIASAAKLMCRKLASGKRS*
Ga0126382_1024354123300010047Tropical Forest SoilMISISIACALIIYLFGYLDSPNYHGIAWAAKMVCHRLTKQRN*
Ga0134080_1046721823300010333Grasslands SoilMISISIACALTIYLFGYLDSADGRGIASAAKLMCRKLASGKRS*
Ga0137364_1003060663300012198Vadose Zone SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLARSKRS*
Ga0137382_1113672913300012200Vadose Zone SoilMISIAIACALVIYLFGYLDSPNYHGIAWAAKMIRHRLTNNQHN*
Ga0137382_1126767113300012200Vadose Zone SoilMISISIACALIIYLFGYLDSAGRGIASAAKLMCRKLASSKRT*
Ga0137376_1022116213300012208Vadose Zone SoilSIACALIIYLFGYLDSADGRGIASAAKLMCRKLARSKRS*
Ga0137376_1177387513300012208Vadose Zone SoilISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0137372_1022460433300012350Vadose Zone SoilMISISIACALIIYLFGYLDSADGRGTASAAKLMCRKLASGKRS*
Ga0137366_1073546113300012354Vadose Zone SoilMISISIACALIIYLFGYLDSADGRGIASAAKLICRKLASGKRS*
Ga0137384_1058231713300012357Vadose Zone SoilMISIAIACALVIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN*
Ga0137360_1037419033300012361Vadose Zone SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRN*
Ga0137361_1036618143300012362Vadose Zone SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRK
Ga0137361_1159015223300012362Vadose Zone SoilMISISIACALVVYLFGYLDSADGRGIASAAKLICRKLASSKRS*
Ga0137358_1018604113300012582Vadose Zone SoilISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0137358_1055711723300012582Vadose Zone SoilMIMISIAIACALIIYLFGYLDSPNYHGIAWSAKMICHRLTNNQHN*
Ga0137358_1078179723300012582Vadose Zone SoilMISISIAYALIIYLFGYLDSADGRGIASAAKLMCRKLA
Ga0137398_1038997213300012683Vadose Zone SoilIACALIIYLFGYLDSADGRGIASAAKLMCRKLAGSKRS*
Ga0137397_1051003833300012685Vadose Zone SoilMTSISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSRR
Ga0137396_1006078033300012918Vadose Zone SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMIRHRLTNNQHN*
Ga0137413_1014833923300012924Vadose Zone SoilMTSISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0137413_1132557723300012924Vadose Zone SoilMISIAIACALIIYLFGYLDSPNYHGIAWSAKMIRHRLTNNQHN*
Ga0137404_1132083813300012929Vadose Zone SoilRKESTMISIAIVCALIIYLFGYLNSPNYHGIAWAAKMICHRLTNNQHN*
Ga0137404_1151617923300012929Vadose Zone SoilMISISIACALVIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0137407_1077933523300012930Vadose Zone SoilMIMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNN
Ga0137407_1187197113300012930Vadose Zone SoilGPTMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS*
Ga0164309_1150831323300012984SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLANNQHN*
Ga0134079_1011369513300014166Grasslands SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKR
Ga0137405_120845433300015053Vadose Zone SoilMISLSIACALIIYLFGYLDSADGRGIASAAKLMCRKLARSKRS*
Ga0137420_142429833300015054Vadose Zone SoilMISIAIACALIIYLFGYLDSPNYRGIAWAAKMIRHRLTNNQHN*
Ga0137412_1010112733300015242Vadose Zone SoilMISIATACALIIYLFGYLDSPNYHGIAWAAKIICHRLTNNQHN*
Ga0134073_1018160923300015356Grasslands SoilALIIYLFGYLDSADSRGIASAAKLMCRKLASGKRS*
Ga0163161_1188315313300017792Switchgrass RhizosphereMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0184618_1032432223300018071Groundwater SedimentLPNSEPGGPTMISISIACALIIYLFGYPDSADGRGIASAAKLMCRKLASSKRS
Ga0066655_1017777843300018431Grasslands SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Ga0066667_1032215133300018433Grasslands SoilMISISIACALIIYLFGYLDSAGRGIASVAKLMCRKLASSKRS
Ga0066669_1012521923300018482Grasslands SoilMISISTACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS
Ga0193756_103389523300019866SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQYN
Ga0193707_101625543300019881SoilMISISIACALIIYLFGYPDSADGRGIASAAKLMCRKLASSKRS
Ga0193713_103729323300019882SoilMISISIAWALIIYLFGYPDSADGRGIASAAKLMCRKLASSKRS
Ga0193734_106192323300020015SoilTVISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0193724_112653313300020062SoilMISIAIACGLIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0179594_1012817733300020170Vadose Zone SoilMISISIACALVIYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Ga0179592_1010057733300020199Vadose Zone SoilMISISIACALTIYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Ga0179596_1012368123300021086Vadose Zone SoilMISISIACALVVYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Ga0193709_108825713300021411SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNN
Ga0193750_108390113300021413SoilTMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0193695_110930213300021418SoilMISISIAYALIIYLFGYPDSADGRGIASAAKLMCRKLASSKRS
Ga0207701_1104200723300025930Corn, Switchgrass And Miscanthus RhizosphereMISISIAWALIIYLFGYLDSADGRGIASAAKLMCRKLASSKRS
Ga0209438_113795123300026285Grasslands SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMIRHRLTNNQHN
Ga0209055_106948923300026309SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS
Ga0209153_120286413300026312SoilSEPAGPTMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS
Ga0209687_130424623300026322SoilLPNSEPGGQAMISISIACALIIYLFGYLDSAGRGIASVAKLMCRKLASSKRS
Ga0209801_112292633300026326SoilTMISISIACALIIYLFGYLDSAGGRGIASAAKLMCRKLASSKRS
Ga0209801_113258433300026326SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLANSKRS
Ga0209267_109742823300026331SoilMISISIACALIIYLFGYLDSAGGRGIASAAKLMSRKLASSKRS
Ga0209803_126434323300026332SoilMISISIACALIIYLFGYLDSADGRGIASAAKLMCRKLAGSKRS
Ga0209808_104230653300026523SoilGRTMISISIAYALIIYLFGYLDSADGRGIASAAKLMCRKLASGKRS
Ga0209648_1001735453300026551Grasslands SoilMISIAIACALIIYLFGHLDSPNYHGIVWAAKMICHRLTNNQHN
Ga0209577_1007621243300026552SoilMISISTACALIIYLFGYLDSADGRGIASAAKLMCRKLAGSKRS
Ga0179587_1027410113300026557Vadose Zone SoilMISISIACALTIYLFGYLDSADGRGIASAAKLMCR
Ga0209590_1050022013300027882Vadose Zone SoilMISISIACALVVYLFGYLDSADGRGIASAAKLICRKL
Ga0222749_1034641533300029636SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTN
Ga0170823_1029203633300031128Forest SoilMISIAIACALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQRN
Ga0170824_10016072523300031231Forest SoilMISISIACALIIYLFGYLDSADGRGITSAAKLMCRKLAGIKRS
Ga0307475_1003404943300031754Hardwood Forest SoilMITLSIACTLIISLFGGYLDSGNERGIAWAAKKMCHRLTNNKHTRQ
Ga0307475_1024150933300031754Hardwood Forest SoilMISTAIAFALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0307479_1097516423300031962Hardwood Forest SoilMISIAIAIALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN
Ga0307472_10144293523300032205Hardwood Forest SoilALIIYLFGYLDSPNYHGIAWAAKMICHRLTNNQHN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.