NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099720

Metagenome / Metatranscriptome Family F099720

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099720
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 44 residues
Representative Sequence MFIETYYDERAAVPGSTSEPMIPYPELSGEELAVWQEYLPMRRE
Number of Associated Samples 84
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 93.20 %
% of genes from short scaffolds (< 2000 bps) 95.15 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.29

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.087 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil
(23.301 % of family members)
Environment Ontology (ENVO) Unclassified
(38.835 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.544 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54
12ZMR_00494890
2INPhiseqgaiiFebDRAFT_1005742132
3KanNP_Total_noBrdU_T14TCDRAFT_10014674
4KanNP_Total_noBrdU_T14TCDRAFT_10047741
5AF_2010_repII_A1DRAFT_101273593
6AP72_2010_repI_A100DRAFT_10326123
7JGI1027J12803_1025818431
8Ga0055471_100845693
9Ga0066398_101632401
10Ga0066395_107303711
11Ga0066809_102402502
12Ga0065705_109577681
13Ga0066388_1018472782
14Ga0070708_1003235981
15Ga0066689_109007832
16Ga0066699_102998412
17Ga0066905_1020998502
18Ga0066903_1056291031
19Ga0070716_1013566361
20Ga0070712_1006108692
21Ga0075427_100764071
22Ga0074055_112313371
23Ga0075430_1006978651
24Ga0075433_112267181
25Ga0075433_119287362
26Ga0075425_1008574351
27Ga0075424_1019933411
28Ga0075419_102882121
29Ga0075418_101089573
30Ga0114129_106626583
31Ga0114129_134025291
32Ga0105092_100728603
33Ga0075423_109942862
34Ga0126374_111939521
35Ga0105089_10115401
36Ga0105062_10922231
37Ga0105085_10417533
38Ga0126380_103784111
39Ga0126380_116382761
40Ga0126384_105976252
41Ga0126384_121244662
42Ga0126384_123147341
43Ga0126382_104785702
44Ga0126382_113114491
45Ga0126382_113572582
46Ga0134063_106457401
47Ga0126370_110663792
48Ga0126376_108434541
49Ga0126378_107919502
50Ga0126379_125225541
51Ga0126381_1012078511
52Ga0126381_1028589271
53Ga0126381_1037809812
54Ga0126383_122666191
55Ga0137429_10482704
56Ga0137421_11213373
57Ga0137383_113173711
58Ga0137363_106164402
59Ga0137362_106305531
60Ga0137380_103599982
61Ga0137376_109474263
62Ga0137376_110328251
63Ga0137370_109184372
64Ga0137386_109779031
65Ga0137367_106605941
66Ga0137385_105234861
67Ga0137360_109460311
68Ga0137361_105924881
69Ga0137373_108853081
70Ga0137358_108087441
71Ga0137397_101575133
72Ga0137397_106448261
73Ga0137397_106463021
74Ga0137394_100740569
75Ga0137407_102006962
76Ga0137407_122058591
77Ga0126375_102617742
78Ga0126375_103593072
79Ga0126375_109840672
80Ga0126375_118733161
81Ga0126369_136925251
82Ga0134110_105521332
83Ga0182038_109691981
84Ga0210379_102976311
85Ga0126371_109361492
86Ga0126371_113943812
87Ga0209690_12093232
88Ga0209157_12634631
89Ga0209684_10593601
90Ga0209466_10273211
91Ga0209466_10591581
92Ga0209465_105228371
93Ga0207428_108502771
94Ga0209382_100994011
95Ga0222749_107743091
96Ga0170818_1019298491
97Ga0307469_101798334
98Ga0307473_112775611
99Ga0307470_109570461
100Ga0307471_1012478361
101Ga0307472_1006503492
102Ga0310914_113863681
103Ga0326726_106569272
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.50%    β-sheet: 0.00%    Coil/Unstructured: 87.50%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MFIETYYDERAAVPGSTSEPMIPYPELSGEELAVWQEYLPMRRESequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.29
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.1%2.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Groundwater Sediment
Natural And Restored Wetlands
Soil
Soil
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Switchgrass Rhizosphere
Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Groundwater Sand
Peat Soil
Populus Rhizosphere
Switchgrass, Maize And Mischanthus Litter
19.4%23.3%5.8%2.9%4.9%8.7%2.9%2.9%12.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
2ZMR_004948902170459016Switchgrass, Maize And Mischanthus LitterMFIETYYDDRATVPGSTSEPMIPYPELSGEELEVWRKYLPMRRQIRNLD
INPhiseqgaiiFebDRAFT_10057421323300000364SoilMFIETYYDERAAVPGSTSEPMIPYPELLGEELAVWEEYL
KanNP_Total_noBrdU_T14TCDRAFT_100146743300000596SoilMFIETYYDDRAAVPGSTSEPMIPYPELSGEKLTVWQEYLPIKRPIRNLY
KanNP_Total_noBrdU_T14TCDRAFT_100477413300000596SoilMFIETYYDDRATVPGSTSEPMIPYPELSGEELEVWRKYLPMRRQIRN
AF_2010_repII_A1DRAFT_1012735933300000597Forest SoilMFIETYYDERATVPGSTNEPMIPYPELSGEELEVWRKYLPIRRPIR
AP72_2010_repI_A100DRAFT_103261233300000837Forest SoilMFIETYYDERATVPGSTNEPMIPYPELSGEELEVWR
JGI1027J12803_10258184313300000955SoilMFIETYYDERAPVPGSTIERMIPYPELSGEELTVWQEYLPIN
Ga0055471_1008456933300003987Natural And Restored WetlandsMFIETYYDDRAALPAGTNVPQIPYPELSGEALTVWRRYLPMRRQIRNLH
Ga0066398_1016324013300004268Tropical Forest SoilMFIETYYDDRAAVPASTGEPIIPFPELSGDELTVWRNYLP
Ga0066395_1073037113300004633Tropical Forest SoilMFIQTYYDDRAAVSGSASEPIIPHPELSGEELKVWRNYLPIR
Ga0066809_1024025023300005168SoilMFIETYYDERAAVPGSTSEPMIPYPELLGEELAVWQEYLP
Ga0065705_1095776813300005294Switchgrass RhizosphereMFIETYYDERAAVPVTTSEPMIPYPELSGEELAVWQAYLPIRREIRKP
Ga0066388_10184727823300005332Tropical Forest SoilMFIETYYDDRVAVSGSASEPIIPYPELSGEELKVWRNYLPIRREIRNLYTQRF
Ga0070708_10032359813300005445Corn, Switchgrass And Miscanthus RhizosphereMFIETYYDERVPVPGSTSEPLIPYPELSGEELAVWQEYLPIRSDM
Ga0066689_1090078323300005447SoilMFIETYYDERVPVPATPREPMMPYPELSGEELTVWRKYLP
Ga0066699_1029984123300005561SoilMFIETYYDETAAVPATTSEPTIPYPELSGEELTVWRKYLPIRRQVENLRS
Ga0066905_10209985023300005713Tropical Forest SoilMFVEIYYDDRAPVPGSASEPIIPYLELSGEQLTVWRNYLPM
Ga0066903_10562910313300005764Tropical Forest SoilMFIETYYDERATVPGSTNEPMIPYPELSGEELEVWRKYL
Ga0070716_10135663613300006173Corn, Switchgrass And Miscanthus RhizosphereMFIETYYDERAPVPGSTSEPLIPYPELSGEELAVWQ
Ga0070712_10061086923300006175Corn, Switchgrass And Miscanthus RhizosphereMFIETYYDERAAVPPTTSEPMIPYPELSGEELAVWQEYLP
Ga0075427_1007640713300006194Populus RhizosphereMFIETYYDERAAVPGSTSEPMIPYPELSGEELAVWQEYLPMRREIRNLY
Ga0074055_1123133713300006573SoilMFIETYYDERAAVPVITSEPMIPYPELSGEELAVWQEYLP
Ga0075430_10069786513300006846Populus RhizosphereMFIETYYDERAAVPASTSEPVIPYPELSGEELAVWQEYLPMRREIRK
Ga0075433_1122671813300006852Populus RhizosphereMFIETYYDERAAVPGSTSEPVIPYPELSGEELAVWQEYLPMRREIRN
Ga0075433_1192873623300006852Populus RhizosphereMFKETYYDDRAAVPGSSSERMIPYPELSGEELTVWQEYLPIKRPIRNL
Ga0075425_10085743513300006854Populus RhizosphereMFIETYYDERAPVPGSTSEPLIPYPELSGQELAVWQEYLPVRREMRNLYS
Ga0075424_10199334113300006904Populus RhizosphereMFIETYYDERAAVPATTSEPMIPYPELLGEELAVWEEYLPMRREIRNLY
Ga0075419_1028821213300006969Populus RhizosphereMFIETYYDERAAVPGSTSEPMIPYPELSGEELAVWQEYLPMRRE
Ga0075418_1010895733300009100Populus RhizosphereMFIETYYDERAAVPGSTSEPMIPYPELSGEELAVWQEYLPMRREIR
Ga0114129_1066265833300009147Populus RhizosphereMFIETYYDDRATVPGSTSEPMIPYPELSGQELEVWRKYL
Ga0114129_1340252913300009147Populus RhizosphereMFIETYYDERVPVPGSTSEPLIPYPELSGEELTEE*
Ga0105092_1007286033300009157Freshwater SedimentMFIETYYDDRAVPGSTSEPMIPYPELSGEKLTVWQEYLPIKRPIRNLSSPF
Ga0075423_1099428623300009162Populus RhizosphereMFIETYYDERAPVPGSTSEPLIPYPELSGQELAVWQEYLPVRREMRNLY
Ga0126374_1119395213300009792Tropical Forest SoilMFIETYYDDRAPVPGSTSEPIIPYPELSGEQLTMWRNYLPMRREIRNLD
Ga0105089_101154013300009809Groundwater SandMFIETYYDDRAAVPGSTSEPMIPYPELSGEKLTVWQEYLPIKR
Ga0105062_109222313300009817Groundwater SandMFIETYYDDRATVPGSTSEPMIPYPELSGEELEVWRKYLPMRRQIRNLDSR
Ga0105085_104175333300009820Groundwater SandMFIETYYDDRAAVPGSTSEPVIPYPELSGEELTVWRKYLPMRRQIRKVS
Ga0126380_1037841113300010043Tropical Forest SoilMFIETYYDERAAVPASTSEPMIPYPELSGEELAVWQEYLPM
Ga0126380_1163827613300010043Tropical Forest SoilMFIETYHDDRATALGSTSEPMIPYPELSGEELEVWRKYLPLRRQIRN
Ga0126384_1059762523300010046Tropical Forest SoilMFIETYYDDRAAVSGSASEPIIPYPELSGEELKVWRNYLPIRREIRNLYT
Ga0126384_1212446623300010046Tropical Forest SoilMFIETYHDDRAAAPGSTSEPIIPFPELSGDELTVWRNYLP
Ga0126384_1231473413300010046Tropical Forest SoilMFIETYYDERATVPGSTNEPMIPYPELSGEELEVWRKYLPIRRP
Ga0126382_1047857023300010047Tropical Forest SoilMFIETYYDDRAPVPGSASEPIIPYPELSGEQLTVWRNYLPMRR
Ga0126382_1131144913300010047Tropical Forest SoilMFTETYHDNRAAVPSGSSEPMISILRIVGEALTVWQEYLPIRRPIRNLY
Ga0126382_1135725823300010047Tropical Forest SoilMFIETYYDERALVPCSMSEPVIPYPELSGEELTAWRDYLPIRRPVA
Ga0134063_1064574013300010335Grasslands SoilMFIETYYDDRAAAVSGSTREPTIPYPELSGEELTV*
Ga0126370_1106637923300010358Tropical Forest SoilMFIETYYDYRAPVPGGTSEPIIPYPELSGEQLTVWRNYLPMRREIRNLY
Ga0126376_1084345413300010359Tropical Forest SoilMFIETYHDERAAVPGSTSEPVIPYPELSGEELTVW
Ga0126378_1079195023300010361Tropical Forest SoilMFIETYHDDRAAVPASTSEPIIPYPELSGEELTMWRNYLPI
Ga0126379_1252255413300010366Tropical Forest SoilMFIETYHDDRATALGSTSEPMIPYPELSGEELEVWRKYLPLRRQIRNL
Ga0126381_10120785113300010376Tropical Forest SoilMFIETYYDDRATVRGSTSEPMIPYPELSGEELEVWRKYLPIRRPIRNLDSR
Ga0126381_10285892713300010376Tropical Forest SoilMFIETYYDERALVPCSMSEPVIPYPELSGEELTAWRDYLPIRRPVANFASWFN
Ga0126381_10378098123300010376Tropical Forest SoilMFIETYYDERAAVLTSTSEPMIPYPELSGKELAVWQEYLPI
Ga0126383_1226661913300010398Tropical Forest SoilMFIETYHDDRATALGSTGEPIIPYPELSGEELEVWRKYLPLRR
Ga0137429_104827043300011437SoilMFIETYYDDRATVPAGTSAPPIPYPELSGDELTVWQQYLPM
Ga0137421_112133733300012039SoilMFIETYYDDRATVPAGTSAPPIPYPELSGDELTVW
Ga0137383_1131737113300012199Vadose Zone SoilMFIETYYDERAAVPATTSEPMIPYPELLGEELAVWQEYLPMRREIRKPFSR
Ga0137363_1061644023300012202Vadose Zone SoilMFIETYYDERAAVFASTSEPMIPYPELSGEELTVWQEYLPMRRQI
Ga0137362_1063055313300012205Vadose Zone SoilMFIETYYDDRAAVSGNTGEPVIPYPELSGEELTVWRNYL
Ga0137380_1035999823300012206Vadose Zone SoilMFIETYYDETAAVPATTSEPTIPYPELSGEELTVWRKYLPIRRQVENLRS*
Ga0137376_1094742633300012208Vadose Zone SoilMFIETYYDERAAVPATTSEPMIPYPELLGEELAVWQEYLPMRREIRK
Ga0137376_1103282513300012208Vadose Zone SoilMFIETYYDDRAAVSGSTSEPTIPYPELSGEELTVWRK
Ga0137370_1091843723300012285Vadose Zone SoilMFIETYYDDRAAVPGGTSEPIIPYPELSGEKLTVWQEY
Ga0137386_1097790313300012351Vadose Zone SoilMFIETYYDDRAAAVSDSTSEPMIPYPELSGEKLTVW
Ga0137367_1066059413300012353Vadose Zone SoilMFIETYYDERAALPASTSEPMIPYPELSGEELTVWQEY
Ga0137385_1052348613300012359Vadose Zone SoilMFIETYYDDRAAVPGSTSEPMIPYPELSGEKLTV*
Ga0137360_1094603113300012361Vadose Zone SoilMFIETYYDERAPVPGNTSEPMIPYPELSGEALTVWRKYLPMRREIENLDSR
Ga0137361_1059248813300012362Vadose Zone SoilMFIETYYDDRAAVPGSTSEPMIPYPELSGEKLAVWQEYLPIRRPI
Ga0137373_1088530813300012532Vadose Zone SoilMFIETYYDERAPVHGSTSEPVIPYPELSGEALEVWRKYLPMRRPIENF
Ga0137358_1080874413300012582Vadose Zone SoilMFVETYYDARVLVPVGGNEPLIPYPELRGEELTVWRKYLPKQTPLENLRF
Ga0137397_1015751333300012685Vadose Zone SoilMFIETYYDERAAVPGSTSEPMIPYPELSGEELTVWRE
Ga0137397_1064482613300012685Vadose Zone SoilMFIETYYDERAAVPVTTSEPMIPYPELSGEELAVWQE
Ga0137397_1064630213300012685Vadose Zone SoilMFKETYYDDRAAVPGSLSERMIPYPELSGEELTVARILAD*
Ga0137394_1007405693300012922Vadose Zone SoilMFIETYYDDRATLPSSTSGPMIPYPELSGGELEVWRKYLPIR
Ga0137407_1020069623300012930Vadose Zone SoilMFIETYYDERAAVPASTSEPMIPYPELSGEELTVWQEYLPIR
Ga0137407_1220585913300012930Vadose Zone SoilMFIETYHDDRATVPGSTSEPMIPYPELSGEELEVWRKYLP
Ga0126375_1026177423300012948Tropical Forest SoilMFIETYYDDRAPVLGSTSEPVIPYPELSGEELRVWQKYLPMRRGIR
Ga0126375_1035930723300012948Tropical Forest SoilMFIETYYDDRAAVSGSASEPIIPYPELSGEELKVWRNYLPIRREIRNLYTQ
Ga0126375_1098406723300012948Tropical Forest SoilMFIETYYDDRVAVSGSASEPIIPYPELSGEELKVWRNYLPIRREIRNLY
Ga0126375_1187331613300012948Tropical Forest SoilMFIETYHDDRATVPARTSEPIIPYPELSGEELTVWRNYL
Ga0126369_1369252513300012971Tropical Forest SoilMFIETYYDDRAPVPGSASEPIIPYPELSGEQLTVWRNYLPMRREI
Ga0134110_1055213323300012975Grasslands SoilMFIETYYDERVPVPAMPRELMMFYPELSGEELTVWRKYLP
Ga0182038_1096919813300016445SoilMFIETYYDQRAPIPGSTSERTIPYPELSGEELAAWQEYLPIKREMGNLHSL
Ga0210379_1029763113300021081Groundwater SedimentMFIETYYDDRAAVPASTSERMIPYPELSGEKLTVWQEYLPIKRPIRNLYSP
Ga0126371_1093614923300021560Tropical Forest SoilMFIETYYDERAPVPSSTSEPVIPYPELLGEELTVWRNYLPMRREIKNLYTQF
Ga0126371_1139438123300021560Tropical Forest SoilMFIETYYDERATVRGSTTEPMIPYPELSGEELEVWRKYLPIRRPI
Ga0209690_120932323300026524SoilMFIETYYDDRAAVPGSTSEPMIPYPELSGEKLTVWQE
Ga0209157_126346313300026537SoilMFIETYYDDRAVATSTSEPVIPYPELSGEELTVWQKYLPIR
Ga0209684_105936013300027527Tropical Forest SoilMFIETYYDERAAVLMSASEPMIPYPELSGKELAVWREYLPIRREMR
Ga0209466_102732113300027646Tropical Forest SoilMFIETYYDDRVAVSGSASEPIIPYPELSGEELKVWRNYLPIRREIRNL
Ga0209466_105915813300027646Tropical Forest SoilMFIETYYDDRAAVLGSASEPIIPYPELSGEELKVWRNYLPIRREIRNL
Ga0209465_1052283713300027874Tropical Forest SoilMFIETYYDDRAPVPGSASEPIIPYPELSGEQLTVWRNYLPMRREIKNLC
Ga0207428_1085027713300027907Populus RhizosphereMFIETYYDERAAVPGSTSEPVIPYPELFGEELTVWREYLPM
Ga0209382_1009940113300027909Populus RhizosphereMFKETYYDDRAAIPSSLSERMIPYPELSGEALTVWREYL
Ga0222749_1077430913300029636SoilMFIETYYDDRAALPGSTSEPEIPYPELSGEALTVWQKY
Ga0170818_10192984913300031474Forest SoilMFIETYYDERTAVPASTNEPMIPYPELSGEELTVKKKIDKSR
Ga0307469_1017983343300031720Hardwood Forest SoilMFIETYYDDRAAVPGSTSETMILYPELSGEKLAVRQEYLAIRRDR
Ga0307473_1127756113300031820Hardwood Forest SoilMFIETYYDERAAVPGSTSEPMIPYPELLGEELAVWQEYLPIRREIRKP
Ga0307470_1095704613300032174Hardwood Forest SoilMFIETYYDERAPVPGSKSEPFIPYPELSGEELAVWQEYLPVRREMKNLYSP
Ga0307471_10124783613300032180Hardwood Forest SoilMFIETYHDDRATVPGSTSEPMIPYPELSGEELQVWRK
Ga0307472_10065034923300032205Hardwood Forest SoilMFIETYYDERAPVPGSTSEPLIPYPELFGEELAVWQE
Ga0310914_1138636813300033289SoilMFIETYYDERAPVPGSTSERMIPYPELSGEELTVWRKYLP
Ga0326726_1065692723300033433Peat SoilMFIETYYDDRATVGGASEPMIPYPELSGEELEVWRKYLPMRREIRNLDSRL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.