NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F082976

Metagenome / Metatranscriptome Family F082976

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F082976
Family Type Metagenome / Metatranscriptome
Number of Sequences 113
Average Sequence Length 39 residues
Representative Sequence LIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP
Number of Associated Samples 101
Number of Associated Scaffolds 113

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 5.31 %
% of genes near scaffold ends (potentially truncated) 89.38 %
% of genes from short scaffolds (< 2000 bps) 87.61 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.21

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (80.531 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(28.319 % of family members)
Environment Ontology (ENVO) Unclassified
(27.434 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(50.442 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.
1JGI12631J13338_10017351
2JGI12635J15846_100334236
3JGI20243J16304_1003761
4Ga0062385_105068181
5Ga0062384_1004370243
6Ga0062389_1009647523
7Ga0070713_1009650641
8Ga0066697_100420333
9Ga0070732_103076533
10Ga0066691_104095173
11Ga0066654_100375484
12Ga0070763_101201314
13Ga0070763_101251401
14Ga0070764_105050331
15Ga0070766_100834643
16Ga0075028_1007789512
17Ga0075019_110882042
18Ga0070765_1007380581
19Ga0070765_1013001952
20Ga0099791_100786122
21Ga0099791_100876412
22Ga0099829_117363142
23Ga0099830_110891181
24Ga0099828_105567333
25Ga0099827_105802631
26Ga0116222_14603492
27Ga0116105_12456191
28Ga0116223_106580972
29Ga0126373_108719642
30Ga0134071_102962532
31Ga0134062_105262011
32Ga0126378_122331682
33Ga0136847_119345021
34Ga0137389_114768061
35Ga0137399_103606204
36Ga0137379_102809094
37Ga0137378_115338852
38Ga0137387_113085701
39Ga0137360_109367252
40Ga0137361_118308292
41Ga0137398_102067083
42Ga0137359_113067402
43Ga0137416_115134832
44Ga0137407_107063321
45Ga0137410_100605615
46Ga0134078_100389811
47Ga0134078_101138073
48Ga0181522_108211191
49Ga0137409_100038711
50Ga0137403_106476321
51Ga0187802_102512031
52Ga0187782_110412542
53Ga0179592_101477703
54Ga0210395_102969521
55Ga0210405_100307335
56Ga0210393_103999742
57Ga0210385_100866563
58Ga0210397_101623071
59Ga0210389_100249615
60Ga0210387_107495421
61Ga0210383_108799713
62Ga0210394_105858033
63Ga0210391_110693802
64Ga0210398_107396501
65Ga0242669_10539731
66Ga0242660_12551311
67Ga0242662_102989502
68Ga0212128_107119392
69Ga0137417_13444529
70Ga0137417_14483527
71Ga0209431_103374912
72Ga0209640_104502383
73Ga0208691_10096924
74Ga0208691_10430013
75Ga0209235_12099712
76Ga0209265_12121152
77Ga0209471_11015363
78Ga0257168_11128772
79Ga0209378_11184403
80Ga0209648_100053942
81Ga0209648_108266312
82Ga0179593_11496792
83Ga0209735_11362522
84Ga0209076_10210014
85Ga0209117_10100751
86Ga0209388_10352921
87Ga0209388_11544762
88Ga0208981_10936052
89Ga0209248_101234981
90Ga0209139_100368574
91Ga0209773_102553932
92Ga0209180_105365862
93Ga0209517_106765572
94Ga0209166_103002963
95Ga0209701_102354573
96Ga0209701_102456873
97Ga0209283_100923614
98Ga0209283_108573201
99Ga0265319_12197771
100Ga0222748_11009311
101Ga0302275_102694303
102Ga0247727_108094671
103Ga0307477_105418512
104Ga0307475_113352551
105Ga0307478_116590792
106Ga0214473_117391271
107Ga0311301_125964122
108Ga0307471_1014020243
109Ga0307471_1018642442
110Ga0307471_1042086711
111Ga0348332_135180812
112Ga0335079_117721431
113Ga0335080_104687433
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 10.61%    β-sheet: 0.00%    Coil/Unstructured: 89.39%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

5101520253035LIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTPSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.21
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
80.5%19.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Bog Forest Soil
Bog
Peatland
Freshwater Sediment
Thermal Springs
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Soil
Tropical Peatland
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Bog
Plant Litter
Biofilm
Rhizosphere
5.3%28.3%3.5%3.5%5.3%14.2%5.3%5.3%5.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12631J13338_100173513300001131Forest SoilDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP*
JGI12635J15846_1003342363300001593Forest SoilEDGKAKRENIHVSFHDVPSTNYAEAGVLVVDQKRTP*
JGI20243J16304_10037613300001635Forest SoilGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0062385_1050681813300004080Bog Forest SoilERITLALIEDGRAKRENIQVTFVDLPATDYAEAGVTVADQKRTP*
Ga0062384_10043702433300004082Bog Forest SoilTLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0062389_10096475233300004092Bog Forest SoilSVLIEDGRAKRENIHVSFHDLPATNYAEAGVLVADQKRTP*
Ga0070713_10096506413300005436Corn, Switchgrass And Miscanthus RhizosphereERITQALIEDGRAKRESIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0066697_1004203333300005540SoilLIEDGKAKRENIHVSFHDVPAANYAEAGMLVADQKRTP*
Ga0070732_1030765333300005542Surface SoilQTLETEGRAKRENIHVAFLDVPKTDYAEGGVTVADQKRVP*
Ga0066691_1040951733300005586SoilEEGRAKRENIHVAFLDVPATNYAEAGVLVADQKRSP*
Ga0066654_1003754843300005587SoilLIEDGKAKRENIHVSFHDVPATHYAEAGVLVADQKRRP*
Ga0070763_1012013143300005610SoilAERITQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0070763_1012514013300005610SoilALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0070764_1050503313300005712SoilDTLIKEGNAKKENIQVTFVDLPPTDYAEAGVTVADQKRSK*
Ga0070766_1008346433300005921SoilGRAKRENIHVSFHDVPATNYAEAGVLVADQKRTP*
Ga0075028_10077895123300006050WatershedsMGAKRENIHVTFLDVAATNYAEAGVLVVDQKRTP*
Ga0075019_1108820423300006086WatershedsITTVLIEDGRAKRENIHVSFHDVPSTDYAEAGVLVVDQKRTP*
Ga0070765_10073805813300006176SoilKLAERVTLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0070765_10130019523300006176SoilRITLALIEDGRAKREHIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0099791_1007861223300007255Vadose Zone SoilVLIEDGKAKRENIHVSFHDVPATNYAEGGVLVVDQKRTP*
Ga0099791_1008764123300007255Vadose Zone SoilEDGKAKRENIHVSFHDLPSTNYAEAGVLVVDQKRTP*
Ga0099829_1173631423300009038Vadose Zone SoilDGRAKRENIHVAFQDVPATNYAEAGVLVADQKRTP*
Ga0099830_1108911813300009088Vadose Zone SoilVTRALEEEGRAKRENIHVAFLDVPATNYAEAGVLVADQKRSP*
Ga0099828_1055673333300009089Vadose Zone SoilVLIEDGKAKRENIHVSFHDLPSANYAEAGVLVVDQKRAP*
Ga0099827_1058026313300009090Vadose Zone SoilRITQALEEDGRAKRENIHVSFLDVPAANYAEAGVVVADQKRAP*
Ga0116222_146034923300009521Peatlands SoilDGKAKRENIHVSFHDVPSTNYAEAGVLVVDQKRTP*
Ga0116105_124561913300009624PeatlandRITLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0116223_1065809723300009839Peatlands SoilLIEEGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP*
Ga0126373_1087196423300010048Tropical Forest SoilVLIEDGKAKRENIHVSFHDVPATDYAEAGVLVADQKRVP*
Ga0134071_1029625323300010336Grasslands SoilVLIEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRAP*
Ga0134062_1052620113300010337Grasslands SoilEDGRAKRENIHVSFHDVPSTNYAEAGVLVADQKRTP*
Ga0126378_1223316823300010361Tropical Forest SoilLMEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRTP*
Ga0136847_1193450213300010391Freshwater SedimentEDGRAKLENIQVTFHDLPATDYAEAGVMVADQKRTP*
Ga0137389_1147680613300012096Vadose Zone SoilVTNVLIEEGKAKRENIHVSFHDLPSTNYAEAGLLVVDQKRSP*
Ga0137399_1036062043300012203Vadose Zone SoilTNVLIEDGRAKRENIHVSFHDVAPSNYAEAGVLVADQKRTP*
Ga0137379_1028090943300012209Vadose Zone SoilDGRAKRENIHVSFLAVPAANYAEAGVVVADQKRAP*
Ga0137378_1153388523300012210Vadose Zone SoilIEDGKAKRENIHVSFHDVASTNYAEAGVLVVDQKRTP*
Ga0137387_1130857013300012349Vadose Zone SoilGKAKRENIHVTFHDLPSTNYAEAGVLVADQKRTP*
Ga0137360_1093672523300012361Vadose Zone SoilTQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTA*
Ga0137361_1183082923300012362Vadose Zone SoilEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRAP*
Ga0137398_1020670833300012683Vadose Zone SoilTAALIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP*
Ga0137359_1130674023300012923Vadose Zone SoilIEDGKARRENIHVSFHDVPATNYAEAGVLVVDQKRTP*
Ga0137416_1151348323300012927Vadose Zone SoilLIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP*
Ga0137407_1070633213300012930Vadose Zone SoilILETEGRAKRENMHGAFLAVPATNYAEAGITVADQKRTP*
Ga0137410_1006056153300012944Vadose Zone SoilEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRTP*
Ga0134078_1003898113300014157Grasslands SoilIEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRTP*
Ga0134078_1011380733300014157Grasslands SoilVLIEDGQAKRENIHVSFHDVPATNYAEAGILVVDQKRTP*
Ga0181522_1082111913300014657BogEGRAKKENVHVTFVDLPPSDYAEAGVTVEDQRKKN*
Ga0137409_1000387113300015245Vadose Zone SoilTSVLIEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRIP*
Ga0137403_1064763213300015264Vadose Zone SoilVLIEDGKAKRENIHVSFHDVPSTNYAEAGVLVVDQKRTP*
Ga0187802_1025120313300017822Freshwater SedimentITQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRAP
Ga0187782_1104125423300017975Tropical PeatlandIEDGRAKRENIHVAFHDVAATNYAEAGVLVADQKRTP
Ga0179592_1014777033300020199Vadose Zone SoilVLIEDGRAKRENIHVSFHDVPPSNYAEAGILVADQKRTP
Ga0210395_1029695213300020582SoilALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210405_1003073353300021171SoilLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210393_1039997423300021401SoilIAAQITQTLETEGRAKRENIHVAFLDVPKTDYAEGGVTVADQKRVP
Ga0210385_1008665633300021402SoilAVVEDGRAKKENVHVTFVDLPPSDYAEAGVTVEDQRKKN
Ga0210397_1016230713300021403SoilRITQALFEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210389_1002496153300021404SoilDGRAQRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210387_1074954213300021405SoilERITLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210383_1087997133300021407SoilRITQVLIEDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP
Ga0210394_1058580333300021420SoilEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0210391_1106938023300021433SoilTQVLIEDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP
Ga0210398_1073965013300021477SoilEEGRAKKENVHVTFVDLPPSDYAEAGVTVEDQRKKN
Ga0242669_105397313300022528SoilQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0242660_125513113300022531SoilLIEDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKHTP
Ga0242662_1029895023300022533SoilVLVEDGKAKRENIHVSFHDVPPTNYAEAGLLVVDQKRTP
Ga0212128_1071193923300022563Thermal SpringsEDGRAKRENCHVAFVDVPPTDYAEAGVTVADQKRSP
Ga0137417_134445293300024330Vadose Zone SoilVLIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP
Ga0137417_144835273300024330Vadose Zone SoilVLIEDGKAKRENIHVSFHDVPSTNYAEAGVLVVDQKRTP
Ga0209431_1033749123300025313SoilEDGRAKRENVHVAFLDVPATDYAEAGVTVADQKRSP
Ga0209640_1045023833300025324SoilLVEDGRAQRGNIHVTFVDVPATDYAEAGVTVADQKRSP
Ga0208691_100969243300025612PeatlandVLIEDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP
Ga0208691_104300133300025612PeatlandIEDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP
Ga0209235_120997123300026296Grasslands SoilALEEDGRARRENIHVSFLDVPAANYAEAGVVVADQKRAP
Ga0209265_121211523300026308SoilLIEDGKAKRENIHVSFHDVPATHYAEAGVLVADQKRRP
Ga0209471_110153633300026318SoilKALEEEGRAKRENIHVAFLDVPATNYAEAGVLVADQKRSP
Ga0257168_111287723300026514SoilRSGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP
Ga0209378_111844033300026528SoilDGRAKQENIHVAFLDLPSVNYAEAGVLVADQKRTP
Ga0209648_1000539423300026551Grasslands SoilVLIEDGKAKRENIHVSFHDVPSTNYAEAGVLVADQKRTP
Ga0209648_1082663123300026551Grasslands SoilIEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP
Ga0179593_114967923300026555Vadose Zone SoilVLIEDGKAKREHIHSVVHDVPSTNYAEAGVLVADQKKHA
Ga0209735_113625223300027562Forest SoilALIEDGRAKRENVQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0209076_102100143300027643Vadose Zone SoilLEEEGRAKRENIHVAFLDVPATNYAEAGVLVADQKRSP
Ga0209117_101007513300027645Forest SoilVLVEDGKAKRENIHVSFHDLPSTNYAEAGVLVADQKRTP
Ga0209388_103529213300027655Vadose Zone SoilEDGKAKRENIHVSFHDLPSTNYAEAGVLVVDQKRTP
Ga0209388_115447623300027655Vadose Zone SoilVLIEDGKAKRENIHVSFHDVPATNYTEAGVLVVDQKRTP
Ga0208981_109360523300027669Forest SoilDGKAKRENTHVSFHDVPSTNYAEAGVLVVDQKRTP
Ga0209248_1012349813300027729Bog Forest SoilDGRAKREDIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0209139_1003685743300027795Bog Forest SoilITQVLIEDGKAKRENIQVTFVDLPATDYAEAGVTVADQKRTP
Ga0209773_1025539323300027829Bog Forest SoilAERITLALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0209180_1053658623300027846Vadose Zone SoilEPVTKALEEEGRAKRENIHVAFLDVPATNYAEAGVPVADQKRSP
Ga0209517_1067655723300027854Peatlands SoilIEEGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0209166_1030029633300027857Surface SoilIEDGRAKRENIHVAFHDVPATNYAEAGVLVADQKRTP
Ga0209701_1023545733300027862Vadose Zone SoilVLIEDGKAKRENIHVSFHDVPSTNYAEAGVLVMDQKRMP
Ga0209701_1024568733300027862Vadose Zone SoilAVLIEDGRAKRENIHVSFHDVPATDYAEAGVLVVDQKRSP
Ga0209283_1009236143300027875Vadose Zone SoilLIEDGKAKRENIHVSFHDVPSTNYAEAGVLVVDQKRTP
Ga0209283_1085732013300027875Vadose Zone SoilGVLMEDGRAKRENIHVAFQDVPATNYAEAGVLVADQKRTP
Ga0265319_121977713300028563RhizosphereVTQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTT
Ga0222748_110093113300029701SoilLIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTQ
Ga0302275_1026943033300030518BogITNALIEDGKAKRENIQVTFVDLPPTDYSEAGVTVEDQRKRA
Ga0247727_1080946713300031576BiofilmVEEGRAKREHVHVAFLDVPATDYAEAGVTVADQKQSP
Ga0307477_1054185123300031753Hardwood Forest SoilVTQALVEDGRAKRENIHVAFVDLPPTDYAEAGVTVAEQKRTP
Ga0307475_1133525513300031754Hardwood Forest SoilALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRIP
Ga0307478_1165907923300031823Hardwood Forest SoilLIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRTP
Ga0214473_1173912713300031949SoilRVTQALIEDGRAKLENIQVTFHDLPATDYAEAGVMVADQKRTP
Ga0311301_1259641223300032160Peatlands SoilAERITQALIEDGRAKRENIQVTFVDLPPTDYAEAGVTVADQKRAP
Ga0307471_10140202433300032180Hardwood Forest SoilDGKAKRENIHVSFHDVPATNYAEAGVLVVDQKRAP
Ga0307471_10186424423300032180Hardwood Forest SoilMEDGRAKRENIHVTFHDLPPASYAEAGVLVVDQKRTP
Ga0307471_10420867113300032180Hardwood Forest SoilIEDGRAKRENIHVTFVDLPPTDYAEAGVPVADQKRAP
Ga0348332_1351808123300032515Plant LitterVLIEDGKAKRENIHVSFHDVPPTNYAEAGVLVVDQKRTP
Ga0335079_1177214313300032783SoilERVTDTLIAEGAAKREHIQITFVDLPPTDYAEGGVTVADQKRVK
Ga0335080_1046874333300032828SoilTLIAEGAAKREHIQITFVDLPPTDYAEGGVTVADQKRVK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.