NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086079

Metagenome / Metatranscriptome Family F086079

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086079
Family Type Metagenome / Metatranscriptome
Number of Sequences 111
Average Sequence Length 44 residues
Representative Sequence LRHANVVILTDETEFARLLTACWQAERQAPNITVLSSDSWR
Number of Associated Samples 97
Number of Associated Scaffolds 111

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 19.82 %
% of genes from short scaffolds (< 2000 bps) 16.22 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (79.279 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(24.324 % of family members)
Environment Ontology (ENVO) Unclassified
(24.324 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(63.063 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66
1Ga0005471J37259_1169352
2JGI25386J43895_101410462
3JGI25617J43924_100589372
4Ga0062595_1018543642
5Ga0066689_100772961
6Ga0066689_102980972
7Ga0070707_1010575051
8Ga0070738_101863491
9Ga0070731_105921591
10Ga0066707_103568721
11Ga0066670_101383201
12Ga0066705_106717411
13Ga0070761_110599922
14Ga0070764_100592261
15Ga0070716_1007499711
16Ga0075014_1001339272
17Ga0075014_1006725251
18Ga0079222_119692461
19Ga0066658_101457671
20Ga0075434_1005381271
21Ga0099795_103117841
22Ga0099828_102228641
23Ga0074044_105434652
24Ga0126370_115554622
25Ga0126377_117627831
26Ga0134066_103825072
27Ga0137393_105311431
28Ga0137389_104538121
29Ga0137389_105466882
30Ga0137388_119041432
31Ga0137383_110216872
32Ga0137380_111923242
33Ga0137386_100462192
34Ga0137360_102590312
35Ga0137360_118443121
36Ga0137361_118788171
37Ga0137398_107391572
38Ga0137413_113553501
39Ga0137419_109281872
40Ga0137419_112659581
41Ga0137418_104507961
42Ga0182034_119674071
43Ga0134112_105259591
44Ga0137408_14816241
45Ga0179594_102845061
46Ga0179592_102438251
47Ga0210407_101110291
48Ga0210403_111209382
49Ga0210401_106460762
50Ga0179596_100706961
51Ga0210404_100610291
52Ga0210404_100640172
53Ga0210406_105076701
54Ga0210400_107987931
55Ga0210400_114847161
56Ga0210405_104858781
57Ga0210408_106286942
58Ga0210388_115289502
59Ga0213876_100415601
60Ga0210393_101346873
61Ga0210393_116919312
62Ga0210385_100640284
63Ga0210397_104970362
64Ga0210386_101074323
65Ga0210394_101353561
66Ga0210384_105280032
67Ga0210390_113156822
68Ga0210398_101031741
69Ga0210410_111430091
70Ga0210409_105499741
71Ga0242662_101301022
72Ga0137417_10998861
73Ga0137417_14397063
74Ga0247668_10052301
75Ga0207671_109072801
76Ga0209863_100067181
77Ga0209839_102317881
78Ga0209240_10489961
79Ga0209131_14110521
80Ga0209152_103055991
81Ga0209267_11020332
82Ga0209803_11281492
83Ga0257161_10890912
84Ga0209378_10356724
85Ga0209056_105739501
86Ga0209805_11572131
87Ga0209805_12759912
88Ga0209648_103290892
89Ga0209648_104976511
90Ga0209116_10640663
91Ga0209329_10093391
92Ga0209118_11080381
93Ga0209074_101728491
94Ga0209274_102020091
95Ga0209274_106769771
96Ga0209693_101933011
97Ga0209166_100083536
98Ga0209068_102547271
99Ga0209488_105684361
100Ga0209006_110486231
101Ga0137415_105830631
102Ga0137415_110083351
103Ga0265338_105921912
104Ga0222749_106329342
105Ga0311353_107942451
106Ga0307477_103831522
107Ga0307475_113602241
108Ga0310913_111945921
109Ga0307479_104826012
110Ga0307479_112779582
111Ga0315270_111881231
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 27.54%    β-sheet: 0.00%    Coil/Unstructured: 72.46%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540LRHANVVILTDETEFARLLTACWQAERQAPNITVLSSDSWRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
20.7%79.3%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Sediment
Watersheds
Vadose Zone Soil
Tropical Forest Soil
Grasslands Soil
Surface Soil
Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Bog Forest Soil
Soil
Prmafrost Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Palsa
Plant Roots
Populus Rhizosphere
Corn Rhizosphere
Rhizosphere
23.4%11.7%5.4%24.3%3.6%4.5%4.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0005471J37259_11693523300002681Forest SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITLLSSDLWKDHEA
JGI25386J43895_1014104623300002912Grasslands SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNKQEAPAHDL
JGI25617J43924_1005893723300002914Grasslands SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWR
Ga0062595_10185436423300004479SoilLPNFNVLILTDETEFARLLTACWQAERQTPAITVLASDLWKEHQDTP
Ga0066689_1007729613300005447SoilLPHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDL
Ga0066689_1029809723300005447SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNK
Ga0070707_10105750513300005468Corn, Switchgrass And Miscanthus RhizosphereLRHANVVILTDETEFARLLTACWQAERQAPLVTVLTSDLWQEQQA
Ga0070738_1018634913300005531Surface SoilLRHASLLILTDDAEFARLLSACWQAERQAPRITVLSSDL
Ga0070731_1059215913300005538Surface SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVL
Ga0066707_1035687213300005556SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVL
Ga0066670_1013832013300005560SoilLRHANVIILTDETEFARLLTACWQAERQAPVITVLT
Ga0066705_1067174113300005569SoilLRSASVLILTDETEFARLLTACWRAERQAPGITVLGTDLWKDH
Ga0070761_1105999223300005591SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVLSSELW
Ga0070764_1005922613300005712SoilLQNVSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEH
Ga0070716_10074997113300006173Corn, Switchgrass And Miscanthus RhizosphereLPHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDLV
Ga0075014_10013392723300006174WatershedsLRNANVVILTDETEFARLLTACWQAERLAPNIAVINSDSWREQDAPAH
Ga0075014_10067252513300006174WatershedsLRHASVVILTDEAEFARLLTACWQAERRAPAVTVLTSDLWKAQEAPVGDLMVLG
Ga0079222_1196924613300006755Agricultural SoilLRQANVVILTDETEFARLLTACWQAERQAPVVTVLTSELWQEQEAPARDLI
Ga0066658_1014576713300006794SoilLRHANVVILTDESEFARLLTACWQAERQAPNITVLNSNSWREQDAPDHDLV
Ga0075434_10053812713300006871Populus RhizosphereLPNFTVLILTDETEFARLLTACWQAERQPPAISVLASNLW
Ga0099795_1031178413300007788Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSGSWREQD
Ga0099828_1022286413300009089Vadose Zone SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDS
Ga0074044_1054346523300010343Bog Forest SoilLRNANVLILTDEAEFARLLTACWQAERHAPRVTVLNSDVWTAQSGPT
Ga0126370_1155546223300010358Tropical Forest SoilLKNANVVILTDESEFARLLSACWHAERHAPAITVLN
Ga0126377_1176278313300010362Tropical Forest SoilLKHASVLILTDETEFARLLTSCWQTERQAPRITVLNS
Ga0134066_1038250723300010364Grasslands SoilLRHANVVILTDDSEFARLLTACWQAERQAPNIAVLNSDSWHEPNAPAHDLV
Ga0137393_1053114313300011271Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWREQDAPAHDLV
Ga0137389_1045381213300012096Vadose Zone SoilLRHANVIILTDETEFARLLTAGWQAERQAPRVTVLTSDLWQEQEAPARD
Ga0137389_1054668823300012096Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPVVTVLTSDLWQEQEAPARD
Ga0137388_1190414323300012189Vadose Zone SoilLRHANVLILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPA
Ga0137383_1102168723300012199Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVLNSDSWQDQDA
Ga0137380_1119232423300012206Vadose Zone SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVL
Ga0137386_1004621923300012351Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVINSDSWQEQNAQEHDLVVV*
Ga0137360_1025903123300012361Vadose Zone SoilLRHANVLILTDETEFARLLTACWQAERQAPSITVIS
Ga0137360_1184431213300012361Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAERHAPGITVL
Ga0137361_1187881713300012362Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLVVVG
Ga0137398_1073915723300012683Vadose Zone SoilLILTDETDFARLLTACWQAEKHAPGITVLGSDLWRDHETLPH
Ga0137413_1135535013300012924Vadose Zone SoilLVNSNVLIVTDETEFARLLTSCWQAERQAPGITILGSE
Ga0137419_1092818723300012925Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWRDHE
Ga0137419_1126595813300012925Vadose Zone SoilLRHANVLVLTDETEFARLLTACWQAERQAPGITVIGSELWREQDVPR
Ga0137418_1045079613300015241Vadose Zone SoilLRHANVVILTDDTEFARLHTACWQAERQPPNITVLNSDLWQEQNTTA
Ga0182034_1196740713300016371SoilLRHANVVILTDETEFARLLTACWQAERHAPRVTVL
Ga0134112_1052595913300017656Grasslands SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHEQNAPSH
Ga0137408_148162413300019789Vadose Zone SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITILGSELWSEHEEIA
Ga0179594_1028450613300020170Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLSSDSWR
Ga0179592_1024382513300020199Vadose Zone SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDL
Ga0210407_1011102913300020579SoilLRSASVLILTDETDFARLLTACWQAEKHAPGITVLGSDLWK
Ga0210403_1112093823300020580SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLN
Ga0210401_1064607623300020583SoilVRNASVLILSDEPEFARLLTACWQAERVAPGITVLSSDLWK
Ga0179596_1007069613300021086Vadose Zone SoilLRNANVIILTDETEFGRLLTACWQTERQAPNITVLSSDLWREQDAPAHDPVVLGP
Ga0210404_1006102913300021088SoilLRHANVVILTDETEFARLLTACWQTERQAPGVTVLTSD
Ga0210404_1006401723300021088SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWREQDAP
Ga0210406_1050767013300021168SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVLSSDLWK
Ga0210400_1079879313300021170SoilLRHANVVILTDETEFARLLTACWQTERQAPNITVLNSDSWRE
Ga0210400_1148471613300021170SoilLQNSSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEHEAVAHEL
Ga0210405_1048587813300021171SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVL
Ga0210408_1062869423300021178SoilLRNANVVILTDETEFARLLTACWQAERQAPNIAVVNSDSW
Ga0210388_1152895023300021181SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITVLSSDLWKDHE
Ga0213876_1004156013300021384Plant RootsLQNSSVLIVTDEPEFARLLTACWRAEREVPAITVLGSDVWNEHEGVAHELAVVG
Ga0210393_1013468733300021401SoilLRNANVLILTDESEFARLLTACWQAERQAPGITVLGSESWKQHEAL
Ga0210393_1169193123300021401SoilVRNASVLILSDEPEFARLLTACWQAERLAPGITLLSS
Ga0210385_1006402843300021402SoilLQNFNVLIVTDEPEFARLLTACWRAEREVPAITVLAS
Ga0210397_1049703623300021403SoilLQNVSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNEHEG
Ga0210386_1010743233300021406SoilLPNSNALIVTDETEFARLLTSCWQAERQAPAITVLGSD
Ga0210394_1013535613300021420SoilLRHANVVILTDETEFARLLTACWQAERQAPAITVLGSSLWREHEGTSHDLVVV
Ga0210384_1052800323300021432SoilLRSASVLILTDETDFARLLTACWRAEKHAPGITVL
Ga0210390_1131568223300021474SoilLRNANVLILTDDAEFARLLTACWQTERQAPRVAVLSSDL
Ga0210398_1010317413300021477SoilLQNFNVLIVTDEPEFARLLTACWRAEREVPAITVLASDVWNEHEGVAHELAVV
Ga0210410_1114300913300021479SoilLRHANVVILTDETEFARLLTACWQAEPQAPNITVLNSESWQE
Ga0210409_1054997413300021559SoilLRHANVVILTDETEFARLLTACWQAEPQAPNITVLNSESWQKQDAPAHDLV
Ga0242662_1013010223300022533SoilLRNANVLILTDESEFARLLTACWQAERQAPGITVLGSESWK
Ga0137417_109988613300024330Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNQAPNITVLN
Ga0137417_143970633300024330Vadose Zone SoilMRQQSVILTDETEFARFADGCWQAERQAPNITVLIAIVA
Ga0247668_100523013300024331SoilMPGSFSPLRHANLLILTDDAEFARLLSACWQAERQAPRITVL
Ga0207671_1090728013300025914Corn RhizosphereLQNASVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWGEH
Ga0209863_1000671813300026281Prmafrost SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITVLGSDLWS
Ga0209839_1023178813300026294SoilLRNANVLILTDEAEFARLLTACWQTERQAPRVTVLNSDVWHGQSGPT
Ga0209240_104899613300026304Grasslands SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLV
Ga0209131_141105213300026320Grasslands SoilLRNANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLVVV
Ga0209152_1030559913300026325SoilLRNASVLILTDETDFARLLTSCWQADRHAPAITVLGSELWKNHEVA
Ga0209267_110203323300026331SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHE
Ga0209803_112814923300026332SoilLRHANIVILTDETEFARLLTACWQGERQMPAITVLGSDLWNKQE
Ga0257161_108909123300026508SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWPEQDTP
Ga0209378_103567243300026528SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNSDSWHEQNAPSHDLVV
Ga0209056_1057395013300026538SoilLRHASVVILTDETEFARLLTACWQAERHAPAVTVLTSDL
Ga0209805_115721313300026542SoilLRHANVVILTDDSEFARLLTACWQAERQAPNITVLNS
Ga0209805_127599123300026542SoilLRHANVVILTDDSEFARLLTACWQAERQAPNIAVLNSDSWHEPNAPAHDLVVVGP
Ga0209648_1032908923300026551Grasslands SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLV
Ga0209648_1049765113300026551Grasslands SoilLRNANVIILTDETEFGRLLTACWHAERQAPNITVLNSDLWR
Ga0209116_106406633300027590Forest SoilLPNSNVLIVTDETEFARLLTACWQTERQAPGITVLGSDL
Ga0209329_100933913300027605Forest SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQDAPAHDLVVVGP
Ga0209118_110803813300027674Forest SoilLRSANVLILSDETDFARLLTACWQAERHAPAITVL
Ga0209074_1017284913300027787Agricultural SoilLRHANVVILTDETEFARLLTACWQAERQAPVVTVLTS
Ga0209274_1020200913300027853SoilVRNASVLILTDEPEFARLLTACWQTERQAPGITVLSSELWKDHEATP
Ga0209274_1067697713300027853SoilVRSASVLILTDEPEFARLLTACWQAERHAPGITVLSSELWK
Ga0209693_1019330113300027855SoilLQNFSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWKEHEGVAHELAVV
Ga0209166_1000835363300027857Surface SoilLRHANVVILTDETEFARLLTACWQADRQAPVVTVLTSDLWQEQQ
Ga0209068_1025472713300027894WatershedsLRNANVVILTDETEFARLLTACWQAERLVPNIAVLNSDSWREQDALVLA
Ga0209488_1056843613300027903Vadose Zone SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPTHD
Ga0209006_1104862313300027908Forest SoilLQNSSVLIVTDEPEFARLLTACWRAERDVPAITVLASDVWNE
Ga0137415_1058306313300028536Vadose Zone SoilLRHANVLVLTDETEFARLLTACWQAERQAPGITVIGSE
Ga0137415_1100833513300028536Vadose Zone SoilLANSNVLIVTDETEFARLLTSCWQAERQAPGITILGSELWSEHE
Ga0265338_1059219123300028800RhizosphereLRNANVLILTDEAEFGRLLSTCWQAERQAPRVTIVDSEH
Ga0222749_1063293423300029636SoilLRHANVVILTDETEFGRLLTACWQAERQAPNITVLNSDSW
Ga0311353_1079424513300030399PalsaLRNANVLILTDEAEFGRLLTTCWRTERQPPRMTILDSDNWHGQE
Ga0307477_1038315223300031753Hardwood Forest SoilLRHANVVILTDETEFARLLTACWQAERQAPNITVLNSDSWREQEAPAHDLVVV
Ga0307475_1136022413300031754Hardwood Forest SoilLRHANVVILTDDTEFARLLTACWQAERQAPNITVLNSDSWHEQDAPALDLIVVGP
Ga0310913_1119459213300031945SoilLRQANVVILTDETEFARLLTACWQAERQAPVVTVLTSDLCQEQQ
Ga0307479_1048260123300031962Hardwood Forest SoilLRSASVLILTDEPDFARLLTACWQAEKHAPGITVLGSDLW
Ga0307479_1127795823300031962Hardwood Forest SoilLRSASVLILTDETDFARLLTACWQAEKHGPGITVMGSD
Ga0315270_1118812313300032275SedimentLRHTNVIVLTDETEFARLLTACWQAERHVPAIAVLNSDLWLRQQHLPCDLLV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.