NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099046

Metagenome / Metatranscriptome Family F099046

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099046
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 43 residues
Representative Sequence TIDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR
Number of Associated Samples 89
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.97 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.029 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.155 % of family members)
Environment Ontology (ENVO) Unclassified
(37.864 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.340 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56
1A10PFW1_116603052
2JGI25382J43887_101403321
3soilH2_102709632
4Ga0063356_1003024651
5Ga0062592_1002132132
6Ga0066683_106312871
7Ga0066680_101000601
8Ga0066673_103432211
9Ga0066690_100515751
10Ga0066671_106292061
11Ga0066676_102581052
12Ga0066676_106422031
13Ga0066675_111982281
14Ga0065715_101582342
15Ga0070691_106202071
16Ga0066686_104734761
17Ga0070706_1003225741
18Ga0070706_1003317551
19Ga0070697_1006787091
20Ga0070697_1019991632
21Ga0066704_101393062
22Ga0066700_102811452
23Ga0066699_102142843
24Ga0066699_103663561
25Ga0066703_102236181
26Ga0066691_104009602
27Ga0066706_100985801
28Ga0066706_102337202
29Ga0066706_114399371
30Ga0068870_105170651
31Ga0066653_105969141
32Ga0066659_101176461
33Ga0075425_1003102882
34Ga0079215_104134661
35Ga0075424_1023602081
36Ga0079216_115837071
37Ga0079218_108255502
38Ga0066710_1001236695
39Ga0066710_1006498501
40Ga0066710_1012889213
41Ga0099827_118890632
42Ga0066709_1022066862
43Ga0105164_101217611
44Ga0134088_102730402
45Ga0134111_102294031
46Ga0134126_130995412
47Ga0134124_127090721
48Ga0120163_10859061
49Ga0137399_105607192
50Ga0137399_109839011
51Ga0137380_113324472
52Ga0137381_107552762
53Ga0137376_103171501
54Ga0137377_100238871
55Ga0137377_106412772
56Ga0137367_101564851
57Ga0137367_107468521
58Ga0137368_100881693
59Ga0137397_100497301
60Ga0137396_104046541
61Ga0137410_103743431
62Ga0120106_10260463
63Ga0120158_102850281
64Ga0134075_100132982
65Ga0167657_10343822
66Ga0167650_11111581
67Ga0137409_100429411
68Ga0132258_131184931
69Ga0134112_103917251
70Ga0184610_10675301
71Ga0184604_103734222
72Ga0184632_101861952
73Ga0215015_104605542
74Ga0222625_16450601
75Ga0137417_13529882
76Ga0209642_102485483
77Ga0209824_100561721
78Ga0209824_102181872
79Ga0207699_108576413
80Ga0207670_108575732
81Ga0207704_103110862
82Ga0209238_12731822
83Ga0209471_11538641
84Ga0209473_10559941
85Ga0209803_11020391
86Ga0209160_11402181
87Ga0209157_10794521
88Ga0209056_100437255
89Ga0209056_101199313
90Ga0209376_11008371
91Ga0209161_101112902
92Ga0209474_100482175
93Ga0209011_11018731
94Ga0209590_103324012
95Ga0209590_106005912
96Ga0307301_101215621
97Ga0307305_100457771
98Ga0307292_100951431
99Ga0307304_102203811
100Ga0307473_106594772
101Ga0326597_104957451
102Ga0307471_1026125471
103Ga0364934_0425100_357_503
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.03%    β-sheet: 0.00%    Coil/Unstructured: 61.97%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540TIDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSRSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Wastewater
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Glacier Forefield Soil
Grasslands Soil
Soil
Agricultural Soil
Sugarcane Root And Bulk Soil
Permafrost
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Soil
Sediment
Arabidopsis Rhizosphere
Miscanthus Rhizosphere
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
Populus Rhizosphere
2.9%2.9%5.8%17.5%3.9%2.9%3.9%28.2%5.8%5.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A10PFW1_1166030523300001538PermafrostVSHPGTIVPAERAAIEALLGAFDALRYDRDAGPVRFVTPAQLAKAYRP*
JGI25382J43887_1014033213300002908Grasslands SoilATERAAIEALFEAFAPVRYDRDAGPVHFVTLAQLAKAYGR*
soilH2_1027096323300003324Sugarcane Root And Bulk SoilATIDATERAAIETLFAAFEPYRWDRQHGPLRFVTLSQLAKAIGP*
Ga0063356_10030246513300004463Arabidopsis Thaliana RhizosphereRAMTLVSHPSTIDATERAAIEKLFQAFEPYRLDRDKGPLRFVTLSQLAKAYGP*
Ga0062592_10021321323300004480SoilGTIDTIERAAIEALFHAFDPYRYDRDSGPLRFVTLAQLAKAYGR*
Ga0066683_1063128713300005172SoilRAVTIVSHPSTIDATERGAIAALFSALAPLRYDLDNGPVRFVTLAQLAQAWR*
Ga0066680_1010006013300005174SoilPGTINAAEGAAINSLFDALATLRYDLDKGPLRFVTLAQLAQAWR*
Ga0066673_1034322113300005175SoilIDATERAAITALFDAFAPLRYDRDAGPVRFVTLAQLAKAWR*
Ga0066690_1005157513300005177SoilGTIDATERAAITELFDALASLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066671_1062920613300005184SoilIDATERGAIEALFTAFDPYRYDRDAGPLRFVTAAELAQAYR*
Ga0066676_1025810523300005186SoilTIDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0066676_1064220313300005186SoilRRAITIVSHPGTIDATERAAITALFEALTPLRYDQDKGPVRFVTLAQLAQAWP*
Ga0066675_1119822813300005187SoilIDASERAAIEALLGAFAPLRYDRDTGPLRFVTLAQLAKAYGL*
Ga0065715_1015823423300005293Miscanthus RhizosphereIDTIERAAIEALFHAFDPYRYDRDAGPLRFVTLAQLAKAYGR*
Ga0070691_1062020713300005341Corn, Switchgrass And Miscanthus RhizosphereTERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYRP*
Ga0066686_1047347613300005446SoilRAITIVSHPGTIDATECAAITALFNAFAPLRYDLDNGPVRFVTLAQLAQAWR*
Ga0070706_10032257413300005467Corn, Switchgrass And Miscanthus RhizosphereTIDATERAAIEALFDAFGPLRYDADRGPVRFVTLAELAEAWR*
Ga0070706_10033175513300005467Corn, Switchgrass And Miscanthus RhizosphereGTIDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0070697_10067870913300005536Corn, Switchgrass And Miscanthus RhizosphereSHPATIDAIERDAIESLFTAFAPLRYDADAGPVRFVTLAQLAQALK*
Ga0070697_10199916323300005536Corn, Switchgrass And Miscanthus RhizosphereLVSHPGTITATERAAIEALFQAFEPVRYDRDAGPVRFVTLAQLAKAYGR*
Ga0066704_1013930623300005557SoilNERAAITALFDAFTPLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066700_1028114523300005559SoilITIVSHPGTIDATERAAITALFDALAALRYDQDKGPVRFVTLAQLAQAWP*
Ga0066699_1021428433300005561SoilSHPGTIGPAERAAIEALLGAFGPLRYDADAGPVRFVTLAQLATAWGR*
Ga0066699_1036635613300005561SoilAIETLFAAFEPLRYDRGNGPLRFVTLAQLATAYSR*
Ga0066703_1022361813300005568SoilERAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG*
Ga0066691_1040096023300005586SoilERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0066706_1009858013300005598SoilTIDATERAAITELFDALASLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066706_1023372023300005598SoilTIDATERAAITELFDALASLRYDLDNGPLRFITLAQLAQAYPY*
Ga0066706_1143993713300005598SoilTIVASERAAIETLFAAFEPLRYDRGNGPLRFVTLAQLAEAYSR*
Ga0068870_1051706513300005840Miscanthus RhizospherePATVNATERAAIESLFRSFEPYRYDRDGGPLRFVTLAQLAQAFK*
Ga0066653_1059691413300006791SoilAAIEALFLAFDPVRYDRDAGPVRFVTLAQLAKAYGR*
Ga0066659_1011764613300006797SoilTLVSHPGTIVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP*
Ga0075425_10031028823300006854Populus RhizosphereERGAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0079215_1041346613300006894Agricultural SoilIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYGP*
Ga0075424_10236020813300006904Populus RhizosphereGTIDATERAAIETLFKAFFPLRYDQDSGPLRFVTLAQLAAAYSR*
Ga0079216_1158370713300006918Agricultural SoilHPSTIDATERAAIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYGP*
Ga0079218_1082555023300007004Agricultural SoilEKLFQAFEPYRWDRDKGPLRYVTLSQLAKAYSSP*
Ga0066710_10012366953300009012Grasslands SoilERAAIEALVKAFAPLRYDRDVGPLRFVTLAEVAKAYSK
Ga0066710_10064985013300009012Grasslands SoilIVSHPGTIDATERAAIETLLGAFAPVRYDRDTGPLRFVTLAQLAKAYGL
Ga0066710_10128892133300009012Grasslands SoilGAIEALLGAVGPLRYDADAGPVRFVTLAQLATAWAR
Ga0099827_1188906323300009090Vadose Zone SoilIVSHPGTLDATEPAAITALFEALAPLRYDLDRGPVRFVTLAQLAQAWR*
Ga0066709_10220668623300009137Grasslands SoilDATERDAIEALFRAFEAYRYDRDAGALRFVTLAQLAKAYAR*
Ga0105164_1012176113300009777WastewaterPGTIDAAERAAIETLLHAFDPFRYDQDRGPVRSITLRELAQVWK*
Ga0134088_1027304023300010304Grasslands SoilIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0134111_1022940313300010329Grasslands SoilSHPGTIDATERAAITELFDALASLRYDLDNGPLRFITLAQLAQAYPY*
Ga0134126_1309954123300010396Terrestrial SoilRAAITALFDALAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0134124_1270907213300010397Terrestrial SoilTLVSHPATIDATERAAIEKLFAAFEPYRWDRDMGPLRYVTLAQLAKAFGP*
Ga0120163_108590613300012003PermafrostPGTIDATERAAIAALFNALAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0137399_1056071923300012203Vadose Zone SoilIDVTERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWR*
Ga0137399_1098390113300012203Vadose Zone SoilAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0137380_1133244723300012206Vadose Zone SoilTVVSHPGTNDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0137381_1075527623300012207Vadose Zone SoilVSHPGTIDATERAAITAIFNALAPFRYDLDQGPLRFVTLAQLAQAWR*
Ga0137376_1031715013300012208Vadose Zone SoilIDATERAAITALFDAFAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0137377_1002388713300012211Vadose Zone SoilHPGTIDATERAATEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0137377_1064127723300012211Vadose Zone SoilHPGTIDATERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWP*
Ga0137367_1015648513300012353Vadose Zone SoilALTLVSHPGTIDATERAAIEALLGAFTPYRYDSDAGPLRFVTLAQLAKAYSR*
Ga0137367_1074685213300012353Vadose Zone SoilTIDATERAAIETLFRAFEPLRYERDTGPLRFVTLAQLAKAYSP*
Ga0137368_1008816933300012358Vadose Zone SoilTIDATERGAIEKLFAAFEPYRWDRDMGPLRYVTLAQLAKAYGP*
Ga0137397_1004973013300012685Vadose Zone SoilRAAIEALFKAFGPLRYDADSGPVRFVTLAQVAQAFR*
Ga0137396_1040465413300012918Vadose Zone SoilIDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0137410_1037434313300012944Vadose Zone SoilEVKVARDATERAAITALFNALAPLRYDLDKGPVRFVTLAQLAQAWR*
Ga0120106_102604633300013427PermafrostAIEALLGAFDPLRYDLDSGPVRFVTLAQLAKAYGP*
Ga0120158_1028502813300013772PermafrostIEALLGAFDALRYDKDSGPVRFVTAAQLAKAYRQ*
Ga0134075_1001329823300014154Grasslands SoilVSHPGTIDATERAAIAALFNALAPLRYDRDSGPLRFVTLAQLAQAWR*
Ga0167657_103438223300015079Glacier Forefield SoilGTIDATERAAIESLFQSFEPYRYDRDHGPVRFVTLAQLAQALK*
Ga0167650_111115813300015203Glacier Forefield SoilTIVPAERAAIEALLGAFDGLRYDKDAGPVRFVTAAQLAKAYRQ*
Ga0137409_1004294113300015245Vadose Zone SoilGTIDATERAAITALFNALAPLRYDLDKGPVRFVTLAQLAQAWR*
Ga0132258_1311849313300015371Arabidopsis RhizosphereLTLVSHPATIDPIERDAIESLFKAFGPLRYEADSGPLRFVTLAQLAQAMK*
Ga0134112_1039172513300017656Grasslands SoilRAITLVSHPGTIDDTERVAIESLFAAFEPLRYDRDEGPLRFVTLAQLARAYAP
Ga0184610_106753013300017997Groundwater SedimentERAAIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYNR
Ga0184604_1037342223300018000Groundwater SedimentTESAAIAALFNALGPLRYDRDSGPVRFVTLAQLAQAWR
Ga0184632_1018619523300018075Groundwater SedimentVSHPSTIDATERGAIETLFRAFDQYRYDRDAGPLRFVTLAQLARAYGP
Ga0215015_1046055423300021046SoilSHPGTFDATERAAIEALFTAFAPLRYDDDAGPVRFVTLTQFAAAYK
Ga0222625_164506013300022195Groundwater SedimentAPLALTIVSHPGTIDATERAAITALFNAFAPLRYDLDKGPVRFVTLAQLAQAWR
Ga0137417_135298823300024330Vadose Zone SoilVITVVSHPGTIVPAERAAIEALLGAFGPLRYDADAGPVRFVTLAQLAKAYNR
Ga0209642_1024854833300025167SoilGAIEALFRAFDPFRYDRDAGPLRFVTLAQLAKAYAR
Ga0209824_1005617213300025173WastewaterPGTIDATERAAIESLFRALEPYRFDRDRGPVRFITLAQLGKALK
Ga0209824_1021818723300025173WastewaterIVSHPGTIDAAERAAIETLLHAFDPFRYDQDKGPLRFVTLQQLARAWK
Ga0207699_1085764133300025906Corn, Switchgrass And Miscanthus RhizospherePAERAAIEALLMAFGPLRYDTDSGPVRFVTLAQLAKAYAH
Ga0207670_1085757323300025936Switchgrass RhizosphereTERAAIEALFKSFEPYRYDRDNGPLRFVTLAQLAQAFK
Ga0207704_1031108623300025938Miscanthus RhizosphereVSHPGTIDTIERAAIEALFHAFDPYRYDRDSGPLRFVTLAQLAKAYGR
Ga0209238_127318223300026301Grasslands SoilTIDATERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWR
Ga0209471_115386413300026318SoilVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP
Ga0209473_105599413300026330SoilGPAERAAVEALLGAFGPLRYDADAGPVRFVTLAQLATAWGR
Ga0209803_110203913300026332SoilATERAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG
Ga0209160_114021813300026532SoilVSHPGTIDANERAAITALFDAFRPLRYDLDNGPLRFVTLAQLAQAWR
Ga0209157_107945213300026537SoilRAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG
Ga0209056_1004372553300026538SoilAITIVSHPGTIDATERAAITALFEALAPLRYDQDKGPVRFVTLAQLAQAWP
Ga0209056_1011993133300026538SoilDATERAAIEALFRAFEPYRYDRDAGPLRFITLAQLAKAYSR
Ga0209376_110083713300026540SoilVVSHPGTIDATERAAIEALFRAFEPYRYDRDAGPLRFITLAQLAKAYSR
Ga0209161_1011129023300026548SoilSHPGTIDATERAAITALFEALAPLRYDQDKGPVRFVTLAQLAQAWP
Ga0209474_1004821753300026550SoilVSHPGTIVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP
Ga0209011_110187313300027678Forest SoilDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR
Ga0209590_1033240123300027882Vadose Zone SoilITATERAAIEALFQAFDPVRYDRDAGPVHFVTLAQLAKAYGR
Ga0209590_1060059123300027882Vadose Zone SoilHPGTIDATERAAITALFDAFSPLRYDRHAGPVRFVTLAQLAQAWR
Ga0307301_1012156213300028719SoilIDATERAVITALFNAFAPLRHDLDNGPVRFVTLAQLAQAWR
Ga0307305_1004577713300028807SoilAAITALFEAFAPLRYDRDAGPVRFVTLAQLAQAWR
Ga0307292_1009514313300028811SoilAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR
Ga0307304_1022038113300028885SoilHPGTIDATERAAIEALFQSFEPYRYDRDNGPLRFVTLAQLERAFK
Ga0307473_1065947723300031820Hardwood Forest SoilIDATERAAITALFDAFAPLRYDQDNGPLRFVTLAQLADAWR
Ga0326597_1049574513300031965SoilSTIGATERGAIETLFRAFEPLRYDRDAGPLRFVTLAQLAKAYAR
Ga0307471_10261254713300032180Hardwood Forest SoilTVVSHPGTIDATERAAIESLFAALEPLRYDRDSGPVRFVTLAQLARAYAR
Ga0364934_0425100_357_5033300034178SedimentVSHPSTIDATERGAIETLFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.