NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103721

Metagenome / Metatranscriptome Family F103721

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103721
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 47 residues
Representative Sequence MTRGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVSQ
Number of Associated Samples 84
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 50.00 %
% of genes near scaffold ends (potentially truncated) 2.97 %
% of genes from short scaffolds (< 2000 bps) 2.97 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.040 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(20.792 % of family members)
Environment Ontology (ENVO) Unclassified
(21.782 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(61.386 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.
1FA3_04276240
2FG3_09782000
3JGI12659J15293_100696461
4JGIcombinedJ51221_101026221
5Ga0062384_1011214272
6Ga0062386_1015336191
7Ga0066388_1085459102
8Ga0070695_1016379141
9Ga0066903_1005544954
10Ga0066903_1019648123
11Ga0066903_1039305312
12Ga0075017_1003794932
13Ga0075019_105533122
14Ga0075014_1000192347
15Ga0075014_1006593202
16Ga0070765_1015958981
17Ga0073928_105983272
18Ga0105242_112252882
19Ga0116218_10936354
20Ga0116218_10964453
21Ga0116225_12611681
22Ga0105249_122771491
23Ga0116102_10706673
24Ga0116110_10456421
25Ga0116110_11441072
26Ga0116110_11686202
27Ga0116215_14674742
28Ga0116224_100919782
29Ga0074044_106976272
30Ga0134128_109545061
31Ga0126381_1037125802
32Ga0134126_106390531
33Ga0134122_134075561
34Ga0105246_102279593
35Ga0164299_101456881
36Ga0164301_111999191
37Ga0164305_101327431
38Ga0164305_103036991
39Ga0181526_110608571
40Ga0157379_109581142
41Ga0132256_1016147101
42Ga0182041_118878961
43Ga0182038_118802011
44Ga0187821_103645471
45Ga0187780_106171282
46Ga0187782_101641261
47Ga0187822_100672441
48Ga0187816_104751381
49Ga0187876_12401322
50Ga0187804_103559531
51Ga0187860_13353972
52Ga0187855_104288981
53Ga0187871_107376821
54Ga0187770_103653011
55Ga0210407_108698512
56Ga0210403_101589051
57Ga0210406_100496413
58Ga0210405_101957143
59Ga0210408_110939531
60Ga0210396_112398282
61Ga0210393_105615461
62Ga0210397_111156021
63Ga0210383_110998452
64Ga0210394_110788332
65Ga0210394_112744871
66Ga0210384_106195461
67Ga0210390_108860963
68Ga0210390_115524611
69Ga0210410_105323851
70Ga0210410_106035972
71Ga0210410_117643542
72Ga0242651_10323302
73Ga0212123_106942951
74Ga0212123_107789291
75Ga0228598_10920941
76Ga0209171_105500611
77Ga0207663_109741032
78Ga0207702_115285702
79Ga0209333_10890491
80Ga0209166_105772711
81Ga0209169_107380431
82Ga0209068_105157451
83Ga0209067_100840355
84Ga0209067_105550811
85Ga0209006_112142092
86Ga0268264_120571461
87Ga0308309_105200781
88Ga0170824_1016172943
89Ga0170818_1035878982
90Ga0310686_1014076112
91Ga0310686_1128754012
92Ga0310686_1154344512
93Ga0307477_102362013
94Ga0318546_110710721
95Ga0306919_104441792
96Ga0310913_109783412
97Ga0310913_113150832
98Ga0306926_121848971
99Ga0311301_1004741613
100Ga0306920_1021156811
101Ga0306920_1034110031
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 51.39%    β-sheet: 0.00%    Coil/Unstructured: 48.61%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

510152025303540MTRGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVSQSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
4.0%96.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Peatland
Bog Forest Soil
Bog
Peatland
Freshwater Sediment
Iron-Sulfur Acid Spring
Watersheds
Soil
Terrestrial Soil
Tropical Forest Soil
Grass Soil
Surface Soil
Peatlands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Tropical Peatland
Bog Forest Soil
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
4.0%4.0%4.0%4.0%6.9%6.9%3.0%5.9%20.8%5.9%3.0%4.0%4.0%3.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
FA3_042762402170459023Grass SoilMFYSLHDQGGILRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVAQLKERGDPWR
FG3_097820002189573005Grass SoilMTKGXVLRYAIAFALRRSRKIIRGLKQGLTEEERYAVADHAVAQLK
JGI12659J15293_1006964613300001546Forest SoilVLRYAIAFALMRARKIVRGAKAGLTEQERYAIADH
JGIcombinedJ51221_1010262213300003505Forest SoilMTKGXVLRYAIAFALRRSRKIIRGLKQXLTEEERYAVADHAVAQLKECGDLGG*
Ga0062384_10112142723300004082Bog Forest SoilMTRGDLLRYRIAFALSRAAKIVRGLRQALTEAERYAVADHVV
Ga0062386_10153361913300004152Bog Forest SoilVTRGDVLKYKIAFALRRASKIVRGLRQGLTEPERYAVADHVVD
Ga0066388_10854591023300005332Tropical Forest SoilMTRGEVLRYAIAFALRRSRKIIRGLKEGLTEAERYAVADHAVAPRSQQRA*
Ga0070695_10163791413300005545Corn, Switchgrass And Miscanthus RhizosphereMTRGDVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVVAQLRE
Ga0066903_10055449543300005764Tropical Forest SoilMTKGEVLRYAIAFALRRSRKIICGLKEGLTEEERFAVADHTVEQERAWGSLAVK*
Ga0066903_10196481233300005764Tropical Forest SoilMTRGEVLRYAIAFALRRAKKIIRGLKESLTEEERF
Ga0066903_10393053123300005764Tropical Forest SoilMFYSSMTRGEVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVVAHAE*
Ga0075017_10037949323300006059WatershedsMTRGEVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVV
Ga0075019_1055331223300006086WatershedsMTRGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADH
Ga0075014_10001923473300006174WatershedsMVCSHPMTRGDILRYRIAFALSRASKIIRGLKQGLTEAERYAVADHVVAQLKE
Ga0075014_10065932023300006174WatershedsMTKGEILRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVAQLKERGDLWRPKP*
Ga0070765_10159589813300006176SoilMTRGDVLRDSIAFALRRARKIIRGLKEGLSESERYAVA
Ga0073928_1059832723300006893Iron-Sulfur Acid SpringMTRGEVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVVAQLKDRGGP
Ga0105242_1122528823300009176Miscanthus RhizosphereMTRAVVLRYAIAFALWRARKIIRGLKDGLTEEERYSVAD
Ga0116218_109363543300009522Peatlands SoilMFLWGMTKGEVLRYAIAFALRRSRKIVRGLKEGLTEDERYAVADHAVAQLK
Ga0116218_109644533300009522Peatlands SoilMTKGEILRYAIAFALRRSRKIIRGLKEGLSEDEGMPSQI
Ga0116225_126116813300009524Peatlands SoilMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERY
Ga0105249_1227714913300009553Switchgrass RhizosphereMFYYVMTRGDVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVVAQLRERG
Ga0116102_107066733300009632PeatlandMFSFCSHGEMTRGDVLQYRLAFALSRAVKIVRGLRQGLTEAERY
Ga0116110_104564213300009643PeatlandMFSYHMTRGDVLRYRLAFALSRAAKIVRGLRQGLTEDERYAVADHVV
Ga0116110_114410723300009643PeatlandMTRGDMLRYRLAFALSRAAKIVRGLRQGLTEAERYAVARSPTS
Ga0116110_116862023300009643PeatlandMGSMTRGDVLRYSIAFALRRAAKIVRGLRQGLSEDERYAV
Ga0116215_146747423300009672Peatlands SoilMFLWGMTKGEVLRYAIAFALRRSRKIVRGLKEGLTEDERYAVADHAV
Ga0116224_1009197823300009683Peatlands SoilMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHAVARLKERGETLGD*
Ga0074044_1069762723300010343Bog Forest SoilMFTICSHKSMTRGDVLRYRIAFALSRAVKIVRGLRQGLTEAERYAVADHVVGQ
Ga0134128_1095450613300010373Terrestrial SoilMTRGDVLRYAIAYALIRARKVVRGLQQGLSEDERHAVAEHVVTQLTQGGDPW
Ga0126381_10371258023300010376Tropical Forest SoilMFLWRMTKGEVLRYAIAFPLRRSGKIVRGLKEGLTEDERYVV
Ga0134126_1063905313300010396Terrestrial SoilVAVTIVLFMFYWAMTKGEVLRYAIAFALRRSRKIIRGLKEGMTEEER
Ga0134122_1340755613300010400Terrestrial SoilVAVTIVLFMFYWAMTKGEVLRYAIAFALRRSRKIIRGLKEGMTEGERFAVADHTVSQLK
Ga0105246_1022795933300011119Miscanthus RhizosphereMTRGDVLRYAIAYALMRARKIVRGLKNGLTEEKGYAVADHVVEQ
Ga0164299_1014568813300012958SoilMFFWGMTKGEVLRYAIAFALRCSRKIIRCLKEGLTEEERFAVADQTVAQLKLRGAP
Ga0164301_1119991913300012960SoilMIRGEVLRYAIAFALRRSSKIIRGLKLGLTEEERYAGGRSHRRA
Ga0164305_1013274313300012989SoilMIRGEVLRYAIAFALRRSRKIIRGLKLGLTEEERYAG
Ga0164305_1030369913300012989SoilMVLYMFLLRMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERFAVADHT
Ga0181526_1106085713300014200BogMTRGDVLRYRIAFALMRARKIIRGLKEGLTESERYAVADHVVAQLKAR
Ga0157379_1095811423300014968Switchgrass RhizosphereMLRYAIAYALIRARKIVRGLKYGLTEEERFAVADAAV
Ga0132256_10161471013300015372Arabidopsis RhizosphereLFYHDKGEVLRYAIAFALRPPARSGLKEGLTEEERYAVADHAVAQLKDHGDPW
Ga0182041_1188789613300016294SoilMTRGEMLRYALAYALRRSRKIVRGLEQDLSEEERYAVADHVVAQL
Ga0182038_1188020113300016445SoilLSSKTMTKGEVLRYAIAFALRRSRGLKQGLTEEERYPVADRAIAQLKERGDPGC
Ga0187821_1036454713300017936Freshwater SedimentMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERCAVADHAVAQLKERGDPW
Ga0187780_1061712823300017973Tropical PeatlandMTKGEILRYAIAFALRRSRKIVRGLKEGLTEDERFAVADCTVAQLKER
Ga0187782_1016412613300017975Tropical PeatlandMTRGEVLRYKLAYALGRAVKIVRGLRQGLTEDERYAVADHVVSQLK
Ga0187822_1006724413300017994Freshwater SedimentMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVAQLKERGDPW
Ga0187816_1047513813300017995Freshwater SedimentMTKGEVLKYAIAFALRRSRKIIRGLKEGLTEAERYAVADHAVAQLKERGVPGPCRKLPG
Ga0187876_124013223300018003PeatlandMTRGDVLRYRLAFALIRAGKIVRGLRQGLTEDERYAVAD
Ga0187804_1035595313300018006Freshwater SedimentMTRGDVLRYAIAYALMRARRIVRGLKEGLTEDERYAVADHVVAQLKERGDP
Ga0187860_133539723300018014PeatlandMTRGDILRYSLAFALRRASKVVRGLRQGLSEDERYAVADHVVGQL
Ga0187855_1042889813300018038PeatlandMTRGDVLRYAIAFALRRARKIVRGLKEGLTEAEGFAVVGHVVSQLQDGGDPWG
Ga0187871_1073768213300018042PeatlandMTKGDILRYATAFALIPARKVVRGLKQELTDEDRFAVADHVVSQLKAHGDPWH
Ga0187770_1036530113300018090Tropical PeatlandMTRGDVLRYAIAFALMRARKIVRGLKQTLTEDERHAVAD
Ga0210407_1086985123300020579SoilMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERFAVADHTV
Ga0210403_1015890513300020580SoilMTKGDILRYAIAFALRRSHKIIRGLKEGLTEEERFAVADHAVAQLKDRGDP
Ga0210406_1004964133300021168SoilMTKGEVLKYAVAFALRRSRKIIRGLKQGLTEEERYAVADHAVAQLKERGDP
Ga0210405_1019571433300021171SoilLFPLCSIERMTKGEVLRYAIAFALRRSRKIIRGLKQGLTEEERYAVADHAVAQLK
Ga0210408_1109395313300021178SoilVLRYAIAFALRRSRKIIRGLKQGLTEEERYAVADHA
Ga0210396_1123982823300021180SoilMTKGEVLRYAIAFALRCSRKIIRGLKEGLTEEERFAVADQTVAQ
Ga0210393_1056154613300021401SoilMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERFAVADHTVAQLKQRGDPWRL
Ga0210397_1111560213300021403SoilMFFWGMTKGEVLRYAIAFALRCSRKIIRGLKEGLTEEERFAVADQ
Ga0210383_1109984523300021407SoilVLRYAIAFALRRSRKIVRGLKEGLTEDERYAVADHAVAQLKERG
Ga0210394_1107883323300021420SoilMTKGEVLRYAIAFALRRSRKIIRGLKQGLTEEERYEVA
Ga0210394_1127448713300021420SoilMTRGDVLRYNLAYALIRACKCVRGLRQALTERERYAVADHVVAQLK
Ga0210384_1061954613300021432SoilMTKGEVLRYAIAFALRCSRKIIRGLKEGLTEEERFAVADQ
Ga0210390_1088609633300021474SoilMTKGDILRYAIAFALRRSRKIIRGLKEGLTEEERFAVADRAVAQL
Ga0210390_1155246113300021474SoilMFFWGMTKGEVLRYAIAFALRCSRKIIRGLKEGLTEEERFAVADQTV
Ga0210410_1053238513300021479SoilMVLYMFLLRMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEER
Ga0210410_1060359723300021479SoilMFFWGMTKGEVLRYAIAFALRCSRKIIRGLKEGLTEEERFAVADQTVAQPNSVAT
Ga0210410_1176435423300021479SoilMFYWAMTKGEVLRYAIAFALRRSRKIIRGLKEGMTEEERFAVADHTVSQ
Ga0242651_103233023300022511SoilEVLRYAFALRRSRKIVRGLKEGLSEDERFAVADTLSTS
Ga0212123_1069429513300022557Iron-Sulfur Acid SpringMTRGDVLQYAIAFALRRARKNVRGLREALTEDERYAVADHVVTQA
Ga0212123_1077892913300022557Iron-Sulfur Acid SpringMTRGEVLKYRIAYALMRARKIVRGLKEGLTEDERYAVADHVVRQLKE
Ga0228598_109209413300024227RhizosphereMTKGDVLRYAIAFALRRSRKIVRGLKQGLTEEERYAVADHAV
Ga0209171_1055006113300025320Iron-Sulfur Acid SpringMTKGEVLRYAIAFALRRSRKIIRGLKEGLNEDERGDAVER
Ga0207663_1097410323300025916Corn, Switchgrass And Miscanthus RhizosphereMTRGEVLRYAIAFALRRSRKIIRGLKAGLTEEERYAVAD
Ga0207702_1152857023300026078Corn RhizosphereMTRAVVLRYAIAFALWRARKIIRGLKDGLTEEERYSVADHVVAQLKDR
Ga0209333_108904913300027676Forest SoilMTKGDILRYATAFALIPARKVVRGLKQELTDEDRFAVADHVVSQLKAHGDPWHL
Ga0209166_1057727113300027857Surface SoilMFSSDMTRGDVLRYRLAFALSRASKIVRGLRQGLSEVERYA
Ga0209169_1073804313300027879SoilMTRGDVLRDSIAFALRRARKIIRGLKEGLTEDERYAVAEHVVSQLKRAATLGVWARKRRLAAATQ
Ga0209068_1051574513300027894WatershedsMQDIAACAFVLYMFSWVMTRGEVLRYAIAYALRRSRKIIRGLKEGLTEEERYA
Ga0209067_1008403553300027898WatershedsMTRGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVSQ
Ga0209067_1055508113300027898WatershedsMTKGEVLRYAIAFALRRSRKIIRGLKQELTEEERYA
Ga0209006_1121420923300027908Forest SoilMFFWGMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERFAVADH
Ga0268264_1205714613300028381Switchgrass RhizosphereMFYYVMTRGDVLRYAIAFALRRARKIIRGLKDGLTEEERYAVADHVVAQLRERGD
Ga0308309_1052007813300028906SoilMTKGEVLRYAIAFALRRSRKIIRGLKQGLTEEERYAVADHAVAQLRECGDPWRL
Ga0170824_10161729433300031231Forest SoilMTKGEVLTYAIAFALRRSRKIIRGLKEGLTEEERYAVADHAVAQL
Ga0170818_10358789823300031474Forest SoilVLRYAIAFALRRSRKIIRGLKKGLTEAERHAVADHAVAQLKERGD
Ga0310686_10140761123300031708SoilMTKGEVLRYAIAFALRRSRKIIRGRKEGLTEEERYAVTDHTVAQ
Ga0310686_11287540123300031708SoilLTKGEVLKYAIAFALRHSRKIIRGLKEGRTEEERYAIADHAVAQLKERGDPRGG
Ga0310686_11543445123300031708SoilMTKGEVLRYAIAFALRRSSKIIPDLKEGLTEEERYAVADHAVAQLKE
Ga0307477_1023620133300031753Hardwood Forest SoilLFPLCSIERMTKGEVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHTVAQLKE
Ga0318546_1107107213300031771SoilMTGEEVLRCAIAFAPRRSRKIIRGLKQGLTEDERYAVADQTVALLKERGDT
Ga0306919_1044417923300031879SoilMTKGEILRYAIAFALRRSRKIIRGLKEGLTEDERFAVADHT
Ga0310913_1097834123300031945SoilMTGEEVLRCAIAFAPRRSRKIIRGLKEGLTEDERYTVADQTVALLKERGDT
Ga0310913_1131508323300031945SoilMTKGEVLRYAIAYALRRSRKIIRGLKEGLNEEERFAVADHAVAQLKE
Ga0306926_1218489713300031954SoilMTRGDVLRYAIAFALRRARKIIRGLKQGLTEEERYAVAD
Ga0311301_10047416133300032160Peatlands SoilVLRYAIAFALRRSRKIIRGLKEGLTEEERYAVADHAVARLKERGETLGD
Ga0306920_10211568113300032261SoilMTREEVLRCAIAFALRRSRKIIRGLKDGLTEDERYTVADQTVALLKE
Ga0306920_10341100313300032261SoilMTRGEVLRYRIAFALSRASKIVRGLKHGLSEGERCAVADHVVRQLNERGDP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.