NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F071069

Metagenome / Metatranscriptome Family F071069

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071069
Family Type Metagenome / Metatranscriptome
Number of Sequences 122
Average Sequence Length 100 residues
Representative Sequence MKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN
Number of Associated Samples 93
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 67.77 %
% of genes near scaffold ends (potentially truncated) 27.05 %
% of genes from short scaffolds (< 2000 bps) 74.59 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.17

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (97.541 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(23.770 % of family members)
Environment Ontology (ENVO) Unclassified
(37.705 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(53.279 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136
1NODE_073916319
2C688J18823_100087174
3C688J18823_100934552
4C688J35102_1199770832
5JGI25614J43888_100092164
6JGI25613J43889_100280071
7JGI25406J46586_101442182
8P52013CM_10141643
9Ga0066677_100676112
10Ga0066673_100671741
11Ga0066673_106818201
12Ga0066684_101363962
13Ga0066678_104810262
14Ga0066678_105744571
15Ga0066671_101190803
16Ga0066671_104559392
17Ga0066675_103912162
18Ga0066675_104418221
19Ga0070703_100564012
20Ga0070714_1016761582
21Ga0070713_1011280531
22Ga0070705_1007731131
23Ga0066682_109176892
24Ga0066687_101323082
25Ga0070699_1000088604
26Ga0073909_101187701
27Ga0070741_1000891520
28Ga0070697_1000268714
29Ga0070695_1002758381
30Ga0066700_100822842
31Ga0066670_103925392
32Ga0066705_103308873
33Ga0066705_107321001
34Ga0066702_102893872
35Ga0066708_101190543
36Ga0066708_108745242
37Ga0066706_110677731
38Ga0081539_1000046210
39Ga0066658_108592831
40Ga0079220_101943342
41Ga0075433_102279693
42Ga0075433_115708921
43Ga0075425_1001498831
44Ga0075425_1002315793
45Ga0075425_1005893492
46Ga0075434_1011976622
47Ga0075426_1000139820
48Ga0075426_101471661
49Ga0075436_1000073998
50Ga0075436_1001743352
51Ga0079219_109799841
52Ga0075435_1009413942
53Ga0066710_1006998113
54Ga0066710_1014497331
55Ga0066709_1007603062
56Ga0066709_1015136851
57Ga0114129_102388863
58Ga0075423_132119441
59Ga0126321_13442722
60Ga0134067_100843903
61Ga0134066_101267641
62Ga0134128_102309862
63Ga0134126_103680532
64Ga0134122_116785162
65Ga0134121_102168002
66Ga0134121_116982152
67Ga0134123_134077082
68Ga0134123_136119982
69Ga0137399_100397396
70Ga0137399_100948792
71Ga0150985_1037164051
72Ga0150985_1060447381
73Ga0150985_1157732732
74Ga0150985_1207110762
75Ga0137366_103836141
76Ga0150984_1069059091
77Ga0150984_1070685301
78Ga0150984_1176150023
79Ga0137397_100113206
80Ga0137396_100221093
81Ga0137394_103665502
82Ga0137394_112453291
83Ga0137359_102887471
84Ga0137419_103977782
85Ga0137416_101191744
86Ga0137416_122344711
87Ga0137410_100113557
88Ga0137410_101214444
89Ga0137410_120274432
90Ga0164303_107131892
91Ga0137420_15011192
92Ga0137409_100567104
93Ga0066667_112438661
94Ga0066667_116078612
95Ga0066662_102178632
96Ga0066669_104548211
97Ga0066669_112708282
98Ga0137417_13128892
99Ga0207700_111033852
100Ga0207664_113716911
101Ga0207665_108697011
102Ga0209238_10648431
103Ga0209055_10164482
104Ga0209153_11819281
105Ga0209152_101191931
106Ga0209057_11240761
107Ga0209690_11503881
108Ga0209059_11151332
109Ga0209806_10179982
110Ga0209161_102440142
111Ga0209474_100293783
112Ga0209474_100598443
113Ga0209577_101715171
114Ga0209074_104940462
115Ga0209811_100656592
116Ga0137415_108261152
117Ga0255312_10235962
118Ga0307469_101771212
119Ga0307468_1002346141
120Ga0307473_100235244
121Ga0307470_100467702
122Ga0307472_1006774412
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 14.84%    β-sheet: 0.00%    Coil/Unstructured: 85.16%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100MKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKNSequenceα-helicesβ-strandsCoilSS Conf. scoreDisordered RegionsSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.17
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
97.5%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Surface Soil
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Ore Pile And Mine Drainage Contaminated Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Sandy Soil
Avena Fatua Rhizosphere
Tabebuia Heterophylla Rhizosphere
Populus Rhizosphere
Avena Fatua Rhizosphere
Sugar Cane Bagasse Incubating Bioreactor
14.8%5.7%23.8%9.8%4.1%4.9%6.6%3.3%10.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
NODE_0739163193300000156Sugar Cane Bagasse Incubating BioreactorMATETRQMKLLAVGIAISLAALAGCDRRPTTPLSPKTDSVSQAQQAGAGSSTTPANAGNPTTGDKREGANPVQGQVDPKHADQHRDFQNNEDGAGPRSSDTKPTMKN*
C688J18823_1000871743300001686SoilMKALTLGMAALIALAAGCDRRPNTPPSPKTDSVSQASPAGAGSTTTPANMGNPTTAEKRDGDNPVQGQVDPKHADQHRDFQNSGDAAGPKSSDTKPTMKN*
C688J18823_1009345523300001686SoilMKLFALGIAISLAAVAGCDRRPTTPPSPKTDSVSQARPAGAGSSTTPANAGIPTTGDKRDGANPVQGQVDPKHADQQRDFQSKSDGAGPRSSDTQPTIKN*
C688J35102_11997708323300002568SoilMKALTLATAAALLALAAGCDRRPTTPPSPKTDSVSQASPAAAGSTTTPANTGNPTNAEKRDGANPVQGQVDPKHADQHRDFQNSGDGAGPKSGDTQPTMKN*
JGI25614J43888_1000921643300002906Grasslands SoilMKXLTLALAVSLALVXGCDRKPTTPPSPKTDSSSVIPQAAAGSSTTPANXGNPTSAEKKEXANPVQGQVXPKAGAXXRDFQQRGDSAGPKSSDTQPTMKN*
JGI25613J43889_1002800713300002907Grasslands SoilAVSLALVGGCDRKPTTPPSPKTDSSSVIPQAAAGSSTTPANAGNPTSAEKKEDANPVQGQVDPKAGAQQRDFQQRGDSAGPKSSDTQPTMKN*
JGI25406J46586_1014421823300003203Tabebuia Heterophylla RhizosphereRHSSCSYGPGASKPEAVMKPLTLGLALALALTAGCDRRPQTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQQRDFQNRGDGAGPKSGDTQPTMKN*
P52013CM_101416433300003465Ore Pile And Mine Drainage Contaminated SoilMKLIAVVAALSLALAAGCDRNTSPTPSPKTDTSSRAPTAAAGSSTTPANLGTPTPAEKRDGSNPQQGQVDPKHADQHRDFQQRGDQAGPQSSDTKPTMKN*
Ga0066677_1006761123300005171SoilMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTVTPANAGNPTSAEKKDGANPVQGQVDPKHADQHRDFRNSGDAAGPRGSDTQPTMKN*
Ga0066673_1006717413300005175SoilMRLLTLGLALSFALVGGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKRDGTNPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN*
Ga0066673_1068182013300005175SoilMKFLTMAVAVSLALAAGCDRKPMTPPSPKTDSSSPIPQAAAGSTMTPANLGAPTTTEKKNGSNPVQGEIDPKHADQRRDFQQRGDGAGPQSSDTKPTMRN*
Ga0066684_1013639623300005179SoilMKILTLALAISLALAAGCDRKPMTPPSPKTDSSSVIPQAAAGSSTTPANMGNPTSAEKRDGANPVQGQVDPKAGAQHRDFQQSGDGAGPKSSDMQPTTKN*
Ga0066678_1048102623300005181SoilMKLLTLAVAISLALAAGCDRKPMTPPSPKTDSSSVIPQAAAGSSTTPANMGNPTSAEKRDGANPVQGQVDPKAGAQHRDFQQSGDGAGPKSSDMQPTTKN*
Ga0066678_1057445713300005181SoilMKFLTMALAISLALAAGCDRKPTTPPSPKTDSSSLVPQAVAGSSTTPANIGTPPAAEKKDGSNPVQGQVDPKHADQHRDFQSRGDAAGPQSSDTKPTMKN*
Ga0066671_1011908033300005184SoilMKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQN
Ga0066671_1045593923300005184SoilGCDRRPTTPPSPKTDSVSQANQAGAGSTATPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNNGQGAGPKSSDTQPTMKN*
Ga0066675_1039121623300005187SoilMKLLALGVAISLAAAAGCERRPTTPPSPKTDSVSQAQQAGAGSTMTPANAGNPTSAEKKQGGNPVQGQVDPKHADQHRDFRNGGDGAGPRGSDTQPTMKN*
Ga0066675_1044182213300005187SoilVKLLALGVAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTTPASAGNPTSAEKKDGANPVQGQVDPKHANQHRDFRNSGDGAGPRGNDTQPTMKN*
Ga0070703_1005640123300005406Corn, Switchgrass And Miscanthus RhizosphereMVHGAGPGLAVGRVMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0070714_10167615823300005435Agricultural SoilMKVLTLGLAIMLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSAEKQRGDNPVSQQVDPSHADQHRDFQNNNDAAGPRSNDTRPTMKN*
Ga0070713_10112805313300005436Corn, Switchgrass And Miscanthus RhizosphereMKVLTLGLAIMLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSGEKQRGDNPVSQQVDPSHADQHRDFQNNNDAAGPRSNDTRPTMKN*
Ga0070705_10077311313300005440Corn, Switchgrass And Miscanthus RhizosphereMKALTLGMAVLLALAAGCDRRPTTPPSPKTDSVSQASPAGAGSSTTPANTGNPTNGEKKDGANPVQGQVDPKHADQQRDFQSSGDGAGPRSNDTQPTMKN*
Ga0066682_1091768923300005450SoilMKLLTLAVAVSLALAAGCDRKPLTPPSPKTDSSSLIPQAAAGSSTTPANIGTPTTAEKKDGSNQHRDFQQRGDGAGPQSSDTKPTMKN*
Ga0066687_1013230823300005454SoilMKLLTLGLALSFALAGGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGDGTGPRSGDTRPTMKN*
Ga0070699_10000886043300005518Corn, Switchgrass And Miscanthus RhizosphereMRLPTLCFAIALALAAGCDRRPSAPPSPKTDSVSPVSPAAAGSTTTPANAGNPTSGEKKEGANPVQGQVDPKHADQHRDFQNSGDGAGPQSSDTKPTMKN*
Ga0073909_1011877013300005526Surface SoilMKALTLGMAVLLALAAGCDRRPTTPPSPKTDSVSQASPAGAGSSTTPANTGNPTNAEKKDGANPVQGQVDPKHADQQRDFQSSGDGAGPRSNDTQPTMKN*
Ga0070741_10008915203300005529Surface SoilMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANAGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0070697_10002687143300005536Corn, Switchgrass And Miscanthus RhizosphereMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0070695_10027583813300005545Corn, Switchgrass And Miscanthus RhizosphereTENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGERREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0066700_1008228423300005559SoilMKFLTMALAISLALAAGCDRKPTTPPSPKTDSSSLIPQAVAGSSTTPANIGAPTAAEKKDGSNQHRDFQSRGDAAGPQSSDTKPTMKN*
Ga0066670_1039253923300005560SoilALLHPGMATEDRQMKLLALGVAISLAAAAGCERRPTTPPSPKTDSVSQAQQAGAGSTMTPANAGNPTSAEKKQGGNPVQGQVDPKHADQHRDFRNGGDGAGPRGSDTQPTMKN*
Ga0066705_1033088733300005569SoilLALAAGCDRKPMTPPSPKTDSSSPISQAAAGSSTTPANIGTPTAGEKKDGSNPVQGQVDPKHADQHRDFQQKGDGAGPQSSDTKPTMKN*
Ga0066705_1073210013300005569SoilALSFALVGGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKRDGTNPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN*
Ga0066702_1028938723300005575SoilMKLLTLGLALSFALVGGCDRRPTTPPSPKTDSVSQANQAAAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN*
Ga0066708_1011905433300005576SoilVKLLALGVAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTTPANAGNPTSAEKKDGANPVQGQVDPKHANQHRDFRNSGDGAGPRGNDTQPTMKN*
Ga0066708_1087452423300005576SoilMKLLTTAVAISLALAAACDRKPTTPPSPKTDSSSPIPQAAAGSSTTPANTGSPTTAENKDCANPVQGQVDPMSGAQHRDFQQKGDGAGPQSSDTKPTMKN*
Ga0066706_1106777313300005598SoilEPMKLLTMALAISLALAAGCERKPTTPPSPKTDSSSPVPQAVAGSSTTPANLGAPTAAEKKNGSNPVQGQVDPKHADQHRDFQSRGDAAGPQSSDTKPTMKN*
Ga0081539_10000462103300005985Tabebuia Heterophylla RhizosphereMKPLTLGLALALALTAGCDRRPQTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQQRDFQNRGDGAGPKSGDTQPTMKN*
Ga0066658_1085928313300006794SoilMKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGERAGPKSGDTQPTMKN*
Ga0079220_1019433423300006806Agricultural SoilMKLLAIGIAISLVALAGCERRPTTPPSPKTDSVSQAQPAGAGSSTTPANMGAPTAGEKRDGDNPVSQQIDPRHADQHRDFQNNEDGAGPRSSDTKPTMKN*
Ga0075433_1022796933300006852Populus RhizosphereMKLLALGVAISLAAAAGCERRPMTPPSPKTDSVSQAPQAGAGSTATPANAGVPTAGEKRDGANPVQGQVDPKHADQHRDFRNDR*
Ga0075433_1157089213300006852Populus RhizosphereMATETAKMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTPANTGVPTTADKRDGVNPVQGQAFRKNEDGAGPRSSDTQPTTKN*
Ga0075425_10014988313300006854Populus RhizosphereMKLLALGVAISLAAAAGCERRPMTPPSPKTDSVSQAPQAGAGSTATPANAGVPTAGEKRDGANPVQGQVDPKHADQQRDFRNDR*
Ga0075425_10023157933300006854Populus RhizosphereMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTLNPVQGQVDPKQADQHRDFRNNEDGAGPRSSDTQPTTKN*
Ga0075425_10058934923300006854Populus RhizosphereMKVLTLGLAIALALATTACDRRPTTPPSPKTDSVSQAPQAGAGSTTTPANLGNPTSGEKQRGDNPVSQQVDPSHADQHRDFQNNNDGAGPRSNDTRPTMKN*
Ga0075434_10119766223300006871Populus RhizosphereMKLLALGVAISLAAAAGCERRPMTPPSPKTDSVSQAPQAGAGSTATPANAGVPTSGEKRDGANPVQGQVDPKHADQQRDFRNDR*
Ga0075426_10001398203300006903Populus RhizosphereMRLSTGCFAIALVLAAGCDRTPSTPPSPKTDSVSPVSPAAAGSTTTPANAGNPTSAEKKEGANPVQGQVDPKHADQHRDFQNSGDGAGPQSSDTKPTMKN*
Ga0075426_1014716613300006903Populus RhizosphereMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGERREGENPVQGQIDPKHADQHRDFQNNEDRAGPRS
Ga0075436_10000739983300006914Populus RhizosphereMKVLTLGLAIALALATTACDRRPTTPPSPKTDSVSQAPQAGAGSTTTQANLGNPTSGEKQRGDNPVSQQVDPSHADQHRDFQNNNDGAGPRSNDTRPTMKN*
Ga0075436_10017433523300006914Populus RhizosphereMVHGAGPGLALERVMPTENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGERREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0079219_1097998413300006954Agricultural SoilMKRLTLGLAVLLALAAGCDRRPTTPPSPKTDSVSQSSPAGAGSSTTPANIGAPTTAEKREGANTVQGQVDPKHADQQRDFQNS
Ga0075435_10094139423300007076Populus RhizosphereGAGPGLAVGRVMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0066710_10069981133300009012Grasslands SoilVKLLALGVAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTTPANAGNPTSAEKKDGANPVQGQVDPKHADQHRDFRNSGDGAGPRGNDTQPTMKN
Ga0066710_10144973313300009012Grasslands SoilMKFLTMALAISLALAAGCDRKPTTPPSPKTDSSSLIPQAVAGSSTTPANIGAPTAAEKKDGSNQHRDFQSRGDAAGPQSSDTKPTMKN
Ga0066709_10076030623300009137Grasslands SoilMATEDRPVKLLALGVAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTTPANAGNPTSAEKKDGANPVQGQVDPKHADQHRDFRNSGDGAGPRGNDTQPTMKN*
Ga0066709_10151368513300009137Grasslands SoilAGCDRKPTTPSPKIDSSSLVPQAAAGSSTTPANLGAPTAAEKKDGSNSVQGQVDPKHADQHRDFQSRGDGAGPQSSDTKPTMKN*
Ga0114129_1023888633300009147Populus RhizosphereMKLLTIALAVSLALAAGCDRKPMTPPSPKTDSSSPIPSAAAGSSTTPANLGTPTAAEKKDGSNPTQGQVDPKAGVQHRDFQHKGDGAGPQSSDTKPTTKD*
Ga0075423_1321194413300009162Populus RhizosphereMKVWTLGLAITLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSGEKQRGDNPVSQQVDPSHADQHRDFQNNNDGAGPRSNDTRPTMKN*
Ga0126321_134427223300010145SoilKPLTLGLALALALTAGCDRRPQTPPSPKTDSVSQANQAGAGSTTTPANTGNPTNAEKKDGANPVQGQVDPKHADQQRDFQNRGDGAGPKSGDTQPTMKN*
Ga0134067_1008439033300010321Grasslands SoilMKLLALGVAISLAAAAGCERRPTTPPSPKTDSVSQAQQAGAGSTMTPANAGNPTSAEKKQGGNPVQGQVDPKHADQHRDFRNSGDGAGPRGNDTQPTMKN*
Ga0134066_1012676413300010364Grasslands SoilMKLLTLAVAISLAALYGCERKATLPPSPKTDSVSQARPAGAGSSTTPANAGIPTTGDKRDGANPVQGQVDPKHADQQRDFQSKSDGAGPRSSDTQPTIKN*
Ga0134128_1023098623300010373Terrestrial SoilMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGERREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0134126_1036805323300010396Terrestrial SoilMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDGAGPRSSDTQPTMKN*
Ga0134122_1167851623300010400Terrestrial SoilMKTEITMKLCTLALAASIALVAGCDRNTTTPPSPKTDSSSVVPQAAAGSSTTPANMGAPTTAQKREGSNPTQGQVDPKHADQHRDFQQSGDDAGPKSSDTKPTMKN*
Ga0134121_1021680023300010401Terrestrial SoilMVHGAGPGLALERVMPTENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0134121_1169821523300010401Terrestrial SoilMKTEITMKLCTLALAASIALVAGCDRNTTTPPSPKTDSSSVVPQAAAGSSTTPANMGAPTTAQKREGSNPTQGQVDPKHADQHRDFQQSGDDAGPQSSDTKPTMKN*
Ga0134123_1340770823300010403Terrestrial SoilMVHGAGPGLALERVMATENRPMKLLTLAFAISLAAAAGCDRRPTTPPSPKTDSVSPAPQAGAGSSTSPANTGVPTTGEKREGENPVQGQIDPKHADQHRDFQN
Ga0134123_1361199823300010403Terrestrial SoilSRPTTPPSPKTDSVSPAPQAGAGSSTTPANTGVPTTGEEREGENPVQGQIDPKHADQHRDFQNNEDRAGPRSSDTQPTMKN*
Ga0137399_1003973963300012203Vadose Zone SoilAAGCDRRPTTPPSPKTDSSSMVPRAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSGDTKPTMKN*
Ga0137399_1009487923300012203Vadose Zone SoilMKLLALGVAVSLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTMPANPGIPTSREKKDGANPVQGQVDPKHADQQRDFPQRGRWRRPAQQ*
Ga0150985_10371640513300012212Avena Fatua RhizosphereMKLFALGIAISLAAVAGCDRRPTTPPSPKTDSVSQARPAGAGASTTPANAGIPTTGDKRDGANPVQGQVDPKHADQQRDFQSKSDGAGPRSSDTQPTIKN*
Ga0150985_10604473813300012212Avena Fatua RhizosphereMKAITLGMAALLALAAGCDRRPTAPPSPKTDSVSQASPAGAGSTTTPANAGNPTKAEKKDGENPVQGQVDPKHADQHRDFQNSGDGAGPRSTDTQPTMKN*
Ga0150985_11577327323300012212Avena Fatua RhizosphereAMRLLTLGLAVSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSSTTPANAGNPTNAEKKDGANPVQGQIDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN*
Ga0150985_12071107623300012212Avena Fatua RhizosphereVAAGFRHANNGDFMKALTLGMAALIALAAGCDRRPNTPPSPKTDSVSQASPAGAGSTTTPANMGNPTTAEKRDGDNPVQGQVDPKHADQHRDFQNSGDAAGPKSSDTKPTMKN*
Ga0137366_1038361413300012354Vadose Zone SoilMKFLTMALAISLALAAGCDRRPMTPPSPKTDSLSPVPQAAAGSSTTPANVGTPTTAQKRDGSNPVQGQVDPKSGAQHRDFQQRGDGAGPQSSDTKPTMKN*
Ga0150984_10690590913300012469Avena Fatua RhizosphereMKTEITMKLCTMALAVSLALVAGCNRNTTTPPSPKTDSSSVVPQAAAGSSTTPANLGTPTTAQRREGENPQQGHADPKHADQHHDFQQRGDDA*
Ga0150984_10706853013300012469Avena Fatua RhizosphereANNGDFMKALTLGMAALIALAAGCDRRPNTPPSPKTDSVSQASPAGAGSTTTPANMGNPTTAEKRDGDNPVQGQVDPKHADQHRDFQNSGDAAGPKSSDTKPTMKN*
Ga0150984_11761500233300012469Avena Fatua RhizosphereTRMAKEDRPMKLFALGIAISLAAVAGCDRRPTTPPSPKTDSVSQARPAGAGSSTTPANAGIPTTGDKRDGANPVQGQVDPKHADQQRDFQSKSDGAGPRSSDTQPTIKN*
Ga0137397_1001132063300012685Vadose Zone SoilMEQPMKLRTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPQAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSSDTKPTMKN*
Ga0137396_1002210933300012918Vadose Zone SoilMKLLALGVAVSLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTTMPANPGIPTSREKKDGANPVQGQVDPKQADQQRDFRNAGDGAGLRSSDTQPTTKN*
Ga0137394_1036655023300012922Vadose Zone SoilMETETHMKLCTLALAISLALVAGCNRNTTTPPSPKTDSSSVVPQAAAGSSTTPANMGAPTAAQKREGANPTQGQVDPKHADQQRDFQQSGDDAGPKSGDSKPTIKN*
Ga0137394_1124532913300012922Vadose Zone SoilTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPQAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSSDTKPTMKN*
Ga0137359_1028874713300012923Vadose Zone SoilMETETHMKLCTLALAISLALVAGCNRNTTTPPSPKTDSSSVVPQAAAGSSTTPANMGAPTAAQKREGANPTQGQVDPKHAGQQRDFQQSGDDAGPKSSDTKPTIKN*
Ga0137419_1039777823300012925Vadose Zone SoilMEQPMKLLTMAVAVSLALAAGCERRPTTPPSPKTDSSSMVPQAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSGDTKPTMKN*
Ga0137416_1011917443300012927Vadose Zone SoilMEQPMKLLTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPRAAAGSSATPSNAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAG
Ga0137416_1223447113300012927Vadose Zone SoilMKLLTLALAVSLALVAGCDRKPTTPPSPKTDSSSVIPQAAAGSSTTPANVGNPTSAEKKEGANPVQGQVDPKAGAQRRDFQQRGDSTGPKSSDTQPTMKN*
Ga0137410_1001135573300012944Vadose Zone SoilMKFWPMAVAVSLALVAGCDRRPTTPPSPKTDISSATPTTAEKREGSNPVQGQVDPKSGAQHRDFQQRGEGAGPSSSDTKPTMKN*
Ga0137410_1012144443300012944Vadose Zone SoilMEQPMKLLTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPRAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSSDTKPTMKN*
Ga0137410_1202744323300012944Vadose Zone SoilMKLLRLALAVSLALVAGCDRKPTTTPSPKTDSSSVIPQAAAGSSTTPANVGNPTSAEKKEGANPVQGQVDPRVGAQRRDFQQRGDSTGPKSSDTQPTMKN*
Ga0164303_1071318923300012957SoilVLLALAAGCDRRPTTPPSPKTDSVSQASPAGAGSSTTPANTGNPTNAEKKDGANPVQGQVDPKHADQQRDFQSSGDGAGPRSNDTQPTMKN*
Ga0137420_150111923300015054Vadose Zone SoilMKLLALGVAVSLAAAGCERRPTTPPSPKTDSVSQAQQAGAGSSTTPANTGIPTNREKKDGANPVQGQVDPKQADQQRDFRNAGDGAGLRSSDTQPTTRN*
Ga0137409_1005671043300015245Vadose Zone SoilMEQPMKLLTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPRAAAGSSATPANAGTPTTAEKREGSNPMQGEIDPKSGAQHRDFQQKGDAAGPQSSDTKPTMKN*
Ga0066667_1124386613300018433Grasslands SoilQALTLGMAAAFLALAVGCDRRPTTPPSPRTDSVSQASPAAAGSTTTPANAGNPTNAEKRDGANPVQGQVDPKHADQHRDFQNSGDGAGPKSGDTQPTMKN
Ga0066667_1160786123300018433Grasslands SoilMKALTLAMAAALLALAAGCDRRPNTPPSPKTDSVSQASPAGAGSTTTPANMGNPTTTEKRDGENPVQGQVDPKHADQHRDF
Ga0066662_1021786323300018468Grasslands SoilMKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN
Ga0066669_1045482113300018482Grasslands SoilMRLLTLGLALSFALVGGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKRDGTNPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN
Ga0066669_1127082823300018482Grasslands SoilMKLLTLALAVSLALVAGCDRKPTTPPSPKTDSSSAIPQAAAGSSTTPANAGNPTSAEKKDGANPVQGQVDPKAGAQHRDFQQGGDGAGPRSSDTQPTTKN
Ga0137417_131288923300024330Vadose Zone SoilMEQPMKLLTMAVAISLALAAGCDRRPTTPPSPKTDSSSMVPRAAAGSSATPANAGTPTTAEKREGSNPVQGEIDPKSGAQHRDFQQKGDAAGPQSSDTKPTMKN
Ga0207700_1110338523300025928Corn, Switchgrass And Miscanthus RhizosphereMKVLTLGLAIMLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSGEKQRGDNPVCQQVDPSHADQHRDFQNNNDAAGPRSNDTRPTMKN
Ga0207664_1137169113300025929Agricultural SoilMKVLTLGLAIMLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSAEKQRGDNPVSQQVDPSHADQHRDFQNNNDAAGPRSNDTRPTMKN
Ga0207665_1086970113300025939Corn, Switchgrass And Miscanthus RhizosphereMKVLTLGLAIMLALATTACDRRPTTPPSPKTDNVSQAPQAGAGSSTTPANLGNPTSAEKARGDNPVSQQIDPSHADQHRDFQNNNDAAGPRSSDTRPTMKN
Ga0209238_106484313300026301Grasslands SoilPEIAMKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGDGTGPRSGDTRPTMKN
Ga0209055_101644823300026309SoilMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTVTPANAGNPTSAEKKDGANPVQGQVDPKHADQHRDFRNSGDAAGPRGSDTQPTMKN
Ga0209153_118192813300026312SoilMKLLTLGLALSFALVAGCDRRPTTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGDGTGPRSGDTRPTMKN
Ga0209152_1011919313300026325SoilAIAAALVAGCDRNTSTTPSPKTDTTSAIPQAAAGGSATPANAGTPSKAERREGENPQQGQVDPKHSEQHRDFQSDADGKGPRSSDTQPTIKN
Ga0209057_112407613300026342SoilMKLLALGVAISLAAAAGCERRPTTPPSPKTDSVSQAQQAGAGSTMTPANAGNPTSAEKKQGGNPVQGQVDPKHADQHRDFRNGGDGAGPRGSDTQPTMKN
Ga0209690_115038813300026524SoilVVGRTRFTARMVHAQAGPSFAGFHRALTGEPMKLLTLAVAVSLALAAGCDRKPLTPPSPKTDSSSLLPQAAAGSSTTPANIGTPTTAEKKDGSNPVQGQVDPKHADQQRDFQSRGDGAGPLSSDTKPTMRN
Ga0209059_111513323300026527SoilMKLLTLGLAVSFALVGGCDRRPTTPPSPKTDSVSQANQAAAGSTTTPANAGNPTNAEKKDGANPVKGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN
Ga0209806_101799823300026529SoilMATEDRQMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQQAGAGSTVTPANAGNPTSAEKKDGANPVQGQVDPKHADQHRDFRNSGDAAGPRGSDTQPTMKN
Ga0209161_1024401423300026548SoilMKLLTMALAISLALAAGCERKPTTPPSPKTDSSSPVPQAVAGSSTTPANLGAPTAAEKKNGSNPVQGQVDPKHADQHRDFQSRGDAAGPQSSDTKPTMKN
Ga0209474_1002937833300026550SoilMKLLALGVAISLAALAGCERRPTTPPSPKTDSVSQAQQAGAGSTMTPANAGNPTSAEKKQGGNPVQGQVDPKHADQHRDFRNGGDGAGPRGSDTQPTMKN
Ga0209474_1005984433300026550SoilMKPFTLGLALALALTAGCDRRPQTPPSPKTDSVSQANQAGAGSTTTPANAGNPTNGEKKDGANPVQGQVDPKHADQHRDFQNRGDGAGPKSGDTQPTMKN
Ga0209577_1017151713300026552SoilRRPTTPPSPKTDSVSQANQAAAGSTTTPANAGNPTNAEKKDGANPVQGQVDPKHADQHRDFQNSGQGAGPKSSDTQPTMKN
Ga0209074_1049404623300027787Agricultural SoilAISLVALAGCERRPTTPPSPKTDSVSQAQPAGAGSSTTPANMGAPTAGEKRDGGNPVSQQIDPRHADQHRDFQNNEDGAGPRSSDTKPTMKN
Ga0209811_1006565923300027821Surface SoilMKALTLGMAVLLALAAGCDRRPTTPPSPKTDSVSQASPAGAGSSTTPANTGNPTNAEKKDGANPVQGQVDPKHADQQRDFQSSGDGAGPRSNDTQPTMKN
Ga0137415_1082611523300028536Vadose Zone SoilMKLLALGVAVSLAAAAGCERRPTTPPSPKTDSVSQAQQAGAGSTTMPANPGIPTSREKKDGANPVQGQVDPKHADQQRDFPQRGRWRRPAQQ
(restricted) Ga0255312_102359623300031248Sandy SoilMERTMKLLITALAVTLALVAGCNRNTTTPPSPKTDTSSIVPQAAAGSSSTPANAGTPSTAEKREGSNPVQGQVDPKDANQQRDFKSDDTKPTTKN
Ga0307469_1017712123300031720Hardwood Forest SoilMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTPANTGVPTTADKRDGGDGANPVQGQVDPKQADQHRDFRNNEDGAGPRSSDTQPTTKN
Ga0307468_10023461413300031740Hardwood Forest SoilMQTETDMKLCTLALAISLALVAGCNRNTTTPPSPKTDSSSVVPRAAAGSSTTPANTAAPTAAQKREGSNPTQGQVDPKEPNQHRDFQSSDTTKN
Ga0307473_1002352443300031820Hardwood Forest SoilMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTPANTGVPTTADKRDGANPVQGQADPKQADQHRDFRNNEDGAGPHSSDTQPTTKN
Ga0307470_1004677023300032174Hardwood Forest SoilMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTPANTGVPTTADKRDGANPVQGQADPKQADQHRDFRNNEDGAGPRSSDTQPTTKN
Ga0307472_10067744123300032205Hardwood Forest SoilMATETAKMKLLALGIAISLAAAAGCDRRPTTPPSPKTDSVSQAQPAAAGSSTTPANAGIPTTGEKRQGANPVQGQVDPQQADQHRDFRSNEDGAGPRSSDTQPTTKN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.