NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F075386

Metagenome / Metatranscriptome Family F075386

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F075386
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 109 residues
Representative Sequence MWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESI
Number of Associated Samples 91
Number of Associated Scaffolds 117

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 77.78 %
% of genes near scaffold ends (potentially truncated) 36.97 %
% of genes from short scaffolds (< 2000 bps) 79.83 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (51.261 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(16.807 % of family members)
Environment Ontology (ENVO) Unclassified
(21.008 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.941 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150
1A5_c1_00012260
2JGI12630J15595_100243112
3JGI12630J15595_100308362
4JGIcombinedJ26739_1000916166
5JGIcombinedJ26739_1008401101
6JGIcombinedJ26739_1011148311
7JGIcombinedJ26739_1014188801
8Ga0063356_1021999261
9Ga0066673_100497393
10Ga0066673_101168153
11Ga0066684_104410862
12Ga0066678_107497301
13Ga0066671_100826731
14Ga0070709_101202873
15Ga0070709_101389411
16Ga0070709_101879391
17Ga0070714_1003533322
18Ga0070714_1016440771
19Ga0070713_1002225953
20Ga0070708_1000180745
21Ga0066687_101198113
22Ga0070697_1016815421
23Ga0066707_104356651
24Ga0066699_101746333
25Ga0066705_106474422
26Ga0066708_103104913
27Ga0066790_100082554
28Ga0066790_100082556
29Ga0075023_1005699961
30Ga0075029_1007289541
31Ga0075017_1011235571
32Ga0075015_1004478431
33Ga0070715_100745031
34Ga0075018_101377223
35Ga0070716_1006118482
36Ga0075014_1006921522
37Ga0066659_109150141
38Ga0079220_100960463
39Ga0079219_102613872
40Ga0099794_102405902
41Ga0066710_1018272612
42Ga0099830_100667473
43Ga0137393_100167792
44Ga0137389_114761951
45Ga0137388_108880301
46Ga0137399_101854301
47Ga0137360_104098491
48Ga0137410_116583041
49Ga0134110_104198872
50Ga0137412_103222972
51Ga0187825_101028222
52Ga0187822_103533451
53Ga0066667_122033861
54Ga0193715_10320802
55Ga0193707_10018755
56Ga0193707_10065434
57Ga0193747_10260852
58Ga0193747_10805872
59Ga0193729_11012871
60Ga0193728_10101166
61Ga0193718_11195791
62Ga0193731_10338391
63Ga0193732_10036261
64Ga0210407_106998911
65Ga0210407_113768501
66Ga0210403_105503892
67Ga0210399_110797411
68Ga0210401_100143387
69Ga0210401_102622982
70Ga0210401_112782481
71Ga0179596_104668931
72Ga0210404_106857451
73Ga0210404_107978221
74Ga0179584_13982932
75Ga0210408_106014862
76Ga0210408_114100981
77Ga0193719_100560132
78Ga0193719_101337321
79Ga0210386_112391541
80Ga0210383_116074351
81Ga0210394_103120861
82Ga0210384_1000320615
83Ga0210384_103509321
84Ga0210384_103541402
85Ga0210391_111030631
86Ga0210410_104570591
87Ga0207685_100348712
88Ga0207665_105833021
89Ga0207689_109772932
90Ga0209839_100232062
91Ga0209839_100232064
92Ga0209155_10618451
93Ga0209155_11488042
94Ga0209687_10998042
95Ga0209473_10402233
96Ga0209808_12428011
97Ga0209805_13773992
98Ga0209117_11987341
99Ga0209217_100006616
100Ga0209217_10825782
101Ga0209009_10658962
102Ga0209283_100775953
103Ga0209488_100769553
104Ga0209006_100824523
105Ga0209698_103228742
106Ga0209526_100122928
107Ga0209526_100159472
108Ga0209526_101072594
109Ga0209526_103479481
110Ga0209526_103611931
111Ga0257175_10513052
112Ga0170824_1288188691
113Ga0308194_102755102
114Ga0307473_100734263
115Ga0307479_101052734
116Ga0307479_109819441
117Ga0307479_114099472
118Ga0307471_1009894281
119Ga0307472_1015251371
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 52.71%    β-sheet: 0.00%    Coil/Unstructured: 47.29%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100MWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESISequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
51.3%48.7%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Grasslands Soil
Agricultural Soil
Soil
Soil
Grasslands Soil
Soil
Forest Soil
Hardwood Forest Soil
Soil
Soil
Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Agricultural Soil
Arabidopsis Thaliana Rhizosphere
Miscanthus Rhizosphere
5.9%10.9%10.9%10.9%16.8%5.0%3.4%3.4%13.4%8.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A5_c1_000122602124908044SoilMWKKEDGVGPQPTLTVTMYTEAMNKFTKSATAFMEHVHLLTEARDAYEEAITASTALRNSLDAGDQTLRSLITKLEQVVSTHFGEPFPDKKKPEPMRVE
JGI12630J15595_1002431123300001545Forest SoilMWKKEDSVTTQPIPTMAMYTEAMNKFTKSAKDFMEHVHLLTEARDAYQEAVTASKALRNSLDAGDQTLRSLMTQLEQVVNVHLGEPTPDRKKPELVKTEASRVNSDSVGVVRTFLP*
JGI12630J15595_1003083623300001545Forest SoilMSDAMWKKEDGMGTQLTPTWAIYAEAMNRFTKSATAFIEHAHLLTEARDAYQEAMAASTALRKGLDAGDHTLRSLRAQLAQVVYDHLDQPALDRKKPELVRVESTKAKNEGTGTARMFP*
JGIcombinedJ26739_10009161663300002245Forest SoilMNQATWKKEEGEGARLTPSWAMYAEAMDRFTKSATAFMEHVHLLTEARTAYEEAMTASAALRSRLDAGDQTLRSLREQLARVVNDHLDEPTLDRKKPELLTGSGGWKTFP*
JGIcombinedJ26739_10084011013300002245Forest SoilMSEAMWRKETGVSTPPKPTMAMYTDAMNKFTKSATAFMEHVPLLTEARDAYQTAISASTALRNSLDAGDQALRSLMSQLEQVVSTHMSEPVPDRKRPELVKAEPIRTNGASTATSG
JGIcombinedJ26739_10111483113300002245Forest SoilMGESMWRKEDGVSTPPMPTMATYTDAMNKFTKSATAFMEHVHLLTEARDAYQTAISASTALRKSLDAGDQALRSLMSQLEQVVSTHMGEPVPDRKRPELVKAEPIKTNGESTATSGK
JGIcombinedJ26739_10141888013300002245Forest SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGA
Ga0063356_10219992613300004463Arabidopsis Thaliana RhizosphereMWKKEDGLAPQPTLTVTMYTEAMNKFTKSATAFMEQVHFLTEARDAYEEAMTASTELRNSLDAGDQTLRSLMTQLEQVVNTHFGGPALDKKKPESMKVEAAG*
Ga0066673_1004973933300005175SoilTEDGVSNQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP*
Ga0066673_1011681533300005175SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRNSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGESTGTITRALP*
Ga0066684_1044108623300005179SoilMNEAMLKTEDGVSTQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP*
Ga0066678_1074973013300005181SoilMNDAMWKKEDSVGTQLTPTWAIYADAMNRFTQSATAFMEHVHLLTEAREAYEEAIKASMALRNSLDSGDQTLRSLRSQLARVVNDHLDEPAFDRKKPELLKSNGAKAFP*
Ga0066671_1008267313300005184SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRSSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGESTGTI
Ga0070709_1012028733300005434Corn, Switchgrass And Miscanthus RhizosphereMSAAIWKREDGVSPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPDKKRPEPVKAEATG*
Ga0070709_1013894113300005434Corn, Switchgrass And Miscanthus RhizosphereMNDAMWKKEDSVGTQLTPNWAIYADAMNRFTQSATAFMEHVHLLTEAREAYEEAIKASMALRNSLDSGDQTLRSLRSQLARVVNDHLDEPAFDRKKPELLKSNGAKAFP*
Ga0070709_1018793913300005434Corn, Switchgrass And Miscanthus RhizosphereMNAAMWKKEDGVPAQPTPTLATYTEAMNKFTKSATAFMEHVHLLTEAQEAYREAMNASAAMRNSLDAGDKTLRGLMTQLEQVVSDHLGEPPLEKKKPESSKVEPIRVNGDGVVNTPFP*
Ga0070714_10035333223300005435Agricultural SoilMWKKEDGPGPQSTLAVTTYAEAMNKFTKSATAFMEHVHLLTEARDAYQQAMTASAALRNTLDAGDETLRSLILQLEQVVSTHLGEPSLEHKSNDAAKSESSRAIKEITAA*
Ga0070714_10164407713300005435Agricultural SoilMNAAMWKKEDGVPAQPTPTLATYTEAMNKFTKSATAFMEHVHLLTEAQEAYREAMNASAAMRNSLDAGDKTLRGLMTQLEQVVSDHLGEPPLEKKKPESSKVEPIRVNGDGMVNTPFP*
Ga0070713_10022259533300005436Corn, Switchgrass And Miscanthus RhizosphereEAMNKFTKSATAFMEHVHLLTEARDAYQQAMTASAALRNTLDAGDETLRSLILQLEQVVSTHLGEPSLEHKSNDAAKSESSRAIKEITAA*
Ga0070708_10001807453300005445Corn, Switchgrass And Miscanthus RhizosphereMSAAMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGMYP*
Ga0066687_1011981133300005454SoilMNEVMLKTEDGVSTQVAPTWTLYADAMNRFTKSATAFMEHVHLLTEARDAYEEAMRASTALRNSLDAGDQTLRSLRTQLARVINDHLDQPAFDRKKPELLKSTGAGKAFP*
Ga0070697_10168154213300005536Corn, Switchgrass And Miscanthus RhizosphereMSAAIWKREDGVNPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPEKKRPEPVKAEATG*
Ga0066707_1043566513300005556SoilKREDGVSPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMTASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPDKKRPEPVKAEATG*
Ga0066699_1017463333300005561SoilMNEAMLKTEDGVSNQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP*
Ga0066705_1064744223300005569SoilMSAAIWKREDGVSPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAP
Ga0066708_1031049133300005576SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRNSLDAGDQALRSLMTQMEQVINAHLSEAALDKK
Ga0066790_1000825543300005995SoilMWKKDDVVGSQPTLTVSMYTEAMNKFTKSATAFMEQVHLLTEARDAYEEAMTASTALRNSLDAGDHTLRSLMTQLEQVVNTHFAEPVPDKKKPEAMKVEATG*
Ga0066790_1000825563300005995SoilMNGAMWKKEDGVGAELTPIWTRYAEAMFRFSKSATAFMGHVHLLTEARAAYLEAMTASTALRNSLDAGDKTLRSLRAQLAQVVNDRLDESTLARKKPELLKNIASAKVFP*
Ga0075023_10056999613300006041WatershedsATMWKKEDGAGAQPTTLTMYTEAMNKFTKSASVFMEQVHLLTEARDAYEEAMRASTALRNSLDAGDQTLRSLITQLEQVVNTHFGEPGPDKKNPEPMKIEATG*
Ga0075029_10072895413300006052WatershedsMSAAMWKKEEGISPQPTSTLATYTEAMNKFTHASTAFMEHVHLLTEAREAYQEAMNASAALRNSLDAGDKSLRGLMTQLEQVVNAHLGDPNLDRKKPEGIRVEPIRGNGDSMGVVRTTSLP*
Ga0075017_10112355713300006059WatershedsMQPTPTLAMYTEAVNKFTRSASAFMQHVHLLTEARDAYREAMTASTMLRRSLDAGDQTLRSLMTQLEQVVNEHFGEPALDKKKPE
Ga0075015_10044784313300006102WatershedsMWKKEEGVNTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEAREAYLEAMNASAALRNSLDAGDKTLRSLMGQLERVVTDHLGEPPLDKKKPEPTRIESIRANSDAMGMLRTPLP*
Ga0070715_1007450313300006163Corn, Switchgrass And Miscanthus RhizosphereMWKKEDGPGPQSTLAVTTYAEAMNKFTKSATAFMEHVHLLTEARDAYQQAMTASAALRNTLDAGDETLRSLILQLEQVVSTHLGEPSLEHKSNDATKSESSRAIK
Ga0075018_1013772233300006172WatershedsMWKKEDNVGMQPTPTLAMYTEAVNKFTRSASAFMQHVHLLTEARDAYREAMTASTMLRRSLDAGDQTLRSLMTQLEQVVNEHFGEPALDKKKPELVKDATRVDSANIGGGRTSIP*
Ga0070716_10061184823300006173Corn, Switchgrass And Miscanthus RhizosphereMSAAIWKREDGVSPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPEKKRPEPVKAEATG*
Ga0075014_10069215223300006174WatershedsMKSENGADPQSTPAVTVYTEAMNKFTTSATAYMEQVQLLTEAGDAYQEAMAASNALRNNLDASDQTLQSLMTQLEQVVNTHLSE
Ga0066659_1091501413300006797SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRSSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGESTATITRALP*
Ga0079220_1009604633300006806Agricultural SoilMSAAMWKKEDGVPAQPTPTLATYTEAMNKFTKSATAFMEHVHLLTEAQEAYREAMNASAAMRNSLDAGDKTLRGLMTQLEQVVSDHLGEPPLEKKKPESSKVEPIRVNGDGMVNTPFP*
Ga0079219_1026138723300006954Agricultural SoilMNAAMWKEEDGVPAQPTPTLATYTEAMNKFTKSATAFMEHVHLLTEAQEAYREAMNASAAMRNSLDAGDKTLRGLMTQLEQVVSDHLGEPPLEKKKPESSKVEPIRVNGDGMVNTPFP*
Ga0099794_1024059023300007265Vadose Zone SoilMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGMYP*
Ga0066710_10182726123300009012Grasslands SoilMSTQPTPTLAMYTEAMNKFTKCATAFMEHVHLLTEARIAYEEAMTSSRALRNSLDAASQALRCLMTQMKQVINAHLSEAALDKKRPELVKVEPTRTNGESTGTITRALP
Ga0099830_1006674733300009088Vadose Zone SoilMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPALDKRKPELVKAESIREKNEGTGTGGMYP*
Ga0137393_1001677923300011271Vadose Zone SoilMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGIYP*
Ga0137389_1147619513300012096Vadose Zone SoilAAMWKKEDGVGPQPTPTVMIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASMALRNSLDAGDETLRSLMTQLEQVVNDHLGEPVLDKKKPELVKAESTRAKNEGTGTSGMFP*
Ga0137388_1088803013300012189Vadose Zone SoilEAMDKFTKSATAFMEHVHLLNEARDAYQEAVSASSTIRRSLDASDQALRSLMTQLEQVVNDHLGEPALERKKLELVKAEATEQAARTPAAT*
Ga0137399_1018543013300012203Vadose Zone SoilEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGMYP*
Ga0137360_1040984913300012361Vadose Zone SoilMSAAMWKREDGVNTPPAPTLAMYTEAMNKFTKAAEAFMEHVHLLTEAREAYQEAMSSSAALRSSLDAGDKTLRSLMLQLEQVVSAHLGEPPVDKKKSEPTKVEPIRANNESVGVVRTSFP
Ga0137410_1165830413300012944Vadose Zone SoilMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESI
Ga0134110_1041988723300012975Grasslands SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRNSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGEST
Ga0137412_1032229723300015242Vadose Zone SoilMTAAIWKREEGVPQAAPTMTMYTEAMNKFTKSATAFMEQVHLLTEAREAYEEAISASTALRKSLDAGDQTLRSLMTQLEQVVTTHFAEPHPDKKRPEIVRSEATRAANEGNGSVGTMLP*
Ga0187825_1010282223300017930Freshwater SedimentMSAAMWKKEDSVSAPPAPTLATYTEAMNKFTKAATAFMEHVHLLTEAREAYQEAMSSSAALRSSLDAGDKTLRSLMIQLEQVVNDHVGEPPVDRKKPEPTKVEPIRTNNDSAAAVRTTSF
Ga0187822_1035334513300017994Freshwater SedimentMSAAMWKKEDSVSAPPAPTLATYTEAMNKFTKAATAFMEHVHLLTEAREAYQEAMSSSAALRSSLDAGDKTLRSLMIQLEQVVNDHVGEPPVDRKKPEPTKVEPIRTNNDSAAA
Ga0066667_1220338613300018433Grasslands SoilMNEAMLKTEDGVSNQVAPTWPMYADAMNRFTNSATAFMEHVQLLTEARDAYEEAMRASTVLRHSLDAGDQTLRSLRSQLARVINAHLDQPTLDKKKPELLKSTGAAKAFP
Ga0193715_103208023300019878SoilMSAAVWKKEDGVGLQPTPTVTMYTEAMNKFTKSATAFMDQVHILTEARDAYQEAMAASTALRERLDAGDQTLRSLMTQLEQVVSAHLGEHARDRKRPEPVKVEANGTNGENTDFARTFL
Ga0193707_100187553300019881SoilMSAAVWKKEDGVGLQPTPTVTMYTEAMNKFTKSATAFMEQAHILTEARDAYQEAMAASTALRERLDAGDQTLRSLMTQLEQVVSAHLGEHVRDRKRPEPVKVEANGTNGENTDFARTFL
Ga0193707_100654343300019881SoilMGSQPPPTVTMYTEAMNKFTKSATAFMDQVHLLTEARDAYQEAIAASTALRNSLDAGDETLRSLMNQLEQVVNAHLGDPIPDKKRPELMKVQATG
Ga0193747_102608523300019885SoilMGSQPPPTVTMYTEAMNKFTKSATAFMDQVHLLTEARDAYQEAIAASTALRNSLDAGDETLRSLMNQLEQVVNAHLGDPIPDRKRPELMKVQATG
Ga0193747_108058723300019885SoilMSAAIWKREDGVGPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPDKKRPEPVKAEATG
Ga0193729_110128713300019887SoilMNDAMWKKEDSVGTQLTPNWAIYADAMNRFTQAATAFMEHVHLLTEAREAYEEAIKASMALRNSLDSGDQTLRSLRSQLARVVNDHLDEPALDRKKPELLKSNGGKAFP
Ga0193728_101011663300019890SoilMSAAIWKREDGVGPQPTPTVTMYTYEINKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPDKKRPEPVKAEATG
Ga0193718_111957913300019999SoilMSAAVWKKEDGVGLQPTPTVTMYTEAMNKFTKSATAFMDQVHILTEARDAYQEAMAASTALRERLDAGDQTLRSLMTQLEQVVSAHLGEHVRDRKRP
Ga0193731_103383913300020001SoilLIMLALVAQEGELRNPMSAAVWKKEDGVGLQPTPTVTMYTEAMNKFTKSATAFMDQVHILTEARDAYQEAMAASTALRERLDAGDQTLRSLMTQLEQVVSAHLGEHARDRKRPEPVKVEANGTNGENTDFARTFL
Ga0193732_100362613300020012SoilEAMNKFTKSATAFMDQVHLLTEARDAYQEAIAASTALRNSLDAGDETLRSLMNQLEQVVNAHLGDPIPDRKRPELMKVQATG
Ga0210407_1069989113300020579SoilPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSLP
Ga0210407_1137685013300020579SoilMWKKEDDVGPQPTLTVTMYIEAMNKFTKSASAFIEQVHLLTEARDAYEEATRASTALRNSLDANDQTLRSLITQLEQVVNTHFGEPIP
Ga0210403_1055038923300020580SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMIASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSL
Ga0210399_1107974113300020581SoilVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSLP
Ga0210401_1001433873300020583SoilMWKKEDGASPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSL
Ga0210401_1026229823300020583SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNID
Ga0210401_1127824813300020583SoilMWKETTWKKADALSSQPTPTMAMYTEAMDKFTKSATAFMEHVHLLNEARDAYHEAVSASSTIRRSLDASDQALRSLMTQLEQVVNDHLGEPALERKKLELVKAEATRTSGENTSNVSKLP
Ga0179596_1046689313300021086Vadose Zone SoilMSAAMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGMYP
Ga0210404_1068574513300021088SoilMWKKEDGVSTPPAPTMAVYTEAMNNFTKSATAFMEHVHLLTEARDAYQTAMTASTALRDSLDAGDQALRTLMTQLEQVVGVHLGEPALDKKKPEAVKADAIRTNV
Ga0210404_1079782213300021088SoilMWKKEDDVGPQPTLTVTMYIEAMNKFTKSASAFIEQVHLLTEARDAYEEATRASTALRNSLDANDQTLRSLITQLEQVVNTHFGEPIPDKKKPEPMKLEETG
Ga0179584_139829323300021151Vadose Zone SoilVPQAAPTMTMYTEAMNKFTKSATAFMEQVHLLTEAREAYEEAISASTALRKSLDAGDQTLRSLMTQLEQVVTTHFAEPHPDKKRPETVRSEATRAAHEGNGSAGTMLP
Ga0210408_1060148623300021178SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSL
Ga0210408_1141009813300021178SoilDGAGAQPTTLTMYTEAMNKFTKSASAFMEQVHLLTEARNAYEEAMTASTALRNSLDAGDQTLRSLFTQLEQVVNTHFGEPGPDKRNPEPMKVEATG
Ga0193719_1005601323300021344SoilMGSQPPPTVTMYTEAMNKFTKSATAFMDQVHLLTEARDAYQGAIAASTALRNSLDAGDETLRSLMNQLEQVVNAHLGDPIPDRKRPELMKVQATG
Ga0193719_1013373213300021344SoilMSAAVWKKEDGVGLQPTPTVTMYTEAMNKFTKSATAFMDQVHILTEARDAYQEAMAASTALRERLDAGDQTLRSLMTQLEQVVSAHLGEHARDRK
Ga0210386_1123915413300021406SoilTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSLP
Ga0210383_1160743513300021407SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVR
Ga0210394_1031208613300021420SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTT
Ga0210384_10003206153300021432SoilMWKKEDGASPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDN
Ga0210384_1035093213300021432SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDNIGAVRTTSL
Ga0210384_1035414023300021432SoilMWKKEDGVSTPPTPTMAVYTEAMNNFTKSATAFMEHVHLLTEARDAYQTAMTASTSLRDSLDAGDQALRTLMTQLEQVVGVHLGEPALDKKKPEAVKADAIRTNVLL
Ga0210391_1110306313300021433SoilEFQTEMSAAMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSLP
Ga0210410_1045705913300021479SoilALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSLP
Ga0207685_1003487123300025905Corn, Switchgrass And Miscanthus RhizosphereMWKKEDGPGPQSTLAVTTYAEAMNKFTKSATAFMEHVHLLTEARDAYQQAMTASAALRNTLDAGDETLRSLILQLEQVVSTHLGEPSLEHKSNDATKSESSRAIKEITAA
Ga0207665_1058330213300025939Corn, Switchgrass And Miscanthus RhizosphereMSAAIWKREDGVSPQPTPTVTMYTDAMNKFTKSATAFMEQVHLLTEARDAYQEAMAASKGLRDSLDAGDQTLRSLMTQLEQVVNTHLGDPAPEKKRPEPVKAEATG
Ga0207689_1097729323300025942Miscanthus RhizospherePPPSMATYTDAMNKFTKAATAFMDHVHLLSEARDAYQAAMTASTALRNSLETGDQALRSLMEQMEQVVSAHLGEPCPDKKKVERLEPTARAPQLDEASSVKI
Ga0209839_1002320623300026294SoilMWKKDDVVGSQPTLTVSMYTEAMNKFTKSATAFMEQVHLLTEARDAYEEAMTASTALRNSLDAGDHTLRSLMTQLEQVVNTHFAEPVPDKKKPEAMKVEATG
Ga0209839_1002320643300026294SoilMNGAMWKKEDGVGAELTPIWTRYAEAMFRFSKSATAFMGHVHLLTEARAAYLEAMTASTALRNSLDAGDKTLRSLRAQLAQVVNDRLDESTLARKKPELLKNIASAKVFP
Ga0209155_106184513300026316SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRNSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGESTGTITRALP
Ga0209155_114880423300026316SoilKTEDGVSNQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP
Ga0209687_109980423300026322SoilMWKKEDGMSTQPTPTLAMYTEAMNKFTKSATAFMEHVHLLTEARDAYEDAMTTSRALRSSLDAGDQALRSLMTQMEQVINAHLSEAALDKKRPELVKVEPTRTNGESTGTITRALP
Ga0209473_104022333300026330SoilMNEAMLKTEDGVSNQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP
Ga0209808_124280113300026523SoilMLKTEDGVSTQVAPTWTMYADAMNRFTKSATAFMEHVQLLTEARDAYEEAMRASTALRHSLDAGDQTLRSLRTQLARVINDHLDQPTLDKKKPELLKSTGAAKAFP
Ga0209805_137739923300026542SoilMLKTEDGVSTQVAPTWTLYADAMNRFTKSATAFMEHVHLLTEARDAYEEAMRASTALRNSLDAGDQTLRSLRTQLARVINDHLDQPAFDRKKPELLKSTGAGKAFP
Ga0209117_119873413300027645Forest SoilMWKRDYVGPQPTLTVTMYTEATNRFTKSATAFMEQVHLLTEARAAYEEAMRVSTALRNSLDAGDQTLRSLITQLEQVVNTHVAGPVPDEKKPEPMKVEATG
Ga0209217_1000066163300027651Forest SoilMNQATWKKEEGEGARLTPSWAMYAEAMDRFTKSATAFMEHVHLLTEARTAYEEAMTASAALRSRLDAGDQTLRSLREQLARVVNDHLDEPTLDRKKPELLTGSGGWKTFP
Ga0209217_108257823300027651Forest SoilMSEAMWRKETGVSTPPKPTMAMYTDAMNKFTKSATAFMEHVPLLTEARDAYQTAISASTALRNSLDAGDQALRSLMSQLEQVVSTHMSEPVPDRKRPELVKAEPIRTNGASTATSGKFLP
Ga0209009_106589623300027667Forest SoilMWKKEDGASPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRSSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRSNNDSIGAVRTTSL
Ga0209283_1007759533300027875Vadose Zone SoilMSAAMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPALDKRKPELVKAESIREKNEGTGTGGMYP
Ga0209488_1007695533300027903Vadose Zone SoilMSAAMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIRE
Ga0209006_1008245233300027908Forest SoilMWKKEDGATPQPTPTLATHTDALNKFTKSATAFMEHVHLLTEAREAYQEAMTASAALRNSLDAGDKALRSLMAQLEQVVSAHLGEPPVDKNIDKKKMEPAKVEPIRFNNDSIGAVRTTSL
Ga0209698_1032287423300027911WatershedsMSAAMWKKEEGISPQPTSTLATYTEAMNKFTHASTAFMEHVHLLTEAREAYQEAMNASAALRNSLDAGDKSLRGLMTQLEQVVNAHLGDPNLDRKKPEGIRVEPIRGNGDSMGVVRTTSL
Ga0209526_1001229283300028047Forest SoilMWKDENDISTQPAPTLATYTEAMNEFTRSATAFMDHVHLLTAARDAYEDAMTTSRALRNSLDASDQTLRALMIQMEQVINAHLGEAAPEKERPEPLKVEATRTYGQNAGNALRSLP
Ga0209526_1001594723300028047Forest SoilMWKKEDSVTTQPIPTMAMYTEAMNKFTKSAKDFMEHVHLLTEARDAYQEAVTASKALRNSLDAGDQTLRSLMTQLEQVVNVHLGEPTPDRKKPELVKTEASRVNSDSVGVVRTFLP
Ga0209526_1010725943300028047Forest SoilMWRKEDGVSTPPMPTMATYTDAMNKFTKSATAFMEHVHLLTEARDAYQTAISASTALRNSLDAGDQALRSLMSQLEQVVSTHMGEPVPDRKRPELVKAEPIRTNGESTATSGKFLP
Ga0209526_1034794813300028047Forest SoilMNDAMWKKEDSVGTQLTPNWAIYADALNRFTHSATAFMEHVHLLTEAREAYEEAMKASIALRNTLDSGDQTLRSLRSQLARVVNDHLDEPAFDRKK
Ga0209526_1036119313300028047Forest SoilMWKKEDSVTTQAMPTMAMYTEAMNKFTKSAKDFMEHVHLLPEARDAYQEAMTVSKALRNSLDAGDQTLRSLMTQLEQVVNVHLGEPTPDRKKPELVKTEATRVNSDSVGGVRTFLP
Ga0257175_105130523300028673SoilMSAAMWKKEDGVGPQPIPTVTIYTEAMNKFTKSATAFMEQVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREK
Ga0170824_12881886913300031231Forest SoilMTAAIWKREEGVIPQAAPTMTMYTEAMNKFTKSATAFMEQVHLLTEAREAYEEAIAASTALRKSLDAGDQTLRSLMTQLEQVVTTHFAEPHPDKKRPETVKGEATRAATEGNGSAGTMLP
Ga0308194_1027551023300031421SoilMGSQPPPTVTMYTEAMNKFTKSATAFMDQVHLLTEARDAYQEAIAASTALRNSLDAGDETLRSLMNQLEQVVNAHLGDPIPDRK
Ga0307473_1007342633300031820Hardwood Forest SoilMNEATWKKEEGAGALLTPTWTMYAEAMNKFTKSAKAFLEHVHLLTEAGAAYEEAMTASAALRSSLDSGDQTLRSLSAQLEQVVNAHLEEPTLGRKKPELLKSSDGWKTFP
Ga0307479_1010527343300031962Hardwood Forest SoilMNGAAWKKEDGVGAELTPIWATYAEAMNRFTKSATAFMGNVHFLTEARAAYLEAMTASTALRNSLDAGDQTLRSLQAQLAQVVNDHLDELTLDRKKPELLKRTGSAKGFP
Ga0307479_1098194413300031962Hardwood Forest SoilMWKETTWKKADAMSSQPTPTMAMYTEAMDKFTKSATAFMEHVHLLNEARDAYHEAVSASSTIRRSLDASDQALRSLMTQLEQVVNDHLGEPALERKKLELVKAEATRTSGENTSNVSKLP
Ga0307479_1140994723300031962Hardwood Forest SoilRSLTILGRQRIMFGISCRTKGIEATMWKKEADVGAQPTLTVTMYIEAMNKFTKSASAFIEQVHLLTEARDAYEEATRASTALRNILDANDQTLRSLITQLEQVVNTHFGEPPDKKKPEPMKLEATG
Ga0307471_10098942813300032180Hardwood Forest SoilMWKDTTWKKADAMSSQPTPTMAMYTEAMDKFTKSATAFMEHVHLLNEARDAYHEAVSASSTIRRSLDASDQALRSLMTQLEQVVNDHLGEPALERKKLELVKAEATRTSGENTSNVSKLP
Ga0307472_10152513713300032205Hardwood Forest SoilMSSQPTPTMAMYTEAMDKFTKSATAFMEHVHLLNEARDAYHEAVSASSTIRRSLDASDQALRSLMTQLEQVVNDHLGEPALERKKLELVKAEATRTSGENTSNVSKLP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.