NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F078507

Metagenome Family F078507

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078507
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 130 residues
Representative Sequence MKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSHEDP
Number of Associated Samples 87
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 32.76 %
% of genes near scaffold ends (potentially truncated) 31.03 %
% of genes from short scaffolds (< 2000 bps) 84.48 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(32.759 % of family members)
Environment Ontology (ENVO) Unclassified
(49.138 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.586 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.
1JGI25382J43887_103178371
2Ga0066674_101507072
3Ga0066674_101625481
4Ga0066680_106686961
5Ga0066685_101908071
6Ga0066685_103822961
7Ga0066678_101886161
8Ga0066676_100686792
9Ga0066676_110442701
10Ga0070690_1011576062
11Ga0070690_1012107541
12Ga0070688_1004497512
13Ga0070708_1011091361
14Ga0066682_101495992
15Ga0066682_101866262
16Ga0070707_1019694021
17Ga0070697_1007064011
18Ga0066692_102514961
19Ga0066707_109780271
20Ga0066704_105765841
21Ga0066698_110510181
22Ga0066705_104662222
23Ga0066705_104669591
24Ga0066694_103730662
25Ga0066708_108434982
26Ga0068859_1010636961
27Ga0068863_1001262582
28Ga0066652_1001032962
29Ga0066652_1015697211
30Ga0066653_106431801
31Ga0066665_103877482
32Ga0066665_112125731
33Ga0066665_112591491
34Ga0066659_100995922
35Ga0066659_115981931
36Ga0066660_112097931
37Ga0066710_1000645653
38Ga0066710_1004872472
39Ga0066710_1032827632
40Ga0111539_101125462
41Ga0111539_101373952
42Ga0066709_1000181142
43Ga0066709_1000763802
44Ga0111538_107711732
45Ga0075423_113085991
46Ga0134088_102488071
47Ga0134086_102139741
48Ga0134062_102441932
49Ga0134066_101377131
50Ga0134127_136410031
51Ga0134123_106503562
52Ga0137388_113286562
53Ga0137383_102342041
54Ga0137383_103405061
55Ga0137365_106599521
56Ga0137363_108437201
57Ga0137374_108947241
58Ga0137380_100394002
59Ga0137380_101302742
60Ga0137380_107780281
61Ga0137376_104054242
62Ga0137376_116395531
63Ga0137378_108996531
64Ga0137377_102435692
65Ga0137377_106554801
66Ga0137377_106774821
67Ga0137377_116647152
68Ga0137370_106451041
69Ga0137372_102810182
70Ga0137367_109929541
71Ga0137366_100938632
72Ga0137369_108611701
73Ga0137371_100490351
74Ga0137371_105724041
75Ga0137384_106785031
76Ga0137384_109862691
77Ga0137368_106739671
78Ga0137375_105260902
79Ga0137361_103320471
80Ga0137373_100741563
81Ga0137395_103771441
82Ga0137394_114509181
83Ga0137359_109059782
84Ga0137404_113891001
85Ga0134110_102563851
86Ga0134079_100090002
87Ga0137403_110589711
88Ga0134085_103054412
89Ga0134085_104108121
90Ga0132258_101108054
91Ga0132258_123174752
92Ga0134083_103353082
93Ga0134083_105480111
94Ga0184618_104504951
95Ga0066655_102923881
96Ga0066667_119573111
97Ga0066662_107008371
98Ga0190270_106738631
99Ga0210382_100500952
100Ga0193719_102338001
101Ga0247691_10331431
102Ga0247661_10720131
103Ga0207662_107375252
104Ga0207670_100093783
105Ga0209469_11553621
106Ga0209055_12971371
107Ga0209266_10565072
108Ga0209266_11505901
109Ga0209375_10771664
110Ga0209375_11596791
111Ga0209056_101197043
112Ga0209056_103086881
113Ga0209376_10117846
114Ga0209161_102685242
115Ga0209474_105697112
116Ga0307471_1016168571
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 87.69%    β-sheet: 0.00%    Coil/Unstructured: 12.31%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110120130MKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSHEDPCytopl.Extracel.Sequenceα-helicesβ-strandsCoilSS Conf. scoreSignal PeptideTM segmentsTopol. domains
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
100.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Groundwater Sediment
Soil
Vadose Zone Soil
Terrestrial Soil
Grasslands Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
29.3%8.6%32.8%7.8%4.3%3.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1031783713300002908Grasslands SoilMRADRKESLHKWMNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWTNDFAAVREKATLNETSDHKLMLELGNHLASIESKLELLMGEHQRDRKLQSGHDGP*
Ga0066674_1015070723300005166SoilVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066674_1016254813300005166SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTIKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLATVHEKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP*
Ga0066680_1066869613300005174SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSTSSETNDSKLMLELGHRLSSIESKLELLMGDYRRDRKLPGSHEDP*
Ga0066685_1019080713300005180SoilRQAKSEAMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066685_1038229613300005180SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0066678_1018861613300005181SoilVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDFSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066676_1006867923300005186SoilLMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066676_1104427013300005186SoilVGTREIVKQAREMRADRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFQTVAASDARLKAHQDYLNEVLKRWVNDFATVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKK
Ga0070690_10115760623300005330Switchgrass RhizosphereMRAVRKESLRKWVNLLVPALSLLLAGLMGPIAFTLRSTVKDMVRQELASNETVAASDARLKAHQDYLNEILKRWANDFAAVREKAAFSETNDHKLMLELDNRLSSIESKLELLMREHRKMQNNHGDP*
Ga0070690_10121075413300005330Switchgrass RhizosphereMKVVQKALLQKWVNVLVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAILREKVASGETNDSKVILEIGHRLSSIESKLELLMRDQLRDKKSQSGQDGP*
Ga0070688_10044975123300005365Switchgrass RhizosphereMRAVRKESLRKWVNLLVPALSLLLAGLMGPIAFTLRSTVKDMVRQELASNETVAASDARLKAHQDYLNEILKRWANDFAAVREKAALSETNDHKLMLELGNRLSSIESKLELLMGEKRKPQNSHGDP*
Ga0070708_10110913613300005445Corn, Switchgrass And Miscanthus RhizosphereMRADRKESLRRWVNVLVPALSLLLACLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHQDYLNEVLKRWATDLALVREKAAMSETNDHKLMLELGYRLSSIESKLELLMGEQKRERKLESSHGDP*
Ga0066682_1014959923300005450SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTIKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLATVHEKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP*
Ga0066682_1018662623300005450SoilMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0070707_10196940213300005468Corn, Switchgrass And Miscanthus RhizosphereNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSAIESKLEVLMGDYRRDRKLPSSHEDP*
Ga0070697_10070640113300005536Corn, Switchgrass And Miscanthus RhizosphereMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSAIESKLEVLMGDYRRDRKLPSSHEDP*
Ga0066692_1025149613300005555SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKL
Ga0066707_1097802713300005556SoilERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0066704_1057658413300005557SoilMIEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASSQTVAASEARLKAHQDYLNEVLKRWANDFAAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSRGSHGDP*
Ga0066698_1105101813300005558SoilNRRLKNRVVTGQMVEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFQTVAASDARLKAHQDYLNEVLKRWVNDFATVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSQGSHGDP*
Ga0066705_1046622223300005569SoilMKTERKETLRKWVNVSVPVLSLLLAGLMGPVAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGE
Ga0066705_1046695913300005569SoilWPWATLSNSNRDMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASNETVAATDARLKAQQDYLNEVLKRWANDLAIVREKSAVSETNDHKLMLELGYRLSSIESKLELLTEEQRREKKLPGSHGDP*
Ga0066694_1037306623300005574SoilNCRTTTRDMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0066708_1084349823300005576SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGH
Ga0068859_10106369613300005617Switchgrass RhizosphereMKVVQKAVLQKWVNVLVPVLSLSLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAALREKVASGETNDYKVILEIGHRLSSIESKLELLTGNQQRERKSQGGQDGP*
Ga0068863_10012625823300005841Switchgrass RhizosphereMKVVQKAVLQKWVNVLVPVLSLSLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAILREKVASGETNDSKVILEIGHRLSSIESKLELLMRDQLRDKKSQSGQDGP*
Ga0066652_10010329623300006046SoilMRADRKESLRRWVNVVVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHEDYLNEVLKRWANDLALVREKAAVSETNDHKLMLDLGNRLSSIESKLEILMEEHKRERKLESSHGDP*
Ga0066652_10156972113300006046SoilVETSEIVKEVRDMKAVRKESLRKWANVLVPALSLLLAGLMGPVAFTLKSTVKDLVRQELASNETVAASDARLKAHQDYLNEVLKRWANDLATVHEKSASSETNDYKLMLELGHRLSSIESKLEVLMG
Ga0066653_1064318013300006791SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLATVHEKSASSETNDYKLMLELGHRLSSIESKLELLMAEHQREKKLRSGHEDP*
Ga0066665_1038774823300006796SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASNETVAATDARLKAHQDYLNEVLKRWANDLAVVREKSAVSETNDHKLMLELGDRLSSIESKLEVLMGEQRRE
Ga0066665_1121257313300006796SoilMRADQKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIE
Ga0066665_1125914913300006796SoilMKTERKESLRKWVNVSVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMG
Ga0066659_1009959223300006797SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0066659_1159819313300006797SoilTICARPNRKLMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066660_1120979313300006800SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKVLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSTSSETNDSKLMLELGHRLSSIESKLELLMGDYRRDRKLPGSHEDP*
Ga0066710_10006456533300009012Grasslands SoilMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDFSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0066710_10048724723300009012Grasslands SoilMVEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELAGSQTVAASEARLKAHQDYLNEVLKRWANDFAAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSRGSHGDP
Ga0066710_10328276323300009012Grasslands SoilVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFQTVAASDARLKAHQDYLNEVLKRWVNDFATVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSQGSHGDP
Ga0111539_1011254623300009094Populus RhizosphereMKIIQKAFLQKWVNVLVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFATLREKVASGETNDSKVILEIGHRLSSIESKLELLMRNQLRERNSQSGQDGP*
Ga0111539_1013739523300009094Populus RhizosphereMRADRKESLRRWVNVAVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHQDYLNEVLKRWASDLAAVRDKAAVSETNDHKLMLDLGNRLSSIESKLEILMGENKREKKLESSHGGP*
Ga0066709_10001811423300009137Grasslands SoilMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDFSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0066709_10007638023300009137Grasslands SoilMIEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELAGSQTVAASEARLKAHQDYLNEVLKRWANDFAAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSRGSHGDP*
Ga0111538_1077117323300009156Populus RhizosphereMKIIQKAFLQKWVNVLVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEGLKRWADDFATLREKVASGETNDSKVILEIGHRLSSIESKLELLMRNQLRERNSQSGQDGP*
Ga0075423_1130859913300009162Populus RhizosphereMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFETVAASDARLKAHQDYLNEVLKRWANDLAAVREKAAMSETNDHKLMLDLGNRLASIESKLEVLMGEHKHDKKLQSSHGDP*
Ga0134088_1024880713300010304Grasslands SoilKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLATVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQSGHDGP*
Ga0134086_1021397413300010323Grasslands SoilMKTERKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSYETVTAADARLKAHQDYLNEVLKRWANDLATVHEKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0134062_1024419323300010337Grasslands SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0134066_1013771313300010364Grasslands SoilMRADRKESLRRWVNVVVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHEDYLNEVLKRWANDLALVREKAAVSETNEHKLMLDLGNRLSSIESKLEILMEEHKRERKLESSHGDP*
Ga0134127_1364100313300010399Terrestrial SoilMKVVQKALLQKWVNVLVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFATLREKVASGETNDSKVILEIGHRLSSIESKLELLMRNQLRERNSQSGQDGP*
Ga0134123_1065035623300010403Terrestrial SoilMKVVQKALLQKWVNVLVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAALREKVASGETNDYKVILEIGHRLSSIESKLELLTGNQQRERKSQGGQDGP*
Ga0137388_1132865623300012189Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWGNDFAAVREKSASSETNDYKLMLELGHRLSSIESKLSVLMGEYQRDRKLQ
Ga0137383_1023420413300012199Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMADHQREKKSQSSHDNP*
Ga0137383_1034050613300012199Vadose Zone SoilNVLVPALSLLLAGLMGPIAFTLKSTVKDLVREELASNETVAATDARLKAHQDYLNEVLKRWANDLAIVREKSAVSETNDHKLMLELGYRLSSIESKLELLTGEQRREKKLPGSHGDP*
Ga0137365_1065995213300012201Vadose Zone SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYQTVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLPSSHEDP*
Ga0137363_1084372013300012202Vadose Zone SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSHEDP*
Ga0137374_1089472413300012204Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAIREKSASSDTNDYKLILELRHRLSSIESKLEVLMGEYRRERKLQGSHEDP*
Ga0137380_1003940023300012206Vadose Zone SoilMVEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASTQTVAASEARLKAHQDYLNEVLKRWANDFAAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQRAKKPQGSHGDP*
Ga0137380_1013027423300012206Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTMKDLVRQELASYETVAASDARLKAHQDYLNEVIKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMGEYRRDRKLQSSHDGP*
Ga0137380_1077802813300012206Vadose Zone SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASNETVAATDARLKAHQDYLNEVLKRWANDLAVVREKSAVSETNDHKLMLELGDRLSSIESKLEVLMGEQRREKKLPSSHG
Ga0137376_1040542423300012208Vadose Zone SoilMKAERKETLRKWVNLSVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELTSFETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP*
Ga0137376_1163955313300012208Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQEYLNEVLKRWANDFAAVREKAGLNETNDNKLMLELSHHLSSIEAKLELLMGERQRDRKLQNSHEDP*
Ga0137378_1089965313300012210Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLAAVREKSTSSETNDSKLMLELGHRLSSIESKLEVLMGEYRRDRKLPSSHEDP*
Ga0137377_1024356923300012211Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWSNDFAAVREKSAASDTNDYKLILELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0137377_1065548013300012211Vadose Zone SoilMRTDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMGEYRRDRKVQSSHDGP*
Ga0137377_1067748213300012211Vadose Zone SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASNETVAATDARLKAHQDYLNEVLKRWANDLAIVREKSAVSETNDHKLMLELGDRLSSIESKLEVLMGEQRREKKLPSSHGDL*
Ga0137377_1166471523300012211Vadose Zone SoilMKTERKESLRKWVNVSVPVLSLLLAGLMGPIAFTIKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDLATVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP*
Ga0137370_1064510413300012285Vadose Zone SoilMKTERKETLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMGE
Ga0137372_1028101823300012350Vadose Zone SoilMRADQKESLRRGVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTRNETVAASDARLKAHQDYLNEVLKRWANDLALVREKAAVSETNDHKLMLELGNRLSSIESKLELLMGEQKREKKLPSRHGDP*
Ga0137367_1099295413300012353Vadose Zone SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLELLMGEYRRDRKVQSSHDGP*
Ga0137366_1009386323300012354Vadose Zone SoilMKVVQKESLRKWVNVLVPALSLLLAGLMGPIAFTLRSTVKDLVRQELASYETVAASDARLKAHQEYLNEVLKRWANDFAAVREKAGLNETNDNKLMLELSHHLSSIEAKLELLMGERQRDRKLQNNHEDP*
Ga0137369_1086117013300012355Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSVSSETNDYKLMMELGHRLSSIESKLELLMGEYQRDRKLQSGHANP*
Ga0137371_1004903513300012356Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASDTNDYKLILELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP*
Ga0137371_1057240413300012356Vadose Zone SoilMIEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASSQTVAASEARLKAHQDYLNEVLKRWVNDFATVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSQGSHGDP*
Ga0137384_1067850313300012357Vadose Zone SoilKCVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASTQTVAASEARLKAHQDYLNEVLKRWANDFAAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQRAKKPQGSHGDP
Ga0137384_1098626913300012357Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIESKLELLMVEYRRDRKWQSSHDGP*
Ga0137368_1067396713300012358Vadose Zone SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYQTVAASDARLKAHQDYLNEVLKRWANDFAAVREKSVSSETNDYKLMMELGHRLSSIESKLELLMGEYQRDRKLQSGHENP*
Ga0137375_1052609023300012360Vadose Zone SoilMKVVQKESLRKWVNVLVPALSLMLAGLMGPIAFTLKSTVKDLVRQELASNETVAASDARLKAHQDYLNEVLKRWATDFAALRERTALNETNDYSLMLELRRRLSSIESKLELLTGQYQRDKNRPDSHNDP*
Ga0137361_1033204713300012362Vadose Zone SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSREDP*
Ga0137373_1007415633300012532Vadose Zone SoilMRADQKESLRRGVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTRNETVAASDARLKAHQDYLNEVLKRWANDLALVREKAAVSETNDHKLMLELGNRLSSIESKLELLMGEQKREKKLPSSHGDP*
Ga0137395_1037714413300012917Vadose Zone SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSI
Ga0137394_1145091813300012922Vadose Zone SoilMRADRKESLRKWVNVLVPALSLLLAGLMGPIAFTLRSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMQELGHRLSSIESKLELLTREYQRD
Ga0137359_1090597823300012923Vadose Zone SoilMVEQDGAMKVDRKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASFQTVAASDARLKAHQDYLNEVVKQWVNDFATVREKVALRETNDYRVMLELDNRLSSIESKLELLMEDHQREKKSPGSHGDP*
Ga0137404_1138910013300012929Vadose Zone SoilMKAEREESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSREDP*
Ga0134110_1025638513300012975Grasslands SoilLLAGLMGPVAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP*
Ga0134079_1000900023300014166Grasslands SoilVETSEIVKEVRDMKAVRKESLRKWANVLVPALSLLLAGLMGPVAFTLKSTVKDLVRQELASNETVAASDARLKAHQDYLNEVLKRWANDFAAVREKAALSDTNDHKLMLELGNRLSSIETKLELLMGEHRKLQNSHGDP*
Ga0137403_1105897113300015264Vadose Zone SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLQSSREDP*
Ga0134085_1030544123300015359Grasslands SoilMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP*
Ga0134085_1041081213300015359Grasslands SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSYETVAASDARLKAHQDYLNEVLKRWANDLATVHEKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQSGHDGP*
Ga0132258_1011080543300015371Arabidopsis RhizosphereMKVVQKALLQKWVNVMVPVLSLLLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFATLREKVASGETNDSKVILEIGHRLSSIESKLELLMGNQQRDAKSQSGQNGP*
Ga0132258_1231747523300015371Arabidopsis RhizosphereVETTVVANEDRDMRAIRKESFRRWVSVLVPALSLFLAALMGPIAFTLKSTVKDLVRQELAGNETIASSDARLKAHQDYLNEVLRRWANDFAAVREKAAVSETNDHKLMLELGNRLSSIESKLELLMEAQRKLQYTHGDP*
Ga0134083_1033530823300017659Grasslands SoilGLMGPIAFTLKSTVKDLVRQELAGSQTVAASEARLKAHQDYLNEILKRWANDFSAVREKVALRETNDYRVMLELGNRLSSIESKLELLMEDHQREKKSQGSHGDP
Ga0134083_1054801113300017659Grasslands SoilLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLELLMGEHQREKKLRSGHEDP
Ga0184618_1045049513300018071Groundwater SedimentSLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLELLTREHQRERK
Ga0066655_1029238813300018431Grasslands SoilVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSILKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0066667_1195731113300018433Grasslands SoilYTLNRQSDYLRQAKSEAMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0066662_1070083713300018468Grasslands SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSTSSETNDSKLMLELGHRLSSIESKLELLMGDYRRDRKLPGSHEDP
Ga0190270_1067386313300018469SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELARFETVAASDARLKAHQDYLNEVLKRWAHDFAAVREKAASSETNDHKLMLEVDNRLSSIESKLELLAQEHQRDRKLRSGHKNP
Ga0210382_1005009523300021080Groundwater SedimentMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGNRLSSIELKLELLTREYQRDRKLP
Ga0193719_1023380013300021344SoilMKAERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLPSSHEGP
Ga0247691_103314313300024222SoilMRADRKESLRRWVNVIVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHQDYLNEVLKRWANDLALVREKAAVSETNDHKLMLDLGNRLSSIESKLEILMGEHKRERKLESSHGDP
Ga0247661_107201313300024254SoilRKESLRRWVNVIVPALSLLLAGLMGPIAFTLKSTVKDLVRQELTSNETVAASDARLKAHQDYLNEVLKRWANDLALVREKAAVSETNDHKLMLDLGNRLSSIESKLEILMGEHKRERKLESSHGDP
Ga0207662_1073752523300025918Switchgrass RhizosphereMKVVQKAVLQKWVNVLVPVLSLSLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAALREKVASGETNDSKVILEIGHRLSSIESKLELLMRNQLRERNSQSGQDGP
Ga0207670_1000937833300025936Switchgrass RhizosphereMKVVQKAVLQKWVNVLVPVLSLSLAGLMGPIAFTLKSTVKDLVRQELASKETVAASDARLKAHEDYLNEVLKRWANDFAILREKVASGETNDSKVILEIGHRLSSIESKLELLMRDQLRDKKSQSGQDGP
Ga0209469_115536213300026307SoilMRYAVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLL
Ga0209055_129713713300026309SoilLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSTSSETNDSKLMLELGHRLSSIESKLELLMGDYRRDRKLPGSHEDP
Ga0209266_105650723300026327SoilVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0209266_115059013300026327SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEV
Ga0209375_107716643300026329SoilVAGYSRVRSAPYTLNRQSDYLRQAKSEANEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0209375_115967913300026329SoilMKTERKESLRKWVNVMVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSAASETNDYKLMLELGHRLSSIESKLEVLMGEYQRDRKLQSGHEGP
Ga0209056_1011970433300026538SoilVNSSVNTSRIVEQVVSMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0209056_1030868813300026538SoilMRADQKESLRKWVNVLVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASNETVAATDARLKAHQDYLNEVLKRWANDLAVVREKSAVSETNDHKLMLELGDRLSSIESKLEVLMGEQRREKKLPSSHGDL
Ga0209376_101178463300026540SoilMKRNEQRESFRKWLTLAVAILALLPAYLMGPIAFTFKTTVKDLLRQELAGYEAATASDSRLKAHQDYLNETLKRWGNDFAAAREKTASSETNDYKLMLDVSNRLSSIESKLDLLMRESQRDRKLQNGSGP
Ga0209161_1026852423300026548SoilLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP
Ga0209474_1056971123300026550SoilMKTERKETLRKWVNVSVPVLSLLLAGLMGPVAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSDTNDYKLMLELGHRLSSIESKLEVLMGEYRRERQLQGSHDGP
Ga0307471_10161685713300032180Hardwood Forest SoilMKAERKESLRKWVNVSVPALSLLLAGLMGPIAFTLKSTVKDLVRQELASYETVAASDARLKAHQDYLNEVLKRWANDFAAVREKSASSETNDYKLMLELGHRLSSIESKLEVLMGEYRRDRKLPSSHEDP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.