NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F079148

Metagenome Family F079148

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079148
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 121 residues
Representative Sequence MTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQ
Number of Associated Samples 86
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.21 %
% of genes near scaffold ends (potentially truncated) 99.14 %
% of genes from short scaffolds (< 2000 bps) 81.03 %
Associated GOLD sequencing projects 75
AlphaFold2 3D model prediction Yes
3D model pTM-score0.42

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (92.241 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(51.724 % of family members)
Environment Ontology (ENVO) Unclassified
(45.690 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(51.724 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176
1JGI12673J13574_10074062
2JGI12635J15846_102293391
3JGI25613J43889_101712531
4JGI25617J43924_101158241
5JGI25616J43925_100020567
6JGI25616J43925_103684051
7Ga0066672_104341632
8Ga0066683_1000001018
9Ga0066680_109226281
10Ga0066685_100590002
11Ga0066701_109692901
12Ga0066695_107804332
13Ga0066707_110194421
14Ga0066656_100722671
15Ga0075029_1004315141
16Ga0075018_107773252
17Ga0066658_101411452
18Ga0066665_113212361
19Ga0066659_107538802
20Ga0079220_110196191
21Ga0099793_107149142
22Ga0099794_101634312
23Ga0099794_102098961
24Ga0066710_1000476371
25Ga0066710_1040735071
26Ga0099829_100129893
27Ga0099830_100894492
28Ga0099828_110701752
29Ga0099828_115566252
30Ga0099827_100520003
31Ga0066709_10000119812
32Ga0099792_105475952
33Ga0137392_100449781
34Ga0137392_104384361
35Ga0137389_102429921
36Ga0137389_113014452
37Ga0137388_103224201
38Ga0137388_105908012
39Ga0137388_112151732
40Ga0137388_116475381
41Ga0137388_117860131
42Ga0137388_118417582
43Ga0137388_120403521
44Ga0137364_114197051
45Ga0137383_112149552
46Ga0137362_103786861
47Ga0137362_115601432
48Ga0137362_115941581
49Ga0137381_100639271
50Ga0137377_110591492
51Ga0137370_102359302
52Ga0137384_113969011
53Ga0137385_112439532
54Ga0137360_105749222
55Ga0137361_100213954
56Ga0137361_107081622
57Ga0137390_114951002
58Ga0137390_118118742
59Ga0137358_105289262
60Ga0137396_100300061
61Ga0137396_104137862
62Ga0137396_104398902
63Ga0137394_101029681
64Ga0137416_100178021
65Ga0137404_118127391
66Ga0137407_122141691
67Ga0137411_12960846
68Ga0066655_111971971
69Ga0193730_11736681
70Ga0179594_101617202
71Ga0179592_103812091
72Ga0210399_110757092
73Ga0210408_114269511
74Ga0210394_110979761
75Ga0210402_103625362
76Ga0209238_11996582
77Ga0209240_11450801
78Ga0209761_12754762
79Ga0209155_10025121
80Ga0209131_10794691
81Ga0209377_11616832
82Ga0257157_10039532
83Ga0209807_12367121
84Ga0209807_12373411
85Ga0209157_13104521
86Ga0209056_102669282
87Ga0209161_100264873
88Ga0209161_104041391
89Ga0209474_102288962
90Ga0209648_105676641
91Ga0209577_100849912
92Ga0179593_12191091
93Ga0179587_101404321
94Ga0208575_10196631
95Ga0209179_11192421
96Ga0209735_10878872
97Ga0209076_10334561
98Ga0209076_11797782
99Ga0209588_11183981
100Ga0209588_11447421
101Ga0209073_105010622
102Ga0209180_100727461
103Ga0209180_105142021
104Ga0209701_100110651
105Ga0209701_104897441
106Ga0209283_102705422
107Ga0209283_103190631
108Ga0209283_109401332
109Ga0307469_110177172
110Ga0307475_103954651
111Ga0307475_113190071
112Ga0307479_102652962
113Ga0307479_110554151
114Ga0307479_114170582
115Ga0307472_1001895982
116Ga0307472_1009099521
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 16.55%    β-sheet: 20.69%    Coil/Unstructured: 62.76%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100110MTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQSequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.42
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
92.2%7.8%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Watersheds
Soil
Vadose Zone Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Hardwood Forest Soil
Soil
Forest Soil
Soil
51.7%17.2%11.2%4.3%6.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12673J13574_100740623300001167Forest SoilMFLLAVAALAAGDKRINLLPKLHPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKTAIHALGRFLTLDSGVWLKKPGDKKPNWDKQRVDPSGKSV
JGI12635J15846_1022933913300001593Forest SoilMRRQLRRIAGMFLLAAATLCAGDRRVHLLPKLQPSQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQEMGNKAVIHARGQFLTLDSVLKAPGDKKPDGDKQRVDPDGKSIEFTISSDGSVN
JGI25613J43889_1017125313300002907Grasslands SoilMTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQRVDPEG
JGI25617J43924_1011582413300002914Grasslands SoilMKFKLQLIAGLLLLVGAPLCVGDRRAHLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQLDAHGLLRIEILDVQETGSKAAIHARGQFLTLDSGVWLKGPHEKRPDWDKQSVHPHDRSIEFTISP
JGI25616J43925_1000205673300002917Grasslands SoilMTHKLQRTAGLLLLAGTTLCAGDRRVNLIPKLQPGQTITYLIRFQSDKIVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPR
JGI25616J43925_1036840513300002917Grasslands SoilMRRQLQRIAGMFLLAAAALCAGDTRVHLLPKLRPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQPAGSKAAIHARGQFLTLDSGVWLKAPGGKKPVGDKQ
Ga0066672_1043416323300005167SoilMRRNLLPIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKER
Ga0066683_10000010183300005172SoilLKRNLVPIAGLSLLAAATLCAGDKRVHFLPRLQPGQTITYLIRFQSDKNVKTESNVVAPMAPDAAQIDAHGLLRLEILGVQQSSSSAAIHLRGQFLTFEPNVQPR
Ga0066680_1092262813300005174SoilMRRNLLAIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKER
Ga0066685_1005900023300005180SoilLSLLAAATLCAGDKRVHFLPRLQPGQTITYLIRFQSDKNVKTESNVVAPMAPDAAQIDAHGLLRLEILGVQQSSSSAAIHLRGQFLTFEPNVQPRTPE
Ga0066701_1096929013300005552SoilMRRNLLPIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRVEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKERVDPNGKTIEFTISPDG
Ga0066695_1078043323300005553SoilMQMKRKPQRTAGLLLLAAVTLCAGDRRVSLLPKLQPGQTLIYLIRFQSDKTVKTESKVVAPMAPNATQIDAHGLLLVEVLDVQPAGAKAMIHARGQFLTLDAGAWLNKPGDNKSGWDRQ
Ga0066707_1101944213300005556SoilMKCKLQRVARLFLLAAAILDAGDKRVHLLPKLQHGQVITYLIRFKADKNVKSESNVAVPMAPNAAQVNAHGLLRVEIMDVKEMGVRPEIRARAKFLALDSGVLPKRPGDKKSDDKKTARDKQLVYPDGKTIEFTISPDGSVN
Ga0066656_1007226713300006034SoilMKRKLQLTAGMFVLTVFTAGAGDKRVNLLPKLHSGQTITYLIRYRTDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKNPNWDRQRVDPQGKRIEF
Ga0075029_10043151413300006052WatershedsMTHKPQWMLGGLLLVAFSLGATDRRVHLLPKLQPGQTIIYLIRFRSDKSVKTESKVVAPMAPDDVGLDAHGLLRVEILDVHETGGNVTIHARGQFLTTDYGAPVK
Ga0075018_1077732523300006172WatershedsVLLLAAASLGAGDRRVHLLPKLQPGQIITYLIRFQSDKTVKTESKVVAPMAPDAAQLDAHGLLRVEILNVQETAGNATIRARSRFLTPNAGAAIKAPNEKNPDMNNLRE
Ga0066658_1014114523300006794SoilMRRNLLPIVVLLLLAAAPLPAADRRTRFLPQLQPGQTITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKERVDRNGKT
Ga0066665_1132123613300006796SoilMTCRLQWIAGLLLLAAATVCAGDRRVQLIPKLQPGQTITYLIHFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPDWDKQRVD
Ga0066659_1075388023300006797SoilMRRNLLPIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKERVDPNGKTIEFTISPDGSANEV
Ga0079220_1101961913300006806Agricultural SoilMRHHLQRTTSVFFLAAVSLFAGDRRVHLLPQLQPGQVLTYLIRFQSEKNIKTESRVVAPMAPDASQIDAHGLLRVEILDVQQTSGKSAIHARGQFLTLDSGVWLKQPGEKNPNWD
Ga0099793_1071491423300007258Vadose Zone SoilPQRTAGLLLLAAVTLCAGDRRVSLLPKLQPGQTLIYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLLVEVLDVQPAGAKAMIHARGQFLTLDSGVWLKKPGDNKSG*
Ga0099794_1016343123300007265Vadose Zone SoilMMRRQLQRIAALLLLAVATLGAADKRINLLPKLQPGQTITYLIRFQTDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQTSGKAAIHARGRFLTLDSGVRLKAPGDKKSDGDKQRVDPDGKSIELTISSD
Ga0099794_1020989613300007265Vadose Zone SoilMFLLAAATLCAGDRRVHLLPKLHPGQTITYLIRFQSDKTVKTESNVVAPMAPNAAQIDAHGLLRVEILDVQPAGRKAAIHARGQFLTLDSGVWL
Ga0066710_10004763713300009012Grasslands SoilLKRNLVPIAGLSLLAAATLCAGDKRVHFLPRLQPGQTITYLIRFQSDKNVKTESNVVAPMAPDAVQIDAHGLLRLEILGVQQSSSSAAIHLRGQFLTFE
Ga0066710_10407350713300009012Grasslands SoilMRRNLLPIIVLLLLAAATLCAADRRAHFLPQLQPGQTITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLILDTGVWIKGPKDKKPNWDKERVDPNGKTIEFTISPD
Ga0099829_1001298933300009038Vadose Zone SoilMKRKLQRLAGVFLLAAATLGAGDKRVHLLPKLQCGQIIIYLIRFQADKNVKSESKVVAPLAPNAAQIDAHGLLRIEVLDVKQIDAKAEIRARAKFLNLDSGVWLKKPGQKKPAWDAQLVDPDGKTIEFSISPVGSVNDLK
Ga0099830_1008944923300009088Vadose Zone SoilMRRKLQRTAGLLLLAVATLCAGDKRINLIPKLQPRQTITYLIRFQSDKTIKTESKVVAPMAPDAAQIDAHGLLRVEILDVQEIGGKVGIHARAQFLTLDTGVWVKGPGDK
Ga0099828_1107017523300009089Vadose Zone SoilMARNLMKQRISLVVGVVLLAGTIAAAGDRRPDLFPKLQPGQTLTYLIRFQSDKKIKTESKVALAMAPSAVQLDAHGLLRVEILDVKASGGKPVVHARARFLTLDSGAWLKKPGDKKPHWDLQRVDPAGKTIDFAISP
Ga0099828_1155662523300009089Vadose Zone SoilMRSKLQRTVGLLLLAVATLCAGDKRINLIPKLQPRQTITYLIRFQSDKTIKTESKVVAPMAPDAAQIDAHGLLRVEILDVQEIGGKVGIHARAQFLTLDTGVWVKGPGDKKPDWDKQRVDPNGKSIEFTISLDGSVN
Ga0099827_1005200033300009090Vadose Zone SoilMRHQLQRIAGLFLLAVATLCAGDRRTNLLPRLQFGQTITYLIRFQSDKTVKTESKIVAPMAPNAAQIDAHGLLRVEILDVQRQGSKAAIHARGRFLTLDSGVWLKRPGDKKPDWDK
Ga0066709_100001198123300009137Grasslands SoilLKRNLVPIAGLSLLAAATLCAGDKRVHFLPRLQPGQTITYLIRFQSDKNVKTESNVVAPMAPDAAQIDAHGLLRLEILGVQQSSSSAAIHLRGQFLTFEPNVQPRTPE
Ga0099792_1054759523300009143Vadose Zone SoilMKRKLQRLAGVFLLAAATLGAGDKRVHLLPKLQCGQIIIYLIRFQADKNVKSESKVVAPLAPNAAQIDAHGLLRMEVLDVKQIDAKAEIRARAKFLNLDSGVWLKKPGQNKPAWDAQLVDPDGKTIEFTISP
Ga0137392_1004497813300011269Vadose Zone SoilMMRRQLQRIAALFLLAVATLGAADKRINLLPKLQPGQTITYLIRFQTDKTVKTESKVVAPMAPNAVQIDAHGLLRVEILDVQQTSGKAAIHARGRFLTLDSGVWLKK
Ga0137392_1043843613300011269Vadose Zone SoilMTCKLQWIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSNKTVRTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGKFLILDTGVWIKGPRDKKPNWDKLRVDPEGKSIEFTISPDG
Ga0137389_1024299213300012096Vadose Zone SoilMKHKLRRTAVLLFLAAAALGAGGSRVHLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQEIGSKSAIHARGQFLTLDSGVWVKGPGDKKPDWDKQRVD
Ga0137389_1130144523300012096Vadose Zone SoilMKRKLQRLAGVFLLAAATLGAGDKRVHLLPKLQCGQIIIYLIRFQADKNVKSESKVVAPLAPNAAQIDAHGLLRMEILDVKEINSKPEIHARAAFLTLDSGVLLKKPSDKKSADKK
Ga0137388_1032242013300012189Vadose Zone SoilMKHKLRRTAVLLFLAAAALGAGGSRVHLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQEIGSKSAIHARGQFLTLDSGV
Ga0137388_1059080123300012189Vadose Zone SoilMTRKLSTIAALLILPAASLCAGDRRVNFLPQLHSGQTITYLVRFQSDKNVKTESSVVTPLAPNAAQVDAHGLLRIEVLDVQQTATRAAIHMRGQFLTLDSGAWLKAPGEKKPNWDKQRVDPKDKSIEFTISADGSV
Ga0137388_1121517323300012189Vadose Zone SoilMRRKLQRTAGLLLLAVATLRAGDKRINLIPKLQPRQTITYLIRFQSDKTIKTESKVVAPMAPDAAQIDAHGLLRVEILDVQEIGGKVGIHARAQFLTLDTGVWVKGPGDKKPDWDKQRVDPNGKSIEFTISLDGSVN
Ga0137388_1164753813300012189Vadose Zone SoilMLLFLAAASLGAGGGRVHLLPQLHCGQTFTYLIRFQSDKTVKTESNVVAPMTPNAAQIDAHGLLRVEILDLQEMGSKAAVHARGQFLTLDSGVWLKGPNDKKPDGDNQSVDPTGKIIEFAISPDGS
Ga0137388_1178601313300012189Vadose Zone SoilMKRKLQLTAGMFLLGVFTAGAGDKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKKPDWDRQRVDPQGKRIAFTISPDGSVEK
Ga0137388_1184175823300012189Vadose Zone SoilMMRKLQPIGGLLLFAAATLCAGDKRVRFLPQLQSGQTITYLVRFQSDKNVKTESSVVTPMAPNAAQVDAHGLLRIEILDVQQTATRAAIHMRGQFLTLDSGVWLKGQDEKKPNWDKQQVNPK
Ga0137388_1204035213300012189Vadose Zone SoilMRRKLQRSAGLLLLAAATLCAGEKRINLMPKLQFGQTITYLIRFQSDKTIKTESKVVAPMAPNAAQIDAHGLLRVEILDVQEIAGKAVVHARAQFLTLDTAVWVKGPGDKKPDWDKQRVDPN
Ga0137364_1141970513300012198Vadose Zone SoilMQMKREPQRTAGLLLLAAVTLSAGDRRVSLLPKLQPGQTLIYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLLVEVLDVQPAGAKAMIHARGQFLTLDSG
Ga0137383_1121495523300012199Vadose Zone SoilLRTEKAPDTLETAASAMKRILQLTAGMFLLAVFTAGAGGKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKKPEWD
Ga0137362_1037868613300012205Vadose Zone SoilLPESRTEKAPDTLETAASAMKRILQLTAGMFLLAVFTAGAGDKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKNPNWD
Ga0137362_1156014323300012205Vadose Zone SoilMKRILQLTAGIFLLAVFTAGAGEKRVNLLPKLRLGQTITYLIRYRSDKSVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKNPNWD
Ga0137362_1159415813300012205Vadose Zone SoilMTRKLSTIFALLILPAAAICAGDRRVNFLPQLQSGQTITYLVRFQSDKNVKTESSVVTPLAPNAAQVDAHGLLRLEVLDVQQTATRAAIHMRGQFLTLDSGVWLKPPGEKKPNWDKQR
Ga0137381_1006392713300012207Vadose Zone SoilMRRNLVPIAGLLLVAAATLCAGDRRVHLLPQLQPGQTVIYLIRFQSDKTVKTESRVIAPMVPNAAQMDAHGLLRVEILGVQQTATRPAIHARGQFLTLDTGVWIKEPRDKKPNWDKQRVDPEGKTIEFTI
Ga0137377_1105914923300012211Vadose Zone SoilMRTKIQRIAGLLILAAATLCAGDRRINLLPRLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQPGSKAAIHARGRFLTLDSGVWLKRPGDKKPDWDKQRVDPHGKSIDFTISPDGSVNEV
Ga0137370_1023593023300012285Vadose Zone SoilMQMKREPQRTAGLLLLAAVTLSAGDRRVSLLPKLQPGQTLIYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLLVEVLDVQPAGAKAMIHARGQFLTLDSGVWLKKPGD
Ga0137384_1139690113300012357Vadose Zone SoilMKRILQLTAGMFLLAVFTAGAGGKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKKPEWD
Ga0137385_1124395323300012359Vadose Zone SoilMKRILQLTAGIFLLAVFTAGAGEKRVNLLPKLRSGQTITYLIRYRSDKSVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKKPDWDRQRVDPQGKSIEFTISPDGSVEKVQ
Ga0137360_1057492223300012361Vadose Zone SoilLPESRTEKAPDTLETAASAMKRILQLTAGMFLLAVFTAGAGDKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDK
Ga0137361_1002139543300012362Vadose Zone SoilMKRKLQRLAGVFLLAAATLGAGDKRVHLLPKLQCGQIIIYLIRFQADKNVKSESKVVTPLAPNAAQIDAHGLLRIEVLDVKQIDAKAEIRARAKFLNLDSGVWLKKPGQKKPAWDAQLVDPDGKTIEFTISPDGSVNDLK
Ga0137361_1070816223300012362Vadose Zone SoilMKRILQLTAGIFLLAVFTAGAGEKRVNLLPKLRLGQTITYLIRYRSDKSVKTESNVVAPMVLNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKNP
Ga0137390_1149510023300012363Vadose Zone SoilMKRILQLTAGIFLLAVFTAGAGDKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPAMHARAEFLTLDSGVWLKRPGDKNPNWDRQRVDPQGKSIEF
Ga0137390_1181187423300012363Vadose Zone SoilMKRKLQLTAGMFLLGVFTAGAGDKRVNLLPKLHTGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPVMHARAEFLTLDSGVWLKRPGDKNPNWDRQRVDPQGKSIEF
Ga0137358_1052892623300012582Vadose Zone SoilMTCKLQWIGGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLNTGVWIKGPRDKNPNWDKQRVDPEGKSIEFTISPD
Ga0137396_1003000613300012918Vadose Zone SoilMRRQLPRIATFFLLAVATLGAADKRINLLPKLQPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKTAIHARGRFLTLDSGVWLKKPGDKKPNRDKQRVDPSGKSIEFTISSDGSVNEVKG
Ga0137396_1041378623300012918Vadose Zone SoilMFLLAAAALCAGDTRVHLLPKLRPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQPAGSKAAIHARGQFLTLDSGVWLKAPGGKKPD
Ga0137396_1043989023300012918Vadose Zone SoilMRRQLQRISGLLLLAVATLGAGDKRINLLPRLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILEVQQAGTRAAIHARGRFLTLDSGVWLKRPGDKKPDWDKQRVDPHGKSIDFTISPDGSVNEV
Ga0137394_1010296813300012922Vadose Zone SoilMFLLAAATLCAGDRRVHLLPKLHPGQTITYLIRFQSDKTVKTESNVVAPMAPNAAQIDAHGLLRVEILDAQELGSKAAIRARGQFLTLDSGVRLKAPGDKKSDGDKQRVDP
Ga0137416_1001780213300012927Vadose Zone SoilMTHKLQRTAGLLLLAAATLWAGGRRVHLLPKLQPGQAITYLIRFRSEKTVKTESKVVAPMGPNAAQLDSSGLLRVEILDVQETGSKVAIHARAQFLPLDSGVSRKMNGDTKLSGEKQSAERAGKFVDFTISPDG
Ga0137404_1181273913300012929Vadose Zone SoilMRRQLPRIATFFLLAVATLGAADKRINLLPKLQPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKTAIHARGRFLTLDSGVWLKKPGDKKPNRDKQRVDHSGKSIEFTISSD
Ga0137407_1221416913300012930Vadose Zone SoilMANEMILKLQRTTGLLLLAAISLFAGDRRVHLLPQLQPGQVLTYLIRFQSEKNIKTESRVVAPMAPDASQMDAHGLLRVEILDVQQSSGKSAIHARGQFLTLDSGVWLKPPGDKNTDRDKQR
Ga0137411_129608463300015052Vadose Zone SoilMTRKLQRIAGLLLLAATTLSAGGRRVHLLPKLQPGQTITYLIRFQSDKAVKTESKVVAPMAPNAAQIDAHGLLQVEVLDVQQTGAKPMIHARGQFLTLDSGVWLKRPGDKKPDWIDSGWTPMARVSSSQFLRTAL*
Ga0066655_1119719713300018431Grasslands SoilMRRKLQRVAGLLLLAPTTLGAGDRHVHLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLLVEVLDVGETGAKPMIHARGQFLTLDSGVWLKKPGDKKPDWDRQRVDPHGKSIEFSISPDGSVNEAK
Ga0193730_117366813300020002SoilLFLLAFATLAAGDKRTNLLPRLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQLGSKAAIHARGRFLTLDSGVWLKRPGDKKPDWDKQRVDPYGKSID
Ga0179594_1016172023300020170Vadose Zone SoilLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPRDKNPNWDKQRVDPKGKSIEFTISPDGSANDVRGL
Ga0179592_1038120913300020199Vadose Zone SoilMTCKLQWIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLGTGVWIKGPRDKKPDWDKQRVDPEGKSIEFTISPD
Ga0210399_1107570923300020581SoilMDQKFPWNKRASASLRIDALTDGAAANAMTRKLSRTAALFLLASATLSAGDKRVQLLPKLQPGETITYLIRFQSDKTVKTESRVVAPMAPNDAQIDAHGLLRVEILDVQETGGMAAIHARGQFLTPNSGVWLKGRGEKKTDSDKQRVDTN
Ga0210408_1142695113300021178SoilMRKLQPIGGLFLLAAATLCAGDKRVHFLPQLQSGQTITYLVRFQSDKNVKTESNVVTPLAPNAAQVDAHGLLRIEILDVQQTAARAAIHMRGQFLTLDSGVWLKGQDEKKPNWDRQRVDPQGKSIEFTISPDGSVND
Ga0210394_1109797613300021420SoilMFFLAAAALCAGDKHVNLLPKLQPGQTITYLIRFQSDKNVKTKSNVVAPMAPNEGQTDAQGLLRVEVLDVQQTGSKTTIHARGQFLTVNSGAGLKKLDDNKPDEDKQRVDPGGKSIEFTISPD
Ga0210402_1036253623300021478SoilMKRKLIWIAGLFLLAGAGLGAVDKRVHLLPKLQRGQVIYYLIRFQSDKNVKTQSKVVAPMAPDAAQLDAHGLLCLEILEVQQSANKSSIHARGRFLSLASGVWVKKPGD
Ga0209238_119965823300026301Grasslands SoilLKIHVFKMITRGEASKTREALADVMKRKRQRIAGLLLLAATTVWARDRHVHLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNATQIDAHGLLLVEVLDVRETGARPMIHARGQFLTLDSGVWLKKPGDKKP
Ga0209240_114508013300026304Grasslands SoilMTHKLQRTAGLLLLAGTTLCAGDRRVNLIPKLQPGQTITYLIRFQSDKIVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQRVDREGKSIEFTISPDGSV
Ga0209761_127547623300026313Grasslands SoilLKRNLVPIAGLSLLAAATLCAGDKRVHFLPRLQPGQTITYLIRFQSDKNVKTESNVVAPMAPDAAQIDAHGLLRLEILGVQQSSSSAAIHLRGQFLTFEPNVQPRTPEE
Ga0209155_100251213300026316SoilMQVKREPQRTAGLLLLAAVTLCAGDRRVSLLPKLRPGQTLIYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLLVEVLDVQPAGAKAMIHARGQFLTLDSGVWLKKPGDNKSGWDRQHVDPH
Ga0209131_107946913300026320Grasslands SoilMTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQRVDPEGKSIEF
Ga0209377_116168323300026334SoilMRRNLLPIAGLLLVAAATLCAGDRRVHLLPQLQPGQTVIYLIRFQSDKTVKTESRVIAPMVPNAAQMDAHGLLRVEILGVQQTATRPAIHARGQFLTLDTGVWIKEPRDKKPDWDKQRVDPEGKTIEFTISP
Ga0257157_100395323300026496SoilMRRQLQRIAGMFLLAAAALCAGDTRVHLLPKLRPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKAAIHARGQFLTLDSGVWLKAPGD
Ga0209807_123671213300026530SoilMRRNLLPIIVLLHFAPATLCAADRPARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPK
Ga0209807_123734113300026530SoilMRRNLLPIVVLLLLAAAPLPAADRRTRFLPQLQPGQTITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRVEILDVQQTATRPAIHARGQFLTLDTGVWIKGPK
Ga0209157_131045213300026537SoilMTRRLKQIAGLLLLPAATLCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMGPNAAQLDANGLLRVEILDVQETGSKAAIHARAQFLTLDSGVSSKVK
Ga0209056_1026692823300026538SoilMRRNLLPIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKD
Ga0209161_1002648733300026548SoilMRRNLLPIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKE
Ga0209161_1040413913300026548SoilMRRNLLPIVVLLLLAAAPLPAADRRTRFLPQLQPGQTITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRVEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKE
Ga0209474_1022889623300026550SoilMRRNLLAIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRVEILDVQQTATRPAIHARGQFLTLDTG
Ga0209648_1056766413300026551Grasslands SoilMTRKLQWIAGLILLAATLCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPHDKKPNWDKLRVDPEG
Ga0209577_1008499123300026552SoilMRRNLLAIIVLLHFAPATLCAADRRARFLPQLQPGQIITYLIRFQSDKTVKTESRVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPAIHARGQFLTLDTGVWIKGPKDKKPNWDKERVD
Ga0179593_121910913300026555Vadose Zone SoilMRRQLQRIAGMFLLVAAALCAGDTRVHLLPKLRPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQPAGSKAAIHARGQFLTLDSGVW
Ga0179587_1014043213300026557Vadose Zone SoilMSGEVSEDLRSSSSAMRRQLQRIAGMFLLAAATLCAGDRRVHLLPKLHPGQTITYLIRFQSDKTVKTESNVVAPMAPNAAQIDAHGLLRVEILDAQEMGSKAAIHARGQFLTLDSGVRLKAPGDKKSDGDKQRVDPDGKSIEFTIS
Ga0208575_101966313300026920SoilMRRQLQRIATFFLLAVATLGAADKRINLLPKLQPGQTITYLIRFQSDKNVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKTAVHARGRFLILDSGVWLKKPGDKKPNWDKQRVDPSGKSIEFTIS
Ga0209179_111924213300027512Vadose Zone SoilMKHKLQPAAVFLLLAAVSLFAGDRRIHLLPQLQPGQVLTYLIRFQSEKNIKTESRVVAPMAPGASQMDAHGLLRVEILDVQQAAGKSAIHARGQFLTLDSGV
Ga0209735_108788723300027562Forest SoilMFLLAVAALAAGDKRINLLPKLHPGQTITYLIRFQSDKTVKTESNVVAPMAPNAAQIDAHGLLRVEILDVQPSGNKAAIHARGQFLTLDSGVWVKSPADKKPDW
Ga0209076_103345613300027643Vadose Zone SoilMTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQ
Ga0209076_117977823300027643Vadose Zone SoilMRRQLQRIAGLFLLAFATLAAGDKRINLLPRLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQLGSKAAIHARGRFLTLDSGVWLKRPGDKKPDWDKQ
Ga0209588_111839813300027671Vadose Zone SoilMTSRLQRILALLLLPGATLCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIHARGQFLTLDTGVWIKGPRDKKPNWDKQRVD
Ga0209588_114474213300027671Vadose Zone SoilMRRKLQRTAGLLLLAVATLCAGDKRINLIPKLQPRQTITYLIRFQSDKTIKTESKVVAPMAPDAAQIDAHGLLRVEILDVQEIGGKVGIHARAQFLTLDTGVW
Ga0209073_1050106223300027765Agricultural SoilMKHKLQRTITLLLLAAVSLFAGDRRVHLLPQLQPGQVLTYLIRFQSEKNIKTESRVVAPMAPDASQIDAHGLLRVEILDVQQTSGKSAIHARGQFLTLDSGVWLKQPGEKNPNWD
Ga0209180_1007274613300027846Vadose Zone SoilMKRKLQRLAGVFLLAAATLGAGDKRVHLLPKLQCGQIIIYLIRFQADKNVKSESKVVAPLAPNAAQIDAHGLLRIEVLDVKQIDAKAEIRARAKFLNLDSGVWLKKPGQKKPAW
Ga0209180_1051420213300027846Vadose Zone SoilMLLFLAAASLCASGGRVHLLPQLQPGQTITYLIRFQSDKSVKTESNVVAPMTPDAAQIDAHGLLRVEILDLQEIGSKTAIHARGQFLTLDSGVWLRGPNDKKPDGNKE
Ga0209701_1001106513300027862Vadose Zone SoilMRRQLQRIAGLLLLAVATLCAGDKRINLMPNLRPGQTITYLIRFQSDKTVKTESNVVAPMAPNAAQIDAHGLLRVEILDVQPTGSKITIHARGQFLVLDSGASL
Ga0209701_1048974413300027862Vadose Zone SoilMMRKLQPIGGLLLFAAATLCAGDKRVRFLPQLQSGQTITYLVRFQSDKNVKTESSVVTPMAPNAAQVDAHGLLRIEILDVQQTATRAAIHMRGQFLTLDSGVWLKGQDEKKPNWDKQRVDPKGKSIEFTISPDGSV
Ga0209283_1027054223300027875Vadose Zone SoilMKRILQLTAGIFLLAVFTAGAGDKRVNLLPKLHSGQTITYLIRYRSDKTVKTESNVVAPMVPNAAQMDAHGLLRIEILDVQQQGAKPAMHARAEFLTLDSGVWLKRPGDKKPDW
Ga0209283_1031906313300027875Vadose Zone SoilMTRKLQRIAGLLLLAAATVCAGDRRVHLIPKLQPGQTITYLIRFQSDKTVKTESKVVAPMVPNAAQIDAHGLLRIEILDVQQTATRPVIRARGQFLTLDTGVWIEGPRDKKPNWDKQRVDPEGKSIEFTISPNGS
Ga0209283_1094013323300027875Vadose Zone SoilMRRKLQRSAGLLLLAAATLCAGEKRINLIPKLQFGQTITYLIRFQSDKTIKTESKVVAPMAPNAAQIDAHGLLRVEILDVQEIGGKAVVHARAQFLTLDSGVWVKGPGDNKPEWDKQRVD
Ga0307469_1101771723300031720Hardwood Forest SoilMFLLAVATLAAGDKRINLLPKLQPGQTITYLIRFQSDKTVKTESKVVAPMAPNAAQIDAHGLLRVEILDVQQAGSKTAIHALGRFLTLDSGVWLKKPGDKKPNWDKQRVDPSGKSVEFTISSDGSVNQVKG
Ga0307475_1039546513300031754Hardwood Forest SoilMTRNVLPIAALVLLAAATLCAADRRVHLLPRLQPGQTIIYLIRFQSDKTIKTESRVVAPMVPNAAQIDAHGLLRVEILDVQQTASRPAIHARGQFLTLDTGVWIKGP
Ga0307475_1131900713300031754Hardwood Forest SoilMARKLSRIIAFLILPVASTCAGDKRVNFLPQLHSGQTITYLVRFQSDKNVKTESSVVTPLAPNAAQVDAHGLLRIEILDVRQTATRAAIHMRGQFLTLDSGVWVK
Ga0307479_1026529623300031962Hardwood Forest SoilMARKLSRIIAFLILPVASVCVGDKRVNFLPQLHSGQTITYLVRFQSDKNVKTESSVVTPLAPNAAQVDAHGLLRIEILDVRQTATRAAIHMRGQFLTLDSGVWLK
Ga0307479_1105541513300031962Hardwood Forest SoilMDQKFPWNKRASASLRIDALTDGAAANAMTRKLSRTAALFLLASATLSAGDKRVQLLPKLQPGETITYLIRFQSDKTVKTESRVVAPMAPNDAQIDAHGLLRVEILDVQETGGMAAIHARGQFLTPNSGVWLKGRGEKKTDSDKQRV
Ga0307479_1141705823300031962Hardwood Forest SoilMKCKLQRIGPLLLLASATLCAGDKRINLFPKLQAGQTITYLIRFQSDKTVKTESRVVAPMAPNDAQIDAHGLLRVEILDVRETGSEPVIHARGRFLTLDSGAWAKGPGEKKPNVEKQSEDSDGKSIEFTISSD
Ga0307472_10018959823300032205Hardwood Forest SoilMGRKPTRIASFLLLAASILCAGDGRVHLLPKLHPGQTITYLIRFQSDKAVKTESRVVAPMAPNAAQLDVHGLLQVEILDVQEAGNKSAIHARGRFLNPDSGARLKGPPDEKPDGYKQRMD
Ga0307472_10090995213300032205Hardwood Forest SoilMVRKLSSIVAFLILPAAALCGGDKRVHFLPQLQSGQTITYLVRFQSDKNVKTESSVVTPLAPNAAQVDAHGLLRIEILDVQQTGTRAAIRMRGQFLTLDSGVWVKAPG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.