NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F075214

Metagenome / Metatranscriptome Family F075214

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F075214
Family Type Metagenome / Metatranscriptome
Number of Sequences 119
Average Sequence Length 208 residues
Representative Sequence MMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Number of Associated Samples 97
Number of Associated Scaffolds 119

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 74.58 %
% of genes near scaffold ends (potentially truncated) 31.93 %
% of genes from short scaffolds (< 2000 bps) 57.14 %
Associated GOLD sequencing projects 79
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.160 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(36.135 % of family members)
Environment Ontology (ENVO) Unclassified
(60.504 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.067 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.146.148.150.152.154.156.158.160.162.164.166.168.170.172.174.176.178.180.182.184.186.188.190.192.194.196.198.200.202.204.206.208.210.212.214.216.218.220.222.224.226.228.230.232.234.236.238.240.242.244.246.248.250.252.254.256.258.260.262.264
1JGI25385J37094_100050713
2JGI25383J37093_100767022
3JGI25384J37096_100000874
4JGI25384J37096_101337801
5JGI25382J37095_100627152
6JGI25382J43887_100116044
7JGI25386J43895_100175492
8Ga0063454_1000162083
9Ga0063455_1000202282
10Ga0066677_100018963
11Ga0066677_101110532
12Ga0066680_100318514
13Ga0066680_102486861
14Ga0066690_100936992
15Ga0066690_106653471
16Ga0066688_105858991
17Ga0066684_101277982
18Ga0066676_102899372
19Ga0066675_100631483
20Ga0066388_1068158481
21Ga0070708_1001061753
22Ga0066689_104293882
23Ga0066681_103376581
24Ga0066687_100385611
25Ga0070681_103266001
26Ga0070706_1001544151
27Ga0070697_1004094042
28Ga0070697_1005558962
29Ga0066701_100250234
30Ga0066661_100295461
31Ga0066692_100169914
32Ga0066704_105040381
33Ga0066670_102305031
34Ga0066703_101968462
35Ga0066705_100055842
36Ga0066705_100222672
37Ga0066691_104051752
38Ga0066691_105028121
39Ga0066654_103651171
40Ga0066706_102484152
41Ga0066696_101007742
42Ga0066652_1005541622
43Ga0070716_1013520811
44Ga0066659_101400911
45Ga0099791_100266992
46Ga0099793_100275601
47Ga0066710_1005239583
48Ga0066710_1014654972
49Ga0066710_1030054301
50Ga0099828_111733021
51Ga0099827_100016576
52Ga0066709_1036715031
53Ga0099792_106626101
54Ga0126374_100092673
55Ga0126384_103388172
56Ga0134109_101708671
57Ga0134067_100741681
58Ga0134066_100256492
59Ga0134124_108675451
60Ga0134122_120558381
61Ga0134121_113013951
62Ga0137388_104230361
63Ga0137388_120068021
64Ga0137377_104119792
65Ga0137387_107066921
66Ga0137360_100717824
67Ga0137360_111673871
68Ga0137361_100242713
69Ga0137397_100096743
70Ga0137396_100354214
71Ga0137396_101346053
72Ga0137419_108096502
73Ga0137416_100043577
74Ga0137416_100561953
75Ga0137407_100301774
76Ga0134073_102490711
77Ga0066662_102453382
78Ga0066669_104362032
79Ga0213851_143011123
80Ga0207662_102641621
81Ga0209438_10601342
82Ga0209235_10073353
83Ga0209235_12644751
84Ga0209236_10056804
85Ga0209238_10791332
86Ga0209761_10002581
87Ga0209761_11349091
88Ga0209761_12048791
89Ga0209686_10335183
90Ga0209687_10205802
91Ga0209802_10596032
92Ga0209473_12716201
93Ga0209267_100094719
94Ga0209267_10150835
95Ga0209803_10292213
96Ga0209804_10350173
97Ga0209808_10480502
98Ga0209690_10335781
99Ga0209059_10390953
100Ga0209059_10397563
101Ga0209806_10199575
102Ga0209056_100695943
103Ga0209805_10014948
104Ga0209161_100749993
105Ga0209474_100098853
106Ga0209648_100697092
107Ga0209648_101660222
108Ga0209577_101552302
109Ga0209588_10132893
110Ga0209590_101874281
111Ga0209488_102022952
112Ga0137415_100094687
113Ga0137415_101678912
114Ga0137415_102424762
115Ga0307469_111827461
116Ga0307479_1000871312
117Ga0307470_105957832
118Ga0307471_1000765742
119Ga0307471_1002940892
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 76.23%    β-sheet: 0.00%    Coil/Unstructured: 23.77%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions
20406080100120140160180200Cytopl.Cytopl.Sequenceα-helicesβ-strandsCoilSS Conf. scoreTM segmentsTopol. domains
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
99.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Watersheds
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Grasslands Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Soil
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
21.0%3.4%36.1%19.3%4.2%4.2%4.2%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1000507133300002558Grasslands SoilMMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
JGI25383J37093_1007670223300002560Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGXLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
JGI25384J37096_1000008743300002561Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
JGI25384J37096_1013378013300002561Grasslands SoilMMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIALSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARAGRRRAQASGRSSQRSSG*
JGI25382J37095_1006271523300002562Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGGSSQRNSG*
JGI25382J43887_1001160443300002908Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
JGI25386J43895_1001754923300002912Grasslands SoilMMDRLDAIMXKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
Ga0063454_10001620833300004081SoilMQWVDQVVERARDRARFRVAALRATHPGEDQVALGRRLIGSSALRAGLSGAATGTLALIAFPIGLPAGVAVSLYLEAELIFGLLEVYESDTAVEQGRLKLYALWAGAGFAGAARSAGLRAGARVIGRVLEGSLPVRIIRRLSPALLKAILRRLGLGWLPRAVKLWPIIGAPISFLLDRAALRTLGEATLATLDDDARKRRRQPAEGRPKRYRRVKLAAAPAR*
Ga0063455_10002022823300004153SoilMEWVDQVVERARDRARFRVAALRATHPGEDQVALGRRLIGSSALRAGLSGAATGTLALIAFPIGLPAGVAVSLYLEAELIFGLLEVYDSDTAGEQGRLKLYALWAGAGFAGAARSAGLRAGARVIGRVLEGSLPVRIIRRLSPALLKAILRRLGLGWLPRAVKLWPIIGAPISFLLDRAALRTLGEATLATLDDDARKRRRQPAEGRPKRYRRVKLAAAPAR*
Ga0066677_1000189633300005171SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG*
Ga0066677_1011105323300005171SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR*
Ga0066680_1003185143300005174SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR*
Ga0066680_1024868613300005174SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
Ga0066690_1009369923300005177SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRVGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG*
Ga0066690_1066534713300005177SoilFRIDALRPVPGQVAPGDPARRYNPWWVTMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRR
Ga0066688_1058589913300005178SoilLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRAARKRQRRKRTVRLKIARA*
Ga0066684_1012779823300005179SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRVLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADR*
Ga0066676_1028993723300005186SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALR
Ga0066675_1006314833300005187SoilVTMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADR*
Ga0066388_10681584813300005332Tropical Forest SoilLDGIVEKARDRARFRIAALRLEHPGQDEVTLGRRLIGSMALKAGFAGAATGTLSLVALPVGLPAGIAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFVDAAKSVGMRAGAGAVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDD
Ga0070708_10010617533300005445Corn, Switchgrass And Miscanthus RhizosphereMMERFDAIVEKARQRARFRVAALRLERPGQDEVALGSALAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAMDRALRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADESVRTSRGKSG*
Ga0066689_1042938823300005447SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLG
Ga0066681_1033765813300005451SoilMTPEWFTKVVDDARTRARGRVAALRAEHAGEHEIELGRRLIKSAANRAGWYGAATGTLALITLPVGLPAGIAVSLFLEAELIFALLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGARAIGKVLWESLPGQIIRKLNPVLLKAILKRLGLGWLPRALKLWPILGAPISFVLDRAALRTLGEATLATLDDASRARQKKPHGRKRTIRLKTA*
Ga0066687_1003856113300005454SoilEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGQVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR*
Ga0070681_1032660013300005458Corn RhizosphereMDRLEAIVENARQRARFRVAALRLEHPGQDQVTLGQKLVGAVALRAGFAGAATGALSLVALPLGLPAGIAVSLLLEAELIFALLDLYEVDTGGEQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRILRGSLPARIIRKLNPALVRAILRRLGLGWLPRAAKFWPVLGAPIGFALDRAALRSLGSATLATLDDDARQRRRRAPAVARSPRPPRSRRVK
Ga0070706_10015441513300005467Corn, Switchgrass And Miscanthus RhizosphereMMERLDAIVEKARQRARFRVAALRLERPGQDEVALGRALAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAMDRALRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADESVRTSRGNSG*
Ga0070697_10040940423300005536Corn, Switchgrass And Miscanthus RhizosphereMMERLDAIVEKARQRARFRVAALRLERPGQDEVALGRALAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAMDRALRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADESVRTSRGKSG*
Ga0070697_10055589623300005536Corn, Switchgrass And Miscanthus RhizosphereMERLDAIVEKARQRARFRVAALRLEHPGQDQVALGRTLAAEMALRAGFAGAATGTLSLLALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAVKNIGMRAGATAMGQVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPVLGAPVGFLLDRAAMRTLGEATLATLDDAARARRRRADPSGRSSQQSTG*
Ga0066701_1002502343300005552SoilRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
Ga0066661_1002954613300005554SoilAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRVGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSHRSSG*
Ga0066692_1001699143300005555SoilMMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
Ga0066704_1050403813300005557SoilPRRRTRPRRPRTMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG*
Ga0066670_1023050313300005560SoilDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRAARKRQRRKRTVRLKIARA*
Ga0066703_1019684623300005568SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADR*
Ga0066705_1000558423300005569SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADR*
Ga0066705_1002226723300005569SoilMDAPAWFTDVVEKARSRARARVAALRAEYPGEHDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRAARNRQRRKRTVRLKIARA*
Ga0066691_1040517523300005586SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGAIAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPV
Ga0066691_1050281213300005586SoilARRAPARRYNHRSHSGVTLDAPVWFADVVEKARSRARARVAALKAEYPGEHDVDLGQRLVKSSANRAGWTGAATGTLALITLPIGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGARAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWIPKAMKLWPLLGAPIAFVLDRAALQALGDATLATLDDASRASRKRQRRKRTVRLKIARA*
Ga0066654_1036511713300005587SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRVLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARVRRR
Ga0066706_1024841523300005598SoilMDAPAWFTDVVEKARSRARARVAALRAEYPGEHDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRVKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPKALKLWPLLGAPIAFVLDKAALQTL
Ga0066696_1010077423300006032SoilMDAPAWFTDVVEKARSRARARVAALRAEYPGEHDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRAARKRQRRKRTVRLKIARA*
Ga0066652_10055416223300006046SoilMDRLDGMVEKARERARFRIAALRLEYPGQDEVALGRRLVRSMALRAGLAGAATGTLSLVALPLGLPAGIAISLLLEAELIFALLEVYGVDTEGEQGRVKLYALWAGAGFVDAAKNVGLRAGADAVGRILRGSLPVRIIRRLNPALLKAILRRLGLGWIPRAARFWPVLGAPIAFALDRAALRALGEATLATLDDAARARRRAPSPARTGRRRSIKIRAVKA*
Ga0070716_10135208113300006173Corn, Switchgrass And Miscanthus RhizosphereMERIDAIVEKARERARFRIAALRLEYPGQDEVALGRRLIGSMALKAGFAGAATGTLSLVALPLGLPAGVAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSVGLRAGAGAVGRVLRGSLPVRIIRRLNPALLKAILRRLGLAWIPRAAKFWPVLGAPIGFALDRAALR
Ga0066659_1014009113300006797SoilMMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLFGAPVGFL
Ga0099791_1002669923300007255Vadose Zone SoilMDRFEALAEKARERARFRIAALRLEHPGEDEVSLARRLIASMSLRAGVAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFSLLELYGMETGGEQGRLKLYALWAGAGFADAAKSVGLRAGATALGRVLRGSLPARIIRRLNPALLKAILRRLGLGWLPRAAKLWPVLGAPIAFALDRAALRTLGEATLATLDDSARARRRAT
Ga0099793_1002756013300007258Vadose Zone SoilMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIALSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLIGAPVGFLLNRAVCWTRGKAQLANLDDADCSGRCRA
Ga0066710_10052395833300009012Grasslands SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0066710_10146549723300009012Grasslands SoilVTPEFVNDLVEKARVKARARVAALRALHPKEDEVDLGRRLIKSAATRAGLWGAATGTVALVALPIGLPAGVAAMLAVEAGLIFALLDLYGVDTEGEQGRLRLYALWLRAGFAYAAKSAGMNLGARALGKVLWESLPGQLIRRINPILLKAILKRLGLGWLPRAFKLWPILGAPIAFAIDRAAGKTLGDAALATLDDEARVERRRSSGARAHRS
Ga0066710_10300543013300009012Grasslands SoilFRIDALRPVPGQVAPGDPARRYNPWWVTMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLAT
Ga0099828_1117330213300009089Vadose Zone SoilMQLFDDIVEQARTKARARVAAVRVEHPGEDEVDLGRRLIRSAATRAGLWGAATGTVALVAMPVGLPAGVAISLAVEAQLVFALLELYGLDTEGEHGRLRLYALWLGAGFADAAKSAGLRTGARALGGVLWESLPGQIIRKLNPELVKAILRRLGLGWLPRALKMWPVLGAPIAFALDRAAVRALGEATLATLDDEARARRRRSRKVVHLR
Ga0099827_1000165763300009090Vadose Zone SoilMQLFDDIVEQARNRARARVAAVRVEHPGEDEVDLGRRLIRSAATRAGLWGAATGTVALVAMPVGLPAGVAISLAVEAQLVFALLELYGLDTEGEHGRLRLYALWLGAGFADAAKSAGLRTGARALGRVLWESLPGQIIRRLNPELLKAILRRLGLGWLPRALKMWPVLGAPIAFALDRAAVRALGEATLATLDDEARARRRRSRKVVHLRSSGARTRRS*
Ga0066709_10367150313300009137Grasslands SoilQDEVALGRRLVQSMALRAGVAGAATGTLALVALPLGLPAGIAISLLLEAELIFALLELYGVETEGEPGRLKLYALWAGAGFADAAKGVGLRAGANAVGRVLRGSLPVRIIRRLNPALIRAILRRLGLEWIPRAARFWPLLGAPIAFALDRAALQTLGDATLAILDDAARARRRGARARSV
Ga0099792_1066261013300009143Vadose Zone SoilLRLEHPGEDEVSLARRLIASMSLRAGVAGAATGTLSLVALPLGLPAGIALSLLLEAELIFSLLELYGMETEGEQGRLKLYALWAGAGFADAAKSVGMRAGANVVGRVLRGSLPVRIIRRLNPALLKAILRRLGLGWLPRAVKLWPVLGAPIAFALDRAALRTLGEATLATLDDSARARRRATQARRAAGSTG*
Ga0126374_1000926733300009792Tropical Forest SoilLGTPSWFNDVVEKARERARKRVERLRADHPGEHDVDLGQRLVKSSATRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLGLYGLETEGESGRLRLYALWAGAGFADAAKSVGLRAGARAIGQILRESLPGQIIRRLNPALLKAILRRIGLGWVPSAMKLWPLLGAPIAFVLDRAALQTLGDATLATLDDAARAARKRQRRKRTVRVKIARA*
Ga0126384_1033881723300010046Tropical Forest SoilMERLNGIVEKARDRARFRIAALRLEHPGQDEVTLGRKLIGAMALKAGFAGAATGTLSLVALPVGLPAGIAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGASAVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDDAARARRRAKSAARSRAQKA*
Ga0134109_1017086713300010320Grasslands SoilMDAPAWFTDVVEKARSRARARVAALRAEYPGEHDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRSARNRQRRKRTVRLKIARA*
Ga0134067_1007416813300010321Grasslands SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGF
Ga0134066_1002564923300010364Grasslands SoilVIMDRLDGMVEKARERARFRIAALRLEYPGQDEVALGRRLVRSMALRAGLAGAATGTLSLVALPLGLPAGIAISLLLEAELIFALLEVYGVDTEGEQGRVKLYALWAGAGFVDAAKNVGLRAGADAVGRILRGSLPVRIIRRLNPALLKAILRRLGLGWIPRAARFWPVLGAPIAFALDRAALRALGEATLATLDDAARARRRAPSPARTGRRRSIKIRAVKA*
Ga0134124_1086754513300010397Terrestrial SoilMNRLEAVVEKARERARFRVAALRLEHPGQDEVALGRRLIASMALRAGLAGAATGTLALVALPLGLPAGVAVSLLLEAELIFALLELYELDTEGEQGRLKLYALWAGAGFADAAKSVGLHAGAGAVGRVMRGSLPGRIIRRLNPALVRAILKRLGLEWLPRAVKFWPVLGAPIAFALDRAALRALGDATLATLMRPERSSGRGWGTGTAPRARRRRAR*
Ga0134122_1205583813300010400Terrestrial SoilALRLEHPGQDEVTLGRRLIASMALRAGLAGAATGTLALVALPLGLPAGVAVSLLLEAELIFALLELYELDTEGEQGRLKLYALWAGAGFADAAKSVGLHAGAGAVGRVLRGSLPGRIIRRLNPALVRAILKRLGLEWLPRAVKFWPVLGAPIAFALDRAALRALGDATLATLMRPERSSGRGWGTGTAPRAGRRHAR*
Ga0134121_1130139513300010401Terrestrial SoilDEHEVDVAKRLVKSAAARAGWTGAATGTLALITLPVGLPAGIAASMYLEAELIFALLEVYELDTEGERGRLRLYALWAGAGFADAAKSVGLHAGAAAVGRVLRGSLPGRIIRRLNPALVRAILKRLGLEWLPRAVKFWPVLGAPIAFALDRAALRALGDATLATLMRPERSSGRGWGTGTAPRARRRRAR*
Ga0137388_1042303613300012189Vadose Zone SoilVIMQLFDDIVEQARTKARARVAAVRVEHPGEDEVDLGRRLIRSAATRAGLWGAATGTVALVAMPVGLPAGVAISLAVEAQLVFALLELYGLDTEGEQGRLRLYALWLGAGFADAAKSAGLRTGARALGRVLWESLPGQIIRKLNPELLKAILRRLGLGWLPRALKMWPVLGAPIAFALDRAAVRALGEATLATLDDEARARRRRSRKVVHLRSTGARARRS*
Ga0137388_1200680213300012189Vadose Zone SoilEHEVDLGRRLVRSTALRAGLAGAATGTLALVTLPLGLPAGVAVSLFFEAEIIFALLELYGLETEGEAGRLRLYALWAGAGFADAAKSVGMRTGAEVIGRILTESLPGQIIRRLNPALVKAILRRLGLGWLPRAMKLWPLLGAPISFALDRAALRTLGDAALATLDDAA
Ga0137377_1041197923300012211Vadose Zone SoilMDRLEGIVEKARERARFRIAALRLEYPGQDEVALGRRLVQSMALRAGVAGAATGTLALVALPLGLPAGIAISLLLEAELIFALLELYGVETEGEPGRLKLYALWAGAGFADAAKGVGLRAGANAVGRVLRGSLPVRIIRRLNPALIRAILRRLGLEWIPRAARFWPLLGAPIAFALDRAALQTLGDATLAILDDAARARRRGARARSV*
Ga0137387_1070669213300012349Vadose Zone SoilMMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDD
Ga0137360_1007178243300012361Vadose Zone SoilMDRFEALAEKARERARFRIAALRLEHPGEDEVSLARRLIASMSLRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFSLLELYGMETEGEQGRLKLYALWVGAGFADAAKSVGLRAGATALGRVLRGSLPARIIRRLNPALLKAILRRLGLGWLPRAAKLWPVLGAPIAFALDRAALRTLGEATLATLDDSARARRRATQARRAAGSTG*
Ga0137360_1116738713300012361Vadose Zone SoilMDRLDRLVEKARERARFRIAALRLEHPGEDEVGLGRRLIESMSLRAGVAGAATGTLSLVALPLGLPAGIAISLLLEAELIFSLLELYGIETEGEQGRLKLYALWAGAGFADAAKSVGLRAGASALGRVLRGSLPVRIIRRLNPALLKAILRRLGLGWIPRAAKLWPVLGAPIAFALDRAALRTLGEATL
Ga0137361_1002427133300012362Vadose Zone SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERTALRTLGEATLATLDDAARARRRRADPGSPG*
Ga0137397_1000967433300012685Vadose Zone SoilMKRLDGIVEKARDRARFRIAALRLEHPGQDEVSLGRRLIATMALKAGLAGAATGTLSLVALPLGLPAGVAISLLLEAELIFSLLELYEVDTEGDQGLLKLYALWAGAGFADAAKSVGMRAGAGVVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDDAARSRRRARSAARSR*
Ga0137396_1003542143300012918Vadose Zone SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPALVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADPGSPG*
Ga0137396_1013460533300012918Vadose Zone SoilLRLEHPGQDEVALGGRLVASMALRAGLLGAATGTLSLVALPVDHPAGVAISLLLEAELIFALLELYGVDTEGEQGQLKLYALWAGAGFADAAKSVGLRAGATAVGRVLRGSLPGRIIRRLNPVLWKAILRRLGLEWLPRAAKFWPVLGAPIAFALDRAALRALGDATLATLDDAARARRRARPTVRHTGRRTIRIRAADR*
Ga0137419_1080965023300012925Vadose Zone SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIVRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAACARRRRADPGSPG*
Ga0137416_1000435773300012927Vadose Zone SoilMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIALSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARAGRRRAQASGRSSQRSSG*
Ga0137416_1005619533300012927Vadose Zone SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADPGSPG*
Ga0137407_1003017743300012930Vadose Zone SoilMDRLDAIMEKARQRARFRVAALGLELPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYAVWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG*
Ga0134073_1024907113300015356Grasslands SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALR
Ga0066662_1024533823300018468Grasslands SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLASEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRAGPGSPG
Ga0066669_1043620323300018482Grasslands SoilMTPEWFTKVVDDARSRARGRVAALRAEHPGEHEIELGRRLIKSAANRAGWYGAATGTLALITLPVGLPAGIAVSLFLEAELIFALLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGARAIGRILWESLPGQIIRKLNPVLLRAILKRLGLGWLPRALKLWPILGAPISFVLDRAALRTLGEATLATLDDAARARQKKPHSRKRTIRLKTA
Ga0213851_1430111233300021860WatershedsMPDAPAWFANVVDDARERARGRVAALRAEFPDERETDLGSRLVRSAATRAGWYGAATGTLALIALPIGLPAGIAVSLLLEAELIFALLELYGLETEGDAGRLKLYALWAGAGFADAAKSVGLRSGARALGRVLWESLPGQIIRKLNPALLKAILRRLGLGWLPRAIKLWPLLGAPIAFAMDRAALRTLGDATLATLHDAARAHRRAAAKSAGARKPGRKRGVWRATSDHGKLNT
Ga0207662_1026416213300025918Switchgrass RhizosphereMNRLEAVVEKARERARFRVAALRLEHPGQDEVALGRRLIASMALRAGLAGAATGTLALVALPLGLPAGVAVSLLLEAELIFALLELYELDTGGEQGRLKLYALWAGAGFADAAKSVGLHAGAGAVGRVLRGSLPGRIIRRLNPALVRAILKRLGLEWLPRAVKFWPVLGAPIAFALDRAALRALGDATLATLMRPERSSGRGWGTGTAPRARRRRAR
Ga0209438_106013423300026285Grasslands SoilMKRLDGIVEKARDRARFRIAALRLEHPGQDEVSLGRRLIATMALKAGLAGAATGTLSLVALPLGLPAGVAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGAEVVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDDAARSRRRARSAARSRH
Ga0209235_100733533300026296Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Ga0209235_126447513300026296Grasslands SoilGQDEVALGRRLVQSMALRAGVAGAATGTLALVALPLGLPAGIAISLLLEAELIFALLELYGVETEGEPGRLKLYALWAGAGFADAAKGVGLRAGANAVGRVLRGSLPVRIIRRLNPALIRAILRRLGLEWIPRAARFWPLLGAPIAFALDRAALQTLGDATLAILDDAARARRRG
Ga0209236_100568043300026298Grasslands SoilMMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Ga0209238_107913323300026301Grasslands SoilMMDRLDAIMGKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGNQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSHRSSG
Ga0209761_100025813300026313Grasslands SoilQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Ga0209761_113490913300026313Grasslands SoilQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARAGRRRAQASGRSSQRSSG
Ga0209761_120487913300026313Grasslands SoilMDRLEGIVEKARERARFRIAALRLEYPGQDEVALGRRLVQSMALRAGVAGAATGTLALVALPLGLPAGIAISLLLEAELIFALLELYGVETEGEQGRLKLYALWAGAGFADAAKGVGLRAGANAVGRVLRGSLPVRIIRRLNPALIRAILRRLGLEWIPRAARFWPLLGAPIAFALDRAALQTLG
Ga0209686_103351833300026315SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFAAAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209687_102058023300026322SoilMMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSLGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALHTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209802_105960323300026328SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0209473_127162013300026330SoilMFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRVLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARVRRRRADE
Ga0209267_1000947193300026331SoilMMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALHTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209267_101508353300026331SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0209803_102922133300026332SoilMMDRLDAIMEKARQRARFRVAALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGMLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Ga0209804_103501733300026335SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0209808_104805023300026523SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADR
Ga0209690_103357813300026524SoilMMDRLDAIMEKARQRARFRVVALRLEHPGQDEVALGRRLVAEMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDSARARRRRAQASGRSSQRNSG
Ga0209059_103909533300026527SoilMMDRLDAIMGHARQRARFRVAALRLEHPGRDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209059_103975633300026527SoilMERLDAIVERARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0209806_101995753300026529SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMSAGATAIDRVLRGSLPGRIIGRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209056_1006959433300026538SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRVLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARVRRRRADESGRSSRRNSR
Ga0209805_100149483300026542SoilMMDRLDAIMGHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIAVSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQASGRSSQRSSG
Ga0209161_1007499933300026548SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSIGMRAGATAIDRMLRGSLPGRIIRRLNPELIKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARVRRRRADESGRSSRRNSR
Ga0209474_1000988533300026550SoilMDAPAWFTDVVEKARSRARARVAALRAEYPGEHDVDLGQRLVKSAANRAGWTGAATGTLALITLPVGLPAGIAASLFLEAELIFGLLELYGLETEGEAGRLKLYALWAGAGFADAAKSVGLRTGAKAIGRVLRESLPGQIIRRLNPALLKAILRRLGLGWVPRALKLWPLLGAPIAFMLDKAALQTLGDATLATLDDASRAARNRQRRKRTVRLKIARA
Ga0209648_1006970923300026551Grasslands SoilMEWFDQLVEKARRRARARVTAARAAHPGEHDVDLGRRLVRSAALRAGLAGAATGTLALVTLPLGLPAGVAVSLFFEAEIIFVLLELYGLETEGEAGRLRLYALWAGAGFADAAKSVGMRTGAEVIGRILTESLPGQIIRRLNPALVKAILRRLGLGWLPRAMKLWPLLGAPLSFALDRAALRTLGDAALATLDDAARARRKAARSGRRRTVRVRPKPVSA
Ga0209648_1016602223300026551Grasslands SoilMDAFAELVDKARARARGRVAALRAEHPAEDEVDLGRRLVRSAANRAGWWGAATGTVALIALPVGLPAGVAVSLFLEAELILSLLELYGLETEGEQGRLKLYALWAGAGFADAAKSVGLRSGAQLIGRVLWESLPGQIIRRLNPVLLKAILRRLGLGWLPRALKLWPVLGAPIAYALDRAALRTLGE
Ga0209577_1015523023300026552SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFADAAKSIGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFFLERAALRTLGEATLATLDDAARARRRRADESGRSSRRNSR
Ga0209588_101328933300027671Vadose Zone SoilMQRLDAIVEKARERARFRIAALRLDHPGQDEVALGGRLVASMALRAGLLGAATGTLSLVALPVGLPAGVAISLLLEAELIFALLELYGVDTEGEQGQLKLYALWAGAGFADAAKSVGLRAGATAVGRVLRGSLPGRIIRRLNPVLWKAILRRLGLEWLPRAAKFWPVLGAPIAFALDRAALRALGDATLATLDDAARARRRARPTVRRTGRRTIRIRAAGR
Ga0209590_1018742813300027882Vadose Zone SoilMQLFDDIVEQARNRARARVAAVRVEHPGEDEVDLGRRLIRSAATRAGLWGAATGTVALVAMPVGLPAGVAISLAVEAQLVFALLELYGLDTEGEHGRLRLYALWLGAGFADAAKSAGLRTGARALGRVLWESLPGQIIRRLNPELLKAILRRLGLGWLPRALKMWPVLGAPIAFALDRAAVRALGEATLATLDDEARARRRRSRKVVHLRSSGARTRRS
Ga0209488_1020229523300027903Vadose Zone SoilMDRLEALTEKARERARFRIAALRLEHPGEDEVSLARRLIASMSLRAGVAGAATGTLSLVALPLGLPAGIALSLLLEAELIFSLLELYGMETEGEQGRLKLYALWAGAGFADAAKSVGLRAGANVVGRVLRGSLPVRIIRRLNPALLKAILRRLGLGWLPRAAKLWPVLGAPIAFALDRAALRTLGEATLATLDDSARARRRAAQARRSPGPTG
Ga0137415_1000946873300028536Vadose Zone SoilMMDRLDAIMEHARQRARFRVAALRLEHPGQDEVALGRTLVGAMALRAGFAGAATGTLSLVALPLGLPAGIALSLLLEAELIFALLELYGLDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGATAIGRVLRGSLPGRIIRRLHPELIKAILRRLGLGWLPRAARFWPLLGAPVGFLLDRAALRTLGEATLATLDDAARAGRRRAQASGRSSQRSSG
Ga0137415_1016789123300028536Vadose Zone SoilMERLDAIVEKARQRARFRVAALRLEHPGQDEVALGRTLAGEMALRAGFAGAATGTLSLVALPLGLPAGIAASLFLEAELIFALLELYGLDTEGDQGRLKLYALWTGAGFAGAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPLLGAPVGFLLERAALRTLGEATLATLDDAARARRRRADPGSPG
Ga0137415_1024247623300028536Vadose Zone SoilMQRLDAIVEKARERARFRIAALRLEHPGQDEVALGGRLVASMALRAGLLGAATGTLSLVALPVGLPAGVAISLLLEAELIFALLELYGVDTEGEQGQLKLYALWAGAGFADAAKSVGLRAGATAVGRVLRGSLPGRIIRRLNPVLWKAILRRLGLEWLPRAAKFWPVLGAPIAFALDRAALRALGDATLATLDDAARARRRARPTVRRTGRRTIRIRAADR
Ga0307469_1118274613300031720Hardwood Forest SoilALRLEHPGQDEVSLGRRLIATMALKAGLAGAATGTLSLVALPLGLPAGVAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGAGAVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPILGAPIGFALDRAALRTLGEATLATLDDAARSRRRARSAARSR
Ga0307479_10008713123300031962Hardwood Forest SoilMEWFDQLVSTARERGRARVAQLRAAHPEEHEVDLGCRLVRAAALRAGLAGAATGTLALITLPLGLPAGVAVSLFLEAELIFALLELYGLSTSGESGRLRLYALWAGAGFADAAKSVGLRTGAQALGRILTESLPGQLIRRLNPALVKLILRRLGLGWLPKAMKLWPLLGAPIAFALDHAALRSLGDATLATLDDAARARRKSARTGRRRSLKLARA
Ga0307470_1059578323300032174Hardwood Forest SoilMKRLDGIVEKARDRARFRIAALRLEHPGQDEVSLGRRLIATMALKAGLAGAATGTLSLVALPLGLPAGVAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSVGMRAGAGAVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDDAARSRRRARSAARSR
Ga0307471_10007657423300032180Hardwood Forest SoilMKRLDGIVEKARDRARFRIAALRLEHPGQDEVSLGRRLIATMALKAGLAGAATGTLSLVALPLGLPAGIAISLLLEAELIFSLLELYEVDTEGDQGRLKLYALWAGAGFADAAKSAGMRAGAGAVGRVLRGSLPVRLIRRLNPALLKAILRRLGLEWIPRAAKFWPVLGAPIGFALDRAALRTLGEATLATLDDAARSRRRARSAARSR
Ga0307471_10029408923300032180Hardwood Forest SoilMTDRFDGIVEQARRRAGFRVAALRLELPGQDEVAVGRKLVEAMALRAGFAGAATGTLALVALPLGLPAGIAVSLLLEAELIFALLELYGVDTGGDQGRLKLYALWAGAGFADAAKSVGMRAGATAMGRVLRGSLPGRIIRRLNPELVKAILRRLGLGWLPRAARFWPVLGAPVGFLLDRAALRTLGEATLATLDDAARARRRRAQSSGRSSPRRSD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.