NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F025072

Metagenome / Metatranscriptome Family F025072

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F025072
Family Type Metagenome / Metatranscriptome
Number of Sequences 203
Average Sequence Length 84 residues
Representative Sequence MPDAPNTLPLLRLGRLALDPGLRALQPGHDASGLVVTATVEVPDSVSEPQYAWDVALADAAREAGTLGADERTAQALAAGAGN
Number of Associated Samples 165
Number of Associated Scaffolds 203

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 62.57 %
% of genes near scaffold ends (potentially truncated) 91.63 %
% of genes from short scaffolds (< 2000 bps) 84.24 %
Associated GOLD sequencing projects 153
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (70.443 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(43.842 % of family members)
Environment Ontology (ENVO) Unclassified
(40.887 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(46.305 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.130.132.134.136.138.140.142.144.
1JGI20277J16323_1004841
2JGIcombinedJ26739_1007456222
3Ga0062389_1023307802
4Ga0066672_108175991
5Ga0066388_1039054281
6Ga0070660_1013317392
7Ga0070689_1012464832
8Ga0070687_1015421011
9Ga0070692_104077991
10Ga0066682_107288091
11Ga0070731_111827361
12Ga0070732_103159991
13Ga0070672_1018136822
14Ga0070762_108278421
15Ga0070763_104424121
16Ga0070764_111093362
17Ga0066903_1003306461
18Ga0068858_1024780561
19Ga0070766_102698122
20Ga0070717_114567931
21Ga0075028_1009238782
22Ga0075029_1006870451
23Ga0075017_1011131702
24Ga0070765_1013014921
25Ga0075433_100271231
26Ga0099828_105791151
27Ga0066709_1044221371
28Ga0105237_103784823
29Ga0116216_102209402
30Ga0126382_120966991
31Ga0126373_102334074
32Ga0126372_125054472
33Ga0126372_127666502
34Ga0126378_120066771
35Ga0126381_1050291851
36Ga0136449_1038387352
37Ga0134126_123653282
38Ga0126383_131567761
39Ga0134121_113709382
40Ga0126350_120106025
41Ga0137383_100692411
42Ga0137382_105422872
43Ga0137376_100762951
44Ga0137378_106400551
45Ga0137386_100147321
46Ga0137384_100467971
47Ga0157320_10083062
48Ga0137398_111490442
49Ga0137413_102579951
50Ga0182016_107886182
51Ga0182036_103859551
52Ga0182033_112283002
53Ga0182039_109843772
54Ga0187802_104037762
55Ga0187807_10340661
56Ga0187809_100427271
57Ga0187808_101051882
58Ga0187817_100293341
59Ga0187781_106425542
60Ga0210399_106449372
61Ga0210395_102666661
62Ga0210405_102171401
63Ga0210408_106050182
64Ga0210396_104564481
65Ga0210396_115946892
66Ga0210393_111639472
67Ga0210393_115332271
68Ga0210389_105843532
69Ga0210391_110472932
70Ga0210392_104920472
71Ga0210392_108651211
72Ga0210402_101571043
73Ga0247667_10400941
74Ga0179591_10470373
75Ga0207685_100108175
76Ga0207684_117387731
77Ga0207664_105152151
78Ga0207644_101761603
79Ga0208369_10128991
80Ga0207762_10646521
81Ga0208860_10392981
82Ga0208099_10639801
83Ga0208608_1155132
84Ga0208992_10379801
85Ga0208241_10756551
86Ga0209074_101653112
87Ga0209580_103744211
88Ga0209693_103099863
89Ga0209579_105405221
90Ga0209283_104544751
91Ga0209590_109786811
92Ga0209275_102514762
93Ga0209275_109220031
94Ga0209380_102238941
95Ga0209488_108734432
96Ga0209006_112247751
97Ga0307289_103637721
98Ga0302235_102682282
99Ga0308309_106258892
100Ga0308309_111291821
101Ga0302181_100679581
102Ga0302184_103997052
103Ga0310037_100648801
104Ga0302324_1005640414
105Ga0302324_1032603832
106Ga0318534_102497381
107Ga0318534_104156432
108Ga0318541_101089861
109Ga0318538_103680062
110Ga0318573_104816361
111Ga0318574_100417295
112Ga0318574_101495092
113Ga0318574_107066581
114Ga0310686_1112474121
115Ga0310686_1156023211
116Ga0318496_103539851
117Ga0318496_103800901
118Ga0310813_121172431
119Ga0318500_101536342
120Ga0318500_102382772
121Ga0318500_105233151
122Ga0318501_103717961
123Ga0318501_107623731
124Ga0318502_100294811
125Ga0318502_110261291
126Ga0318492_100096211
127Ga0318492_102354063
128Ga0318494_101047001
129Ga0307475_115409442
130Ga0318554_101311241
131Ga0318554_102729363
132Ga0318509_101087163
133Ga0318509_104835791
134Ga0318521_100719224
135Ga0318521_103601293
136Ga0318521_106535082
137Ga0318546_101732773
138Ga0318498_103649392
139Ga0318498_104099562
140Ga0318566_102860982
141Ga0318547_106347272
142Ga0318547_107422712
143Ga0318557_103719671
144Ga0318557_104606202
145Ga0318550_102811681
146Ga0318497_100932791
147Ga0318568_101556301
148Ga0318567_101235304
149Ga0307478_101276773
150Ga0307478_103356602
151Ga0318564_102465093
152Ga0318564_103403391
153Ga0318499_101002863
154Ga0310917_103819361
155Ga0318517_104382572
156Ga0318511_100875821
157Ga0318512_101295592
158Ga0318512_105404302
159Ga0318536_104236861
160Ga0318536_105972592
161Ga0318522_100666942
162Ga0318522_103856831
163Ga0318551_107801071
164Ga0318551_109241531
165Ga0318520_101168193
166Ga0306921_108172742
167Ga0310912_111143572
168Ga0310916_115982761
169Ga0310910_103583221
170Ga0310909_111934822
171Ga0310909_116061182
172Ga0306926_105769653
173Ga0318530_102921941
174Ga0318563_100951132
175Ga0318563_103809302
176Ga0318549_103834111
177Ga0318549_104943721
178Ga0318556_100338901
179Ga0318556_101443921
180Ga0318558_103360011
181Ga0318570_104274942
182Ga0318575_104867211
183Ga0318514_101170932
184Ga0318553_100231041
185Ga0306924_126065941
186Ga0318525_105001862
187Ga0318577_102659881
188Ga0311301_109834162
189Ga0307471_1016736402
190Ga0306920_1023395723
191Ga0306920_1042482362
192Ga0335079_108044801
193Ga0335078_101922995
194Ga0335080_104791594
195Ga0335080_108037361
196Ga0335070_110280972
197Ga0335076_102226334
198Ga0335077_101898523
199Ga0335077_102491995
200Ga0335077_107523121
201Ga0310914_108981261
202Ga0318519_107115181
203Ga0334854_111227_435_656
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 32.43%    β-sheet: 0.00%    Coil/Unstructured: 67.57%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

1020304050607080MPDAPNTLPLLRLGRLALDPGLRALQPGHDASGLVVTATVEVPDSVSEPQYAWDVALADAAREAGTLGADERTAQALAAGAGNSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
70.4%29.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Bog Forest Soil
Freshwater Sediment
Watersheds
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Soil
Hardwood Forest Soil
Soil
Tropical Peatland
Bog
Tropical Forest Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Switchgrass Rhizosphere
Agricultural Soil
Soil
Palsa
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Populus Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Corn Rhizosphere
Boreal Forest Soil
6.4%3.4%43.8%3.9%4.4%4.4%5.4%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI20277J16323_10048413300001647Forest SoilMPDHSNITHPLLRLGRLPLGPGLRDLVPGTAGSDLVVTATVAVPDSSVSQAYAWDVAWTDAVREASQLGADPDTAQALERGAGTA
JGIcombinedJ26739_10074562223300002245Forest SoilMADAPMITLPLRRLGRLTLDPGLRAFQPSSNGSGLVVTAMVAAADSPANRSHEWDVAWADAAREASRLGADAETAGVLVGGAG
Ga0062389_10233078023300004092Bog Forest SoilMPDKPMITLPLLRLGRLQLSPGLRALQPGSAAASGLVVTATVLVPDSPADRQYAWDVAWADAVREAGQLGADQGTARALSGGAGQPVTTGTRVVV
Ga0066672_1081759913300005167SoilMPDAPNTLPLLRLGRLALDPGLRALQPGHDASGLVATATVEVPDSVSEPEYAWDVALADAAREAGTLGADERTA
Ga0066388_10390542813300005332Tropical Forest SoilMPTVPTTRPLLRLGRLEPDPGLRALQPGSAASGLVVTATVEVPESVSEQPYAWEVAWADAVREAGQLGADERTAQALA
Ga0070660_10133173923300005339Corn RhizosphereVEAIERPLLRLGRLELDAGLRALRPGPAVSDLVVTAAVEVPVSASEPQYAWEVAWADAVREAEALGADQR
Ga0070689_10124648323300005340Switchgrass RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRT
Ga0070687_10154210113300005343Switchgrass RhizosphereVEAIERPLLRLGRLELDAGLRALRPGPAVSDLVATAAVEVPVSGSEPQYAWEVAWADAVREAEALGAGPRTAQVLAAGSGVVPADGSRV
Ga0070692_1040779913300005345Corn, Switchgrass And Miscanthus RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAG
Ga0066682_1072880913300005450SoilMPDAPNTLPLLRLGRLALDPGLRALQPGHDASGLVVTATVEVPDSVSEPQYAWDVALADAAREAGTLGADERTAQALAAGAGN
Ga0070731_1118273613300005538Surface SoilMPHSPNTLALLRLGQLELSPALRAFRPGPAASDLVVTATVEVPDSVSEPGHAWDVAWADAMREASALGADERTARVLPAGAGNAVAGGTWVVVAAHGQVLLARWLPNGAA
Ga0070732_1031599913300005542Surface SoilMPDAPNTLPLLRLGRLELDPGLRALQPGQDASDLVATATVEVPDSVSEPQYAWDVALADAAREAGKLGADERTAQALA
Ga0070672_10181368223300005543Miscanthus RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQR
Ga0070762_1082784213300005602SoilMPDAPNTLPLLRLGRLVLDPELRALQPGHDASGLVATATVEVPDSVSEPQYAWDVALADAAREAGTLGADERTAQALAAGAGNAL
Ga0070763_1044241213300005610SoilMPHDPNTLPLLRLGRLELDPGLRALQPGQDASDLVATATVEVPDSESEPQHAWDVALADAAREAGKLGADERTAQALA
Ga0070764_1110933623300005712SoilMPHSPNNLPLLRLGRLELGPELRAFQPGSSASDLVVTATVEVPDSPSEQQNAWDVAWADAIREASGLGADERTARALAAGAGDSVAGGTWVVVAAHGQVLLAR
Ga0066903_10033064613300005764Tropical Forest SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLAQPADQPQAWDVTWADAVREAGQLGADQATVQALAGS
Ga0068858_10247805613300005842Switchgrass RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRTAQ
Ga0070766_1026981223300005921SoilMPDAPLAHPLLKLGRLELDPALRAFQPGPAVSDPVVTATVEVPDSASERQQAWDVAWADAGREAIELGAGQR
Ga0070717_1145679313300006028Corn, Switchgrass And Miscanthus RhizosphereVEAIDRPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWTDAGREAEALGAGQRTAQALAAGSGAAP
Ga0075028_10092387823300006050WatershedsMPDAPNTLPLLRLGRLALDPGLRALQPGHDASSLVVTATVEVPDSVSEPQYAWDVALADAAREAGTLGADERTAQALAAGAGNALAGGT
Ga0075029_10068704513300006052WatershedsMPDKPMITLPLLRLGRLQLSPGLRALQPGSAAASGLVVTATVLVPDSPADRQYAWDVAWADAVREAGQLGADQGTARALSGGAGQP
Ga0075017_10111317023300006059WatershedsMITLPLLRLGRLQLDPALRVLQPGSAASGLVVTATVAVPDTPADRSYAWDVAWADAAREAGRLGADQGTVQALAGGAG
Ga0070765_10130149213300006176SoilMPHDPNTLPLLRLGRLELDPGLRALQPGQDASDLVATAMVEVPDSVSEPQEAWDVALADAAREAGKLGADERTAQALA
Ga0075433_1002712313300006852Populus RhizosphereMPEGQSALPLLKLGRLELGPGLRALQPGAASGLVVTAMIEVPDSASDRQYAWEVAWSDAAREASGLGAGQRTAE
Ga0099828_1057911513300009089Vadose Zone SoilMPDKPMITFPLLRLGRLQLDPGLRALQPGSAASGLVVTATVPVPDSSADRQYAWDVAWADAAREAGQLGADQRTVRALPSGAGKALMSGTRVVV
Ga0066709_10442213713300009137Grasslands SoilVEAIERPLLRLGRLELDAGLRALRPGPAASDLVVTAAVEVPVSGSEPQYAWEVAWADAVREAEALGAGQRTAQVLAAGSGVV
Ga0105237_1037848233300009545Corn RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWDDAGREAEALGAGQRTAQALAAGSGAAPADG
Ga0116216_1022094023300009698Peatlands SoilMAGKPMITLPLLRLGRLQLDPALRVLQPGSAASGLVVTATVAVPDTPADRSYAWDVAWADAAREAGRLGAD
Ga0126382_1209669913300010047Tropical Forest SoilMPEGRSAFPLLRLGRLELGPGLRALQPGTSASGLVVTAMVQVPDSGSEQQYAWDVAWSDAAREATGLGADQRTAEALPVAAPTAPAPPTSAVPAPAG
Ga0126373_1023340743300010048Tropical Forest SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPEAWDVTWADAVREAGGLGADQATVQALAGSAGDALGSGPTRAVGG
Ga0126372_1250544723300010360Tropical Forest SoilMPDSTTITHPLLRLGRLTLGPGLRGLAPGSSASGLMVTATVTVPESPVSRPYAWDVAWADAVREASQQGAHPVTA
Ga0126372_1276665023300010360Tropical Forest SoilMSDELIAFPLLRLGRLQLEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAARQAGELGADQRTAQALAAGAGRMRAA
Ga0126378_1200667713300010361Tropical Forest SoilMSDASIAFPLLRIGRLELGRGLRALQPGSAPAGLVVTAMVEVPESVSEPQYSWDVAWADAVRQAGELGADQRTAQA
Ga0126381_10502918513300010376Tropical Forest SoilMSNAPIAIPLLRLGRLELGPGLRALQLDPAASDLVVTATVETPESASEPQYAWDVAWADAVRAAETLGADQRTAQALA
Ga0136449_10383873523300010379Peatlands SoilMVGKPMITLPLLRLGRLQPDPALRVLQPGSAASGLVVTVTVAVPDTPADRSYAWDVAWADAAREAGRLGADQGTVQALAGGA
Ga0134126_1236532823300010396Terrestrial SoilVESIERPLLRLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRSAQALAA
Ga0126383_1315677613300010398Tropical Forest SoilMSDASIAFPLLRIGRLELGRELRALQPGSAPAGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALATGAGRMRAGGTRVVVAAHGAVLLAR
Ga0134121_1137093823300010401Terrestrial SoilMPDAPNSLPLLRLGRLVLDAGLRALQPGHDASGLVATATVEVPDSTSEPEYAWDVALADAAREAGTLGADERTAQALAAGAG
Ga0126350_1201060253300010880Boreal Forest SoilMPDPLNITHPLLRLGRLPLGPGLHDLAPGSAAPDLVVTATVAVPDSSVSQAYAWDVAWADAVREAGQRGADPDTAQALERGAG
Ga0137383_1006924113300012199Vadose Zone SoilMPDAPNPLPLLRLGRLELDPGLRALQPGQDASGLVTTATVEVPDSVSEPQHAWDVALADAVREAGKLGADERTAQALAVGAGNAL
Ga0137382_1054228723300012200Vadose Zone SoilMPDASNPLPLLRLGRLELDPGLRALRPGQGASDLVATATVEVPDSVSEPQNAWDVALADAAREAGKLGVDERTAQAIVAGAGNALAGGTWIVVAAHGQVLL
Ga0137376_1007629513300012208Vadose Zone SoilMPDASNPLPLLRLGRLELDPGLRALRPGQGASDLVATATVEVPDSVSEPQNAWDVALADAAREAGKLGVDERTAQAIVAG
Ga0137378_1064005513300012210Vadose Zone SoilMPDASNPLPLLRLGRLELDPGLRALRPGQGVSDLVATATVEVPDSVSEPQNAWDVALADAAREAGKLGVDERTAQAIVAGAGNALAGGTWIVV
Ga0137386_1001473213300012351Vadose Zone SoilMPDAPNPLPLLRLGRLELDPGLRALQPGQDASGLVTTATVEVPDSVSEPQHAWDVALADAVREAGKLGADERTAQALAVGAGNALAGGTWIVVAAHGQVLLARRLPHGAAAPEVRVGPLP
Ga0137384_1004679713300012357Vadose Zone SoilMPDAPNPLPLLRLGRLELDPGLRALQPGQDASGLVTTATVEVPDSVSEPQHAWEFALADAVREAGKLGADERTAQALAVGAGNALAGGTWIVVAAHGQVLLARR
Ga0157320_100830623300012481Arabidopsis RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGADQ
Ga0137398_1114904423300012683Vadose Zone SoilMPEGRSALPLLRLGRLELGPGLRALQPGPAASGLVVTAMIQVPDSGSERQYAWDVAWSDAAREASRLGADQRTAEALAAG
Ga0137413_1025799513300012924Vadose Zone SoilMPDAPLAHPLLKLGRLELDPALRAFQPGPAVSDLVVTATVEVPDSASERQQAWDVAWADAGREAIELGAGRRTAEALAAG
Ga0182016_1078861823300014493BogMASKTTITHPLLRLGRLQLDPALRALQPGSAAPSLVVTATVAVPDTSGDRSYAWDVAWADAVREAGRLGADQGTAQALAVGAGQAQPGGTRVVV
Ga0182036_1038595513300016270SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQA
Ga0182033_1122830023300016319SoilMSSASIALPLSRLGRLELDPGLRALQSGPAGSDLVVTATVELPESASERQHAWDVAWADAVRQAGRLGADQRTAQALSTGAGNAL
Ga0182039_1098437723300016422SoilMPDSTTITHPLLRLGRLTLGPGLRGLSPGSSESGLMVTATVAVPDSAVSRSYAWDVAWADAVREAGQLGADPATAQALERGAGWVPGGDTQVVVAAHGEGADLADAA
Ga0187802_1040377623300017822Freshwater SedimentMPHAPDNVALLRLGRLELDPGLRALQPGSAASGLVVTATVEVPESVSEPQYAWDVAWADAVREASELGADEPTARALAAGAG
Ga0187807_103406613300017926Freshwater SedimentMPHAPDNVALLRLGRLELDPGLRALQPGSAASDLVVTATVEVPESGSEPGYAWDVAWADAMREASELGADERTARALIAGAGNVVAGGTWVVVAAHGQVLLARWLPHGSAAAS
Ga0187809_1004272713300017937Freshwater SedimentMADAPTITLPLLRLGRLELGPGLRDLQPDSAATGLVVTATVPVPDQPADQPQTWGVAWADAVREAGQLGADQATAQALSGGAGDALAGGTRA
Ga0187808_1010518823300017942Freshwater SedimentMPDSPKITHPLLRLGRLALGPGLHGLPPGSAESGLVVTATVAAPDAPVSRSYAWDVAWADAVREARQLGADPATAQALERGAGAVSGG
Ga0187817_1002933413300017955Freshwater SedimentMPDHPNITHPLLRLGRLPLGPGLRDLAPGSGESGRVVTAAVAVPDSPSSRAYAWDVAWADAVREASLQGADPATAQALER
Ga0187781_1064255423300017972Tropical PeatlandMPDSPIITHPLLRLGRLALGPGLHGLTPGSAGSGPVVTATVAAPDAPVSRPYAWDVAWADAVREASQLGADPATAQALERGAGAVSG
Ga0210399_1064493723300020581SoilMEAIAVPLLRLGRLELDARLRGLRPGLAASDLVVTATVEVPGSGSERQYAWGVAWADAAREAEALGAGQGTAQTLATGAGTAPPDG
Ga0210395_1026666613300020582SoilMPDHSNITHPLLRLGRLPLGPGLRDLVPGTAGSDLVVTATVAVPDSSVSQAYAWDVAWTDAVREASQLGADPDTAQALERGAGTASAGVAQV
Ga0210405_1021714013300021171SoilMPDHPNITHPFLRLGRLPLGPGLRGLAPGLPASGLMVTATVSVPDSATSRGYAWDVAWADAVREASQQGADPATAQ
Ga0210408_1060501823300021178SoilMPDHPNITHPFLRLGRLPLGPGLRGLAPGLPASGLMVTATVSVPDSATSRGYAWDVAWADAVREASQQGADPATAQALEGGAGRVTGGDT
Ga0210396_1045644813300021180SoilMPASPNSTHPLQRLGRLSLSPALRGLEPDPAGSDLAVTATVAVPDAPVSRSSAWEVAWADAAREAARLGADPD
Ga0210396_1159468923300021180SoilMAGKTTITLPLLRLGRLQLDTALRALPLGPAASSPVVTATVSVPDTPVDRSYEWDVAWADAARQASELGADQATA
Ga0210393_1116394723300021401SoilMPDHSNITHPLLRLGRLPLGPGLRDLTPGSTGTDPVVTATVAVPDSSVSQAYAWDVAWADAVREAGQRGADPDTAQALERGAGTASAGGAQ
Ga0210393_1153322713300021401SoilMAGKTTITVPLLRLGRLQLDPALRALPLGSASSGLVVTATVAVPDAPVDRSYEWDVAWADAARQASELGADQATAEALPTGAGQPVAGG
Ga0210389_1058435323300021404SoilMEAIAVPLLRLGRLELDARLRGLRPGPAASDLVVTAAVEVPESESERQYAWDVAWADATREAEVLGAGQRTARALATGAGTAPPDGSRV
Ga0210391_1104729323300021433SoilMPDHSNITHPLLRLGRLPLGPGLRDLVPGTAGSDLVVTATVAVPDSSVSQAYAWDVAWTDAVREASQLGADPDTAQALEH
Ga0210392_1049204723300021475SoilMAGKTTITVPLLRLGRLQLDPALRALPLGAASSGLVVTATVAVPDAPVDRSYEWDVAWDDAAREAAQLGADQDTAQVLATGAG
Ga0210392_1086512113300021475SoilMPDHSNITHPLLRLGRLPLGPGLRDLVPGTAGSDLVVTATVAVPDSSVSQAYAWDVAWADAVREAGQRGADPGTAQALERGAGTASAGGVQVV
Ga0210402_1015710433300021478SoilMPDAPNALPLLRLGRLALDPGLRALQPGHDASDLVATATVEVPDSVSEPQYAWDVALADAAREAGTLGADERT
Ga0247667_104009413300024290SoilVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRNA
Ga0179591_104703733300024347Vadose Zone SoilMEAIERPLLRLGRLELDAGLRALRPGPAASDLVVTAAVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRTAQVLAAGSGGRA
Ga0207685_1001081753300025905Corn, Switchgrass And Miscanthus RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRTAQALAAGS
Ga0207684_1173877313300025910Corn, Switchgrass And Miscanthus RhizosphereMPDAPNTLPLLRLGRLVLDPELRALQPGHDASGLVATATVEVPDSTSEPEYAWDVALADAAREAGTLGADERT
Ga0207664_1051521513300025929Agricultural SoilMPASTNSTHALQRLGRLALGPELHGLVPSGPDVVVTATVTVPDAPVSRGNPWEVAWADAAREATQLGADPDTAKALERGAGAAAAGGTRVVVALHGVVGLAWWLPPGTAESSV
Ga0207644_1017616033300025931Switchgrass RhizosphereVEAIERPLLLLGRLELDAGLRALRPGPAASDLVVTATVEVPVSDSEPQYAWEVAWADAGREAEALGAGQRTAQALAAGSGA
Ga0208369_101289913300026998Forest SoilMPDHSNITHPLLRLGRLPLGPGLRDLAPNPAGSDLVVTATVAVPDSSVSQAYAWDVAWADAVREAGQRGADPDTAQALERGAGTASAG
Ga0207762_106465213300027063Tropical Forest SoilMPHALDTLPLLRLGRLELGPELRAFAPGSAASDLVVTATVEVPDSVSEPQYAWDVAWADAVREASGLGADEGTARALAAGAG
Ga0208860_103929813300027076Forest SoilMPDSPNPLPLLRLGRLELDPGLRALQPGQDASDLVATAMVEVPDSVSEPQEAWDVALADAAREAGKLGADERTAQALVAGAGK
Ga0208099_106398013300027096Forest SoilMPDAPNTLPLLRLGRLELDPGLRALQPGQDASDLVATATVEVPDSESEPQHAWDVALADAAREAGKLGADERTAQALAAGAGNALTGGTW
Ga0208608_11551323300027165Forest SoilMAGKTTITHPLLRLGRLQLDPALRALQPGSAGSGLVVTATVAVPDTPDQGSYTWDVAWADAGREAGRLGADQGTAQ
Ga0208992_103798013300027176Forest SoilVEAIERPLLGLGRLELDAGLRALRPGLAASDLVVTATVEVPVSDSEPQYAWDVAWADAGREAEALGAGQRTAQVLAAGSGTVPADGSR
Ga0208241_107565513300027297Forest SoilMPDSPNPLPLLRLGRLELDPGLRALQPGQDASGLVATATVEVPDSESEPQHAWDVALADAAREAGKLGADERTAQALAVGAGNSLAGGTW
Ga0209074_1016531123300027787Agricultural SoilMPDAPNPLPLLRLGRLVLDPGLRGLQPGHDASGLVATATVEVPDSTSEPEYAWDVALADAAREAGTLGADERTVQALAAGAGNALAGGTSI
Ga0209580_1037442113300027842Surface SoilLLRLGRLELDPGLRALRPGPAASDLVVTATVEVPESVSEPQYAWDVAWADAMREARELGVDERTARALV
Ga0209693_1030998633300027855SoilMPDAPNTLPLLRLGRLELDPGLRALQPGQDASDPVATATVEVPDSVSEPQYAWDVALADAVREAGKLGADERTAQALAVGAGKALAGGTWIVVAAHGQVLL
Ga0209579_1054052213300027869Surface SoilMPHSPNTLALLRLGQLELSPALRAFRPGPAASDLVVTATVEVPDSVSEPGHAWDVAWADAMREASALGADERTARVLPAGAGNAVAGGTWVVVAAHGQVLLARWLPNGAAA
Ga0209283_1045447513300027875Vadose Zone SoilMPDKPMITFPLLRLGRLQLDPGLRALQPGSAASGLVVTATVPVPDSSADRQYAWDVAWADAAREAGQLGADQRTVRALPSGAGKALMSGT
Ga0209590_1097868113300027882Vadose Zone SoilMITLPLLRLGRLQLDPGLRALQPGSAASGLVVTATVPVPDSSADRQYAWDVAWADAAREASKLGADQGTAQALAG
Ga0209275_1025147623300027884SoilMPDHSNITHPLLRLGQLPLGPGLRDLTPGSAPSDLVVTAAVAVPDSPVSQSYAWDVAWTDAVREAGEQGADPATAQALEGGAGAAAL
Ga0209275_1092200313300027884SoilMEAIAVPLLRLGRLELDARLRGLRPGPAAPDLVVTAAVEVPESESERQYAWDVAWADATREAEVLGAGQRTARALATGAGTAPPDGSRVV
Ga0209380_1022389413300027889SoilMPDAPLAHPLLKLGRLELDPALRAFQPGPAVSDPVVTATVEVPDSASERQRAWDVAWADAGREAIELGAGQR
Ga0209488_1087344323300027903Vadose Zone SoilMSVLPAGQGVRCDGGMPDAPLAHPLLKLGRLELDPALRAFQPGPAVSDLVVTATVEVPDSASERQQAWDVAWADAGREAIELGAGRRTAEALAAGAGNASADGSRVVVA
Ga0209006_1122477513300027908Forest SoilMPDPLNTTHPLLRLGRLPLGPGLRDLAPGSAAPDLVVTATVAVPDSSVSRAYAWDVAWADAVREAGQRGADPDTAQALERGAGKAS
Ga0307289_1036377213300028875SoilVVAIERPLLRLGRLELDAGLRALRPGPAVSDLVVTAAVEVPVSGSEPQYAWEVAWADAVREAEALGTDQRTAQALAAGSGVVPADGSRVVVAARGEVLLTRWLPPGTV
Ga0302235_1026822823300028877PalsaMPANPNITHPLQRLGHLSLTPGLRELAPDPAQSGLAVTATVAVPGTPVSRGTAWEVAWADAAREAERL
Ga0308309_1062588923300028906SoilMPHDPNTLPLLRLGRLELDPGLRALQPGQDASDLVATAMVEVPDSVSEPQEAWDVALADAAREAGKLGADERTAQALAAGAGNALAGG
Ga0308309_1112918213300028906SoilMPDDPNTLPLLRLGRLELDPGLRGLQPGHGASGLVATATVEVPDSVSEPQHAWDVALADAAREAGKLGADERTAQALAAGAGN
Ga0302181_1006795813300030056PalsaMTGKTTITHPLLRLGRLQLDPALRALQPGSAGLVVTATVAVPDTSAERSYAWDVAWADAAREAGRLGADQG
Ga0302184_1039970523300030490PalsaMPDHSNITHPLLRLGQLPLGPGLRDLTPGSAPSDLVVTATVAVPDSSVSQSYGWDVAWTDAVREAGEQGADPATAQALEG
Ga0310037_1006488013300030494Peatlands SoilMPDHPNITHPLLRLGRLPLGPGLRALAPGSPASGLMVTATVSVPDSATSRGYAWDVAWTDAVREASQQGADPATAQALEGGAGRV
Ga0302324_10056404143300031236PalsaMTGKTTITHPLLRLGRLQLDPALRALQPGSAGLVVTATVAVPDTSAERSYAWDVAWADAAREAGRLGADQGTAQALAAG
Ga0302324_10326038323300031236PalsaMPDHSNITHPLLRLGQLPLGPGLRDLTPGSAPSDLVVTAAVAVPDSSVSQSYAWDVAWTDAVREAGEQGAD
Ga0318534_1024973813300031544SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGAD
Ga0318534_1041564323300031544SoilMSSASIALPLSRLGRLELDPGLRALQSGPAGSDLVVTATVELPESSSERQHAWDVAWADAVQQAGRLGADQRTAQALATG
Ga0318541_1010898613300031545SoilMPEGRNALPVLRLGRLELGPGLRALQPGQAASGLVATAILQMPDSGSERQYAWDVAWSDAAREAGTLGADQRT
Ga0318538_1036800623300031546SoilMPEGRNALPVLRLGRLELGPGLRALQPGQAASGLVATAILQMPDSGSERQYAWDVAWSDAAREASGLGADQRTAEAL
Ga0318573_1048163613300031564SoilMPSVPVTRPLLRLGRLELDPGLRAFQPGSAASGLVVTATVEVPESVSEQQYAWDVAWADAVRQAGQLGADER
Ga0318574_1004172953300031680SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPGAWDVTWADAVREADGLGADQATVQALAGSAGDA
Ga0318574_1014950923300031680SoilMPSVPITLPLLRLGRLELGPGLRAFQPGPAASDLVVTAAVEVPESVSEPQYAWDVAWADAVREASQLGADQRTAQALAGGAGRVLAGGTRVVVAADGE
Ga0318574_1070665813300031680SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDGGSRVVVA
Ga0310686_11124741213300031708SoilMTGKTEITLPLLKLGRLSLDPALRARPPGSAASGLVVTATVPVPDAPTERQYAWEVAWADAVRQAGQLGADADTAEALAVGADKPLAGGT
Ga0310686_11560232113300031708SoilMPHAPDNVALLRLGRLELGPALHDLQLGPAASDLVVTATVEVPDSESEPEYAWDVAWADAVREASELGADERTAQA
Ga0318496_1035398513300031713SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQA
Ga0318496_1038009013300031713SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALA
Ga0310813_1211724313300031716SoilVEAIERPLLRLGRLELDAGLRALRPGPASDLVVTATVEVPVSDSEPQYAWDVAWADAGREAEALGAGQRTAQVVAAGAGTVPADG
Ga0318500_1015363423300031724SoilMTRPLLRLGRLELDPGLRAVQPDSAASGPVVTATVEVPESVSERQYAWEVAWADAVREAGRLGADERTAQALAT
Ga0318500_1023827723300031724SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDGGSRVVVAAHGEVLLARWLPA
Ga0318500_1052331513300031724SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPGAWDVTWADAVREADGLGADQATVQALAG
Ga0318501_1037179613300031736SoilMPERQSALPLLRLGRLELGPGLRALEPGPAASGLVVTAMVQVPDSGSERQYAWDVAWSDAAREAKRLGADQRTAEALAAGAGTALADRTRAV
Ga0318501_1076237313300031736SoilMSSASIALPLSRLGRLELDPGLRALRSGPAGSDLVVTATVELPESSSERQHAWDVAWADAVRQAGRLGADQRTAQALATGAGNALAGGSRVVVAAHGEVLLSRWLPA
Ga0318502_1002948113300031747SoilLPETGAAGWDGVMPDSTTITHPLLRLGRLTLGPGLRALAPGTAASDLVATATVAAPESPVSRPYAWDVAWADAVREASQQGADPTTAQALER
Ga0318502_1102612913300031747SoilMAEAMITLPLLRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQRGAEQ
Ga0318492_1000962113300031748SoilMPEGRNALPVLRLGRLELGPGLRALQPGQAASGLVATAILQMPDSGSERQYAWDVAWSDAAREAGTLGADQRTAEALAAGAG
Ga0318492_1023540633300031748SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPGAWDVTWADAVREADGLGADQATVQALAGSAGDALS
Ga0318494_1010470013300031751SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAAREAETLGADQRTAQALAT
Ga0307475_1154094423300031754Hardwood Forest SoilLLRLGRLELDPGLRALQPGPAASDLVVTATVEVPESVSEPQYAWDVAWADAMREARELGVDERTARALVAGAGNAVAGGTWVVVAAHGQVL
Ga0318554_1013112413300031765SoilMPRVPITLPLLRLGRLELGPGLRAFQPGPAASDLVVTAAVEVPESVSEPQYAWDVAWADAVREASQLGADQRTAQALAGEY
Ga0318554_1027293633300031765SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDGGSRVVV
Ga0318509_1010871633300031768SoilMADAPTITLPLLRLGRLELGPGLRDLQPGSAATGLVVTATVPVPDQPADQPQAWGVAWADAAREAARLGADQDTAQVLVTGAGDALAGGT
Ga0318509_1048357913300031768SoilMPSVPITLPLLRLGRLELGPGLRAFQPGPAASDLVVTAAVEVPESVSEPQYAWDVAWADAVREASQLGADQRTAQALAGGAGRVLAGGTRVVVAADGEV
Ga0318521_1007192243300031770SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPGAWDVTWADAVREADGLGADQATVQALAGSAGDAL
Ga0318521_1036012933300031770SoilMAEAMITLPLLRLGRLQLDPGLRTLQPGSARSGLVVTATVPVPDQPADQPQAWDVAWSEAVREAGQLGADQATVQATS
Ga0318521_1065350823300031770SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDEG
Ga0318546_1017327733300031771SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDGGSR
Ga0318498_1036493923300031778SoilMPDSTTITHPLLRLGRLALGPGLRALAPGSAASGLVATATVAAPESPVSRPYAWDVAWADAVREASQQGADPA
Ga0318498_1040995623300031778SoilMSSASIALPLSRLGRLELDPGLRALQSGPAGSDLVVTATVELPESASERQHAWDVAWADAVRQAGRLGADQRTAQA
Ga0318566_1028609823300031779SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGADQRTAQALATGSGNALDGGSRVV
Ga0318547_1063472723300031781SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQA
Ga0318547_1074227123300031781SoilMPSVPMTRPLLRLGRLELDPGLRAVQPDSAASGPVVTATVEVPESVSERQYAWEVAWADAVREAGRLGADERTA
Ga0318557_1037196713300031795SoilMPERQSALPLLRLGRLELGPGLRALEPGPAASGLVVTAMVQVPDSGSERQYAWDVAWSDAAREASRLGADQRTAEALTAG
Ga0318557_1046062023300031795SoilMSNAPIAIPLLRLGRLELGPGLRALRLDPAASDLVVTATVEAPESASEPQYAWDVAWADAVREAETLGAD
Ga0318550_1028116813300031797SoilMPYAQGTNPLLRLGRLELDPELRAFQPGSAASDLVVTATVEVPDSVSEPQYAWDVAWADAVREAGELGADEGTARALAAGAGNAVAGGTWVVV
Ga0318497_1009327913300031805SoilMPETPIITHPLLRLGRLPLGPGLRGLSPGSGPADLVVTATVAVPDAPVSRSYAWDVAWADAVREASQLGTDPATAQALERGAG
Ga0318568_1015563013300031819SoilMAEAMITLPLLRLGRLQLDPGLRTLQPGSARSGLVVTATVPVPDQPADQPQAWDVAWSEAVREAGQLGADQATVQALAG
Ga0318567_1012353043300031821SoilMPETPIITHPLLRLGRLPLGPGLRGLSPGSGPADLVVTATVAVPDAPVSRSYAWDVAWADAVREASQLGADPATAQA
Ga0307478_1012767733300031823Hardwood Forest SoilMAGKTTITLPLLRLGRLQLDHALRALPLGPAASGPVVTATVSVPDTPADRSYEWDVAWADAARQASELGADQATAEALPTGAGQP
Ga0307478_1033566023300031823Hardwood Forest SoilMPDHSNITHPLLRLGRLPLSPGLRDLAPGSAGSDPVVTATVAVPDSSVSQAYAWDVAWADAVREAGQRGA
Ga0318564_1024650933300031831SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVMTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALSGGAGDALAG
Ga0318564_1034033913300031831SoilMSAALIAFPLLRLGRLELGRGLRALRPGPAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQACAADVST
Ga0318499_1010028633300031832SoilMSDALIAFPLLRIGRLELGRGLRALQPGSASAGPVVTAMVEVPASVSEPQYSWDVAWADAVRQAGELGADQRTAQALAVGAGRT
Ga0310917_1038193613300031833SoilLPETGAAGWDGVMPDSTTITHPLLRLGRLTLGPGLRALAPGTAASGLVATATVAAPESPVSRPYAWDVAWADAVREASQQ
Ga0318517_1043825723300031835SoilMITHPLLRLGRLPLGPGLRGFSPGSSSADLVITATVAVPDAPVSRSYAWDVAWADAVREASQQGADPATAQALE
Ga0318511_1008758213300031845SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKVVVAAHAAVL
Ga0318512_1012955923300031846SoilMPSVPVTRPLLRLGRLELGPGLRAFRPGSAASGLVVTATVEVPESVSEQRYAWDVAWADAVREAGQLGADERTAQALATGAGKAA
Ga0318512_1054043023300031846SoilMPERQSALPLLRLGRLELGPGLRALEPGPAASGLVVTAMVQVPDSGSERQYAWDVAWSDAAREAKRLGADQRTAEALAAGAGTA
Ga0318536_1042368613300031893SoilMPEGRNALPVLRLGRLELGPGLRALQPGQAASGLVATAILQMPDSGSERQYAWDVAWSDAAREASGRGADQRT
Ga0318536_1059725923300031893SoilMPYAQGTNPLLRLGRLELDPELRAFQPGSAASDLVVTATVEVPDSVSEPQYAWDVAWADAVREAGELGADEGTARALAAGAGNAVAGGTWVV
Ga0318522_1006669423300031894SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRM
Ga0318522_1038568313300031894SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSARSGLVVTATVPVPDQPQAWDVAWSEAVREAGQLGADQATVQALAGGAGDALAGG
Ga0318551_1078010713300031896SoilMITHPLLRLGRLPLGPGLRGFSPGSSSADLVITATVAVPDAPVSRSYAWDVAWADAVREASQQGADPATAQALERGAGA
Ga0318551_1092415313300031896SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALSGGAGDALAGGTRAAGGTRV
Ga0318520_1011681933300031897SoilMPEGRNALPVLRLGRLELGPGLRALQPGQAASGLVATAILQMPDSGSERQYAWDVAWSDAAREAGTLGADQRTAEALAAGA
Ga0306921_1081727423300031912SoilMKGGGVAAGGRAGWDGSMPDSANITHPLLRLGRLTLGPGLRGLSPGSAESGLVVTATVAVPEAPVSRSYAWDVAWADAVREASQLGADPATAQALERGAGTVSG
Ga0310912_1111435723300031941SoilMSAALIAFPLLRLGRLELGRGLRALRPGPAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKVVVAAHA
Ga0310916_1159827613300031942SoilMPERQSALPLLRLGRLELGPGLRALEPGPAASGLVVTAMVQVPDSGSERQYAWDVAWSDAAREAKRLGADQRTAEALA
Ga0310910_1035832213300031946SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGA
Ga0310909_1119348223300031947SoilMADAPTITLPLLRLGRLELGPGLRALQPGSAATGLVVTATVPMLGQPADQPGAWDVTWADAVREADGLGADQATVQALAGSAGEA
Ga0310909_1160611823300031947SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKVVVAAHAAVLLGR
Ga0306926_1057696533300031954SoilMSDALIAFPLLRIGRLELGRGLRALQPGSASAGLVVTAMVEVPASVSEPQYSWDVAWADAVRQAGELGADQRTAQ
Ga0318530_1029219413300031959SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKVVVAAHAAVLLGRWLP
Ga0318563_1009511323300032009SoilMPEGQGALPLLRLGRLELGPGLRAFQPAPPASGLVVTAMVQVPDSASERQYAWDVAWSDAAREASGLGVDHRTAEALVSAAG
Ga0318563_1038093023300032009SoilMSSASIALPLSRLGRLELDPGLRALQSGPAGSDLVVTATVELPESASERQHAWDVAWADAVRQAGRLGADQRTAQATSQACC
Ga0318549_1038341113300032041SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALSGGAGD
Ga0318549_1049437213300032041SoilMPETPIITHPLLRLGRLPLGPGLRGLSPGSGPADLVVTATVAVPDAPVSRSYAWDVAWADAVREASQLGADPATAQALERGAGAVSAGDTQVV
Ga0318556_1003389013300032043SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALSGGAGDALAGGTRAAGQGIS
Ga0318556_1014439213300032043SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKVVVAAHA
Ga0318558_1033600113300032044SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTARTHPP
Ga0318570_1042749423300032054SoilMPSVPITLPLLRLGRLELGPGLRAFQPGPAASDLVVTAAVEVPESVSEPQYAWDVAWADAVREASQLGADQRTAQA
Ga0318575_1048672113300032055SoilMSDALIAFPLLRIGRLELGRGLRALQPGSASAGPVVTAMVEVPASVSEPQYSWDVAWADAVRQAGELGADQRTAQALAV
Ga0318514_1011709323300032066SoilMPEGQGALPLLRLGRLELGPGLRAFQPAPPASGLVVTAMVQVPDSASERQYAWDVAWSDAAREASGLGVDHRTAEALVSAAGTA
Ga0318553_1002310413300032068SoilMADAMITLPLRRLGRLQLDPGLRTLQPGSAATGLVVTATVPVPDQPADQPQAWDVAWADAVREAGQLGADQATVQALSGCRR
Ga0306924_1260659413300032076SoilMPSVPMTRPLLRLGRLELDPALRAVQPDSAASGPVVTATVEVPESVSERQYAREVAWADAVREAGRLGA
Ga0318525_1050018623300032089SoilMDRDSVRTDRSIPLLKLGRLQLGPGLRALQPGPAPSGLVVTATVPVPQSSANRQYAWDVAWADAAREAGRLGADEATAQALPGGAGDAFVGGT
Ga0318577_1026598813300032091SoilMSDELIAFPLLRLGRLELEGGLRALRPGSAPSGLVVTAMVEVPESVSEPQYAWDVAWADAVRQAGELGADQRTAQALAAGAGRMRPGGTKV
Ga0311301_1098341623300032160Peatlands SoilMITHPLLRLGRLQLDPALRVLQPGSATSGLVVTATVAVPDTPADRSYAWDVAWADAAREAGRLGADQGTVQALAG
Ga0307471_10167364023300032180Hardwood Forest SoilVEAIERPLLRLGRLELDAGLRALRPGPAVSDLVVTAAVEVPESESEPQYAWDVAWADAVREAEGLGAGQRTAQALASGAGTVPADGSR
Ga0306920_10233957233300032261SoilMAEAMITLPLLRLGRLQLDPGLRTLQPGSARSGLVVTATVPVPDQPADQPQAWDVAWSEAVREAGQLGADQATVQALAGGAGDALAG
Ga0306920_10424823623300032261SoilMPDIPIITHPLLRLGRLTLGPGLRGLAPGSAKSGLVVTATVGVPDSAVSRSYAWDVAWADAVREASQLGADPA
Ga0335079_1080448013300032783SoilMTDAPITLPLLRLGRVHLGQGLRALKPGSAASGLVVTATVELPESTSERQYAWNVAWADAVREASRLGADERTAQALATGAGKAIA
Ga0335078_1019229953300032805SoilMPHTPDTLPLLRLGRLELGPELRALQPSPAASDLVVTATVEVPDSPSQQQDAWDVAWADAMREAVELGADERTARALAVGAGDSVAGGTWLV
Ga0335080_1047915943300032828SoilMTRPLLRLGRLELDPGLRAVQPGSAASGPVVTATVEVPESVSERQYAWEVAWADAVREAGRLGADERTAQAL
Ga0335080_1080373613300032828SoilMADAITLPLLRLGRLQLDSGLRTLQPGSAKADLVVTATVPVPDQPADRPQAWGVAWADAVREADQLGADQATVQALSG
Ga0335070_1102809723300032829SoilMSDASIAFPLLKIGRLELEHGLRALRPGSAPSGLAVTATVEVPESVSEPQYAWDIAWADAVRQAGELGADQRTAQALAAGAG
Ga0335076_1022263343300032955SoilMPDTPITAPLLRLGRLELDPGLRALEPAPAGSDPVVTATVETPVSESEPQYAWDVAWADAAREAGQLGAD
Ga0335077_1018985233300033158SoilMRAAERVAGGTGAGWDGGMPENKNVTHPLLRLGRLPLGPGLRGLAPGSTESGLVVTARVTVPDSAVSRSYAWDVAWADAVREASQQGADPATAQALE
Ga0335077_1024919953300033158SoilMTRPLLRLGRLELDPGLRAVQPGSAASGPVVTATVEVPESVSERQYAWEVAWADAVREAGRLGADERTAQA
Ga0335077_1075231213300033158SoilMPTNPLLRLGRLALSPELRALQPGPDLVVTATVGVPDSASERQHAWDVAWADAAREAGQLGADPDTAQALPAGAGDA
Ga0310914_1089812613300033289SoilMPYAQGTNPLLRLGRLELDPELRAFQPGSAASDLVVTATVEVPDSVSEPQYAWDVAWADAVREAGELGADEDTAR
Ga0318519_1071151813300033290SoilMPSVPITLPLLRLGRLELGPGLRAFQPGPAASDLVVTAAVEVPESVSEPQYAWDVAWADAVREASQLGADQRTAQALAGGAGWVLA
Ga0334854_111227_435_6563300033829SoilMPDHSNITHPLLRLGQLPLGPGLRDLTPGSAPSDLVVTAAVAVPDSSVSQSYAWDVAWTDAVREASEQGADPAT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.