NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F021631

Metagenome / Metatranscriptome Family F021631

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F021631
Family Type Metagenome / Metatranscriptome
Number of Sequences 218
Average Sequence Length 79 residues
Representative Sequence MKSLKLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGEYTVKWEGNGPNVELNILQGKKVVATMPA
Number of Associated Samples 196
Number of Associated Scaffolds 218

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 98.61 %
% of genes near scaffold ends (potentially truncated) 95.41 %
% of genes from short scaffolds (< 2000 bps) 86.70 %
Associated GOLD sequencing projects 188
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (84.404 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(13.303 % of family members)
Environment Ontology (ENVO) Unclassified
(22.477 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.294 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128.
12NP_03206750
2JGI12270J11330_101864141
3JGIcombinedJ26739_1011088061
4JGI25383J37093_100980962
5Ga0005483J37271_1106071
6Ga0058900_14305491
7Ga0058904_13764792
8Ga0058904_14211141
9Ga0058896_14269691
10Ga0058885_12499641
11Ga0058897_111699001
12Ga0068919_14186431
13Ga0068938_11414121
14Ga0066815_101194831
15Ga0066683_108749622
16Ga0066673_100166093
17Ga0066690_100625953
18Ga0066688_106145052
19Ga0066684_100273971
20Ga0066671_110806481
21Ga0070670_1008910391
22Ga0070671_1002879432
23Ga0070710_102413041
24Ga0070705_1011646851
25Ga0070694_1002697691
26Ga0070708_10001018710
27Ga0070699_1001447053
28Ga0070695_1001707691
29Ga0070695_1008148731
30Ga0070704_1014852191
31Ga0066704_101753653
32Ga0066702_101339311
33Ga0066708_101226401
34Ga0066708_103085893
35Ga0066706_103267691
36Ga0068863_1021390231
37Ga0068858_1013108653
38Ga0070717_112962681
39Ga0066651_105413011
40Ga0066696_100212301
41Ga0066652_1010919461
42Ga0075028_1001641443
43Ga0075028_1005704871
44Ga0075026_1001451802
45Ga0075019_103522332
46Ga0075018_102646182
47Ga0070716_1007616432
48Ga0070716_1007741032
49Ga0070712_1011326921
50Ga0066653_104523301
51Ga0066665_100793953
52Ga0066659_106377971
53Ga0066660_102881262
54Ga0079221_104513671
55Ga0079221_116376521
56Ga0075434_1012768312
57Ga0073928_104890941
58Ga0075436_1008067041
59Ga0075436_1011037771
60Ga0079219_108755051
61Ga0075435_1014471871
62Ga0075435_1016483021
63Ga0099791_101579622
64Ga0066710_1030844261
65Ga0099829_108507971
66Ga0099830_101121551
67Ga0099827_113791041
68Ga0099792_105980081
69Ga0105248_101710143
70Ga0116214_11594872
71Ga0116125_10995461
72Ga0116217_100129028
73Ga0116217_102106792
74Ga0099796_102367861
75Ga0134082_103625781
76Ga0134084_103564381
77Ga0134111_101752361
78Ga0126372_107774682
79Ga0126381_1016365581
80Ga0134126_102824511
81Ga0134121_116821101
82Ga0138554_1388341
83Ga0138555_10266771
84Ga0138570_10079361
85Ga0150983_112131921
86Ga0150983_119414861
87Ga0137392_108274161
88Ga0153952_11291391
89Ga0137388_111156921
90Ga0137382_103537982
91Ga0137376_101429931
92Ga0137376_113329821
93Ga0137387_105854481
94Ga0137387_107593121
95Ga0137371_114298751
96Ga0137384_101298073
97Ga0137358_100663513
98Ga0137394_101689471
99Ga0137419_104911381
100Ga0137407_102639821
101Ga0134077_100708861
102Ga0164304_106412881
103Ga0157371_100142417
104Ga0157374_100962703
105Ga0163163_103430532
106Ga0157377_106044491
107Ga0167668_11058461
108Ga0132258_131505321
109Ga0181507_11012841
110Ga0134083_103168062
111Ga0187825_102613481
112Ga0187847_106953671
113Ga0184608_102687601
114Ga0187883_103400131
115Ga0187851_100874262
116Ga0184621_103466921
117Ga0066662_100465733
118Ga0184595_1221941
119Ga0184588_1349391
120Ga0184644_13119271
121Ga0184642_15992401
122Ga0193715_10198031
123Ga0193727_11077521
124Ga0193751_10222373
125Ga0193731_11063671
126Ga0193755_10805771
127Ga0193734_10643791
128Ga0179594_103423791
129Ga0210401_102979451
130Ga0210404_102737351
131Ga0210400_115328311
132Ga0210408_113244731
133Ga0210408_114641241
134Ga0210388_103853761
135Ga0210397_113405251
136Ga0210392_106470432
137Ga0210409_102647381
138Ga0210409_104879482
139Ga0242647_10429351
140Ga0242667_10347961
141Ga0242659_10799652
142Ga0242663_10444201
143Ga0242669_11027531
144Ga0242668_10765961
145Ga0242671_11068121
146Ga0242673_10471891
147Ga0242665_101356073
148Ga0242665_103685451
149Ga0247546_1063811
150Ga0247543_1057581
151Ga0207707_113408511
152Ga0207660_116271071
153Ga0207646_117084521
154Ga0207668_103808862
155Ga0207677_103069682
156Ga0207702_122086281
157Ga0209234_11507282
158Ga0209236_10280761
159Ga0209236_12921441
160Ga0209055_10190405
161Ga0209155_11267102
162Ga0209687_10059105
163Ga0209152_101187152
164Ga0209801_11081961
165Ga0209473_10119851
166Ga0209804_10082265
167Ga0257180_10112811
168Ga0257180_10375501
169Ga0257171_10733101
170Ga0257169_10532891
171Ga0257165_10897111
172Ga0209160_10719032
173Ga0209160_10759201
174Ga0209056_105581241
175Ga0209219_11174751
176Ga0208827_11102131
177Ga0209217_11195031
178Ga0209009_10951661
179Ga0209446_10806791
180Ga0209178_14168391
181Ga0209689_10292275
182Ga0209167_108212111
183Ga0209579_101972002
184Ga0209283_108561021
185Ga0209488_103033631
186Ga0209583_100717681
187Ga0209069_105715491
188Ga0268265_100357234
189Ga0137415_102191981
190Ga0302278_102528361
191Ga0308309_104242952
192Ga0308309_109668421
193Ga0310037_100411732
194Ga0210272_12210961
195Ga0310038_103304111
196Ga0265462_115580251
197Ga0265462_123936891
198Ga0265461_132211421
199Ga0265770_10237502
200Ga0265779_1088991
201Ga0308193_10878422
202Ga0170823_158255531
203Ga0308179_10438542
204Ga0170819_109596892
205Ga0310686_11669948012
206Ga0307474_112847891
207Ga0307469_124522702
208Ga0307478_103147132
209Ga0306926_127838331
210Ga0307479_105022631
211Ga0316040_1201741
212Ga0307471_1001251494
213Ga0307471_1028152561
214Ga0335069_108978411
215Ga0335084_106077942
216Ga0335084_111300561
217Ga0335077_104453123
218Ga0326728_103403512
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 18.45%    β-sheet: 32.04%    Coil/Unstructured: 49.51%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

10203040506070MKSLKLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGEYTVKWEGNGPNVELNILQGKKVVATMPASequenceα-helicesβ-strandsCoilSS Conf. scoreSignal Peptide
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
84.4%15.6%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds





 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Groundwater Sediment
Peatland
Bog Forest Soil
Peatland
Freshwater Sediment
Iron-Sulfur Acid Spring
Groundwater Sediment
Watersheds
Soil
Soil
Vadose Zone Soil
Terrestrial Soil
Tropical Forest Soil
Glacier Forefield Soil
Grasslands Soil
Surface Soil
Peatlands Soil
Agricultural Soil
Soil
Grasslands Soil
Soil
Forest Soil
Soil
Hardwood Forest Soil
Soil
Soil
Forest Soil
Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Corn Rhizosphere
Bog
Peat Soil
Arabidopsis Rhizosphere
Corn Rhizosphere
Switchgrass Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Populus Rhizosphere
Switchgrass Rhizosphere
Corn Rhizosphere
Miscanthus Rhizosphere
Switchgrass Rhizosphere
Attine Ant Fungus Gardens
Switchgrass, Maize And Mischanthus Litter
3.2%7.8%10.6%5.5%12.4%13.3%6.0%6.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
2NP_032067502170459020Switchgrass, Maize And Mischanthus LitterMKALKISKGLLLGLALLLATSAFAANKGNLQVSDPVTVNGKQIGAGDYTVKWDGNGPNVELNILHGKNVVATV
JGI12270J11330_1018641413300000567Peatlands SoilMKFANVSKGLLLGLALLLATSAFAVSNRGSMELLDPVTVSGKQLPAGEYSVKWDGSGPNVELNILKGNKVVATTPA
JGIcombinedJ26739_10110880613300002245Forest SoilMKFAKFSKGLLLGLALVLATGAFAASNRGSVQIVDPVTVSGKQLRPGDYSVQWDGSGPNVELSIMQGKKVVATTPARLIDLSK
JGI25383J37093_1009809623300002560Grasslands SoilMKVSQMTKRLLLGLALLLATSAFASNKGSLQLNEAVNVIGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGS
Ga0005483J37271_11060713300002680Forest SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTVNGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAH
Ga0058900_143054913300004099Forest SoilMKFQSISKSLLVGLALLLATSAFAAAANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDAAV
Ga0058904_137647923300004100Forest SoilMKVSKISKGLLLGLTLLLATSVFAANKGQLQLNDPLTINGKQLAAGEYRLQWEGTGSSVELSI
Ga0058904_142111413300004100Forest SoilMKFQSVSKSLLLGLALLLATSAFAAANKGSLELPSAVTVSGKQLSPGDYSVKWDGNGPNVELSILQGSKVV
Ga0058896_142696913300004101Forest SoilMKLQSISKSLLVGLALLLATSVFAAAANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKK
Ga0058885_124996413300004116Forest SoilMKFANISKGLLLGLALLLATSAFAAANQGSMQLQDPVTVSGKQLRAGDYSVKWDGNGPNVQLSILKGNKVVATTPARLIDLNQKANNDAAVVKSNDDGSR
Ga0058897_1116990013300004139Forest SoilMKFQSISKSLLLGLALLLATSAFAATNKGSLQLANPVTVSGTQLSAGDYSVKWEGNGPSVELSILQGNKVVATAPPAWSI*
Ga0068919_141864313300004473Peatlands SoilMKFQGFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGSKVVATTPARLVDLSDKSSSDAALVKKN
Ga0068938_114141213300004592Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSDKSNS
Ga0066815_1011948313300005164SoilMSVSKVSKGLLLGLALLMATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVRWEGAGPNVELNILKGKNVVATVP
Ga0066683_1087496223300005172SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNK
Ga0066673_1001660933300005175SoilMKVSKMTKGLWLGLALLLATSAFASKKGSLQLSQAVNVNGKQLPAGDYTVKWHGSGTNVQASIMKGKNVVAT
Ga0066690_1006259533300005177SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNKVVATVPARLVDINQSASSDAAVVRKNED
Ga0066688_1061450523300005178SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNKVVATVPARLV
Ga0066684_1002739713300005179SoilMKVSKMTKGLWLGLALLLATSAFASKKGSLQLSQAVNVNGKQLPAGDYTVKWHGSGTNVQASIMKGKNVVATVPARLVDLDRTPGRDASV
Ga0066671_1108064813300005184SoilMKVSKMTKGLWLGLALLLATSAFASKKGSLQLSQAVNVNGKQLPAGDYTVKWHGSGTNVQASIMKGKNVVATVPARLVDLDRTPGRDASVITGNADGSRS
Ga0070670_10089103913300005331Switchgrass RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDSVTVNGKQLAAGDYTVKWEGAGPNVELNIL
Ga0070671_10028794323300005355Switchgrass RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYVVKWEGAGPNVELNILK
Ga0070710_1024130413300005437Corn, Switchgrass And Miscanthus RhizosphereMSVSKISKGLLLGLALLLATSVFAANKGTLQVNDPVTVNGKQLGSGEYTVRWDGAGPNVELNILKGKNVVATVPARMLELEQSPNRDAVVTSTNS
Ga0070705_10116468513300005440Corn, Switchgrass And Miscanthus RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYVVKWEGAGPNVELNILKGKNVVATVPARMVDLSRSPDRDSAVTVVNSDGRK
Ga0070694_10026976913300005444Corn, Switchgrass And Miscanthus RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELNILQGKNVVATVPARMVDLARSPDRDSAVTVVNSDGRK
Ga0070708_100010187103300005445Corn, Switchgrass And Miscanthus RhizosphereMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGAYTVKWEGNGPNVE
Ga0070699_10014470533300005518Corn, Switchgrass And Miscanthus RhizosphereMKASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIDAGEYTVKWDGNGPNVELNILRGKNVVATVPARMVDLDRTPSRDSSVTVVNE
Ga0070695_10017076913300005545Corn, Switchgrass And Miscanthus RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELNILKGKNVVATVPARMVDLARSPDRDSAVTVVNSDGRK
Ga0070695_10081487313300005545Corn, Switchgrass And Miscanthus RhizosphereMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTVNGKPLAAGEYTVKWDGAGPNVEVNIMKGKNV
Ga0070704_10148521913300005549Corn, Switchgrass And Miscanthus RhizosphereMKLSKVSKGLLLGLALLLATSAFAANKGSLMVSDPVTVSGKSLAAGEYTVKWEGNGPNVELNI
Ga0066704_1017536533300005557SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNKVVATVPARLVDINQSASSDAAVVRKNEDGSRS
Ga0066702_1013393113300005575SoilMKSSKLSKGLLLGLALLLATSVFAANKGSLQVSDEVTVSGKQLARGEYTVKWEGNGPNVELNILQGKKVVA
Ga0066708_1012264013300005576SoilMRISKLSKGLLLSLAVLLATSAFAANKGSLQISDTVNLAGKQLAPGNYTVKWEGSGPSVQASILQGKNVVATVPARLVDLDRAPGHDAAVTRRGENGSKSI
Ga0066708_1030858933300005576SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNKVVATVPAR
Ga0066706_1032676913300005598SoilMKASKISKGLLLGLALLLATSVFAINKGSLQVSDPVTVNGKQIGAGEYTVKWEGNGPD
Ga0068863_10213902313300005841Switchgrass RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELN
Ga0068858_10131086533300005842Switchgrass RhizosphereMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTVNGKPLAAGEYTVKWDGAGPNVEVNIMK
Ga0070717_1129626813300006028Corn, Switchgrass And Miscanthus RhizosphereMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGAYTVKWEGNGPNVELNI
Ga0066651_1054130113300006031SoilMKSSKLSKGLLLGLALLLATSAFAANKGSLQVSDEVTVSGKQLARGEYTVKWEGNGPNVELNILQGKKVVATTPARLIDLNRTADGDSA
Ga0066696_1002123013300006032SoilMTASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIPAGEYTVKWEGTGSNVELNILRGKSVVATVPARMI
Ga0066652_10109194613300006046SoilMTFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGNGPNVELNILQGKKVVATIPARLIDLDRSAPGNTSVIKRNEDGSK
Ga0075028_10016414433300006050WatershedsMTTSKISKGLLLGLALLLATSVFAATNKGSLQVTDPLTVNGKQLPAGDYTVKWDGAGPNVEL
Ga0075028_10057048713300006050WatershedsMSVSKISKGLLLGLALLLATSGFAANKGSLQVDHPVTVNGKPLAAGEYTVKWDGAGPNVELNIMKGKNVVATVPAHMLDLEQ
Ga0075026_10014518023300006057WatershedsMKVSKISKGLLLGLALLLATSVFAANKASLELNDPLTVNGKQLAAGSYSLKWQGTG
Ga0075019_1035223323300006086WatershedsMSVSKISKGLLLGLALLLATSGFAANKGSLQVDHLVTINGKQLAAGDYTVKWDGAGPNVELNILKGKNVVATVPAHMLDLE
Ga0075018_1026461823300006172WatershedsMSVSKISKGLLLGLALLLATSGFAANKGSLQVDHPVTINGKQLAAGDYTVKWDGAGPNVELNILKGKNVVAIVPAHMLD
Ga0070716_10076164323300006173Corn, Switchgrass And Miscanthus RhizosphereMKASKISKGLLLGLALLLATSAFAANKGNLQVSDPVTVNGKQIGAGDYTVKWDGNGPNVELNILHGKNVVATVPARMVDLDQTP
Ga0070716_10077410323300006173Corn, Switchgrass And Miscanthus RhizosphereMKVSKMTKSLLLGLALLLATSAFAANKGSLQLSNAANISGKQLAAGDYTVKWDGNGPNVQASIMKGKNVVATVPARLVDLDRAPGSDAAVITNNAPTVAAH*
Ga0070712_10113269213300006175Corn, Switchgrass And Miscanthus RhizosphereMSVSKISKGLLLGLALLLATSVFAANNKGSMQVTDSVTVNGKQLPAGEYTIKWDGAGPNVELNILRGKN
Ga0066653_1045233013300006791SoilMTFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGNGPNVELNILQGKKMVATIPARLIDL
Ga0066665_1007939533300006796SoilMKASKISKGLLLGLALLLATSVFAINKGSLQVSDPVTVNGKQIAAGEYTVKWEGNGPDVE
Ga0066659_1063779713300006797SoilMKASRMYKGLLLGLALLLATSAFAANKGSLKVNDPVTINGKQLAAGEYKVSWDGSGPSVELHIMQGKNVVATVPAKMVDLPRAASDDG
Ga0066660_1028812623300006800SoilMKASKISKGLLLGLALLLATSVFAINKGSLQVSDPVTVNGKQIGAGEYTVKWEGNGPDVELNILHGKNIVATVPAR
Ga0079221_1045136713300006804Agricultural SoilMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELNILKGKNVVATVPARMVDLARSPDR
Ga0079221_1163765213300006804Agricultural SoilMNASKMSKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIAPGEYTVKWEGNGPNVELNILSGKNVVAT
Ga0075434_10127683123300006871Populus RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELNILKGK
Ga0073928_1048909413300006893Iron-Sulfur Acid SpringMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNV
Ga0075436_10080670413300006914Populus RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAGPNVELNILKGKNVVATVPARMVDLARSPDRDSAVTVVN
Ga0075436_10110377713300006914Populus RhizosphereMKSNVSKCLALGAMLLLAVSAFASNKGSLTVPDAFTVNGKQLAAGEYTVKWEGSGPNV
Ga0079219_1087550513300006954Agricultural SoilMKASKISKGLLLGLALLLATSAFAANKGSLQVSDPVTVNGKQIGAGDYTVKWDGNGPNVELNILRGKNVV
Ga0075435_10144718713300007076Populus RhizosphereMKSNVSKCLALGAMLLLAVSAFASNKGSLTVPDAFTVNGKQLAAGEYTVKWEGSGPNVELSIEQ
Ga0075435_10164830213300007076Populus RhizosphereMKASKISKGLLLGLALLLATSSFAANKGSLQVTDPVTVNGKQIGAGNYTVKWDGNGPNVELNILRGRNVVATVPAR
Ga0099791_1015796223300007255Vadose Zone SoilMKSSKLSKGWLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLARGEYTVKWEGNGPNVELNILQGKKVVATAPARLI
Ga0066710_10308442613300009012Grasslands SoilMKVSKMFKGLPLGLALLLATSALAANQGSLQVSDPVTVSGKQLKTGDYTVKWEGNG
Ga0099829_1085079713300009038Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVELNIMKGKNV
Ga0099830_1011215513300009088Vadose Zone SoilMKVSKMYKGLLLGLALLLATNAFAANKGSLQVSDPVTVSGKQLAAGDYTVKWEGAGPNVELNILQGKNIVATVPARLIDLN
Ga0099827_1137910413300009090Vadose Zone SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYNLKWEGNGPNVELNILQGKKV
Ga0099792_1059800813300009143Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNIL
Ga0105248_1017101433300009177Switchgrass RhizosphereMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTVNGKPLAAGEYTVKWDGAGPNVEVNIMKGKNVVATVPARMLALDQSPNRDSVVT
Ga0116214_115948723300009520Peatlands SoilMKFANVSKGLLLVGLALLLATSAFAAANKGSMQLVDQVTVSGKQLPAGDYSVKLDGSGPNVELSIL
Ga0116125_109954613300009628PeatlandMKFANISKGLLLGLALLLATSALAATNKGSVQLQDSVTVSGKQLRAGEYSVKWDGSGPNVELSILKGNKV
Ga0116217_1001290283300009700Peatlands SoilMKFANVSKGLLLGLALLLATSAFAVSNRGSMELLDPVTVSGKQLPAGEYSVKWDGSGPNVELNILKGNKVVAT
Ga0116217_1021067923300009700Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSDKSNSDAALVKKNDDGSKSL*
Ga0099796_1023678613300010159Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAHMLDLEQSPTRDSVV
Ga0134082_1036257813300010303Grasslands SoilMTASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIPAGEYTVKWEGTGSNVELNILRGKS
Ga0134084_1035643813300010322Grasslands SoilMKFSNTSKGLVLGLALLLATSAFAANKGSLQVSDEVTVSGKQLARGEYTVKWEGNGPHVA
Ga0134111_1017523613300010329Grasslands SoilMKSSQLSKGLLLGLALLLATSAFGANKGSLQVSDPVTVSGKSLAAGEYTVKWEGNGPNVELNILQGKKMVATIPARLIDL
Ga0126372_1077746823300010360Tropical Forest SoilMTKSLWLGLALLLTTSAFAANKGSLQLREAVNLSGRQLAAGDYTVRWDGNGPNVQASIMKGKNVVATVPARLVDLDSKAVSDSVVVTGNADGSRTL
Ga0126381_10163655813300010376Tropical Forest SoilMKFQSISKSLVLGLALLLAGSAFAAANKGTLQLPNTVTVSGKQLSAGEYSVKWDGNGPNVEINILQGNKVVATAPARLVDLSQKQTADTAVVKNNADGTRS
Ga0134126_1028245113300010396Terrestrial SoilMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVKWEGAG
Ga0134121_1168211013300010401Terrestrial SoilMNASKFSKGLLLGLALLLATSAFAASKGPLQLTAPASVAGKQLAAGDYTVKWDGNGPS
Ga0138554_13883413300011049Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSDKSNSDAALVKK
Ga0138555_102667713300011075Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSD
Ga0138570_100793613300011087Peatlands SoilMKFQSISKSLLVGLALLLATSVFAAAANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDAAVVKNNDDGSRS
Ga0150983_1121319213300011120Forest SoilMKFQSISKSLLVGLALLLATSAFAAAANKGSMQLLDPVTVSGKQLPAGDYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDAAVVKNNDDGSRSLA
Ga0150983_1194148613300011120Forest SoilMKSQSILKSLVLGAALLLATGAFADANKGSMQLGNTVSVAGKQLSAGDYSVKWEGSGSNVQVSFLQGKKVVATASARLI
Ga0137392_1082741613300011269Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTVNGKQLASGTYSVTWDGAGPNVELNILKGKS
Ga0153952_112913913300012176Attine Ant Fungus GardensVEAVQAAQNRFPTFSGKKKELMKFAKFSKGLLLGLALVLATGAFAASNKGSLEVVDPVTVSGKQLRPGDYSVKWDGSGPNVELSIMQGKKVVATTPARLIDLNQTPN
Ga0137388_1111569213300012189Vadose Zone SoilMKVSKMFQGLLLGSALLLTTSAFAANKGSLQVLDPVTVSGKQLKAGDYAVKWEGNGPNV
Ga0137382_1035379823300012200Vadose Zone SoilMKSSKLSKGLLLGLALLLATSAFAANKGSLQVSDEVTVSGKQLARGEYTVKWEGNGPNVELNILQGKKVVATTPARLIDLNRTADGDSAVVRKNDDGSRTLA*
Ga0137376_1014299313300012208Vadose Zone SoilMKSSKLSKGLLLGLALLLATSAFAANKGSLQVSDTVNISGKSLAAGEYNVKWEGSGPN
Ga0137376_1133298213300012208Vadose Zone SoilMKSLKLSKGLLLGLALLLETSAFAANKGSLQVSDPVTVSGKQLAPGEYTVKWEGNGPNVELNILQGKK
Ga0137387_1058544813300012349Vadose Zone SoilMKVSKMFKGLLLGLALLLATSAFAASKGSLQVSDPGTVSGKQLAAGNYTVKWQGKGPNVELNILQGKNVVATVPARLIDLDRSSDSNAAVTKLNGD
Ga0137387_1075931213300012349Vadose Zone SoilMKLSTISKSLLLGMALLLATGAFAASKGSLQVQDPVTVSGKQLPAGDYQLKWNGKGPNVELNILKDNKVVATVPARLVDINQ
Ga0137371_1142987513300012356Vadose Zone SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGAYTVKWEGNGPNVELNILQGKKV
Ga0137384_1012980733300012357Vadose Zone SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSDKQLAPGAYTVKWEGNGPNVE
Ga0137358_1006635133300012582Vadose Zone SoilMKASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVYGKQIDAGEYTVKWDGNGPN
Ga0137394_1016894713300012922Vadose Zone SoilMKSSKISKGLLLGLALLLATSAFAANKGSLQVSDPVMVSGKQLAAGDYTVKWEGNGPNVELNILQGKKVVATIPARLIDLNRSADGNSAVVKRNDDGSRTL
Ga0137419_1049113813300012925Vadose Zone SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGAYTVEWEGNGPNVELNILQGKKVVATMPARLIDLNRS
Ga0137407_1026398213300012930Vadose Zone SoilMKSSKLSKGLLLGLALLLATSAFAANKGSLQVSDAVTVSGKSLAAGEYNLKWEGNGPNVELNI
Ga0134077_1007088613300012972Grasslands SoilMRVSKLSKGLLLSLAVLLATSAFAANKGSLQISDTVNLAGKQLAPGNYTVKWEGNGPSVQAS
Ga0164304_1064128813300012986SoilMSVSKISKGLLLGLALLLATSVFAANNKGSMQVTDSVTVNGKQLPAGEYTIKWDGAGPNVELNIMRGKNVVATVPARMVDLNQSPNRDSLITTVNSDGRKS
Ga0157371_1001424173300013102Corn RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYVVKWEGAGPNVELNILKGKNVVATVPARMVDLSRSPDRDSAVTVVNSDGR
Ga0157374_1009627033300013296Miscanthus RhizosphereMSVSKISKGLLLGLALLLATSVFAANNKGSMQVTDSVTVNGKQLPAGEYTIKWDGAGPNVELNILRGKNVVATVPARMVDLEQSPNRDSVITNVNSDGRKSL
Ga0163163_1034305323300014325Switchgrass RhizosphereMSVSKVSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLPAGDYVVKWEG
Ga0157377_1060444913300014745Miscanthus RhizosphereMSVSKISKGLLLGLTLLLATSVFAANKGSLQVSDPVTVNGKQIGAGDYTVKWEGNGPNVELN
Ga0167668_110584613300015193Glacier Forefield SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTISGKQLAAGAYTVKWDGAGPNVELNIL
Ga0132258_1315053213300015371Arabidopsis RhizosphereMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTVNGKPLAAGEYTVKWDGAGPNVEVNIMKGKNVVATVPARMLALDQSPNRDSVVTNTNSD
Ga0181507_110128413300016705PeatlandMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQDPVTISGKQLRAGDYSVKWDGNGPNVELS
Ga0134083_1031680623300017659Grasslands SoilMKVSQMTKRLLLGLALLLATSAFASNKGSLQLNEAVNVIGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDAAVTTNNADGS
Ga0187825_1026134813300017930Freshwater SedimentMKFANVFKGVLVGLALLLATSAFAASNKGSMQLLDPVTVSGKQLPTGEYSVKWDGNGPNVELSILRGNKVVATTPARLIDLSQKSNGDSAIVQQNGDG
Ga0187847_1069536713300017948PeatlandMKFANISKGLLLGLALLLATSALAATNKGSVQLQDSVTVSGKQLRAGEYSVKWDGSGPNVELSILKGNKVVATTPARLIDLNEKSNRDAAVV
Ga0184608_1026876013300018028Groundwater SedimentMKSSKTFKGLLLGLALLLATSAFAANKGSLMVSDPVTVSGKSLAAGEYSVKWEGNGPNVE
Ga0187883_1034001313300018037PeatlandMKFANTSKGLLLGLALLLATSAFAAANKGSMQLQDPVTVSGKQLHAGDYSVKWDGNGPNVELSIMKGNKVVATAPA
Ga0187851_1008742623300018046PeatlandMKFANTSKGLLLGLALLLATSAFAAANKGSMQLQDPVTVSGKQLHAGDYSVKWDGNGPNVELSIMKGN
Ga0184621_1034669213300018054Groundwater SedimentMSVSKISKGLLLGLALLLATSVFAANKGTLQVSDSVTVNGKQLAAGDYTVKWDGAGPNV
Ga0066662_1004657333300018468Grasslands SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGRQLAPGAYTVEWEGNGPNVE
Ga0184595_12219413300019166SoilMKFQSFSKSMLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGSKVVATTPARLVDLSDKSNNDAALVKKNDDG
Ga0184588_13493913300019186SoilMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQDPVTISGKQLRAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNHDAAVVQNNGDGTKS
Ga0184644_131192713300019269Groundwater SedimentMKFSNTSKGLLLGLALLLATSAFAANKGSLQVSDTVTVSGKSLAAGEYSVKWEGNGPNVELNILQGKKVVATIPARLIDLDRSATGNTSVIKRNGDGSKT
Ga0184642_159924013300019279Groundwater SedimentMTFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTISGKSLAAGEYSVKWEGNGPNVELNILQGKKVVATTPARLIDLDRSATGNTAVVKRNG
Ga0193715_101980313300019878SoilMTFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGNGPN
Ga0193727_110775213300019886SoilMRFSNTSKGLLLGLALLLATSAFAANKGSLQVSDTVTVSGKSLAAGEYSVKWEGNGPNVELNILQGKKVVATIPA
Ga0193751_102223733300019888SoilMKFSNTSKGLLLGLALLLATSAFAANKGSLQVSDTVNVSGKSLAAGEYNVKWEGNGPNVELNILQGKKVVATIPARLIDLDRSAPGNTSVVKRNEDGSK
Ga0193731_110636713300020001SoilMRFSNTSKGLLLGLALLLATSAFAANKGSLQVSDTVTVSGKSLAAGEYSVKWEGNGPNVELNILQGKKVVATIPARLIDLDRSATGNTSVIKRNGD
Ga0193755_108057713300020004SoilMKSLKLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGEYTVKWEGNGPNVELNILQGKKVVATMPA
Ga0193734_106437913300020015SoilMKVSKVSKGLLLGLALLLATSAFAANKGSLQVSDAVTVNGKQIAPGEYTVKWEGNGPNVELNILRGKN
Ga0179594_1034237913300020170Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAHMLD
Ga0210401_1029794513300020583SoilMKFQSISRSLLVGLALLLATSAFAAAANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPAR
Ga0210404_1027373513300021088SoilMKTSKMYKGLLLGLALLLATSAFAANKGSLKVNDPVTINGKQLAAGEYKVSWDGSGPSVELHIMQGKNVVATVPAKMVDLPRPAADDGAVVNTNGDG
Ga0210400_1153283113300021170SoilMKFANTSKGLLLGLALLLATSAFAAANKGSMQLQDPVTVSGKQLHAGDYSVKWDGNGPNVELSIMKGHKVVATAPARLIDLNEKSN
Ga0210408_1132447313300021178SoilMKFQSISKSLLLGLALLLATSAFAAANKGSLELPSAVTVSGKQLSAGEYSVKWEGNGP
Ga0210408_1146412413300021178SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVELNIMKGKNLVATVP
Ga0210388_1038537613300021181SoilMKFANISKGLLLGLALLLATSALAANSGSMQLQGPVTVSGKQLRAGEYSVKWDGNGPNVELSILKGNKVVATTPARLIDLNE
Ga0210397_1134052513300021403SoilMKFQSVSKSLLLGLALLLATSAFAAANKGSLELPSAVTVSGKQLSPGDYSVKWDGNGPNVELSILQGSKVVAT
Ga0210392_1064704323300021475SoilMKLQSISKSLLVGLALLLATTAFAATANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLS
Ga0210409_1026473813300021559SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVELNIMKGKNVVATVPARMLDLEQSPARDSIITSVNSEGH
Ga0210409_1048794823300021559SoilMSASKISKGLLLGLALLLATSVFAANKGSMEVIDPLTVNGKQLPAGDYTVKWEGT
Ga0242647_104293513300022505SoilMKFENVSKGLLLGLAVLLATSAFAAANQGSMQLQDPVTVSGKQLRAGDYSVKWDGTGPNVELSILKGN
Ga0242667_103479613300022513SoilMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQDPVTVSGKQLRAGDYSVKWDGNGPNVQLSILKGNKVVATTPARLIDLNEKSNNDAAV
Ga0242659_107996523300022522SoilMKFQSASKSLLLGLALLLATSAFAAANKGSLELPSAVTVSGKQLSPGDYSVKWDGNGP
Ga0242663_104442013300022523SoilMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQDPVTISGKQLRAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNHDAAVVQNNGD
Ga0242669_110275313300022528SoilMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQNPVTVSGKQLGAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNHDAAVVQNNGDGTK
Ga0242668_107659613300022529SoilMKLQSISKSLLVGLALLLATTAFAATANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGK
Ga0242671_110681213300022714SoilMKLQSISKSLLVGLALLLATSAFAATTNKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDAAVVK
Ga0242673_104718913300022716SoilMKLQSISKSLLVGLALLLATTAFAATANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDA
Ga0242665_1013560733300022724SoilMSESKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGLARTSN
Ga0242665_1036854513300022724SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVELNIMKGKNVV
Ga0247546_10638113300023551SoilMKFQSFSKSMLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGSKVVATTPARLVDLSDKSNNDAALVKKNDDGSKSL
Ga0247543_10575813300023677SoilMKFANTSKGLLLGLALLLATSAFAAANQGSMQLQDPVTISGKQLRAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNH
Ga0207707_1134085113300025912Corn RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDCVVKWEGAGPNVELN
Ga0207660_1162710713300025917Corn RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYTVRWEGAGPNVELNILKGKNVVATVPARMVDLSRSPDRDSAVTV
Ga0207646_1170845213300025922Corn, Switchgrass And Miscanthus RhizosphereMTKSLLLGLALLLATSAFAANKGSLQLSNAANISGKQLAAGDYTVKWDGNGPN
Ga0207668_1038088623300025972Switchgrass RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYVVKWEGAGPNVELNILKGKNVVATVPARMVDLSRSPDRDSAVTV
Ga0207677_1030696823300026023Miscanthus RhizosphereMSVSKISKGLLLGLALLLATSVFAANNKGSMQVTDSVTVNGKQLPAGEYTIKWDGAGPNVELNILRGKNVVATVPARMVDLEQSPNRDSVITN
Ga0207702_1220862813300026078Corn RhizosphereMSVSKISKGLLLGLALLLATSVFAANNKGSMQVTDSVTVNGKQLPAGEYTVKWDGAGPNVELNILRGKNVVATVPARMVDLEQSPNRDSVITN
Ga0209234_115072823300026295Grasslands SoilMTFSNTFKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGNGPNVELNILQGKKVVATMPARLI
Ga0209236_102807613300026298Grasslands SoilMKVSKMTKSLLLGLALLLATNAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKG
Ga0209236_129214413300026298Grasslands SoilMKLSVISKGLLLGMALLLATSAFASNKGSMNVQENLTVSGKQLSAGDYQLQWEGSGPNVEVNILRGKKVVATVPARLVDINQSPSSNASIVRKNADG
Ga0209055_101904053300026309SoilMKVSKMTKSLLLGLALLLATNAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDAAVTT
Ga0209155_112671023300026316SoilMTASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIPAGEYTVKWEGTGSNVELNILRGKSVVATVPARMIDLNQ
Ga0209687_100591053300026322SoilMKVSKMTKSLLLGLALLLATSAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDAAVT
Ga0209152_1011871523300026325SoilMKLSVISKGLLLGMALLLATSAFASNKGSMNVQENLTVSGKQLSAGDYQLQWEGSGPNVEVNILRGK
Ga0209801_110819613300026326SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKQLAPGAYTVEWEGNGPNVELNILQG
Ga0209473_101198513300026330SoilMKVSKMTKRLLLGLALLLATSAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDAAVTT
Ga0209804_100822653300026335SoilMKVSKMTKSLLLGLALLLATNAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLV
Ga0257180_101128113300026354SoilMKASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIAAGEYTVKWDGNGPNVELNILRGKNVVATVPARMVDLESTPSRDSAVTVV
Ga0257180_103755013300026354SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVNDPVTINGKQLASGTYSVTWD
Ga0257171_107331013300026377SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAHMLDLEQSPTRDSV
Ga0257169_105328913300026469SoilMKSSQLSKGLLLGLALLLATSAFAANKGSLQVSDPVMVSGKQLAPGAYTVEWEGNGPNVELNILQ
Ga0257165_108971113300026507SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTVNGKQLASGTYSVTWDGAGPNVELNILKGKSVVATVPAHMLDLEQSPARDSVVTNTNSDGHKS
Ga0209160_107190323300026532SoilMKVSKMTKSLLLGLALLLATNAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDAAVTTNNADGSRSLT
Ga0209160_107592013300026532SoilMKRSTISKSLLLGMALLLATGAFAANKGSLQVQDPVTVIGKQLPAGDYQLKWDGKGPNVELSILKGNKVVATVPARLVDINQSASSDAAVVR
Ga0209056_1055812413300026538SoilMTKSLFLGLALLLATSAFASNKASLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVV
Ga0209219_111747513300027565Forest SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDDPVTISGKQLAAGAYTVKWDGAGPNVELNILRGKNVVATVPARM
Ga0208827_111021313300027641Peatlands SoilMKFANVSKGLLLVGLALLLATSAFAASNKGSMQLLDTVTVSGKPLPAGDYSVKWDGTGPNVELNILQGSKVVATT
Ga0209217_111950313300027651Forest SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVE
Ga0209009_109516613300027667Forest SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAHMLDLEQSPTRDSVVTNTNSDGHKSL
Ga0209446_108067913300027698Bog Forest SoilMKFQSISKSLLLGLALLMATSAFAAGNKGSMQLLDPVSVSGKQLPAGEYSVKWDGSGPNVEVNIMKGNKVVATTPARLIDLSQKPDRDAAVVKNNDDG
Ga0209178_141683913300027725Agricultural SoilMNASKMSKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIAPGEYTVKWEGNGPNVELNILSGKNVVATVPAR
Ga0209689_102922753300027748SoilMKVSKMTKSLLLGLALLLATSAFASNKGSLQLNEAVNVSGKQLAAGDYTVKWDGAGPNVQASIMKGRNVVATVPAHLVDLDRAPGSDA
Ga0209167_1082121113300027867Surface SoilMKFQSASKSLLLGLALLLATSAFAAANKGSLELPSAVTVSGKQLSPGDYSVKWDGNGPTVELSILQGSKVVATTQARMVDLSQKQ
Ga0209579_1019720023300027869Surface SoilMKLQSISKSLLVGLALLLATSVFAAAANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMK
Ga0209283_1085610213300027875Vadose Zone SoilMKVSKMYKGLLLGLALLLATNAFAANKGSLQVSDPVTVSGKQLAAGDYTVKWEGAGPNVELNILQGKNIVATVPARLIDLNRSSDNNAAVTTLNGDG
Ga0209488_1030336313300027903Vadose Zone SoilMKASKISKGLLLGLALLLATSVFAANKGSLQVSDPVTVNGKQIDAGEYTVKWDGNGPNVELNILRGKNVVATVPARMVDLDRTPSRDSSVTVV
Ga0209583_1007176813300027910WatershedsMTTSKISKGLLLGLALLLATSVFAATNKGSLQVTDPLTVNGKQLPAGDYTVKWDGAGP
Ga0209069_1057154913300027915WatershedsMKVSKISKGLLLGLALLLATSVFAANKASLELNDPLTVNGKQLAAGSYSLKWQGTGPGVELSILQGKNVVATAPARLI
Ga0268265_1003572343300028380Switchgrass RhizosphereMSVSKLSKGLLLGLALLLATSVFAANKGTLQVSDPVTVNGKQLAAGDYVVKWEGAGPNVELNILKGKNVVATVPARMVDLSRSPDRDSAVTVVNSDG
Ga0137415_1021919813300028536Vadose Zone SoilMSVSKISKGLLLGLALLLATSGFAANKGSIQVNDPVTINGKQLASGTYSVTWDGAGPNVELNILKGKNVVATVPAHMLDLEQS
Ga0302278_1025283613300028866BogMKSTKNLILTLGLAVLTATSAFAAPNKGSLQITSPVKINGTQLKPGDYSVKWEGTGSNVQLSILQGRSVVT
Ga0308309_1042429523300028906SoilMSVSKISKGLLLGLALLLATSGFAANKGSLQVDNPVTINGKPLAAGEYTVKWDGAGPNVELNI
Ga0308309_1096684213300028906SoilMKASKWSKGLLLGLALLLATSAFASNKGSLAVTDNCMVAGKQLTKGDYKVSWEG
Ga0310037_1004117323300030494Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSDKSNSDAALVKKNDD
Ga0210272_122109613300030573SoilMKFTNSSKGLLLGLALLLATSAFAAANQGSMQLQDPVTISGKQLRAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNH
Ga0310038_1033041113300030707Peatlands SoilMKFQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGNKVVATTPARLVDLSDKSS
Ga0265462_1155802513300030738SoilMKFTNSSKGLLLGLALLLATSAFAAANQGSMQLQNPVTVSGKQLRAGDYSVKWDGNGPNVELSIMKGNKVVATAPARLIDLNEKSNHDMKTKKGNEMKG
Ga0265462_1239368913300030738SoilMKLSISKSVLLGLAVLLATSAFAAANKGSLELSNPVVVSGTQLSPGDYSVKWDGNGPNVELNILKGSKVVATTPARLVDLSQKQSVDNAVVKNNADGTNSLA
Ga0265461_1322114213300030743SoilMKFQNISKGLLLGLAFVLATGAFAATANKGSVQLMDSVTISGKQLAAGSYQVKWDGSGPNVEVNFLQKNQVVATTSAHLVDLNQKQDN
Ga0265770_102375023300030878SoilMKFQSISKSLLVGLALLLATTAFAAANKGTMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIIKGKKVVATTPARLIDLSQKPARDAAVVKNNDDGSRS
Ga0265779_10889913300031043SoilLLLATSAFAATANKGSMQLLDPVTVSGKQLPAGEYSVQWDGSGPNVEVNIMKGKKVVATTPARLIDLSQKPARDSAVVRNNDDG
Ga0308193_108784223300031096SoilMTFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGSGPNVELNILQGKNVVA
Ga0170823_1582555313300031128Forest SoilMKASKMSKGLLLGLALLLATSAFASNKGSLAVTDNCMIAGKQLTKGDYKVSWEGNGPD
Ga0308179_104385423300031424SoilMRFSNTSKGLLLGLALLLATSAFAANKGSLQVSDPVTVSGKSLAAGEYTVKWEGTGPKDEINNLQNNKYDYTTSARMI
Ga0170819_1095968923300031469Forest SoilMSKSKISKGLLLGLALLLATSVFAATNKGSLQVLDPLTVNGKQLPAGDYTVTWDGAGPNVELNIMRGKNVVASVPAHMVDLDKSPNRDSLITNVNSDGHKALNE
Ga0310686_116699480123300031708SoilMKFQSFSKSMLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGSKVVATTPARLVDLSDKSN
Ga0307474_1128478913300031718Hardwood Forest SoilMKASKLSKGLLLGLALLLATSAFASNKGSLAVTDNCVVAGKQLTKGDYKVSWEGNGPDVQLSIMKGMDVVATVPAH
Ga0307469_1245227023300031720Hardwood Forest SoilMKASKISKGLLLGLALLLATSSFAANKGSLQVTDPVTVNGKQIGAGDYTVKWDGNGPNVELNI
Ga0307478_1031471323300031823Hardwood Forest SoilMKFANVSKGLLVGLALLLATTAFAASNRGSMQLLDSVTVSGKQLPAGEYSVKWDGSGPNVELNILRGNKVVATTPARLIDLNQKPSSDSAVVQRNGDGSNSL
Ga0306926_1278383313300031954SoilMKASKTSKGLLLGLALLLATTAFASNKGTLAVTDNCTVAGKALAKGDYKVSWDGNGPDVQLNIMKGKEVVATVPAHMTEL
Ga0307479_1050226313300031962Hardwood Forest SoilMKITSVYKGLVLSLVLLLAATAFASNKGSMQISTPVMVNGRQLAPGDYSVKWEGNGP
Ga0316040_12017413300032121SoilMKSQSFSKSLLMGLALLLATSAFAAANKGSVQFLDPVTISGKQVPAGDYSVKWDGNGPNVELNILKGSKVV
Ga0307471_10012514943300032180Hardwood Forest SoilMKASKISKGLLLGLALLLATSAFAANKGNLQVSDPVTVNGKQIGAGDYTVKWDGNGPNVELNILHGKNVVATVPARMVDLDQTPNRDSAVTVLSPDGHKSLNE
Ga0307471_10281525613300032180Hardwood Forest SoilMKFSNTSKGLVLGLALLLATSAFAANKGSLQVSDTVNVSGKSLAAGEYNVKWEGNGPNVELNILQGKKV
Ga0335069_1089784113300032893SoilMSVSKISKGLLLGLALLLATSVFAANKGTLQVNDPVTVNGKQLGAGEYTVKWDGAGPNVELNILKGKNVVATVPARMLELEQS
Ga0335084_1060779423300033004SoilMKMWKLSILAFAILLATLAFAANKAPMQVLNPVSVSGKQLAAGDYTVSWEGNGPAVELSILKGKNVVAKVPAKMVDLPAAPDRNSIVTTN
Ga0335084_1113005613300033004SoilMKVSRISKGLLLGLALLLATSAFAANKGSLSVVDPVTIAGKQLAAGQYKVTWDGSG
Ga0335077_1044531233300033158SoilMKLVKTYKGLLLGLALLLATSAFASSKGTLQVTDNLSVSGTQLAVGDYTVKWDGTGPSVELNILQGNKVVATVPARLI
Ga0326728_1034035123300033402Peat SoilMKFANFSKSLMLGLALLLATSAFATANKGSVQLMDPVTVSGTQLPAGEYSVKWDGSGPNVEVNFLKGNKVVATTPARLIDLSQKPYSD


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.