NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F029893

Metagenome / Metatranscriptome Family F029893

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F029893
Family Type Metagenome / Metatranscriptome
Number of Sequences 187
Average Sequence Length 103 residues
Representative Sequence MPTMQDSVSVAANAVSANVLAGQLYEFVPTGLNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTARNTTAGALTFFWRVDL
Number of Associated Samples 137
Number of Associated Scaffolds 187

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 79.68 %
% of genes near scaffold ends (potentially truncated) 20.86 %
% of genes from short scaffolds (< 2000 bps) 49.20 %
Associated GOLD sequencing projects 121
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (98.930 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand
(8.556 % of family members)
Environment Ontology (ENVO) Unclassified
(18.717 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(21.925 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      
Full Alignment
Alignment of all the sequences in the family.
Sorting
Filter
Selection
Vis.elements
Color scheme
Extras
Export
Help

IDLabel
.2.4.6.8.10.12.14.16.18.20.22.24.26.28.30.32.34.36.38.40.42.44.46.48.50.52.54.56.58.60.62.64.66.68.70.72.74.76.78.80.82.84.86.88.90.92.94.96.98.100.102.104.106.108.110.112.114.116.118.120.122.124.126.128
1EF_395120
2EF_na_1334280
3EF_01858300
4EF_00093220
5GPICI_01785730
6ICChiseqgaiiDRAFT_05589701
7TB_LI09_3DRAFT_10065583
8JGI11643J11755_116575952
9RCM34_10237922
10C687J26616_100217692
11C687J26623_101967352
12JGI25385J37094_100202072
13Ga0068515_1175862
14Ga0066680_100526952
15Ga0066680_107583991
16Ga0066688_101758972
17Ga0070667_1000995853
18Ga0070705_1000583872
19Ga0070697_1000927263
20Ga0070697_1002811232
21Ga0070704_1003503503
22Ga0066692_106876112
23Ga0066700_104983931
24Ga0066905_1004970591
25Ga0066903_1003880212
26Ga0074472_101427003
27Ga0074472_105635571
28Ga0074472_107467662
29Ga0074472_114034672
30Ga0075023_1000097422
31Ga0075417_102459622
32Ga0075432_100151752
33Ga0082029_11189632
34Ga0066659_100567605
35Ga0066659_100588276
36Ga0075428_1002556351
37Ga0075421_1002888004
38Ga0075431_1009396622
39Ga0075425_1001687384
40Ga0075425_1005136291
41Ga0079218_101040523
42Ga0105044_101582593
43Ga0110935_10098345
44Ga0115888_1240581
45Ga0066710_1002625876
46Ga0066710_1002639914
47Ga0066710_1005980163
48Ga0066710_1013622552
49Ga0066710_1037806551
50Ga0111539_101583414
51Ga0114129_108037881
52Ga0111538_113291451
53Ga0075423_101539303
54Ga0075423_101566634
55Ga0103857_100112022
56Ga0116190_10313934
57Ga0105059_10067032
58Ga0105069_10003372
59Ga0105069_10009752
60Ga0105081_10025523
61Ga0105087_10175931
62Ga0105085_10133962
63Ga0105085_10277992
64Ga0126376_105925651
65Ga0126376_132025542
66Ga0126377_105558662
67Ga0126377_107881352
68Ga0126377_125921661
69Ga0126379_105699372
70Ga0126381_1002270091
71Ga0136847_117623944
72Ga0126383_101237953
73Ga0126383_109024501
74Ga0137451_10121264
75Ga0137334_10989152
76Ga0157136_10007493
77Ga0157209_1010632
78Ga0157209_1010852
79Ga0126369_114093692
80Ga0126369_122531591
81Ga0163202_10053133
82Ga0163202_10054072
83Ga0163202_10058842
84Ga0163200_10772741
85Ga0163203_11265342
86Ga0180066_10396142
87Ga0180086_10053434
88Ga0180086_10356232
89Ga0180063_11057471
90Ga0157379_101001052
91Ga0163144_113360941
92Ga0132256_1023160452
93Ga0187778_112588081
94Ga0187787_103154041
95Ga0184634_100563952
96Ga0184634_102754972
97Ga0184626_100243415
98Ga0184637_100515103
99Ga0184637_100615453
100Ga0184637_101172783
101Ga0184637_101329154
102Ga0184637_102828751
103Ga0187773_105384151
104Ga0184640_100778633
105Ga0184627_100382804
106Ga0184627_104370522
107Ga0193754_10007351
108Ga0193755_10179741
109Ga0163150_100681066
110Ga0163150_100696662
111Ga0213920_10088434
112Ga0213919_10087282
113Ga0213919_10713131
114Ga0163145_10219312
115Ga0213922_10794701
116Ga0224505_100543622
117Ga0212124_100475873
118Ga0212124_100518722
119Ga0212124_100525093
120Ga0209431_100836793
121Ga0209640_101115812
122Ga0209341_112529611
123Ga0208461_10239693
124Ga0208325_10110201
125Ga0207658_100727964
126Ga0209235_10362596
127Ga0209237_10392217
128Ga0209236_10382037
129Ga0209801_10319483
130Ga0209377_10325787
131Ga0209160_11137411
132Ga0209896_10405261
133Ga0209876_10004462
134Ga0209876_10015273
135Ga0209897_10018552
136Ga0209869_10028754
137Ga0209845_10038065
138Ga0209845_10670771
139Ga0209842_10680341
140Ga0233416_101113382
141Ga0209591_101097791
142Ga0209023_100721662
143Ga0209814_100243375
144Ga0209481_100298532
145Ga0209486_100655733
146Ga0209254_107239151
147Ga0209253_100918852
148Ga0209382_104466653
149Ga0209583_100132212
150Ga0209853_10129542
151Ga0272412_10286103
152Ga0272412_13009102
153Ga0168034_1013591
154Ga0168034_1024052
155Ga0168034_1138252
156Ga0120082_10021296
157Ga0255311_10052915
158Ga0307408_1011783501
159Ga0247727_101262392
160Ga0247727_101291172
161Ga0247727_111371962
162Ga0307469_100533557
163Ga0307469_100554462
164Ga0307469_100562602
165Ga0307468_1000289597
166Ga0315293_100978442
167Ga0315293_100984405
168Ga0315293_100984442
169Ga0315293_100994862
170Ga0307473_100251295
171Ga0307473_100289865
172Ga0307473_100332114
173Ga0307413_103607011
174Ga0315290_100965992
175Ga0315290_107848101
176Ga0307410_101172923
177Ga0315297_100855961
178Ga0315274_101948173
179Ga0315284_114759871
180Ga0315284_125118592
181Ga0315277_101939655
182Ga0315292_108844671
183Ga0307470_100313526
184Ga0315276_101419505
185Ga0307471_1032099322
186Ga0307472_1000635585
187Ga0364942_0030851_5_268
Powered by MSAViewer


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 0.00%    β-sheet: 43.61%    Coil/Unstructured: 56.39%
Feature Viewer
Position :
0
Zoom :
x 1
+ Add Multiple Variants

Enter the variants

Position

Original

Variant

Get Predictions
Get Predictions

Enter the variants

Position

Original

Variant

102030405060708090100MPTMQDSVSVAANAVSANVLAGQLYEFVPTGLNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTARNTTAGALTFFWRVDLSequenceα-helicesβ-strandsCoilSS Conf. score
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains



 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains


Neighboring Clusters of Orthologous Genes (COGs)



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:

Visualization
All Organisms
Unclassified
98.9%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts

Associated Scaffolds



Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:

Visualization
Freshwater And Sediment
Freshwater Lake Sediment
Freshwater
Freshwater Microbial Mat
Freshwater
Sediment
Freshwater Sediment
Marine Plankton
Freshwater
Groundwater
River Water
Freshwater
Freshwater
Marine Water
Sediment
Soil
Sediment (Intertidal)
Groundwater Sediment
Watersheds
Biofilm
Aquarium Water
Soil
Tropical Forest Soil
Termite Nest
Agricultural Soil
Soil
Grasslands Soil
Hardwood Forest Soil
Tropical Peatland
Tropical Forest Soil
Corn, Switchgrass And Miscanthus Rhizosphere
Soil
Groundwater Sand
Sandy Soil
Biofilm
Sediment
Populus Rhizosphere
Rhizosphere
Switchgrass Rhizosphere
Switchgrass Rhizosphere
Arabidopsis Rhizosphere
Activated Sludge
Anaerobic Digestor Sludge
Wastewater
Wastewater
Sbr_Wastewater
4.8%7.5%3.2%5.9%5.9%7.0%4.8%5.3%8.6%8.0%
Download SVG
Download PNG
Download CSV
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).


Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
EF_3951202020627000WastewaterMPVMQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSNTGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
EF_na_13342802027040001WastewaterQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSNTGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
EF_018583002070309007WastewaterDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSNTGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
EF_000932202070309007WastewaterMQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSNTGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFW
GPICI_017857302088090015SoilMQDSVSVSANSVSSNQLSGQLYEFVPQGANVTVSCTGSATGLRVSFICGVPLIEDQAIGLQNRFPLIPDDVIHSGPVPGGRMVLKFRNSTGGALTAFWRVDV
ICChiseqgaiiDRAFT_055897013300000033SoilMPTMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAINLQNRFPLVPDDVLHGGPVPGGRCVLTFRNTTGGALTAFWRVDV*
TB_LI09_3DRAFT_100655833300000229GroundwaterMPTMQDSVVVAANSVSANVLAGQLYEFVPEGQPVTISVTGSAIGLRTTYICGAPVINDQAIGLQNRFPLIPDDIIQSGAVPGGRQVLTFRNTTAGPLTAFWRVDL*
JGI11643J11755_1165759523300000787SoilMPTMQDSVSVSANSVSSNQLSGQLYEFVPQGANVTVSCTGSATGLRVSFICGVPLIEDQAIGLQNRFPLIPDDVIHSGPVPGGRMVLKFRNSTGGALTAFWRVDV*
RCM34_102379223300001843Marine PlanktonMPVMQDSVSVAANSISSNVLAGQLYEFVPTGTKVTLSCSGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGQVRACRLVLTARNTTAGALTFFWRVDVN*
C687J26616_1002176923300002120SoilMPTMQDSVSVAANAVSANVLAGQLYEFVPTGLNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTARNTTAGALTFFWRVDL*
C687J26623_1019673523300002122SoilMPTMQDSVSVAANSVSANVLAGQLYEFVAPGTNVTISCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIQNGQVPGGRMVLTARNTTAGALTFFWRVDL*
JGI25385J37094_1002020723300002558Grasslands SoilMPTMQDSVAVAANAVTTNQIAGQLYEFVRRGTLVVLSATGSVTGLRITFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLTFRNTTAGAITAFWRVDVTI*
Ga0068515_11758623300004829Marine WaterMPTMQDSVSVAANSVSSNVLAGQLYEFVPNGTRVALSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRQCRLVLTARNTTAGALTFFWRVDLS*
Ga0066680_1005269523300005174SoilMPTMQDSVSVAANGVSTNQLAGQLYEFLPRGTLVVLSCAGSATGLRATLIANIPVLNDQAINLQNRFPIIPDDIIYTGRVTACRLVLTFRNTTGGALTAFWRVDVSR*
Ga0066680_1075839913300005174SoilMPTMQDSVSVAANAVSTNVLAGQLYEFIETGTLVVLSCTGSVTGLRTTFIINNPVVNDQAIGLQNRFPIIPDDLLFTGRVRRGRLVLTFRNTTAGAITAFWRVDVSR*
Ga0066688_1017589723300005178SoilMPTMQDSVSVAANAVSTNQIAGQLYEFVRRGTLVVLSATGSATGLRVTFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLTFRNTTAGAITAFWRVDVTL*
Ga0070667_10009958533300005367Switchgrass RhizosphereMPLMQDSVSVAANAVSANVLAGQLYEFVAPGTNVTVSVTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIITSGPMPGGRMVLTARNTTAGALTFFWRVDL*
Ga0070705_10005838723300005440Corn, Switchgrass And Miscanthus RhizosphereMPTMQDSVSVAANAVSANVLAGQLYEFVPNGTRVALSCTGSATGLRATLIANIPVLNDQAISLNNRFPLIPDDILYTGTVRQCRLVLTSRNTTAGALTFFWRVDLS*
Ga0070697_10009272633300005536Corn, Switchgrass And Miscanthus RhizosphereMPTMQDSVSVAANAVSANVLAGQLYEFVPNGTRVALSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRQCRLVLTARNTTAGALTFFWRVDLS*
Ga0070697_10028112323300005536Corn, Switchgrass And Miscanthus RhizosphereMQDSISVAANAVSANVLAGQLYEFVPNGTRVALSCTGSATGLRATLIANIPVLNDQAISLNNRFPLIPDDILYTGTVRQCRLVLTSRNTTAGALTFFWRVDLS*
Ga0070704_10035035033300005549Corn, Switchgrass And Miscanthus RhizosphereMPTMQDSVSVAANAVSANVLAGQLYEFVPRGTLVTLSCTGSATGLRSTLIANIPVLNDQAIGLQNRFPIIPDDIAYSGRVAACRLVLTSRNSTGGALTFFWRVDVSR*
Ga0066692_1068761123300005555SoilMPTMQDSISVAANAVSTNQLAGQLYEFVQSGTQVVLSCAGSAPGLRTTLIARIPVVNDQAINLQNRFPIIPDDIIFTGRVRACRLVLTFRNTTGGALTAFWRVDVS*
Ga0066700_1049839313300005559SoilMPTMQDSVSVAANAVSTNQLAGQLYEFVQRGTLVVLSSTGSATGLRITFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVAAGRLFLTFRNTTAGAITAFWRV
Ga0066905_10049705913300005713Tropical Forest SoilMPTMQDSVSVAANSVSANVLSGQLYEFVDMGTIVTLSATGSATGLRCSFTCAIPLVLDQAIGLQNRFPLVPDDLLHSGQVLGGRMVLTARNTTAGALTFFWRVDL*
Ga0066903_10038802123300005764Tropical Forest SoilMPTMQDSVSVAANSVSANVLAGQLYEFVPSGTRIVLSCTGSATGLRTTLIANIPVMNDQAINLQNRFPVIPDDIVYTGTVRACRLVLTSRNTTAGALTFFWRVDVS*
Ga0074472_1014270033300005833Sediment (Intertidal)MGMMQDSLSVAANAVSANVLNGQLYEFQPAGAPVQLLCTGSATGLRVSLIAAIPVVNDQAIGLLNRFPIVPDDRVWQGRVRAACRLVLTFRNSTAGALTAFWRVDTMD*
Ga0074472_1056355713300005833Sediment (Intertidal)SGELYEFAEEGQDVTLSVAGSAVGLRTTFICGVPLVNDQAINTQNRFPLIPDDVLHSGQVPGGRMILTFRNTTGAPLTAFWRADL*
Ga0074472_1074676623300005833Sediment (Intertidal)MPTMQDSISVAANSVSANVLAGQLYEFVGQVPVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGVVPGGRMVLTARNTTAGALTYFWRVDL*
Ga0074472_1140346723300005833Sediment (Intertidal)RPDVVHVRNLPVVVKGVIMPTMQDSVSVAANSVSSNVLSGQLYEFVGAIPVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGVVPGGRMVLTARNTTAGALTFFWRLDL*
Ga0075023_10000974223300006041WatershedsMGLMQDSISVPANAVTLNQLNGQLYEFQPAGAPVQLLATGSGTGLRVTLLAATAVVNDQAIGLQNRFPIIPDDRVWMGRVKANCRLVLTFRNSTAGVLIAFWRVDTQD*
Ga0075417_1024596223300006049Populus RhizosphereMPTMQDSISVAANSVSANVLAGQLYEFFAPGSNITLSVAGSATGLRLTFINGVPLINDQAMNLQNRFPLIPDDVVHAGPVPGGRAVLTFRNTTAGALTAFWRVDL*
Ga0075432_1001517523300006058Populus RhizosphereVPTMQDSVSVAANSVSANVLSGQLYEFLPQGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGQVPGGRAILTFRNTTAGALTAFWRIDL*
Ga0082029_111896323300006169Termite NestMPTMQDSVSVAANSVSTNQLAGQLYEFVPQGANVTLSATGSATGLRCTFICGVPLVNDQAIGLQNRFPLVPDDVLHGGPVPGGRMVLTALN
Ga0066659_1005676053300006797SoilMPTMQDSVSVAANAVSTNQIAGQLYEFVRRGTLVVLSATGSATGLRVTFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLT
Ga0066659_1005882763300006797SoilMPTMQDSVSVAANAVSTNVLAGQLYEFIETGTLVVLSCTGSVTGLRTTFIINNPVVNDQAIGLQNRFPIIPDDLLFTGRVRRGRLVLTFRN
Ga0075428_10025563513300006844Populus RhizosphereMPTMQDSVSVAANSVSSNQLSGQLYEFVPQGANVTVSCTGSATGLRVSFICGVPLIEDQAIGLQNRFPLIPDDVIHSGPVPGGRMVLKFRNSTGGALTAFWRVDV*
Ga0075421_10028880043300006845Populus RhizosphereMPTMQDSVSVAANAVSANVLAGQLYEFVDPGTQVTVSVTGSATGLRTTFICGIPLINDQAINLQNRFPLIPDDIVHSGAVPGGRMVLTSRNTTAGALTFFWRVDL*
Ga0075431_10093966223300006847Populus RhizosphereMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAINLQNRFPLVPDDVLHGGPVPGGRCVLTFRNTTGGALTAFWRVDV*
Ga0075425_10016873843300006854Populus RhizosphereMPLMQDSVSVAANSVSTNVLAGQLYEFVAPGTNVTVSVTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIITSGPMPGGRMVLTSRNTTAGALTFFWRVDL*
Ga0075425_10051362913300006854Populus RhizosphereMPTMQDSVSVAANAVSINVLAGQLYEFVDPGTQVTVSVTGSATGLRTTFICGIPLINDQAINLQNRFPLIPDDIVHSGAVPGGRMVLTARNTTAGALTFFWRVDL*
Ga0079218_1010405233300007004Agricultural SoilMPTMQDSVSVAANSVSANVLSGQLYEFVGQVPVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGVVPGGRMVLTARNTTGGALTFFWRVDL*
Ga0105044_1015825933300007521FreshwaterMQDSIAVLANSVSANVLAGQLYEFVENGTQVTVSVTGSLTGLRCSYISGIPLINDQAINLQNRFPLIPDDIIHSGSVPGGRQVLTFRNTTAGTVTAFWRVDL*
Ga0110935_100983453300008065WastewaterMQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSNTGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN*
Ga0115888_12405813300008807Sbr_WastewaterMMQDSLTVAANSVSANVLAGQLYEFVRAGQPIRLLATGSATGLRLSMLIGMAIINDQALNLQNRFPLVPDDVIHTGRAPVSGRMVLTFRNTTAGALTAFWRIDL*
Ga0066710_10026258763300009012Grasslands SoilMPTMQDSVSVGANAVSTNVLAGQLYEFVPNGTRVALAVTGSATGLRATLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRQCRLVLTARNTTAGALTFFWRVDLS
Ga0066710_10026399143300009012Grasslands SoilMPTMQDSLSVAANGVSANVLAGQLYEFVETGTQVVLSATGAATGMRCTLIARIPVINDQAIGLQNRFPIIPDDIIFTGRVRRCRLVLTFRNNTGSAIVTFWRVDVA
Ga0066710_10059801633300009012Grasslands SoilMPTMQDSVSVGANAVSTNVLAGQLYEFVPNGTRVALAVTGSATGLRATLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRQCRLVLTARNTTAGALVFFWRVDLS
Ga0066710_10136225523300009012Grasslands SoilMPTMQDSVSVAANATSLNQLSGQLYEFVQTGTQVVLSCTGSATGLRVTLIARIPVILDQAINLQNRFPVIPDDIIFTGKVRACRLFLTARNSTGGALTFFWRVDVS
Ga0066710_10378065513300009012Grasslands SoilMPTMQDSISVAANAVSANQLSGQLYEFVQTGTQVVLSCTGSATGLRVTLIARIPVINDQAIGLQNRFPVIPDDIIFTGRVRAARLFLTFRNTTGGALTAFWRVDIS
Ga0111539_1015834143300009094Populus RhizosphereMQDSVSVAANSVSANVLSGQLYEFLPQGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGQVPGGRAILTFRNTTAGALTAFWRIDL*
Ga0114129_1080378813300009147Populus RhizosphereLAGQLYEFFAPGSNITLSVAGSATGLRLTFINGVPLINDQAMNLQNRFPLIPDDVVHAGPVPGGRAVLTFRNTTAGALTAFWRVDL*
Ga0111538_1132914513300009156Populus RhizosphereDMPTMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAINLQNRFPLVPDDVLHGGPVPGGRCVLTFRNTTGGALTAFWRVDV*
Ga0075423_1015393033300009162Populus RhizosphereMQDSVSVAANAVSINVLAGQLYEFVDPGTQVTVSVTGSATGLRTTFICGIPLINDQAINLQNRFPLIPDDIVHSGAVPGGRMVLTARNTTAGALTFFWRVDL*
Ga0075423_1015666343300009162Populus RhizosphereMPTMQDSVSVGANAVSSNVLSGQLYEFVPNGTRVSLSVTGSATGLRCTLIANIPVLNDQAMNLQNRFPLIPDDIIYTGTVRACRLVLTARNTTAGALTFFWRVDLS*
Ga0103857_1001120223300009235River WaterMQDSVSVAANSISSNVLAGQLYEFVPTGTKVTLSCSGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGQVRACRLVLTARNTTAGALTFFWRVDVN*
Ga0116190_103139343300009655Anaerobic Digestor SludgeMQDSVSVGANSVSANVVAGQLYEFVPTGTKVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN*
Ga0105059_100670323300009795Groundwater SandMQDSVSVAANSVSTNQLTGQMYEVVARGTPVILAVAGSLTGLRVSFTCTIPLILDQAMNLQNRFPLIPDDIMYRGRVPGGRMILTFRNTT
Ga0105069_100033723300009800Groundwater SandMPTMQDSVSVGANAVSANVLSGQLYEFVPGGTLVTLACTGSATGLRTTLICNIPVVNDQAISLQNRFPIIPDDIVFSGRVRACRLVLTARNSTGGALTFFWRVDVS*
Ga0105069_100097523300009800Groundwater SandMQDSLSVAANSVSANVLAGQLYEFVDAGTNVTVSVTGSATGLRTTFICAVPIINDQAINLQNRFPLVPDDVIQSGTVLGGRMVLTFRNSTAGALTAFWRVDL*
Ga0105081_100255233300009806Groundwater SandMQDSVSVGANAVSTNQLAGQLYEFVPAGTLVVLSATGGATGLRCTLIANIPVLNDQAIGLQNRFPIIPDDIVFTGRVRNCRLVLTFRNTTGAAVIAFWRVDVSR*
Ga0105087_101759313300009819Groundwater SandMQDSISVAANAVSANVLNGQLYENAFPGQVVTLSCTGGATGLRATYICGMPLINDQAINLQNRFPLIPDDVLHAGPVPGGRQVLTFRNTTGAPITAFWRVDL*
Ga0105085_101339623300009820Groundwater SandMQDSVSVAANSVSTNQLTGQMYEVVARGTPVILAVAGSLTGLRVSFTCTIPLILDQAMNLQNRFPLIPDDIMYRGRVPGGRMILTFRNTTAGAITAFWRVDVA*
Ga0105085_102779923300009820Groundwater SandMQDSVSVGANAVSTNQLAGQLYEFVPAGTLVVLSATGGATGLRCTLIANIPVLNDQAIGLQNRFPIIPDDIVFTGRVRNCRLVLTFRNT
Ga0126376_1059256513300010359Tropical Forest SoilMQDSVSVAANAVSANVLAGQLYEFVPNGTRVVLSCTGSATGLRSTLIANIPVMNDQAINLQNRFPIIPDDIIYTGTVRACRLVLTSRNTTAGALTFFWRVDVS*
Ga0126376_1320255423300010359Tropical Forest SoilVPTMQDSVSVAANAVSSNVLAGQLYEFVPNGTRVVLSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRACRLVLTARNTTAGALTFFWRVDVS*
Ga0126377_1055586623300010362Tropical Forest SoilMQDSVSVAANAVSTNVLAGQLYEFVPQGTLVTLSCSGSATGLRTTLICNIPVVNDQAINLQNRFPIIPDDIVFSGRVRACRLVLTARNTTAGALTFFWRVDVS*
Ga0126377_1078813523300010362Tropical Forest SoilMPTMQDSVSVAANSVSSNVLAGQLYEFVPNGTRVALAVTGSATGLRCTLIANIPVLNDQALNLQNRFPLIPDDILYTGTVRQCRLVLTARNTTAGALTFFWRVDLS*
Ga0126377_1259216613300010362Tropical Forest SoilMQDSVSVAANAVSANVLAGQLYEFVPNGTRVVLSCTGSATGLRSTLIANIPVMNDQAINLQNRFPLIPDDILYTGTVRQCRLVLTARNTTAGALTFFWRVDLS*
Ga0126379_1056993723300010366Tropical Forest SoilMPTMQDSVSVAANAVSTNVLAGQLYEFVPQGTLVTLSCSGSATGLRTTLICNIPVVNDQAINLQNRFPIIPDDIVFSGRVRACRLVLTARNTTAGALTFFWRVDVS*
Ga0126381_10022700913300010376Tropical Forest SoilMPTMQDSVSVAANGVSANVLAGQLYEFVPSGTRVVLSCTGSATGLRVTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRACRLVLTSRNTTGGAVTFFWRVDVS*
Ga0136847_1176239443300010391Freshwater SedimentMQDSVLVVANTVSTNVLAGQLYEFVPRGTLVTLSCAGSATGLRSTLIANIPVLNDQAINLQNRFPLIPDDILYSGRVAACRLVLTFRNGTAGNLTAFWRVDVSR*
Ga0126383_1012379533300010398Tropical Forest SoilMPTMQDSVSVAANAVSANVLAGQLYEFVPNGTRVVLSCTGSATGLRSTLIANIPVMNDQAINLQNRFPIIPDDIIYTGTVRACRLVLTSRNTTAGALTFFWRVDVS*
Ga0126383_1090245013300010398Tropical Forest SoilLAGQLYEFVPNGTRLVLSCTGSATGLRATLIANIPVMNDQAINLQNRFPVVPDDIIYTGTIRACRLVLTSRNTTAGALTFFWRVDVS*
Ga0137451_101212643300011438SoilMQDSVSVAANSVSANVLAGQLYEFIPPGANVTLSCTGSATGLRTTFICGIPLINDQAFNFQNRFPLIPDDVIHSGGTPGGRMVLTARNTTAGALTFFWRVDV*
Ga0137334_109891523300012179SoilVLSGQLYEFVPPGINVTVSCTGAVTGLRATFICGVPLINDQAIGLQNRFPLIPDDIIQNGPVPGGRMVLSFRNTTGAAIIVFWRVDV*
Ga0157136_100074933300012282FreshwaterMMQDSLSVAANGVSANVLAGQLYEFVRAGQAIRLSGTGSAMGLRVTLLIGMAIINDQAINLQNRFPLIPDDVIHTGRAPVSGRMVLTFRNTTAGALTAFWRIDL*
Ga0157209_10106323300012630FreshwaterMQDSVSVAANSVSANVLSGQLYEFVDPGTQVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGQVPGGRMVLTARNTTAGALTYFWRVDL*
Ga0157209_10108523300012630FreshwaterVPTMQDSVSVAANSVSTNVLAGQLYEFVDPGTQVTVSVTGSATGLRTTFICGIPLINDQAINLQNRFPLVPDDIIHSGPVPGGRMVLTARNTTAGALTYFWRVDV*
Ga0126369_1140936923300012971Tropical Forest SoilMPTMQDSVSVAANAVSTNVLAGQLYEFVPTGTKLVLSCTGSATGLRATLIANIPVMNDQAINLQNRFPVIPDDIIYTGTVRSCRLVLTSRNTTAGALTFFWRVDVS*
Ga0126369_1225315913300012971Tropical Forest SoilMPTMQDSVSVAANGVSTNVLTGQLYEFVPSGTRVVLSCTGSATGLRTTLIANIPVLNDQAIGLQNRFPVIPDDIIYTGTVRACRLVLTARNTTGGALTFFWRVDVS*
Ga0163202_100531333300013086FreshwaterMQDSVSVAANSVSVNVLSGQLWEFADEGQTVSVSVTGSATGLRTTFIAGVPLINDQAINLQNRFPLIPDDVLHSGEVPGGRMVLTFRNTTAGALTAFWRVDV*
Ga0163202_100540723300013086FreshwaterMQDSIAVAANSVSVNVLSGQLYEFVEDGANVTVSLTGSATGLRTTFISGVPMINDQAINLQNRFPLIPDDVLHSGEVPGGRMVLTFRNTTAGALTAFWRVDV*
Ga0163202_100588423300013086FreshwaterMQDSISVAANSVSVNVLSGLLFEFTDGGELGVSCCGSATGLRATFIVGVPLCDDIAINLQNRFPILPDDIIFSGEVPPGRMILRFRNTTAGALTAFWRVDS*
Ga0163200_107727413300013088FreshwaterTMQDSISVAANSVSVNVLSGLLFEFTDGGELGVSCCGSATGLRATFIVGVPLCDDIAINLQNRFPILPDDIIFSGEVPPGRMILRFRNTTAGALTAFWRVDS*
Ga0163203_112653423300013089FreshwaterMQDSIAVAANSVSVNVLSGQLYEFVEDGANVTVSLTGSATGLRTTFISGVPMINDQAINLQNRFPLIPDDVLHSGEVPGGRMVLTFRN
Ga0180066_103961423300014873SoilSANVLSGQLYEFVPPGINVTVSCTGAVTGLRATFICGVPLINDQAIGLQNRFPLIPDDIIQNGPVPGGRMVLSFRNTTGAAIIVFWRVDV*
Ga0180086_100534343300014883SoilMQDSVSVAANSVSANVLAGQLYEFVPQGVNVTVSCTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIIQNGQVPGGRMVLTARNTTAGALTYFWRVDL*
Ga0180086_103562323300014883SoilMQDSVSVAANSVSANVLAGQLYEFVPQGVNVTVSCTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIIQNGQVPGGRMVLTARNTTAGALTFFWRVDL*
Ga0180063_110574713300014885SoilVAANSVSANVLAGQLYEFVPQGVNVTVSCTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIIQNGQVPGGRMVLTARNTTAGALTYFWRVDL*
Ga0157379_1010010523300014968Switchgrass RhizosphereMQDSVSVAANAVSANVLAGQLYEFVAPGTNVTVSVTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIITSGPMPGGRMVLTARNTTAGALTFFWRVDL*
Ga0163144_1133609413300015360Freshwater Microbial MatMQDSLSVAANAVSANVLAGQLYEFVPAGINVTVSCTGSATGLRTTFICGMPLINDQAISLQNRFPLIPDDIIHSGDVPGGRMVLTARNSTAGALIFFWRVDV*
Ga0132256_10231604523300015372Arabidopsis RhizosphereMQDSVSVAANAVSSNVLAGQLYEFVAPGTNVTVSVTGSATGLRATFICGVPLINDQAINLQNRFPLIPDDIIHSGGVPGGRMVLNARNSTAGALTFFWRVDL*
Ga0187778_1125880813300017961Tropical PeatlandMPVMQDSVSVAANTVSANVLSGQLYEFVPTGTRVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPVIPDDIVYQGTVRQCRLVLTSRNTTAGALTFFWRVDVN
Ga0187787_1031540413300018029Tropical PeatlandMATMQDSVSVAANSVSANVLAGQLYEFVPTGTRVTLSCTGSATGLRTTLIANVPVLNDQAINLQNRFPIIPDDIIYAGRVRACRLVLTARNTTGGALTFFWRVDIN
Ga0184634_1005639523300018031Groundwater SedimentMPTMQDSVVVLLNSVSANVLAGQLYEFVPAINVTVSCTGSVTGLRTTFICGIPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTFRNTTGGSITAFWRVDV
Ga0184634_1027549723300018031Groundwater SedimentMPTMQDSVLVLSNSVSLNVLAGQLYEFVPAGLNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTSRNTSGGNLTFFWRVDV
Ga0184626_1002434153300018053Groundwater SedimentMPTMQDSVTVLLNSVSLNVLAGQLYEFADPGWVVTLSLTGQATGLRTTFICGVPLINDQAINLQNRFPLVPDDVIHSGPVPGGRMVLTFRNTTGGSLIAFWRVDI
Ga0184637_1005151033300018063Groundwater SedimentMPTMQDSVSVAANALSANVLAGQLYEFVPSGTQVVMSATGAATGLRTTLICNIPVVNDQAIGLQNRFPIIPDDIVFTGRVRACRLVLTFRNSTGGAIIVFWRVDVSR
Ga0184637_1006154533300018063Groundwater SedimentMPTMQDSLSVAANAVSLNVLAGQLYEFVDVGTLVTVSVTGSATGLRTSFICGIPLINDQAINLQNRFPLIPDDIIHSGQVPGGRMVLTFRNGTAGALTAFWRVDV
Ga0184637_1011727833300018063Groundwater SedimentMPTMQDSIVVLLNSVSVNVLSGQLYEFVDIGQLVTVSVTGSVTGLRSTFICGVPLINDQAINLQNRFPLIPDDIIHSGQVPGGRMVLTFRNTTGASITAFWRVDI
Ga0184637_1013291543300018063Groundwater SedimentMPTMQDSLSVAANALSANVLAGQLYEFVDPGTLVTVSVTGSVTGLRTSFICGVPMINDQAINLQNRFPLIPDDIIHSGQVPGGRMVLTF
Ga0184637_1028287513300018063Groundwater SedimentMPTMQDSVVVVLNSVSLNVLAGQLYEFVDAGQLVTVSVTGSVTGLRATFICGVPLINDQAINLQNRFPLIPDDIIHSGQVPGGRMVLTFRNTTGGSITGFWRVDL
Ga0187773_1053841513300018064Tropical PeatlandMPLMQDSVSVAANSVSANQLSGQIYEFIPPGAPVTLSVTGSATGLRVTFVVGGVTIINDQAINLQNRFPIIPDDVITTGRMPGGRMVLTARNTTGGALTFFWRVDI
Ga0184640_1007786333300018074Groundwater SedimentMPTMQDSVAVLTNTKSVNVLAGQLYEFADPGALVTVSCTGSLTGLRVNFICGVPLINDQAINLQNRFPLIPDDIIHSGIVPGGRMVLEFINRTAGTVSAFWRLDL
Ga0184627_1003828043300018079Groundwater SedimentMPTMQDSLSVAANAVSLNVLAGQLYEFVDVGTLVTVSVTGSSTGLRTSFICGIPLINDQAINLQNRFPLIPDDIIHSGQVPGGRMVLTFRNGTAGALTAFWRVDV
Ga0184627_1043705223300018079Groundwater SedimentNVLAGQLYEFVTSINVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHAGSVPGGRMVLTFRNTTGASITAFWRVDV
Ga0193754_100073513300019872SoilMPTMQDSVSVGANAVSTNVLAGQLYEFVPNGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRACRLVLTSRNTTGGALTFFWRVDLS
Ga0193755_101797413300020004SoilVSTNVLAGQLYEFVPNGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRACRLVLTSRNTTGGALTFFWRVDLS
Ga0163150_1006810663300020195Freshwater Microbial MatMPTMQDSVSVAANALSANVLAGQLYEFVPAGIGVTVSCTGSATGLRTTFICGIPLINDQAINLQNRFPLIPDDIIHQGEVPGGRMVLTARNSTAGALTFFWRIDV
Ga0163150_1006966623300020195Freshwater Microbial MatMPTMQDSLSVAANAVSANVLAGQLYEFVPAGINVTVSCTGSATGLRTTFICGMPLINDQAISLQNRFPLIPDDIIHSGDVPGGRMVLTARNSTAGALIFFWRVDV
Ga0213920_100884343300021438FreshwaterMPVMQDSVSVGANAVSANVLAGQLYEFVGTGTQVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGRVRACRLVLTARNTTAGALTFFWRVDVN
Ga0213919_100872823300021440FreshwaterMATMQDSVSVAANSISANVLSGLLYEFVDDGTMVTVSCTGSATGLRATFIAGVPIADDVAINLQNRFALIPDDVLLSTEVPGGRLILRARNTTAGALTFFWRVDL
Ga0213919_107131313300021440FreshwaterMPTMQDSVSVAANSVSTNQLSGQLYEFVEEGTSVTVSCTGSATGLRTTFICGVPLVNDQAIGLQNRFPLIPDDIIMNGEVPGGRMVLTARNTTAGALTFFWRVDL
Ga0163145_102193123300021515Freshwater Microbial MatMPTMQDSLSVAANAVSANVLAGQLYEFVPAGINVTVSCTGSATGLRTTFICGMPLINDQAISLQNRFPLIPDDIIHSGEVPGGRMVLTARNSTAGTLIFFWRVDV
Ga0213922_107947013300021956FreshwaterMPTMQDSVSVAANGVSANVLSGQLYEFVPNGANVTLACTGSATGLRTTLICNIPVINDQAINLQNRFPIIPDDIIFSGRVRQCRLVLTARNTTAGALTFFWRVDVN
Ga0224505_1005436223300022214SedimentMPMMQDSVSVAANAVSANVLAGQLYEFVQAGRPVRLSATGSATGLRVSLLIGMAVINDQALNLQNRFPLMPDDVVHVGRAPVSGRLVLTFRNTTAGALTAFWRIDL
Ga0212124_1004758733300022553FreshwaterMSTMQDSISVAANSVSVNVLSGLLFEFTDGGELGVSCCGSATGLRATFIVGVPLCDDIAINLQNRFPILPDDIIFSGEVPPGRMILRFRNTTAGALTAFWRVDS
Ga0212124_1005187223300022553FreshwaterMPTMQDSVSVAANSVSVNVLSGQLWEFADEGQTVSVSVTGSATGLRTTFIAGVPLINDQAINLQNRFPLIPDDVLHSGEVPGGRMVLTFRNTTAGALTAFWRVDV
Ga0212124_1005250933300022553FreshwaterMPTMQDSIAVAANSVSVNVLSGQLYEFVEDGANVTVSLTGSATGLRTTFISGVPMINDQAINLQNRFPLIPDDVLHSGEVPGGRMVLTFRNTTAGALTAFWRVDV
Ga0209431_1008367933300025313SoilMPTMQDSVSVAANAVSANVLAGQLYEFVPTGLNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTARNTTAGALTFFWRVDL
Ga0209640_1011158123300025324SoilMPTMQDSVSVAANSVSANVLAGQLYEFVAPGTNVTISCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIQNGQVPGGRMVLTARNTTAGALTFFWRVDL
Ga0209341_1125296113300025325SoilMPTMQDSVSVAANAVSTNQLAGLLHEFLQQRARVTVSATGSATGLRCTFLVLGVALVNDQAIGLQNRFPLIPDDMLTSEGVPGGRMILTFRNSTGGALTAFWRVDVDYF
Ga0208461_102396933300025613Anaerobic Digestor SludgeMPVMQDSVSVGANSVSANVVAGQLYEFVPTGTKVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
Ga0208325_101102013300025824FreshwaterMSTMQDSISVAANSVSVNVLSGLLFEFTDGGELGVSCCGSATGLRATFIVGVPLCDDIAINLQNRFPILPDDIIFSGEVPPGRMILRFRNTTAGALTAFW
Ga0207658_1007279643300025986Switchgrass RhizosphereMPLMQDSVSVAANAVSANVLAGQLYEFVAPGTNVTVSVTGSATGLRTTFICGVPIINDQAINLQNRFPLIPDDIITSGPMPGGRMVLTARNTTAGALTFFWRVDL
Ga0209235_103625963300026296Grasslands SoilMPTMQDSVAVAANAVTTNQIAGQLYEFVRRGTLVVLSATGSVTGLRITFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLTFRNTTAGAITAFWRVDVTI
Ga0209237_103922173300026297Grasslands SoilMPTMQDSVAVAANAVTTNQIAGQLYEFVRRGTLVVLSATGSVTGLRITFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLTFRNTTAG
Ga0209236_103820373300026298Grasslands SoilMPTMQDSVAVAANAVTTNQIAGQLYEFVRRGTLVVLSATGSVTGLRITFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFL
Ga0209801_103194833300026326SoilMPTMQDSVSVAANAVSTNQIAGQLYEFVRRGTLVVLSATGSATGLRVTFIINVPVVNDQAIGLQNRFPIIPDDIVFTGRVTAGRLFLTFRNTTAGAITAFWRVDVTL
Ga0209377_103257873300026334SoilAGQLYEFLPRGTLVVLSCAGSATGLRATLIANIPVLNDQAINLQNRFPIIPDDIIYTGRVTACRLVLTFRNTTGGALTAFWRVDVSR
Ga0209160_111374113300026532SoilMPTMQDSVSVAANGVSTNQLAGQLYEFLPRGTLVVLSCAGSATGLRATLIANIPVLNDQAINLQNRFPIIPDDIIYTGRVTACRLVLTFRNTTGGALTAFWRVDVSR
Ga0209896_104052613300027006Groundwater SandVPTMQDSVSVAANAVSTNVLSGQLYEFVDGGTNVTVSVTGSATGLRCTFICGVPLINDQAINLQNRFPLVPDDIIHGGEVPGGRMVL
Ga0209876_100044623300027041Groundwater SandMPTMQDSVSVGANAVSANVLSGQLYEFVPGGTLVTLACTGSATGLRTTLICNIPVVNDQAISLQNRFPIIPDDIVFSGRVRACRLVLTARNSTGGALTFFWRVDVS
Ga0209876_100152733300027041Groundwater SandMATMQDSLSVAANSVSANVLAGQLYEFVDAGTNVTVSVTGSATGLRTTFICAVPIINDQAINLQNRFPLVPDDVIQSGTVLGGRMVLTFRNSTAGALTAFWRVDL
Ga0209897_100185523300027169Groundwater SandMPTMQDSVSVAANAVSANVLAGQLYEFVPRGTQVTLAVTGSATGLRCTLIANIPLVNDQAINLQNRFPLIPDDILFSGRVSAARLVLTARNSTAGALTFFWRVDVSR
Ga0209869_100287543300027187Groundwater SandMPTMQDSVSVGANAVSTNQLAGQLYEFVPAGTLVVLSATGGATGLRCTLIANIPVLNDQAIGLQNRFPIIPDDIVFTGRVRNCRLVLTFRNTTGAAVIAFWRVDVSR
Ga0209845_100380653300027324Groundwater SandMPTMQDSVSVAANSVSTNQLTGQMYEVVARGTPVILAVAGSLTGLRVSFTCTIPLILDQAMNLQNRFPLIPDDIMYRGRVPGGRMILTFRNTTAGAITAFWRVDVA
Ga0209845_106707713300027324Groundwater SandMPTMQDSISVAANAVSANVLNGQLYENAFPGQVVTLSCTGGATGLRATYICGMPLINDQAINLQNRFPLIPDDVLHAGPVPGGRQVLTFRNTTGA
Ga0209842_106803413300027379Groundwater SandMGVMQDSVSVAANSKSTNVLAGMMEEFVSQPSIVRLSATGSATGLRATLIIGGAVVIDDQAISLQNRFPLVPDDVLT
(restricted) Ga0233416_1011133823300027799SedimentMPTMQDSVSVGANSVSSNVLAGQLYEFVPAGTRVTLSATGSATGLRTTLICSVPLVNDQAIGLQNRFPLIPDDIVSSEIVPGGRLVLTARNTTGGALTFFWRVDIG
Ga0209591_1010977913300027850FreshwaterMPTMQDSVAVLANSVSANVLAGQLYEFVENGTQVTVSVTGSLTGLRCSYISGIPLINDQAINLQNRFPLIPDDIIHSGSVPGGRQVLTFRNTTAGTVTAFWRVDL
Ga0209023_1007216623300027870Freshwater And SedimentMPTMQDSVSVAANAVSANVLAGQLYEFVPNGTRLALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRACRLVLTARNTTAGALTFFWRVDLS
Ga0209814_1002433753300027873Populus RhizosphereVPTMQDSVSVAANSVSANVLSGQLYEFLPQGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGQVPGGRAILTFRNTTAGALTAFWRIDL
Ga0209481_1002985323300027880Populus RhizosphereMPTMQDSVSVAANSVSSNQLSGQLYEFVPQGANVTVSCTGSATGLRVSFICGVPLIEDQAIGLQNRFPLIPDDVIHSGPVPGGRMVLKFRNSTGGALTAFWRVDV
Ga0209486_1006557333300027886Agricultural SoilMPTMQDSVSVAANSVSANVLSGQLYEFVGQVPVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGVVPGGRMVLTARNTTGGALTFFWRVDL
Ga0209254_1072391513300027897Freshwater Lake SedimentMPTMQDSVSVAANSVSANVLTGQLYEFVQQGQPVTISCTGSATGLRTSFVCGVPLIDDQAISLQNRFPLVPDDIIHSGQVPGGRMILRARNTTAGALTFFWRVDI
Ga0209253_1009188523300027900Freshwater Lake SedimentMPTMQDSVSVAANSVSANVLAGQLYEFVPNGTRVSLAVTGSATGLRTTLIANIPVLNDQAINLQNRFPLIPDDILFTGTVRACRLVLTARNTTAGALTYFWRVDLS
Ga0209382_1044666533300027909Populus RhizosphereMPTMQDSVSVAANAVSANVLAGQLYEFVDPGTQVTVSVTGSATGLRTTFICGIPLINDQAINLQNRFPLIPDDIVHSGAVPGGRMVLTSRNTTAGALTFFWRVDL
Ga0209583_1001322123300027910WatershedsMGLMQDSISVPANAVTLNQLNGQLYEFQPAGAPVQLLATGSGTGLRVTLLAATAVVNDQAIGLQNRFPIIPDDRVWMGRVKANCRLVLTFRNSTAGVLIAFWRVDTQD
Ga0209853_101295423300027961Groundwater SandMPLMQDSVSVGANAVSANQLAGQLYEVVARGTPVVLSVTGSATGLRLSFTCTIPLILDQAMNLQNRFPLIPDDIIFRGRVPGGRMVLTFRNTTAGALTAFWRVDVG
Ga0272412_102861033300028647Activated SludgeMPVMQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
Ga0272412_130091023300028647Activated SludgeMPVMQDSVSVAANSVSSNVVAGQLYEFVPTGTKVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRIVLTARNTTAGALTFFWRIDVN
Ga0168034_10135913300029171Aquarium WaterMPTMQDSVSVAGNSVSANVLSGQLYEFVDPGTQVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGQVPGGRMVLTARNTTAGALTYFWRVDI
Ga0168034_10240523300029171Aquarium WaterMPTMQDSLSVAANSVSTNVLSGQLYEFVDPGTNVTVSVTGSGTGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGPVPGGRMVLTARNATGGALTYFWRVDL
Ga0168034_11382523300029171Aquarium WaterMPTMQDSLSVAANSVSTNVLSGQLYEFVDPGTNVTVSVTGSATGLRTTFICGVPLINDQAINLQNRFPLVPDDIIHSGPVPGGRMVLTARNATGGALTYFWRVDL
Ga0120082_100212963300029200BiofilmMPVMQDSVSVAANSVSANVVAGQLYEFVPTGTKVTLSCTGSATGLRATLIANIPVMNDQAINLQNRFPIIPDDIVFQGAVRACRLVLTARNTTGGALTFFWRIDVN
(restricted) Ga0255311_100529153300031150Sandy SoilMPVMQDSVSVAANSVSANVLAGQLYEFVGNGANVTLSCTGSATGLRSTLIANIPVMNDQAINLQNRFPIIPDDIVFQGRVRACRLVLTARNTTAGALTFFWRVDVN
Ga0307408_10117835013300031548RhizosphereVPTMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGPVPGGRAVLTFRNTTAGALTAFWRIDL
Ga0247727_1012623923300031576BiofilmMPTMQDSVSVAANSVSTNQLAGLLHEFLQGAARVSVSATGSATGLRCTLLVMSVSLIQDTAIGLQNRFPLVPDDLLTTEAVPGGRMILIFRNTTVGALTAFWRVDVDYS
Ga0247727_1012911723300031576BiofilmMPTMQDSVSVAANAVSTNQLSGQLYEFVPRGTLVTLSCTGSATGLRTTLIANIPVLNDQAINLQNRFPVIPDDIIYSGRVSACRLVLTSRNTTGGALTFFWRVDVSR
Ga0247727_1113719623300031576BiofilmMPTMQDSVSVGANAISTNVLAGQLYEFVPRGTMVVLSATGAATGMRSTLIANIPVVNDQAISFQNRFPLIPDDIVFTGRVAACRLVLTFRNTTGAPILTFWRVDVAR
Ga0307469_1005335573300031720Hardwood Forest SoilMPTMQDSVSVAANSVSANVLAGQLYEFVPTGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRSCRLVLTSRNTTGGA
Ga0307469_1005544623300031720Hardwood Forest SoilMPTMQDSVSVAANAVSANVLAGQLYEFVPNGTRVALSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRQCRLVLTARNTTAGALTFFWRVDLS
Ga0307469_1005626023300031720Hardwood Forest SoilMPTMQDSVSVGANAVSANVLAGQLYEFVPNGTRVALSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGTVRQCRLVLTSRNTTAGALTFFWRVDLS
Ga0307468_10002895973300031740Hardwood Forest SoilMPTMQDSVSVAANSVSANVLAGQLYEFVPTGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRSCRLVLTSRNTTGGALTFFWRVDLS
Ga0315293_1009784423300031746SedimentMQDSISVAANAVSSNVLAGQLYEFVDAGTAATVSVTGSATGLRTSFICGIPLINDQAINLQNRFPLVPDDIIHSGHLPGGRMVLTFRNTTAGALTAFWRVDL
Ga0315293_1009844053300031746SedimentMPTMQDSISVAANAVSANVLAGQLYEFVDGGTQATVSCTGSATGLRVSFICGIPLINDQAIGLQARFPLIPDDIIHSGFVPGGRMVLTFRNTTAGALTAFWRVDL
Ga0315293_1009844423300031746SedimentMPTMQDSLSVAANSVSANVLAGQLYEFVDAGTQATISCTGSATGLRTSFICGIPLINDQAINLQNRFPLIPDDIIHSGFVPGGRMVLTFRNTTAGALTAFWRVDL
Ga0315293_1009948623300031746SedimentMPTMQDSVSVAANAVSVNVLAGQLYEFVDAGTQATISCTGSATGLRTSFICGIPLINDQAINLQNRFPLIPDDIIHSGFVAGGRMVLTFRNTTAGALTAFWRVDL
Ga0307473_1002512953300031820Hardwood Forest SoilMPTMQDSVSVAANSVSTNVLAGQLYEFVPSGTRVALSCTGSATGLRATLIANIPVLNDQAISLNNRFPLIPDDILYTGVVRACRLVLTSRNGTGGALTFFWRVDLS
Ga0307473_1002898653300031820Hardwood Forest SoilMPTMQDSVSVAANAVSANVLAGQLYEFVPSGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRQCRLVLTSRNTTGGALTFFWRVDLS
Ga0307473_1003321143300031820Hardwood Forest SoilVPTMQDSVSVAANSVSANVLAGQLYEFVPSGTRVALAVTGSATGLRCTLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRQCRLVLTSRNTTGGALTFFWRVDLS
Ga0307413_1036070113300031824RhizosphereVPTMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGPVPGGRAVLTFRNTTAG
Ga0315290_1009659923300031834SedimentMPTMQDSVSVAANATSSNQIAGQLYEFVPNGTNITLSCTGSATGLRTTLICNIPVILDQAISLQNRFPLIPDDVIYQGRVRACRLFLTARNTTAGALTFFWRIDVN
Ga0315290_1078481013300031834SedimentMQDSVSVAANATSTNQIAGQLYEFVPNGTNITLSCTGSAVGLRTTLICNIPVILDQAISLQNRFPLIPDDVIYQGRVRACRLFLTARNTTAGALTYFWRIDVN
Ga0307410_1011729233300031852RhizosphereVPTMQDSVSVAANSVSANVLSGQLYEFLPPGANVTLSVAGSATGLRCTFINGVPLINDQAMNLQNRFPIVPDDVMHGGPVPGGRAVLTFRNTTAGA
Ga0315297_1008559613300031873SedimentMQDSVSVAANATSTNQIAGQLYEFVPNGTNITLSCTGSATGLRTTLICNIPVILDQAIGLQARFPIIPDDVIYQGRVRACRLFLTARNTTAGALTFFWRIDVN
Ga0315274_1019481733300031999SedimentMPTMQDSVSVAANSVSSNQIAGQLYEFVPNGTNITLSCTGSAIGLRSTLICNIPVILDQAISLQNRFPLIPDDVIYQGRVRACRLFLTARNTTAGALTFFWRVDVN
Ga0315284_1147598713300032053SedimentMPTMQDSVSVAANAVSANVLAGQLYEFVPPGANVTVSCTGSATGLRTTYICGVPLINDQAINLQNRFPLIPDDIIQSGEVPGGRMVLTARNTTA
Ga0315284_1251185923300032053SedimentAANAVSVNVLAGQLYEFVDAGTQATISCTGSATGLRTSFICGIPLINDQAINLQNRFPLIPDDIIHSGFVAGGRMVLTFRNTTAGALTAFWRVDL
Ga0315277_1019396553300032118SedimentNAVSVNQLAGQLYEFVEEGTELALSCTGSATGLRVTFICQIPLLLDQAIGLLNRFPVIPDDTIMTGEVPGGRLVLTFRNSTAGALTAFWRVDL
Ga0315292_1088446713300032143SedimentMPTMQDSVSVAANATSTNQIAGQLYEFVPNGTNITLSCTGSATGLRTTLICNIPVILDQAIGLQARFPIIPDDVIYQGRVRACRLFLTARNTTAGALTYFWRIDVN
Ga0307470_1003135263300032174Hardwood Forest SoilMTMQDSVSVAANGVSANVLSGQLYEFVPNGAAIQLAATGSATGLRCTLIANIPVVNDQAIGLQNRFPLIPDDVMFAGRVRSCRLVLTARNTTGGALTFFWRIDVN
Ga0315276_1014195053300032177SedimentMPTMQDSVSIAANATSSNQIAGQLYEFVPNGTNITLSCTGSATGLRTTLICNIPVILDQAISLQNRFPLIPDDVIYQGRVRACRLFLTARNTTAGALTFFWRIDVN
Ga0307471_10320993223300032180Hardwood Forest SoilMPTMQDSVAVAANAVSANVLAGQLYEFVRTGTKVILSCTGSATGLRTTLIANIPVLNDQAINLQNRFPIIPDDIIYTGVVRACRLVLTARNTTAGALTFFWRVDVS
Ga0307472_10006355853300032205Hardwood Forest SoilMPTMQDSVSVGANSVSTNVLSGQLYEFVPNGTRVALAVTGSATGLRATLIANIPVLNDQAINLQNRFPLIPDDILYTGTVRACRLVLTARNTTAGALTFFWRVDLS
Ga0364942_0030851_5_2683300034165SedimentVLAGQLYEFVPEGYNVTVSCTGSATGLRTTFICGVPLINDQAINLQNRFPLIPDDIIHSGEVPGGRMVLTFRNTTGGSITAFWRLDL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.