NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F049881

Metagenome Family F049881

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F049881
Family Type Metagenome
Number of Sequences 146
Average Sequence Length 207 residues
Representative Sequence ENNKDGIAYIQQAVFSGAYQAKDPAKRAGLLTRFAQIFPDSPYVNQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVTDEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPY
Number of Associated Samples 115
Number of Associated Scaffolds 146

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.70 %
% of genes near scaffold ends (potentially truncated) 97.26 %
% of genes from short scaffolds (< 2000 bps) 91.78 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.85

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.575 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(28.082 % of family members)
Environment Ontology (ENVO) Unclassified
(30.822 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.315 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 69.40%    β-sheet: 0.00%    Coil/Unstructured: 30.60%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.85
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.118.8.9: TPR-like repeats from PCI (proteasome / COP9 signalosome / eIF3) domainsd3txna13txn0.74767
a.118.8.2: Transcription factor MalT domain IIId1hz4a_1hz40.74387
a.118.8.1: Tetratricopeptide repeat (TPR)d1w3ba_1w3b0.72607
a.118.8.9: TPR-like repeats from PCI (proteasome / COP9 signalosome / eIF3) domainsd4cr2s14cr20.71101
a.118.8.0: automated matchesd5mjza_5mjz0.68758


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 146 Family Scaffolds
PF01625PMSR 30.82
PF13181TPR_8 2.05
PF07719TPR_2 2.05
PF02397Bac_transf 0.68
PF03712Cu2_monoox_C 0.68
PF13620CarboxypepD_reg 0.68
PF00006ATP-synt_ab 0.68
PF13432TPR_16 0.68
PF11154DUF2934 0.68
PF02517Rce1-like 0.68
PF00515TPR_1 0.68
PF12704MacB_PCD 0.68

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 146 Family Scaffolds
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 30.82
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.68
COG2148Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid)Cell wall/membrane/envelope biogenesis [M] 0.68
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.68


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms97.95 %
UnclassifiedrootN/A2.05 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000789|JGI1027J11758_12297355All Organisms → cellular organisms → Bacteria → Acidobacteria794Open in IMG/M
3300001431|F14TB_101571520All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium695Open in IMG/M
3300002916|JGI25389J43894_1072686All Organisms → cellular organisms → Bacteria → Acidobacteria595Open in IMG/M
3300003505|JGIcombinedJ51221_10224268All Organisms → cellular organisms → Bacteria → Acidobacteria763Open in IMG/M
3300004479|Ga0062595_102620711All Organisms → cellular organisms → Bacteria → Acidobacteria506Open in IMG/M
3300004643|Ga0062591_100001801All Organisms → cellular organisms → Bacteria6743Open in IMG/M
3300005174|Ga0066680_10960604All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Methanomicrobia → Methanomicrobiales → Methanomicrobiaceae → Methanolacinia → Methanolacinia petrolearia503Open in IMG/M
3300005177|Ga0066690_10719918All Organisms → cellular organisms → Bacteria → Acidobacteria658Open in IMG/M
3300005179|Ga0066684_10139694All Organisms → cellular organisms → Bacteria → Acidobacteria1527Open in IMG/M
3300005332|Ga0066388_105151920All Organisms → cellular organisms → Bacteria664Open in IMG/M
3300005446|Ga0066686_10486136All Organisms → cellular organisms → Bacteria841Open in IMG/M
3300005451|Ga0066681_10077215All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium1871Open in IMG/M
3300005536|Ga0070697_100855008All Organisms → cellular organisms → Bacteria806Open in IMG/M
3300005537|Ga0070730_10977549All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300005541|Ga0070733_10548608All Organisms → cellular organisms → Bacteria774Open in IMG/M
3300005578|Ga0068854_102281691All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Methanomicrobia → Methanomicrobiales → Methanomicrobiaceae → Methanolacinia → Methanolacinia petrolearia501Open in IMG/M
3300005587|Ga0066654_10309631All Organisms → cellular organisms → Bacteria → Acidobacteria849Open in IMG/M
3300005602|Ga0070762_10117856All Organisms → cellular organisms → Bacteria1557Open in IMG/M
3300005602|Ga0070762_10666564All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae695Open in IMG/M
3300005602|Ga0070762_11149962All Organisms → cellular organisms → Bacteria → Acidobacteria536Open in IMG/M
3300005610|Ga0070763_10691377All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300006031|Ga0066651_10692908All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae547Open in IMG/M
3300006034|Ga0066656_10872049All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300006034|Ga0066656_10877047All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae575Open in IMG/M
3300006050|Ga0075028_100367902All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae815Open in IMG/M
3300006050|Ga0075028_100469542All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Bryobacteraceae → Bryobacter → Bryobacter aggregatus730Open in IMG/M
3300006059|Ga0075017_100924788All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae677Open in IMG/M
3300006173|Ga0070716_100154668All Organisms → cellular organisms → Bacteria1479Open in IMG/M
3300006806|Ga0079220_10197055All Organisms → cellular organisms → Bacteria → Acidobacteria1159Open in IMG/M
3300006806|Ga0079220_11693384All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300006854|Ga0075425_100640436All Organisms → cellular organisms → Bacteria → Acidobacteria1222Open in IMG/M
3300006903|Ga0075426_11492835All Organisms → cellular organisms → Bacteria → Acidobacteria514Open in IMG/M
3300006904|Ga0075424_101243038All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium792Open in IMG/M
3300006954|Ga0079219_10597709All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium807Open in IMG/M
3300007265|Ga0099794_10716578All Organisms → cellular organisms → Bacteria → Acidobacteria533Open in IMG/M
3300009012|Ga0066710_101744554All Organisms → cellular organisms → Bacteria → Acidobacteria944Open in IMG/M
3300010046|Ga0126384_11183345All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300010048|Ga0126373_13088315All Organisms → cellular organisms → Bacteria → Acidobacteria519Open in IMG/M
3300010358|Ga0126370_11219666All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium700Open in IMG/M
3300010359|Ga0126376_10624465All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1023Open in IMG/M
3300010359|Ga0126376_11406232All Organisms → cellular organisms → Bacteria → Acidobacteria723Open in IMG/M
3300010362|Ga0126377_13477676All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium509Open in IMG/M
3300010364|Ga0134066_10064765All Organisms → cellular organisms → Bacteria → Acidobacteria978Open in IMG/M
3300010364|Ga0134066_10322650All Organisms → cellular organisms → Bacteria → Acidobacteria563Open in IMG/M
3300011120|Ga0150983_11416399All Organisms → cellular organisms → Bacteria1063Open in IMG/M
3300011269|Ga0137392_10441683All Organisms → cellular organisms → Bacteria → Acidobacteria1080Open in IMG/M
3300011269|Ga0137392_11517198All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300011270|Ga0137391_11473403All Organisms → cellular organisms → Bacteria → Acidobacteria526Open in IMG/M
3300011270|Ga0137391_11524884All Organisms → cellular organisms → Bacteria → Acidobacteria513Open in IMG/M
3300012096|Ga0137389_11438576All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300012200|Ga0137382_10174905All Organisms → cellular organisms → Bacteria → Acidobacteria1466Open in IMG/M
3300012201|Ga0137365_10242676All Organisms → cellular organisms → Bacteria → Acidobacteria1341Open in IMG/M
3300012202|Ga0137363_10992807All Organisms → cellular organisms → Bacteria → Acidobacteria713Open in IMG/M
3300012203|Ga0137399_10040821All Organisms → cellular organisms → Bacteria3295Open in IMG/M
3300012203|Ga0137399_10259884All Organisms → cellular organisms → Bacteria1426Open in IMG/M
3300012205|Ga0137362_10605794All Organisms → cellular organisms → Bacteria → Acidobacteria944Open in IMG/M
3300012206|Ga0137380_10540744All Organisms → cellular organisms → Bacteria → Acidobacteria1023Open in IMG/M
3300012207|Ga0137381_10202384All Organisms → cellular organisms → Bacteria → Acidobacteria1721Open in IMG/M
3300012207|Ga0137381_11098808All Organisms → cellular organisms → Bacteria → Acidobacteria684Open in IMG/M
3300012208|Ga0137376_11704916All Organisms → cellular organisms → Bacteria → Acidobacteria521Open in IMG/M
3300012209|Ga0137379_11069534All Organisms → cellular organisms → Bacteria → Acidobacteria712Open in IMG/M
3300012210|Ga0137378_10400046All Organisms → cellular organisms → Bacteria → Acidobacteria1274Open in IMG/M
3300012210|Ga0137378_10871517All Organisms → cellular organisms → Bacteria → Acidobacteria813Open in IMG/M
3300012357|Ga0137384_11172295All Organisms → cellular organisms → Bacteria → Acidobacteria612Open in IMG/M
3300012357|Ga0137384_11512328All Organisms → cellular organisms → Bacteria → Acidobacteria521Open in IMG/M
3300012362|Ga0137361_10027500All Organisms → cellular organisms → Bacteria → Acidobacteria4451Open in IMG/M
3300012362|Ga0137361_11609988All Organisms → cellular organisms → Bacteria → Acidobacteria570Open in IMG/M
3300012685|Ga0137397_10448018All Organisms → cellular organisms → Bacteria → Acidobacteria962Open in IMG/M
3300012918|Ga0137396_11289843All Organisms → cellular organisms → Bacteria → Acidobacteria508Open in IMG/M
3300012922|Ga0137394_10314188All Organisms → cellular organisms → Bacteria → Acidobacteria1338Open in IMG/M
3300012922|Ga0137394_10873677All Organisms → cellular organisms → Bacteria → Acidobacteria751Open in IMG/M
3300012925|Ga0137419_10868632All Organisms → cellular organisms → Bacteria → Acidobacteria741Open in IMG/M
3300012927|Ga0137416_10327870All Organisms → cellular organisms → Bacteria → Acidobacteria1277Open in IMG/M
3300012927|Ga0137416_10546028All Organisms → cellular organisms → Bacteria → Acidobacteria1003Open in IMG/M
3300012929|Ga0137404_11408342All Organisms → cellular organisms → Bacteria → Acidobacteria644Open in IMG/M
3300012944|Ga0137410_10448653All Organisms → cellular organisms → Bacteria → Acidobacteria1047Open in IMG/M
3300012944|Ga0137410_10798133All Organisms → cellular organisms → Bacteria → Acidobacteria792Open in IMG/M
3300012948|Ga0126375_10940048All Organisms → cellular organisms → Bacteria → Acidobacteria698Open in IMG/M
3300012977|Ga0134087_10113956All Organisms → cellular organisms → Bacteria → Acidobacteria1145Open in IMG/M
3300014154|Ga0134075_10177732All Organisms → cellular organisms → Bacteria → Acidobacteria913Open in IMG/M
3300015245|Ga0137409_10980237All Organisms → cellular organisms → Bacteria → Acidobacteria682Open in IMG/M
3300015358|Ga0134089_10201574All Organisms → cellular organisms → Bacteria → Acidobacteria801Open in IMG/M
3300016294|Ga0182041_11861311All Organisms → cellular organisms → Bacteria → Acidobacteria559Open in IMG/M
3300016341|Ga0182035_11463423All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium614Open in IMG/M
3300016404|Ga0182037_10079135All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2302Open in IMG/M
3300017927|Ga0187824_10117943All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300017930|Ga0187825_10239771All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300017943|Ga0187819_10148151All Organisms → cellular organisms → Bacteria1398Open in IMG/M
3300017955|Ga0187817_10399461All Organisms → cellular organisms → Bacteria → Acidobacteria877Open in IMG/M
3300017955|Ga0187817_10517177All Organisms → cellular organisms → Bacteria → Acidobacteria762Open in IMG/M
3300017955|Ga0187817_11107590All Organisms → cellular organisms → Bacteria → Acidobacteria508Open in IMG/M
3300017961|Ga0187778_10559801All Organisms → cellular organisms → Bacteria → Acidobacteria763Open in IMG/M
3300017995|Ga0187816_10473101All Organisms → cellular organisms → Bacteria → Acidobacteria562Open in IMG/M
3300018012|Ga0187810_10217829All Organisms → cellular organisms → Bacteria → Acidobacteria778Open in IMG/M
3300018085|Ga0187772_10161042All Organisms → cellular organisms → Bacteria1490Open in IMG/M
3300018468|Ga0066662_12883108All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300018482|Ga0066669_11496039All Organisms → cellular organisms → Bacteria → Acidobacteria613Open in IMG/M
3300019361|Ga0173482_10067999All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1211Open in IMG/M
3300020199|Ga0179592_10038816All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2156Open in IMG/M
3300020580|Ga0210403_10112403All Organisms → cellular organisms → Bacteria → Acidobacteria2213Open in IMG/M
3300020580|Ga0210403_10973942All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300021088|Ga0210404_10667941All Organisms → cellular organisms → Bacteria → Acidobacteria592Open in IMG/M
3300021168|Ga0210406_10880142All Organisms → cellular organisms → Bacteria → Acidobacteria675Open in IMG/M
3300021420|Ga0210394_10937028All Organisms → cellular organisms → Bacteria → Acidobacteria752Open in IMG/M
3300021559|Ga0210409_10245163All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1623Open in IMG/M
3300021559|Ga0210409_10543028All Organisms → cellular organisms → Bacteria → Acidobacteria1028Open in IMG/M
3300021559|Ga0210409_11084245All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300021559|Ga0210409_11632274All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300021559|Ga0210409_11684390All Organisms → cellular organisms → Bacteria → Acidobacteria510Open in IMG/M
3300021560|Ga0126371_13636865All Organisms → cellular organisms → Bacteria → Acidobacteria520Open in IMG/M
3300024330|Ga0137417_1136544All Organisms → cellular organisms → Bacteria → Acidobacteria618Open in IMG/M
3300024330|Ga0137417_1369080All Organisms → cellular organisms → Bacteria → Acidobacteria2181Open in IMG/M
3300025899|Ga0207642_10552241All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium711Open in IMG/M
3300026294|Ga0209839_10287904All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300026310|Ga0209239_1182944All Organisms → cellular organisms → Bacteria → Acidobacteria786Open in IMG/M
3300026323|Ga0209472_1081517All Organisms → cellular organisms → Bacteria → Acidobacteria1315Open in IMG/M
3300026330|Ga0209473_1283273All Organisms → cellular organisms → Bacteria → Acidobacteria557Open in IMG/M
3300026342|Ga0209057_1212614All Organisms → cellular organisms → Bacteria → Acidobacteria549Open in IMG/M
3300026557|Ga0179587_10388477All Organisms → cellular organisms → Bacteria → Acidobacteria909Open in IMG/M
3300027069|Ga0208859_1042851All Organisms → cellular organisms → Bacteria → Acidobacteria529Open in IMG/M
3300027174|Ga0207948_1012025All Organisms → cellular organisms → Bacteria → Acidobacteria999Open in IMG/M
3300027388|Ga0208995_1056025All Organisms → cellular organisms → Bacteria → Acidobacteria693Open in IMG/M
3300027671|Ga0209588_1007604All Organisms → cellular organisms → Bacteria → Acidobacteria3193Open in IMG/M
3300027884|Ga0209275_10494904All Organisms → cellular organisms → Bacteria → Acidobacteria696Open in IMG/M
3300027910|Ga0209583_10805060All Organisms → cellular organisms → Bacteria → Acidobacteria500Open in IMG/M
3300028145|Ga0247663_1019382All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300028536|Ga0137415_10405220All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1168Open in IMG/M
3300031715|Ga0307476_11056405All Organisms → cellular organisms → Bacteria → Acidobacteria597Open in IMG/M
3300031720|Ga0307469_10197696All Organisms → cellular organisms → Bacteria → Acidobacteria1566Open in IMG/M
3300031720|Ga0307469_10669143All Organisms → cellular organisms → Bacteria → Acidobacteria937Open in IMG/M
3300031720|Ga0307469_11034767All Organisms → cellular organisms → Bacteria → Acidobacteria768Open in IMG/M
3300031720|Ga0307469_11653490All Organisms → cellular organisms → Bacteria → Acidobacteria616Open in IMG/M
3300031753|Ga0307477_10206029All Organisms → cellular organisms → Bacteria1367Open in IMG/M
3300031754|Ga0307475_10131616All Organisms → cellular organisms → Bacteria1976Open in IMG/M
3300031820|Ga0307473_11163530All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300031823|Ga0307478_10108266All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2162Open in IMG/M
3300031823|Ga0307478_10725804All Organisms → cellular organisms → Bacteria → Acidobacteria832Open in IMG/M
3300031823|Ga0307478_11101768All Organisms → cellular organisms → Bacteria → Acidobacteria662Open in IMG/M
3300031954|Ga0306926_11078048All Organisms → cellular organisms → Bacteria → Acidobacteria951Open in IMG/M
3300032035|Ga0310911_10379408All Organisms → cellular organisms → Bacteria → Acidobacteria817Open in IMG/M
3300032076|Ga0306924_10317203All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1788Open in IMG/M
3300032261|Ga0306920_102874137All Organisms → cellular organisms → Bacteria → Acidobacteria654Open in IMG/M
3300032782|Ga0335082_10921038All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium737Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil28.08%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil9.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.22%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.53%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment5.48%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.11%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.42%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.42%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.42%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.42%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.74%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.05%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.05%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.37%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.37%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.37%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.37%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.68%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.68%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.68%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.68%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.68%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.68%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.68%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.68%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017995Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_1EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018085Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019361Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026294Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-050 (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027388Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028145Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK04EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032782Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10479324523300000364SoilDPNNMGMLLLVSDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKXNDTSXARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKTLPATTAAARKKPS*
JGI1027J11758_1229735513300000789SoilKDGITYIQNAVFSGVYQAKDAGKRAVLLVRFAQIFSDSPYASQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNVGMLLLLSDYYGEKGEQLDKXETYAKKAVGVLETTKKPNEMTDDQWKQQSQLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAMEAFTQAASVNSPYKGPAQEKLKAMAAPVRKKAS*
F14TB_10157152013300001431SoilLLRFAKIFPDSPYALPAMGVAATSYQQAQNTPKMLEVANAILAKDANNLGMLLLVSDYYGEKGEQLDKAESYAQKAVALADSAPRPANLTDEQWSQQTALQRGLALSALGQANLQKKNNLQAVQNFQAAARLLKSNDASYARNAYRMGFALINLKKIPEARAAFTEAASVNSPYKGPAQDKLKTLPARAAAPRKPG*
JGI25389J43894_107268613300002916Grasslands SoilQTLENNKDGIAYIQQAVFSGAYQAKDVAKRAGLLTRFVQIFPDSPYANQALGVAATAYQQAQNASKMLEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKRAVAALQTAGKPESVTDEQWVQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDXGSYARNQYRLGFALLNLKRNAEAKEAFT
JGIcombinedJ51221_1022426813300003505Forest SoilEQVWADQKVHTLESNKDAITYIQQAVFSGVYQVKDPGKRADLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNVGMLLLVSDYYSEKGEQLDKAEAYAKKAIAALDTAKKPDEMTDEQWKQQSGLQKGLALSSLGEVDIQKKDNVTAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNPEAKEAFTKAASVNSPYKGPAQEKLKGLEGASAAPARKKAS
Ga0062595_10262071113300004479SoilTHFAELFPDSPYANQALGVAATSYQQAQNIPKMLEVANGVLAKDPNNIGMLLLLSDYYSEKGEHLDKAEASAKKAISLLGTAQKPEGVKDEQWQKQLSLQKGLALSSLGQVNLQKKDNVQAVESFQSAAPLLKADEGSFGRNQYWLGFALLNLKKNAEAKEAFTQAAS
Ga0062591_10000180193300004643SoilQNAPKMMEVANAILAKDPNNMGMLLLVSDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKRIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS*
Ga0066680_1096060413300005174SoilAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGVTDEQWTQQKGLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQDKLKGLAVPPRRKAS*
Ga0066690_1071991813300005177SoilFRGVYQVKDPARRAAFLVRFAQAFPDSTFANQALGVAATAYQQAQNTAKMLEVANGLLVKDPNNIGMLLLLSDYYGEKGEQLDKAEGYAKKAISTLETAAKPEGVTGEQWAQQKGLQKGLALSSLGQINIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKRSAEAKEAFTQAASVNSPYKALAQEKLRGLAGPARRKAS*
Ga0066684_1013969413300005179SoilEKAGGIMQRYKAAPPPEGTSAEMWKEQQTHTLEANKDSYTYVQQLVFRGVYQVKDPARRAAFLVRFAQAFPDSTFANQALGVAATAYQQAQNTAKMLEVANGLLVKDPNNIGMLLLLSDYYGEKGEQLDKAEGYAKKAISTLETAAKPEGVTGEQWAQQKGLQKGLALSSLGQINIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKRSAEAKEAFTQAASVNSPYKALAQEKLRGLAGPARRKAS*
Ga0066388_10515192013300005332Tropical Forest SoilAPSGMSADAWQQNRSQTLAENKDTIAYLQQLVYTGLFQSPILVKDPGKRASLLARFAQGFPDSPYANPALGVAATSYQQAQNPAKMLEVANGLLAKDPENLGMLLLVSDYYSEKNEQLDKAESYAKKAIAVLAAAKKPDGVADDQWQQQTSLQRGLALSSLGQVNMEKKDNAQAVQNFRAAAPLVKSDAITYARNQYRLGFALVNLKRMPEAKEAFTQAA
Ga0066686_1048613623300005446SoilAYQARDVAKRAGLLTRFAQIFPDSPYANQALGVAATAFQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKKAVAVLQTAGKPEGVTDEQWVQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNVEAKEAFTQAASVNSPYKALAQEKLKASAGASAARQKTP*
Ga0066681_1007721533300005451SoilIIQRFKAAPAPEGTAAAAWEEQKARTLEANKDTLVYVQQLLFSGVYQAKEPSKRAALLVKFAQAFPDSPLANQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVISALESAAKPEGLTDEQWTQQKGLQQGLALSSLGQINIEKKDNAQAVENLKAAAPLVRPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLRGLAGPAAAKRKPS*
Ga0070697_10085500813300005536Corn, Switchgrass And Miscanthus RhizosphereEYGDKLFALDPDNFANAVNMVRAANEKGDNDKLMGYGEKASGILQRFKAAPPPSGTEADAWQRQKDATLESNKDNVAYVEQAMYMGAYRTTNAPKRAEMFTHFAELFPDSPYANQALGVAATSYQQAQNIPKMLEVANGVLAKDPNNIGMLLLLSDYYSEKGEHLDKAEASAKKAISLLGTAQKPEGVKDEQWQKQLSLQKGLALSSLGQVNLQKKDNVQAVESFQSAAPLLKADEGSFGRNQYWLGFALLNLKKNAEAKEAFTQAAS
Ga0070730_1097754913300005537Surface SoilATLESNKDNIAYVEQAMYMGAYRTTNAPKRAEMFTHFAALFPDSPYANQALGVAATSYQQAQNIPKMLEVANGVLAKDPNNIGMLLLLSDYYSEKGEHLDKAEASAKKVISLLGTAQKPEGVTDEQWQKQLSLQKGLALSSLGQVNLQKKDNAQAVESFKSAAPLLKADEGSFGRN
Ga0070733_1054860813300005541Surface SoilGMNMVRAASEKGDADKLMAYGEKTGGILQRYKAASAPDGADKDSWTRQRAQTLDSNKDNITYVEQVVYGAAYQTQNPAKRAQLLTGFAQWFPDSQYSNQALVVAAGIYEQLQNGTKMLEVANGLLTKDPNNIGMLLLLSDYYSEKGEQLDKAESSAKKAITLLGTAQKPEGVTDEQWTQQVSLQKGLALSSLGQVNIQKKDNATAVDNFKSAAPLLKADENSYGRNQYRLGFALLNLKKNAEAKQAFTEAASVNSPY
Ga0068854_10228169113300005578Corn RhizosphereNAPKMMEVANAILAKDPNNMGMLLLVSDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS
Ga0066654_1030963123300005587SoilGVAATAYQQAQNTAKMLEVANGLLVKDPNNIGMLLLLSDYYGEKGEQLDKAEGYAKKAISTLETAAKPEGVTGEQWAQQKGLQKGLALSSLGQINIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKRSAEAKEAFTQAASVNSPYKALAQEKLRGLAGPARRKAS*
Ga0070762_1011785633300005602SoilDNWERQKTATLESNKDTTTYVEQAIFLGVYRTQNPAKRAAQLTHFAELFPASPFAIQALGVAATSYQQAQNTSKMLEVANGVLAKDPNNIGMLLLLSDYYSEKPDQLDKAEASAKKVIAVLPTATKPEGVTDEQWQAQVSLEKGLALSSLGQVNISKKDNAAAVENLKAAAPLLKADENSYGKNQYRLGFALLNLKRNAEAKDAFTQSASVNSAYKSLAQAKLKTFDTAAAKKKS*
Ga0070762_1066656413300005602SoilLGVAATAYQQAQSAPKMLEVANGLLAKDPNNVGMLLLVSDYYGEKGEQLDKAEAYAKKAIAVLDGAKKPDEMTDDQWKQQSGLQKGLALSSLGEVDIQKKDNATAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGPAQEKLKGLAGASAAPARKKAS*
Ga0070762_1114996213300005602SoilDTWERQKAATLESNKENITYIEQAVFLGVYRTTNPAKRAALLTHFAELFPASPFANQALGVAATSYQQAQNVPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEASAKKAIALLGTAPKPEGVTDEQWQRQVSLEKGLALSSLGQVSIQKKDNAQAAENFKAAAPLLKTDE
Ga0070763_1069137713300005610SoilAPSGMEADSWERQKTTTLDSNKDNITYVEQAIFLGVYRTQNPAKRASQLTHFAELFPASPFAIQALGVAATSYQQAQNTSKMLEVANGVLAKDPNNIGMLLLLSDYYSEKPDQLDKAEASAKKVIAVLPTATKPEGVTDEQWQAQVSLEKGLALSSLGQVNISKKDNAAAVENLKAAAPLLKADENSYGKNQYRLGFA
Ga0066651_1069290813300006031SoilASDSERQKAQTLENNKDGIAYIQQAVFSGAYQAKDPAKRAGLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKVVAVLQTAGKPEGVTDEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDG
Ga0066656_1087204913300006034SoilAPAATLEANKDGITYIQNAVFSAVYQTKDAGKRASLLVRFAQIFADSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNFRAAARLLKPDDGSYARNQYRLGFALLNLK
Ga0066656_1087704713300006034SoilLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKKAVAVLQTAGKPEGVTDEQWVQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNVEAKEAFTQAASVNSPYKALAQEKLKASAGASAARQKTP*
Ga0075028_10036790213300006050WatershedsEGDYPILANGLYVDYYLAQKNYDKALEYGDKLFALDQDNFQNAMNMVRAASEKGDTDRLISYGEKAQGILERYKKAPAPEGASAEAWNQQKAQTLEANKESITYVQNAVFSGVYQAKDAGKRAALLVRFAQIFPDSPYSNQALGVAATAYQQAQNASKMLEVANGLLAKDPNNVGMLLLLSDYYGEKGEQLDKAEAFAKKAVTVLETAKKPDEMTDDQWKQQSELQKGLALSSLGQVNIQKKDNAQAVTNLRAAAPLLKRDDGSYARNQYR
Ga0075028_10046954213300006050WatershedsLDPDSFQNAMNMIRVASEKGDPERVVGYGEKAQGILKRFKDAPAPAGTEPKVWEEQKAKTLESNKDGVAYIQQAVYNGALRASDAGKRASLLTRFAQAFPDSPYTNQALGVAATSYLQAQNAPRMLEVANGLLAKDPNNLGMLLVLSDYYCDKPDQFAKAETYAKKAISVLDAAPRPEGLTDAQWAQQKGLQKGLALSSLGQVNIEKKDNAQAVENLRAAAPLLKPDDGSYGRNQYRLGFALA
Ga0075017_10092478813300006059WatershedsRFQVAPAPAGTAEPVWVDQKARTLEANKDGMTYIQQAVFGGVYQAKDPAKRAASLAKFAQIFPDSPYANQALGVAATSYQQAQNGPKMLEVANGLLAKDPSNLGMLLLISDYFSEKGEQLDKAEAYAKKAITVLETAQKPEGLADDQWAQQKSLQKGLALSSLGQVNIERKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVN
Ga0070716_10015466813300006173Corn, Switchgrass And Miscanthus RhizosphereVSEQTWKDRKEQALADLKDSITYVEQLLFNGAYQVRDVTKRAGLLVRFAQLFPDSPYAGQALGVAAASYRQMQNTPKMLEVANGLLAKDPNNLGMLILLADYYSEKGEQLEKAEAEAKKAVSLLNSAAKPEGMTDEQWQAQNALQKGLALSALGQINIQKKDNATAVQNFKTAAPLLKSDAGSYARNQYRLGFALLNLKKMPEAKAALTEAASLNTPYKALAQDKLKSLPATTAGKN*
Ga0079220_1019705523300006806Agricultural SoilFQNALNMVRAAAEKGDVDKLSSYGEKAGGIIQRYKAAPAPEGTAAAVWDEQKARTLEANKDSFVYVQQLVFSGVYQAKEPAKRAALLVKFAQAFPDSALGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGLTDEQWAHQKGLQKGLALSSLGQVNIEKKDNTQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALALEKLKGLAGPATARRKPA*
Ga0079220_1169338413300006806Agricultural SoilYQTPEPAKRAPLLVRFAKAFPDSPYAVPALGVAATAYQQAQDTPRMLEVADGLLAKDPNNLGMLLLLSDYYGEKGEQLAKAEEFARKAVSLADTAPKPDGVSDDQWNHEKALQKGLALSALGQINLQKKDNAQAVQNFVAAAPLLKGNEASYGRNQYRLGFAYLNLKKTAEARAALSEAAAAKG
Ga0075425_10064043613300006854Populus RhizosphereRTLEANKDSFVYVQQLVFSGVYQAKEPAKRAALLVKFAQAFPDSALGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGLTDEQWAHQKGLQKGLALSSLGQVNIEKKDNTQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALALEKLKGLAGPATARRKPA*
Ga0075426_1149283513300006903Populus RhizosphereRTLEANKDSFVYVQQLVFSGVYQAKEPAKRAALLVKFAQAFPDSALGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGLTDEQWAHQKGLQKGLALSSLGQVNIEKKDNTQAVENLKAAAPLVKPDD
Ga0075424_10124303823300006904Populus RhizosphereQDPAKRADQLLRFAKIFPDSPYALPAMGVAATSYQQAQNTPKMLEVANAILAKDANNLGMLLLVSDYYGEKGEQLDKAESYAQKAVALADSAPRPANLTDEQWSQQTALQRGLALSALGQVNLQKKNNLQAVQNFQAAARLLKSNDASYARNAYRMGFALINLKKIPEARAAFTEAASVNSPYKGPAQDKLKTLPARAAAPRKPS*
Ga0079219_1059770923300006954Agricultural SoilSAYAVPAMGVAATAYQQAQDTPKMLEVADGLLAKDPNNLGMLLLLSDYYGEKGEQLAKAEEFAKKAVGLADTAPKPDGVPDDQWNREKTLQKGLALSALGQINLQKKDNAQAVQNFVAAAPLLKGNEASYGRNQYRLGFAYLNLKKTAEARAAFSEAAAAKGPYQTLAQEKLKSLPAAPSRRRRAS*
Ga0099794_1071657813300007265Vadose Zone SoilAGAAASDWERQRSQTLESNKDAIAYIQQAVFSGAYQAKDVVKRAGLLTRFAQVFPDSPYASQALGVAATSYQQAQNVPKMLEVANGLLAKDANNLGMLLLLSDYYSEKGEQLDKAETYAKKAAAVLQTAQKPEGVTDEQWALQKALQKGLALSSLGQVNIQKKDNAQAVENLKTAAP
Ga0066710_10174455413300009012Grasslands SoilDKLSGYGEKASGILKRFKEAPAPSGMSPDAWQQNRAQTLADNKDTILYLQQVLYNGFYQSPVLTKDPGKRATLLARFAQQFPDSPYANPALGVAATSYQQAQNPSKMLEVANGLLAKDPENLGMLLLVSDYYSEKGEQLEKAEGYAKKAIAALAATKKPEGIADDQWLQQTSLQKGLALSALGQVNMEKKDNAQAVQNFRAAAPLVKSDAVTYARNQYRLGFALVNLKRMPEAKEAFTQAASVNSPYKSLAQEKLKSFAATAKKGAS
Ga0126384_1118334513300010046Tropical Forest SoilKDANKRAAYLVHFAQSFPDSPYADQAMGVAAATYQQTQNAPKMLEAANGLLAKDPNNLGMLILLADYYSEKGEQLDKAEASAKKAISVLETAKKPEGVTDDRWAQQTGLQKGLALSALGQVNIQKKNNAGEVESFRSASSLLKPDPGSYARNQYRLAYALLNLKKIPEAKAALTEAASVNSPYKQPAQEKLKALNAAVAAKAKS*
Ga0126373_1308831513300010048Tropical Forest SoilPDSAFANQALGVAATAYQQAQNTAKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLETAAKPEGVTNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEK
Ga0126370_1121966613300010358Tropical Forest SoilAKAFPDSTYAAQAMGVAASSYQQAQNSSKMLEVANSILAKDPSNLGMLLLVSDYYGEKGEQLDKAESYAQKAATLAASAPRPENVSEDQWKQQTNLQKGLALSTLGQINLQKKNNTQAVQNFQAAAPLLKSDDTSYARNQYRLGFAYINLKNAPGARTAFTECASVNSPYKQYALEKLKGIPATAAAKRTR*
Ga0126376_1062446513300010359Tropical Forest SoilGIAATAYQQALNTAKMLEVANGLLAKDPNNIGMLLLLSDYYSDKGEQIDKAGTYAKKAISLLETAQKPEGITDEQWAKQKSLQKGLALSSLGQVNIEKKDNAQAADNLRAAAPLVKSDNMSYAHNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKDRALEKLKGLAAPARRKPS*
Ga0126376_1140623223300010359Tropical Forest SoilYQQAQNTPKMLEVANGLLTKNPSDIGMLLLLSDYFSEKGEQLDKDESYAKKAISTLETAPKPEGATDEQWTQQKALQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAGPAKRKAS*
Ga0126377_1347767613300010362Tropical Forest SoilLVYNGAFQTPDAAKRAAYLTRFAQAFPDSPYSAPAMGVAATAWQQAQNPAKMLEVANTLLAKDPNNLGMLLLLSDYYGQKSEQLPKAEEFAKRAATIVDGLKKPDDITEDQWKQQTSLQKGLALSSLGQINMQKKDNATAVTSFQAAAPLLKSESASYGRNQYLLGFAL
Ga0134066_1006476523300010364Grasslands SoilDRLSSYGEKAGGIIQRYKAAPAPEGTSEAGWQEQKTRTLEANKDSYTYVQQLVFSGVYQAKDPGKRAALLVKFAQAFPDSAYANQALGVAATAYQQAQNTSKMLEVANGLLAKAPNDIGMLLLLSDYYGEKGEQLDKAESYAKKAIATLETAAKPEGVTDEQWTQQKALQKGLALSSLGQVNIAKKDNAQAVENLRTAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGLAQDKLKGLAAPARRKAS*
Ga0134066_1032265013300010364Grasslands SoilPAPEGTAAAAWEEQKARTLEANKDTLVYVQQLLFSGVYQAKEPSKRAALLLKFAQAFPDSPLANQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVISALESAAKPEGLTDEQWTQQKGLQQGLALSSLGQINIEKKDNAQAVENLKAAAPLVRPDDG
Ga0150983_1141639913300011120Forest SoilAKFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKPDQVAKAETFAKKAISALDTAPKPEGLTDDQWAQQKNLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLVKVDDGSYARNQYRLGFALVNLKRNAEAKDAFTQAASVNSPYKALAQDKLKGLAAPATARKKAS*
Ga0137392_1044168313300011269Vadose Zone SoilDKLFALDPDNFQNALNMIRAASEKGDADRLISYGEKAQGILKRYKDAPAPAGAAASDWERQRSQTLESNKDAIAYIQQAVFSGAYQAKDAAKRAALLTRFAQTYPDSPYANQALGVAATSYQQAQNVPKMQEVANGLLAKDPNNPGMLLLLSDYYSEKGEQLDKAEAYAKKAAAVLQTAQKPEGVTDEQWALQKALQKGLALSSLGQVNIQKKDNAQAVENLKTAAPLLKPDDGSYARNQYRLGFALLNLKKNADAKEAFTQAASVNSPYKALAQEKLKASAGAAQARRKSP*
Ga0137392_1151719813300011269Vadose Zone SoilQQAVYNGALQARDAGKRAGLLARFAQAYPNSPYTSQALGVAATSYLQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKTDQLAKAEAYAKKTISLLDAAPKPEGLTDDQWGKQKGLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLLKPDDGSYGRNQYRLGFALLNLKR
Ga0137391_1147340313300011270Vadose Zone SoilAASDWERQRSQTLESNKDAIAYIQQAVFSGAYQAKDAAKRAALLTRFAQIYPDSPYANQALGVAAASYQQAQNVPKMQEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKKAAAVLQTAQKPEGVTDEQWALQKALQKGLALSSLGQVNIQKKDNAQAVENLKTAAP
Ga0137391_1152488413300011270Vadose Zone SoilGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0137389_1143857613300012096Vadose Zone SoilDAGKRAGLLTRFAQIFPDSPYANQALGVAAASYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKTDQLAKAETYAKKAISLLDGAPKPEGVTDDQWAQQKSLQKGLALSSLGQVNIEKKDNAQAVENLRAAAPLLKTDDGSYGRNQYRLGFALLNLKKTAEAKKAISLLDAAPKPEGVTDEQWSQQKS
Ga0137382_1017490513300012200Vadose Zone SoilAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKVVAVLQTAGKPEGVTDEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEKLRASGAAPARRKTP*
Ga0137365_1024267613300012201Vadose Zone SoilNKDSIVYIQQAIFSGAYQAKDAPKRAALLTKFAQIFPDSPYANQALGVAATAYQQAQNGPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0137363_1099280723300012202Vadose Zone SoilLLAKDPSNLGMLLLLSDYYCDKADQLAKAEGYAKKATSVVDTSAKPEGATDEQWTQQKSLQKGLALSSLGQVNIGKKDNAQAVENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLKASAGAAPTRRKAP*
Ga0137399_1004082113300012203Vadose Zone SoilVANGLLAKDPNNLGMLLVLSDYYCDKPDQFAKAEPYAKKAISVVDSAPKPEGLTDEQWARQKGLQKGLALSSLGQVNIEKKDNAQAVGNLKAAAPLVKVDDASYGRNQYRLGFALANLKRTADAKEAFTQAASVNGPYKALAQDKLKAAAGPATARKKTP*
Ga0137399_1025988413300012203Vadose Zone SoilLFALDQDNFQNAMNMVRAASEKGDADRLISDGEKAEGILQRYKNAPAPRGSSAEAWTQQKAQTLEANKDGIAYIQNAVFSGVYQTKDTGKRAALLVRFAQVFADSPYASQALGVAATAYQQAQNASRMLEVANGLLGKDPNNVGMLLLLSDYYGEKGEQLDKAEAYAKKAASVLEAAKKPDEMTDEQWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137362_1060579413300012205Vadose Zone SoilVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS*
Ga0137380_1054074423300012206Vadose Zone SoilAINMVRAAAEKGDVDKLSSYGEKAGGIIQRFKAAPAPEGTPAAGWDEQKTRTLEANKDSFVYVQQLVFTGMYQVKEPAKRAALLVKFAQAFPDSPLGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGVTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKTAEAKEAFTQAASVNSPYKGLAQDKLKGLAVPPRRKAS*
Ga0137381_1020238413300012207Vadose Zone SoilAEKGDVDKLSSYGEKAGGIIQRFKAAPAPEGTPAAGWDEQKTRTLEANKDSFVYVQQLVFTGMYQVKEPAKRAALLVKFAQAFPDSPLGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGVTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKTAEAKEAFTQAASVNSPYKGLAQDKLKGLAVPPRRKAS*
Ga0137381_1109880813300012207Vadose Zone SoilKTQGILQRYKDSPAPAGTSAETWADQKARTLESNKDSIVYIQQAIFSGAYQAKDAPKRAALLTKFAQIFPDSPYANQALGVAATAYQQAQNGPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALGSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQA
Ga0137376_1170491613300012208Vadose Zone SoilPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKVVAVLQTAGKPEGVTDEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEK
Ga0137379_1106953413300012209Vadose Zone SoilLISYGEKTQGILQRYKDSPAPAGTSAETWADQKARTLESNKDSIVYIQQAIFSGAYQAKDAPKRAALLTKFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKTISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKEAFAQAAS
Ga0137378_1040004613300012210Vadose Zone SoilTWADQKARTLESNKDSIVYIQQAIFSGAYQAKDAPKRAALLTKFAQIFPDSPYANQALGVAATAYQQAQNGPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0137378_1087151713300012210Vadose Zone SoilPDSFQNAINMVRAAAEKGDVDKLSSYGEKAGGIIQRFKAAPAPEGTPAAGWDEQKTRTLEANKDSFVYVQQLVFTGMYQVKEPAKRAALLVKFAQAFPDSPLGNQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVIAALEGAPKPEGVTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLVKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQDK
Ga0137384_1117229513300012357Vadose Zone SoilIQRYKAAPAPAGTQEANWQEQKARALEANKDSYTYVQQLVFSGVYQAKDPAKRATLLVKFAQAFPDSAFANQALGVAATAYQQAQNTAKMLDVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAEAYAKKAISALETVAKPESVTDEQWTLQKALQKGLALSSLGQVNIEKKENAQAVENLKAAAALVKPDDVSYARNQYRL
Ga0137384_1151232813300012357Vadose Zone SoilQPAVWEEQKSKTLESNKDGMTYIQQAVFSGAYQAKDAAKRATLLVRFAQTFADSPYAIQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAVTVLGTAPKPEGVADEQWTQQKSLQKGLALSSLGQVNIEKKDNAQAVTNLKAAA
Ga0137361_1002750053300012362Vadose Zone SoilVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS*
Ga0137361_1160998813300012362Vadose Zone SoilGSSAEAWTQQKAQTLEANKDGITYIQNGVFSGVYQTKDAGKRAALLVRFAQIFPDSPYANSALGVAATAYQQAQNAPKMLEVANGLLTKDPNNLGMLLLLSDYYGERGEQLDKAEAYAKKAVSVLETAKKPDEMTDDLWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQY
Ga0137397_1044801813300012685Vadose Zone SoilALGVAATAYQQAQNVPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMTDDQWKQQSELQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPPRRKPS*
Ga0137396_1128984313300012918Vadose Zone SoilEAWTQQKAQTLEANKDGITYIQNAVFSGVYRTKDAGKRAALLVLFAQIFADSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNFR
Ga0137394_1031418813300012922Vadose Zone SoilSKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNAPKMLEVANGLLTKDPNNVGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMADDQWKTQSGLQKGLALSSLGQINIQKKDNAQAVTNFRTAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAREAFTQAASVNSPYKGPAQEKLKAIAAPPRRKAS*
Ga0137394_1087367713300012922Vadose Zone SoilILQRYKSAPAPAGSSAEAWTQQKAQTLEANKDGIAYIQNAVFSSVYQTKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNVPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAESYAKKAVSVLETAKKPDEMGDDQWKQQSELQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPPRRK
Ga0137419_1030806613300012925Vadose Zone SoilVSDYYGEKGEQLDKAEAYAQKAATLADSAPRPENVTEDQWKQQTALQKGLALSTLGQVNMQKKNNAQAIQNFQAAAPLLKSSDVGYARNEYRLGFAFINLRKIPEAKTAFTQAASVNSPYKQPALDKIKALPASPPTRRKAS*
Ga0137419_1086863213300012925Vadose Zone SoilEYGDKLFALDQDNFQNAMNMVRAASEKGDADRLISYCEKAQGILQRYKNAPAPVGSSAEAWTQQKAQTLEANKDGITYIQNAVFSGVYQTKDAGKRAALLVRFAQMFPDSPYASQALGVAATAYQQAQNAPKMLEVANGLLTKDPNNVGMLLLLSDYYGEKGEQLDKAETYAKKAVSVLETAKKPDEMTDEQWKLQSGLQKGLALSALGQINLQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLG
Ga0137416_1032787013300012927Vadose Zone SoilAYQQAQNAPKMLEVANGLLVKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVTDERWAQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPARRKTP*
Ga0137416_1054602813300012927Vadose Zone SoilASEKGDADRLISYGEKAQGILQRYKNAPAPAGSSSEAWTQQKAQTLEANKDGIAYIQNAVFSGVYQTNDVGKRAALLVRFAQIFPDSPYANQALGVAATAYQQAQNGPKMLEVANGLLAKDHNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAASVLETAKKPDEMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137404_1140834213300012929Vadose Zone SoilAPAPAGSSAEAWTQQKAQTLEANKDGIAYIWNAVFSGVYQTKDAAKRAALLVRFAQIFPDSPYANSALGVAATAYQQAQNAPKMLEVANGLLTKDPNNLGMLLLLSDYYGERGEQLDKAEAYAKKAVSVLETAKKPDEMTDDQWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAF
Ga0137410_1044865313300012944Vadose Zone SoilEYGDKLFALDQDNFQNAMNMVRAASEKGDADRLISYGEKAQGILQRYKNAPAPAGSSSEAWTQQKAQTLEANKDGIAYIQNAVFSGVYQTKDVGKRAALLVRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAASVLETAKKPDEMTDEQWKTQSGLQKGLALSSLGQINIRKKDDAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRSPEAKEAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS*
Ga0137410_1079813313300012944Vadose Zone SoilEANKDGITYIQNAVFSGVYQSKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNAPKMLEVANGLLTKDPNNVGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMADDQWKTQSGLQKGLALSSLGQINIQKKDNAQAVTNFRTAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPPRRKAS*
Ga0126375_1094004813300012948Tropical Forest SoilKRAGFLVKFAQVFPDSPYSTQALGVAATAYQEAQNTAKMLEVANGLLNKDPNNIGMLLLLSDYYGERGEQLDKAEAFAKKVISALDAAPKPEGVTDEQWTQQKSLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLVKADNTSYAHNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKDRALEKLKGLAAPARRKPS*
Ga0134087_1011395613300012977Grasslands SoilKDPAKRAALLTQFAQIFPDSPYVNQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVADEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLVKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPRKTP*
Ga0134075_1017773213300014154Grasslands SoilAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAEKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0137409_1098023713300015245Vadose Zone SoilDGITYIQNAVFSGVYQSKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNAPKMLEVANGLLTKDPNNVGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMADDQWKTQSGLQKGLALSSLGQINIQKKDNAQAVTNFRTAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPPRRKAS*
Ga0134089_1020157413300015358Grasslands SoilKTQGILQRYKDSPAPAGTSAETWADQKARTLESNKDSIVYIQQAIFSGAYQAKDAAKRAALLTKFAQIFPDSPYANQALGVAATAYQQAQNGPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAQKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS*
Ga0132255_10280766013300015374Arabidopsis RhizosphereDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS*
Ga0182041_1186131113300016294SoilLVFSGVYQAKDPAKRAALLVRFAQAFPDSAFANQALGVAATAYQQAQNTTKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQA
Ga0182035_1146342313300016341SoilANGLLTKDPNNLGMLLLLAGYYGEKGEQLDKAEAYAKKATGLADTAPKPEGVADDQWKQQTTLQKGLAFSALGQVNLQKKDNASAVQNLQTAAPLVKSDNFSYAKNQYRLAFAFLNLKKLPEAKQAFTEAASVNTPYKQPALDKLKGLPAKAPATGRKPS
Ga0182037_1007913513300016404SoilDRLISYGEKAGAIVQRYKAAAAPEGTSAAAWDEQKTRALEANKDSYTYVQQLVFSGVYQAKDPAKRAALLVRFAQAFPDSAFANQALGVAATAYQQAQNTTKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0187824_1011794313300017927Freshwater SedimentGVAAASYQQAQNIPKMLEVADGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAETSANKAIALLGSAQKPASVTDEQWQQEISLQKGLALSSLGQVSIQKKDNAHAVENFEAAAPLLKPDEGSYGRNQYWLGFALLNLKRNAEAKEAFTQSASVNSAYKSLAQAKLKTFAGASRH
Ga0187825_1023977113300017930Freshwater SedimentAATLESNKDNINYIEEAVYMVDYRTPSAAKRAAQMTHFAQIFPDSPYANQALGVAAASYQQAQNIPKMLEVADGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAETSANKAIALLGSAQKPASVTDEQWQQEISLQKGLALSSLGQVSIQKKDNAHAVENFEAAAPLLKPDEGSYGRNQYWLGFALLNLKRNAEAKEAFTQSASVNSAYKSLAQAKLKTFA
Ga0187819_1014815133300017943Freshwater SedimentQALGMAAAAYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYAKKIPPLLETAPKPEGMTDDQWAKQKALQKGLALSALGQVNIQKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLALDKLKDMAKPVHHKAS
Ga0187817_1039946113300017955Freshwater SedimentRASYLARFGQAFPDSPYANQALGMAAAGYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYAKKVPPLLEAAQKPEGMTDEQWAKEKALQKGLALSALGQVNIEKKDNAQAAENLKAAAPLLKSDDVSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKALAQDKLKDMAKPVHHKPSS
Ga0187817_1051717713300017955Freshwater SedimentSEKGDAEKLAAYGEKAAAILKRYKGSPAPTGTSSESWEGQKAQALANNKDGITYVEQLVFNGAYQTKDPAKRASYLARFGQAFPDSPYANQALGMAAAAYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYARKIPPLLDTAPKPEGMTDDQWAKQKALQKGLALSALGQVNIQKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLAL
Ga0187817_1110759013300017955Freshwater SedimentSPYANQALGMAAAAYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYAKKIPPLLETAPKPEGMTDDQWAKQKALQKGLALSALGQVNIQKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLAL
Ga0187778_1055980113300017961Tropical PeatlandAQNAPKMLEAANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAETYAKKVGPLLDAAQKPEGMADDQWAKEKALQKGLALSALGQVNIQKKDNAQAADNLKAAAPLLKADDGSYARNQYRLGFAYLNLKKNPEAKEAFTQAASVNSPYKQLALDKLKAMATPAHHKAS
Ga0187816_1047310113300017995Freshwater SedimentAYGEKAAAILKRYKGSPAPTGTSPESWEGQKAQALANNKDGITYVEQLVFNGAYQTKDPAKRALYLARFGQAFPDSPYANQALGMAAAAYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYAKKIPPLLETAPKPEGMTDDQWAKQKALQKGLALSALGQVNIQKKDNAQA
Ga0187810_1021782913300018012Freshwater SedimentPESWEGQKAQTLANNKDGITYVQQLVFNGAYQTKDAAKRASYLARFGQAFPDSPYANQALGMAAAGYQQAQNAPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEAYARKIPPMLDTAPKPEGMTDDQWAKQKALQKGLALSALGQVNIEKKDNAQAAENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKQLALDKLKDMAKPVHHKAS
Ga0187772_1016104213300018085Tropical PeatlandAGTSDAVWAEQKTRTLEANKDSYTYIQQLVLNGVYQAKDAAKRAALLVKFATIFPDSPYAVQALGVAATAYQQAQNAPKMLEVANGLLAKDPDNLGMLLLLSDYYCDKPDQLDKAATYAKKAISLLDTAQKPEGVTDEQWTQQKALQKGLALSSLGQVNIEKKDNAQAVDNLKTAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKAMALDKLKSLSGAPAKKKA
Ga0066662_1288310813300018468Grasslands SoilKSQTLENNKDGIAYIQQAVYNGAIQAKDAGKRAAYLARFAQAFPDSPYANQALGVAATSYLQAQNAPKMLEVANGLLAKDPNNVGMLLLLSDYYCDKADQLTKAETYAKKAISVLDAAPKPEGLAGEQWTQQKNLQKGLALSSLGQVNIGKKDNAQAVDNLKAAGPLLKA
Ga0066669_1149603913300018482Grasslands SoilENNKDGIAYIQQAVFSGAYQAKDPAKRAGLLTRFAQIFPDSPYVNQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVTDEQWTQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPY
Ga0173482_1006799913300019361SoilQAQNAPKMMEVANAILAKDPNNMGMLLLVSDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPGDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKRIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS
Ga0179592_1003881643300020199Vadose Zone SoilSDSERQKAQTLENNKDGIAYIQQAVFSGAYQTKDPAKRAGLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLVKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVTDERWAQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPARRKTP
Ga0210403_1011240343300020580SoilDKAFEFGDKLFALDPDSFTNAMNMIRAASEKGDADRLISYGEKAQAILKRYKEAPAPAGMDATQWAQQKTQTLEANKDGIAYMQQAVFSGAYQVKDAGKRAALLTKFAQIFPDSPYANQALGVAATSYQQAQNAPKMLEVASGLLAKDPNNLGMLLLLSDYYSEKGEQLDKADAYAKKVIAVLPAAKKPEGLNDEQWEQQKALQKGLALSSLGQVNIQKKDNAQAVENLKAAAPLLKSDDGSFARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYKGPAQDKLKAMAAPAKRKAS
Ga0210403_1097394213300020580SoilPAKRAAMLAKFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKPDQVAKAETFAKKAISALDTAPKPEGLTDDQWAQQKNLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLVKVDDGSYARNQYRLGFALVNLKRNAEAKDAFTQAASVNSPYKALAQDKLKGLAAPATARKKAS
Ga0210404_1066794113300021088SoilVHTLESNKDAITYIQQAVFSGVYQVKDPGKRADLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNVGMLLLISDYYSEKGEQLDKAEAYAKKAIAALDTAKKPDEMTDEQWKQQSGLQKGLALSSLGEVDIQKKDNVTAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNPEAKE
Ga0210406_1088014213300021168SoilNAVNMVRAASEKGDIDKLMAYGEKVPGILQRFKASPAPSGMETENWERQKAGTLESNKDNILYIQQAMFLGFYRTTNAAKRAALLTHFAELFPDSPYTNQALGVAATSYQQAQNVPKMLEVANGVLAKDPNNIGMLLLLSDYYSEKGEQLDKADASAKKVIALLGTVQKPEGVPDEQWRQQVSLQKGLALSALGQINIQKKDNAQAAENFKAASPLLKSDDGSYG
Ga0210394_1093702813300021420SoilAAGILKRYKDAPAPAGTSEQVWADQKVHTLESNKDAITYIQQAVFSGVYQVKDPGKRADLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNVGMLLLVSDYYSEKGEQLDKAEAYAKKAIAALDTAKKPDEMTDEQWKQQSGLQKGLALSSLGEVDIQKKDNVTAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNPEAKEAFTKAASVNSPYKGPAQEKLKGLEGASA
Ga0210409_1024516313300021559SoilQKARTLEANKDGMTYIQQAVFGGVYQAKDPAKRAAMLAKFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKPDQVAKAETFAKKAISALDTAPKPEGLTDDQWAQQKNLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLVKVDDGSYARNQYRLGFALVNLKRNAEAKDAFTQAASVNSPYKALAQDKLKGLAAPATARKKAS
Ga0210409_1054302823300021559SoilAYQTKEAGKRAALLVRFAQAFPDSPYANQALGIAAAAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKGEAYAKKAIVVLGKATKTEGMTDEQWKQQSALQKGLALSSLGQVNIQKKDNAQAVANLSSAAPLLKSDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNTPYKALAQDKLKAMAAPAARKKAS
Ga0210409_1108424513300021559SoilENITYIEQAVFLGVYRTTNPAKRAALLTHFAELFPASPFANQALGVAATSYQQAQNVPKMLEVANGLLAKDPNNIGMLLLLSDYYSEKGEQLDKAEASAKKAIALLGTAPKPEGVTDEQWQRQVSLEKGLALSSLGQVSIQKKDNAQAAENFKAAAPLLKTDEGSYGRNQYRLGFALLNLKRNAEAKDAFAQSASVNSAYKTLAQAKLKTFDAAATKKKS
Ga0210409_1163227413300021559SoilEQQWKQNKALALEANKDGYTYVQQLVFSGVYQVKEPGKRAMLLARFGNTFPDSPYANQALGIAATSYAQAQNTPKMLEVANQLLAKDPDNLGMLLVLSDYYCDKADQLAKAETYAKKAISLLDTAPKPEGATDEQWSQQKNLQKGLALSSLGQVNIEKKANAQAVDNLKAAAP
Ga0210409_1168439013300021559SoilLEANKDGMTYIQQAVFGGVYQAKDPAKRAAMLAKFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKPDQVAKAETFAKKAISALDTAPKPEGLTDDQWAQQKALQKGLALSSLGQVNIEKKDNAQAVDDLKAAAPLVKVDDG
Ga0126371_1363686513300021560Tropical Forest SoilQKTRVLEANKDSYTYVQQLVFSGVYQAKDPAKRAALLVRFAKAFPDSAFANQALGVAATAYQQAQNTAKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLETAAKPEGVTNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADD
Ga0137417_113654413300024330Vadose Zone SoilLKRYKDAPAPAGAAASDWERQRSQTLESNKDAIAYIQQAVFSGAYQAKDAAKRAALLTRFAQTYPDSPYANQALGVAATSYQQAQNVPKMQEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKKAAAVLQTAQKPEGVTDEQWALQKALQKGLALSSLGQVNIQKKDNAQAVENLKTAAPLLKPDDGSYARNQYRLGFA
Ga0137417_136908023300024330Vadose Zone SoilVRFAQVFADSPYASQALGVAATAYQQAQNASRMLEVANGLLGKDPNNVGMLLLLSDYYGEKGEQLDKAEAYAKKAASVLEAAKKPDEMTDEQWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSLTKGPRKKN
Ga0207642_1055224123300025899Miscanthus RhizosphereASSYQQAQNAPKMMEVANAILAKDPNNMGMLLLVSDYYGEKGEQLDKAEGYAQKAVTLAGSAQKPSDVPEDQWKQQTALQKGLALSTLGQINLQKKNNSGAMQNFQAAAPLLKTNDTSYARNQYRLGFALLNLKKIPEAKAAFTEAASVNSPYKSYAQDKLKALPATTAARKKPS
Ga0209839_1028790413300026294SoilAYRVNDPSKRAGLLQRFATLFPDSPTAEQALGMSAFSYQAAQNRPKMLEVANGLLVKNPDNIGMLVLLADDYSEKNDQLDKAEADAKKAIALCDTAKKPEGVTDADWQNQLTLQKGLALSALGQVGIEKKDNLGAVKNLTAAAPLLKANANPYARNQYRLGFAYLNLKK
Ga0209239_118294413300026310Grasslands SoilKSQTLENNKDGIAYIQQAVFSGAYQAKDVAKRAGLLTRFVQIFPDSPYANQALGVAATAYQQAQNASKMLEVANGLLAKDPNNLGMLLLLSDYYSEKGEQLDKAEAYAKRAVAALQTAGKPESVTDEQWVQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKEAFTQAASVNSPYRALAQEKLKASAGGAAPARRKTP
Ga0209472_108151713300026323SoilIIQRFKAAPAPEGTAAAAWEEQKARTLEANKDTLVYVQQLLFSGVYQAKEPSKRAALLVKFAQAFPDSPLANQALGVAATAYQQAQNTAKMLEVANGLLAKDANNVGMLLLLSDYYGEKGEQLDKAEGYAKKVISALESAAKPEGLTDEQWTQQKGLQQGLALSSLGQINIEKKDNAQAVENLKAAAPLVRPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGLAQEKLRGLAGPAAAKRKPS
Ga0209473_128327313300026330SoilPPPEGTSAEMWKEQQTHTLEANKDSYTYVQQLVFRGVYQVKDPARRAAFLVRFAQAFPDSTFANQALGVAATAYQQAQNTAKMLEVANGLLVKDPNNIGMLLLLSDYYGEKGEQLDKAEGYAKKAISTLETAAKPEGVTGEQWAQQKGLQKGLALSSLGQINIEKKENAQAVENLKAAAPLVKAD
Ga0209057_121261413300026342SoilYQQAQNGPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAISVLEKAEKPEGMTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAAENLKAAAPLLKADDGSYARNQYRLGFALLNLKKNAEAKDAFAQAASVNSPYKVLAQEKLKGLAGAAPARRKPS
Ga0179587_1038847723300026557Vadose Zone SoilQTKDAGKRASLLVRFAQIFADSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPPRRKAS
Ga0208859_104285113300027069Forest SoilAGTAEQAWADQKAHTLESNKDAITYIQQAVFSGVYQVKDPGKRADLLTRFAQIFPDSPYAIQALGVAATAYQQAQSAPKMLEVANGLLAKDPNNVGMLLLVSDYYGEKGEQLDKAEAYAKKAIAVLDGAKKPDEMTDDQWKQQSGLQKGLALSSLGEVDIQKKDNATAVVNFRAAA
Ga0207948_101202513300027174Forest SoilKDAITYIQQAVFSGVYQVKDPGKRADLLTRFAQIFPDSPYAIQALGVAATAYQQAQSAPKMLEVANGLLAKDPNNVGMLLLVSDYYGEKGEQLDKAEAYAKKAIAVLDGAKKPDEMTDDQWKQQSGLQKGLALSSLGEVDIQKKDNVTAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNPEAKEAFTKAASVNSPYKGPAQEKLKGLEGASAAPARKKAS
Ga0208995_105602513300027388Forest SoilRLISYGEKAQGALQRYKNAPAPAGSSAEVWTQQKAQTLEANKDGIAYIQNAVFSGVYQTKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMTDEQWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNLRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKD
Ga0209588_100760443300027671Vadose Zone SoilSLLVRFAQIFADSPYANQALGVAATAYQQAQNAPKMLEVANGLLAKDPSNVGMLLLLSDYYGEKGEQLDKAEAYSKKAMTVLETAKKPEEMTDDQWKQQSDLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYRGPAQEKLKAMAAPPRRKAS
Ga0209275_1049490413300027884SoilALGVAATAYQQAQSAPKMLEVANGLLAKDPNNVGMLLLVSDYYGEKGEQLDKAEAYAKKAIAVLDGAKKPDEMTDDQWKQQSGLQKGLALSSLGEVDIQKKDNATAVVNFRAAAPLLKPDDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKGPAQEKLKGLEGASAAPARKKAS
Ga0209583_1080506013300027910WatershedsIQQAVFSGVYQAKDAAKRAGLLTRFAQIYPESPYANQALGVAATSYQQAQNAPKMLEVANGLLTKDPNNLGMLLLLSDYYCDKTDQLGKAEASAKKAISLLDSAPKPEGATDEQWTQQKSLQKGLALSSLGQVNIEKKDNAQAVENLKAAAPLLKPDDGSYARNQY
Ga0247663_101938223300028145SoilRQKDATLESNKDNIAYVEQAMYMGAYRTTNPSKRAAMFTHFAQLFQDSPYANQALGVAATSYQQAQNIPKMLEVANGLLAKDPNNIAMLLLLSDYYSEKGEQLDKAEASAKKAISLLATVQKPERVTDEQWQKQLSLQKGLALSSLGQVNIQKKDNAQAVESFKSATPLLKADEGSFGRNQYWLGFALLNLKRNAEAKEAFTQAASVNSAYKGLAQAKLKSFDAASRKKP
Ga0137415_1040522013300028536Vadose Zone SoilVASDSERQKAQTLENNKDGIAYIQQAVFSGAYQTKDPAKRAGLLTRFAQIFPDSPYANQALGVAATAYQQAQNAPKMLEVANGLLVKDPNNLGMLLLFSDYYSEKGEQLDKAEAYAKKAVAVLQTAEKPEGVTDERWAQQKALQKGLALSSLGQVNIQKKDNAQAAENLKAAAPLLKPDDGSYARNQYRLGFALLNLKRTAEAKEAFTQAASVNSPYKALAQEKLRASAGAAPARRKTP
Ga0307476_1105640513300031715Hardwood Forest SoilQKTQTLEANKDGIAYMQQAVFSGAYQVKDAGKRAALLTKFAQIFPDSPYANQALGVAATSYQQAQNAPKMLEVASGLLAKDPNNLGMLLLLSDYYSEKGEQLDKADAYAKKVIAVLPTAPKPEALSNEQWEQQKALQKGLALSSLGQVNIQKKDNAQAVENLKAAAPLLKSDDGSFARNQYRLGFALLNLKRNVEAKEA
Ga0307469_1019769623300031720Hardwood Forest SoilAALLARFAQQFPDSPYANPALGVAATSYQQAQNPAKMLEVANGLLAKDPENLGMLLLVSDYYSEKGEQLDKAEAYAKKTIAVLAAAKKPDGVADDQWQQQTSLQKGLALSALGQVNMEKKDNAQAVQNFRAAAPLVKSDAVTYARNQYRLGFALVNLKRMPEAKEAFTQAASVNSPYRSLAQDKLKSFAASKKGSS
Ga0307469_1066914323300031720Hardwood Forest SoilNAVFSGVYQTKDAGKRAALLVRFAQIFADSPYASQALGVAATAYQQAQNAPKMLEVANGLLAKDPSNLGMLLLLSDYYGEKGEQLDKAEAYAKKAVSVLETAKKPDEMTDEQWKTQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRDPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPPRRKAS
Ga0307469_1103476713300031720Hardwood Forest SoilVRAASEKGDVDRLITYGEKAGGIWKRFQDAPAPADASASAWEDQKKRTLESNKDVMTYIQQLVFSGVYQAKDPGKRAALLVRFAQAFADSPYAVQALGVAATSYQQAQNAPKMLEVANGVLAKDPNNLGMLLLLSDYYGEKGEQLDKAEAYAKKAVAGLETAPKPEGVTDEQWAKQKSIQKGLALSSLGQVNIQKKDNAQAVTNLKAAAPLLKPDDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKAL
Ga0307469_1165349013300031720Hardwood Forest SoilFSGVYQTKDAGKRAALLVRFAQIFPDSPYANSALGVAATAYQQAQNAPKMLEVANGLLTKDPNNLGMLLLLSDYYGERGEQLDKAEAYAKKAVSVLETAKKPDEMTDDQWKLQSGLQKGLALSSLGQINIQKKDNAQAVTNFRAAAPLLKPDDGSYARNQYRLGFALLNLKRNPEAKDAFTQAASVNSPYKGPAQEKLKAMAAPP
Ga0307477_1020602933300031753Hardwood Forest SoilNQALGVAAASYRQTQSTAKMLSVANDLLAKDPDNLGMVLLLADYYSEKGEQLDKAEAYAKKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTTANKS
Ga0307475_1013161633300031754Hardwood Forest SoilAYRTTDAAKRAGLLVRFAQLFPDSPFANQALGVAAASYRQTQSTAKMLSVANDLLAKDPDNLGMVLLLADYYSEKGEQLDKAEAYAKKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTTANKS
Ga0307473_1116353013300031820Hardwood Forest SoilDVDRLSGYGEKASGILKRFQEAPPPSGMSPEVWQQHRTQALAENKDTIAYLQQVLYNGFYQSPVLAKDPGKRAALLARFAQQFPDSPYANPALGVAATSYQQAQNPAKMLEVANGLLAKDPENLGMLLLVSDYYSEKGEQLDKAEAYAKKTIAVLAAAKKPDGVADDQWQQQTSLQKGLALSALGQVNME
Ga0307478_1010826633300031823Hardwood Forest SoilKRAAYLARFGQAFPDSPYAIRALGMAAAAYQQAQNVPKMLEVANGLLAKDPNNVGMLLLLSDYYSEKGEQLDKAEAYAKKAIPLLEAGQKPEGMTDEQWVKQKALQKGLALSSLGQVNIQKKDNAQATENFKAAAPLLKSDDGSYARNQYRLGFALLNLKKNPEAKEAFTQAASVNSPYKALAQDKLKAMATPAHRKPS
Ga0307478_1072580413300031823Hardwood Forest SoilQNALNMIRAASEKGDPDRLMTYAEKAGGIWTRYKASPAPAGTAESAWADQKARTLDANKDSMTYMQQAVFGGVYQAKDPAKCAAMLAKFAQIFPDSPYANQALGVAATAYQQAQNVPKMLEVANGLLAKDPNNLGMLLLLTDYYCDKPDQLAKAETYAKKVISLVDAAPKPEGLTDDQWAQQKGLQKGLALSSLGQVNIEKKDNAQAVDNLKAAAPLVRVDDGSYARNQYRLGFALVNLKRNAEAKDAFTQAASVNSPYKALAQDKLKGLAAPATAR
Ga0307478_1110176823300031823Hardwood Forest SoilMLSVANDLLAKDPDNLGMVLLLADYYSEKGEQLDKAEAYAKKSVALLETAKKPDGVTDEQWKQQSALQKGLALSALGQVNIEKKNNAQAVDNFKAAAPLLKSDEGSYARNQYRLGFALLNLKRVPEAKAAFTDAASVNSPYKALAQEKLKALPGTTANKS
Ga0306926_1107804813300031954SoilSGVYQAKDPAKRAALLVRFAQAFPDSAFANQALGVAATAYQQAQNTTKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0310911_1037940813300032035SoilEKAGAIVQRYKAAAAPEGTSAAAWDEQKTRALEANKDSYTYVQQLVFSGVYQAKDPAKRAALLVRFAQAFPDSAFANQALGVAATAYQQAQNTTKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0306924_1031720333300032076SoilYDKAFEYGERVFALDRDSFQNAINMVRASAEKGDVDRLISYGEKAGAIVQRYKAAAAPEGTSAAAWDEQKTRALEANKDSYTYVQQLVFSGVYQAKDPAKRAALLVRFAQAFPDSAFANQALGVAATAYQQAQNTTKMLEVANGLLAKDPNDVGMLLLLSDYYGEKGEQLDKAETYAKKAISTLERAAKPEGVSNEQWTQQKALQKGLALSSLGQIDIEKKENAQAVENLKAAAPLVKADDGSYARNQYRLGFALLNLKKNAEAKEAFTQAASVNSPYKALAQEKLKGLAAPARRKAS
Ga0306920_10287413713300032261SoilKTRTLEANKENYTYIQQLVFNGALQAKDPAKRAALFVKFSELFPDSPYAVQAVGVAATSYQQAQNAPKMLEVANGLLAKDPNNLGMLLLLSDYYCDKPDQLDKAATYAKKAITFLDTAPKPEGATDEQWTQQKALQKGLALSSLGQVNIEKKDNAQAVDNLKTAAPLVKADDGSYARNQYRLGFALLNLKRNAEAKDAFTQAASVNSPYKAMALDKL
Ga0335082_1092103813300032782SoilAGYLTRFAQAFPDSPYAAPAMGVAATAYQQAQNPAKMLEVANSLLAKDPNNLGMLLLLSDYFGQKGEQLPKAEEYAKRAATVVDGLKKPDDVPEDQWQKQTSLQKGLALSSLGQVNLQKKDNASAIQNFRAAAPLLKSESSSFGRNQYLLGFALLNLKKIAEARAALTEAAAAKGPYQSLAQDKLKTLPPAPSTKKRTS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.